Talk page

Title:
Provably learning a multi-head attention layer

Speaker:
Sitan Chen

Link:
http://www.birs.ca/events/2024/5-day-workshops/24w5214/videos/watch/202402261618-Chen.html

Workshop:
Birs- 24w5214: Computational Complexity of Statistical Inference