VideoArxiv
Home
Workshops
Speakers
Talk page
Title:
Provably learning a multi-head attention layer
Speaker:
Sitan Chen
Link:
http://www.birs.ca/events/2024/5-day-workshops/24w5214/videos/watch/202402261618-Chen.html
Workshop:
Birs- 24w5214: Computational Complexity of Statistical Inference