VideoArxiv
Home
Workshops
Speakers
Talk page
Title:
Gradient Descent and Attention Models: Challenges Posed by the Softmax Function
Speaker:
Salma Tarmoun
Link:
http://www.birs.ca/events/2024/5-day-workshops/24w5297/videos/watch/202406111630-Tarmoun.html
Workshop:
Birs- 24w5297: Mathematics of Deep Learning