Talk page

Title:
Gradient Descent and Attention Models: Challenges Posed by the Softmax Function

Speaker:
Salma Tarmoun

Link:
http://www.birs.ca/events/2024/5-day-workshops/24w5297/videos/watch/202406111630-Tarmoun.html

Workshop:
Birs- 24w5297: Mathematics of Deep Learning