Sitan Chen

  1. Title:
    Provably learning a multi-head attention layer

    Speaker:

    Link: