Skip to yearly menu bar Skip to main content


Optimizing Attention with Mirror Descent: Generalized Max-Margin Token Selection

Aaron Alvarado Kristanto Julistiono ⋅ Davoud Ataee Tarzanagh ⋅ Navid Azizan

Abstract

Chat is not available.