Skip to yearly menu bar Skip to main content


Poster

FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

Tri Dao ⋅ Dan Fu ⋅ Stefano Ermon ⋅ Atri Rudra ⋅ Christopher Ré
2022 Poster
[ Paper [ Poster [ OpenReview

Abstract

Video

Chat is not available.