Skip to yearly menu bar Skip to main content


Poster

FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness

Tri Dao · Dan Fu · Stefano Ermon · Atri Rudra · Christopher RĂ©
2022 Poster
[ Paper [ Poster [ OpenReview

Abstract

Video

Chat is not available.