Skip to yearly menu bar Skip to main content


Spotlight Poster

MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention

Huiqiang Jiang ⋅ Yucheng LI ⋅ Chengruidong Zhang ⋅ Qianhui Wu ⋅ Xufang Luo ⋅ Surin Ahn ⋅ Zhenhua Han ⋅ Amir Abdi ⋅ Dongsheng Li ⋅ Chin-Yew Lin ⋅ Yuqing Yang ⋅ Lili Qiu
2024 Spotlight Poster

Abstract

Video

Chat is not available.