Skip to yearly menu bar Skip to main content


Sirius: Contextual Sparsity with Correction for Efficient LLM

Yang Zhou ⋅ Zhuoming Chen ⋅ Zhaozhuo Xu ⋅ Victoria Lin ⋅ Beidi Chen

Abstract

Chat is not available.