Skip to yearly menu bar Skip to main content


Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Mengzhou Xia ⋅ Tianyu Gao ⋅ Zhiyuan Zeng ⋅ Danqi Chen
[ Poster

Abstract

Chat is not available.