Skip to yearly menu bar Skip to main content


Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Mengzhou Xia · Tianyu Gao · Zhiyuan Zeng · Danqi Chen

Abstract

Video

Chat is not available.