Skip to yearly menu bar Skip to main content


Approximations may be all you need: Towards Pre-training LLMs with Low-Rank Decomposition and Optimizers

Namrata Shivagunde · Mayank Kulkarni · Giannis Karamanolakis · Jack FitzGerald · Yannick Versley · Saleh Soltan · Volkan Cevher · Jianhua Lu · Anna Rumshisky

Abstract

Video

Chat is not available.