Skip to yearly menu bar Skip to main content


Poster Fri, Dec 5, 2025 • 11:00 AM – 2:00 PM PST

GPAS: Accelerating Convergence of LLM Pretraining via Gradient-Preserving Activation Scaling

Tianhao Chen ⋅ Xin Xu ⋅ Zijing Liu ⋅ Pengxiang Li ⋅ Xinyuan Song ⋅ AJAY JAISWAL ⋅ Fan Zhang ⋅ Jishan Hu ⋅ Yang Wang ⋅ Hao CHEN ⋅ Shizhe Diao ⋅ Shiwei Liu ⋅ Yu Li ⋅ Lu Yin ⋅ Can Yang

Abstract

Video

Chat is not available.