Skip to yearly menu bar Skip to main content


Spotlight Poster Thu, Dec 4, 2025 • 4:30 PM – 7:30 PM PST

Nemotron-CLIMB: Clustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Shizhe Diao ⋅ Yu Yang ⋅ Yonggan Fu ⋅ Xin Dong ⋅ Dan SU ⋅ Markus Kliegl ⋅ ZIJIA CHEN ⋅ Peter Belcak ⋅ Yoshi Suhara ⋅ Hongxu Yin ⋅ Mostofa Patwary ⋅ Yingyan (Celine) Lin ⋅ Jan Kautz ⋅ Pavlo Molchanov

Abstract

Video

Chat is not available.