Skip to yearly menu bar Skip to main content


Spotlight Poster

DoReMi: Optimizing Data Mixtures Speeds Up Language Model Pretraining

Sang Michael Xie ⋅ Hieu Pham ⋅ Xuanyi Dong ⋅ Nan Du ⋅ Hanxiao Liu ⋅ Yifeng Lu ⋅ Percy Liang ⋅ Quoc V Le ⋅ Tengyu Ma ⋅ Adams Wei Yu
2023 Spotlight Poster
[ Paper [ Poster [ OpenReview

Abstract

Video

Chat is not available.