Skip to yearly menu bar Skip to main content


Poster

Diversity as a Reward: Fine-Tuning LLMs on a Mixture of Domain-Undetermined Data

Zhenqing Ling ⋅ Daoyuan Chen ⋅ Liuyi Yao ⋅ Qianli Shen ⋅ Yaliang Li ⋅ Ying Shen
2025 Poster

Abstract

Video

Chat is not available.