Skip to yearly menu bar Skip to main content


Poster

Triplets Better Than Pairs: Towards Stable and Effective Self-Play Fine-Tuning for LLMs

Yibo Wang ⋅ Hai-Long Sun ⋅ Guangda Huzhang ⋅ Qingguo Chen ⋅ Zhao Xu ⋅ Weihua Luo ⋅ Kaifu Zhang ⋅ Lijun Zhang
2025 Poster

Abstract

Video

Chat is not available.