Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Language Gamification

S2L-RM: Short-to-Long Reward Modeling

Changyu CHEN ⋅ Zichen Liu ⋅ Haonan Wang ⋅ Chao Du ⋅ Tianyu Pang ⋅ Qian Liu ⋅ Arunesh Sinha ⋅ Pradeep Varakantham ⋅ Min Lin

Abstract

Chat is not available.