Skip to yearly menu bar Skip to main content


Poster

Avoiding exp(R) scaling in RLHF through Preference-based Exploration

Mingyu Chen ⋅ Yiding Chen ⋅ Wen Sun ⋅ Xuezhou Zhang
2025 Poster

Abstract

Video

Chat is not available.