Skip to yearly menu bar Skip to main content


Poster
in
Workshop: System-2 Reasoning at Scale

Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning

Yuxi Xie ⋅ Anirudh Goyal ⋅ Wenyue Zheng ⋅ Min-Yen Kan ⋅ Timothy Lillicrap ⋅ Kenji Kawaguchi ⋅ Michael Qizhe Shieh

Abstract

Chat is not available.