Skip to yearly menu bar Skip to main content


Poster

Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning

Yurun Yuan ⋅ Fan Chen ⋅ Zeyu Jia ⋅ Alexander Rakhlin ⋅ Tengyang Xie
2025 Poster

Abstract

Video

Chat is not available.