Skip to yearly menu bar Skip to main content


Poster

Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards

Xiaoyuan Liu ⋅ Tian Liang ⋅ Zhiwei He ⋅ Jiahao Xu ⋅ Wenxuan Wang ⋅ Pinjia He ⋅ Zhaopeng Tu ⋅ Haitao Mi ⋅ Dong Yu
2025 Poster

Abstract

Video

Chat is not available.