Skip to yearly menu bar Skip to main content


Poster Fri, Dec 5, 2025 • 11:00 AM – 2:00 PM PST

Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards

Xiaoyuan Liu ⋅ Tian Liang ⋅ Zhiwei He ⋅ Jiahao Xu ⋅ Wenxuan Wang ⋅ Pinjia He ⋅ Zhaopeng Tu ⋅ Haitao Mi ⋅ Dong Yu

Abstract

Video

Chat is not available.