Skip to yearly menu bar Skip to main content


Poster

Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning

Wenlin Zhang ⋅ Xiangyang Li ⋅ Kuicai Dong ⋅ Yichao Wang ⋅ Pengyue Jia ⋅ Xiaopeng Li ⋅ Yingyi Zhang ⋅ Derong Xu ⋅ Zhaocheng Du ⋅ Huifeng Guo ⋅ Ruiming Tang ⋅ Xiangyu Zhao
2025 Poster

Abstract

Video

Chat is not available.