Skip to yearly menu bar Skip to main content


Poster Wed, Dec 3, 2025 • 4:30 PM – 7:30 PM PST

GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning

Shutong Ding ⋅ Ke Hu ⋅ Shan Zhong ⋅ Haoyang Luo ⋅ Weinan Zhang ⋅ Jingya Wang ⋅ Jun Wang ⋅ Ye Shi

Abstract

Video

Chat is not available.