Skip to yearly menu bar Skip to main content


SPG: Sandwiched Policy Gradient for Mask Diffusion Language Models

Chenyu Wang ⋅ Paria Rashidinejad ⋅ DiJia Su ⋅ Song Jiang ⋅ Sid Wang ⋅ Siyan Zhao ⋅ Cai Zhou ⋅ Zejiang Shen ⋅ Feiyu Chen ⋅ Tommi Jaakkola ⋅ Yuandong Tian ⋅ Bo Liu

Abstract

Chat is not available.