Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Language Gamification

Multi-Step Preference Optimization via Two-Player Markov Games

Yongtao Wu · Luca Viano · Yihang Chen · Zhenyu Zhu · Quanquan Gu · Volkan Cevher

Abstract

Chat is not available.