Skip to yearly menu bar Skip to main content


Offline Reinforcement Learning with Closed-Form Policy Improvement Operators

Jiachen Li ⋅ Edwin Zhang ⋅ Ming Yin ⋅ Qinxun Bai ⋅ Yu-Xiang Wang ⋅ William Yang Wang

Abstract

Video

Chat is not available.