Skip to yearly menu bar Skip to main content


Offline Reinforcement Learning with Closed-Form Policy Improvement Operators

Jiachen Li · Edwin Zhang · Ming Yin · Qinxun Bai · Yu-Xiang Wang · William Yang Wang

Abstract

Video

Chat is not available.