Skip to yearly menu bar Skip to main content


Long-Horizon Model-Based Offline Reinforcement Learning Without Conservatism

Tianwei Ni ⋅ Esther Derman ⋅ Vineet Jain ⋅ Vincent Taboga ⋅ Siamak Ravanbakhsh ⋅ Pierre-Luc Bacon

Abstract

Chat is not available.