Skip to yearly menu bar Skip to main content


Q-Ensemble for Offline RL: Don't Scale the Ensemble, Scale the Batch Size

Alexander Nikulin ⋅ Vladislav Kurenkov ⋅ Denis Tarasov ⋅ Dmitry Akimov ⋅ Sergey Kolesnikov

Abstract

Video

Chat is not available.