Skip to yearly menu bar Skip to main content


Offline evaluation in RL: soft stability weighting to combine fitted Q-learning and model-based methods

Briton Park ⋅ Xian Wu ⋅ Bin Yu ⋅ Angela Zhou

Abstract

Video

Chat is not available.