Skip to yearly menu bar Skip to main content


Offline evaluation in RL: soft stability weighting to combine fitted Q-learning and model-based methods

Briton Park · Xian Wu · Bin Yu · Angela Zhou

Abstract

Video

Chat is not available.