Skip to yearly menu bar Skip to main content


Poster

Weighted importance sampling for off-policy learning with linear function approximation

Rupam Mahmood ⋅ Hado P van Hasselt ⋅ Richard Sutton
2014 Poster
[ PDF

Abstract

Chat is not available.