Skip to yearly menu bar Skip to main content


Poster

Two Time-scale Off-Policy TD Learning: Non-asymptotic Analysis over Markovian Samples

Tengyu Xu ⋅ Shaofeng Zou ⋅ Yingbin Liang
2019 Poster
[ Paper [ Poster

Abstract

Chat is not available.