Skip to yearly menu bar Skip to main content


Poster

Two Time-scale Off-Policy TD Learning: Non-asymptotic Analysis over Markovian Samples

Tengyu Xu · Shaofeng Zou · Yingbin Liang
2019 Poster
[ Paper [ Poster

Abstract

Chat is not available.