Skip to yearly menu bar Skip to main content


Poster

Reinforcement Learning in Factored MDPs: Oracle-Efficient Algorithms and Tighter Regret Bounds for the Non-Episodic Setting

Ziping Xu · Ambuj Tewari
2020 Poster

Abstract

Video

Chat is not available.