Skip to yearly menu bar Skip to main content


Poster

Beyond Value-Function Gaps: Improved Instance-Dependent Regret Bounds for Episodic Reinforcement Learning

Christoph Dann ⋅ Teodor Vanislavov Marinov ⋅ Mehryar Mohri ⋅ Julian Zimmert
2021 Poster

Abstract

Video

Chat is not available.