Skip to yearly menu bar Skip to main content


Spotlight

Beyond Value-Function Gaps: Improved Instance-Dependent Regret Bounds for Episodic Reinforcement Learning

Christoph Dann ⋅ Teodor Vanislavov Marinov ⋅ Mehryar Mohri ⋅ Julian Zimmert

Abstract

Chat is not available.