Skip to yearly menu bar Skip to main content


Spotlight

Beyond Value-Function Gaps: Improved Instance-Dependent Regret Bounds for Episodic Reinforcement Learning

Christoph Dann · Teodor Vanislavov Marinov · Mehryar Mohri · Julian Zimmert

Abstract

Chat is not available.