Skip to yearly menu bar Skip to main content


Spotlight

Posterior sampling for reinforcement learning: worst-case regret bounds

Shipra Agrawal · Randy Jia
2017 Spotlight

Abstract

Chat is not available.