Skip to yearly menu bar Skip to main content


Adaptive Trust Region Policy Optimization: Convergence and Faster Rates of regularized MDPs

Lior Shani · Yonathan Efroni · Shie Mannor

Abstract

Chat is not available.