NIPS 2007 Stable Dual Dynamic Programming Spotlight

Spotlight

Stable Dual Dynamic Programming

Tao Wang · Daniel Lizotte · Michael Bowling · Dale Schuurmans

[ Abstract ] [ Visit Spotlights ]

Abstract:

Recently, a novel approach to dynamic programming and reinforcement learning has been proposed based on maintaining explicit representations of stationary distributions instead of value functions. The convergence properties and practical effectiveness of these algorithms have not been previously studied however. In this paper, we investigate the convergence properties of these dual algorithms both theoretically and empirically, and show how they can be scaled up by incorporating function approximation.

Chat is not available.