Timezone: »
Learning to make decisions in an uncertain and dynamic environment is a task of fundamental performance in a number of domains. This paper concerns the problem of learning control policies for an unknown linear dynamical system so as to minimize a quadratic cost function. We present a method, based on convex optimization, that accomplishes this task ‘robustly’, i.e., the worst-case cost, accounting for system uncertainty given the observed data, is minimized. The method balances exploitation and exploration, exciting the system in such a way so as to reduce uncertainty in the model parameters to which the worst-case cost is most sensitive. Numerical simulations and application to a hardware-in-the-loop servo-mechanism are used to demonstrate the approach, with appreciable performance and robustness gains over alternative methods observed in both.
Author Information
Jack Umenberger (Uppsala University)
Mina Ferizbegovic (KTH Royal Institute of Technology)
Thomas Schön (Uppsala University)
Håkan Hjalmarsson (KTH)
Related Events (a corresponding poster, oral, or spotlight)
-
2019 Poster: Robust exploration in linear quadratic reinforcement learning »
Wed. Dec 11th 01:30 -- 03:30 AM Room East Exhibition Hall B + C #177
More from the Same Authors
-
2021 Poster: Stabilizing Dynamical Systems via Policy Gradient Methods »
Juan Perdomo · Jack Umenberger · Max Simchowitz -
2018 Poster: Learning convex bounds for linear quadratic control policy synthesis »
Jack Umenberger · Thomas Schön -
2018 Spotlight: Learning convex bounds for linear quadratic control policy synthesis »
Jack Umenberger · Thomas Schön -
2017 Poster: Linearly constrained Gaussian processes »
Carl Jidling · Niklas Wahlström · Adrian Wills · Thomas Schön -
2014 Poster: Sequential Monte Carlo for Graphical Models »
Christian Andersson Naesseth · Fredrik Lindsten · Thomas Schön -
2013 Poster: Bayesian Inference and Learning in Gaussian Process State-Space Models with Particle MCMC »
Roger Frigola · Fredrik Lindsten · Thomas Schön · Carl Edward Rasmussen -
2012 Poster: Ancestor Sampling for Particle Gibbs »
Fredrik Lindsten · Michael Jordan · Thomas Schön