Timezone: »

Nonlinear Inverse Reinforcement Learning with Gaussian Processes
Sergey Levine · Zoran Popovic · Vladlen Koltun

Mon Dec 12 10:00 AM -- 02:59 PM (PST) @

We present a probabilistic algorithm for nonlinear inverse reinforcement learning. The goal of inverse reinforcement learning is to learn the reward function in a Markov decision process from expert demonstrations. While most prior inverse reinforcement learning algorithms represent the reward as a linear combination of a set of features, we use Gaussian processes to learn the reward as a nonlinear function, while also determining the relevance of each feature to the expert's policy. Our probabilistic algorithm allows complex behaviors to be captured from suboptimal stochastic demonstrations, while automatically balancing the simplicity of the learned reward structure against its consistency with the observed actions.

Author Information

Sergey Levine (Stanford University)
Zoran Popovic (University of Washington)
Vladlen Koltun (Adobe Research)

More from the Same Authors