Timezone: »
For many years, the reinforcement learning community primarily focused on sequential decision making in fully observable but unknown domains while the planning under uncertainty community focused on known but partially observable domains. Since most problems are both partially observable and (at least partially) unknown, recent years have seen a surge of interest in combining the related, but often different, algorithmic machineries developed in the two communities. The time thus seems ripe for a symposium that brings these two communities together and reviews recent advances in this convergence.
A reinforcement learning agent for a partially observable environment is often broken into two parts: 1) the inference of an environment model from data; and 2) the solution of the associated control/planning problem. There has been significant progress on both these fronts in recent years. Both linear and non-linear models of various forms can now be learned from history data. Modern POMDP solvers can also now handle some models with millions of states. This symposium brings together five active researchers in PORL research to present some state-of-the-art developments.
Author Information
Marcus Hutter (Australian National University)
Will Uther (NICTA Neville Road Research Lab)
Pascal Poupart (University of Waterloo)
More from the Same Authors
-
2012 Poster: Cost-Sensitive Exploration in Bayesian Reinforcement Learning »
Dongho Kim · Kee-Eung Kim · Pascal Poupart -
2012 Poster: Symbolic Dynamic Programming for Continuous State and Observation POMDPs »
Zahra Zamani · Scott Sanner · Pascal Poupart · Kristian Kersting -
2011 Poster: Automated Refinement of Bayes Networks' Parameters based on Test Ordering Constraints »
Omar Z Khan · Pascal Poupart · John Agosta -
2010 Workshop: Machine Learning for Assistive Technologies »
Jesse Hoey · Pascal Poupart · Thomas Ploetz -
2010 Session: Spotlights Session 8 »
Pascal Poupart -
2010 Session: Oral Session 9 »
Pascal Poupart -
2008 Workshop: Model Uncertainty and Risk in Reinforcement Learning »
Yaakov Engel · Mohammad Ghavamzadeh · Shie Mannor · Pascal Poupart -
2007 Poster: Temporal Difference with Eligibility Traces Derived from First Principles »
Marcus Hutter · Shane Legg -
2006 Poster: Automated Hierarchy Discovery for Planning in Partially Observable Domains »
Laurent Charlin · Pascal Poupart · Romy Shioda