Infer to Control: Probabilistic Reinforcement Learning and Structured Control

Workshop

Infer to Control: Probabilistic Reinforcement Learning and Structured Control

Leslie Kaelbling · Martin Riedmiller · Marc Toussaint · Igor Mordatch · Roy Fox · Tuomas Haarnoja

Sat 8 Dec, 5 a.m. PST

[ Abstract ] Workshop Website

Reinforcement learning and imitation learning are effective paradigms for learning controllers of dynamical systems from experience. These fields have been empowered by recent success in deep learning of differentiable parametric models, allowing end-to-end training of highly nonlinear controllers that encompass perception, memory, prediction, and decision making. The aptitude of these models to represent latent dynamics, high-level goals, and long-term outcomes is unfortunately curbed by the poor sample complexity of many current algorithms for learning these models from experience.

Probabilistic reinforcement learning and inference of control structure are emerging as promising approaches for avoiding prohibitive amounts of controller–system interactions. These methods leverage informative priors on useful behavior, as well as controller structure such as hierarchy and modularity, as useful inductive biases that reduce the effective size of policy search space and shape the optimization landscape. Intrinsic and self-supervised signals can further guide the training process of distinct internal components — such as perceptual embeddings, predictive models, exploration policies, and inter-agent communication — to break down the hard holistic problem of control into more efficiently learnable parts.

Effective inference methods are crucial for probabilistic approaches to reinforcement learning and structured control. Approximate control and model-free reinforcement learning exploit latent system structure and priors on policy structure, that are not directly evident in the controller–system interactions, and must be inferred by the learning algorithm. The growing interest of the reinforcement learning and optimal control community in the application of inference methods is synchronized well with the development by the probabilistic learning community of powerful inference techniques, such as probabilistic programming, variational inference, Gaussian processes, and nonparametric regression.

This workshop is a venue for the inference and reinforcement learning communities to come together in discussing recent advances, developing insights, and future potential in inference methods and their application to probabilistic reinforcement learning and structured control. The goal of this workshop is to catalyze tighter collaboration within and between the communities, that will be leveraged in upcoming years to rise to the challenges of real-world control problems.

=== Intel AI is proud to sponsor Infer2Control @ NeurIPS 2018 ===
Early detection of tumors. Predicting equipment failures before they happen. Having a natural conversation with your home or car. Making retail more personal than ever. This is Artificial Intelligence powered by Intel, and companies around the globe are using it to make money, save money, and advance the future of their industry. At Intel, we’re using decades of expertise in silicon, software, communications, memory and storage to create the new technologies that AI demands. Technologies that break barriers between data center and edge, server and network, training and inference, model and reality – maximizing the economics of AI to take data from theory to real-world success. Learn more: ai.intel.com

Chat is not available.

Timezone: America/Los_Angeles

Schedule

Sat 5:20 a.m. - 5:30 a.m.	Opening Remarks ( Introduction ) >	Roy Fox 🔗
Sat 5:30 a.m. - 6:00 a.m.	Control as Inference and Soft Deep RL (Sergey Levine) ( Invited Talk ) >	Sergey Levine 🔗
Sat 6:00 a.m. - 6:10 a.m.	Unsupervised Learning of Image Embedding for Continuous Control (Carlos Florensa) ( Contributed Talk ) >	Carlos Florensa 🔗
Sat 6:10 a.m. - 6:20 a.m.	Variational Inference Techniques for Sequential Decision Making in Generative Models (Igor Kiselev) ( Contributed Talk ) >	Igor Kiselev 🔗
Sat 6:20 a.m. - 6:30 a.m.	Probabilistic Planning with Sequential Monte Carlo (Alexandre Piché) ( Contributed Talk ) >	Alexandre Piche 🔗
Sat 6:30 a.m. - 7:00 a.m.	Inference and control of rules in human hierarchical reinforcement learning (Anne Collins) ( Invited Talk ) >	Anne Collins 🔗
Sat 7:00 a.m. - 7:30 a.m.	Hierarchical RL: From Prior Knowledge to Policies (Shie Mannor) ( Invited Talk ) >	Shie Mannor 🔗
Sat 7:30 a.m. - 8:00 a.m.	-- Coffee Break 1 --	🔗
Sat 8:00 a.m. - 8:30 a.m.	Off-policy Policy Optimization (Dale Schuurmans) ( Invited Talk ) >	Dale Schuurmans 🔗
Sat 8:30 a.m. - 8:45 a.m.	Spotlights 1 ( Spotlights ) >	11 presenters Ming-Xu Huang · Hao(Jackson) Cui · Arash Mehrjou · Yaqi Duan · Sharad Vikram · Angelina Wang · Karan Goel · Jonathan Hunt · Zhengwei Wu · Dinghan Shen · Mattie Fellows 🔗
Sat 8:45 a.m. - 9:15 a.m.	Poster Session 1 ( Poster Session ) >	35 presenters Kyle H Ambert · Brandon Araki · Xiya Cao · Sungjoon Choi · Hao(Jackson) Cui · Jonas Degrave · Yaqi Duan · Mattie Fellows · Carlos Florensa · Karan Goel · Aditya Gopalan · Ming-Xu Huang · Jonathan Hunt · Cyril Ibrahim · Brian Ichter · Maximilian Igl · Zheng Tracy Ke · Igor Kiselev · Anuj Mahajan · Arash Mehrjou · Karl Pertsch · Alexandre Piche · Nicholas Rhinehart · Thomas Ringstrom · Reazul Hasan Russel · Oleh Rybkin · Ion Stoica · Sharad Vikram · Angelina Wang · Ting-Han Wei · Abigail H Wen · I-Chen Wu · Zhengwei Wu · Linhai Xie · Dinghan Shen 🔗
Sat 9:15 a.m. - 10:45 a.m.	-- Lunch Break --	🔗
Sat 10:45 a.m. - 11:15 a.m.	Solving inference and control problems with the same machinery (Emo Todorov) ( Invited Talk ) >	Emo Todorov 🔗
Sat 11:15 a.m. - 11:30 a.m.	Spotlights 2 ( Spotlights ) >	Aditya Gopalan · Sungjoon Choi · Thomas Ringstrom · Roy Fox · Jonas Degrave · Xiya Cao · Karl Pertsch · Maximilian Igl · Brian Ichter 🔗
Sat 11:30 a.m. - 12:00 p.m.	Inference and Control of Learning Behavior in Rodents (Ryan Adams) ( Invited Talk ) >	Ryan Adams 🔗
Sat 12:00 p.m. - 12:30 p.m.	-- Coffee Break 2 --	🔗
Sat 12:30 p.m. - 1:00 p.m.	On the Value of Knowing What You Don't Know: Learning to Sample and Sampling to Learn for Robot Planning (Leslie Kaelbling) ( Invited Talk ) >	Leslie Kaelbling 🔗
Sat 1:00 p.m. - 1:10 p.m.	Learning to Plan with Logical Automata (Brandon Araki) ( Contributed Talk ) >	Brandon Araki 🔗
Sat 1:10 p.m. - 1:20 p.m.	Tight Bayesian Ambiguity Sets for Robust MDPs (Reazul Hasan Russel) ( Contributed Talk ) >	Reazul Hasan Russel 🔗
Sat 1:20 p.m. - 1:30 p.m.	Deep Imitative Models for Flexible Inference, Planning, and Control (Nicholas Rhinehart) ( Contributed Talk ) >	Nicholas Rhinehart 🔗
Sat 1:30 p.m. - 2:00 p.m.	Probabilistic Reasoning for Reinforcement Learning (Nicolas Heess) ( Invited Talk ) >	Nicolas Heess 🔗
Sat 2:00 p.m. - 3:00 p.m.	Discussion Panel: Ryan Adams, Nicolas Heess, Leslie Kaelbling, Shie Mannor, Emo Todorov (moderator: Roy Fox) ( Discussion Panel ) >	Ryan Adams · Nicolas Heess · Leslie Kaelbling · Shie Mannor · Emo Todorov · Roy Fox 🔗
Sat 3:00 p.m. - 3:30 p.m.	Poster Session 2 ( Poster Session ) >	🔗