This is the public, feature-limited version of the conference webpage. After Registration and login please visit the full version.

Workshop: Deep Reinforcement Learning

Pieter Abbeel, Chelsea Finn, Joelle Pineau, David Silver, Satinder Singh, Coline Devin, Misha Laskin, Kimin Lee, Janarthanan Rajendran, Vivek Veeriah

2020-12-11T08:30:00-08:00 - 2020-12-11T19:00:00-08:00
Abstract: In recent years, the use of deep neural networks as function approximators has enabled researchers to extend reinforcement learning techniques to solve increasingly complex control tasks. The emerging field of deep reinforcement learning has led to remarkable empirical results in rich and varied domains like robotics, strategy games, and multiagent interactions. This workshop will bring together researchers working at the intersection of deep learning and reinforcement learning, and it will help interested researchers outside of the field gain a high-level view about the current state of the art and potential directions for future contributions.

Chat

To ask questions please use rocketchat, available only upon registration and login.

Schedule

2020-12-11T08:29:00-08:00 - 2020-12-11T08:30:00-08:00
Welcome and Introduction
2020-12-11T08:30:00-08:00 - 2020-12-11T09:00:00-08:00
Invited talk: PierreYves Oudeyer "Machines that invent their own problems: Towards open-ended learning of skills"
Pierre-Yves Oudeyer
2020-12-11T09:00:00-08:00 - 2020-12-11T09:15:00-08:00
Contributed Talk: Learning Functionally Decomposed Hierarchies for Continuous Control Tasks with Path Planning
Sammy Christen, Lukas Jendele, Emre Aksan, Otmar Hilliges
2020-12-11T09:15:00-08:00 - 2020-12-11T09:30:00-08:00
Contributed Talk: Maximum Reward Formulation In Reinforcement Learning
Sai Krishna Gottipati, Yashaswi Pathak, Rohan Nuttall, Sahir ., Ravi Chunduru, Ahmed Touati, Sriram Ganapathi, Matthew Taylor , Sarath Chandar
2020-12-11T09:30:00-08:00 - 2020-12-11T09:45:00-08:00
Contributed Talk: Accelerating Reinforcement Learning with Learned Skill Priors
Karl Pertsch, Youngwoon Lee, Joseph Lim
2020-12-11T09:45:00-08:00 - 2020-12-11T10:00:00-08:00
Contributed Talk: Asymmetric self-play for automatic goal discovery in robotic manipulation
OpenAI Robotics, Matthias Plappert, Raul Sampedro, Tao Xu , Ilge Akkaya, Vineet Kosaraju, Peter Welinder, Ruben D'Sa, Arthur Petron, Henrique Ponde, Alex Paino, Hyeonwoo Noh  Noh , Lilian Weng, Qiming Yuan, Casey Chu , Wojciech Zaremba
2020-12-11T10:00:00-08:00 - 2020-12-11T10:30:00-08:00
Invited talk: Marc Bellemare "Autonomous navigation of stratospheric balloons using reinforcement learning"
Marc Bellemare
2020-12-11T10:30:00-08:00 - 2020-12-11T11:00:00-08:00
Break
2020-12-11T10:59:00-08:00 - 2020-12-11T11:00:00-08:00
Introduction
2020-12-11T11:00:00-08:00 - 2020-12-11T11:30:00-08:00
Invited talk: Peter Stone "Grounded Simulation Learning for Sim2Real with Connections to Off-Policy Reinforcement Learning"
Peter Stone
2020-12-11T11:30:00-08:00 - 2020-12-11T11:45:00-08:00
Contributed Talk: Mirror Descent Policy Optimization
Manan Tomar, Lior Shani, Yonathan Efroni, Mohammad Ghavamzadeh
2020-12-11T11:45:00-08:00 - 2020-12-11T12:00:00-08:00
Contributed Talk: Planning from Pixels using Inverse Dynamics Models
Keiran Paster, Sheila McIlraith, Jimmy Ba
2020-12-11T12:00:00-08:00 - 2020-12-11T12:30:00-08:00
Invited talk: Matt Botvinick "Alchemy: A Benchmark Task Distribution for Meta-Reinforcement Learning Research"
Matt Botvinick
2020-12-11T12:30:00-08:00 - 2020-12-11T13:30:00-08:00
Poster session 1
2020-12-11T13:29:00-08:00 - 2020-12-11T13:30:00-08:00
Introduction
2020-12-11T13:30:00-08:00 - 2020-12-11T14:00:00-08:00
Invited talk: Susan Murphy "We used RL but…. Did it work?!"
Susan Murphy
2020-12-11T14:00:00-08:00 - 2020-12-11T14:15:00-08:00
Contributed Talk: MaxEnt RL and Robust Control
Benjamin Eysenbach, Sergey Levine
2020-12-11T14:15:00-08:00 - 2020-12-11T14:30:00-08:00
Contributed Talk: Reset-Free Lifelong Learning with Skill-Space Planning
Kevin Lu, Aditya Grover, Pieter Abbeel, Igor Mordatch
2020-12-11T14:30:00-08:00 - 2020-12-11T15:00:00-08:00
Invited talk: Anusha Nagabandi "Model-based Deep Reinforcement Learning for Robotic Systems"
Anusha Nagabandi
2020-12-11T15:00:00-08:00 - 2020-12-11T15:30:00-08:00
Break
2020-12-11T15:29:00-08:00 - 2020-12-11T15:30:00-08:00
Introduction
2020-12-11T15:30:00-08:00 - 2020-12-11T16:00:00-08:00
Invited talk: Ashley Edwards "Learning Offline from Observation"
Ashley Edwards
2020-12-11T16:00:00-08:00 - 2020-12-11T16:07:00-08:00
NeurIPS RL Competitions: Flatland challenge
Sharada Mohanty
2020-12-11T16:07:00-08:00 - 2020-12-11T16:15:00-08:00
NeurIPS RL Competitions: Learning to run a power network
Antoine Marot
2020-12-11T16:15:00-08:00 - 2020-12-11T16:22:00-08:00
NeurIPS RL Competitions: Procgen challenge
Sharada Mohanty
2020-12-11T16:22:00-08:00 - 2020-12-11T16:30:00-08:00
NeurIPS RL Competitions: MineRL
William Guss, Stephanie Milani
2020-12-11T16:30:00-08:00 - 2020-12-11T17:00:00-08:00
Invited talk: Karen Liu "Deep Reinforcement Learning for Physical Human-Robot Interaction"
Karen Liu
2020-12-11T17:00:00-08:00 - 2020-12-11T18:00:00-08:00
Panel discussion
Pierre-Yves Oudeyer, Marc Bellemare, Peter Stone, Matt Botvinick, Susan Murphy, Anusha Nagabandi, Ashley Edwards, Karen Liu, Pieter Abbeel
2020-12-11T18:00:00-08:00 - 2020-12-11T19:00:00-08:00
Poster session 2
Poster: How to make Deep RL work in Practice
Poster: Safety Aware Reinforcement Learning
Poster: Greedy Multi-Step Off-Policy Reinforcement Learning
Poster: Randomized Ensembled Double Q-Learning: Learning Fast Without a Model
Poster: Combating False Negatives in Adversarial Imitation Learning
Poster: Exploring Zero-Shot Emergent Communication in Embodied Multi-Agent Populations
Poster: Interactive Visualization for Debugging RL
Poster: Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization
Poster: D2RL: Deep Dense Architectures in Reinforcement Learning
Poster: Domain Adversarial Reinforcement Learning
Poster: Reinforcement Learning with Bayesian Classifiers: Efficient Skill Learning from Outcome Examples
Poster: Unified View of Inference-based Off-policy RL: Decoupling Algorithmic and Implemental Source of Performance Gaps
Poster: PettingZoo: Gym for Multi-Agent Reinforcement Learning
Poster: Continual Model-Based Reinforcement Learning withHypernetworks
Poster: Online Hyper-parameter Tuning in Off-policy Learning via Evolutionary Strategies
Poster: Deep Bayesian Quadrature Policy Gradient
Poster: Accelerating Reinforcement Learning with Learned Skill Priors
Poster: Semantic State Representation for Reinforcement Learning
Poster: Regularized Inverse Reinforcement Learning
Poster: Decoupling Exploration and Exploitation in Meta-Reinforcement Learning without Sacrifices
Poster: Model-Based Reinforcement Learning via Latent-Space Collocation
Poster: Conservative Safety Critics for Exploration
Poster: Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning
Poster: Autoregressive Dynamics Models for Offline Policy Evaluation and Optimization
Poster: Online Safety Assurance for Deep Reinforcement Learning
Poster: FinRL: A Deep Reinforcement Learning Library for Automated Stock Trading in Quantitative Finance
Poster: Dream and Search to Control: Latent Space Planning for Continuous Control
Poster: DREAM: Deep Regret minimization with Advantage baselines and Model-free learning
Poster: Learning to Reach Goals via Iterated Supervised Learning
Poster: Learning Accurate Long-term Dynamics for Model-based Reinforcement Learning
Poster: Modular Training, Integrated Planning Deep Reinforcement Learning for Mobile Robot Navigation
Poster: Preventing Value Function Collapse in Ensemble Q-Learning by Maximizing Representation Diversity
Poster: Diverse Exploration via InfoMax Options
Poster: Tonic: A Deep Reinforcement Learning Library for Fast Prototyping and Benchmarking
Poster: Lyapunov Barrier Policy Optimization
Poster: Explanation Augmented Feedback in Human-in-the-Loop Reinforcement Learning
Poster: FactoredRL: Leveraging Factored Graphs for Deep Reinforcement Learning
Poster: SCC: an efficient deep reinforcement learning agent mastering the game of StarCraft II
Poster: Parrot: Data-driven Behavioral Priors for Reinforcement Learning
Poster: C-Learning: Learning to Achieve Goals via Recursive Classification
Poster: Maximum Mutation Reinforcement Learning for Scalable Control
Poster: A Policy Gradient Method for Task-Agnostic Exploration
Poster: Evolving Reinforcement Learning Algorithms
Poster: Learning Functionally Decomposed Hierarchies for Continuous Control Tasks with Path Planning
Poster: Sample Efficient Training in Multi-Agent AdversarialGames with Limited Teammate Communication
Poster: A Deep Value-based Policy Search Approach for Real-world Vehicle Repositioning on Mobility-on-Demand Platforms
Poster: Parameter-based Value Functions
Poster: Bringing order into Actor-Critic Algorithms usingStackelberg Games
Poster: Skill Transfer via Partially Amortized Hierarchical Planning
Poster: XLVIN: eXecuted Latent Value Iteration Nets
Poster: Latent State Models for Meta-Reinforcement Learning from Images
Poster: Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in First-person Simulated 3D Environments
Poster: Curriculum Learning through Distilled Discriminators
Poster: Decoupling Representation Learning from Reinforcement Learning
Poster: Amortized Variational Deep Q Network
Poster: Abstract Value Iteration for Hierarchical Deep Reinforcement Learning
Poster: Average Reward Reinforcement Learning with Monotonic Policy Improvement
Poster: Harnessing Distribution Ratio Estimators for Learning Agents with Quality and Diversity
Poster: Contrastive Behavioral Similarity Embeddings for Generalization in Reinforcement Learning
Poster: Model-Based Visual Planning with Self-Supervised Functional Distances
Poster: Adversarial Environment Generation for Learning to Navigate the Web
Poster: On Effective Parallelization of Monte Carlo Tree Search
Poster: Emergent Road Rules In Multi-Agent Driving Environments
Poster: Motion Planner Augmented Reinforcement Learning for Robot Manipulation in Obstructed Environments
Poster: What Matters for On-Policy Deep Actor-Critic Methods? A Large-Scale Study
Poster: Utilizing Skipped Frames in Action Repeats via Pseudo-Actions
Poster: Off-Dynamics Reinforcement Learning: Training for Transfer with Domain Classifiers
Poster: Action and Perception as Divergence Minimization
Poster: PixL2R: Guiding Reinforcement Learning Using Natural Language by Mapping Pixels to Rewards
Poster: Targeted Query-based Action-Space Adversarial Policies on Deep Reinforcement Learning Agents
Poster: Disentangled Planning and Control in Vision Based Robotics via Reward Machines
Poster: Hyperparameter Auto-tuning in Self-Supervised Robotic Learning
Poster: Policy Guided Planning in Learned Latent Space
Poster: Model-Based Meta-Reinforcement Learning for Flight with Suspended Payloads
Poster: Learning Intrinsic Symbolic Rewards in Reinforcement Learning
Poster: GRAC: Self-Guided and Self-Regularized Actor-Critic
Poster: DisCo RL: Distribution-Conditioned Reinforcement Learning for General-Purpose Policies
Poster: Structure and randomness in planning and reinforcement learning
Poster: Trust, but verify: model-based exploration in sparse reward environments
Poster: Learning to Represent Action Values as a Hypergraph on the Action Vertices
Poster: Value Generalization among Policies: Improving Value Function with Policy Representation
Poster: Unsupervised Domain Adaptation for Visual Navigation
Poster: Data-Efficient Reinforcement Learning with Self-Predictive Representations
Poster: Inter-Level Cooperation in Hierarchical Reinforcement Learning
Poster: Safe Reinforcement Learning with Natural Language Constraints
Poster: Multi-Agent Option Critic Architecture
Poster: Chaining Behaviors from Data with Model-Free Reinforcement Learning
Poster: An Examination of Preference-based Reinforcement Learning for Treatment Recommendation
Poster: Addressing reward bias in Adversarial Imitation Learning with neutral reward functions
Poster: Unsupervised Task Clustering for Multi-Task Reinforcement Learning
Poster: Policy Learning Using Weak Supervision
Poster: Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning
Poster: Efficient Competitive Self-Play Policy Optimization
Poster: Beyond Exponentially Discounted Sum: Automatic Learning of Return Function
Poster: Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning
Poster: Variational Empowerment as Representation Learning for Goal-Based Reinforcement Learning
Poster: Influence-aware Memory for Deep Reinforcement Learning in POMDPs
Poster: Measuring Visual Generalization in Continuous Control from Pixels
Poster: R-LAtte: Visual Control via Deep Reinforcement Learning with Attention Network
Poster: OPAL: Offline Primitive Discovery for Accelerating Offline Reinforcement Learning
Poster: Revisiting Rainbow: Promoting more insightful and inclusive deep reinforcement learning research
Poster: XT2: Training an X-to-Text Typing Interface with Online Learning from Implicit Feedback
Poster: Self-Supervised Policy Adaptation during Deployment
Poster: Model-based Navigation in Environments with Novel Layouts Using Abstract $n$-D Maps
Poster: Provably Efficient Policy Optimization via Thompson Sampling
Poster: Discovery of Options via Meta-Gradients
Poster: Weighted Bellman Backups for Improved Signal-to-Noise in Q-Updates
Poster: Causal Curiosity: RL Agents Discovering Self-supervised Experiments for Causal Representation Learning
Poster: Energy-based Surprise Minimization for Multi-Agent Value Factorization
Poster: Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms
Poster: An Algorithmic Causal Model of Credit Assignment in Reinforcement Learning
Poster: Pairwise Weights for Temporal Credit Assignment
Poster: Multi-task Reinforcement Learning with a Planning Quasi-Metric
Poster: Reinforcement Learning with Latent Flow
Poster: Which Mutual-Information Representation Learning Objectives are Sufficient for Control?
Poster: Successor Landmarks for Efficient Exploration and Long-Horizon Navigation
Poster: Backtesting Optimal Trade Execution Policies in Agent-Based Market Simulator
Poster: Deep Q-Learning with Low Switching Cost
Poster: Learning Markov State Abstractions for Deep Reinforcement Learning
Poster: Robust Domain Randomised Reinforcement Learning through Peer-to-Peer Distillation
Poster: A Variational Inference Perspective on Goal-Directed Behavior in Reinforcement Learning
Poster: BeBold: Exploration Beyond the Boundary of Explored Regions
Poster: Visual Imitation with Reinforcement Learning using Recurrent Siamese Networks
Poster: Maximum Reward Formulation In Reinforcement Learning
Poster: Planning from Pixels using Inverse Dynamics Models
Poster: Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks
Poster: Super-Human Performance in Gran Turismo Sport Using Deep Reinforcement Learning
Poster: Asymmetric self-play for automatic goal discovery in robotic manipulation
Poster: DERAIL: Diagnostic Environments for Reward And Imitation Learning
Poster: MaxEnt RL and Robust Control
Poster: Solving Compositional Reinforcement Learning Problems via Task Reduction
Poster: Unlocking the Potential of Deep Counterfactual Value Networks
Poster: Addressing Distribution Shift in Online Reinforcement Learning with Offline Datasets
Poster: Predictive PER: Balancing Priority and Diversity towards Stable Deep Reinforcement Learning
Poster: Evaluating Agents Without Rewards
Poster: ReaPER: Improving Sample Efficiency in Model-Based Latent Imagination
Poster: Compute- and Memory-Efficient Reinforcement Learning with Latent Experience Replay
Poster: Optimizing Traffic Bottleneck Throughput using Cooperative, Decentralized Autonomous Vehicles
Poster: TACTO: A Simulator for Learning Control from Touch Sensing
Poster: Mastering Atari with Discrete World Models
Poster: Learning to Sample with Local and Global Contexts in Experience Replay Buffer
Poster: C-Learning: Horizon-Aware Cumulative Accessibility Estimation
Poster: AWAC: Accelerating Online Reinforcement Learning With Offline Datasets
Poster: Understanding Learned Reward Functions
Poster: Correcting Momentum in Temporal Difference Learning
Poster: Goal-Conditioned Reinforcement Learning in the Presence of an Adversary
Poster: Reset-Free Lifelong Learning with Skill-Space Planning
Poster: Model-Based Reinforcement Learning: A Compressed Survey
Poster: Mirror Descent Policy Optimization
Poster: Learning Latent Landmarks for Generalizable Planning
Poster: Reusability and Transferability of Macro Actions for Reinforcement Learning
Poster: Quantifying Differences in Reward Functions
Poster: Learning to Weight Imperfect Demonstrations