Deep Reinforcement Learning

Mon 8:55 a.m. - 9:00 a.m.

Welcome and Introduction ( Welcoming Notes ) >
SlidesLive Video

🔗

Mon 9:00 a.m. - 9:12 a.m.

Implicit Behavioral Cloning ( Oral ) > link
SlidesLive Video

Pete Florence · Corey Lynch · Andy Zeng · Oscar Ramirez · Ayzaan Wahid · Laura Downs · Adrian Wong · Igor Mordatch · Jonathan Tompson 🔗

Mon 9:12 a.m. - 9:15 a.m.

Implicit Behavioral Cloning Q&A ( Q&A ) > link

Link

Pete Florence · Corey Lynch · Andy Zeng · Oscar Ramirez · Ayzaan Wahid · Laura Downs · Adrian Wong · Igor Mordatch · Jonathan Tompson 🔗

Mon 9:15 a.m. - 9:27 a.m.

DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization ( Oral ) > link
SlidesLive Video

Link

Aviral Kumar · Rishabh Agarwal · Tengyu Ma · Aaron Courville · George Tucker · Sergey Levine 🔗

Mon 9:27 a.m. - 9:30 a.m.

DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization Q&A ( Q&A ) > link

Link

Aviral Kumar · Rishabh Agarwal · Tengyu Ma · Aaron Courville · George Tucker · Sergey Levine 🔗

Mon 9:30 a.m. - 9:42 a.m.

HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation ( Oral ) > link
SlidesLive Video

Link

Boyan Li · Hongyao Tang · YAN ZHENG · Jianye Hao · Pengyi Li · Zhaopeng Meng · LI Wang 🔗

Mon 9:42 a.m. - 9:45 a.m.

HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation Q&A ( Q&A ) > link

Link

Boyan Li · Hongyao Tang · YAN ZHENG · Jianye Hao · Pengyi Li · Zhaopeng Meng · LI Wang 🔗

Mon 9:45 a.m. - 9:57 a.m.

Benchmarking the Spectrum of Agent Capabilities ( Oral ) > link
SlidesLive Video

Link

Danijar Hafner 🔗

Mon 9:57 a.m. - 10:00 a.m.

Benchmarking the Spectrum of Agent Capabilities Q&A ( Q&A ) > link

Link

Danijar Hafner 🔗

Mon 10:00 a.m. - 10:25 a.m.

Invited Talk: Laura Schulz - In praise of folly: Goals, play, and human cognition ( Talk ) >
SlidesLive Video

Laura Schulz 🔗

Mon 10:25 a.m. - 10:30 a.m.

Laura Schulz Talk Q&A ( Q&A ) >

Laura Schulz 🔗

Mon 10:30 a.m. - 11:00 a.m.

Break

🔗

Mon 11:00 a.m. - 11:25 a.m.

Opinion Contributed Talk: Wilka Carvalho ( Talk ) >
SlidesLive Video

Wilka Carvalho Carvalho 🔗

Mon 11:25 a.m. - 11:30 a.m.

Wilka Carvalho Talk Q&A ( Q&A ) >

Wilka Carvalho Carvalho 🔗

Mon 11:30 a.m. - 11:42 a.m.

Adaptive Scheduling of Data Augmentation for Deep Reinforcement Learning ( Oral ) > link
SlidesLive Video

Link

Byungchan Ko · Jungseul Ok 🔗

Mon 11:42 a.m. - 11:45 a.m.

Adaptive Scheduling of Data Augmentation for Deep Reinforcement Learning Q&A ( Oral ) > link

Link

Byungchan Ko · Jungseul Ok 🔗

Mon 11:45 a.m. - 11:57 a.m.

Offline Meta-Reinforcement Learning with Online Self-Supervision ( Oral ) > link
SlidesLive Video

Link

Vitchyr Pong · Ashvin Nair · Laura Smith · Catherine Huang · Sergey Levine 🔗

Mon 11:57 a.m. - 12:00 p.m.

Offline Meta-Reinforcement Learning with Online Self-Supervision Q&A ( Q&A ) > link

Link

Vitchyr Pong · Ashvin Nair · Laura Smith · Catherine Huang · Sergey Levine 🔗

Mon 12:00 p.m. - 12:25 p.m.

Invited Talk: George Konidaris - Signal to Symbol (via Skills) ( Talk ) >
SlidesLive Video

George Konidaris 🔗

Mon 12:25 p.m. - 12:30 p.m.

George Konidaris Talk Q&A ( Q&A ) >

George Konidaris 🔗

Mon 12:30 p.m. - 1:30 p.m.

Poster Session (in Gather Town) ( Poster Session ) >

🔗

Mon 1:30 p.m. - 1:55 p.m.

Opinion Contributed Talk: Sergey Levine ( Talk ) >
SlidesLive Video

Sergey Levine 🔗

Mon 1:55 p.m. - 2:00 p.m.

Sergey Levine Talk Q&A ( Q&A ) >

Sergey Levine 🔗

Mon 2:00 p.m. - 2:30 p.m.

Panel Discussion 1 ( Panel Discussion ) >
SlidesLive Video

🔗

Mon 2:30 p.m. - 2:55 p.m.

Invited Talk: Dale Schuurmans - Understanding Deep Value Estimation ( Talk ) >
SlidesLive Video

Dale Schuurmans 🔗

Mon 2:55 p.m. - 3:00 p.m.

Dale Schuurmans Talk Q&A ( Q&A ) >

Dale Schuurmans 🔗

Mon 3:00 p.m. - 3:30 p.m.

Break

🔗

Mon 3:30 p.m. - 3:57 p.m.

Invited Talk: Karol Hausman - Reinforcement Learning as a Data Sponge ( Talk ) >
SlidesLive Video

Karol Hausman 🔗

Mon 3:55 p.m. - 4:00 p.m.

Karol Hausman Talk Q&A ( Q&A ) >

Karol Hausman 🔗

Mon 4:00 p.m. - 4:30 p.m.

NeurIPS RL Competitions Results Presentations ( Presentations ) >
SlidesLive Video

Rohin Shah · Liam Paull · Tabitha Lee · Tim Rocktäschel · Heinrich Küttler · Sharada Mohanty · Manuel Wuethrich 🔗

Mon 4:30 p.m. - 4:55 p.m.

Invited Talk: Kenji Doya - Natural and Artificial Reinforcement Learning ( Talk ) >
SlidesLive Video

Kenji Doya 🔗

Mon 4:55 p.m. - 5:00 p.m.

Kenji Doya Talk Q&A ( Q&A ) >

Kenji Doya 🔗

Mon 5:00 p.m. - 6:00 p.m.

Panel Discussion 2 ( Panel Discussion ) >
SlidesLive Video

🔗

-

Self-Imitation Learning from Demonstrations ( Poster ) > link
SlidesLive Video

Link

Georgiy Pshikhachev · Dmitry Ivanov · Vladimir Egorov · Aleksei Shpilman 🔗

-

Understanding and Preventing Capacity Loss in Reinforcement Learning ( Poster ) > link

Link

Clare Lyle · Mark Rowland · Will Dabney 🔗

-

Variance-Seeking Meta-Exploration to Handle Out-of-Distribution Tasks ( Poster ) > link

Link

Yashvir Singh Grewal · Sarah Goodwin 🔗

-

A Closer Look at Gradient Estimators with Reinforcement Learning as Inference ( Poster ) > link
SlidesLive Video

Link

Jonathan Lavington · Michael Teng · Mark Schmidt · Frank Wood 🔗

-

From One Hand to Multiple Hands: Imitation Learning for Dexterous Manipulation from Single-Camera Teleoperation ( Poster ) > link
SlidesLive Video

Link

Yuzhe Qin · Hao Su · Xiaolong Wang 🔗

-

Attention-based Partial Decoupling of Policy and Value for Generalization in Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Nasik Muhammad Nafi · Creighton Glasscock · William Hsu 🔗

-

Imitation Learning from Observations under Transition Model Disparity ( Poster ) > link

Link

Tanmay Gangwani · Yuan Zhou · Jian Peng 🔗

-

Vision-Guided Quadrupedal Locomotion in the Wild with Multi-Modal Delay Randomization ( Poster ) > link

Link

Minghao Zhang · Ruihan Yang · Yuzhe Qin · Xiaolong Wang 🔗

-

Learning from demonstrations with SACR2: Soft Actor-Critic with Reward Relabeling ( Poster ) > link
SlidesLive Video

Link

Jesús Bujalance Martín · Raphael Chekroun · Fabien Moutarde 🔗

-

Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification ( Poster ) > link
SlidesLive Video

Link

Ling Pan · Longbo Huang · Tengyu Ma · Huazhe Xu 🔗

-

Generalisation in Lifelong Reinforcement Learning through Logical Composition ( Poster ) > link
SlidesLive Video

Link

Geraud Nangue Tasse · Steven James · Benjamin Rosman 🔗

-

DreamerPro: Reconstruction-Free Model-Based Reinforcement Learning with Prototypical Representations ( Poster ) > link
SlidesLive Video

Link

Fei Deng · Ingook Jang · Sungjin Ahn 🔗

-

Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates ( Poster ) > link
SlidesLive Video

Link

Litian Liang · Yaosheng Xu · Stephen McAleer · Dailin Hu · Alexander Ihler · Pieter Abbeel · Roy Fox 🔗

-

Improving Actor-Critic Reinforcement Learning via Hamiltonian Monte Carlo Method ( Poster ) > link

Link

Duo XU 🔗

-

Learning Vision-Guided Quadrupedal Locomotion End-to-End with Cross-Modal Transformers ( Poster ) > link
SlidesLive Video

Link

Ruihan Yang · Minghao Zhang · Nicklas Hansen · Huazhe Xu · Xiaolong Wang 🔗

-

Look Closer: Bridging Egocentric and Third-Person Views with Transformers for Robotic Manipulation ( Poster ) > link

Link

Rishabh Jangir · Nicklas Hansen · Xiaolong Wang 🔗

-

Learning Value Functions from Undirected State-only Experience ( Poster ) > link
SlidesLive Video

Link

Matthew Chang · Arjun Gupta · Saurabh Gupta 🔗

-

Target Entropy Annealing for Discrete Soft Actor-Critic ( Poster ) > link
SlidesLive Video

Link

Yaosheng Xu · Dailin Hu · Litian Liang · Stephen McAleer · Pieter Abbeel · Roy Fox 🔗

-

Learning Action Translator for Meta Reinforcement Learning on Sparse-Reward Tasks ( Poster ) > link
SlidesLive Video

Link

Yijie Guo · Qiucheng Wu · Honglak Lee 🔗

-

Follow the Object: Curriculum Learning for Manipulation Tasks with Imagined Goals ( Poster ) > link

Link

Ozsel Kilinc · Giovanni Montana 🔗

-

The Reflective Explorer: Online Meta-Exploration from Offline Data in Realistic Robotic Tasks ( Poster ) > link
SlidesLive Video

Link

Rafael Rafailov · · Tianhe Yu · Avi Singh · Mariano Phielipp · Chelsea Finn 🔗

-

BLAST: Latent Dynamics Models from Bootstrapping ( Poster ) > link
SlidesLive Video

Link

Keiran Paster · Lev McKinney · Sheila McIlraith · Jimmy Ba 🔗

-

Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning ( Poster ) > link
SlidesLive Video

Link

Dhruv Shah · Ted Xiao · Alexander Toshev · Sergey Levine · brian ichter 🔗

-

Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Dailin Hu · Pieter Abbeel · Roy Fox 🔗

-

Data Sharing without Rewards in Multi-Task Offline Reinforcement Learning ( Poster ) > link

Link

Tianhe Yu · Aviral Kumar · Yevgen Chebotar · Chelsea Finn · Sergey Levine · Karol Hausman 🔗

-

StarCraft II Unplugged: Large Scale Offline Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

20 presenters

Michael Mathieu · Sherjil Ozair · Srivatsan Srinivasan · Caglar Gulcehre · Shangtong Zhang · Ray Jiang · Tom Paine · Konrad Żołna · Julian Schrittwieser · David Choi · Petko I Georgiev · Daniel Toyama · Roman Ring · Igor Babuschkin · Timo Ewalds · · Aaron van den Oord · Wojciech Czarnecki · Nando de Freitas · Oriol Vinyals

🔗

-

Learning Robust Dynamics through Variational Sparse Gating ( Poster ) > link
SlidesLive Video

Link

Arnav Kumar Jain · Shivakanth Sujit · Shruti Joshi · Vincent Michalski · Danijar Hafner · Samira Ebrahimi Kahou 🔗

-

Should I Run Offline Reinforcement Learning or Behavioral Cloning? ( Poster ) > link

Link

Aviral Kumar · Joey Hong · Anikait Singh · Sergey Levine 🔗

-

DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization ( Poster ) > link

Link

Aviral Kumar · Rishabh Agarwal · Tengyu Ma · Aaron Courville · George Tucker · Sergey Levine 🔗

-

Deep RePReL--Combining Planning and Deep RL for acting in relational domains ( Poster ) > link
SlidesLive Video

Link

Harsha Kokel · Arjun Manoharan · Sriraam Natarajan · Balaraman Ravindran · Prasad Tadepalli 🔗

-

Fast Inference and Transfer of Compositional Task for Few-shot Task Generalization ( Poster ) > link
SlidesLive Video

Link

Sungryull Sohn · Hyunjae Woo · Jongwook Choi · Izzeddin Gur · Aleksandra Faust · Honglak Lee 🔗

-

Benchmark for Out-of-Distribution Detection in Deep Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Aaqib Parvez Mohammed · Matias Valdenegro-Toro 🔗

-

Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning ( Poster ) > link
SlidesLive Video

Link

Trevor Ablett · Bryan Chan · Jonathan Kelly 🔗

-

Off-Policy Correction For Multi-Agent Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Michał Zawalski · Błażej Osiński · Henryk Michalewski · Piotr Miłoś 🔗

-

Bayesian Exploration for Lifelong Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Haotian Fu · Shangqun Yu · Michael Littman · George Konidaris 🔗

-

A Modern Self-Referential Weight Matrix That Learns to Modify Itself ( Poster ) > link
SlidesLive Video

Link

Kazuki Irie · Imanol Schlag · Róbert Csordás · Jürgen Schmidhuber 🔗

-

Distributional Decision Transformer for Offline Hindsight Information Matching ( Poster ) > link
SlidesLive Video

Link

Hiroki Furuta · Yutaka Matsuo · Shixiang (Shane) Gu 🔗

-

Offline Policy Selection under Uncertainty ( Poster ) > link

Link

Mengjiao (Sherry) Yang · Bo Dai · Ofir Nachum · George Tucker · Dale Schuurmans 🔗

-

Learning Transferable Motor Skills with Hierarchical Latent Mixture Policies ( Poster ) > link
SlidesLive Video

Link

11 presenters

Dushyant Rao · Fereshteh Sadeghi · Leonard Hasenclever · Markus Wulfmeier · Martina Zambelli · Giulia Vezzani · Dhruva Tirumala · Yusuf Aytar · Josh Merel · Nicolas Heess · Raia Hadsell

🔗

-

Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation ( Poster ) > link
SlidesLive Video

Link

Todor Davchev · Oleg Sushkov · Jean-Baptiste Regli · Stefan Schaal · Yusuf Aytar · Markus Wulfmeier · Jonathan Scholz 🔗

-

Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization ( Poster ) > link
SlidesLive Video

Link

Zihan Zhou · Wei Fu · Bingliang Zhang · Yi Wu 🔗

-

Neighborhood Mixup Experience Replay: Local Convex Interpolation for Improved Sample Efficiency in Continuous Control Tasks ( Poster ) > link
SlidesLive Video

Link

Ryan Sander · Wilko Schwarting · Tim Seyde · Igor Gilitschenski · Sertac Karaman · Daniela Rus 🔗

-

Cross-Domain Imitation Learning via Optimal Transport ( Poster ) > link
SlidesLive Video

Link

Arnaud Fickinger · Samuel Cohen · Stuart Russell · Brandon Amos 🔗

-

Lifting the veil on hyper-parameters for value-baseddeep reinforcement learning ( Poster ) > link
SlidesLive Video

Link

João Madeira Araújo · Johan Obando Ceron · Pablo Samuel Castro 🔗

-

Reward Uncertainty for Exploration in Preference-based Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Xinran Liang · Katherine Shu · Kimin Lee · Pieter Abbeel 🔗

-

TransDreamer: Reinforcement Learning with Transformer World Models ( Poster ) > link
SlidesLive Video

Link

· Jaesik Yoon · Yi-Fu Wu · Sungjin Ahn 🔗

-

Learning Parameterized Task Structure for Generalization to Unseen Entities ( Poster ) > link
SlidesLive Video

Link

Anthony Liu · Sungryull Sohn · Honglak Lee 🔗

-

The Effects of Reward Misspecification: Mapping and Mitigating Misaligned Models ( Poster ) > link
SlidesLive Video

Link

Alexander Pan · Kush Bhatia · Jacob Steinhardt 🔗

-

Learning a Subspace of Policies for Online Adaptation in Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Jean-Baptiste Gaya · Laure Soulier · Ludovic Denoyer 🔗

-

Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Nicolai Dorka · Joschka Boedecker · Wolfram Burgard 🔗

-

Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations ( Poster ) > link
SlidesLive Video

Link

Haoran Xu · Xianyuan Zhan · Honglei Yin · 🔗

-

Task-driven Discovery of Perceptual Schemas for Generalization in Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Wilka Carvalho · Andrew Lampinen · Kyriacos Nikiforou · Felix Hill · Murray Shanahan 🔗

-

Meta Arcade: A Configurable Environment Suite for Deep Reinforcement Learning and Meta-Learning ( Poster ) > link
SlidesLive Video

Link

Edward Staley · Jared Markowitz · Kapil Katyal 🔗

-

Hindsight Foresight Relabeling for Meta-Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Michael Wan · Jian Peng · Tanmay Gangwani 🔗

-

CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery ( Poster ) > link
SlidesLive Video

Link

Misha Laskin · Hao Liu · Xue Bin Peng · Denis Yarats · Aravind Rajeswaran · Pieter Abbeel 🔗

-

Continuous Control With Ensemble Deep Deterministic Policy Gradients ( Poster ) > link
SlidesLive Video

Link

Piotr Januszewski · Mateusz Olko · Michał Królikowski · Jakub Swiatkowski · Marcin Andrychowicz · Łukasz Kuciński · Piotr Miłoś 🔗

-

What Would the Expert $do(\cdot)$ ?: Causal Imitation Learning ( Poster ) > link
SlidesLive Video

Link

Gokul Swamy · Sanjiban Choudhury · James Bagnell · Steven Wu 🔗

-

Grounding Aleatoric Uncertainty in Unsupervised Environment Design ( Poster ) > link
SlidesLive Video

Link

Minqi Jiang · Michael Dennis · Jack Parker-Holder · Andrei Lupu · Heinrich Kuttler · Edward Grefenstette · Tim Rocktäschel · Jakob Foerster 🔗

-

SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Jongjin Park · Younggyo Seo · Jinwoo Shin · Honglak Lee · Pieter Abbeel · Kimin Lee 🔗

-

Task-Induced Representation Learning ( Poster ) > link
SlidesLive Video

Link

Jun Yamada · Karl Pertsch · Anisha Gunjal · Joseph Lim 🔗

-

OVD-Explorer: A General Information-theoretic Exploration Approach for Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Jinyi Liu · Zhi Wang · YAN ZHENG · Jianye Hao · Junjie Ye · Chenjia Bai · Pengyi Li 🔗

-

GrASP: Gradient-Based Affordance Selection for Planning ( Poster ) > link

Link

Vivek Veeriah · Zeyu Zheng · Richard L Lewis · Satinder Singh 🔗

-

Beyond Target Networks: Improving Deep $Q$ -learning with Functional Regularization ( Poster ) > link
SlidesLive Video

Link

Alexandre Piche · Joseph Marino · Gian Maria Marconi · Valentin Thomas · Chris Pal · Mohammad Emtiyaz Khan 🔗

-

No DICE: An Investigation of the Bias-Variance Tradeoff in Meta-Gradients ( Poster ) > link
SlidesLive Video

Link

Risto Vuorio · Jacob Beck · Greg Farquhar · Jakob Foerster · Shimon Whiteson 🔗

-

Block Contextual MDPs for Continual Learning ( Poster ) > link
SlidesLive Video

Link

Shagun Sodhani · Franziska Meier · Joelle Pineau · Amy Zhang 🔗

-

PFPN: Continuous Control of Physically Simulated Characters using Particle Filtering Policy Network ( Poster ) > link
SlidesLive Video

Link

Pei Xu · Ioannis Karamouzas 🔗

-

Recurrent Off-policy Baselines for Memory-based Continuous Control ( Poster ) > link
SlidesLive Video

Link

Zhihan Yang · Nguyen Hai 🔗

-

A Framework for Efficient Robotic Manipulation ( Poster ) > link
SlidesLive Video

Link

Albert Zhan · Ruihan Zhao · Lerrel Pinto · Pieter Abbeel · Misha Laskin 🔗

-

Transfer RL across Observation Feature Spaces via Model-Based Regularization ( Poster ) > link
SlidesLive Video

Link

Yanchao Sun · Ruijie Zheng · Xiyao Wang · Andrew Cohen · Furong Huang 🔗

-

Embodiment perspective of reward definition for behavioural homeostasis ( Poster ) > link
SlidesLive Video

Link

Naoto Yoshida · Yasuo Kuniyoshi 🔗

-

Communication-Efficient Actor-Critic Methods for Homogeneous Markov Games ( Poster ) > link
SlidesLive Video

Link

Dingyang Chen · Yile Li · Qi Zhang 🔗

-

URLB: Unsupervised Reinforcement Learning Benchmark ( Poster ) > link
SlidesLive Video

Link

Misha Laskin · Denis Yarats · Hao Liu · Kimin Lee · Albert Zhan · Kevin Lu · Catherine Cang · Lerrel Pinto · Pieter Abbeel 🔗

-

Offline Reinforcement Learning with In-sample Q-Learning ( Poster ) > link

Link

Ilya Kostrikov · Ashvin Nair · Sergey Levine 🔗

-

Wasserstein Distance Maximizing Intrinsic Control ( Poster ) > link
SlidesLive Video

Link

Ishan Durugkar · Steven Hansen · Stephen Spencer · Volodymyr Mnih · Ishan Durugkar 🔗

-

Augmenting Reinforcement Learning with Behavior Primitives for Diverse Manipulation Tasks ( Poster ) > link
SlidesLive Video

Link

Soroush Nasiriany · Huihan Liu · Yuke Zhu 🔗

-

Strength Through Diversity: Robust Behavior Learning via Mixture Policies ( Poster ) > link
SlidesLive Video

Link

Tim Seyde · Wilko Schwarting · Igor Gilitschenski · Markus Wulfmeier · Daniela Rus 🔗

-

Long-Term Credit Assignment via Model-based Temporal Shortcuts ( Poster ) > link
SlidesLive Video

Link

Michel Ma · Pierluca D'Oro · Yoshua Bengio · Pierre-Luc Bacon 🔗

-

C-Planning: An Automatic Curriculum for Learning Goal-Reaching Tasks ( Poster ) > link
SlidesLive Video

Link

Tianjun Zhang · Ben Eysenbach · Russ Salakhutdinov · Sergey Levine · Joseph Gonzalez 🔗

-

General Characterization of Agents by States they Visit ( Poster ) > link
SlidesLive Video

Link

Anssi Kanervisto · Ville Hautamäki 🔗

-

TARGETED ENVIRONMENT DESIGN FROM OFFLINE DATA ( Poster ) > link
SlidesLive Video

Link

Izzeddin Gur · Ofir Nachum · Aleksandra Faust 🔗

-

GPU-Podracer: Scalable and Elastic Library for Cloud-Native Deep Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Xiao-Yang Liu · Zhuoran Yang · Zhaoran Wang · Anwar Walid · Jian Guo · Michael Jordan 🔗

-

Behavior Predictive Representations for Generalization in Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Siddhant Agarwal · Aaron Courville · Rishabh Agarwal 🔗

-

Fast and Data-Efficient Training of Rainbow: an Experimental Study on Atari ( Poster ) > link
SlidesLive Video

Link

Dominik Schmidt · Thomas Schmied 🔗

-

Implicit Behavioral Cloning ( Poster ) > link
SlidesLive Video

Link

Pete Florence · Corey Lynch · Andy Zeng · Oscar Ramirez · Ayzaan Wahid · Laura Downs · Adrian Wong · Igor Mordatch · Jonathan Tompson 🔗

-

Policy Gradients Incorporating the Future ( Poster ) > link
SlidesLive Video

Link

David Venuto · Elaine Lau · Doina Precup · Ofir Nachum 🔗

-

TempoRL: Temporal Priors for Exploration in Off-Policy Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Marco Bagatella · Sammy Christen · Otmar Hilliges 🔗

-

Dynamic Mirror Descent based Model Predictive Control for Accelerating Robot Learning ( Poster ) > link
SlidesLive Video

Link

Utkarsh A Mishra · Soumya Samineni · Shalabh Bhatnagar · Shishir N Y 🔗

-

Exploring through Random Curiosity with General Value Functions ( Poster ) > link
SlidesLive Video

Link

Aditya Ramesh · Louis Kirsch · Sjoerd van Steenkiste · Jürgen Schmidhuber 🔗

-

Maximum Entropy Model-based Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Oleg Svidchenko · Aleksei Shpilman 🔗

-

Exponential Family Model-Based Reinforcement Learning via Score Matching ( Poster ) > link
SlidesLive Video

Link

Gene Li · Junbo Li · Nathan Srebro · Zhaoran Wang · Zhuoran Yang 🔗

-

Imitation Learning from Pixel Observations for Continuous Control ( Poster ) > link
SlidesLive Video

Link

Samuel Cohen · Brandon Amos · Marc Deisenroth · Mikael Henaff · Eugene Vinitsky · Denis Yarats 🔗

-

Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Guy Tennenholtz · Assaf Hallak · Gal Dalal · Shie Mannor · Gal Chechik · Uri Shalit 🔗

-

Latent Geodesics of Model Dynamics for Offline Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Guy Tennenholtz · Nir Baram · Shie Mannor 🔗

-

An Empirical Study of Non-Uniform Sampling in Off-Policy Reinforcement Learning for Continuous Control ( Poster ) > link
SlidesLive Video

Link

Nicholas Ioannidis · Jonathan Lavington · Mark Schmidt 🔗

-

On Using Hamiltonian Monte Carlo Sampling for Reinforcement Learning Problems in High-dimension ( Poster ) > link
SlidesLive Video

Link

Udari Madhushani · Biswadip Dey · Naomi Leonard · Amit Chakraborty 🔗

-

Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback ( Poster ) > link
SlidesLive Video

Link

Xiaofei Wang · Kimin Lee · Kourosh Hakhamaneshi · Pieter Abbeel · Misha Laskin 🔗

-

That Escalated Quickly: Compounding Complexity by Editing Levels at the Frontier of Agent Capabilities ( Poster ) > link
SlidesLive Video

Link

Jack Parker-Holder · Minqi Jiang · Michael Dennis · Mikayel Samvelyan · Jakob Foerster · Edward Grefenstette · Tim Rocktäschel 🔗

-

The Information Geometry of Unsupervised Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Ben Eysenbach · Russ Salakhutdinov · Sergey Levine 🔗

-

Mismatched No More: Joint Model-Policy Optimization for Model-Based RL ( Poster ) > link
SlidesLive Video

Link

Ben Eysenbach · Alexander Khazatsky · Sergey Levine · Russ Salakhutdinov 🔗

-

Graph Backup: Data Efficient Backup Exploiting Markovian Data ( Poster ) > link
SlidesLive Video

Link

zhengyao Jiang · Tianjun Zhang · Robert Kirk · Tim Rocktäschel · Edward Grefenstette 🔗

-

Offline Meta-Reinforcement Learning with Online Self-Supervision ( Poster ) > link

Link

Vitchyr Pong · Ashvin Nair · Laura Smith · Catherine Huang · Sergey Levine 🔗

-

Unsupervised Learning of Temporal Abstractions using Slot-based Transformers ( Poster ) > link

Link

Anand Gopalakrishnan · Kazuki Irie · Jürgen Schmidhuber · Sjoerd van Steenkiste 🔗

-

Modern Hopfield Networks for Return Decomposition for Delayed Rewards ( Poster ) > link
SlidesLive Video

Link

Michael Widrich · Markus Hofmarcher · Vihang Patil · Angela Bitto · Sepp Hochreiter 🔗

-

Learning Two-Player Mixture Markov Games: Kernel Function Approximation and Correlated Equilibrium ( Poster ) > link

Link

Chris Junchi Li · Dongruo Zhou · Quanquan Gu · Michael Jordan 🔗

-

Interactive Robust Policy Optimization for Multi-Agent Reinforcement Learning ( Poster ) > link

Link

Videh Nema · Balaraman Ravindran 🔗

-

Stability Analysis in Mixed-Autonomous Traffic with Deep Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Dongsu Lee · Minhae Kwon 🔗

-

Understanding the Effects of Dataset Composition on Offline Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Kajetan Schweighofer · Markus Hofmarcher · Marius-Constantin Dinu · Philipp Renz · Angela Bitto · Vihang Patil · Sepp Hochreiter 🔗

-

Learning Efficient Multi-Agent Cooperative Visual Exploration ( Poster ) > link
SlidesLive Video

Link

Chao Yu · Jiaxuan Gao · Huazhong Yang · Yu Wang · Yi Wu 🔗

-

Mean-Variance Efficient Reinforcement Learning by Expected Quadratic Utility Maximization ( Poster ) > link
SlidesLive Video

Link

Masahiro Kato · Kei Nakagawa · Kenshi Abe · Tetsuro Morimura 🔗

-

Learning compositional tasks from language instructions ( Poster ) > link
SlidesLive Video

Link

Lajanugen Logeswaran · Wilka Carvalho · Honglak Lee 🔗

-

Large Scale Coordination Transfer for Cooperative Multi-Agent Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Ethan Wang · Binghong Chen · Le Song 🔗

-

Return Dispersion as an Estimator of Learning Potential for Prioritized Level Replay ( Poster ) > link

Link

Iryna Korshunova · Minqi Jiang · Jack Parker-Holder · Tim Rocktäschel · Edward Grefenstette 🔗

-

Status-quo policy gradient in Multi-Agent Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Pinkesh Badjatiya · Mausoom Sarkar · Nikaash Puri · Jayakumar Subramanian · Abhishek Sinha · Siddharth Singh · Balaji Krishnamurthy 🔗

-

Deep Reinforcement Learning Explanation via Model Transforms ( Poster ) > link
SlidesLive Video

Link

Sarah Keren · Yoav Kolumbus · Jeffrey S Rosenschein · David Parkes · Mira Finkelstein 🔗

-

A Meta-Gradient Approach to Learning Cooperative Multi-Agent Communication Topology ( Poster ) > link

Link

Qi Zhang · Dingyang Chen 🔗

-

A Family of Cognitively Realistic Parsing Environments for Deep Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Adrian Brasoveanu · Rohan Pandey · Maximilian Alfano-Smith 🔗

-

OstrichRL: A Musculoskeletal Ostrich Simulation to Study Bio-mechanical Locomotion ( Poster ) > link
SlidesLive Video

Link

Vittorio La Barbera · Fabio Pardo · Yuval Tassa · Petar Kormushev · John Hutchinson 🔗

-

Hybrid Imitative Planning with Geometric and Predictive Costs in Offroad Environments ( Poster ) > link
SlidesLive Video

Link

Daniel Shin · Dhruv Shah · Ali Agha · Nicholas Rhinehart · Sergey Levine 🔗

-

Accelerated Deep Reinforcement Learning of Terrain-Adaptive Locomotion Skills ( Poster ) > link

Link

Khaled Refaat · Kai Ding 🔗

-

CoMPS: Continual Meta Policy Search ( Poster ) > link
SlidesLive Video

Link

Glen Berseth · Zhiwei Zhang · Grace Zhang · Chelsea Finn · Sergey Levine 🔗

-

Continuous Control with Action Quantization from Demonstrations ( Poster ) > link

Link

Robert Dadashi · Leonard Hussenot · Damien Vincent · Anton Raichuk · Matthieu Geist · Olivier Pietquin 🔗

-

Investigation of Independent Reinforcement Learning Algorithms in Multi-Agent Environments ( Poster ) > link
SlidesLive Video

Link

Ken Ming Lee · Sriram Ganapathi · Mark Crowley 🔗

-

Expert Human-Level Driving in Gran Turismo Sport Using Deep Reinforcement Learning with Image-based Representation ( Poster ) > link
SlidesLive Video

Link

Ryuji Imamura · Takuma Seno · Kenta Kawamoto · Michael Spranger 🔗

-

MHER: Model-based Hindsight Experience Replay ( Poster ) > link
SlidesLive Video

Link

Yang Rui · Meng Fang · Lei Han · Yali Du · Feng Luo · Xiu Li 🔗

-

On the Transferability of Deep-Q Networks ( Poster ) > link
SlidesLive Video

Link

Matthia Sabatelli · Pierre Geurts 🔗

-

Adaptive Scheduling of Data Augmentation for Deep Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Byungchan Ko · Jungseul Ok 🔗

-

Skill-based Meta-Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Taewook Nam · Shao-Hua Sun · Karl Pertsch · Sung Ju Hwang · Joseph Lim 🔗

-

Introducing Symmetries to Black Box Meta Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Louis Kirsch · Sebastian Flennerhag · Hado van Hasselt · Abram Friesen · Junhyuk Oh · Yutian Chen 🔗

-

A Graph Policy Network Approach for Volt-Var Control in Power Distribution Systems ( Poster ) > link
SlidesLive Video

Link

Xian Yeow Lee · Soumik Sarkar 🔗

-

Robust Robotic Control from Pixels using Contrastive Recurrent State-Space Models ( Poster ) > link
SlidesLive Video

Link

Nitish Srivastava · Walter Talbott · Shuangfei Zhai · Joshua Susskind 🔗

-

Component Transfer Learning for Deep RL Based on Abstract Representations ( Poster ) > link
SlidesLive Video

Link

Geoffrey Driessel · Vincent Francois-Lavet 🔗

-

ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives ( Poster ) > link
SlidesLive Video

Link

Toshinori Kitamura · Ryo Yonetani 🔗

-

HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation ( Poster ) > link

Link

Boyan Li · Hongyao Tang · YAN ZHENG · Jianye Hao · Pengyi Li · Zhaopeng Meng · LI Wang 🔗

-

Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL ( Poster ) > link

Link

Catherine Cang · Aravind Rajeswaran · Pieter Abbeel · Misha Laskin 🔗

-

Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Jason Yecheng Ma · Andrew Shen · Osbert Bastani · Dinesh Jayaraman 🔗

-

Math Programming based Reinforcement Learning for Multi-Echelon Inventory Management ( Poster ) > link
SlidesLive Video

Link

Pavithra Harsha · Ashish Jagmohan · Jayant Kalagnanam · Brian Quanz · Divya Singhvi 🔗

-

Implicitly Regularized RL with Implicit Q-values ( Poster ) > link
SlidesLive Video

Link

Nino Vieillard · Marcin Andrychowicz · Anton Raichuk · Olivier Pietquin · Matthieu Geist 🔗

-

Towards Automatic Actor-Critic Solutions to Continuous Control ( Poster ) > link
SlidesLive Video

Link

Jake Grigsby · Jin Yong Yoo · Yanjun Qi 🔗

-

Transferring Dexterous Manipulation from GPU Simulation to a Remote Real-World Trifinger ( Poster ) > link
SlidesLive Video

Link

Arthur Allshire · Mayank Mittal · Varun Lodaya · Viktor Makoviychuk · Denys Makoviichuk · Felix Widmaier · Manuel Wuethrich · Stefan Bauer · Ankur Handa · Animesh Garg 🔗

-

Hierarchical Few-Shot Imitation with Skill Transition Models ( Poster ) > link
SlidesLive Video

Link

Kourosh Hakhamaneshi · Ruihan Zhao · Albert Zhan · Pieter Abbeel · Misha Laskin 🔗

-

Accelerating Robotic Reinforcement Learning via Parameterized Action Primitives ( Poster ) > link
SlidesLive Video

Link

Murtaza Dalal · Deepak Pathak · Russ Salakhutdinov 🔗

-

Who Is the Strongest Enemy? Towards Optimal and Efficient Evasion Attacks in Deep RL ( Poster ) > link
SlidesLive Video

Link

Yanchao Sun · Ruijie Zheng · Yongyuan Liang · Furong Huang 🔗

-

Automatic Curricula via Expert Demonstrations ( Poster ) > link
SlidesLive Video

Link

Siyu Dai · Andreas Hofmann · Brian Williams 🔗

-

Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Denis Yarats · Rob Fergus · Alessandro Lazaric · Lerrel Pinto 🔗

-

Benchmarking the Spectrum of Agent Capabilities ( Poster ) > link

Link

Danijar Hafner 🔗

-

Policy Optimization via Optimal Policy Evaluation ( Poster ) > link
SlidesLive Video

Link

Alberto Maria Metelli · Samuele Meta · Marcello Restelli 🔗

-

A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Mingde Zhao · Zhen Liu · Sitao Luan · Shuyuan Zhang · Doina Precup · Yoshua Bengio 🔗

-

Discriminator Augmented Model-Based Reinforcement Learning ( Poster ) > link
SlidesLive Video

Link

Allan Zhou · Archit Sharma · Chelsea Finn 🔗

Main Navigation

Workshop

Deep Reinforcement Learning

Pieter Abbeel · Chelsea Finn · David Silver · Matthew Taylor · Martha White · Srijita Das · Yuqing Du · Andrew Patterson · Manan Tomar · Olivia Watkins

Schedule