Mon 8:55 a.m. - 9:00 a.m.
|
Welcome and Introduction
(
Welcoming Notes
)
>
SlidesLive Video
|
🔗
|
Mon 9:00 a.m. - 9:12 a.m.
|
Implicit Behavioral Cloning
(
Oral
)
>
link
SlidesLive Video
|
Pete Florence · Corey Lynch · Andy Zeng · Oscar Ramirez · Ayzaan Wahid · Laura Downs · Adrian Wong · Igor Mordatch · Jonathan Tompson
🔗
|
Mon 9:12 a.m. - 9:15 a.m.
|
Implicit Behavioral Cloning Q&A
(
Q&A
)
>
link
|
Pete Florence · Corey Lynch · Andy Zeng · Oscar Ramirez · Ayzaan Wahid · Laura Downs · Adrian Wong · Igor Mordatch · Jonathan Tompson
🔗
|
Mon 9:15 a.m. - 9:27 a.m.
|
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization
(
Oral
)
>
link
SlidesLive Video
|
Aviral Kumar · Rishabh Agarwal · Tengyu Ma · Aaron Courville · George Tucker · Sergey Levine
🔗
|
Mon 9:27 a.m. - 9:30 a.m.
|
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization Q&A
(
Q&A
)
>
link
|
Aviral Kumar · Rishabh Agarwal · Tengyu Ma · Aaron Courville · George Tucker · Sergey Levine
🔗
|
Mon 9:30 a.m. - 9:42 a.m.
|
HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation
(
Oral
)
>
link
SlidesLive Video
|
Boyan Li · Hongyao Tang · YAN ZHENG · Jianye Hao · Pengyi Li · Zhaopeng Meng · LI Wang
🔗
|
Mon 9:42 a.m. - 9:45 a.m.
|
HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation Q&A
(
Q&A
)
>
link
|
Boyan Li · Hongyao Tang · YAN ZHENG · Jianye Hao · Pengyi Li · Zhaopeng Meng · LI Wang
🔗
|
Mon 9:45 a.m. - 9:57 a.m.
|
Benchmarking the Spectrum of Agent Capabilities
(
Oral
)
>
link
SlidesLive Video
|
Danijar Hafner
🔗
|
Mon 9:57 a.m. - 10:00 a.m.
|
Benchmarking the Spectrum of Agent Capabilities Q&A
(
Q&A
)
>
link
|
Danijar Hafner
🔗
|
Mon 10:00 a.m. - 10:25 a.m.
|
Invited Talk: Laura Schulz - In praise of folly: Goals, play, and human cognition
(
Talk
)
>
SlidesLive Video
|
Laura Schulz
🔗
|
Mon 10:25 a.m. - 10:30 a.m.
|
Laura Schulz Talk Q&A
(
Q&A
)
>
|
Laura Schulz
🔗
|
Mon 10:30 a.m. - 11:00 a.m.
|
Break
|
🔗
|
Mon 11:00 a.m. - 11:25 a.m.
|
Opinion Contributed Talk: Wilka Carvalho
(
Talk
)
>
SlidesLive Video
|
Wilka Carvalho Carvalho
🔗
|
Mon 11:25 a.m. - 11:30 a.m.
|
Wilka Carvalho Talk Q&A
(
Q&A
)
>
|
Wilka Carvalho Carvalho
🔗
|
Mon 11:30 a.m. - 11:42 a.m.
|
Adaptive Scheduling of Data Augmentation for Deep Reinforcement Learning
(
Oral
)
>
link
SlidesLive Video
|
Byungchan Ko · Jungseul Ok
🔗
|
Mon 11:42 a.m. - 11:45 a.m.
|
Adaptive Scheduling of Data Augmentation for Deep Reinforcement Learning Q&A
(
Oral
)
>
link
|
Byungchan Ko · Jungseul Ok
🔗
|
Mon 11:45 a.m. - 11:57 a.m.
|
Offline Meta-Reinforcement Learning with Online Self-Supervision
(
Oral
)
>
link
SlidesLive Video
|
Vitchyr Pong · Ashvin Nair · Laura Smith · Catherine Huang · Sergey Levine
🔗
|
Mon 11:57 a.m. - 12:00 p.m.
|
Offline Meta-Reinforcement Learning with Online Self-Supervision Q&A
(
Q&A
)
>
link
|
Vitchyr Pong · Ashvin Nair · Laura Smith · Catherine Huang · Sergey Levine
🔗
|
Mon 12:00 p.m. - 12:25 p.m.
|
Invited Talk: George Konidaris - Signal to Symbol (via Skills)
(
Talk
)
>
SlidesLive Video
|
George Konidaris
🔗
|
Mon 12:25 p.m. - 12:30 p.m.
|
George Konidaris Talk Q&A
(
Q&A
)
>
|
George Konidaris
🔗
|
Mon 12:30 p.m. - 1:30 p.m.
|
Poster Session (in Gather Town)
(
Poster Session
)
>
|
🔗
|
Mon 1:30 p.m. - 1:55 p.m.
|
Opinion Contributed Talk: Sergey Levine
(
Talk
)
>
SlidesLive Video
|
Sergey Levine
🔗
|
Mon 1:55 p.m. - 2:00 p.m.
|
Sergey Levine Talk Q&A
(
Q&A
)
>
|
Sergey Levine
🔗
|
Mon 2:00 p.m. - 2:30 p.m.
|
Panel Discussion 1
(
Panel Discussion
)
>
SlidesLive Video
|
🔗
|
Mon 2:30 p.m. - 2:55 p.m.
|
Invited Talk: Dale Schuurmans - Understanding Deep Value Estimation
(
Talk
)
>
SlidesLive Video
|
Dale Schuurmans
🔗
|
Mon 2:55 p.m. - 3:00 p.m.
|
Dale Schuurmans Talk Q&A
(
Q&A
)
>
|
Dale Schuurmans
🔗
|
Mon 3:00 p.m. - 3:30 p.m.
|
Break
|
🔗
|
Mon 3:30 p.m. - 3:57 p.m.
|
Invited Talk: Karol Hausman - Reinforcement Learning as a Data Sponge
(
Talk
)
>
SlidesLive Video
|
Karol Hausman
🔗
|
Mon 3:55 p.m. - 4:00 p.m.
|
Karol Hausman Talk Q&A
(
Q&A
)
>
|
Karol Hausman
🔗
|
Mon 4:00 p.m. - 4:30 p.m.
|
NeurIPS RL Competitions Results Presentations
(
Presentations
)
>
SlidesLive Video
|
Rohin Shah · Liam Paull · Tabitha Lee · Tim Rocktäschel · Heinrich Küttler · Sharada Mohanty · Manuel Wuethrich
🔗
|
Mon 4:30 p.m. - 4:55 p.m.
|
Invited Talk: Kenji Doya - Natural and Artificial Reinforcement Learning
(
Talk
)
>
SlidesLive Video
|
Kenji Doya
🔗
|
Mon 4:55 p.m. - 5:00 p.m.
|
Kenji Doya Talk Q&A
(
Q&A
)
>
|
Kenji Doya
🔗
|
Mon 5:00 p.m. - 6:00 p.m.
|
Panel Discussion 2
(
Panel Discussion
)
>
SlidesLive Video
|
🔗
|
-
|
Self-Imitation Learning from Demonstrations
(
Poster
)
>
link
SlidesLive Video
|
Georgiy Pshikhachev · Dmitry Ivanov · Vladimir Egorov · Aleksei Shpilman
🔗
|
-
|
Understanding and Preventing Capacity Loss in Reinforcement Learning
(
Poster
)
>
link
|
Clare Lyle · Mark Rowland · Will Dabney
🔗
|
-
|
Variance-Seeking Meta-Exploration to Handle Out-of-Distribution Tasks
(
Poster
)
>
link
|
Yashvir Singh Grewal · Sarah Goodwin
🔗
|
-
|
A Closer Look at Gradient Estimators with Reinforcement Learning as Inference
(
Poster
)
>
link
SlidesLive Video
|
Jonathan Lavington · Michael Teng · Mark Schmidt · Frank Wood
🔗
|
-
|
From One Hand to Multiple Hands: Imitation Learning for Dexterous Manipulation from Single-Camera Teleoperation
(
Poster
)
>
link
SlidesLive Video
|
Yuzhe Qin · Hao Su · Xiaolong Wang
🔗
|
-
|
Attention-based Partial Decoupling of Policy and Value for Generalization in Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Nasik Muhammad Nafi · Creighton Glasscock · William Hsu
🔗
|
-
|
Imitation Learning from Observations under Transition Model Disparity
(
Poster
)
>
link
|
Tanmay Gangwani · Yuan Zhou · Jian Peng
🔗
|
-
|
Vision-Guided Quadrupedal Locomotion in the Wild with Multi-Modal Delay Randomization
(
Poster
)
>
link
|
Minghao Zhang · Ruihan Yang · Yuzhe Qin · Xiaolong Wang
🔗
|
-
|
Learning from demonstrations with SACR2: Soft Actor-Critic with Reward Relabeling
(
Poster
)
>
link
SlidesLive Video
|
Jesús Bujalance Martín · Raphael Chekroun · Fabien Moutarde
🔗
|
-
|
Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification
(
Poster
)
>
link
SlidesLive Video
|
Ling Pan · Longbo Huang · Tengyu Ma · Huazhe Xu
🔗
|
-
|
Generalisation in Lifelong Reinforcement Learning through Logical Composition
(
Poster
)
>
link
SlidesLive Video
|
Geraud Nangue Tasse · Steven James · Benjamin Rosman
🔗
|
-
|
DreamerPro: Reconstruction-Free Model-Based Reinforcement Learning with Prototypical Representations
(
Poster
)
>
link
SlidesLive Video
|
Fei Deng · Ingook Jang · Sungjin Ahn
🔗
|
-
|
Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates
(
Poster
)
>
link
SlidesLive Video
|
Litian Liang · Yaosheng Xu · Stephen McAleer · Dailin Hu · Alexander Ihler · Pieter Abbeel · Roy Fox
🔗
|
-
|
Improving Actor-Critic Reinforcement Learning via Hamiltonian Monte Carlo Method
(
Poster
)
>
link
|
Duo XU
🔗
|
-
|
Learning Vision-Guided Quadrupedal Locomotion End-to-End with Cross-Modal Transformers
(
Poster
)
>
link
SlidesLive Video
|
Ruihan Yang · Minghao Zhang · Nicklas Hansen · Huazhe Xu · Xiaolong Wang
🔗
|
-
|
Look Closer: Bridging Egocentric and Third-Person Views with Transformers for Robotic Manipulation
(
Poster
)
>
link
|
Rishabh Jangir · Nicklas Hansen · Xiaolong Wang
🔗
|
-
|
Learning Value Functions from Undirected State-only Experience
(
Poster
)
>
link
SlidesLive Video
|
Matthew Chang · Arjun Gupta · Saurabh Gupta
🔗
|
-
|
Target Entropy Annealing for Discrete Soft Actor-Critic
(
Poster
)
>
link
SlidesLive Video
|
Yaosheng Xu · Dailin Hu · Litian Liang · Stephen McAleer · Pieter Abbeel · Roy Fox
🔗
|
-
|
Learning Action Translator for Meta Reinforcement Learning on Sparse-Reward Tasks
(
Poster
)
>
link
SlidesLive Video
|
Yijie Guo · Qiucheng Wu · Honglak Lee
🔗
|
-
|
Follow the Object: Curriculum Learning for Manipulation Tasks with Imagined Goals
(
Poster
)
>
link
|
Ozsel Kilinc · Giovanni Montana
🔗
|
-
|
The Reflective Explorer: Online Meta-Exploration from Offline Data in Realistic Robotic Tasks
(
Poster
)
>
link
SlidesLive Video
|
Rafael Rafailov · · Tianhe Yu · Avi Singh · Mariano Phielipp · Chelsea Finn
🔗
|
-
|
BLAST: Latent Dynamics Models from Bootstrapping
(
Poster
)
>
link
SlidesLive Video
|
Keiran Paster · Lev McKinney · Sheila McIlraith · Jimmy Ba
🔗
|
-
|
Value Function Spaces: Skill-Centric State Abstractions for Long-Horizon Reasoning
(
Poster
)
>
link
SlidesLive Video
|
Dhruv Shah · Ted Xiao · Alexander Toshev · Sergey Levine · brian ichter
🔗
|
-
|
Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Dailin Hu · Pieter Abbeel · Roy Fox
🔗
|
-
|
Data Sharing without Rewards in Multi-Task Offline Reinforcement Learning
(
Poster
)
>
link
|
Tianhe Yu · Aviral Kumar · Yevgen Chebotar · Chelsea Finn · Sergey Levine · Karol Hausman
🔗
|
-
|
StarCraft II Unplugged: Large Scale Offline Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
20 presenters
Michael Mathieu · Sherjil Ozair · Srivatsan Srinivasan · Caglar Gulcehre · Shangtong Zhang · Ray Jiang · Tom Paine · Konrad Żołna · Julian Schrittwieser · David Choi · Petko I Georgiev · Daniel Toyama · Roman Ring · Igor Babuschkin · Timo Ewalds · · Aaron van den Oord · Wojciech Czarnecki · Nando de Freitas · Oriol Vinyals
🔗
|
-
|
Learning Robust Dynamics through Variational Sparse Gating
(
Poster
)
>
link
SlidesLive Video
|
Arnav Kumar Jain · Shivakanth Sujit · Shruti Joshi · Vincent Michalski · Danijar Hafner · Samira Ebrahimi Kahou
🔗
|
-
|
Should I Run Offline Reinforcement Learning or Behavioral Cloning?
(
Poster
)
>
link
|
Aviral Kumar · Joey Hong · Anikait Singh · Sergey Levine
🔗
|
-
|
DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization
(
Poster
)
>
link
|
Aviral Kumar · Rishabh Agarwal · Tengyu Ma · Aaron Courville · George Tucker · Sergey Levine
🔗
|
-
|
Deep RePReL--Combining Planning and Deep RL for acting in relational domains
(
Poster
)
>
link
SlidesLive Video
|
Harsha Kokel · Arjun Manoharan · Sriraam Natarajan · Balaraman Ravindran · Prasad Tadepalli
🔗
|
-
|
Fast Inference and Transfer of Compositional Task for Few-shot Task Generalization
(
Poster
)
>
link
SlidesLive Video
|
Sungryull Sohn · Hyunjae Woo · Jongwook Choi · Izzeddin Gur · Aleksandra Faust · Honglak Lee
🔗
|
-
|
Benchmark for Out-of-Distribution Detection in Deep Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Aaqib Parvez Mohammed · Matias Valdenegro-Toro
🔗
|
-
|
Learning from Guided Play: A Scheduled Hierarchical Approach for Improving Exploration in Adversarial Imitation Learning
(
Poster
)
>
link
SlidesLive Video
|
Trevor Ablett · Bryan Chan · Jonathan Kelly
🔗
|
-
|
Off-Policy Correction For Multi-Agent Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Michał Zawalski · Błażej Osiński · Henryk Michalewski · Piotr Miłoś
🔗
|
-
|
Bayesian Exploration for Lifelong Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Haotian Fu · Shangqun Yu · Michael Littman · George Konidaris
🔗
|
-
|
A Modern Self-Referential Weight Matrix That Learns to Modify Itself
(
Poster
)
>
link
SlidesLive Video
|
Kazuki Irie · Imanol Schlag · Róbert Csordás · Jürgen Schmidhuber
🔗
|
-
|
Distributional Decision Transformer for Offline Hindsight Information Matching
(
Poster
)
>
link
SlidesLive Video
|
Hiroki Furuta · Yutaka Matsuo · Shixiang (Shane) Gu
🔗
|
-
|
Offline Policy Selection under Uncertainty
(
Poster
)
>
link
|
Mengjiao (Sherry) Yang · Bo Dai · Ofir Nachum · George Tucker · Dale Schuurmans
🔗
|
-
|
Learning Transferable Motor Skills with Hierarchical Latent Mixture Policies
(
Poster
)
>
link
SlidesLive Video
|
11 presenters
Dushyant Rao · Fereshteh Sadeghi · Leonard Hasenclever · Markus Wulfmeier · Martina Zambelli · Giulia Vezzani · Dhruva Tirumala · Yusuf Aytar · Josh Merel · Nicolas Heess · Raia Hadsell
🔗
|
-
|
Wish you were here: Hindsight Goal Selection for long-horizon dexterous manipulation
(
Poster
)
>
link
SlidesLive Video
|
Todor Davchev · Oleg Sushkov · Jean-Baptiste Regli · Stefan Schaal · Yusuf Aytar · Markus Wulfmeier · Jonathan Scholz
🔗
|
-
|
Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization
(
Poster
)
>
link
SlidesLive Video
|
Zihan Zhou · Wei Fu · Bingliang Zhang · Yi Wu
🔗
|
-
|
Neighborhood Mixup Experience Replay: Local Convex Interpolation for Improved Sample Efficiency in Continuous Control Tasks
(
Poster
)
>
link
SlidesLive Video
|
Ryan Sander · Wilko Schwarting · Tim Seyde · Igor Gilitschenski · Sertac Karaman · Daniela Rus
🔗
|
-
|
Cross-Domain Imitation Learning via Optimal Transport
(
Poster
)
>
link
SlidesLive Video
|
Arnaud Fickinger · Samuel Cohen · Stuart Russell · Brandon Amos
🔗
|
-
|
Lifting the veil on hyper-parameters for value-baseddeep reinforcement learning
(
Poster
)
>
link
SlidesLive Video
|
João Madeira Araújo · Johan Obando Ceron · Pablo Samuel Castro
🔗
|
-
|
Reward Uncertainty for Exploration in Preference-based Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Xinran Liang · Katherine Shu · Kimin Lee · Pieter Abbeel
🔗
|
-
|
TransDreamer: Reinforcement Learning with Transformer World Models
(
Poster
)
>
link
SlidesLive Video
|
· Jaesik Yoon · Yi-Fu Wu · Sungjin Ahn
🔗
|
-
|
Learning Parameterized Task Structure for Generalization to Unseen Entities
(
Poster
)
>
link
SlidesLive Video
|
Anthony Liu · Sungryull Sohn · Honglak Lee
🔗
|
-
|
The Effects of Reward Misspecification: Mapping and Mitigating Misaligned Models
(
Poster
)
>
link
SlidesLive Video
|
Alexander Pan · Kush Bhatia · Jacob Steinhardt
🔗
|
-
|
Learning a Subspace of Policies for Online Adaptation in Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Jean-Baptiste Gaya · Laure Soulier · Ludovic Denoyer
🔗
|
-
|
Adaptively Calibrated Critic Estimates for Deep Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Nicolai Dorka · Joschka Boedecker · Wolfram Burgard
🔗
|
-
|
Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations
(
Poster
)
>
link
SlidesLive Video
|
Haoran Xu · Xianyuan Zhan · Honglei Yin ·
🔗
|
-
|
Task-driven Discovery of Perceptual Schemas for Generalization in Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Wilka Carvalho · Andrew Lampinen · Kyriacos Nikiforou · Felix Hill · Murray Shanahan
🔗
|
-
|
Meta Arcade: A Configurable Environment Suite for Deep Reinforcement Learning and Meta-Learning
(
Poster
)
>
link
SlidesLive Video
|
Edward Staley · Jared Markowitz · Kapil Katyal
🔗
|
-
|
Hindsight Foresight Relabeling for Meta-Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Michael Wan · Jian Peng · Tanmay Gangwani
🔗
|
-
|
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
(
Poster
)
>
link
SlidesLive Video
|
Misha Laskin · Hao Liu · Xue Bin Peng · Denis Yarats · Aravind Rajeswaran · Pieter Abbeel
🔗
|
-
|
Continuous Control With Ensemble Deep Deterministic Policy Gradients
(
Poster
)
>
link
SlidesLive Video
|
Piotr Januszewski · Mateusz Olko · Michał Królikowski · Jakub Swiatkowski · Marcin Andrychowicz · Łukasz Kuciński · Piotr Miłoś
🔗
|
-
|
What Would the Expert $do(\cdot)$?: Causal Imitation Learning
(
Poster
)
>
link
SlidesLive Video
|
Gokul Swamy · Sanjiban Choudhury · James Bagnell · Steven Wu
🔗
|
-
|
Grounding Aleatoric Uncertainty in Unsupervised Environment Design
(
Poster
)
>
link
SlidesLive Video
|
Minqi Jiang · Michael Dennis · Jack Parker-Holder · Andrei Lupu · Heinrich Kuttler · Edward Grefenstette · Tim Rocktäschel · Jakob Foerster
🔗
|
-
|
SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Jongjin Park · Younggyo Seo · Jinwoo Shin · Honglak Lee · Pieter Abbeel · Kimin Lee
🔗
|
-
|
Task-Induced Representation Learning
(
Poster
)
>
link
SlidesLive Video
|
Jun Yamada · Karl Pertsch · Anisha Gunjal · Joseph Lim
🔗
|
-
|
OVD-Explorer: A General Information-theoretic Exploration Approach for Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Jinyi Liu · Zhi Wang · YAN ZHENG · Jianye Hao · Junjie Ye · Chenjia Bai · Pengyi Li
🔗
|
-
|
GrASP: Gradient-Based Affordance Selection for Planning
(
Poster
)
>
link
|
Vivek Veeriah · Zeyu Zheng · Richard L Lewis · Satinder Singh
🔗
|
-
|
Beyond Target Networks: Improving Deep $Q$-learning with Functional Regularization
(
Poster
)
>
link
SlidesLive Video
|
Alexandre Piche · Joseph Marino · Gian Maria Marconi · Valentin Thomas · Chris Pal · Mohammad Emtiyaz Khan
🔗
|
-
|
No DICE: An Investigation of the Bias-Variance Tradeoff in Meta-Gradients
(
Poster
)
>
link
SlidesLive Video
|
Risto Vuorio · Jacob Beck · Greg Farquhar · Jakob Foerster · Shimon Whiteson
🔗
|
-
|
Block Contextual MDPs for Continual Learning
(
Poster
)
>
link
SlidesLive Video
|
Shagun Sodhani · Franziska Meier · Joelle Pineau · Amy Zhang
🔗
|
-
|
PFPN: Continuous Control of Physically Simulated Characters using Particle Filtering Policy Network
(
Poster
)
>
link
SlidesLive Video
|
Pei Xu · Ioannis Karamouzas
🔗
|
-
|
Recurrent Off-policy Baselines for Memory-based Continuous Control
(
Poster
)
>
link
SlidesLive Video
|
Zhihan Yang · Nguyen Hai
🔗
|
-
|
A Framework for Efficient Robotic Manipulation
(
Poster
)
>
link
SlidesLive Video
|
Albert Zhan · Ruihan Zhao · Lerrel Pinto · Pieter Abbeel · Misha Laskin
🔗
|
-
|
Transfer RL across Observation Feature Spaces via Model-Based Regularization
(
Poster
)
>
link
SlidesLive Video
|
Yanchao Sun · Ruijie Zheng · Xiyao Wang · Andrew Cohen · Furong Huang
🔗
|
-
|
Embodiment perspective of reward definition for behavioural homeostasis
(
Poster
)
>
link
SlidesLive Video
|
Naoto Yoshida · Yasuo Kuniyoshi
🔗
|
-
|
Communication-Efficient Actor-Critic Methods for Homogeneous Markov Games
(
Poster
)
>
link
SlidesLive Video
|
Dingyang Chen · Yile Li · Qi Zhang
🔗
|
-
|
URLB: Unsupervised Reinforcement Learning Benchmark
(
Poster
)
>
link
SlidesLive Video
|
Misha Laskin · Denis Yarats · Hao Liu · Kimin Lee · Albert Zhan · Kevin Lu · Catherine Cang · Lerrel Pinto · Pieter Abbeel
🔗
|
-
|
Offline Reinforcement Learning with In-sample Q-Learning
(
Poster
)
>
link
|
Ilya Kostrikov · Ashvin Nair · Sergey Levine
🔗
|
-
|
Wasserstein Distance Maximizing Intrinsic Control
(
Poster
)
>
link
SlidesLive Video
|
Ishan Durugkar · Steven Hansen · Stephen Spencer · Volodymyr Mnih · Ishan Durugkar
🔗
|
-
|
Augmenting Reinforcement Learning with Behavior Primitives for Diverse Manipulation Tasks
(
Poster
)
>
link
SlidesLive Video
|
Soroush Nasiriany · Huihan Liu · Yuke Zhu
🔗
|
-
|
Strength Through Diversity: Robust Behavior Learning via Mixture Policies
(
Poster
)
>
link
SlidesLive Video
|
Tim Seyde · Wilko Schwarting · Igor Gilitschenski · Markus Wulfmeier · Daniela Rus
🔗
|
-
|
Long-Term Credit Assignment via Model-based Temporal Shortcuts
(
Poster
)
>
link
SlidesLive Video
|
Michel Ma · Pierluca D'Oro · Yoshua Bengio · Pierre-Luc Bacon
🔗
|
-
|
C-Planning: An Automatic Curriculum for Learning Goal-Reaching Tasks
(
Poster
)
>
link
SlidesLive Video
|
Tianjun Zhang · Ben Eysenbach · Russ Salakhutdinov · Sergey Levine · Joseph Gonzalez
🔗
|
-
|
General Characterization of Agents by States they Visit
(
Poster
)
>
link
SlidesLive Video
|
Anssi Kanervisto · Ville Hautamäki
🔗
|
-
|
TARGETED ENVIRONMENT DESIGN FROM OFFLINE DATA
(
Poster
)
>
link
SlidesLive Video
|
Izzeddin Gur · Ofir Nachum · Aleksandra Faust
🔗
|
-
|
GPU-Podracer: Scalable and Elastic Library for Cloud-Native Deep Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Xiao-Yang Liu · Zhuoran Yang · Zhaoran Wang · Anwar Walid · Jian Guo · Michael Jordan
🔗
|
-
|
Behavior Predictive Representations for Generalization in Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Siddhant Agarwal · Aaron Courville · Rishabh Agarwal
🔗
|
-
|
Fast and Data-Efficient Training of Rainbow: an Experimental Study on Atari
(
Poster
)
>
link
SlidesLive Video
|
Dominik Schmidt · Thomas Schmied
🔗
|
-
|
Implicit Behavioral Cloning
(
Poster
)
>
link
SlidesLive Video
|
Pete Florence · Corey Lynch · Andy Zeng · Oscar Ramirez · Ayzaan Wahid · Laura Downs · Adrian Wong · Igor Mordatch · Jonathan Tompson
🔗
|
-
|
Policy Gradients Incorporating the Future
(
Poster
)
>
link
SlidesLive Video
|
David Venuto · Elaine Lau · Doina Precup · Ofir Nachum
🔗
|
-
|
TempoRL: Temporal Priors for Exploration in Off-Policy Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Marco Bagatella · Sammy Christen · Otmar Hilliges
🔗
|
-
|
Dynamic Mirror Descent based Model Predictive Control for Accelerating Robot Learning
(
Poster
)
>
link
SlidesLive Video
|
Utkarsh A Mishra · Soumya Samineni · Shalabh Bhatnagar · Shishir N Y
🔗
|
-
|
Exploring through Random Curiosity with General Value Functions
(
Poster
)
>
link
SlidesLive Video
|
Aditya Ramesh · Louis Kirsch · Sjoerd van Steenkiste · Jürgen Schmidhuber
🔗
|
-
|
Maximum Entropy Model-based Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Oleg Svidchenko · Aleksei Shpilman
🔗
|
-
|
Exponential Family Model-Based Reinforcement Learning via Score Matching
(
Poster
)
>
link
SlidesLive Video
|
Gene Li · Junbo Li · Nathan Srebro · Zhaoran Wang · Zhuoran Yang
🔗
|
-
|
Imitation Learning from Pixel Observations for Continuous Control
(
Poster
)
>
link
SlidesLive Video
|
Samuel Cohen · Brandon Amos · Marc Deisenroth · Mikael Henaff · Eugene Vinitsky · Denis Yarats
🔗
|
-
|
Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Guy Tennenholtz · Assaf Hallak · Gal Dalal · Shie Mannor · Gal Chechik · Uri Shalit
🔗
|
-
|
Latent Geodesics of Model Dynamics for Offline Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Guy Tennenholtz · Nir Baram · Shie Mannor
🔗
|
-
|
An Empirical Study of Non-Uniform Sampling in Off-Policy Reinforcement Learning for Continuous Control
(
Poster
)
>
link
SlidesLive Video
|
Nicholas Ioannidis · Jonathan Lavington · Mark Schmidt
🔗
|
-
|
On Using Hamiltonian Monte Carlo Sampling for Reinforcement Learning Problems in High-dimension
(
Poster
)
>
link
SlidesLive Video
|
Udari Madhushani · Biswadip Dey · Naomi Leonard · Amit Chakraborty
🔗
|
-
|
Skill Preferences: Learning to Extract and Execute Robotic Skills from Human Feedback
(
Poster
)
>
link
SlidesLive Video
|
Xiaofei Wang · Kimin Lee · Kourosh Hakhamaneshi · Pieter Abbeel · Misha Laskin
🔗
|
-
|
That Escalated Quickly: Compounding Complexity by Editing Levels at the Frontier of Agent Capabilities
(
Poster
)
>
link
SlidesLive Video
|
Jack Parker-Holder · Minqi Jiang · Michael Dennis · Mikayel Samvelyan · Jakob Foerster · Edward Grefenstette · Tim Rocktäschel
🔗
|
-
|
The Information Geometry of Unsupervised Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Ben Eysenbach · Russ Salakhutdinov · Sergey Levine
🔗
|
-
|
Mismatched No More: Joint Model-Policy Optimization for Model-Based RL
(
Poster
)
>
link
SlidesLive Video
|
Ben Eysenbach · Alexander Khazatsky · Sergey Levine · Russ Salakhutdinov
🔗
|
-
|
Graph Backup: Data Efficient Backup Exploiting Markovian Data
(
Poster
)
>
link
SlidesLive Video
|
zhengyao Jiang · Tianjun Zhang · Robert Kirk · Tim Rocktäschel · Edward Grefenstette
🔗
|
-
|
Offline Meta-Reinforcement Learning with Online Self-Supervision
(
Poster
)
>
link
|
Vitchyr Pong · Ashvin Nair · Laura Smith · Catherine Huang · Sergey Levine
🔗
|
-
|
Unsupervised Learning of Temporal Abstractions using Slot-based Transformers
(
Poster
)
>
link
|
Anand Gopalakrishnan · Kazuki Irie · Jürgen Schmidhuber · Sjoerd van Steenkiste
🔗
|
-
|
Modern Hopfield Networks for Return Decomposition for Delayed Rewards
(
Poster
)
>
link
SlidesLive Video
|
Michael Widrich · Markus Hofmarcher · Vihang Patil · Angela Bitto · Sepp Hochreiter
🔗
|
-
|
Learning Two-Player Mixture Markov Games: Kernel Function Approximation and Correlated Equilibrium
(
Poster
)
>
link
|
Chris Junchi Li · Dongruo Zhou · Quanquan Gu · Michael Jordan
🔗
|
-
|
Interactive Robust Policy Optimization for Multi-Agent Reinforcement Learning
(
Poster
)
>
link
|
Videh Nema · Balaraman Ravindran
🔗
|
-
|
Stability Analysis in Mixed-Autonomous Traffic with Deep Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Dongsu Lee · Minhae Kwon
🔗
|
-
|
Understanding the Effects of Dataset Composition on Offline Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Kajetan Schweighofer · Markus Hofmarcher · Marius-Constantin Dinu · Philipp Renz · Angela Bitto · Vihang Patil · Sepp Hochreiter
🔗
|
-
|
Learning Efficient Multi-Agent Cooperative Visual Exploration
(
Poster
)
>
link
SlidesLive Video
|
Chao Yu · Jiaxuan Gao · Huazhong Yang · Yu Wang · Yi Wu
🔗
|
-
|
Mean-Variance Efficient Reinforcement Learning by Expected Quadratic Utility Maximization
(
Poster
)
>
link
SlidesLive Video
|
Masahiro Kato · Kei Nakagawa · Kenshi Abe · Tetsuro Morimura
🔗
|
-
|
Learning compositional tasks from language instructions
(
Poster
)
>
link
SlidesLive Video
|
Lajanugen Logeswaran · Wilka Carvalho · Honglak Lee
🔗
|
-
|
Large Scale Coordination Transfer for Cooperative Multi-Agent Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Ethan Wang · Binghong Chen · Le Song
🔗
|
-
|
Return Dispersion as an Estimator of Learning Potential for Prioritized Level Replay
(
Poster
)
>
link
|
Iryna Korshunova · Minqi Jiang · Jack Parker-Holder · Tim Rocktäschel · Edward Grefenstette
🔗
|
-
|
Status-quo policy gradient in Multi-Agent Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Pinkesh Badjatiya · Mausoom Sarkar · Nikaash Puri · Jayakumar Subramanian · Abhishek Sinha · Siddharth Singh · Balaji Krishnamurthy
🔗
|
-
|
Deep Reinforcement Learning Explanation via Model Transforms
(
Poster
)
>
link
SlidesLive Video
|
Sarah Keren · Yoav Kolumbus · Jeffrey S Rosenschein · David Parkes · Mira Finkelstein
🔗
|
-
|
A Meta-Gradient Approach to Learning Cooperative Multi-Agent Communication Topology
(
Poster
)
>
link
|
Qi Zhang · Dingyang Chen
🔗
|
-
|
A Family of Cognitively Realistic Parsing Environments for Deep Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Adrian Brasoveanu · Rohan Pandey · Maximilian Alfano-Smith
🔗
|
-
|
OstrichRL: A Musculoskeletal Ostrich Simulation to Study Bio-mechanical Locomotion
(
Poster
)
>
link
SlidesLive Video
|
Vittorio La Barbera · Fabio Pardo · Yuval Tassa · Petar Kormushev · John Hutchinson
🔗
|
-
|
Hybrid Imitative Planning with Geometric and Predictive Costs in Offroad Environments
(
Poster
)
>
link
SlidesLive Video
|
Daniel Shin · Dhruv Shah · Ali Agha · Nicholas Rhinehart · Sergey Levine
🔗
|
-
|
Accelerated Deep Reinforcement Learning of Terrain-Adaptive Locomotion Skills
(
Poster
)
>
link
|
Khaled Refaat · Kai Ding
🔗
|
-
|
CoMPS: Continual Meta Policy Search
(
Poster
)
>
link
SlidesLive Video
|
Glen Berseth · Zhiwei Zhang · Grace Zhang · Chelsea Finn · Sergey Levine
🔗
|
-
|
Continuous Control with Action Quantization from Demonstrations
(
Poster
)
>
link
|
Robert Dadashi · Leonard Hussenot · Damien Vincent · Anton Raichuk · Matthieu Geist · Olivier Pietquin
🔗
|
-
|
Investigation of Independent Reinforcement Learning Algorithms in Multi-Agent Environments
(
Poster
)
>
link
SlidesLive Video
|
Ken Ming Lee · Sriram Ganapathi · Mark Crowley
🔗
|
-
|
Expert Human-Level Driving in Gran Turismo Sport Using Deep Reinforcement Learning with Image-based Representation
(
Poster
)
>
link
SlidesLive Video
|
Ryuji Imamura · Takuma Seno · Kenta Kawamoto · Michael Spranger
🔗
|
-
|
MHER: Model-based Hindsight Experience Replay
(
Poster
)
>
link
SlidesLive Video
|
Yang Rui · Meng Fang · Lei Han · Yali Du · Feng Luo · Xiu Li
🔗
|
-
|
On the Transferability of Deep-Q Networks
(
Poster
)
>
link
SlidesLive Video
|
Matthia Sabatelli · Pierre Geurts
🔗
|
-
|
Adaptive Scheduling of Data Augmentation for Deep Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Byungchan Ko · Jungseul Ok
🔗
|
-
|
Skill-based Meta-Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Taewook Nam · Shao-Hua Sun · Karl Pertsch · Sung Ju Hwang · Joseph Lim
🔗
|
-
|
Introducing Symmetries to Black Box Meta Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Louis Kirsch · Sebastian Flennerhag · Hado van Hasselt · Abram Friesen · Junhyuk Oh · Yutian Chen
🔗
|
-
|
A Graph Policy Network Approach for Volt-Var Control in Power Distribution Systems
(
Poster
)
>
link
SlidesLive Video
|
Xian Yeow Lee · Soumik Sarkar
🔗
|
-
|
Robust Robotic Control from Pixels using Contrastive Recurrent State-Space Models
(
Poster
)
>
link
SlidesLive Video
|
Nitish Srivastava · Walter Talbott · Shuangfei Zhai · Joshua Susskind
🔗
|
-
|
Component Transfer Learning for Deep RL Based on Abstract Representations
(
Poster
)
>
link
SlidesLive Video
|
Geoffrey Driessel · Vincent Francois-Lavet
🔗
|
-
|
ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives
(
Poster
)
>
link
SlidesLive Video
|
Toshinori Kitamura · Ryo Yonetani
🔗
|
-
|
HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation
(
Poster
)
>
link
|
Boyan Li · Hongyao Tang · YAN ZHENG · Jianye Hao · Pengyi Li · Zhaopeng Meng · LI Wang
🔗
|
-
|
Behavioral Priors and Dynamics Models: Improving Performance and Domain Transfer in Offline RL
(
Poster
)
>
link
|
Catherine Cang · Aravind Rajeswaran · Pieter Abbeel · Misha Laskin
🔗
|
-
|
Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Jason Yecheng Ma · Andrew Shen · Osbert Bastani · Dinesh Jayaraman
🔗
|
-
|
Math Programming based Reinforcement Learning for Multi-Echelon Inventory Management
(
Poster
)
>
link
SlidesLive Video
|
Pavithra Harsha · Ashish Jagmohan · Jayant Kalagnanam · Brian Quanz · Divya Singhvi
🔗
|
-
|
Implicitly Regularized RL with Implicit Q-values
(
Poster
)
>
link
SlidesLive Video
|
Nino Vieillard · Marcin Andrychowicz · Anton Raichuk · Olivier Pietquin · Matthieu Geist
🔗
|
-
|
Towards Automatic Actor-Critic Solutions to Continuous Control
(
Poster
)
>
link
SlidesLive Video
|
Jake Grigsby · Jin Yong Yoo · Yanjun Qi
🔗
|
-
|
Transferring Dexterous Manipulation from GPU Simulation to a Remote Real-World Trifinger
(
Poster
)
>
link
SlidesLive Video
|
Arthur Allshire · Mayank Mittal · Varun Lodaya · Viktor Makoviychuk · Denys Makoviichuk · Felix Widmaier · Manuel Wuethrich · Stefan Bauer · Ankur Handa · Animesh Garg
🔗
|
-
|
Hierarchical Few-Shot Imitation with Skill Transition Models
(
Poster
)
>
link
SlidesLive Video
|
Kourosh Hakhamaneshi · Ruihan Zhao · Albert Zhan · Pieter Abbeel · Misha Laskin
🔗
|
-
|
Accelerating Robotic Reinforcement Learning via Parameterized Action Primitives
(
Poster
)
>
link
SlidesLive Video
|
Murtaza Dalal · Deepak Pathak · Russ Salakhutdinov
🔗
|
-
|
Who Is the Strongest Enemy? Towards Optimal and Efficient Evasion Attacks in Deep RL
(
Poster
)
>
link
SlidesLive Video
|
Yanchao Sun · Ruijie Zheng · Yongyuan Liang · Furong Huang
🔗
|
-
|
Automatic Curricula via Expert Demonstrations
(
Poster
)
>
link
SlidesLive Video
|
Siyu Dai · Andreas Hofmann · Brian Williams
🔗
|
-
|
Mastering Visual Continuous Control: Improved Data-Augmented Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Denis Yarats · Rob Fergus · Alessandro Lazaric · Lerrel Pinto
🔗
|
-
|
Benchmarking the Spectrum of Agent Capabilities
(
Poster
)
>
link
|
Danijar Hafner
🔗
|
-
|
Policy Optimization via Optimal Policy Evaluation
(
Poster
)
>
link
SlidesLive Video
|
Alberto Maria Metelli · Samuele Meta · Marcello Restelli
🔗
|
-
|
A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Mingde Zhao · Zhen Liu · Sitao Luan · Shuyuan Zhang · Doina Precup · Yoshua Bengio
🔗
|
-
|
Discriminator Augmented Model-Based Reinforcement Learning
(
Poster
)
>
link
SlidesLive Video
|
Allan Zhou · Archit Sharma · Chelsea Finn
🔗
|