firstbacksecondback
Filter by Keyword:
590 Results
Spotlight
|
Offline Reinforcement Learning as One Big Sequence Modeling Problem Michael Janner · Qiyang Li · Sergey Levine |
||
Workshop
|
SURF: Semi-supervised Reward Learning with Data Augmentation for Feedback-efficient Preference-based Reinforcement Learning Jongjin Park · Younggyo Seo · Jinwoo Shin · Honglak Lee · Pieter Abbeel · Kimin Lee |
||
Workshop
|
Execute Order 66: Targeted Data Poisoning for Reinforcement Learning via Minuscule Perturbations Harrison Foley · Liam Fowl · Tom Goldstein · Gavin Taylor |
||
Datasets and Benchmarks
|
URLB: Unsupervised Reinforcement Learning Benchmark Misha Laskin · Denis Yarats · Hao Liu · Kimin Lee · Albert Zhan · Kevin Lu · Catherine Cang · Lerrel Pinto · Pieter Abbeel |
||
Workshop
|
URLB: Unsupervised Reinforcement Learning Benchmark Misha Laskin · Denis Yarats · Hao Liu · Kimin Lee · Albert Zhan · Kevin Lu · Catherine Cang · Lerrel Pinto · Pieter Abbeel |
||
Workshop
|
Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning Jason Yecheng Ma · Andrew Shen · Osbert Bastani · Dinesh Jayaraman |
||
Workshop
|
One Step at a Time: Pros and Cons of Multi-Step Meta-Gradient Reinforcement Learning Clément Bonnet · Paul Caron · Thomas D Barrett · Ian Davies · Alexandre Laterre |
||
Workshop
|
Interactive Robust Policy Optimization for Multi-Agent Reinforcement Learning Videh Nema · Balaraman Ravindran |
||
Workshop
|
Interactive Robust Policy Optimization for Multi-Agent Reinforcement Learning Videh Nema · Balaraman Ravindran |
||
Workshop
|
Interactive Robust Policy Optimization for Multi-Agent Reinforcement Learning Videh Nema · Balaraman Ravindran |
||
Spotlight
|
Learning MDPs from Features: Predict-Then-Optimize for Sequential Decision Making by Reinforcement Learning Kai Wang · Sanket Shah · Haipeng Chen · Andrew Perrault · Finale Doshi-Velez · Milind Tambe |
||
Workshop
|
Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning Jason Yecheng Ma · Andrew Shen · Osbert Bastani · Dinesh Jayaraman |