firstbacksecondback
100 Results
Workshop
|
Hybrid RL: Using both offline and online data can make RL efficient Yuda Song · Yifei Zhou · Ayush Sekhari · J. Bagnell · Akshay Krishnamurthy · Wen Sun |
||
Workshop
|
State Advantage Weighting for Offline RL Jiafei Lyu · aicheng Gong · Le Wan · Zongqing Lu · Xiu Li |
||
Workshop
|
Offline evaluation in RL: soft stability weighting to combine fitted Q-learning and model-based methods Briton Park · Xian Wu · Bin Yu · Angela Zhou |
||
Workshop
|
Using Confounded Data in Offline RL Maxime Gasse · Damien GRASSET · Guillaume Gaudron · Pierre-Yves Oudeyer |
||
Workshop
|
Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective Raj Ghugare · Homanga Bharadhwaj · Benjamin Eysenbach · Sergey Levine · Ruslan Salakhutdinov |
||
Poster
|
Wed 9:00 |
Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds Joshua Albrecht · Abraham Fetterman · Bryden Fogelman · Ellie Kitanidis · Bartosz Wróblewski · Nicole Seo · Michael Rosenthal · Maksis Knutins · Zack Polizzi · James Simon · Kanjun Qiu |
|
Workshop
|
The Paradox of Choice: On the Role of Attention in Hierarchical Reinforcement Learning Andrei Nica · Khimya Khetarpal · Doina Precup |
||
Poster
|
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning Yuanpei Chen · Tianhao Wu · Shengjie Wang · Xidong Feng · Jiechuan Jiang · Zongqing Lu · Stephen McAleer · Hao Dong · Song-Chun Zhu · Yaodong Yang |
||
Poster
|
Thu 14:00 |
Dungeons and Data: A Large-Scale NetHack Dataset Eric Hambro · Roberta Raileanu · Danielle Rothermel · Vegard Mella · Tim Rocktäschel · Heinrich Küttler · Naila Murray |
|
Poster
|
Tue 9:00 |
Provably sample-efficient RL with side information about latent dynamics Yao Liu · Dipendra Misra · Miro Dudik · Robert Schapire |
|
Poster
|
Wed 9:00 |
Left Heavy Tails and the Effectiveness of the Policy and Value Networks in DNN-based best-first search for Sokoban Planning Dieqiao Feng · Carla Gomes · Bart Selman |
|
Poster
|
Discrete Compositional Representations as an Abstraction for Goal Conditioned Reinforcement Learning Riashat Islam · Hongyu Zang · Anirudh Goyal · Alex Lamb · Kenji Kawaguchi · Xin Li · Romain Laroche · Yoshua Bengio · Remi Tachet des Combes |