NeurIPS 2022

Workshop

Hybrid RL: Using both offline and online data can make RL efficient
Yuda Song · Yifei Zhou · Ayush Sekhari · J. Bagnell · Akshay Krishnamurthy · Wen Sun

Workshop

State Advantage Weighting for Offline RL
Jiafei Lyu · aicheng Gong · Le Wan · Zongqing Lu · Xiu Li

Workshop

Offline evaluation in RL: soft stability weighting to combine fitted Q-learning and model-based methods
Briton Park · Xian Wu · Bin Yu · Angela Zhou

Workshop

Using Confounded Data in Offline RL
Maxime Gasse · Damien GRASSET · Guillaume Gaudron · Pierre-Yves Oudeyer

Workshop

Simplifying Model-based RL: Learning Representations, Latent-space Models, and Policies with One Objective
Raj Ghugare · Homanga Bharadhwaj · Benjamin Eysenbach · Sergey Levine · Ruslan Salakhutdinov

Poster

Wed 9:00

Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds
Joshua Albrecht · Abraham Fetterman · Bryden Fogelman · Ellie Kitanidis · Bartosz Wróblewski · Nicole Seo · Michael Rosenthal · Maksis Knutins · Zack Polizzi · James Simon · Kanjun Qiu

Workshop

The Paradox of Choice: On the Role of Attention in Hierarchical Reinforcement Learning
Andrei Nica · Khimya Khetarpal · Doina Precup

Poster

Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Yuanpei Chen · Tianhao Wu · Shengjie Wang · Xidong Feng · Jiechuan Jiang · Zongqing Lu · Stephen McAleer · Hao Dong · Song-Chun Zhu · Yaodong Yang

Poster

Thu 14:00

Dungeons and Data: A Large-Scale NetHack Dataset
Eric Hambro · Roberta Raileanu · Danielle Rothermel · Vegard Mella · Tim Rocktäschel · Heinrich Küttler · Naila Murray

Poster

Tue 9:00

Provably sample-efficient RL with side information about latent dynamics
Yao Liu · Dipendra Misra · Miro Dudik · Robert Schapire

Poster

Wed 9:00

Left Heavy Tails and the Effectiveness of the Policy and Value Networks in DNN-based best-first search for Sokoban Planning
Dieqiao Feng · Carla Gomes · Bart Selman

Poster

Discrete Compositional Representations as an Abstraction for Goal Conditioned Reinforcement Learning
Riashat Islam · Hongyu Zang · Anirudh Goyal · Alex Lamb · Kenji Kawaguchi · Xin Li · Romain Laroche · Yoshua Bengio · Remi Tachet des Combes

Main Navigation

100 Results