firstbacksecondback
591 Results
Poster
|
Tue 9:00 |
Continuous MDP Homomorphisms and Homomorphic Policy Gradient Sahand Rezaei-Shoshtari · Rosie Zhao · Prakash Panangaden · David Meger · Doina Precup |
|
Poster
|
Tue 14:00 |
Explain My Surprise: Learning Efficient Long-Term Memory by predicting uncertain outcomes Artyom Sorokin · Nazar Buzun · Leonid Pugachev · Mikhail Burtsev |
|
Workshop
|
Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization Kaixuan Huang · Yu Wu · Xuezhou Zhang · Shenyinying Tu · Qingyun Wu · Mengdi Wang · Huazheng Wang |
||
Poster
|
Wed 9:00 |
Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation Cansu Sancaktar · Sebastian Blaes · Georg Martius |
|
Poster
|
Thu 9:00 |
IMED-RL: Regret optimal learning of ergodic Markov decision processes Fabien Pesquerel · Odalric-Ambrym Maillard |
|
Workshop
|
MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning Mikayel Samvelyan · Akbir Khan · Michael Dennis · Minqi Jiang · Jack Parker-Holder · Jakob Foerster · Roberta Raileanu · Tim Rocktäschel |
||
Poster
|
Wed 14:00 |
NeoRL: A Near Real-World Benchmark for Offline Reinforcement Learning Rong-Jun Qin · Xingyuan Zhang · Songyi Gao · Xiong-Hui Chen · Zewen Li · Weinan Zhang · Yang Yu |
|
Workshop
|
Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization Kaixuan Huang · Yu Wu · Xuezhou Zhang · Shenyinying Tu · Qingyun Wu · Mengdi Wang · Huazheng Wang |
||
Poster
|
Tue 14:00 |
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge Linxi Fan · Guanzhi Wang · Yunfan Jiang · Ajay Mandlekar · Yuncong Yang · Haoyi Zhu · Andrew Tang · De-An Huang · Yuke Zhu · Anima Anandkumar |
|
Workshop
|
Offline Model-Based Reinforcement Learning for Tokamak Control Ian Char · Joseph Abbate · Laszlo Bardoczi · Mark Boyer · Youngseog Chung · Rory Conlin · Keith Erickson · Viraj Mehta · Nathan Richner · Egemen Kolemen · Jeff Schneider |
||
Poster
|
Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning Runze Liu · Fengshuo Bai · Yali Du · Yaodong Yang |
||
Poster
|
Tue 14:00 |
RAMBO-RL: Robust Adversarial Model-Based Offline Reinforcement Learning Marc Rigter · Bruno Lacerda · Nick Hawes |