firstbacksecondback
584 Results
Poster
|
Wed 9:00 |
Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation Cansu Sancaktar · Sebastian Blaes · Georg Martius |
|
Workshop
|
MAESTRO: Open-Ended Environment Design for Multi-Agent Reinforcement Learning Mikayel Samvelyan · Akbir Khan · Michael Dennis · Minqi Jiang · Jack Parker-Holder · Jakob Foerster · Roberta Raileanu · Tim Rocktäschel |
||
Workshop
|
Offline Model-Based Reinforcement Learning for Tokamak Control Ian Char · Joseph Abbate · Laszlo Bardoczi · Mark Boyer · Youngseog Chung · Rory Conlin · Keith Erickson · Viraj Mehta · Nathan Richner · Egemen Kolemen · Jeff Schneider |
||
Poster
|
Tue 14:00 |
RAMBO-RL: Robust Adversarial Model-Based Offline Reinforcement Learning Marc Rigter · Bruno Lacerda · Nick Hawes |
|
Workshop
|
Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier Pierluca D'Oro · Max Schwarzer · Evgenii Nikishin · Pierre-Luc Bacon · Marc Bellemare · Aaron Courville |
||
Poster
|
Meta-Reward-Net: Implicitly Differentiable Reward Learning for Preference-based Reinforcement Learning Runze Liu · Fengshuo Bai · Yali Du · Yaodong Yang |
||
Workshop
|
Pareto-Optimal Diagnostic Policy Learning in Clinical Applications via Semi-Model-Based Deep Reinforcement Learning zheng Yu · Yikuan Li · Joseph Kim · Kaixuan Huang · Yuan Luo · Mengdi Wang |
||
Poster
|
Tue 9:00 |
A Unified Framework for Deep Symbolic Regression Mikel Landajuela · Chak Shing Lee · Jiachen Yang · Ruben Glatt · Claudio P Santiago · Ignacio Aravena · Terrell Mundhenk · Garrett Mulcahy · Brenden K Petersen |
|
Workshop
|
Pareto-Optimal Diagnostic Policy Learning in Clinical Applications via Semi-Model-Based Deep Reinforcement Learning zheng Yu · Yikuan Li · Joseph Kim · Kaixuan Huang · Yuan Luo · Mengdi Wang |
||
Poster
|
Thu 9:00 |
Learning General World Models in a Handful of Reward-Free Deployments Yingchen Xu · Jack Parker-Holder · Aldo Pacchiano · Philip Ball · Oleh Rybkin · S Roberts · Tim Rocktäschel · Edward Grefenstette |
|
Poster
|
Wed 14:00 |
A Near-Optimal Primal-Dual Method for Off-Policy Learning in CMDP Fan Chen · Junyu Zhang · Zaiwen Wen |
|
Poster
|
Thu 14:00 |
E-MAPP: Efficient Multi-Agent Reinforcement Learning with Parallel Program Guidance Can Chang · Ni Mu · Jiajun Wu · Ling Pan · Huazhe Xu |