firstbacksecondback
701 Results
Poster
|
Tue 14:00 |
RAMBO-RL: Robust Adversarial Model-Based Offline Reinforcement Learning Marc Rigter · Bruno Lacerda · Nick Hawes |
|
Workshop
|
Offline Model-Based Reinforcement Learning for Tokamak Control Ian Char · Joseph Abbate · Laszlo Bardoczi · Mark Boyer · Youngseog Chung · Rory Conlin · Keith Erickson · Viraj Mehta · Nathan Richner · Egemen Kolemen · Jeff Schneider |
||
Workshop
|
Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier Pierluca D'Oro · Max Schwarzer · Evgenii Nikishin · Pierre-Luc Bacon · Marc Bellemare · Aaron Courville |
||
Workshop
|
Pareto-Optimal Diagnostic Policy Learning in Clinical Applications via Semi-Model-Based Deep Reinforcement Learning zheng Yu · Yikuan Li · Joseph Kim · Kaixuan Huang · Yuan Luo · Mengdi Wang |
||
Workshop
|
Pareto-Optimal Diagnostic Policy Learning in Clinical Applications via Semi-Model-Based Deep Reinforcement Learning zheng Yu · Yikuan Li · Joseph Kim · Kaixuan Huang · Yuan Luo · Mengdi Wang |
||
Poster
|
Tue 9:00 |
A Unified Framework for Deep Symbolic Regression Mikel Landajuela · Chak Shing Lee · Jiachen Yang · Ruben Glatt · Claudio P Santiago · Ignacio Aravena · Terrell Mundhenk · Garrett Mulcahy · Brenden K Petersen |
|
Poster
|
Thu 9:00 |
Learning General World Models in a Handful of Reward-Free Deployments Yingchen Xu · Jack Parker-Holder · Aldo Pacchiano · Philip Ball · Oleh Rybkin · S Roberts · Tim Rocktäschel · Edward Grefenstette |
|
Workshop
|
Multi-Agent Reinforcement Learning for Fast-Timescale Demand Response Vincent Mai · Philippe Maisonneuve · Tianyu Zhang · Jorge Montalvo Arvizu · Liam Paull · Antoine Lesage-Landry |
||
Poster
|
Thu 14:00 |
E-MAPP: Efficient Multi-Agent Reinforcement Learning with Parallel Program Guidance Can Chang · Ni Mu · Jiajun Wu · Ling Pan · Huazhe Xu |
|
Poster
|
Tue 14:00 |
Off-Policy Evaluation for Episodic Partially Observable Markov Decision Processes under Non-Parametric Models Rui Miao · Zhengling Qi · Xiaoke Zhang |
|
Poster
|
Wed 14:00 |
A Near-Optimal Primal-Dual Method for Off-Policy Learning in CMDP Fan Chen · Junyu Zhang · Zaiwen Wen |
|
Workshop
|
Multi-Agent Reinforcement Learning for Fast-Timescale Demand Response Vincent Mai · Philippe Maisonneuve · Tianyu Zhang · Jorge Montalvo Arvizu · Liam Paull · Antoine Lesage-Landry |