firstbacksecondback
591 Results
Workshop
|
Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier Pierluca D'Oro · Max Schwarzer · Evgenii Nikishin · Pierre-Luc Bacon · Marc Bellemare · Aaron Courville |
||
Workshop
|
Pareto-Optimal Diagnostic Policy Learning in Clinical Applications via Semi-Model-Based Deep Reinforcement Learning zheng Yu · Yikuan Li · Joseph Kim · Kaixuan Huang · Yuan Luo · Mengdi Wang |
||
Workshop
|
Safe Real-World Autonomous Driving by Learning to Predict and Plan with a Mixture of Experts Stefano Pini · Christian Perone · Aayush Ahuja · Ana Sofia Rufino Ferreira · Moritz Niendorf · Sergey Zagoruyko |
||
Workshop
|
Pareto-Optimal Diagnostic Policy Learning in Clinical Applications via Semi-Model-Based Deep Reinforcement Learning zheng Yu · Yikuan Li · Joseph Kim · Kaixuan Huang · Yuan Luo · Mengdi Wang |
||
Poster
|
Tue 9:00 |
A Unified Framework for Deep Symbolic Regression Mikel Landajuela · Chak Shing Lee · Jiachen Yang · Ruben Glatt · Claudio P Santiago · Ignacio Aravena · Terrell Mundhenk · Garrett Mulcahy · Brenden K Petersen |
|
Poster
|
Thu 9:00 |
Learning General World Models in a Handful of Reward-Free Deployments Yingchen Xu · Jack Parker-Holder · Aldo Pacchiano · Philip Ball · Oleh Rybkin · S Roberts · Tim Rocktäschel · Edward Grefenstette |
|
Workshop
|
Multi-Agent Reinforcement Learning for Fast-Timescale Demand Response Vincent Mai · Philippe Maisonneuve · Tianyu Zhang · Jorge Montalvo Arvizu · Liam Paull · Antoine Lesage-Landry |
||
Workshop
|
Multi-Agent Reinforcement Learning for Fast-Timescale Demand Response Vincent Mai · Philippe Maisonneuve · Tianyu Zhang · Jorge Montalvo Arvizu · Liam Paull · Antoine Lesage-Landry |
||
Poster
|
Thu 14:00 |
E-MAPP: Efficient Multi-Agent Reinforcement Learning with Parallel Program Guidance Can Chang · Ni Mu · Jiajun Wu · Ling Pan · Huazhe Xu |
|
Poster
|
Tue 14:00 |
Off-Policy Evaluation for Episodic Partially Observable Markov Decision Processes under Non-Parametric Models Rui Miao · Zhengling Qi · Xiaoke Zhang |
|
Poster
|
Tue 9:00 |
Multi-agent Dynamic Algorithm Configuration Ke Xue · Jiacheng Xu · Lei Yuan · Miqing Li · Chao Qian · Zongzhang Zhang · Yang Yu |
|
Affinity Workshop
|
Mon 8:25 |
Oral Presentation 7: Adapting the Function Approximation Architecture in Online Reinforcement Learning Fatima Davelouis |