firstbacksecondback
29 Results
Workshop
|
Eureka: Human-Level Reward Design via Coding Large Language Models Jason Ma · William Liang · Guanzhi Wang · De-An Huang · Osbert Bastani · Dinesh Jayaraman · Yuke Zhu · Linxi Fan · Animashree Anandkumar |
||
Workshop
|
Contextual Pre-Planning on Reward Machine Abstractions for Enhanced Transfer in Deep Reinforcement Learning Guy Azran · Mohamad Hosein Danesh · Stefano Albrecht · Sarah Keren |
||
Workshop
|
Vision-Language Models as a Source of Rewards Harris Chan · Volodymyr Mnih · Feryal Behbahani · Michael Laskin · Luyu Wang · Fabio Pardo · Maxime Gazeau · Himanshu Sahni · Daniel Horgan · Kate Baumli · Yannick Schroecker · Stephen Spencer · Richie Steigerwald · John Quan · Gheorghe Comanici · Sebastian Flennerhag · Alexander Neitz · Lei Zhang · Tom Schaul · Satinder Singh · Clare Lyle · Tim Rocktäschel · Jack Parker-Holder · Kristian Holsheimer |
||
Workshop
|
Regularity as Intrinsic Reward for Free Play Cansu Sancaktar · Justus Piater · Georg Martius |
||
Poster
|
Wed 8:45 |
Direct Preference-based Policy Optimization without Reward Modeling Gaon An · Junhyeok Lee · Xingdong Zuo · Norio Kosaka · Kyung-Min Kim · Hyun Oh Song |