firstbacksecondback
643 Results
Poster
|
Wed 8:45 |
Direct Preference-based Policy Optimization without Reward Modeling Gaon An · Junhyeok Lee · Xingdong Zuo · Norio Kosaka · Kyung-Min Kim · Hyun Oh Song |
|
Workshop
|
Bridging State and History Representations: Understanding Self-Predictive RL Tianwei Ni · Benjamin Eysenbach · Erfan Seyedsalehi · Michel Ma · Clement Gehring · Aditya Mahajan · Pierre-Luc Bacon |
||
Workshop
|
Sat 12:05 |
Addressing Long-Horizon Tasks by Integrating Program Synthesis and State Machines Yu-An Lin · Chen-Tao Lee · Guan-Ting Liu · Pu-Jen Cheng · Shao-Hua Sun |
|
Workshop
|
Vision-Language Models as a Source of Rewards Harris Chan · Volodymyr Mnih · Feryal Behbahani · Michael Laskin · Luyu Wang · Fabio Pardo · Maxime Gazeau · Himanshu Sahni · Daniel Horgan · Kate Baumli · Yannick Schroecker · Stephen Spencer · Richie Steigerwald · John Quan · Gheorghe Comanici · Sebastian Flennerhag · Alexander Neitz · Lei Zhang · Tom Schaul · Satinder Singh · Clare Lyle · Tim Rocktäschel · Jack Parker-Holder · Kristian Holsheimer |
||
Workshop
|
Relating Goal and Environmental Complexity for Improved Task Transfer: Initial Results Sunandita Patra · Paul Rademacher · Kristen Jacobson · Kyle Hassold · Onur Kulaksizoglu · Laura Hiatt · Mark Roberts · Dana Nau |
||
Workshop
|
Sat 9:36 |
[Paper-Oral 7] MultiPrompter: Cooperative Prompt Optimization with Multi-Agent Reinforcement Learning Dong-Ki Kim · Sungryull Sohn · Lajanugen Logeswaran · Dongsub Shim · Honglak Lee |
|
Poster
|
Thu 15:00 |
Gigastep - One Billion Steps per Second Multi-agent Reinforcement Learning Mathias Lechner · lianhao yin · Tim Seyde · Tsun-Hsuan Johnson Wang · Wei Xiao · Ramin Hasani · Joshua Rountree · Daniela Rus |
|
Workshop
|
Sat 9:15 |
Hierarchical Reinforcement Learning with AI Planning Models Junkyu Lee · Michael Katz · Don Joven Agravante · Miao Liu · Geraud Nangue Tasse · Tim Klinger · Shirin Sohrabi Araghi |
|
Poster
|
Thu 8:45 |
Unified Off-Policy Learning to Rank: a Reinforcement Learning Perspective Zeyu Zhang · Yi Su · Hui Yuan · Yiran Wu · Rishab Balasubramanian · Qingyun Wu · Huazheng Wang · Mengdi Wang |
|
Poster
|
Wed 8:45 |
Describe, Explain, Plan and Select: Interactive Planning with LLMs Enables Open-World Multi-Task Agents Zihao Wang · Shaofei Cai · Guanzhou Chen · Anji Liu · Xiaojian (Shawn) Ma · Yitao Liang |
|
Poster
|
Wed 15:00 |
On Imitation in Mean-field Games Giorgia Ramponi · Pavel Kolev · Olivier Pietquin · Niao He · Mathieu Lauriere · Matthieu Geist |
|
Poster
|
Thu 8:45 |
Distributional Pareto-Optimal Multi-Objective Reinforcement Learning Xin-Qiang Cai · Pushi Zhang · Li Zhao · Jiang Bian · Masashi Sugiyama · Ashley Llorens |