firstbacksecondback
790 Results
Workshop
|
Sat 14:43 |
Real-time Carbon Footprint Minimization in Sustainable Data Centers wth Reinforcement Learning Soumyendu Sarkar · Avisek Naug · Ricardo Luna Gutierrez · Antonio Guillen-Perez · Vineet Gundecha · Ashwin Ramesh Babu |
|
Workshop
|
Discovering Quantum Circuits for Logical State Preparation with Deep Reinforcement Learning Remmy Zen · Jan Olle · Matteo Puviani · Florian Marquardt |
||
Poster
|
Tue 8:45 |
Optimal Convergence Rate for Exact Policy Mirror Descent in Discounted Markov Decision Processes Emmeran Johnson · Ciara Pike-Burke · Patrick Rebeschini |
|
Workshop
|
RAVL: Reach-Aware Value Learning for the Edge-of-Reach Problem in Offline Model-Based Reinforcement Learning Anya Sims · Cong Lu · Yee Whye Teh |
||
Workshop
|
Vision-Language Models as a Source of Rewards Harris Chan · Volodymyr Mnih · Feryal Behbahani · Michael Laskin · Luyu Wang · Fabio Pardo · Maxime Gazeau · Himanshu Sahni · Daniel Horgan · Kate Baumli · Yannick Schroecker · Stephen Spencer · Richie Steigerwald · John Quan · Gheorghe Comanici · Sebastian Flennerhag · Alexander Neitz · Lei Zhang · Tom Schaul · Satinder Singh · Clare Lyle · Tim Rocktäschel · Jack Parker-Holder · Kristian Holsheimer |
||
Poster
|
Wed 8:45 |
Direct Preference-based Policy Optimization without Reward Modeling Gaon An · Junhyeok Lee · Xingdong Zuo · Norio Kosaka · Kyung-Min Kim · Hyun Oh Song |
|
Workshop
|
Bridging State and History Representations: Understanding Self-Predictive RL Tianwei Ni · Benjamin Eysenbach · Erfan Seyedsalehi · Michel Ma · Clement Gehring · Aditya Mahajan · Pierre-Luc Bacon |
||
Workshop
|
Sat 12:05 |
Addressing Long-Horizon Tasks by Integrating Program Synthesis and State Machines Yu-An Lin · Chen-Tao Lee · Guan-Ting Liu · Pu-Jen Cheng · Shao-Hua Sun |
|
Workshop
|
Relating Goal and Environmental Complexity for Improved Task Transfer: Initial Results Sunandita Patra · Paul Rademacher · Kristen Jacobson · Kyle Hassold · Onur Kulaksizoglu · Laura Hiatt · Mark Roberts · Dana Nau |
||
Workshop
|
Sat 9:36 |
[Paper-Oral 7] MultiPrompter: Cooperative Prompt Optimization with Multi-Agent Reinforcement Learning Dong-Ki Kim · Sungryull Sohn · Lajanugen Logeswaran · Dongsub Shim · Honglak Lee |
|
Poster
|
Thu 15:00 |
Gigastep - One Billion Steps per Second Multi-agent Reinforcement Learning Mathias Lechner · lianhao yin · Tim Seyde · Tsun-Hsuan Johnson Wang · Wei Xiao · Ramin Hasani · Joshua Rountree · Daniela Rus |
|
Workshop
|
Sat 9:15 |
Hierarchical Reinforcement Learning with AI Planning Models Junkyu Lee · Michael Katz · Don Joven Agravante · Miao Liu · Geraud Nangue Tasse · Tim Klinger · Shirin Sohrabi Araghi |