firstbacksecondback
496 Results
Poster
|
Wed 16:30 |
Near-Optimal Distributionally Robust Reinforcement Learning with General Lp Norms Pierre Clavier · Laixi Shi · Erwan Le Pennec · Eric Mazumdar · Adam Wierman · Matthieu Geist |
|
Workshop
|
OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code Maxence Faldor · Jenny Zhang · Antoine Cully · Jeff Clune |
||
Poster
|
Reinforcement Learning Policy as Macro Regulator Rather than Macro Placer Ke Xue · Ruo-Tong Chen · Xi Lin · Yunqi Shi · Shixiong Kai · Siyuan Xu · Chao Qian |
||
Poster
|
Fri 16:30 |
Coevolving with the Other You: Fine-Tuning LLM with Sequential Cooperative Multi-Agent Reinforcement Learning Hao Ma · Tianyi Hu · Zhiqiang Pu · Liu Boyin · Xiaolin Ai · Yanyan Liang · Min Chen |
|
Workshop
|
Using adaptive intrinsic motivation in RL to model learning across development Kai Sandbrink · Brian Christian · Linas Nasvytis · Christian Schroeder de Witt · Patrick Butlin |
||
Poster
|
Wed 16:30 |
Multi-turn Reinforcement Learning with Preference Human Feedback Lior Shani · Aviv Rosenberg · Asaf Cassel · Oran Lang · Daniele Calandriello · Avital Zipori · Hila Noga · Orgad Keller · Bilal Piot · Idan Szpektor · Avinatan Hassidim · Yossi Matias · Remi Munos |
|
Poster
|
Thu 11:00 |
Pre-Trained Multi-Goal Transformers with Prompt Optimization for Efficient Online Adaptation Haoqi Yuan · Yuhui Fu · Feiyang Xie · Zongqing Lu |
|
Poster
|
Fri 11:00 |
Bigger, Regularized, Optimistic: scaling for compute and sample efficient continuous control Michal Nauman · Mateusz Ostaszewski · Krzysztof Jankowski · Piotr Miłoś · Marek Cygan |
|
Poster
|
Wed 11:00 |
Diffusion-Reward Adversarial Imitation Learning Chun-Mao Lai · Hsiang-Chun Wang · Ping-Chun Hsieh · Frank Wang · Min-Hung Chen · Shao-Hua Sun |
|
Poster
|
Thu 16:30 |
Near-Minimax-Optimal Distributional Reinforcement Learning with a Generative Model Mark Rowland · Kevin Li · Remi Munos · Clare Lyle · Yunhao Tang · Will Dabney |
|
Workshop
|
Sliding Puzzles Gym: A Scalable Benchmark for State Representation in Visual Reinforcement Learning Bryan Lincoln Marques de Oliveira · Bruno Brandão · Murilo da Luz · Luana Guedes Barros Martins · Telma de Lima Soares · Luckeciano Carvalho Melo |
||
Poster
|
Fri 16:30 |
Randomized Exploration for Reinforcement Learning with Multinomial Logistic Function Approximation Wooseong Cho · Taehyun Hwang · Joongkyu Lee · Min-hwan Oh |