firstbacksecondback
790 Results
Poster
|
Wed 15:00 |
Learning to Modulate pre-trained Models in RL Thomas Schmied · Markus Hofmarcher · Fabian Paischer · Razvan Pascanu · Sepp Hochreiter |
|
Poster
|
Tue 8:45 |
Provably Efficient Algorithm for Nonstationary Low-Rank MDPs Yuan Cheng · Jing Yang · Yingbin Liang |
|
Poster
|
Wed 15:00 |
Evaluating Cognitive Maps and Planning in Large Language Models with CogEval Ida Momennejad · Hosein Hasanbeig · Felipe Vieira Frujeri · Hiteshi Sharma · Nebojsa Jojic · Hamid Palangi · Robert Ness · Jonathan Larson |
|
Poster
|
Wed 8:45 |
Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning Mitsuhiko Nakamoto · Simon Zhai · Anikait Singh · Max Sobol Mark · Yi Ma · Chelsea Finn · Aviral Kumar · Sergey Levine |
|
Poster
|
Wed 8:45 |
Multi-Player Zero-Sum Markov Games with Networked Separable Interactions Chanwoo Park · Kaiqing Zhang · Asuman Ozdaglar |
|
Poster
|
Tue 8:45 |
Read and Reap the Rewards: Learning to Play Atari with the Help of Instruction Manuals Yue Wu · Yewen Fan · Paul Pu Liang · Amos Azaria · Yuanzhi Li · Tom Mitchell |
|
Poster
|
Thu 15:00 |
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment Tianwei Ni · Michel Ma · Benjamin Eysenbach · Pierre-Luc Bacon |
|
Workshop
|
Sat 9:05 |
Cooperative AI via Decentralized Commitment Devices Xyn Sun · Davide Crapis · Matt Stephenson · Jonathan Passerat-Palmbach |
|
Workshop
|
Sat 13:50 |
Robustness to Multi-Modal Environment Uncertainty in MARL using Curriculum Learning Aakriti Agrawal · Rohith Aralikatti · Yanchao Sun · Furong Huang |
|
Workshop
|
Cooperative AI via Decentralized Commitment Devices Xyn Sun · Davide Crapis · Matt Stephenson · Jonathan Passerat-Palmbach |
||
Workshop
|
Coupling Semi-supervised Learning with Reinforcement Learning for Better Decision Making --- An application to Cryo-EM Data Collection Ziping Xu · Quanfu Fan · Yilai Li · Emma Lee · john cohn · Ambuj Tewari · Seychelle Vos · Michael Cianfrocco |
||
Oral
|
Thu 13:20 |
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment Tianwei Ni · Michel Ma · Benjamin Eysenbach · Pierre-Luc Bacon |