firstbacksecondback
135 Results
Workshop
|
Faster, More Efficient RLHF through Off-Policy Asynchronous Learning Michael Noukhovitch · Shengyi Huang · Sophie Xhonneux · Arian Hosseini · Rishabh Agarwal · Aaron Courville |
||
Workshop
|
Improved Off-policy Reinforcement Learning in Biological Sequence Design Hyeonah Kim · Minsu Kim · Taeyoung Yun · Sanghyeok Choi · Emmanuel Bengio · Alex Hernandez-Garcia · Jinkyoo Park |
||
Poster
|
Fri 11:00 |
Policy-shaped prediction: avoiding distractions in model-based reinforcement learning Miles Hutson · Isaac Kauvar · Nick Haber |
|
Poster
|
Wed 16:30 |
Off-Policy Selection for Initiating Human-Centric Experimental Design Ge Gao · Xi Yang · Qitong Gao · Song Ju · Miroslav Pajic · Min Chi |
|
Poster
|
Fri 11:00 |
RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation Jeongyeol Kwon · Shie Mannor · Constantine Caramanis · Yonathan Efroni |
|
Poster
|
Fri 16:30 |
Improved off-policy training of diffusion samplers Marcin Sendera · Minsu Kim · Sarthak Mittal · Pablo Lemos · Luca Scimeca · Jarrid Rector-Brooks · Alexandre Adam · Yoshua Bengio · Nikolay Malkin |
|
Poster
|
Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rate Fan-Ming Luo · Zuolin Tu · Zefang Huang · Yang Yu |
||
Oral
|
Wed 15:30 |
RL-GPT: Integrating Reinforcement Learning and Code-as-policy Shaoteng Liu · Haoqi Yuan · Minda Hu · Yanwei Li · Yukang Chen · Shu Liu · Zongqing Lu · Jiaya Jia |
|
Poster
|
Wed 16:30 |
Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision Processes Andrew Bennett · Nathan Kallus · Miruna Oprescu · Wen Sun · Kaiwen Wang |
|
Poster
|
A Study of Plasticity Loss in On-Policy Deep Reinforcement Learning Arthur Juliani · Jordan Ash |
||
Poster
|
Wed 11:00 |
Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning Ruoqi Zhang · Ziwei Luo · Jens Sjölund · Thomas Schön · Per Mattsson |
|
Poster
|
Wed 11:00 |
Efficient Contextual LLM Cascades through Budget-Constrained Policy Learning Xuechen Zhang · Zijian Huang · Ege Onur Taga · Carlee Joe-Wong · Samet Oymak · Jiasi Chen |