Poster
|
Fri 11:00
|
Policy-shaped prediction: avoiding distractions in model-based reinforcement learning
Miles Hutson · Isaac Kauvar · Nick Haber
|
|
Poster
|
Wed 16:30
|
Off-policy estimation with adaptively collected data: the power of online learning
Jeonghwan Lee · Cong Ma
|
|
Poster
|
Fri 11:00
|
Logarithmic Smoothing for Pessimistic Off-Policy Evaluation, Selection and Learning
Otmane Sakhi · Imad Aouali · Pierre Alquier · Nicolas Chopin
|
|
Poster
|
Thu 11:00
|
On the Curses of Future and History in Future-dependent Value Functions for Off-policy Evaluation
Yuheng Zhang · Nan Jiang
|
|
Affinity Event
|
|
MLSherlock: An Audit Lens for Policy-Compliance Machine Learning Systems
Ismat Jarin · Tu Le · Athina Markopoulou
|
|
Poster
|
Fri 11:00
|
RL in Latent MDPs is Tractable: Online Guarantees via Off-Policy Evaluation
Jeongyeol Kwon · Shie Mannor · Constantine Caramanis · Yonathan Efroni
|
|
Poster
|
Wed 16:30
|
Off-Policy Selection for Initiating Human-Centric Experimental Design
Ge Gao · Xi Yang · Qitong Gao · Song Ju · Miroslav Pajic · Min Chi
|
|
Poster
|
Fri 16:30
|
Improved off-policy training of diffusion samplers
Marcin Sendera · Minsu Kim · Sarthak Mittal · Pablo Lemos · Luca Scimeca · Jarrid Rector-Brooks · Alexandre Adam · Yoshua Bengio · Nikolay Malkin
|
|
Poster
|
|
Efficient Recurrent Off-Policy RL Requires a Context-Encoder-Specific Learning Rate
Fan-Ming Luo · Zuolin Tu · Zefang Huang · Yang Yu
|
|
Poster
|
Wed 16:30
|
Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision Processes
Andrew Bennett · Nathan Kallus · Miruna Oprescu · Wen Sun · Kaiwen Wang
|
|
Poster
|
Wed 16:30
|
RL-GPT: Integrating Reinforcement Learning and Code-as-policy
Shaoteng Liu · Haoqi Yuan · Minda Hu · Yanwei Li · Yukang Chen · Shu Liu · Zongqing Lu · Jiaya Jia
|
|
Poster
|
Wed 16:30
|
Rethinking Model-based, Policy-based, and Value-based Reinforcement Learning via the Lens of Representation Complexity
Guhao Feng · Han Zhong
|
|