Poster
|
Wed 11:00
|
RA-PbRL: Provably Efficient Risk-Aware Preference-Based Reinforcement Learning
Yujie Zhao · Jose Aguilar Escamilla · Weyl Lu · Huazheng Wang
|
|
Workshop
|
|
Learning Robust Representations for Transfer in Reinforcement Learning
Faisal Ahmed Abdelrahman Mohamed · Roger Creus Castanyer · Hongyao Tang · Zahra Sheikhbahaee · Glen Berseth
|
|
Workshop
|
|
Efficient Design-and-Control Automation with Reinforcement Learning and Adaptive Exploration
Jiajun Fan · Hongyao Tang · Michael Przystupa · Mariano Phielipp · Santiago Miret · Glen Berseth
|
|
Poster
|
Wed 11:00
|
Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
Cameron Allen · Aaron Kirtland · Ruo Yu Tao · Sam Lobel · Daniel Scott · Nicholas Petrocelli · Omer Gottesman · Ronald Parr · Michael Littman · George Konidaris
|
|
Workshop
|
|
Honesty to Subterfuge: In-Context Reinforcement Learning Can Make Honest Models Reward Hack
Leo McKee-Reid · Christoph Sträter · Maria Martinez · Joe Needham · Mikita Balesni
|
|
Poster
|
Fri 16:30
|
Offline Oracle-Efficient Learning for Contextual MDPs via Layerwise Exploration-Exploitation Tradeoff
Jian Qian · Haichen Hu · David Simchi-Levi
|
|
Workshop
|
|
Convergence Rates of Bayesian Network Policy Gradient for Cooperative Multi-Agent Reinforcement Learning
Dingyang Chen · Zhenyu Zhang · Xiaolong Kuang · Xinyang Shen · Ozalp Ozer · Qi Zhang
|
|
Workshop
|
|
Optimizing Reward Models with Proximal Policy Exploration in Preference-Based Reinforcement Learning
Yiwen Zhu · Jinyi Liu · Yifu Yuan · Wenya Wei · Zhenxing Ge · qianyi fu · Zhou Fang · Yujing Hu · Bo An
|
|
Poster
|
Wed 11:00
|
BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning
Haohong Lin · Wenhao Ding · Jian Chen · Laixi Shi · Jiacheng Zhu · Bo Li · DING ZHAO
|
|
Poster
|
Wed 11:00
|
NetworkGym: Reinforcement Learning Environments for Multi-Access Traffic Management in Network Simulation
Momin Haider · Ming Yin · Menglei Zhang · Arpit Gupta · Jing Zhu · Yu-Xiang Wang
|
|
Poster
|
Fri 11:00
|
The Ladder in Chaos: Improving Policy Learning by Harnessing the Parameter Evolving Path in A Low-dimensional Space
Hongyao Tang · Min Zhang · Chen Chen · Jianye Hao
|
|
Poster
|
Fri 16:30
|
Learning General Parameterized Policies for Infinite Horizon Average Reward Constrained MDPs via Primal-Dual Policy Gradient Algorithm
Qinbo Bai · Washim Mondal · Vaneet Aggarwal
|
|