Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

496 Results

<<   <   Page 37 of 42   >   >>
Poster
Wed 11:00 RA-PbRL: Provably Efficient Risk-Aware Preference-Based Reinforcement Learning
Yujie Zhao · Jose Aguilar Escamilla · Weyl Lu · Huazheng Wang
Workshop
Learning Robust Representations for Transfer in Reinforcement Learning
Faisal Ahmed Abdelrahman Mohamed · Roger Creus Castanyer · Hongyao Tang · Zahra Sheikhbahaee · Glen Berseth
Workshop
Efficient Design-and-Control Automation with Reinforcement Learning and Adaptive Exploration
Jiajun Fan · Hongyao Tang · Michael Przystupa · Mariano Phielipp · Santiago Miret · Glen Berseth
Poster
Wed 11:00 Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
Cameron Allen · Aaron Kirtland · Ruo Yu Tao · Sam Lobel · Daniel Scott · Nicholas Petrocelli · Omer Gottesman · Ronald Parr · Michael Littman · George Konidaris
Workshop
Honesty to Subterfuge: In-Context Reinforcement Learning Can Make Honest Models Reward Hack
Leo McKee-Reid · Christoph Sträter · Maria Martinez · Joe Needham · Mikita Balesni
Poster
Fri 16:30 Offline Oracle-Efficient Learning for Contextual MDPs via Layerwise Exploration-Exploitation Tradeoff
Jian Qian · Haichen Hu · David Simchi-Levi
Workshop
Convergence Rates of Bayesian Network Policy Gradient for Cooperative Multi-Agent Reinforcement Learning
Dingyang Chen · Zhenyu Zhang · Xiaolong Kuang · Xinyang Shen · Ozalp Ozer · Qi Zhang
Workshop
Optimizing Reward Models with Proximal Policy Exploration in Preference-Based Reinforcement Learning
Yiwen Zhu · Jinyi Liu · Yifu Yuan · Wenya Wei · Zhenxing Ge · qianyi fu · Zhou Fang · Yujing Hu · Bo An
Poster
Wed 11:00 BECAUSE: Bilinear Causal Representation for Generalizable Offline Model-based Reinforcement Learning
Haohong Lin · Wenhao Ding · Jian Chen · Laixi Shi · Jiacheng Zhu · Bo Li · DING ZHAO
Poster
Wed 11:00 NetworkGym: Reinforcement Learning Environments for Multi-Access Traffic Management in Network Simulation
Momin Haider · Ming Yin · Menglei Zhang · Arpit Gupta · Jing Zhu · Yu-Xiang Wang
Poster
Fri 11:00 The Ladder in Chaos: Improving Policy Learning by Harnessing the Parameter Evolving Path in A Low-dimensional Space
Hongyao Tang · Min Zhang · Chen Chen · Jianye Hao
Poster
Fri 16:30 Learning General Parameterized Policies for Infinite Horizon Average Reward Constrained MDPs via Primal-Dual Policy Gradient Algorithm
Qinbo Bai · Washim Mondal · Vaneet Aggarwal