firstbacksecondback
135 Results
Poster
|
Wed 11:00 |
Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforcement Learning Ruoqi Zhang · Ziwei Luo · Jens Sjölund · Thomas Schön · Per Mattsson |
|
Poster
|
Wed 11:00 |
Variational Distillation of Diffusion Policies into Mixture of Experts Hongyi Zhou · Denis Blessing · Ge Li · Onur Celik · Xiaogang Jia · Gerhard Neumann · Rudolf Lioutikov |
|
Poster
|
Wed 16:30 |
Constrained Latent Action Policies for Model-Based Offline Reinforcement Learning Marvin Alles · Philip Becker-Ehmck · Patrick van der Smagt · Maximilian Karl |
|
Poster
|
Thu 16:30 |
Scalable Constrained Policy Optimization for Safe Multi-agent Reinforcement Learning Lijun Zhang · Lin Li · Wei Wei · Huizhong Song · Yaodong Yang · Jiye Liang |
|
Poster
|
Fri 11:00 |
Meta-Reinforcement Learning with Universal Policy Adaptation: Provable Near-Optimality under All-task Optimum Comparator Siyuan Xu · Minghui Zhu |
|
Poster
|
Thu 11:00 |
Graph Diffusion Policy Optimization Yijing Liu · Chao Du · Tianyu Pang · Chongxuan LI · Min Lin · Wei Chen |
|
Poster
|
Thu 11:00 |
Discovering Creative Behaviors through DUPLEX: Diverse Universal Features for Policy Exploration Borja G. Leon · Francesco Riccio · Kaushik Subramanian · Peter Wurman · Peter Stone |
|
Poster
|
Wed 16:30 |
Policy Learning from Tutorial Books via Understanding, Rehearsing and Introspecting Xiong-Hui Chen · Ziyan Wang · Yali Du · Shengyi Jiang · Meng Fang · Yang Yu · Jun Wang |
|
Poster
|
Efficient Multi-task Reinforcement Learning with Cross-Task Policy Guidance Jinmin He · Kai Li · Yifan Zang · Haobo Fu · Qiang Fu · Junliang Xing · Jian Cheng |
||
Poster
|
Wed 16:30 |
Policy Aggregation Parand A. Alamdari · Soroush Ebadian · Ariel Procaccia |
|
Poster
|
Thu 11:00 |
Reinforcing LLM Agents via Policy Optimization with Action Decomposition Muning Wen · Ziyu Wan · Jun Wang · Weinan Zhang · Ying Wen |
|
Poster
|
Thu 11:00 |
Efficient Policy Evaluation Across Multiple Different Experimental Datasets Yonghan Jung · Alexis Bellot |