Poster
|
Thu 9:00
|
A consistently adaptive trust-region method
Fadi Hamad · Oliver Hinder
|
|
Workshop
|
|
Fully Stochastic Trust-Region Sequential Quadratic Programming for Equality-Constrained Optimization Problems
Yuchen Fang · Sen Na · Mladen Kolar
|
|
Workshop
|
|
Trust-Region Sequential Quadratic Programming for Stochastic Optimization with Random Models: First-Order Stationarity
Yuchen Fang · Sen Na · Mladen Kolar
|
|
Poster
|
Wed 14:00
|
A Near-Optimal Primal-Dual Method for Off-Policy Learning in CMDP
Fan Chen · Junyu Zhang · Zaiwen Wen
|
|
Poster
|
|
Learning to Constrain Policy Optimization with Virtual Trust Region
Thai Hung Le · Thommen Karimpanal George · Majid Abdolshah · Dung Nguyen · Kien Do · Sunil Gupta · Svetha Venkatesh
|
|
Poster
|
Thu 14:00
|
Off-Policy Evaluation with Policy-Dependent Optimization Response
Wenshuo Guo · Michael Jordan · Angela Zhou
|
|
Poster
|
Thu 14:00
|
Trust Region Policy Optimization with Optimal Transport Discrepancies: Duality and Algorithm for Continuous Actions
Antonio Terpin · Nicolas Lanzetti · Batuhan Yardim · Florian Dorfler · Giorgia Ramponi
|
|
Poster
|
Wed 9:00
|
Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification
Takumi Tanabe · Rei Sato · Kazuto Fukuchi · Jun Sakuma · Youhei Akimoto
|
|
Poster
|
Wed 14:00
|
Mismatched No More: Joint Model-Policy Optimization for Model-Based RL
Benjamin Eysenbach · Alexander Khazatsky · Sergey Levine · Russ Salakhutdinov
|
|
Poster
|
|
On the role of overparameterization in off-policy Temporal Difference learning with linear function approximation
Valentin Thomas
|
|
Poster
|
Wed 9:00
|
The Policy-gradient Placement and Generative Routing Neural Networks for Chip Design
Ruoyu Cheng · Xianglong Lyu · Yang Li · Junjie Ye · Jianye Hao · Junchi Yan
|
|
Workshop
|
|
Pareto-Optimal Diagnostic Policy Learning in Clinical Applications via Semi-Model-Based Deep Reinforcement Learning
zheng Yu · Yikuan Li · Joseph Kim · Kaixuan Huang · Yuan Luo · Mengdi Wang
|
|