Poster
|
Wed 9:00
|
DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs
Khaled Nakhleh · I-Hong Hou
|
|
Workshop
|
Fri 8:20
|
Online Policy Optimization for Robust MDP
Jing Dong · Jingwei Li · Baoxiang Wang · Jingzhao Zhang
|
|
Poster
|
Thu 14:00
|
Batch size-invariance for policy optimization
Jacob Hilton · Karl Cobbe · John Schulman
|
|
Poster
|
Thu 9:00
|
A Simple and Optimal Policy Design for Online Learning with Safety against Heavy-tailed Risk
David Simchi-Levi · Zeyu Zheng · Feng Zhu
|
|
Poster
|
|
LAPO: Latent-Variable Advantage-Weighted Policy Optimization for Offline Reinforcement Learning
Xi Chen · Ali Ghadirzadeh · Tianhe Yu · Jianhao Wang · Alex Yuan Gao · Wenzhe Li · Liang Bin · Chelsea Finn · Chongjie Zhang
|
|
Poster
|
Thu 9:00
|
Factored DRO: Factored Distributionally Robust Policies for Contextual Bandits
Tong Mu · Yash Chandak · Tatsunori Hashimoto · Emma Brunskill
|
|
Poster
|
Tue 14:00
|
Truly Deterministic Policy Optimization
Ehsan Saleh · Saba Ghaffari · Tim Bretl · Matthew West
|
|
Poster
|
|
Supported Policy Optimization for Offline Reinforcement Learning
Jialong Wu · Haixu Wu · Zihan Qiu · Jianmin Wang · Mingsheng Long
|
|
Poster
|
Wed 9:00
|
Conservative Dual Policy Optimization for Efficient Model-Based Reinforcement Learning
Shenao Zhang
|
|
Poster
|
Thu 9:00
|
Global Convergence of Direct Policy Search for State-Feedback H∞ Robust Control: A Revisit of Nonsmooth Synthesis with Goldstein Subdifferential
Xingang Guo · Bin Hu
|
|
Workshop
|
|
Efficient Offline Policy Optimization with a Learned Model
Zichen Liu · Siyi Li · Wee Sun Lee · Shuicheng Yan · Zhongwen Xu
|
|
Workshop
|
|
Online Policy Optimization for Robust MDP
Jing Dong · Jingwei Li · Baoxiang Wang · Jingzhao Zhang
|
|