Skip to yearly menu bar Skip to main content


Search All 2022 Events
 

40 Results

<<   <   Page 1 of 4   >   >>
Poster
Wed 14:00 A Near-Optimal Primal-Dual Method for Off-Policy Learning in CMDP
Fan Chen · Junyu Zhang · Zaiwen Wen
Poster
Thu 14:00 Off-Policy Evaluation with Policy-Dependent Optimization Response
Wenshuo Guo · Michael Jordan · Angela Zhou
Poster
Wed 9:00 Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification
Takumi Tanabe · Rei Sato · Kazuto Fukuchi · Jun Sakuma · Youhei Akimoto
Poster
Wed 14:00 Mismatched No More: Joint Model-Policy Optimization for Model-Based RL
Benjamin Eysenbach · Alexander Khazatsky · Sergey Levine · Russ Salakhutdinov
Poster
On the role of overparameterization in off-policy Temporal Difference learning with linear function approximation
Valentin Thomas
Poster
Wed 9:00 The Policy-gradient Placement and Generative Routing Neural Networks for Chip Design
Ruoyu Cheng · Xianglong Lyu · Yang Li · Junjie Ye · Jianye Hao · Junchi Yan
Workshop
Pareto-Optimal Diagnostic Policy Learning in Clinical Applications via Semi-Model-Based Deep Reinforcement Learning
zheng Yu · Yikuan Li · Joseph Kim · Kaixuan Huang · Yuan Luo · Mengdi Wang
Workshop
Pareto-Optimal Diagnostic Policy Learning in Clinical Applications via Semi-Model-Based Deep Reinforcement Learning
zheng Yu · Yikuan Li · Joseph Kim · Kaixuan Huang · Yuan Luo · Mengdi Wang
Poster
Tue 9:00 Instance-Dependent Near-Optimal Policy Identification in Linear MDPs via Online Experiment Design
Andrew Wagenmaker · Kevin Jamieson
Poster
Wed 9:00 DeepTOP: Deep Threshold-Optimal Policy for MDPs and RMABs
Khaled Nakhleh · I-Hong Hou
Poster
Thu 14:00 Batch size-invariance for policy optimization
Jacob Hilton · Karl Cobbe · John Schulman
Poster
Thu 9:00 Factored DRO: Factored Distributionally Robust Policies for Contextual Bandits
Tong Mu · Yash Chandak · Tatsunori Hashimoto · Emma Brunskill