firstbacksecondback
40 Results
Poster
|
Wed 14:00 |
The Role of Baselines in Policy Gradient Optimization Jincheng Mei · Wesley Chung · Valentin Thomas · Bo Dai · Csaba Szepesvari · Dale Schuurmans |
|
Poster
|
Tue 9:00 |
Policy Optimization with Linear Temporal Logic Constraints Cameron Voloshin · Hoang Le · Swarat Chaudhuri · Yisong Yue |
|
Poster
|
Tue 9:00 |
Policy Optimization with Advantage Regularization for Long-Term Fairness in Decision Systems Eric Yu · Zhizhen Qin · Min Kyung Lee · Sicun Gao |
|
Poster
|
Learning to Constrain Policy Optimization with Virtual Trust Region Thai Hung Le · Thommen Karimpanal George · Majid Abdolshah · Dung Nguyen · Kien Do · Sunil Gupta · Svetha Venkatesh |
||
Poster
|
Thu 14:00 |
When to Intervene: Learning Optimal Intervention Policies for Critical Events Niranjan Damera Venkata · Chiranjib Bhattacharyya |
|
Poster
|
Wed 9:00 |
Policy Optimization for Markov Games: Unified Framework and Faster Convergence Runyu Zhang · Qinghua Liu · Huan Wang · Caiming Xiong · Na Li · Yu Bai |
|
Poster
|
Thu 14:00 |
Bellman Residual Orthogonalization for Offline Reinforcement Learning Andrea Zanette · Martin J Wainwright |
|
Workshop
|
Novel Policy Seeking with Constrained Optimization Hao Sun · Zhenghao Peng · Bolei Zhou |
||
Poster
|
Tue 14:00 |
Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning Ruida Zhou · Tao Liu · Dileep Kalathil · P. R. Kumar · Chao Tian |
|
Poster
|
Wed 14:00 |
DNA: Proximal Policy Optimization with a Dual Network Architecture Matthew Aitchison · Penny Sweetser |
|
Poster
|
Tue 9:00 |
Continuous MDP Homomorphisms and Homomorphic Policy Gradient Sahand Rezaei-Shoshtari · Rosie Zhao · Prakash Panangaden · David Meger · Doina Precup |
|
Workshop
|
Understanding Curriculum Learning in Policy Optimization for Online Combinatorial Optimization Runlong Zhou · Yuandong Tian · YI WU · Simon Du |