firstbacksecondback
19 Results
Poster
|
Tue 14:00 |
Truly Deterministic Policy Optimization Ehsan Saleh · Saba Ghaffari · Tim Bretl · Matthew West |
|
Poster
|
Wed 9:00 |
Chaotic Regularization and Heavy-Tailed Limits for Deterministic Gradient Descent Soon Hoe Lim · Yijun Wan · Umut Simsekli |
|
Workshop
|
SoftTreeMax: Policy Gradient with Tree Search Gal Dalal · Assaf Hallak · Shie Mannor · Gal Chechik |
||
Workshop
|
On All-Action Policy Gradients Michal Nauman · Marek Cygan |
||
Poster
|
Wed 9:00 |
Gradient-Free Methods for Deterministic and Stochastic Nonsmooth Nonconvex Optimization Tianyi Lin · Zeyu Zheng · Michael Jordan |
|
Poster
|
Wed 14:00 |
Alleviating "Posterior Collapse'' in Deep Topic Models via Policy Gradient Yewen Li · Chaojie Wang · Zhibin Duan · Dongsheng Wang · Bo Chen · Bo An · Mingyuan Zhou |
|
Poster
|
Wed 14:00 |
The Role of Baselines in Policy Gradient Optimization Jincheng Mei · Wesley Chung · Valentin Thomas · Bo Dai · Csaba Szepesvari · Dale Schuurmans |
|
Workshop
|
Policy gradient finds global optimum of nearly linear-quadratic control systems Yinbin Han · Meisam Razaviyayn · Renyuan Xu |
||
Poster
|
Wed 14:00 |
Gradient Descent Is Optimal Under Lower Restricted Secant Inequality And Upper Error Bound Charles Guille-Escuret · Adam Ibrahim · Baptiste Goujaud · Ioannis Mitliagkas |
|
Workshop
|
Training graph neural networks with policy gradients to perform tree search Matthew Macfarlane · Diederik Roijers · Herke van Hoof |
||
Poster
|
Wed 9:00 |
Policy Gradient With Serial Markov Chain Reasoning Edoardo Cetin · Oya Celiktutan |
|
Poster
|
Wed 14:00 |
DNA: Proximal Policy Optimization with a Dual Network Architecture Matthew Aitchison · Penny Sweetser |