Poster
|
Thu 14:00
|
Batch size-invariance for policy optimization
Jacob Hilton · Karl Cobbe · John Schulman
|
|
Poster
|
Thu 9:00
|
Global Convergence of Direct Policy Search for State-Feedback H∞ Robust Control: A Revisit of Nonsmooth Synthesis with Goldstein Subdifferential
Xingang Guo · Bin Hu
|
|
Poster
|
Wed 14:00
|
On the convergence of policy gradient methods to Nash equilibria in general stochastic games
Angeliki Giannou · Kyriakos Lotidis · Panayotis Mertikopoulos · Emmanouil-Vasileios Vlatakis-Gkaragkounis
|
|
Poster
|
Tue 9:00
|
On the Global Convergence Rates of Decentralized Softmax Gradient Play in Markov Potential Games
Runyu Zhang · Jincheng Mei · Bo Dai · Dale Schuurmans · Na Li
|
|
Poster
|
Tue 9:00
|
Continuous MDP Homomorphisms and Homomorphic Policy Gradient
Sahand Rezaei-Shoshtari · Rosie Zhao · Prakash Panangaden · David Meger · Doina Precup
|
|
Poster
|
Tue 14:00
|
Anchor-Changing Regularized Natural Policy Gradient for Multi-Objective Reinforcement Learning
Ruida Zhou · Tao Liu · Dileep Kalathil · P. R. Kumar · Chao Tian
|
|
Poster
|
Wed 9:00
|
The Policy-gradient Placement and Generative Routing Neural Networks for Chip Design
Ruoyu Cheng · Xianglong Lyu · Yang Li · Junjie Ye · Jianye Hao · Junchi Yan
|
|