firstbacksecondback
2 Results
Poster
|
Thu 9:00 |
Continuous Deep Q-Learning in Optimal Control Problems: Normalized Advantage Functions Analysis Anton Plaksin · Stepan Martyanov |
|
Poster
|
Thu 9:00 |
Direct Advantage Estimation Hsiao-Ru Pan · Nico Gürtler · Alexander Neitz · Bernhard Schölkopf |