firstbacksecondback
91 Results
Poster
|
Wed 18:30 |
Dynamic Safe Interruptibility for Decentralized Multi-Agent Reinforcement Learning El Mahdi El-Mhamdi · Rachid Guerraoui · Hadrien Hendrikx · Alexandre Maurer |
|
Spotlight
|
Wed 17:15 |
Repeated Inverse Reinforcement Learning Kareem Amin · Nan Jiang · Satinder Singh |
|
Spotlight
|
Wed 17:30 |
EX2: Exploration with Exemplar Models for Deep Reinforcement Learning Justin Fu · John Co-Reyes · Sergey Levine |
|
Poster
|
Wed 18:30 |
Robust and Efficient Transfer Learning with Hidden Parameter Markov Decision Processes Taylor Killian · Samuel Daulton · Finale Doshi-Velez · George Konidaris |
|
Workshop
|
Sat 13:30 |
Hierarchical Imitation and Reinforcement Learning for Robotics (Jan Peters) Jan Peters |
|
Workshop
|
Sat 8:40 |
Learning to optimize with reinforcement learning Jitendra Malik |
|
Poster
|
Wed 18:30 |
Regret Minimization in MDPs with Options without Prior Knowledge Ronan Fruit · Matteo Pirotta · Alessandro Lazaric · Emma Brunskill |
|
Poster
|
Mon 18:30 |
Interpolated Policy Gradient: Merging On-Policy and Off-Policy Gradient Estimation for Deep Reinforcement Learning Shixiang (Shane) Gu · Timothy Lillicrap · Richard Turner · Zoubin Ghahramani · Bernhard Schölkopf · Sergey Levine |
|
Poster
|
Tue 18:30 |
Hindsight Experience Replay Marcin Andrychowicz · Filip Wolski · Alex Ray · Jonas Schneider · Rachel Fong · Peter Welinder · Bob McGrew · Josh Tobin · OpenAI Pieter Abbeel · Wojciech Zaremba |
|
Poster
|
Tue 18:30 |
Finite sample analysis of the GTD Policy Evaluation Algorithms in Markov Setting Yue Wang · Wei Chen · Yuting Liu · Zhi-Ming Ma · Tie-Yan Liu |
|
Workshop
|
Sat 16:50 |
POSTER: Curiosity-driven reinforcement learning with hoemostatic regulation Ildefons Magrans de Abril |
|
Spotlight
|
Tue 17:05 |
Posterior sampling for reinforcement learning: worst-case regret bounds Shipra Agrawal · Randy Jia |