firstbacksecondback
Filter by Keyword:
27 Results
Poster
|
Tue 17:30 |
Explicit Planning for Efficient Exploration in Reinforcement Learning Liangpeng Zhang · Ke Tang · Xin Yao |
|
Poster
|
Tue 10:45 |
Non-Asymptotic Gap-Dependent Regret Bounds for Tabular MDPs Max Simchowitz · Kevin Jamieson |
|
Poster
|
Tue 17:30 |
Learning Mean-Field Games Xin Guo · Anran Hu · Renyuan Xu · Junzi Zhang |
|
Poster
|
Tue 17:30 |
Real-Time Reinforcement Learning Simon Ramstedt · Chris Pal |
|
Poster
|
Tue 10:45 |
Generalized Off-Policy Actor-Critic Shangtong Zhang · Wendelin Boehmer · Shimon Whiteson |
|
Poster
|
Wed 10:45 |
A Geometric Perspective on Optimal Representations for Reinforcement Learning Marc Bellemare · Will Dabney · Robert Dadashi · Adrien Ali Taiga · Pablo Samuel Castro · Nicolas Le Roux · Dale Schuurmans · Tor Lattimore · Clare Lyle |
|
Poster
|
Tue 17:30 |
Hindsight Credit Assignment Anna Harutyunyan · Will Dabney · Thomas Mesnard · Mohammad Gheshlaghi Azar · Bilal Piot · Nicolas Heess · Hado van Hasselt · Gregory Wayne · Satinder Singh · Doina Precup · Remi Munos |
|
Poster
|
Thu 10:45 |
Adaptive Temporal-Difference Learning for Policy Evaluation with Per-State Uncertainty Estimates Carlos Riquelme · Hugo Penedones · Damien Vincent · Hartmut Maennel · Sylvain Gelly · Timothy A Mann · Andre Barreto · Gergely Neu |
|
Poster
|
Tue 10:45 |
A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment Felix Leibfried · Sergio Pascual-Díaz · Jordi Grau-Moya |
|
Poster
|
Tue 10:45 |
Regret Bounds for Learning State Representations in Reinforcement Learning Ronald Ortner · Matteo Pirotta · Alessandro Lazaric · Ronan Fruit · Odalric-Ambrym Maillard |
|
Poster
|
Thu 10:45 |
Non-Stationary Markov Decision Processes, a Worst-Case Approach using Model-Based Reinforcement Learning Erwan Lecarpentier · Emmanuel Rachelson |
|
Poster
|
Tue 17:30 |
Non-Cooperative Inverse Reinforcement Learning Xiangyuan Zhang · Kaiqing Zhang · Erik Miehling · Tamer Basar |