Skip to yearly menu bar Skip to main content


(30 events)   Timezone:  
Show all
Toggle Poster Visibility
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #200
A Geometric Perspective on Optimal Representations for Reinforcement Learning
Marc Bellemare · Will Dabney · Robert Dadashi · Adrien Ali Taiga · Pablo Samuel Castro · Nicolas Le Roux · Dale Schuurmans · Tor Lattimore · Clare Lyle
[ Paper [ Slides
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #201
A Regularized Approach to Sparse Optimal Policy in Reinforcement Learning
Wenhao Yang · Xiang Li · Zhihua Zhang
[ Paper [ Poster
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #202
Constrained Reinforcement Learning Has Zero Duality Gap
Santiago Paternain · Luiz Chamon · Miguel Calvo-Fullana · Alejandro Ribeiro
[ Paper [ Poster
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #203
Distributional Reward Decomposition for Reinforcement Learning
Zichuan Lin · Li Zhao · Derek Yang · Tao Qin · Tie-Yan Liu · Guangwen Yang
[ Paper [ Slides
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #204
Divergence-Augmented Policy Optimization
Qing Wang · Yingru Li · Jiechao Xiong · Tong Zhang
[ Paper [ Poster
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #205
DualDICE: Behavior-Agnostic Estimation of Discounted Stationary Distribution Corrections
Ofir Nachum · Yinlam Chow · Bo Dai · Lihong Li
[ Paper [ Poster
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #206
Fast Efficient Hyperparameter Tuning for Policy Gradient Methods
Supratik Paul · Vitaly Kurin · Shimon Whiteson
[ Paper [ Poster
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #207
Finite-Time Performance Bounds and Adaptive Learning Rate Selection for Two Time-Scale Reinforcement Learning
Harsh Gupta · R. Srikant · Lei Ying
[ Paper [ Poster
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #208
Fully Parameterized Quantile Function for Distributional Reinforcement Learning
Derek Yang · Li Zhao · Zichuan Lin · Tao Qin · Jiang Bian · Tie-Yan Liu
[ Paper [ Slides
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #209
Intrinsically Efficient, Stable, and Bounded Off-Policy Evaluation for Reinforcement Learning
Nathan Kallus · Masatoshi Uehara
[ Paper [ Poster
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #210
Learning Reward Machines for Partially Observable Reinforcement Learning
Rodrigo Toro Icarte · Ethan Waldie · Toryn Klassen · Rick Valenzano · Margarita Castro · Sheila McIlraith
[ Paper [ Poster [ Slides
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #211
Off-Policy Evaluation via Off-Policy Classification
Alexander Irpan · Kanishka Rao · Konstantinos Bousmalis · Chris Harris · Julian Ibarz · Sergey Levine
[ Paper [ Poster [ Slides
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #212
SMILe: Scalable Meta Inverse Reinforcement Learning through Context-Conditional Policies
Kamyar Ghasemipour · Shixiang (Shane) Gu · Richard Zemel
[ Paper [ Slides
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #213
Variance Reduced Policy Evaluation with Smooth Function Approximation
Hoi-To Wai · Mingyi Hong · Zhuoran Yang · Zhaoran Wang · Kexin Tang
[ Paper [ Poster
Poster
Wed Dec 11 10:45 AM -- 12:45 PM (PST) @ East Exhibition Hall B + C #214
VIREL: A Variational Inference Framework for Reinforcement Learning
Mattie Fellows · Anuj Mahajan · Tim G. J. Rudner · Shimon Whiteson
[ Paper [ Slides
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #202
Budgeted Reinforcement Learning in Continuous State Space
Nicolas Carrara · Edouard Leurent · Romain Laroche · Tanguy Urvoy · Odalric-Ambrym Maillard · Olivier Pietquin
[ Paper [ Poster
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #203
Characterizing the Exact Behaviors of Temporal Difference Learning Algorithms Using Markov Jump Linear System Theory
Bin Hu · Usman Syed
[ Paper [ Poster
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #204
From Complexity to Simplicity: Adaptive ES-Active Subspaces for Blackbox Optimization
Krzysztof M Choromanski · Aldo Pacchiano · Jack Parker-Holder · Yunhao Tang · Vikas Sindhwani
[ Paper [ Poster
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #205
Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing Shaped Rewards
Alexander Trott · Stephan Zheng · Caiming Xiong · Richard Socher
[ Paper [ Slides
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #206
Learning from Trajectories via Subgoal Discovery
Sujoy Paul · Jeroen Vanbaar · Amit Roy-Chowdhury
[ Paper [ Poster
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #207
Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Gradient Estimators for Reinforcement Learning
Gregory Farquhar · Shimon Whiteson · Jakob Foerster
[ Paper [ Slides
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #208
Towards Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling
Tengyang Xie · Yifei Ma · Yu-Xiang Wang
[ Paper [ Poster [ Slides
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #209
Meta-Inverse Reinforcement Learning with Probabilistic Context Variables
Lantao Yu · Tianhe Yu · Chelsea Finn · Stefano Ermon
[ Paper [ Poster
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #210
Neural Proximal/Trust Region Policy Optimization Attains Globally Optimal Policy
Boyi Liu · Qi Cai · Zhuoran Yang · Zhaoran Wang
[ Paper [ Slides
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #211
Neural Temporal-Difference Learning Converges to Global Optima
Qi Cai · Zhuoran Yang · Jason Lee · Zhaoran Wang
[ Paper [ Slides
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #212
Provably Global Convergence of Actor-Critic: A Case for Linear Quadratic Regulator with Ergodic Cost
Zhuoran Yang · Yongxin Chen · Mingyi Hong · Zhaoran Wang
[ Paper [ Slides
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #213
Regularized Anderson Acceleration for Off-Policy Deep Reinforcement Learning
Wenjie Shi · Shiji Song · Hui Wu · Ya-Chu Hsu · Cheng Wu · Gao Huang
[ Paper [ Poster
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #214
Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction
Aviral Kumar · Justin Fu · George Tucker · Sergey Levine
[ Paper [ Slides
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #215
Surrogate Objectives for Batch Policy Optimization in One-step Decision Making
Minmin Chen · Ramki Gummadi · Chris Harris · Dale Schuurmans
[ Paper [ Poster
Poster
Wed Dec 11 05:00 PM -- 07:00 PM (PST) @ East Exhibition Hall B + C #216
Discovery of Useful Questions as Auxiliary Tasks
Vivek Veeriah · Matteo Hessel · Zhongwen Xu · Janarthanan Rajendran · Richard L Lewis · Junhyuk Oh · Hado van Hasselt · David Silver · Satinder Singh
[ Paper [ Slides