NeurIPS 2019

Skip to yearly menu bar Skip to main content

Filter by Keyword:

27 Results

Poster	Wed 17:00	Characterizing the Exact Behaviors of Temporal Difference Learning Algorithms Using Markov Jump Linear System Theory Bin Hu · Usman Syed
Poster	Tue 17:30	Worst-Case Regret Bounds for Exploration via Randomized Value Functions Daniel Russo
Poster	Tue 10:45	Value Function in Frequency Domain and the Characteristic Value Iteration Algorithm Amir-massoud Farahmand
Poster	Wed 17:00	Large Scale Markov Decision Processes with Changing Rewards Adrian Rivera Cardoso · He Wang · Huan Xu
Poster	Tue 10:45	Limiting Extrapolation in Linear Approximate Value Iteration Andrea Zanette · Alessandro Lazaric · Mykel J Kochenderfer · Emma Brunskill
Poster	Wed 10:45	Finite-Time Performance Bounds and Adaptive Learning Rate Selection for Two Time-Scale Reinforcement Learning Harsh Gupta · R. Srikant · Lei Ying
Poster	Tue 10:45	Maximum Expected Hitting Cost of a Markov Decision Process and Informativeness of Rewards Falcon Dai · Matthew Walter
Poster	Tue 17:30	Exploration Bonus for Regret Minimization in Discrete and Continuous Average Reward MDPs Jian QIAN · Ronan Fruit · Matteo Pirotta · Alessandro Lazaric
Poster	Tue 10:45	Finite-Sample Analysis for SARSA with Linear Function Approximation Shaofeng Zou · Tengyu Xu · Yingbin Liang
Poster	Wed 10:45	A Regularized Approach to Sparse Optimal Policy in Reinforcement Learning Wenhao Yang · Xiang Li · Zhihua Zhang
Poster	Tue 17:30	Almost Horizon-Free Structure-Aware Best Policy Identification with a Generative Model Andrea Zanette · Mykel J Kochenderfer · Emma Brunskill
Poster	Tue 17:30	Explicit Planning for Efficient Exploration in Reinforcement Learning Liangpeng Zhang · Ke Tang · Xin Yao