Poster
|
Wed 17:00
|
Characterizing the Exact Behaviors of Temporal Difference Learning Algorithms Using Markov Jump Linear System Theory
Bin Hu · Usman Syed
|
|
Poster
|
Tue 17:30
|
Worst-Case Regret Bounds for Exploration via Randomized Value Functions
Daniel Russo
|
|
Poster
|
Tue 10:45
|
Value Function in Frequency Domain and the Characteristic Value Iteration Algorithm
Amir-massoud Farahmand
|
|
Poster
|
Wed 17:00
|
Large Scale Markov Decision Processes with Changing Rewards
Adrian Rivera Cardoso · He Wang · Huan Xu
|
|
Poster
|
Tue 10:45
|
Limiting Extrapolation in Linear Approximate Value Iteration
Andrea Zanette · Alessandro Lazaric · Mykel J Kochenderfer · Emma Brunskill
|
|
Poster
|
Wed 10:45
|
Finite-Time Performance Bounds and Adaptive Learning Rate Selection for Two Time-Scale Reinforcement Learning
Harsh Gupta · R. Srikant · Lei Ying
|
|
Poster
|
Tue 10:45
|
Maximum Expected Hitting Cost of a Markov Decision Process and Informativeness of Rewards
Falcon Dai · Matthew Walter
|
|
Poster
|
Tue 17:30
|
Exploration Bonus for Regret Minimization in Discrete and Continuous Average Reward MDPs
Jian QIAN · Ronan Fruit · Matteo Pirotta · Alessandro Lazaric
|
|
Poster
|
Tue 10:45
|
Finite-Sample Analysis for SARSA with Linear Function Approximation
Shaofeng Zou · Tengyu Xu · Yingbin Liang
|
|
Poster
|
Wed 10:45
|
A Regularized Approach to Sparse Optimal Policy in Reinforcement Learning
Wenhao Yang · Xiang Li · Zhihua Zhang
|
|
Poster
|
Tue 17:30
|
Almost Horizon-Free Structure-Aware Best Policy Identification with a Generative Model
Andrea Zanette · Mykel J Kochenderfer · Emma Brunskill
|
|
Poster
|
Tue 17:30
|
Explicit Planning for Efficient Exploration in Reinforcement Learning
Liangpeng Zhang · Ke Tang · Xin Yao
|
|