Poster
|
Tue 17:30
|
Provably Efficient Q-learning with Function Approximation via Distribution Shift Error Checking Oracle
Simon Du · Yuping Luo · Ruosong Wang · Hanrui Zhang
|
|
Poster
|
Wed 17:00
|
Loaded DiCE: Trading off Bias and Variance in Any-Order Score Function Gradient Estimators for Reinforcement Learning
Gregory Farquhar · Shimon Whiteson · Jakob Foerster
|
|
Poster
|
Wed 17:00
|
Modelling the Dynamics of Multiagent Q-Learning in Repeated Symmetric Games: a Mean Field Theoretic Approach
Shuyue Hu · Chin-wing Leung · Ho-fung Leung
|
|
Poster
|
Tue 17:30
|
Learning Multiple Markov Chains via Adaptive Allocation
Mohammad Sadegh Talebi · Odalric-Ambrym Maillard
|
|
Poster
|
Wed 17:00
|
MAVEN: Multi-Agent Variational Exploration
Anuj Mahajan · Tabish Rashid · Mikayel Samvelyan · Shimon Whiteson
|
|
Poster
|
Tue 17:30
|
Exploration via Hindsight Goal Generation
Zhizhou Ren · Kefan Dong · Yuan Zhou · Qiang Liu · Jian Peng
|
|
Poster
|
Tue 17:30
|
Reconciling λ-Returns with Experience Replay
Brett Daley · Christopher Amato
|
|
Poster
|
Tue 10:45
|
A Meta-MDP Approach to Exploration for Lifelong Reinforcement Learning
Francisco Garcia · Philip Thomas
|
|
Poster
|
Wed 10:45
|
Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards
Siyuan Li · Rui Wang · Minxue Tang · Chongjie Zhang
|
|
Poster
|
Thu 10:45
|
Privacy-Preserving Q-Learning with Functional Noise in Continuous Spaces
Baoxiang Wang · Nidhi Hegde
|
|
Poster
|
Tue 10:45
|
Value Function in Frequency Domain and the Characteristic Value Iteration Algorithm
Amir-massoud Farahmand
|
|
Poster
|
Tue 17:30
|
Mapping State Space using Landmarks for Universal Goal Reaching
Zhiao Huang · Fangchen Liu · Hao Su
|
|