firstbacksecondback
5 Results
Workshop
|
The Emphatic Approach to Average-Reward Policy Evaluation Jiamin He · Yi Wan · Rupam Mahmood |
||
Workshop
|
On Convergence of Average-Reward Off-Policy Control Algorithms in Weakly-Communicating MDPs Yi Wan · Richard Sutton |
||
Poster
|
Wed 9:00 |
Learning Infinite-Horizon Average-Reward Restless Multi-Action Bandits via Index Awareness GUOJUN XIONG · Shufan Wang · Jian Li |
|
Poster
|
Thu 9:00 |
IMED-RL: Regret optimal learning of ergodic Markov decision processes Fabien Pesquerel · Odalric-Ambrym Maillard |
|
Poster
|
Tue 9:00 |
Influencing Long-Term Behavior in Multiagent Reinforcement Learning Dong-Ki Kim · Matthew Riemer · Miao Liu · Jakob Foerster · Michael Everett · Chuangchuang Sun · Gerald Tesauro · Jonathan How |