Poster
|
Tue 9:00 |
Uplifting Bandits Yu-Guan Hsieh · Shiva Kasiviswanathan · Branislav Kveton |
|
Workshop
|
Clairvoyant Regret Minimization: Equivalence with Nemirovski’s Conceptual Prox Method and Extension to General Convex Games Gabriele Farina · Christian Kroer · Chung-Wei Lee · Haipeng Luo |
||
Poster
|
Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret Jiawei Huang · Li Zhao · Tao Qin · Wei Chen · Nan Jiang · Tie-Yan Liu |
||
Poster
|
Wed 14:00 |
Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback Tiancheng Jin · Tal Lancewicki · Haipeng Luo · Yishay Mansour · Aviv Rosenberg |