Poster
|
Tue 9:00
|
Uplifting Bandits
Yu-Guan Hsieh · Shiva Kasiviswanathan · Branislav Kveton
|
|
Workshop
|
|
Clairvoyant Regret Minimization: Equivalence with Nemirovski’s Conceptual Prox Method and Extension to General Convex Games
Gabriele Farina · Christian Kroer · Chung-Wei Lee · Haipeng Luo
|
|
Poster
|
|
Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret
Jiawei Huang · Li Zhao · Tao Qin · Wei Chen · Nan Jiang · Tie-Yan Liu
|
|
Poster
|
Wed 14:00
|
Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback
Tiancheng Jin · Tal Lancewicki · Haipeng Luo · Yishay Mansour · Aviv Rosenberg
|
|
Poster
|
|
Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach
Peng Mi · Li Shen · Tianhe Ren · Yiyi Zhou · Xiaoshuai Sun · Rongrong Ji · Dacheng Tao
|
|
Poster
|
|
How and Why to Manipulate Your Own Agent: On the Incentives of Users of Learning Agents
Yoav Kolumbus · Noam Nisan
|
|
Poster
|
Thu 14:00
|
Efficient Phi-Regret Minimization in Extensive-Form Games via Online Mirror Descent
Yu Bai · Chi Jin · Song Mei · Ziang Song · Tiancheng Yu
|
|
Poster
|
Thu 9:00
|
IMED-RL: Regret optimal learning of ergodic Markov decision processes
Fabien Pesquerel · Odalric-Ambrym Maillard
|
|
Poster
|
Thu 14:00
|
Near-Optimal Goal-Oriented Reinforcement Learning in Non-Stationary Environments
Liyu Chen · Haipeng Luo
|
|
Poster
|
Thu 14:00
|
Adapting to Online Label Shift with Provable Guarantees
Yong Bai · Yu-Jie Zhang · Peng Zhao · Masashi Sugiyama · Zhi-Hua Zhou
|
|
Poster
|
Wed 9:00
|
Queue Up Your Regrets: Achieving the Dynamic Capacity Region of Multiplayer Bandits
Ilai Bistritz · Nicholas Bambos
|
|
Poster
|
Wed 14:00
|
Learn to Match with No Regret: Reinforcement Learning in Markov Matching Markets
Yifei Min · Tianhao Wang · Ruitu Xu · Zhaoran Wang · Michael Jordan · Zhuoran Yang
|
|