Timezone: »
This paper examines the equilibrium convergence properties of no-regret learning with exponential weights in potential games. To establish convergence with minimal information requirements on the players' side, we focus on two frameworks: the semi-bandit case (where players have access to a noisy estimate of their payoff vectors, including strategies they did not play), and the bandit case (where players are only able to observe their in-game, realized payoffs). In the semi-bandit case, we show that the induced sequence of play converges almost surely to a Nash equilibrium at a quasi-exponential rate. In the bandit case, the same result holds for approximate Nash equilibria if we introduce a constant exploration factor that guarantees that action choice probabilities never become arbitrarily small. In particular, if the algorithm is run with a suitably decreasing exploration factor, the sequence of play converges to a bona fide Nash equilibrium with probability 1.
Author Information
Amélie Héliou (Criteo)
Johanne Cohen (LRI-CNRS)
Panayotis Mertikopoulos (CNRS (French National Center for Scientific Research))
More from the Same Authors
-
2023 Poster: Riemannian stochastic optimization methods avoid strict saddle points »
Ya-Ping Hsieh · Mohammad Reza Karimi Jaghargh · Andreas Krause · Panayotis Mertikopoulos -
2023 Poster: Strategic Stability under Regularized Learning in Games »
Victor Boone · Panayotis Mertikopoulos -
2023 Poster: Payoff-based Learning with Matrix Multiplicative Weights in Quantum Games »
Kyriakos Lotidis · Panayotis Mertikopoulos · Nicholas Bambos · Jose Blanchet -
2023 Poster: Exploiting hidden structures in non-convex games for convergence to Nash equilibrium »
Iosif Sakos · Emmanouil-Vasileios Vlatakis-Gkaragkounis · Panayotis Mertikopoulos · Georgios Piliouras -
2022 Poster: No-regret learning in games with noisy feedback: Faster rates and adaptivity via learning rate separation »
Yu-Guan Hsieh · Kimon Antonakopoulos · Volkan Cevher · Panayotis Mertikopoulos -
2022 Poster: On the convergence of policy gradient methods to Nash equilibria in general stochastic games »
Angeliki Giannou · Kyriakos Lotidis · Panayotis Mertikopoulos · Emmanouil-Vasileios Vlatakis-Gkaragkounis -
2021 Poster: Fast Routing under Uncertainty: Adaptive Learning in Congestion Games via Exponential Weights »
Dong Quan Vu · Kimon Antonakopoulos · Panayotis Mertikopoulos -
2021 Poster: Sifting through the noise: Universal first-order methods for stochastic variational inequalities »
Kimon Antonakopoulos · Thomas Pethick · Ali Kavis · Panayotis Mertikopoulos · Volkan Cevher -
2021 Poster: Adaptive First-Order Methods Revisited: Convex Minimization without Lipschitz Requirements »
Kimon Antonakopoulos · Panayotis Mertikopoulos -
2021 Poster: On the Rate of Convergence of Regularized Learning in Games: From Bandits and Uncertainty to Optimism and Beyond »
Angeliki Giannou · Emmanouil-Vasileios Vlatakis-Gkaragkounis · Panayotis Mertikopoulos -
2020 Poster: No-Regret Learning and Mixed Nash Equilibria: They Do Not Mix »
Emmanouil-Vasileios Vlatakis-Gkaragkounis · Lampros Flokas · Thanasis Lianeas · Panayotis Mertikopoulos · Georgios Piliouras -
2020 Spotlight: No-Regret Learning and Mixed Nash Equilibria: They Do Not Mix »
Emmanouil-Vasileios Vlatakis-Gkaragkounis · Lampros Flokas · Thanasis Lianeas · Panayotis Mertikopoulos · Georgios Piliouras -
2020 Poster: Explore Aggressively, Update Conservatively: Stochastic Extragradient Methods with Variable Stepsize Scaling »
Yu-Guan Hsieh · Franck Iutzeler · Jérôme Malick · Panayotis Mertikopoulos -
2020 Poster: Online Non-Convex Optimization with Imperfect Feedback »
Amélie Héliou · Matthieu Martin · Panayotis Mertikopoulos · Thibaud Rahier -
2020 Poster: On the Almost Sure Convergence of Stochastic Gradient Descent in Non-Convex Problems »
Panayotis Mertikopoulos · Nadav Hallak · Ali Kavis · Volkan Cevher -
2020 Spotlight: Explore Aggressively, Update Conservatively: Stochastic Extragradient Methods with Variable Stepsize Scaling »
Yu-Guan Hsieh · Franck Iutzeler · Jérôme Malick · Panayotis Mertikopoulos -
2019 Poster: On the convergence of single-call stochastic extra-gradient methods »
Yu-Guan Hsieh · Franck Iutzeler · Jérôme Malick · Panayotis Mertikopoulos -
2019 Poster: An adaptive Mirror-Prox method for variational inequalities with singular operators »
Kimon Antonakopoulos · Veronica Belmega · Panayotis Mertikopoulos -
2018 : Poster spotlight »
Tianbao Yang · Pavel Dvurechenskii · Panayotis Mertikopoulos · Hugo Berard -
2018 Poster: Bandit Learning in Concave N-Person Games »
Mario Bravo · David Leslie · Panayotis Mertikopoulos -
2018 Poster: Learning in Games with Lossy Feedback »
Zhengyuan Zhou · Panayotis Mertikopoulos · Susan Athey · Nicholas Bambos · Peter W Glynn · Yinyu Ye -
2017 Poster: Countering Feedback Delays in Multi-Agent Learning »
Zhengyuan Zhou · Panayotis Mertikopoulos · Nicholas Bambos · Peter W Glynn · Claire Tomlin -
2017 Poster: Stochastic Mirror Descent in Variationally Coherent Optimization Problems »
Zhengyuan Zhou · Panayotis Mertikopoulos · Nicholas Bambos · Stephen Boyd · Peter W Glynn