Timezone: »
Poster
An Analysis of Ensemble Sampling
Chao Qin · Zheng Wen · Xiuyuan Lu · Benjamin Van Roy
Ensemble sampling serves as a practical approximation to Thompson sampling when maintaining an exact posterior distribution over model parameters is computationally intractable. In this paper, we establish a regret bound that ensures desirable behavior when ensemble sampling is applied to the linear bandit problem. This represents the first rigorous regret analysis of ensemble sampling and is made possible by leveraging information-theoretic concepts and novel analytic techniques that may prove useful beyond the scope of this paper.
Author Information
Chao Qin (Columbia University)
Zheng Wen (DeepMind)
Xiuyuan Lu (DeepMind)
Benjamin Van Roy (Stanford University)
More from the Same Authors
-
2021 : On Adaptivity and Confounding in Contextual Bandit Experiments »
Chao Qin · Daniel Russo -
2021 : On Adaptivity and Confounding in Contextual Bandit Experiments »
Chao Qin · Daniel Russo -
2023 Poster: Epistemic Neural Networks »
Ian Osband · Zheng Wen · Seyed Mohammad Asghari · Vikranth Dwaracherla · MORTEZA IBRAHIMI · Xiuyuan Lu · Benjamin Van Roy -
2023 Poster: A Definition of Continual Reinforcement Learning »
David Abel · Andre Barreto · Benjamin Van Roy · Doina Precup · Hado van Hasselt · Satinder Singh -
2022 Spotlight: An Analysis of Ensemble Sampling »
Chao Qin · Zheng Wen · Xiuyuan Lu · Benjamin Van Roy -
2022 Poster: The Neural Testbed: Evaluating Joint Predictions »
Ian Osband · Zheng Wen · Seyed Mohammad Asghari · Vikranth Dwaracherla · Xiuyuan Lu · MORTEZA IBRAHIMI · Dieterich Lawson · Botao Hao · Brendan O'Donoghue · Benjamin Van Roy -
2021 : On Adaptivity and Confounding in Contextual Bandit Experiments »
Chao Qin · Daniel Russo -
2020 Poster: On Efficiency in Hierarchical Reinforcement Learning »
Zheng Wen · Doina Precup · Morteza Ibrahimi · Andre Barreto · Benjamin Van Roy · Satinder Singh -
2020 Spotlight: On Efficiency in Hierarchical Reinforcement Learning »
Zheng Wen · Doina Precup · Morteza Ibrahimi · Andre Barreto · Benjamin Van Roy · Satinder Singh -
2019 Poster: Bootstrapping Upper Confidence Bound »
Botao Hao · Yasin Abbasi Yadkori · Zheng Wen · Guang Cheng