Timezone: »
The proliferation of automated inference algorithms in Bayesian statistics has provided practitioners newfound access to fast, reproducible data analysis and powerful statistical models. Designing automated methods that are also both computationally scalable and theoretically sound, however, remains a significant challenge. Recent work on Bayesian coresets takes the approach of compressing the dataset before running a standard inference algorithm, providing both scalability and guarantees on posterior approximation error. But the automation of past coreset methods is limited because they depend on the availability of a reasonable coarse posterior approximation, which is difficult to specify in practice. In the present work we remove this requirement by formulating coreset construction as sparsity-constrained variational inference within an exponential family. This perspective leads to a novel construction via greedy optimization, and also provides a unifying information-geometric view of present and past methods. The proposed Riemannian coreset construction algorithm is fully automated, requiring no problem-specific inputs aside from the probabilistic model and dataset. In addition to being significantly easier to use than past methods, experiments demonstrate that past coreset constructions are fundamentally limited by the fixed coarse posterior approximation; in contrast, the proposed algorithm is able to continually improve the coreset, providing state-of-the-art Bayesian dataset summarization with orders-of-magnitude reduction in KL divergence to the exact posterior.
Author Information
Trevor Campbell (UBC)
Boyan Beronov (University of British Columbia)
More from the Same Authors
-
2023 Poster: Embracing the chaos: analysis and diagnosis of numerical instability in variational flows »
Zuheng Xu · Trevor Campbell -
2022 Poster: Bayesian inference via sparse Hamiltonian flows »
Naitong Chen · Zuheng Xu · Trevor Campbell -
2022 Poster: Fast Bayesian Coresets via Subsampling and Quasi-Newton Refinement »
Cian Naik · Judith Rousseau · Trevor Campbell -
2022 Poster: Parallel Tempering With a Variational Reference »
Nikola Surjanovic · Saifuddin Syed · Alexandre Bouchard-Côté · Trevor Campbell -
2021 Workshop: Your Model is Wrong: Robustness and misspecification in probabilistic modeling »
Diana Cai · Sameer Deshpande · Michael Hughes · Tamara Broderick · Trevor Campbell · Nick Foti · Barbara Engelhardt · Sinead Williamson -
2020 Poster: Bayesian Pseudocoresets »
Dionysis Manousakas · Zuheng Xu · Cecilia Mascolo · Trevor Campbell -
2019 Poster: Universal Boosting Variational Inference »
Trevor Campbell · Xinglong Li