Timezone: »
In reasoning about sequential events it is natural to pose probabilistic queries such as “when will event A occur next” or “what is the probability of A occurring before B”, with applications in areas such as user modeling, language models, medicine, and finance. These types of queries are complex to answer compared to next-event prediction, particularly for neural autoregressive models such as recurrent neural networks and transformers. This is in part due to the fact that future querying involves marginalization over large path spaces, which is not straightforward to do efficiently in such models. In this paper we introduce a general typology for predictive queries in neural autoregressive sequence models and show that such queries can be systematically represented by sets of elementary building blocks. We leverage this typology to develop new query estimation methods based on beam search, importance sampling, and hybrids. Across four large-scale sequence datasets from different application domains, as well as for the GPT-2 language model, we demonstrate the ability to make query answering tractable for arbitrary queries in exponentially-large predictive path-spaces, and find clear differences in cost-accuracy tradeoffs between search and sampling methods.
Author Information
Alex Boyd (UC Irvine)
Samuel Showalter (University of California, Irvine)
Stephan Mandt (University of California, Irvine)
Padhraic Smyth (University of California, Irvine)
More from the Same Authors
-
2021 : Analyzing High-Resolution Clouds and Convection using Multi-Channel VAEs »
Harshini Mangipudi · Griffin Mooers · Mike Pritchard · Tom Beucler · Stephan Mandt -
2021 : Structured Stochastic Gradient MCMC: a hybrid VI and MCMC approach »
Antonios Alexos · Alex Boyd · Stephan Mandt -
2022 : Probabilistic Querying of Continuous-Time Sequential Events »
Alex Boyd · Yuxin Chang · Stephan Mandt · Padhraic Smyth -
2022 : An Unsupervised Learning Perspective on the Dynamic Contribution to Extreme Precipitation Changes »
Griffin Mooers · Tom Beucler · Mike Pritchard · Stephan Mandt -
2022 Panel: Panel 5B-4: Predictive Querying for… & On the difficulty… »
Alex Boyd · Jonas Mikhaeil -
2022 : Q & A »
Karen Ullrich · Yibo Yang · Stephan Mandt -
2022 Tutorial: Data Compression with Machine Learning »
Karen Ullrich · Yibo Yang · Stephan Mandt -
2022 : Tutorial part 1 »
Yibo Yang · Karen Ullrich · Stephan Mandt -
2021 Poster: Detecting and Adapting to Irregular Distribution Shifts in Bayesian Online Learning »
Aodong Li · Alex Boyd · Padhraic Smyth · Stephan Mandt -
2021 Poster: Combining Human Predictions with Model Probabilities via Confusion Matrices and Calibration »
Gavin Kerrigan · Padhraic Smyth · Mark Steyvers -
2020 : Q/A and Discussion for ML Theory Session »
Karthik Kashinath · Mayur Mudigonda · Stephan Mandt · Rose Yu -
2020 : Stephan Mandt »
Stephan Mandt -
2020 Poster: Can I Trust My Fairness Metric? Assessing Fairness with Unlabeled Data and Bayesian Inference »
Disi Ji · Padhraic Smyth · Mark Steyvers -
2020 Poster: User-Dependent Neural Sequence Models for Continuous-Time Event Data »
Alex Boyd · Robert Bamler · Stephan Mandt · Padhraic Smyth -
2017 : Coffee break and Poster Session II »
Mohamed Kane · Albert Haque · Vagelis Papalexakis · John Guibas · Peter Li · Carlos Arias · Eric Nalisnick · Padhraic Smyth · Frank Rudzicz · Xia Zhu · Theodore Willke · Noemie Elhadad · Hans Raffauf · Harini Suresh · Paroma Varma · Yisong Yue · Ognjen (Oggi) Rudovic · Luca Foschini · Syed Rameel Ahmad · Hasham ul Haq · Valerio Maggio · Giuseppe Jurman · Sonali Parbhoo · Pouya Bashivan · Jyoti Islam · Mirco Musolesi · Chris Wu · Alexander Ratner · Jared Dunnmon · Cristóbal Esteban · Aram Galstyan · Greg Ver Steeg · Hrant Khachatrian · Marc Górriz · Mihaela van der Schaar · Anton Nemchenko · Manasi Patwardhan · Tanay Tandon -
2016 Workshop: Towards an Artificial Intelligence for Data Science »
Charles Sutton · James Geddes · Zoubin Ghahramani · Padhraic Smyth · Chris Williams -
2012 Workshop: Algorithmic and Statistical Approaches for Large Social Network Data Sets »
Michael Goodrich · Pavel N Krivitsky · David M Mount · Christopher DuBois · Padhraic Smyth -
2011 Oral: Continuous-Time Regression Models for Longitudinal Networks »
Duy Q Vu · Arthur Asuncion · David Hunter · Padhraic Smyth -
2011 Poster: Continuous-Time Regression Models for Longitudinal Networks »
Duy Q Vu · Arthur Asuncion · David Hunter · Padhraic Smyth -
2010 Spotlight: Learning concept graphs from text with stick-breaking priors »
America Chambers · Padhraic Smyth · Mark Steyvers -
2010 Poster: Learning concept graphs from text with stick-breaking priors »
America Chambers · Padhraic Smyth · Mark Steyvers -
2009 Poster: Particle-based Variational Inference for Continuous Systems »
Alexander Ihler · Andrew Frank · Padhraic Smyth -
2008 Poster: Asynchronous Distributed Learning of Topic Models »
Arthur Asuncion · Padhraic Smyth · Max Welling -
2007 Spotlight: Distributed Inference for Latent Dirichlet Allocation »
David Newman · Arthur Asuncion · Padhraic Smyth · Max Welling -
2007 Poster: Distributed Inference for Latent Dirichlet Allocation »
David Newman · Arthur Asuncion · Padhraic Smyth · Max Welling -
2006 Poster: Modeling General and Specific Aspects of Documents with a Probabilistic Topic Model »
Chaitanya Chemudugunta · Padhraic Smyth · Mark Steyvers -
2006 Poster: Learning Time-Intensity Profiles of Human Activity using Non-Parametric Bayesian Models »
Alexander Ihler · Padhraic Smyth -
2006 Poster: Hierarchical Dirichlet Processes with Random Effects »
Seyoung Kim · Padhraic Smyth