Timezone: »
One of the promises of Big Data is its potential to answer “what if?” questions in digital, natural and social systems. Whether we speak of genetic interactions in a cell, passengers commuting in railways and roads, recommender systems matching users to ads, or understanding contagion in social networks, such systems are composed of many interacting components that suggest that learning to control them or understanding the effect of shocks to a system is not an easy task. What if some railways are closed, what will passengers do? What if we incentivize a member of a social network to propagate an idea, how influential can they be? What if some genes in a cell are knockedout, which phenotypes can we expect?
Such questions need to be addressed via a combination of experimental and observational data, and require a careful approach to modelling heterogeneous datasets and structural assumptions concerning the causal relations among the components of the system. The workshop is aimed at bringing together research expertise from a variety of communities in machine learning, statistics, engineering, and the social, medical and natural sciences. It is an opportunity for methods for causal inference, reinforcement learning and game theory to be crossfertilized with more traditional research in statistics and the realworld constraints found in practical applications. Ultimately, this can lead to new research platforms to aid the assessment of policies, shocks and experimental design methods in the discovery of breakthroughs in a variety of domains.
Fri 11:20 p.m.  11:30 p.m.

Welcome
(
Opening remarks
)

🔗 
Fri 11:30 p.m.  12:15 a.m.

The DataFusion Problem: Causal Inference and Reinforcement Learning
(
Invited Talk
)

Elias Bareinboim 🔗 
Sat 12:15 a.m.  1:00 a.m.

A Contextual Research Program
(
Invited Talk
)
A theory of contextual interventions has developed and matured to the point where contextual bandits can be routinely deployed to solve appropriate problems. A more general theory of contextual interventions in complex settings appears desirable and is under development leading to developments in two new areas: 1) Sequential decision making around deviations from existing solutions 
John Langford 🔗 
Sat 1:00 a.m.  2:00 a.m.

Poster Session I
(
Poster Session
)
The first of two sessions. Each session will include all posters. 
🔗 
Sat 2:00 a.m.  2:30 a.m.

Optimal and Adaptive Offpolicy Evaluation in Contextual Bandits
(
Contributed Talk
)
We consider the problem of offpolicy evaluation—estimating the value of a target policy using data collected by another policy—under the contextual bandit model. We establish a minimax lower bound on the mean squared error (MSE), and show that it is matched up to constant factors by the inverse propensity scoring (IPS) estimator. Since in the multiarmed bandit problem the IPS is suboptimal, our result highlights the difficulty of the contextual setting with nondegenerate context distributions. We further consider improvements on this minimax MSE bound, given access to a reward model. We show that the existing doubly robust approach, which utilizes such a reward model, may continue to suffer from high variance even when the reward model is perfect. We propose a new estimator called SWITCH which more effectively uses the reward model and achieves a superior biasvariance tradeoff compared with prior work. We prove an upper bound on its MSE and demonstrate its benefits empirically on a diverse collection of datasets, often seeing orders of magnitude improvements over a number of baselines. 
YuXiang Wang 🔗 
Sat 2:30 a.m.  3:00 a.m.

Joint Causal Inference on Observational and Experimental Datasets
(
Contributed Talk
)
We introduce joint causal inference, a powerful formulation of causal discovery over multiple datasets in which we jointly learn both the causal structure and targets of interventions from independence test results. While offering many advantages, joint causal inference induces faithfulness violations due to deterministic relations, so we extend a recently proposed constraintbased method to deal with this type of violations. A preliminary evaluation shows the benefits of joint causal inference. 
Sara Magliacane 🔗 
Sat 4:45 a.m.  5:30 a.m.

Extracting Templates from Media Event Sequences
(
Invited Talk
)
In this preliminary research we'll present early results on extracting repeatable probabilistic templates from global mediaevent sequences. Such patterns could hint on some weak forms of causality in the global social dynamics. As a basis, we are using the evolving graph of interlinked events generated by the "Event Registry" system (eventregistry.org), where each event is represented as an object composed from three main components: social, topical and temporal. In the analysis we will show early results on the structure of the problem and the spectrum of extracted templates from simple to hard ones. 
Marko Grobelnik 🔗 
Sat 5:30 a.m.  6:00 a.m.

Estimating Whatif Outcomes for Targeting Interventions in a Clinical Setting
(
Contributed Talk
)
Individuals have heterogeneous outcomes from interventions. In a clinical setting, estimating how patients will respond to different treatments is critical for targeted care. Clinicians constantly ask themselves, given a patient’s history, what would happen to the patient’s clinical trajectory if they were given one treatment versus another. However, in practice it is often unknown how the patient’s signals will change in response to treatment until that treatment is actually administered. Even then, it is impossible to observe the counterfactual from real data, i.e., what would have happened to the patient if the doctor had made a different choice. In order to solve this causal question, we use the gformula with proper assumptions to estimate physiologic trajectories and treatment responses from observed data. To demonstrate this we model blood pressure and heart rate for patients in the intensive care unit (ICU) and estimate their responses to six types of treatments that are used in their management. These two signals are among the most commonly used vital signs in the ICU and are critical for identifying lifethreatening conditions like septic and hemorrhagic shock. To model the signal with treatment response from observed data, we use two different Bayesian nonparametric (BNP) methods to build the estimator. BNP are known to have an extremely flexible functional form, which helps to overcome the model misspecification problem and makes the estimator more robust. 
Suchi Saria 🔗 
Sat 6:00 a.m.  7:00 a.m.

Poster Session II
(
Poster Session
)

🔗 
Sat 7:00 a.m.  7:30 a.m.

Longterm Causal Effects in Policy Analysis
(
Contributed Talk
)
In most causal problems we want to evaluate the longterm effects of policy changes but only have access to shortterm experimental data. For example, for the longterm effects of minimum wage increase we may only have access to oneyear worth of employment data. In this technical note we argue that such conceptual gap between what is to be estimated and what is in the data has not been adequately addressed. To make our criticism constructive we describe our approach in studying multiagent systems and the longterm effects of interventions in such systems. Central to our approach is behavioral game theory, where a behavioral model of how agents act conditional on their latent behaviors is combined with a temporal model of how behaviors evolve. 
Panagiotis Toulis 🔗 
Sat 7:30 a.m.  8:15 a.m.

Causal Inference for Recommendation Systems
(
Invited Talk
)
We develop a causal inference approach to recommender systems. Observational recommendation data contains two sources of information: which items each user decided to look at and which of those items each user liked. We assume these two types of information come from different modelsthe exposure data comes from a model by which users discover items to consider; the click data comes from a model by which users decide which items they like. Traditionally, recommender systems use the click data alone (or ratings data) to infer the user preferences. But this inference is biased by the exposure data, i.e., that users do not consider each item independently at random. We use causal inference to correct for this bias. On realworld data, we demonstrate that causal inference for recommender systems leads to improved generalization to new data. (Joint work with Dawen Liang and Laurent Charlin) 
David Blei 🔗 
Sat 8:15 a.m.  9:00 a.m.

Panel & Closing
(
Panel Discussion and Closing Remarks
)

🔗 
Author Information
Ricardo Silva (University College London)
John ShaweTaylor (University College London)
Adith Swaminathan (Cornell University)
Thorsten Joachims (Cornell)
More from the Same Authors

2022 : Pragmatic Fairness: Optimizing Policies with Outcome Disparity Control »
Limor Gultchin · Siyuan Guo · Alan Malek · Silvia Chiappa · Ricardo Silva 
2022 : Evaluating the Impact of Geometric and Statistical Skews on OutOfDistribution Generalization Performance »
Aengus Lynch · Jean Kaddour · Ricardo Silva 
2022 : Evaluating the Impact of Geometric and Statistical Skews on OutOfDistribution Generalization Performance »
Aengus Lynch · Jean Kaddour · Ricardo Silva 
2022 : Partial identification without distributional assumptions »
Kirtan Padh · Jakob Zeitler · David Watson · Matt Kusner · Ricardo Silva · Niki Kilbertus 
2022 Poster: When Do Flat Minima Optimizers Work? »
Jean Kaddour · Linqing Liu · Ricardo Silva · Matt Kusner 
2021 : Ricardo Silva  The Road to Causal Programming »
Ricardo Silva 
2021 Poster: Fairness in Ranking under Uncertainty »
Ashudeep Singh · David Kempe · Thorsten Joachims 
2021 Poster: Causal Effect Inference for Structured Treatments »
Jean Kaddour · Yuchen Zhu · Qi Liu · Matt Kusner · Ricardo Silva 
2020 : Invited Talk: On Prediction, Action and Interference »
Ricardo Silva 
2020 Poster: A Class of Algorithms for General Instrumental Variable Models »
Niki Kilbertus · Matt Kusner · Ricardo Silva 
2020 Poster: MOReL: ModelBased Offline Reinforcement Learning »
Rahul Kidambi · Aravind Rajeswaran · Praneeth Netrapalli · Thorsten Joachims 
2019 : Opening Remarks »
Thorsten Joachims · Nathan Kallus · Michele Santacatterina · Adith Swaminathan · David Sontag · Angela Zhou 
2019 Workshop: Machine Learning with Guarantees »
Ben London · Gintare Karolina Dziugaite · Daniel Roy · Thorsten Joachims · Aleksander Madry · John ShaweTaylor 
2019 Workshop: “Do the right thing”: machine learning and causal inference for improved decision making »
Michele Santacatterina · Thorsten Joachims · Nathan Kallus · Adith Swaminathan · David Sontag · Angela Zhou 
2019 : Thorsten Joachim: Fair Ranking with Biased Data »
Thorsten Joachims 
2019 Poster: Policy Learning for Fairness in Ranking »
Ashudeep Singh · Thorsten Joachims 
2018 Poster: Bayesian Semisupervised Learning with Graph Gaussian Processes »
Yin Cheng Ng · Nicolò Colombo · Ricardo Silva 
2017 : An Efficient Method to Impose Fairness in Linear Models »
Massimiliano Pontil · John ShaweTaylor 
2017 : Equality of Opportunity in Rankings »
Thorsten Joachims · Ashudeep Singh 
2017 Workshop: From 'What If?' To 'What Next?' : Causal Inference and Machine Learning for Intelligent Decision Making »
Ricardo Silva · Panagiotis Toulis · John ShaweTaylor · Alexander Volfovsky · Thorsten Joachims · Lihong Li · Nathan Kallus · Adith Swaminathan 
2017 Poster: Counterfactual Fairness »
Matt Kusner · Joshua Loftus · Chris Russell · Ricardo Silva 
2017 Oral: Counterfactual Fairness »
Matt Kusner · Joshua Loftus · Chris Russell · Ricardo Silva 
2017 Poster: Tomography of the London Underground: a Scalable Model for OriginDestination Data »
Nicolò Colombo · Ricardo Silva · Soong Moon Kang 
2017 Poster: When Worlds Collide: Integrating Different Counterfactual Assumptions in Fairness »
Chris Russell · Matt Kusner · Joshua Loftus · Ricardo Silva 
2016 : Panel Discussion »
Gisbert Schneider · Ross E Goodwin · Simon Colton · Russ Salakhutdinov · Thorsten Joachims · Florian Pinel 
2016 : Structured Prediction with Logged Bandit Feedback »
Thorsten Joachims 
2016 Poster: ObservationalInterventional Priors for DoseResponse Learning »
Ricardo Silva 
2016 Poster: Scaling Factorial Hidden Markov Models: Stochastic Variational Inference without Messages »
Yin Cheng Ng · Pawel M Chilinski · Ricardo Silva 
2015 Poster: The SelfNormalized Estimator for Counterfactual Learning »
Adith Swaminathan · Thorsten Joachims 
2015 Spotlight: The SelfNormalized Estimator for Counterfactual Learning »
Adith Swaminathan · Thorsten Joachims 
2014 Poster: Causal Inference through a Witness Protection Program »
Ricardo Silva · Robin Evans 
2013 Poster: Learning Trajectory Preferences for Manipulators via Iterative Improvement »
Ashesh Jain · Brian Wojcik · Thorsten Joachims · Ashutosh Saxena 
2013 Poster: Flexible sampling of discrete data correlations without the marginal distributions »
Alfredo Kalaitzis · Ricardo Silva 
2011 Poster: Semantic Labeling of 3D Point Clouds for Indoor Scenes »
Hema Koppula · Abhishek Anand · Thorsten Joachims · Ashutosh Saxena 
2011 Poster: Thinning Measurement Models and Questionnaire Design »
Ricardo Silva 
2007 Workshop: Machine Learning for Web Search »
Denny Zhou · Olivier Chapelle · Thorsten Joachims · Thomas Hofmann 
2007 Poster: Hidden Common Cause Relations in Relational Learning »
Ricardo Silva · Wei Chu · Zoubin Ghahramani 
2007 Spotlight: Hidden Common Cause Relations in Relational Learning »
Ricardo Silva · Wei Chu · Zoubin Ghahramani