Skip to yearly menu bar Skip to main content

Workshop

Causal Inference Challenges in Sequential Decision Making: Bridging Theory and Practice

Aurelien Bibaut ⋅ Maria Dimakopoulou ⋅ Nathan Kallus ⋅ Xinkun Nie ⋅ Masatoshi Uehara ⋅ Kelly Zhang

Project Page

Abstract

Sequential decision-making problems appear in settings as varied as healthcare, e-commerce, operations management, and policymaking, and depending on the context these can have very varied features that make each problem unique. Problems can involve online learning or offline data, known cost structures or unknown counterfactuals, continuous actions with or without constraints or finite or combinatorial actions, stationary environments or environments with dynamic agents, utilitarian considerations or fairness or equity considerations. More and more, causal inference and discovery and adjacent statistical theories have come to bear on such problems, from the early work on longitudinal causal inference from the last millenium up to recent developments in bandit algorithms and inference, dynamic treatment regimes, both online and offline reinforcement learning, interventions in general causal graphs and discovery thereof, and more. While the interaction between these theories has grown, expertise is spread across many different disciplines, including CS/ML, (bio)statistics, econometrics, ethics/law, and operations research.

The primary purpose of this workshop is to convene both experts, practitioners, and interested young researchers from a wide range of backgrounds to discuss recent developments around causal inference in sequential decision making and the avenues forward on the topic, especially ones that bring together ideas from different fields. The all-virtual nature of this year's NeurIPS workshop makes it particularly felicitous to such an assembly. The workshop will combine invited talks and panels by a diverse group of researchers and practitioners from both academia and industry together with contributed talks and town-hall Q&A that will particularly seek to draw from younger individuals new to the area.

Video

Chat is not available.

Schedule

Timezone: America/Los_Angeles

10:50 AM

Opening Remarks

Video

11:00 AM

TBD (Elias Bareibnboim)

Elias Bareinboim

Video

11:30 AM

Sequential Adaptive Designs for Learning Optimal Individualized Treatment Rules with Formal Inference (Mark van der Laan)

Mark van der Laan

Video

12:00 PM

Confident Off-Policy Evaluation and Selection through Self-Normalized Importance Weighting (Claire Vernade)

Claire Vernade

Video

12:30 PM

Panel Discussion

Elias Bareinboim ⋅ Mark van der Laan ⋅ Claire Vernade

Video

1:20 PM

Poster Presentation

2:20 PM

Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning (Guy Tennenholtz)

Guy Tennenholtz

Video

2:40 PM

MAGNET: Multi-Agent Graph Cooperative Bandits (Hengrui Cai)

Hengrui Cai

Video

3:00 PM

(un)fairness in sequential decision making as a challenge (Razieh Nabi)

Razieh Nabi

Video

3:30 PM

Off-Policy Confidence Interval Estimation with Confounded Markov Decision Process (Rui Song)

Rui Song

Video

4:00 PM

TALK (Susan Athey)

Susan Athey

Video

4:30 PM

Panel Discussion

Susan Athey ⋅ Rui Song ⋅ Razieh Nabi

Video

5:30 PM

What Would the Expert $do(\cdot)$?: Causal Imitation Learning (Gokul Swamy)

Gokul Swamy

Video

5:50 PM

The Limits to Learning a Diffusion Model (Andy Zheng)

Andrew Zheng

Video

6:10 PM

Deviation-Based Learning (Komiyama Junpei)

Junpei Komiyama

Video

6:30 PM

Closing Remarks

6:40 PM

Poster Presentation

Bandits with Partially Observable Confounded Data

Guy Tennenholtz ⋅ Uri Shalit ⋅ Shie Mannor ⋅ Yonathan Efroni

Deviation-Based Learning

Junpei Komiyama ⋅ Shunya Noda

MAGNET: Multi-Agent Graph Cooperative Bandits

Hengrui Cai ⋅ Rui Song

On Adaptivity and Confounding in Contextual Bandit Experiments

Chao Qin ⋅ Daniel Russo

Doubly robust confidence sequences

Ian Waudby-Smith ⋅ David Arbour ⋅ Ritwik Sinha ⋅ Edward Kennedy ⋅ Aaditya Ramdas

A Causality-based Graphical Test to obtain an Optimal Blocking Set for Randomized Experiments

Abhishek Kumar Umrawal

Kernel Methods for Multistage Causal Inference: Mediation Analysis and Dynamic Treatment Effects

Rahul Singh ⋅ Ritsugen Jo ⋅ Arthur Gretton

Reinforcement Learning in Reward-Mixing MDPs

Jeongyeol Kwon ⋅ Yonathan Efroni ⋅ Constantine Caramanis ⋅ Shie Mannor

Chronological Causal Bandit

Neil Dhir

Causal Multi-Agent Reinforcement Learning: Review and Open Problems

St John Grimbly ⋅ Jonathan Shock ⋅ Arnu Pretorius

Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning

Guy Tennenholtz ⋅ Assaf Hallak ⋅ Gal Dalal ⋅ Shie Mannor ⋅ Gal Chechik ⋅ Uri Shalit

Multiple imputation via state space model for missing data in non-stationary multivariate time series

Xiaoxuan Cai ⋅ Linda Valeri

Practical Policy Optimization with PersonalizedExperimentation

Mia Garrard ⋅ Hanson Wang ⋅ Ben Letham ⋅ Zehui Wang ⋅ Yin Huang ⋅ Yichun Hu ⋅ Chad Zhou ⋅ Norm Zhou ⋅ Eytan Bakshy

Dynamic Causal Discovery in Imitation Learning

Wenchao Yu

A Validation Tool for Designing Reinforcement Learning Environments

RUIYANG XU ⋅ Zhengxing Chen

What Would the Expert $do(\cdot)$?: Causal Imitation Learning

Gokul Swamy ⋅ Sanjiban Choudhury ⋅ James Bagnell ⋅ Steven Wu

Learning Treatment Effects in Panels with General Intervention Patterns

Vivek Farias ⋅ Andrew Li ⋅ Tianyi Peng

Partition-based Local Independence Discovery

Inwoo Hwang ⋅ Byoung-Tak Zhang ⋅ Sanghack Lee

Understanding User Podcast Consumption Using Sequential Treatment Effect Estimation

Vishwali Mhasawade ⋅ Praveen Chandar ⋅ Ghazal Fazelnia ⋅ Benjamin Carterette

A Variational Information Bottleneck Principle for Recurrent Neural Networks

ADRIAN TOVAR ⋅ Varun Jog

Off-Policy Evaluation with Embedded Spaces

Jaron Jia Rong Lee ⋅ David Arbour ⋅ Georgios Theocharous

Algorithms for Adaptive Experiments that Trade-off Statistical Analysis with Reward: Combining Uniform Random Assignment and Reward Maximization

Jacob Nogas ⋅ Arghavan Modiri ⋅ ⋅ Sofia Villar ⋅ Audrey Durand ⋅ Anna Rafferty ⋅ Joseph Williams

The Limits to Learning a Diffusion Model

Jackie Baek ⋅ Vivek Farias ⋅ ANDREEA GEORGESCU ⋅ Retsef Levi ⋅ Tianyi Peng ⋅ Joshua Wilde ⋅ Andrew Zheng

Beyond Ads: Sequential Decision-Making Algorithmsin Public Policy

Peter Henderson ⋅ Brandon Anderson ⋅ Daniel Ho

Double/Debiased Machine Learning for Dynamic Treatment Effects via $g$-Estimation

Greg Lewis ⋅ Vasilis Syrgkanis

Estimating the Long-Term Effects of Novel Treatments

Keith Battocchi ⋅ Maggie Hei ⋅ Greg Lewis ⋅ Miruna Oprescu ⋅ Vasilis Syrgkanis

Deviation-Based Learning

Junpei Komiyama ⋅ Shunya Noda

MAGNET: Multi-Agent Graph Cooperative Bandits

Hengrui Cai ⋅ Rui Song

Covariate Shift of Latent Confounders in Imitation and Reinforcement Learning

Guy Tennenholtz ⋅ Assaf Hallak ⋅ Gal Dalal ⋅ Shie Mannor ⋅ Gal Chechik ⋅ Uri Shalit

What Would the Expert $do(\cdot)$?: Causal Imitation Learning

Gokul Swamy ⋅ Sanjiban Choudhury ⋅ James Bagnell ⋅ Steven Wu

The Limits to Learning a Diffusion Model

Jackie Baek ⋅ Vivek Farias ⋅ ANDREEA GEORGESCU ⋅ Retsef Levi ⋅ Tianyi Peng ⋅ Joshua Wilde ⋅ Andrew Zheng