Timezone: »
One of the common ways children learn is by mimicking adults. Imitation learning focuses on learning policies with suitable performance from demonstrations generated by an expert, with an unspecified performance measure, and unobserved reward signal. Popular methods for imitation learning start by either directly mimicking the behavior policy of an expert (behavior cloning) or by learning a reward function that prioritizes observed expert trajectories (inverse reinforcement learning). However, these methods rely on the assumption that covariates used by the expert to determine her/his actions are fully observed. In this paper, we relax this assumption and study imitation learning when sensory inputs of the learner and the expert differ. First, we provide a non-parametric, graphical criterion that is complete (both necessary and sufficient) for determining the feasibility of imitation from the combinations of demonstration data and qualitative assumptions about the underlying environment, represented in the form of a causal model. We then show that when such a criterion does not hold, imitation could still be feasible by exploiting quantitative knowledge of the expert trajectories. Finally, we develop an efficient procedure for learning the imitating policy from experts' trajectories.
Author Information
Junzhe Zhang (Columbia University)
Daniel Kumor (Purdue University)
Elias Bareinboim (Columbia University)
Related Events (a corresponding poster, oral, or spotlight)
-
2020 Poster: Causal Imitation Learning With Unobserved Confounders »
Wed. Dec 9th 05:00 -- 07:00 PM Room Poster Session 3 #875
More from the Same Authors
-
2021 Spotlight: Double Machine Learning Density Estimation for Local Treatment Effects with Instruments »
Yonghan Jung · Jin Tian · Elias Bareinboim -
2022 Poster: Causal Identification under Markov equivalence: Calculus, Algorithm, and Completeness »
Amin Jaber · Adele Ribeiro · Jiji Zhang · Elias Bareinboim -
2022 Poster: Online Reinforcement Learning for Mixed Policy Scopes »
Junzhe Zhang · Elias Bareinboim -
2022 Poster: Finding and Listing Front-door Adjustment Sets »
Hyunchai Jeong · Jin Tian · Elias Bareinboim -
2021 : Panel Discussion »
Elias Bareinboim · Mark van der Laan · Claire Vernade -
2021 : TBD (Elias Bareibnboim) »
Elias Bareinboim -
2021 : Invited Talk: Causality and Fairness »
Elias Bareinboim -
2021 Workshop: Causal Inference & Machine Learning: Why now? »
Elias Bareinboim · Bernhard Schölkopf · Terrence Sejnowski · Yoshua Bengio · Judea Pearl -
2021 Oral: Sequential Causal Imitation Learning with Unobserved Confounders »
Daniel Kumor · Junzhe Zhang · Elias Bareinboim -
2021 Poster: Causal Identification with Matrix Equations »
Sanghack Lee · Elias Bareinboim -
2021 Poster: Nested Counterfactual Identification from Arbitrary Surrogate Experiments »
Juan Correa · Sanghack Lee · Elias Bareinboim -
2021 Poster: Sequential Causal Imitation Learning with Unobserved Confounders »
Daniel Kumor · Junzhe Zhang · Elias Bareinboim -
2021 Poster: The Causal-Neural Connection: Expressiveness, Learnability, and Inference »
Kevin Xia · Kai-Zhan Lee · Yoshua Bengio · Elias Bareinboim -
2021 Poster: Double Machine Learning Density Estimation for Local Treatment Effects with Instruments »
Yonghan Jung · Jin Tian · Elias Bareinboim -
2021 Oral: Causal Identification with Matrix Equations »
Sanghack Lee · Elias Bareinboim -
2020 Workshop: Causal Discovery and Causality-Inspired Machine Learning »
Biwei Huang · Sara Magliacane · Kun Zhang · Danielle Belgrave · Elias Bareinboim · Daniel Malinsky · Thomas Richardson · Christopher Meek · Peter Spirtes · Bernhard Schölkopf -
2020 Poster: Characterizing Optimal Mixed Policies: Where to Intervene and What to Observe »
Sanghack Lee · Elias Bareinboim -
2020 Poster: Causal Discovery from Soft Interventions with Unknown Targets: Characterization and Learning »
Amin Jaber · Murat Kocaoglu · Karthikeyan Shanmugam · Elias Bareinboim -
2020 Poster: General Transportability of Soft Interventions: Completeness Results »
Juan Correa · Elias Bareinboim -
2020 Poster: Learning Causal Effects via Weighted Empirical Risk Minimization »
Yonghan Jung · Jin Tian · Elias Bareinboim -
2019 Poster: Efficient Identification in Linear Structural Causal Models with Instrumental Cutsets »
Daniel Kumor · Bryant Chen · Elias Bareinboim