Timezone: »
Concept bottleneck models (CBMs) are a class of interpretable neural network models that predict the target label of a given input based on its high-level concepts. Unlike other end-to-end deep learning models, CBMs enable domain experts to intervene on the predicted concepts at test time so that more accurate and reliable target predictions can be made. While the intervenability provides a powerful avenue of control, many aspects of the intervention procedure remain underexplored. In this work, we inspect the current intervention practice for its efficiency and reliability. Specifically, we first present an array of new intervention methods to significantly improve the target prediction accuracy for a given budget of intervention expense. We also bring attention to non-trivial yet unknown issues related to reliability and fairness of the intervention and discuss how we can fix these problems in practice.
Author Information
Sungbin Shin (POSTECH)
Yohan Jo (Korea Advanced Institute of Science and Technology)
Sungsoo Ahn (POSTECH)
Namhoon Lee (POSTECH)
More from the Same Authors
-
2022 : Substructure-Atom Cross Attention for Molecular Representation Learning »
Jiye Kim · Seungbeom Lee · Dongwoo Kim · Sungsoo Ahn · Jaesik Park -
2023 Poster: Multi-resolution Spectral Coherence for Graph Generation with Score-based Diffusion »
Hyuna Cho · Minjae Jeong · Sooyeon Jeon · Sungsoo Ahn · Won Hwa Kim -
2023 Poster: Bootstrapped Training of Score-Conditioned Generator for Offline Design of Biological Sequences »
Minsu Kim · Federico Berto · Sungsoo Ahn · Jinkyoo Park -
2023 Poster: Diffusion Probabilistic Models for Structured Node Classification »
Hyosoon Jang · Seonghyun Park · Sangwoo Mo · Sungsoo Ahn -
2022 Poster: Learning Debiased Classifier with Biased Committee »
Nayeong Kim · Sehyun Hwang · Sungsoo Ahn · Jaesik Park · Suha Kwak