Poster
in
Workshop: Constrained Optimization for Machine Learning Sun, Dec 7, 2025 • 9:20 AM – 10:50 AM PST

Statistical Inference for Responsiveness Verification

Meredith Stewart · Seung Hyun Cheon · Berk Ustun · Bogdan Kulynych · Lily Weng

Project Page [ OpenReview]

Abstract

Many safety failures in machine learning arise when models assign individual predictions while ignoring how model inputs can change. In this work, we introduce a formal validation procedure for the responsiveness of predictions with respect to interventions on their features. Our machinery uses mixed-integer programming to produce reachable sets over both discrete and continuous features, enabling black-box estimation with exact statistical guarantees. By optimizing to certify, we support falsification and failure-probability estimation at scale. This approach runs many small mixed-integer feasibility checks, reuses sampled reachable sets across models, and surfaces concrete counterexamples. We further demonstrate safety benefits in recidivism, transplant prioritization, and content moderation.

Chat is not available.