What is Flagged in Uncertainty Quantification? Latent Density Models for Uncertainty Categorization

Hao Sun · Boris van Breugel · Jonathan CrabbĂ© · Nabeel Seedat · Mihaela van der Schaar

Great Hall & Hall B1+B2 (level 1) #1519
[ ]
Thu 14 Dec 3 p.m. PST — 5 p.m. PST


Uncertainty quantification (UQ) is essential for creating trustworthy machine learning models. Recent years have seen a steep rise in UQ methods that can flag suspicious examples, however, it is often unclear what exactly these methods identify. In this work, we propose a framework for categorizing uncertain examples flagged by UQ methods. We introduce the confusion density matrix---a kernel-based approximation of the misclassification density---and use this to categorize suspicious examples identified by a given uncertainty method into three classes: out-of-distribution (OOD) examples, boundary (Bnd) examples, and examples in regions of high in-distribution misclassification (IDM). Through extensive experiments, we show that our framework provides a new and distinct perspective for assessing differences between uncertainty quantification methods, thereby forming a valuable assessment benchmark.

Chat is not available.