Emergent Mind

Conformal prediction under ambiguous ground truth

(2307.09302)
Published Jul 18, 2023 in cs.LG , cs.CV , stat.ME , and stat.ML

Abstract

Conformal Prediction (CP) allows to perform rigorous uncertainty quantification by constructing a prediction set $C(X)$ satisfying $\mathbb{P}(Y \in C(X))\geq 1-\alpha$ for a user-chosen $\alpha \in [0,1]$ by relying on calibration data $(X1,Y1),...,(Xn,Yn)$ from $\mathbb{P}=\mathbb{P}{X} \otimes \mathbb{P}{Y|X}$. It is typically implicitly assumed that $\mathbb{P}{Y|X}$ is the "true" posterior label distribution. However, in many real-world scenarios, the labels $Y1,...,Yn$ are obtained by aggregating expert opinions using a voting procedure, resulting in a one-hot distribution $\mathbb{P}{vote}{Y|X}$. For such ``voted'' labels, CP guarantees are thus w.r.t. $\mathbb{P}{vote}=\mathbb{P}X \otimes \mathbb{P}{vote}{Y|X}$ rather than the true distribution $\mathbb{P}$. In cases with unambiguous ground truth labels, the distinction between $\mathbb{P}{vote}$ and $\mathbb{P}$ is irrelevant. However, when experts do not agree because of ambiguous labels, approximating $\mathbb{P}{Y|X}$ with a one-hot distribution $\mathbb{P}{vote}{Y|X}$ ignores this uncertainty. In this paper, we propose to leverage expert opinions to approximate $\mathbb{P}{Y|X}$ using a non-degenerate distribution $\mathbb{P}{agg}{Y|X}$. We develop Monte Carlo CP procedures which provide guarantees w.r.t. $\mathbb{P}{agg}=\mathbb{P}X \otimes \mathbb{P}{agg}{Y|X}$ by sampling multiple synthetic pseudo-labels from $\mathbb{P}{agg}{Y|X}$ for each calibration example $X1,...,Xn$. In a case study of skin condition classification with significant disagreement among expert annotators, we show that applying CP w.r.t. $\mathbb{P}{vote}$ under-covers expert annotations: calibrated for $72\%$ coverage, it falls short by on average $10\%$; our Monte Carlo CP closes this gap both empirically and theoretically.

We're not able to analyze this paper right now due to high demand.

Please check back later (sorry!).

Generate a summary of this paper on our Pro plan:

We ran into a problem analyzing this paper.

Newsletter

Get summaries of trending comp sci papers delivered straight to your inbox:

Unsubscribe anytime.