Conformal prediction under ambiguous ground truth
(2307.09302)Abstract
Conformal Prediction (CP) allows to perform rigorous uncertainty quantification by constructing a prediction set $C(X)$ satisfying $\mathbb{P}(Y \in C(X))\geq 1-\alpha$ for a user-chosen $\alpha \in [0,1]$ by relying on calibration data $(X1,Y1),...,(Xn,Yn)$ from $\mathbb{P}=\mathbb{P}{X} \otimes \mathbb{P}{Y|X}$. It is typically implicitly assumed that $\mathbb{P}{Y|X}$ is the "true" posterior label distribution. However, in many real-world scenarios, the labels $Y1,...,Yn$ are obtained by aggregating expert opinions using a voting procedure, resulting in a one-hot distribution $\mathbb{P}{vote}{Y|X}$. For such ``voted'' labels, CP guarantees are thus w.r.t. $\mathbb{P}{vote}=\mathbb{P}X \otimes \mathbb{P}{vote}{Y|X}$ rather than the true distribution $\mathbb{P}$. In cases with unambiguous ground truth labels, the distinction between $\mathbb{P}{vote}$ and $\mathbb{P}$ is irrelevant. However, when experts do not agree because of ambiguous labels, approximating $\mathbb{P}{Y|X}$ with a one-hot distribution $\mathbb{P}{vote}{Y|X}$ ignores this uncertainty. In this paper, we propose to leverage expert opinions to approximate $\mathbb{P}{Y|X}$ using a non-degenerate distribution $\mathbb{P}{agg}{Y|X}$. We develop Monte Carlo CP procedures which provide guarantees w.r.t. $\mathbb{P}{agg}=\mathbb{P}X \otimes \mathbb{P}{agg}{Y|X}$ by sampling multiple synthetic pseudo-labels from $\mathbb{P}{agg}{Y|X}$ for each calibration example $X1,...,Xn$. In a case study of skin condition classification with significant disagreement among expert annotators, we show that applying CP w.r.t. $\mathbb{P}{vote}$ under-covers expert annotations: calibrated for $72\%$ coverage, it falls short by on average $10\%$; our Monte Carlo CP closes this gap both empirically and theoretically.
We're not able to analyze this paper right now due to high demand.
Please check back later (sorry!).
Generate a summary of this paper on our Pro plan:
We ran into a problem analyzing this paper.