Vacuity Metric Flaws in Evidential Deep Learning for OOD Detection

other · 2026-05-09

A new paper on arXiv (2605.06382) reveals that vacuity, or Uncertainty Mass (UM), a common metric for Out-of-Distribution (OOD) detection in Evidential Deep Learning (EDL), is fundamentally flawed due to its sensitivity to the number of classes (K). UM is calculated by dividing K by the total strength of belief (S), derived from Dirichlet parameters. The authors argue that there is no linear relationship between K and S as both increase, because EDL suppresses incorrectly assigned evidence. This causes significant discrepancies when comparing In Distribution (ID) and OOD results if K differs between them. Empirical demonstrations show AUROC and AUPR can vary substantially when class cardinality differs by just 1, with AUROC differing by as much as [value not specified in source]. The study emphasizes that K_ID and K_OOD must be equal for valid comparisons, a condition often unmet in practice.

Key facts

Vacuity (Uncertainty Mass) is used for OOD detection in EDL.
UM = K / S, where K is number of classes and S is total strength of belief.
S is derived from summing Dirichlet parameters.
No linear relationship between K and S due to EDL suppressing incorrect evidence.
K_ID and K_OOD must be equal for valid comparisons.
AUROC and AUPR can differ substantially when K differs by 1.
Paper provides empirical demonstration of metric sensitivity.
Source: arXiv:2605.06382.

Vacuity Metric Flaws in Evidential Deep Learning for OOD Detection

Key facts

Entities

Institutions

Sources