Computer Science > Machine Learning

arXiv:2502.10533 (cs)

[Submitted on 14 Feb 2025 (v1), last revised 2 Mar 2026 (this version, v3)]

Title:Identity-Free Deferral For Unseen Experts

Authors:Joshua Strong, Pramit Saha, Yasin Ibrahim, Cheng Ouyang, Alison Noble

View PDF

Abstract:Learning to Defer (L2D) improves AI reliability in decision-critical environments by training AI to either make its own prediction or defer the decision to a human expert. A key challenge is adapting to unseen experts at test time, whose competence can differ from the training population. Current methods for this task, however, can falter when unseen experts are out-of-distribution (OOD) relative to the training population. We identify a core architectural flaw as the cause: they learn identity-conditioned policies by processing class-indexed signals in fixed coordinates, creating shortcuts that violate the problem's inherent permutation symmetry. We introduce Identity-Free Deferral (IFD), an architecture that enforces this symmetry by construction. From a few-shot context, IFD builds a query-independent Bayesian competence profile for each expert. It then supplies the deferral rejector with a low-dimensional, role-indexed state containing only structural information, such as the model's confidence in its top-ranked class and the expert's estimated skill for that same role, which obscures absolute class identities. We train IFD using an uncertainty-aware, context-only objective that removes the need for expensive query-time expert labels. We formally prove the permutation invariance of our approach, contrasting it with the generic non-invariance of standard population encoders. Experiments on medical imaging benchmarks and ImageNet-16H with real human annotators show that IFD consistently improves generalisation to unseen experts, with gains in OOD settings, all while using fewer annotations than alternative methods.

Comments:	Fourteenth International Conference on Learning Representations (ICLR) 2026
Subjects:	Machine Learning (cs.LG); Human-Computer Interaction (cs.HC)
Cite as:	arXiv:2502.10533 [cs.LG]
	(or arXiv:2502.10533v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2502.10533

Submission history

From: Joshua Strong [view email]
[v1] Fri, 14 Feb 2025 19:59:25 UTC (1,349 KB)
[v2] Sat, 24 May 2025 17:50:44 UTC (8,592 KB)
[v3] Mon, 2 Mar 2026 12:59:23 UTC (1,105 KB)

Computer Science > Machine Learning

Title:Identity-Free Deferral For Unseen Experts

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Identity-Free Deferral For Unseen Experts

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators