Computer Science > Computer Vision and Pattern Recognition

arXiv:2508.17283 (cs)

[Submitted on 24 Aug 2025]

Title:Quickly Tuning Foundation Models for Image Segmentation

Authors:Breenda Das, Lennart Purucker, Timur Carstensen, Frank Hutter

Abstract:Foundation models like SAM (Segment Anything Model) exhibit strong zero-shot image segmentation performance, but often fall short on domain-specific tasks. Fine-tuning these models typically requires significant manual effort and domain expertise. In this work, we introduce QTT-SEG, a meta-learning-driven approach for automating and accelerating the fine-tuning of SAM for image segmentation. Built on the Quick-Tune hyperparameter optimization framework, QTT-SEG predicts high-performing configurations using meta-learned cost and performance models, efficiently navigating a search space of over 200 million possibilities. We evaluate QTT-SEG on eight binary and five multiclass segmentation datasets under tight time constraints. Our results show that QTT-SEG consistently improves upon SAM's zero-shot performance and surpasses AutoGluon Multimodal, a strong AutoML baseline, on most binary tasks within three minutes. On multiclass datasets, QTT-SEG delivers consistent gains as well. These findings highlight the promise of meta-learning in automating model adaptation for specialized segmentation tasks. Code available at: this https URL

Comments:	Accepted as a short paper at the non-archival content track of AutoML 2025
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2508.17283 [cs.CV]
	(or arXiv:2508.17283v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2508.17283

Submission history

From: Lennart Purucker [view email]
[v1] Sun, 24 Aug 2025 10:06:02 UTC (8,507 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Quickly Tuning Foundation Models for Image Segmentation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Quickly Tuning Foundation Models for Image Segmentation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators