Diversified In-domain Synthesis with Efficient Fine-Tuning for Few-Shot Classification

Dall' Asen, Nicola; Turrisi Da Costa, Victor G.; Wang, Yiming; Sebe, Nicu; Ricci, Elisa

doi:10.1007/978-3-032-10192-1_36

Few-shot image classification aims to learn an image classifier using only a small set of labeled examples per class. A recent research direction for improving few-shot classifiers involves augmenting the labelled samples with synthetic images created by state-of-the-art text-to-image generation models. Following this trend, we propose Diversified In-domain Synthesis with Efficient Fine-tuning (DISEF), a novel approach which addresses the generalization challenge in few-shot learning using synthetic data. DISEF consists of two main components. First, we propose a novel text-to-image augmentation pipeline that, by leveraging the real samples and their rich semantics coming from a captioning model, promotes in-domain sample diversity for better generalization. Second, we emphasize the importance of effective model fine-tuning in few-shot recognition, proposing to use Low-Rank Adaptation (LoRA) for joint adaptation of the text and image encoders in a Vision Language Model. We validate our method in ten different benchmarks, consistently outperforming baselines and establishing a new state-of-the-art for few-shot classification. Code is available at https://github.com/vturrisi/disef.

Diversified In-domain Synthesis with Efficient Fine-Tuning for Few-Shot Classification / Dall' Asen, N., Turrisi Da Costa, V.G., Wang, Y., Sebe, N., Ricci, E.. - 16168:(2026), pp. 436-448. (23rd International Conference on Image Analysis and Processing, ICIAP 2025 Rome Sept. 2025) [10.1007/978-3-032-10192-1_36].

Diversified In-domain Synthesis with Efficient Fine-Tuning for Few-Shot Classification

Dall' Asen, Nicola;Turrisi da Costa, Victor G.;Wang, Yiming;Sebe, Nicu;Ricci, Elisa

2026-01-01

Abstract

Few-shot image classification aims to learn an image classifier using only a small set of labeled examples per class. A recent research direction for improving few-shot classifiers involves augmenting the labelled samples with synthetic images created by state-of-the-art text-to-image generation models. Following this trend, we propose Diversified In-domain Synthesis with Efficient Fine-tuning (DISEF), a novel approach which addresses the generalization challenge in few-shot learning using synthetic data. DISEF consists of two main components. First, we propose a novel text-to-image augmentation pipeline that, by leveraging the real samples and their rich semantics coming from a captioning model, promotes in-domain sample diversity for better generalization. Second, we emphasize the importance of effective model fine-tuning in few-shot recognition, proposing to use Low-Rank Adaptation (LoRA) for joint adaptation of the text and image encoders in a Vision Language Model. We validate our method in ten different benchmarks, consistently outperforming baselines and establishing a new state-of-the-art for few-shot classification. Code is available at https://github.com/vturrisi/disef.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2026
			
	Titolo del volume (Proceedings title)
	
				Image Analysis and Processing – ICIAP 2025
			
	Luogo di edizione (Place of publication)
	
				Heidelberg
			
	Casa editrice (Publisher)
	
				Springer
			
	ISBN
	
				9783032101914
9783032101921
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-105028360650
			
	Tutti gli autori
	
						Dall' Asen, Nicola; Turrisi Da Costa, Victor G.; Wang, Yiming; Sebe, Nicu; Ricci, Elisa
					
	Citazione
	
				Diversified In-domain Synthesis with Efficient Fine-Tuning for Few-Shot Classification / Dall' Asen, N., Turrisi Da Costa, V.G., Wang, Y., Sebe, N., Ricci, E.. - 16168:(2026), pp. 436-448. (23rd International Conference on Image Analysis and Processing, ICIAP 2025 Rome Sept. 2025) [10.1007/978-3-032-10192-1_36].

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/471398

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

0

ND

0

Diversified In-domain Synthesis with Efficient Fine-Tuning for Few-Shot Classification

Dall' Asen, Nicola;Turrisi da Costa, Victor G.;Wang, Yiming;Sebe, Nicu;Ricci, Elisa

2026-01-01

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Attenzione

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)