On the Effectiveness of LayerNorm Tuning for Continual Learning in Vision Transformers

IRIS

State-of-the-art rehearsal-free continual learning methods exploit the peculiarities of Vision Transformers to learn task-specific prompts, drastically reducing catastrophic forgetting. However, there is a tradeoff between the number of learned parameters and the performance, making such models computationally expensive. In this work, we aim to reduce this cost while maintaining competitive performance. We achieve this by revisiting and extending a simple transfer learning idea: learning task-specific normalization layers. Specifically, we tune the scale and bias parameters of LayerNorm for each continual learning task, selecting them at inference time based on the similarity between task-specific keys and the output of the pre-trained model. To make the classifier robust to incorrect selection of parameters during inference, we introduce a two-stage training procedure, where we first optimize the task-specific parameters and then train the classifier with the same selection procedure of the inference time. Experiments on ImageNet-R and CIFAR-100 show that our method achieves results that are either superior or on par with the state of the art while being computationally cheaper.

On the Effectiveness of LayerNorm Tuning for Continual Learning in Vision Transformers / De Min, Thomas.; Mancini, Massimiliano; Alahari, Karteek; Alameda-Pineda, Xavier.; Ricci, Elisa. - (2023), pp. 3577-3586. (Intervento presentato al convegno 2023 IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2023 tenutosi a Parigi, Francia nel 2nd - 6th October 2023) [10.1109/ICCVW60793.2023.00385].

On the Effectiveness of LayerNorm Tuning for Continual Learning in Vision Transformers

De Min, Thomas.;Mancini, Massimiliano;Alahari, Karteek;Alameda-Pineda, Xavier.;Ricci, Elisa

2023-01-01

Abstract

State-of-the-art rehearsal-free continual learning methods exploit the peculiarities of Vision Transformers to learn task-specific prompts, drastically reducing catastrophic forgetting. However, there is a tradeoff between the number of learned parameters and the performance, making such models computationally expensive. In this work, we aim to reduce this cost while maintaining competitive performance. We achieve this by revisiting and extending a simple transfer learning idea: learning task-specific normalization layers. Specifically, we tune the scale and bias parameters of LayerNorm for each continual learning task, selecting them at inference time based on the similarity between task-specific keys and the output of the pre-trained model. To make the classifier robust to incorrect selection of parameters during inference, we introduce a two-stage training procedure, where we first optimize the task-specific parameters and then train the classifier with the same selection procedure of the inference time. Experiments on ImageNet-R and CIFAR-100 show that our method achieves results that are either superior or on par with the state of the art while being computationally cheaper.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2023
			
	Titolo del volume (Proceedings title)
	
				2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW)
			
	Luogo di edizione (Place of publication)
	
				Piscataway, NJ USA
			
	Casa editrice (Publisher)
	
				IEEE Computer Society
			
	ISBN
	
				979-8-3503-0744-3
979-8-3503-0745-0
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85182930876
			
	Codice WOS (WOS identifier)
	
				WOS:001156680303072
			
	Tutti gli autori
	
						De Min, Thomas.; Mancini, Massimiliano; Alahari, Karteek; Alameda-Pineda, Xavier.; Ricci, Elisa
					
	Citazione
	
				On the Effectiveness of LayerNorm Tuning for Continual Learning in Vision Transformers / De Min, Thomas.; Mancini, Massimiliano; Alahari, Karteek; Alameda-Pineda, Xavier.; Ricci, Elisa. - (2023), pp. 3577-3586. (Intervento presentato al  convegno 2023 IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2023 tenutosi a Parigi, Francia nel 2nd - 6th October 2023) [10.1109/ICCVW60793.2023.00385].
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
2308.09610.pdf accesso aperto Tipologia: Pre-print non referato (Non-refereed preprint) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 545.71 kB Formato Adobe PDF Visualizza/Apri	545.71 kB	Adobe PDF	Visualizza/Apri
On_the_Effectiveness_of_LayerNorm_Tuning_for_Continual_Learning_in_Vision_Transformers.pdf Solo gestori archivio Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 1.09 MB Formato Adobe PDF Visualizza/Apri	1.09 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/400789

Citazioni

ND

5

0

ND

social impact