Whitening for Self-Supervised Representation Learning

Ermolov, A.; Siarohin, A.; Sangineto, E.; Sebe, N.

Most of the current self-supervised representation learning (SSL) methods are based on the contrastive loss and the instance-discrimination task, where augmented versions of the same image instance (“positives”) are contrasted with instances extracted from other images (“negatives”). For the learning to be effective, many negatives should be compared with a positive pair, which is computationally demanding. In this paper, we propose a different direction and a new loss function for SSL, which is based on the whitening of the latentspace features. The whitening operation has a “scattering” effect on the batch samples, avoiding degenerate solutions where all the sample representations collapse to a single point. Our solution does not require asymmetric networks and it is conceptually simple. Moreover, since negatives are not needed, we can extract multiple positive pairs from the same image instance. The source code of the method and of all the experiments is available at: https://github.com/htdt/ self-supervised

Whitening for Self-Supervised Representation Learning / Ermolov, A.; Siarohin, A.; Sangineto, E.; Sebe, N.. - 139:(2021), pp. 3015-3024. (Intervento presentato al convegno 38th International Conference on Machine Learning, ICML 2021 tenutosi a online nel 18th-24th July 2021).

Whitening for Self-Supervised Representation Learning

A. Ermolov;A. Siarohin;E. Sangineto;N. Sebe

2021-01-01

Abstract

Most of the current self-supervised representation learning (SSL) methods are based on the contrastive loss and the instance-discrimination task, where augmented versions of the same image instance (“positives”) are contrasted with instances extracted from other images (“negatives”). For the learning to be effective, many negatives should be compared with a positive pair, which is computationally demanding. In this paper, we propose a different direction and a new loss function for SSL, which is based on the whitening of the latentspace features. The whitening operation has a “scattering” effect on the batch samples, avoiding degenerate solutions where all the sample representations collapse to a single point. Our solution does not require asymmetric networks and it is conceptually simple. Moreover, since negatives are not needed, we can extract multiple positive pairs from the same image instance. The source code of the method and of all the experiments is available at: https://github.com/htdt/ self-supervised

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2021
			
	Titolo del volume (Proceedings title)
	
				International Conference on Machine Learning (ICML’21)
			
	Luogo di edizione (Place of publication)
	
				Red Hook, NY, USA
			
	Casa editrice (Publisher)
	
				Curran Associates - ML Research Press
			
	ISBN
	
				9781713845065
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85161262436
			
	Codice WOS (WOS identifier)
	
				WOS:000683104603003
			
	Tutti gli autori
	
						Ermolov, A.; Siarohin, A.; Sangineto, E.; Sebe, N.
					
	Citazione
	
				Whitening for Self-Supervised Representation Learning / Ermolov, A.; Siarohin, A.; Sangineto, E.; Sebe, N.. - 139:(2021), pp. 3015-3024. (Intervento presentato al  convegno 38th International Conference on Machine Learning, ICML 2021 tenutosi a online nel 18th-24th July 2021).
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
ermolov21a.pdf accesso aperto Tipologia: Post-print referato (Refereed author’s manuscript) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 1.5 MB Formato Adobe PDF Visualizza/Apri	1.5 MB	Adobe PDF	Visualizza/Apri