Improving Covariance Conditioning of the SVD Meta-layer by Orthogonality

Song, Yue; Sebe, Nicu; Wang, Wei

doi:10.1007/978-3-031-20053-3_21

Inserting an SVD meta-layer into neural networks is prone to make the covariance ill-conditioned, which could harm the model in the training stability and generalization abilities. In this paper, we systematically study how to improve the covariance conditioning by enforcing orthogonality to the Pre-SVD layer. Existing orthogonal treatments on the weights are first investigated. However, these techniques can improve the conditioning but would hurt the performance. To avoid such a side effect, we propose the Nearest Orthogonal Gradient (NOG) and Optimal Learning Rate (OLR). The effectiveness of our methods is validated in two applications: decorrelated Batch Normalization (BN) and Global Covariance Pooling (GCP). Extensive experiments on visual recognition demonstrate that our methods can simultaneously improve the covariance conditioning and generalization. Moreover, the combinations with orthogonal weight can further boost the performances.

Improving Covariance Conditioning of the SVD Meta-layer by Orthogonality / Song, Yue; Sebe, Nicu; Wang, Wei. - 13684:(2022), pp. 356-372. (Intervento presentato al convegno 17th European Conference on Computer Vision, ECCV 2022 tenutosi a Tel AvivI, Israel nel 23–27 October 2022) [10.1007/978-3-031-20053-3_21].

Improving Covariance Conditioning of the SVD Meta-layer by Orthogonality

Song, Yue;Sebe, Nicu;Wang, Wei

2022-01-01

Abstract

Inserting an SVD meta-layer into neural networks is prone to make the covariance ill-conditioned, which could harm the model in the training stability and generalization abilities. In this paper, we systematically study how to improve the covariance conditioning by enforcing orthogonality to the Pre-SVD layer. Existing orthogonal treatments on the weights are first investigated. However, these techniques can improve the conditioning but would hurt the performance. To avoid such a side effect, we propose the Nearest Orthogonal Gradient (NOG) and Optimal Learning Rate (OLR). The effectiveness of our methods is validated in two applications: decorrelated Batch Normalization (BN) and Global Covariance Pooling (GCP). Extensive experiments on visual recognition demonstrate that our methods can simultaneously improve the covariance conditioning and generalization. Moreover, the combinations with orthogonal weight can further boost the performances.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2022
			
	Titolo del volume (Proceedings title)
	
				European Conference on Computer Vision (ECCV)
			
	Luogo di edizione (Place of publication)
	
				GEWERBESTRASSE 11, CHAM, CH-6330, SWITZERLAND
			
	Casa editrice (Publisher)
	
				Springer Science and Business Media Deutschland GmbH
			
	ISBN
	
				978-3-031-20052-6
978-3-031-20053-3
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85142690498
			
	Codice WOS (WOS identifier)
	
				WOS:000904279900021
			
	Tutti gli autori
	
						Song, Yue; Sebe, Nicu; Wang, Wei
					
	Citazione
	
				Improving Covariance Conditioning of the SVD Meta-layer by Orthogonality / Song, Yue; Sebe, Nicu; Wang, Wei. - 13684:(2022), pp. 356-372. (Intervento presentato al  convegno 17th European Conference on Computer Vision, ECCV 2022 tenutosi a Tel AvivI, Israel nel 23–27 October 2022) [10.1007/978-3-031-20053-3_21].
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
136840352.pdf accesso aperto Tipologia: Post-print referato (Refereed author’s manuscript) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 6.68 MB Formato Adobe PDF Visualizza/Apri	6.68 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/361309

Citazioni

ND

2

3

ND

Nome	Dominio	Durata	Descrizione
s_.*	plu.mx	sessione	recupero grafico citazioni sociali da plumx
A_.*	core.ac.uk	7 giorni	recupero pubblicazioni consigliate per il pannello core-recommander
GS_.*	gstatic.com	richiesta http	visualizza grafico citazioni
CC_.*	creativecommons.org	richiesta http	visualizza licenza bitstream

Improving Covariance Conditioning of the SVD Meta-layer by Orthogonality

Song, Yue;Sebe, Nicu;Wang, Wei

2022-01-01

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)