Orthogonal SVD Covariance Conditioning and Latent Disentanglement

Song, Y.; Sebe, N.; Wang, W.

doi:10.1109/TPAMI.2022.3228979

Inserting an SVD meta-layer into neural networks is prone to make the covariance ill-conditioned, which could harm the model in the training stability and generalization abilities. In this article, we systematically study how to improve the covariance conditioning by enforcing orthogonality to the Pre-SVD layer. Existing orthogonal treatments on the weights are first investigated. However, these techniques can improve the conditioning but would hurt the performance. To avoid such a side effect, we propose the Nearest Orthogonal Gradient (NOG) and Optimal Learning Rate (OLR). The effectiveness of our methods is validated in two applications: decorrelated Batch Normalization (BN) and Global Covariance Pooling (GCP). Extensive experiments on visual recognition demonstrate that our methods can simultaneously improve covariance conditioning and generalization. The combinations with orthogonal weight can further boost the performance. Moreover, we show that our orthogonality techniques can benefit generative models for better latent disentanglement through a series of experiments on various benchmarks. Code is available at: https://github.com/KingJamesSong/OrthoImproveCond.

Orthogonal SVD Covariance Conditioning and Latent Disentanglement / Song, Y.; Sebe, N.; Wang, W.. - In: IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE. - ISSN 0162-8828. - 45:7(2023), pp. 8773-8786. [10.1109/TPAMI.2022.3228979]

Orthogonal SVD Covariance Conditioning and Latent Disentanglement

Song Y.;Sebe N.;Wang W.

2023-01-01

Abstract

Inserting an SVD meta-layer into neural networks is prone to make the covariance ill-conditioned, which could harm the model in the training stability and generalization abilities. In this article, we systematically study how to improve the covariance conditioning by enforcing orthogonality to the Pre-SVD layer. Existing orthogonal treatments on the weights are first investigated. However, these techniques can improve the conditioning but would hurt the performance. To avoid such a side effect, we propose the Nearest Orthogonal Gradient (NOG) and Optimal Learning Rate (OLR). The effectiveness of our methods is validated in two applications: decorrelated Batch Normalization (BN) and Global Covariance Pooling (GCP). Extensive experiments on visual recognition demonstrate that our methods can simultaneously improve covariance conditioning and generalization. The combinations with orthogonal weight can further boost the performance. Moreover, we show that our orthogonality techniques can benefit generative models for better latent disentanglement through a series of experiments on various benchmarks. Code is available at: https://github.com/KingJamesSong/OrthoImproveCond.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2023
			
	Titolo del periodico (Journal title)
	
				IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE
			
	Numero e parte del fascicolo (Issue number and part)
	
				7
			
	DOI
	
				https://dx.doi.org/10.1109/TPAMI.2022.3228979
			
	Codice PubMed (PubMed Identifier)
	
				37015375
			
	Codice Scopus (Scopus identifier)
	
				2-s2.0-85144788675
			
	Codice WOS (WOS identifier)
	
				WOS:001004665900055
			
	Tutti gli autori
	
						Song, Y.; Sebe, N.; Wang, W.
					
	Citazione
	
				Orthogonal SVD Covariance Conditioning and Latent Disentanglement / Song, Y.; Sebe, N.; Wang, W.. - In: IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE. - ISSN 0162-8828. - 45:7(2023), pp. 8773-8786. [10.1109/TPAMI.2022.3228979]
			
	Appare nelle tipologie:
	
				03.1 Articolo su rivista (Journal article)

File in questo prodotto:

File	Dimensione	Formato
Orthogonal_SVD_PAMI22-compressed.pdf Solo gestori archivio Tipologia: Post-print referato (Refereed author’s manuscript) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 722.45 kB Formato Adobe PDF Visualizza/Apri	722.45 kB	Adobe PDF	Visualizza/Apri
Orthogonal_SVD_Covariance_Conditioning_and_Latent_Disentanglement.pdf Solo gestori archivio Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 5.9 MB Formato Adobe PDF Visualizza/Apri	5.9 MB	Adobe PDF	Visualizza/Apri