Latent Traversals in Generative Models as Potential Flows

Song, Y.; Keller, A.; Sebe, N.; Welling, M.

Despite the significant recent progress in deep generative models, the underlying structure of their latent spaces is still poorly understood, thereby making the task of performing semantically meaningful latent traversals an open research challenge. Most prior work has aimed to solve this challenge by modeling latent structures linearly, and finding corresponding linear directions which result in 'disentangled' generations. In this work, we instead propose to model latent structures with a learned dynamic potential landscape, thereby performing latent traversals as the flow of samples down the landscape's gradient. Inspired by physics, optimal transport, and neuroscience, these potential landscapes are learned as physically realistic partial differential equations, thereby allowing them to flexibly vary over both space and time. To achieve disentanglement, multiple potentials are learned simultaneously, and are constrained by a classifier to be distinct and semantically self-consistent. Experimentally, we demonstrate that our method achieves both more qualitatively and quantitatively disentangled trajectories than state-of-the-art baselines. Further, we demonstrate that our method can be integrated as a regularization term during training, thereby acting as an inductive bias towards the learning of structured representations, ultimately improving model likelihood on similarly structured data. Code is available at https://github.com/KingJamesSong/PDETraversal.

Latent Traversals in Generative Models as Potential Flows / Song, Y.; Keller, A.; Sebe, N.; Welling, M.. - 202:(2023), pp. 32288-32303. ( 40th International Conference on Machine Learning, ICML 2023 Honolulu, Hawaii, USA 23-29 July 2023).

Latent Traversals in Generative Models as Potential Flows

Song, Y.;Keller, A.;Sebe, N.;Welling, M.

2023-01-01

Abstract

Despite the significant recent progress in deep generative models, the underlying structure of their latent spaces is still poorly understood, thereby making the task of performing semantically meaningful latent traversals an open research challenge. Most prior work has aimed to solve this challenge by modeling latent structures linearly, and finding corresponding linear directions which result in 'disentangled' generations. In this work, we instead propose to model latent structures with a learned dynamic potential landscape, thereby performing latent traversals as the flow of samples down the landscape's gradient. Inspired by physics, optimal transport, and neuroscience, these potential landscapes are learned as physically realistic partial differential equations, thereby allowing them to flexibly vary over both space and time. To achieve disentanglement, multiple potentials are learned simultaneously, and are constrained by a classifier to be distinct and semantically self-consistent. Experimentally, we demonstrate that our method achieves both more qualitatively and quantitatively disentangled trajectories than state-of-the-art baselines. Further, we demonstrate that our method can be integrated as a regularization term during training, thereby acting as an inductive bias towards the learning of structured representations, ultimately improving model likelihood on similarly structured data. Code is available at https://github.com/KingJamesSong/PDETraversal.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2023
			
	Titolo del volume (Proceedings title)
	
				Proceedings of Machine Learning Research
			
	Luogo di edizione (Place of publication)
	
				New York
			
	Casa editrice (Publisher)
	
				ML Research Press
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85174394456
			
	Codice WOS (WOS identifier)
	
				WOS:001372498800011
			
	Tutti gli autori
	
						Song, Y.; Keller, A.; Sebe, N.; Welling, M.
					
	Citazione
	
				Latent Traversals in Generative Models as Potential Flows / Song, Y.; Keller, A.; Sebe, N.; Welling, M.. - 202:(2023), pp. 32288-32303. ( 40th International Conference on Machine Learning, ICML 2023 Honolulu, Hawaii, USA 23-29 July 2023).
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
song23d (1).pdf accesso aperto Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 8.15 MB Formato Adobe PDF Visualizza/Apri	8.15 MB	Adobe PDF	Visualizza/Apri