PI-Trans: Parallel-Convmlp and Implicit-Transformation Based Gan for Cross-View Image Translation

Ren, Bin; Tang, Hao; Wang, Yiming; Xia, Li; Wang, Wei; Sebe, Nicu

doi:10.1109/ICASSP49357.2023.10095449

For semantic-guided cross-view image translation, it is crucial to learn where to sample pixels from the source view image and where to reallocate them guided by the target view semantic map, especially when there is little overlap or drastic view difference between the source and target images. Hence, one not only needs to encode the long- range dependencies among pixels in both the source view image and target view semantic map but also needs to translate these learned dependencies. To this end, we propose a novel generative adversarial network, PI-Trans, which mainly consists of a novel Parallel-ConvMLP module and an Implicit Transformation module at multiple semantic levels. Extensive experimental results show that PI-Trans achieves the best qualitative and quantitative performance by a large margin compared to the state-of-the-art methods on two challenging datasets. The source code is available at https://github.com/Amazingren/PI-Trans.

PI-Trans: Parallel-Convmlp and Implicit-Transformation Based Gan for Cross-View Image Translation / Ren, Bin; Tang, Hao; Wang, Yiming; Li, Xia; Wang, Wei; Sebe, Nicu. - 2023-:(2023), pp. 1-5. ( 48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023 Rhodes Island, Greece 04-10 June 2023) [10.1109/ICASSP49357.2023.10095449].

PI-Trans: Parallel-Convmlp and Implicit-Transformation Based Gan for Cross-View Image Translation

Ren, Bin;Tang, Hao;Wang, Yiming;Li, Xia;Wang, Wei;Sebe, Nicu

2023-01-01

Abstract

For semantic-guided cross-view image translation, it is crucial to learn where to sample pixels from the source view image and where to reallocate them guided by the target view semantic map, especially when there is little overlap or drastic view difference between the source and target images. Hence, one not only needs to encode the long- range dependencies among pixels in both the source view image and target view semantic map but also needs to translate these learned dependencies. To this end, we propose a novel generative adversarial network, PI-Trans, which mainly consists of a novel Parallel-ConvMLP module and an Implicit Transformation module at multiple semantic levels. Extensive experimental results show that PI-Trans achieves the best qualitative and quantitative performance by a large margin compared to the state-of-the-art methods on two challenging datasets. The source code is available at https://github.com/Amazingren/PI-Trans.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2023
			
	Titolo del volume (Proceedings title)
	
				IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
			
	Luogo di edizione (Place of publication)
	
				Piscataway, NJ USA
			
	Casa editrice (Publisher)
	
				IEEE
			
	ISBN
	
				978-1-7281-6327-7
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85177569282
			
	Tutti gli autori
	
						Ren, Bin; Tang, Hao; Wang, Yiming; Li, Xia; Wang, Wei; Sebe, Nicu
					
	Citazione
	
				PI-Trans: Parallel-Convmlp and Implicit-Transformation Based Gan for Cross-View Image Translation / Ren, Bin; Tang, Hao; Wang, Yiming; Li, Xia; Wang, Wei; Sebe, Nicu. - 2023-:(2023), pp. 1-5. ( 48th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2023 Rhodes Island, Greece 04-10 June 2023) [10.1109/ICASSP49357.2023.10095449].
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
PI-Trans_Parallel-Convmlp_and_Implicit-Transformation_Based_Gan_for_Cross-View_Image_Translation.pdf Solo gestori archivio Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 6.05 MB Formato Adobe PDF Visualizza/Apri	6.05 MB	Adobe PDF	Visualizza/Apri