SinGAN-3D: towards unconditioned 3D shapes generation

Sambugaro, Zeno; Merlin, Marco; Conci, Nicola

doi:10.1109/IVMSP54334.2022.9816357

We present SinGAN-3D, a variation of the deep neural network architecture presented originally by SinGAN, for the generation of 3D contents, starting from a single three-dimensional voxelized model. Our network uses a pyramid of 3D convolutional networks to model the third dimension and exploits periodic activation functions to capture the latent structure of the input model. The approach can synthesize contents at different resolutions and aspect ratios, and can be extended to implement super resolution. To evaluate the performances of the proposed model we use the Single Image Freche't distance, and the multiscale structural similarity index. The metrics highlight the similarities between the synthesised-three dimensional assets and their corresponding original template. An additional user study has also been conducted to assess the quality of the generated shapes. The code can be found at: https://github.com/zenos4mbu/SinGAN3D

SinGAN-3D: towards unconditioned 3D shapes generation / Sambugaro, Zeno; Merlin, Marco; Conci, Nicola. - (2022), pp. 1-5. (Intervento presentato al convegno 14th IEEE Image, Video, and Multidimensional Signal Processing Workshop, IVMSP 2022 tenutosi a grc nel 2022) [10.1109/IVMSP54334.2022.9816357].

SinGAN-3D: towards unconditioned 3D shapes generation

Sambugaro, Zeno;Merlin, Marco;Conci, Nicola

2022-01-01

Abstract

We present SinGAN-3D, a variation of the deep neural network architecture presented originally by SinGAN, for the generation of 3D contents, starting from a single three-dimensional voxelized model. Our network uses a pyramid of 3D convolutional networks to model the third dimension and exploits periodic activation functions to capture the latent structure of the input model. The approach can synthesize contents at different resolutions and aspect ratios, and can be extended to implement super resolution. To evaluate the performances of the proposed model we use the Single Image Freche't distance, and the multiscale structural similarity index. The metrics highlight the similarities between the synthesised-three dimensional assets and their corresponding original template. An additional user study has also been conducted to assess the quality of the generated shapes. The code can be found at: https://github.com/zenos4mbu/SinGAN3D

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2022
			
	Titolo del volume (Proceedings title)
	
				IVMSP 2022 - 2022 IEEE 14th Image, Video, and Multidimensional Signal Processing Workshop
			
	Luogo di edizione (Place of publication)
	
				piscataway NJ
			
	Casa editrice (Publisher)
	
				Institute of Electrical and Electronics Engineers Inc.
			
	ISBN
	
				978-1-6654-7822-9
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85135175959
			
	Codice WOS (WOS identifier)
	
				WOS:000853856800070
			
	Tutti gli autori
	
						Sambugaro, Zeno; Merlin, Marco; Conci, Nicola
					
	Citazione
	
				SinGAN-3D: towards unconditioned 3D shapes generation / Sambugaro, Zeno; Merlin, Marco; Conci, Nicola. - (2022), pp. 1-5. (Intervento presentato al  convegno 14th IEEE Image, Video, and Multidimensional Signal Processing Workshop, IVMSP 2022 tenutosi a grc nel 2022) [10.1109/IVMSP54334.2022.9816357].
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
SinganModel4.pdf Solo gestori archivio Tipologia: Post-print referato (Refereed author’s manuscript) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 2.24 MB Formato Adobe PDF Visualizza/Apri	2.24 MB	Adobe PDF	Visualizza/Apri