We present SinGAN-3D, a variation of the deep neural network architecture presented originally by SinGAN, for the generation of 3D contents, starting from a single three-dimensional voxelized model. Our network uses a pyramid of 3D convolutional networks to model the third dimension and exploits periodic activation functions to capture the latent structure of the input model. The approach can synthesize contents at different resolutions and aspect ratios, and can be extended to implement super resolution. To evaluate the performances of the proposed model we use the Single Image Freche't distance, and the multiscale structural similarity index. The metrics highlight the similarities between the synthesised-three dimensional assets and their corresponding original template. An additional user study has also been conducted to assess the quality of the generated shapes. The code can be found at: https://github.com/zenos4mbu/SinGAN3D

SinGAN-3D: towards unconditioned 3D shapes generation / Sambugaro, Zeno; Merlin, Marco; Conci, Nicola. - (2022), pp. 1-5. (Intervento presentato al convegno 14th IEEE Image, Video, and Multidimensional Signal Processing Workshop, IVMSP 2022 tenutosi a grc nel 2022) [10.1109/IVMSP54334.2022.9816357].

SinGAN-3D: towards unconditioned 3D shapes generation

Sambugaro, Zeno;Conci, Nicola
2022-01-01

Abstract

We present SinGAN-3D, a variation of the deep neural network architecture presented originally by SinGAN, for the generation of 3D contents, starting from a single three-dimensional voxelized model. Our network uses a pyramid of 3D convolutional networks to model the third dimension and exploits periodic activation functions to capture the latent structure of the input model. The approach can synthesize contents at different resolutions and aspect ratios, and can be extended to implement super resolution. To evaluate the performances of the proposed model we use the Single Image Freche't distance, and the multiscale structural similarity index. The metrics highlight the similarities between the synthesised-three dimensional assets and their corresponding original template. An additional user study has also been conducted to assess the quality of the generated shapes. The code can be found at: https://github.com/zenos4mbu/SinGAN3D
2022
IVMSP 2022 - 2022 IEEE 14th Image, Video, and Multidimensional Signal Processing Workshop
piscataway NJ
Institute of Electrical and Electronics Engineers Inc.
978-1-6654-7822-9
Sambugaro, Zeno; Merlin, Marco; Conci, Nicola
SinGAN-3D: towards unconditioned 3D shapes generation / Sambugaro, Zeno; Merlin, Marco; Conci, Nicola. - (2022), pp. 1-5. (Intervento presentato al convegno 14th IEEE Image, Video, and Multidimensional Signal Processing Workshop, IVMSP 2022 tenutosi a grc nel 2022) [10.1109/IVMSP54334.2022.9816357].
File in questo prodotto:
File Dimensione Formato  
SinganModel4.pdf

Solo gestori archivio

Tipologia: Post-print referato (Refereed author’s manuscript)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 2.24 MB
Formato Adobe PDF
2.24 MB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/354563
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? 0
  • OpenAlex ND
social impact