Recently, diffusion models have made significant strides in synthesizing realistic 2D human images based on provided text prompts. Building upon this, researchers have extended 2D text-to-image diffusion models into the 3D domain for generating human textures (UV Maps). However, some important problems about UV Map Generative models are still not solved, i.e., how to generate personalized texture maps for any given face image, and how to define and evaluate the quality of these generated texture maps. To solve the above problems, we introduce a novel method, UVMap-ID, which is a controllable and personalized UV Map generative model. Unlike traditional large-scale training methods in 2D, we propose to fine-tune a pre-trained text-to-image diffusion model which is integrated with a face fusion module for achieving ID-driven customized generation. To support the finetuning strategy, we introduce a small-scale attribute-balanced training dataset, including high-quality textures with labeled text and Face ID. Additionally, we introduce some metrics to evaluate the multiple aspects of the textures. Finally, both quantitative and qualitative analyses demonstrate the effectiveness of our method in controllable and personalized UV Map generation.

UVMap-ID: A Controllable and Personalized UV Map Generative Model / Weijie Wang, Authors:; Zhang, Jichao; Liu, Chang; Li, Xia; Xu, Xingqian; Shi, Humphrey; Sebe, Nicu; Lepri, Bruno. - (2024), pp. 10725-10734. ( 32nd ACM International Conference on Multimedia, MM 2024 aus 2024) [10.1145/3664647.3680861].

UVMap-ID: A Controllable and Personalized UV Map Generative Model

Jichao Zhang;Chang Liu;Nicu Sebe;Bruno Lepri
2024-01-01

Abstract

Recently, diffusion models have made significant strides in synthesizing realistic 2D human images based on provided text prompts. Building upon this, researchers have extended 2D text-to-image diffusion models into the 3D domain for generating human textures (UV Maps). However, some important problems about UV Map Generative models are still not solved, i.e., how to generate personalized texture maps for any given face image, and how to define and evaluate the quality of these generated texture maps. To solve the above problems, we introduce a novel method, UVMap-ID, which is a controllable and personalized UV Map generative model. Unlike traditional large-scale training methods in 2D, we propose to fine-tune a pre-trained text-to-image diffusion model which is integrated with a face fusion module for achieving ID-driven customized generation. To support the finetuning strategy, we introduce a small-scale attribute-balanced training dataset, including high-quality textures with labeled text and Face ID. Additionally, we introduce some metrics to evaluate the multiple aspects of the textures. Finally, both quantitative and qualitative analyses demonstrate the effectiveness of our method in controllable and personalized UV Map generation.
2024
MM '24: Proceedings of the 32nd ACM International Conference on Multimedia
New York
Association for Computing Machinery, Inc
9798400706868
Weijie Wang, Authors:; Zhang, Jichao; Liu, Chang; Li, Xia; Xu, Xingqian; Shi, Humphrey; Sebe, Nicu; Lepri, Bruno
UVMap-ID: A Controllable and Personalized UV Map Generative Model / Weijie Wang, Authors:; Zhang, Jichao; Liu, Chang; Li, Xia; Xu, Xingqian; Shi, Humphrey; Sebe, Nicu; Lepri, Bruno. - (2024), pp. 10725-10734. ( 32nd ACM International Conference on Multimedia, MM 2024 aus 2024) [10.1145/3664647.3680861].
File in questo prodotto:
File Dimensione Formato  
3664647.3680861 (1)-compressed.pdf

accesso aperto

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Creative commons
Dimensione 1.45 MB
Formato Adobe PDF
1.45 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/439532
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 0
  • OpenAlex ND
social impact