Visible thermal person re-identification (VT-ReID) suffers from inter-modality discrepancy and intra-identity variations. Distribution alignment is a popular solution for VT-ReID, however, it is usually restricted to the influence of the intra-identity variations. In this paper, we propose the Cross-Modality Earth Mover’s Distance (CM-EMD) that can alleviate the impact of the intra-identity variations during modality alignment. CM-EMD selects an optimal transport strategy and assigns high weights to pairs that have a smaller intra-identity variation. In this manner, the model will focus on reducing the inter-modality discrepancy while paying less attention to intra-identity variations, leading to a more effective modality alignment. Moreover, we introduce two techniques to improve the advantage of CM-EMD. First, Cross-Modality Discrimination Learning (CM-DL) is designed to overcome the discrimination degradation problem caused by modality alignment. By reducing the ratio between intra-identity and inter-identity variances, CM-DL leads the model to learn more discriminative representations. Second, we construct the Multi-Granularity Structure (MGS), enabling us to align modalities from both coarse- and fine-grained levels with the proposed CM-EMD. Extensive experiments show the benefits of the proposed CM-EMD and its auxiliary techniques (CM-DL and MGS). Our method achieves state-of-the-art performance on two VT-ReID benchmarks.

Cross-Modality Earth Mover’s Distance for Visible Thermal Person Re-identification / Ling, Y.; Zhong, Z.; Luo, Z.; Yang, F.; Cao, D.; Lin, Y.; Li, S.; Sebe, N.. - 37:(2023), pp. 1631-1639. (Intervento presentato al convegno 37th AAAI Conference on Artificial Intelligence, AAAI 2023 tenutosi a usa nel 7 - 14 February 2023) [10.1609/aaai.v37i2.25250].

Cross-Modality Earth Mover’s Distance for Visible Thermal Person Re-identification

Zhong Z.;Yang F.;Sebe N.
2023-01-01

Abstract

Visible thermal person re-identification (VT-ReID) suffers from inter-modality discrepancy and intra-identity variations. Distribution alignment is a popular solution for VT-ReID, however, it is usually restricted to the influence of the intra-identity variations. In this paper, we propose the Cross-Modality Earth Mover’s Distance (CM-EMD) that can alleviate the impact of the intra-identity variations during modality alignment. CM-EMD selects an optimal transport strategy and assigns high weights to pairs that have a smaller intra-identity variation. In this manner, the model will focus on reducing the inter-modality discrepancy while paying less attention to intra-identity variations, leading to a more effective modality alignment. Moreover, we introduce two techniques to improve the advantage of CM-EMD. First, Cross-Modality Discrimination Learning (CM-DL) is designed to overcome the discrimination degradation problem caused by modality alignment. By reducing the ratio between intra-identity and inter-identity variances, CM-DL leads the model to learn more discriminative representations. Second, we construct the Multi-Granularity Structure (MGS), enabling us to align modalities from both coarse- and fine-grained levels with the proposed CM-EMD. Extensive experiments show the benefits of the proposed CM-EMD and its auxiliary techniques (CM-DL and MGS). Our method achieves state-of-the-art performance on two VT-ReID benchmarks.
2023
Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023
New York
AAAI Press
978-1-57735-880-0
Ling, Y.; Zhong, Z.; Luo, Z.; Yang, F.; Cao, D.; Lin, Y.; Li, S.; Sebe, N.
Cross-Modality Earth Mover’s Distance for Visible Thermal Person Re-identification / Ling, Y.; Zhong, Z.; Luo, Z.; Yang, F.; Cao, D.; Lin, Y.; Li, S.; Sebe, N.. - 37:(2023), pp. 1631-1639. (Intervento presentato al convegno 37th AAAI Conference on Artificial Intelligence, AAAI 2023 tenutosi a usa nel 7 - 14 February 2023) [10.1609/aaai.v37i2.25250].
File in questo prodotto:
File Dimensione Formato  
25250-Article Text-29313-1-2-20230626.pdf

Solo gestori archivio

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 385.41 kB
Formato Adobe PDF
385.41 kB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/388609
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 11
  • ???jsp.display-item.citation.isi??? 8
  • OpenAlex ND
social impact