PDiscoNet: Semantically consistent part discovery for fine-grained recognition

IRIS

Fine-grained classification often requires recognizing specific object parts, such as beak shape and wing patterns for birds. Encouraging a fine-grained classification model to first detect such parts and then using them to infer the class could help us gauge whether the model is indeed looking at the right details better than with interpretability methods that provide a single attribution map. We propose PDiscoNet to discover object parts by using only image-level class labels along with priors encouraging the parts to be: discriminative, compact, distinct from each other, equivariant to rigid transforms, and active in at least some of the images. In addition to using the appropriate losses to encode these priors, we propose to use part-dropout, where full part feature vectors are dropped at once to prevent a single part from dominating in the classification, and part feature vector modulation, which makes the information coming from each part distinct from the perspective of the classifier. Our results on CUB, CelebA, and PartImageNet show that the proposed method provides substantially better part discovery performance than previous methods while not requiring any additional hyper-parameter tuning and without penalizing the classification performance. The code is available at https://github.com/robertdvdk/part_detection

PDiscoNet: Semantically consistent part discovery for fine-grained recognition / van der Klis, Robert; Alaniz, Stephan; Mancini, Massimiliano; Dantas, Cassio F.; Ienco, Dino; Akata, Zeynep; Marcos, Diego. - (2023), pp. 1866-1876. (Intervento presentato al convegno 2023 IEEE/CVF International Conference on Computer Vision, ICCV 2023 tenutosi a Paris, France nel 2nd -6th October 2023) [10.1109/ICCV51070.2023.00179].

PDiscoNet: Semantically consistent part discovery for fine-grained recognition

van der Klis, Robert;Alaniz, Stephan;Mancini, Massimiliano;Dantas, Cassio F.;Ienco, Dino;Akata, Zeynep;Marcos, Diego

2023-01-01

Abstract

Fine-grained classification often requires recognizing specific object parts, such as beak shape and wing patterns for birds. Encouraging a fine-grained classification model to first detect such parts and then using them to infer the class could help us gauge whether the model is indeed looking at the right details better than with interpretability methods that provide a single attribution map. We propose PDiscoNet to discover object parts by using only image-level class labels along with priors encouraging the parts to be: discriminative, compact, distinct from each other, equivariant to rigid transforms, and active in at least some of the images. In addition to using the appropriate losses to encode these priors, we propose to use part-dropout, where full part feature vectors are dropped at once to prevent a single part from dominating in the classification, and part feature vector modulation, which makes the information coming from each part distinct from the perspective of the classifier. Our results on CUB, CelebA, and PartImageNet show that the proposed method provides substantially better part discovery performance than previous methods while not requiring any additional hyper-parameter tuning and without penalizing the classification performance. The code is available at https://github.com/robertdvdk/part_detection

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2023
			
	Titolo del volume (Proceedings title)
	
				2023 IEEE/CVF International Conference on Computer Vision (ICCV)
			
	Luogo di edizione (Place of publication)
	
				Piscataway, NJ USA
			
	Casa editrice (Publisher)
	
				IEEE Computer Society
			
	ISBN
	
				979-8-3503-0718-4
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85182560302
			
	Codice WOS (WOS identifier)
	
				WOS:001159644302013
			
	Tutti gli autori
	
						van der Klis, Robert; Alaniz, Stephan; Mancini, Massimiliano; Dantas, Cassio F.; Ienco, Dino; Akata, Zeynep; Marcos, Diego
					
	Citazione
	
				PDiscoNet: Semantically consistent part discovery for fine-grained recognition / van der Klis, Robert; Alaniz, Stephan; Mancini, Massimiliano; Dantas, Cassio F.; Ienco, Dino; Akata, Zeynep; Marcos, Diego. - (2023), pp. 1866-1876. (Intervento presentato al  convegno 2023 IEEE/CVF International Conference on Computer Vision, ICCV 2023 tenutosi a Paris, France nel 2nd -6th October 2023) [10.1109/ICCV51070.2023.00179].
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
van_der_Klis_PDiscoNet_Semantically_consistent_part_discovery_for_fine-grained_recognition_ICCV_2023_paper.pdf accesso aperto Descrizione: ICCV paper Open Access version Tipologia: Post-print referato (Refereed author’s manuscript) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 7.53 MB Formato Adobe PDF Visualizza/Apri	7.53 MB	Adobe PDF	Visualizza/Apri
PDiscoNet_Semantically_consistent_part_discovery_for_fine-grained_recognition.pdf Solo gestori archivio Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 8.07 MB Formato Adobe PDF Visualizza/Apri	8.07 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/400790

Citazioni

ND

8

6

ND

social impact