In crowd counting datasets, the location labels are costly, yet, they are not taken into the evaluation metrics. Besides, existing multi-task approaches employ high-level tasks to improve counting accuracy. This research tendency increases the demand for more annotations. In this paper, we propose a weakly-supervised counting network, which directly regresses the crowd numbers without the location supervision. Moreover, we train the network to count by exploiting the relationship among the images. We propose a soft-label sorting network along with the counting network, which sorts the given images by their crowd numbers. The sorting network drives the shared backbone CNN model to obtain density-sensitive ability explicitly. Therefore, the proposed method improves the counting accuracy by utilizing the information hidden in crowd numbers, rather than learning from extra labels, such as locations and perspectives. We evaluate our proposed method on three crowd counting datasets, and the performance of our method plays favorably against the fully supervised state-of-the-art approaches.

Weakly-Supervised Crowd Counting Learns from Sorting Rather Than Locations / Yang, Yifan; Li, Guorong; Wu, Zhe; Su, Li; Huang, Qingming; Sebe, Nicu. - 12353:(2020), pp. 1-17. (Intervento presentato al convegno ECCV 2020 tenutosi a online (Glasgow, UK) nel 23rd-28th August 2020) [10.1007/978-3-030-58598-3_1].

Weakly-Supervised Crowd Counting Learns from Sorting Rather Than Locations

Sebe, Nicu
2020-01-01

Abstract

In crowd counting datasets, the location labels are costly, yet, they are not taken into the evaluation metrics. Besides, existing multi-task approaches employ high-level tasks to improve counting accuracy. This research tendency increases the demand for more annotations. In this paper, we propose a weakly-supervised counting network, which directly regresses the crowd numbers without the location supervision. Moreover, we train the network to count by exploiting the relationship among the images. We propose a soft-label sorting network along with the counting network, which sorts the given images by their crowd numbers. The sorting network drives the shared backbone CNN model to obtain density-sensitive ability explicitly. Therefore, the proposed method improves the counting accuracy by utilizing the information hidden in crowd numbers, rather than learning from extra labels, such as locations and perspectives. We evaluate our proposed method on three crowd counting datasets, and the performance of our method plays favorably against the fully supervised state-of-the-art approaches.
2020
Computer Vision: 16th European Conference Proceedings, Part 8.
Cham, CH
Springer
978-3-030-58597-6
978-3-030-58598-3
Yang, Yifan; Li, Guorong; Wu, Zhe; Su, Li; Huang, Qingming; Sebe, Nicu
Weakly-Supervised Crowd Counting Learns from Sorting Rather Than Locations / Yang, Yifan; Li, Guorong; Wu, Zhe; Su, Li; Huang, Qingming; Sebe, Nicu. - 12353:(2020), pp. 1-17. (Intervento presentato al convegno ECCV 2020 tenutosi a online (Glasgow, UK) nel 23rd-28th August 2020) [10.1007/978-3-030-58598-3_1].
File in questo prodotto:
File Dimensione Formato  
Guorong123530001.pdf

Open Access dal 01/01/2023

Tipologia: Post-print referato (Refereed author’s manuscript)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 1.42 MB
Formato Adobe PDF
1.42 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/284572
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 35
  • ???jsp.display-item.citation.isi??? ND
social impact