Since high-level events in images (e.g. "dinner", "motorcycle stunt", etc.) may not be directly correlated with their visual appearance, low-level visual features do not carry enough semantics to classify such events satisfactorily. This paper explores a fully compositional approach for event based image retrieval which is able to overcome this shortcoming. Furthermore, the approach is fully scalable in both adding new events and new primitives. Using the Pascal VOC 2007 dataset, our contributions are the following: (i) We apply the Faceted Analysis-Synthesis Theory (FAST) to build a hierarchy of 228 high-level events. (ii) We show that rule-based classifiers are better suited for compositional recognition of events than SVMs. In addition, rule-based classifiers provide semantically meaningful event descriptions which help bridging the semantic gap. (iii) We demonstrate that compositionality enables unseen event recognition: we can use rules learned from non-visual cues, together with object detectors to get reasonable performance on unseen event categories.

(Unseen) Event Recognition via Semantic Compositionality / Stottinger, Julian; Giunchiglia, Fausto; Sebe, Nicu; Pandey, Anand K.; Uijlings, Jasper R. R.. - ELETTRONICO. - (2012).

(Unseen) Event Recognition via Semantic Compositionality

Stottinger, Julian
Primo
;
Giunchiglia, Fausto
Ultimo
;
Sebe, Nicu
Penultimo
;
Pandey, Anand K.;
2012-01-01

Abstract

Since high-level events in images (e.g. "dinner", "motorcycle stunt", etc.) may not be directly correlated with their visual appearance, low-level visual features do not carry enough semantics to classify such events satisfactorily. This paper explores a fully compositional approach for event based image retrieval which is able to overcome this shortcoming. Furthermore, the approach is fully scalable in both adding new events and new primitives. Using the Pascal VOC 2007 dataset, our contributions are the following: (i) We apply the Faceted Analysis-Synthesis Theory (FAST) to build a hierarchy of 228 high-level events. (ii) We show that rule-based classifiers are better suited for compositional recognition of events than SVMs. In addition, rule-based classifiers provide semantically meaningful event descriptions which help bridging the semantic gap. (iii) We demonstrate that compositionality enables unseen event recognition: we can use rules learned from non-visual cues, together with object detectors to get reasonable performance on unseen event categories.
2012
Trento
Università degli Studi di Trento, Dipartimento di Ingegneria e Scienza dell'Informazione
(Unseen) Event Recognition via Semantic Compositionality / Stottinger, Julian; Giunchiglia, Fausto; Sebe, Nicu; Pandey, Anand K.; Uijlings, Jasper R. R.. - ELETTRONICO. - (2012).
Stottinger, Julian; Giunchiglia, Fausto; Sebe, Nicu; Pandey, Anand K.; Uijlings, Jasper R. R.
File in questo prodotto:
File Dimensione Formato  
techRep020.pdf

accesso aperto

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 1.09 MB
Formato Adobe PDF
1.09 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/391891
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact