This work explores the linguistic distinction between count and mass nouns in the visual modality. Since the former class typically refers to well-defined, countable objects, with the latter prototypically including less countable substances, we explore to which extent the linguistic distinction is grounded in the visual representations of the entities denoted by count/mass nouns. Using visual features extracted from a state-of-the-art Convolutional Neural Network (CNN), we show that the entities referred to as mass exhibit a lower variance both internally (i.e. intra-image) and externally (i.e. inter-image) compared to count. That is, various instances of substances are internally more homogeneous and externally more consistent to each other than are count. We compare variance across various CNN layers and show that it is indicative of the categorization when low-level features of the images are used, whereas any effect disappears when experimenting with higher-level, more abstract representations.
Can you see the (linguistic) difference? Exploring mass/count distinction in Vision / Addison Smith, David; Pezzelle, Sandro; Franzon, Francesca; Zanini, Chiara; Bernardi, Raffaella. - ELETTRONICO. - (2017), pp. 1-8. (Intervento presentato al convegno IWCS 2017 tenutosi a Montpellier, France nel 19th-22th September 2017).
Can you see the (linguistic) difference? Exploring mass/count distinction in Vision
Sandro Pezzelle;Raffaella Bernardi
2017-01-01
Abstract
This work explores the linguistic distinction between count and mass nouns in the visual modality. Since the former class typically refers to well-defined, countable objects, with the latter prototypically including less countable substances, we explore to which extent the linguistic distinction is grounded in the visual representations of the entities denoted by count/mass nouns. Using visual features extracted from a state-of-the-art Convolutional Neural Network (CNN), we show that the entities referred to as mass exhibit a lower variance both internally (i.e. intra-image) and externally (i.e. inter-image) compared to count. That is, various instances of substances are internally more homogeneous and externally more consistent to each other than are count. We compare variance across various CNN layers and show that it is indicative of the categorization when low-level features of the images are used, whereas any effect disappears when experimenting with higher-level, more abstract representations.File | Dimensione | Formato | |
---|---|---|---|
W17-6939-mass.pdf
accesso aperto
Descrizione: articolo principale
Tipologia:
Versione editoriale (Publisher’s layout)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
1.35 MB
Formato
Adobe PDF
|
1.35 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione