Distributional semantics models (DSMs) are known to produce excellent representations of word meaning, which correlate with a range of behavioural data. As lexical representations, they have been said to be fundamentally different from truth-theoretic models of semantics, where meaning is defined as a correspondence relation to the world. There are two main aspects to this difference: a) DSMs are built over corpus data which may or may not reflect `what is in the world'; b) they are built from word co-occurrences, that is, from lexical types rather than entities and sets. In this paper, we inspect the properties of a distributional model built over a set-theoretic approximation of `the real world'. To achieve this, we take the annotation a large database of images marked with objects, attributes and relations, convert the data into a representation akin to first-order logic and build several distributional models using various combinations of features. We evaluate those models over both relatedness and similarity datasets, demonstrating their effectiveness in standard evaluations. This allows us to conclude that, despite prior claims, truth-theoretic models are good candidates for building graded lexical representations of meaning.

Distributional Semantics in the Real World: Building Word Vector Representations from a Truth-Theoretic Model / Kuzmenko, Elizaveta; Herbelot, Aurelie. - (2019), pp. 16-23. (Intervento presentato al convegno International Conference on Computational Semantics tenutosi a Gothenburg, Sweden nel 23th May-27th May 2019).

Distributional Semantics in the Real World: Building Word Vector Representations from a Truth-Theoretic Model

Herbelot, Aurelie
2019-01-01

Abstract

Distributional semantics models (DSMs) are known to produce excellent representations of word meaning, which correlate with a range of behavioural data. As lexical representations, they have been said to be fundamentally different from truth-theoretic models of semantics, where meaning is defined as a correspondence relation to the world. There are two main aspects to this difference: a) DSMs are built over corpus data which may or may not reflect `what is in the world'; b) they are built from word co-occurrences, that is, from lexical types rather than entities and sets. In this paper, we inspect the properties of a distributional model built over a set-theoretic approximation of `the real world'. To achieve this, we take the annotation a large database of images marked with objects, attributes and relations, convert the data into a representation akin to first-order logic and build several distributional models using various combinations of features. We evaluate those models over both relatedness and similarity datasets, demonstrating their effectiveness in standard evaluations. This allows us to conclude that, despite prior claims, truth-theoretic models are good candidates for building graded lexical representations of meaning.
2019
Proceedings of the 13th International Conference on Computational Semantics - Short Papers
EastStroudsburg
Association for Computational Linguistics
Kuzmenko, Elizaveta; Herbelot, Aurelie
Distributional Semantics in the Real World: Building Word Vector Representations from a Truth-Theoretic Model / Kuzmenko, Elizaveta; Herbelot, Aurelie. - (2019), pp. 16-23. (Intervento presentato al convegno International Conference on Computational Semantics tenutosi a Gothenburg, Sweden nel 23th May-27th May 2019).
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/242680
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? ND
social impact