Computational models of meaning trained on naturally occurring text successfully model human performance on tasks involving simple similarity measures. But they characterize meaning in terms of undifferentiated bags of words or topical dimensions. This has led some to question their psychological plausibility. Smaller-scale highly structured models also simulate participants' experimental behaviour, but they rely on unrealistically clean input that is typically prepared by the experimenter. We present a fully automatic method for extracting a structured and comprehensive set of concept descriptions directly from an English part-of-speech-tagged corpus. Concepts are characterized by weighted properties, enriched with concept-property types that approximate classical relations such as hypernymy and function. Our model outperforms comparable algorithms in cognitive tasks pertaining not only to concept-internal structures (discovering properties of concepts, grouping properties by property type) but also to inter-concept relations (clustering into superordinates), suggesting the empirical validity of the property-based approach.

Strudel: a corpus-based semantic model based on properties and types

Baroni, Marco;Murphy, Brian Edmond;Barbu, Eduard;Poesio, Massimo
2010-01-01

Abstract

Computational models of meaning trained on naturally occurring text successfully model human performance on tasks involving simple similarity measures. But they characterize meaning in terms of undifferentiated bags of words or topical dimensions. This has led some to question their psychological plausibility. Smaller-scale highly structured models also simulate participants' experimental behaviour, but they rely on unrealistically clean input that is typically prepared by the experimenter. We present a fully automatic method for extracting a structured and comprehensive set of concept descriptions directly from an English part-of-speech-tagged corpus. Concepts are characterized by weighted properties, enriched with concept-property types that approximate classical relations such as hypernymy and function. Our model outperforms comparable algorithms in cognitive tasks pertaining not only to concept-internal structures (discovering properties of concepts, grouping properties by property type) but also to inter-concept relations (clustering into superordinates), suggesting the empirical validity of the property-based approach.
2010
no. 2
Baroni, Marco; Murphy, Brian Edmond; Barbu, Eduard; Poesio, Massimo
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/77360
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact