Compositional distributional semantic models (CDSMs) have successfully been applied to the task of predicting the meaning of a range of linguistic constructions. Their performance on semicompositional word formation process of (morphological) derivation, however, has been extremely variable, with no large-scale empirical investigation to date. This paper fills that gap, performing an analysis of CDSM predictions on a large dataset (over 30,000 German derivationally related word pairs). We use linear regression models to analyze CDSM performance and obtain insights into the linguistic factors that influence how predictable the distributional context of a derived word is going to be. We identify various such factors, notably part of speech, argument structure, and semantic regularity.

Predictability of distributional semantics in derivational word formation / Pad'O, Sebastian; Herbelot, Aurelie Georgette Geraldine; Kisselew, Max; Snajder, Jan. - (2016), pp. 1285-1296. (Intervento presentato al convegno COLING tenutosi a Osaka, Japan nel 13-16 December 2016).

Predictability of distributional semantics in derivational word formation

Herbelot, Aurelie Georgette Geraldine;
2016-01-01

Abstract

Compositional distributional semantic models (CDSMs) have successfully been applied to the task of predicting the meaning of a range of linguistic constructions. Their performance on semicompositional word formation process of (morphological) derivation, however, has been extremely variable, with no large-scale empirical investigation to date. This paper fills that gap, performing an analysis of CDSM predictions on a large dataset (over 30,000 German derivationally related word pairs). We use linear regression models to analyze CDSM performance and obtain insights into the linguistic factors that influence how predictable the distributional context of a derived word is going to be. We identify various such factors, notably part of speech, argument structure, and semantic regularity.
2016
Proceedings of the 26th International Conference on Computational Linguistics (COLING 2016)
Sheffield, UK
International Committee on Computational Linguistics
978-487974702-0
Pad'O, Sebastian; Herbelot, Aurelie Georgette Geraldine; Kisselew, Max; Snajder, Jan
Predictability of distributional semantics in derivational word formation / Pad'O, Sebastian; Herbelot, Aurelie Georgette Geraldine; Kisselew, Max; Snajder, Jan. - (2016), pp. 1285-1296. (Intervento presentato al convegno COLING tenutosi a Osaka, Japan nel 13-16 December 2016).
File in questo prodotto:
File Dimensione Formato  
coling2016_polysemy_final.pdf

accesso aperto

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Creative commons
Dimensione 240.46 kB
Formato Adobe PDF
240.46 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/212062
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 16
  • ???jsp.display-item.citation.isi??? ND
social impact