In this paper we study if semantic complexity can influence the distribution of generalized quantifiers in a large English corpus derived from Wikipedia. We consider the minimal computational device recognizing a generalized quantifier as the core measure of its semantic complexity. We regard quantifiers that belong to three increasingly more com-plex classes: Aristotelian (recognizable by 2-state acyclic finite automata), counting (k þ 2-state finite automata), and proportional quantifiers (pushdown automata). Using regression analysis we show that semantic complexity is a statistically significant factor explaining 27.29% of frequency variation. We compare this impact to that of other known sources of complexity, both semantic (quantifier monotonicity and the comparative/su- perlative distinction) and superficial (e.g., the length of quantifier surface forms). In gen-eral, we observe that the more complex a quantifier, the less frequent it is.

Exploring the relation between semantic complexity and quantifier distribution in large corpora / Szymanik, Jakub Krzysztof; Thorne, Camilo. - In: LANGUAGE SCIENCES. - ISSN 0388-0001. - 60:(2017), pp. 80-93. [10.1016/j.langsci.2017.01.006]

Exploring the relation between semantic complexity and quantifier distribution in large corpora

Jakub Szymanik;
2017-01-01

Abstract

In this paper we study if semantic complexity can influence the distribution of generalized quantifiers in a large English corpus derived from Wikipedia. We consider the minimal computational device recognizing a generalized quantifier as the core measure of its semantic complexity. We regard quantifiers that belong to three increasingly more com-plex classes: Aristotelian (recognizable by 2-state acyclic finite automata), counting (k þ 2-state finite automata), and proportional quantifiers (pushdown automata). Using regression analysis we show that semantic complexity is a statistically significant factor explaining 27.29% of frequency variation. We compare this impact to that of other known sources of complexity, both semantic (quantifier monotonicity and the comparative/su- perlative distinction) and superficial (e.g., the length of quantifier surface forms). In gen-eral, we observe that the more complex a quantifier, the less frequent it is.
2017
Szymanik, Jakub Krzysztof; Thorne, Camilo
Exploring the relation between semantic complexity and quantifier distribution in large corpora / Szymanik, Jakub Krzysztof; Thorne, Camilo. - In: LANGUAGE SCIENCES. - ISSN 0388-0001. - 60:(2017), pp. 80-93. [10.1016/j.langsci.2017.01.006]
File in questo prodotto:
File Dimensione Formato  
1-s2.0-S0388000117300323-main.pdf

Solo gestori archivio

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Tutti i diritti riservati (All rights reserved)
Dimensione 758.93 kB
Formato Adobe PDF
758.93 kB Adobe PDF   Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/364286
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 7
  • ???jsp.display-item.citation.isi??? 6
  • OpenAlex ND
social impact