In this paper we study if semantic complexity can influence the distribution of generalized quantifiers in a large English corpus derived from Wikipedia. We consider the minimal computational device recognizing a generalized quantifier as the core measure of its semantic complexity. We regard quantifiers that belong to three increasingly more com-plex classes: Aristotelian (recognizable by 2-state acyclic finite automata), counting (k þ 2-state finite automata), and proportional quantifiers (pushdown automata). Using regression analysis we show that semantic complexity is a statistically significant factor explaining 27.29% of frequency variation. We compare this impact to that of other known sources of complexity, both semantic (quantifier monotonicity and the comparative/su- perlative distinction) and superficial (e.g., the length of quantifier surface forms). In gen-eral, we observe that the more complex a quantifier, the less frequent it is.
Exploring the relation between semantic complexity and quantifier distribution in large corpora / Szymanik, Jakub Krzysztof; Thorne, Camilo. - In: LANGUAGE SCIENCES. - ISSN 0388-0001. - 60:(2017), pp. 80-93. [10.1016/j.langsci.2017.01.006]
Exploring the relation between semantic complexity and quantifier distribution in large corpora
Jakub Szymanik;
2017-01-01
Abstract
In this paper we study if semantic complexity can influence the distribution of generalized quantifiers in a large English corpus derived from Wikipedia. We consider the minimal computational device recognizing a generalized quantifier as the core measure of its semantic complexity. We regard quantifiers that belong to three increasingly more com-plex classes: Aristotelian (recognizable by 2-state acyclic finite automata), counting (k þ 2-state finite automata), and proportional quantifiers (pushdown automata). Using regression analysis we show that semantic complexity is a statistically significant factor explaining 27.29% of frequency variation. We compare this impact to that of other known sources of complexity, both semantic (quantifier monotonicity and the comparative/su- perlative distinction) and superficial (e.g., the length of quantifier surface forms). In gen-eral, we observe that the more complex a quantifier, the less frequent it is.File | Dimensione | Formato | |
---|---|---|---|
1-s2.0-S0388000117300323-main.pdf
Solo gestori archivio
Tipologia:
Versione editoriale (Publisher’s layout)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
758.93 kB
Formato
Adobe PDF
|
758.93 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione