Applying tools from network science and statistical mechanics, this paper represents an interdisciplinary analysis of the phonetic organisation of the English language. By using open datasets, we build phonological networks, where nodes are the phonetic pronunciations of words and edges connect words differing by the addition, deletion, or substitution of exactly one phoneme. We present an investigation of whether the topological features of this phonological network reflect only lower or also higher order correlations in phoneme organisation. We address this question by exploring artificially constructed repertoires of words, constructing phonological networks for these repertoires, and comparing them to the network constructed from the real data. Artificial repertoires of words are built to reflect increasingly higher order statistics of the English corpus. Hence, we start with percolation-type experiments in which phonemes are sampled uniformly at random to construct words, then sample from the real phoneme frequency distribution, and finally we consider repertoires resulting from Markov processes of first, second, and third order. As expected, we find that percolation-type experiments constitute a poor null model for the real data. However, some network features, such as the relatively high assortative mixing by degree and the clustering coefficient of the English PN, can be retrieved by Markov models for word construction. Nevertheless, even Markov processes up to third order cannot fully reproduce other patterns of the empirical network, such as link densities and component sizes. We conjecture that this difference is related to the combinatorial space the real and the artificial phonological networks are embedded into and that the connectivity properties of phonological networks reflect additional patterns in word organisation in the English language which cannot be captured by lower order phoneme correlations.

Investigating the phonetic organisation of the English language via phonological networks, percolation and Markov models / Stella, M; Brede, M. - (2016), pp. 219-229. (Intervento presentato al convegno European Conference on Complex Systems tenutosi a Oxford, U.K nel 25 – 29 SEPTEMBER 2006).

Investigating the phonetic organisation of the English language via phonological networks, percolation and Markov models

Stella M
Primo
;
2016-01-01

Abstract

Applying tools from network science and statistical mechanics, this paper represents an interdisciplinary analysis of the phonetic organisation of the English language. By using open datasets, we build phonological networks, where nodes are the phonetic pronunciations of words and edges connect words differing by the addition, deletion, or substitution of exactly one phoneme. We present an investigation of whether the topological features of this phonological network reflect only lower or also higher order correlations in phoneme organisation. We address this question by exploring artificially constructed repertoires of words, constructing phonological networks for these repertoires, and comparing them to the network constructed from the real data. Artificial repertoires of words are built to reflect increasingly higher order statistics of the English corpus. Hence, we start with percolation-type experiments in which phonemes are sampled uniformly at random to construct words, then sample from the real phoneme frequency distribution, and finally we consider repertoires resulting from Markov processes of first, second, and third order. As expected, we find that percolation-type experiments constitute a poor null model for the real data. However, some network features, such as the relatively high assortative mixing by degree and the clustering coefficient of the English PN, can be retrieved by Markov models for word construction. Nevertheless, even Markov processes up to third order cannot fully reproduce other patterns of the empirical network, such as link densities and component sizes. We conjecture that this difference is related to the combinatorial space the real and the artificial phonological networks are embedded into and that the connectivity properties of phonological networks reflect additional patterns in word organisation in the English language which cannot be captured by lower order phoneme correlations.
2016
Proceedings of ECCS 2014
ITALY
ELSEVIER
Stella, M; Brede, M
Investigating the phonetic organisation of the English language via phonological networks, percolation and Markov models / Stella, M; Brede, M. - (2016), pp. 219-229. (Intervento presentato al convegno European Conference on Complex Systems tenutosi a Oxford, U.K nel 25 – 29 SEPTEMBER 2006).
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/365055
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 6
  • ???jsp.display-item.citation.isi??? 5
  • OpenAlex ND
social impact