Code-mixing is the alternation between two or more languages in the same text. This phenomenon is very relevant in the travel domain, since it can provide new insight in the way foreign cultures are perceived and described to the readers. In this paper, we analyse EnglishItalian code-mixing in historical English travel writings about Italy. We retrain and compare two existing systems for the automatic detection of code-mixing, and analyse the semantic categories mostly connected to Italian. Besides, we release the domain corpus used in our experiments and the output of the extraction.

A little bit of bella pianura: Detecting Code-Mixing in Historical English Travel Writing / Sprugnoli, Rachele; Tonelli, Sara; Moretti, Giovanni; Menini, Stefano. - ELETTRONICO. - (2017), pp. 304-309. (Intervento presentato al convegno CLiC-it 2017 tenutosi a Roma nel 11-12-13/12/2017).

A little bit of bella pianura: Detecting Code-Mixing in Historical English Travel Writing

Rachele Sprugnoli;Sara Tonelli;Stefano Menini
2017-01-01

Abstract

Code-mixing is the alternation between two or more languages in the same text. This phenomenon is very relevant in the travel domain, since it can provide new insight in the way foreign cultures are perceived and described to the readers. In this paper, we analyse EnglishItalian code-mixing in historical English travel writings about Italy. We retrain and compare two existing systems for the automatic detection of code-mixing, and analyse the semantic categories mostly connected to Italian. Besides, we release the domain corpus used in our experiments and the output of the extraction.
2017
Proceedings of the Fourth Italian Conference on Computational Linguistics (CLiC-it 2017)
Torino
Accademia University Press
Sprugnoli, Rachele; Tonelli, Sara; Moretti, Giovanni; Menini, Stefano
A little bit of bella pianura: Detecting Code-Mixing in Historical English Travel Writing / Sprugnoli, Rachele; Tonelli, Sara; Moretti, Giovanni; Menini, Stefano. - ELETTRONICO. - (2017), pp. 304-309. (Intervento presentato al convegno CLiC-it 2017 tenutosi a Roma nel 11-12-13/12/2017).
File in questo prodotto:
File Dimensione Formato  
CLICIT2017-53_Sprugnoli_Tonelli_Moretti_etal.pdf

accesso aperto

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Creative commons
Dimensione 166.45 kB
Formato Adobe PDF
166.45 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/190755
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact