We present the CIC-FBK system, which took part in the Native Language Identification (NLI) Shared Task 2017. Our approach combines features commonly used in previous NLI research, i.e., word n-grams, lemma n-grams, part-of-speech n-grams, and function words, with recently introduced character n-grams from misspelled words, and features that are novel in this task, such as typed character n-grams, and syntactic n-grams of words and of syntactic relation tags. We use log-entropy weighting scheme and perform classification using the Support Vector Machines (SVM) algorithm. Our system achieved 0.8808 macro-averaged F1-score and shared the 1st rank in the NLI Shared Task 2017 scoring
CIC-FBK Approach to Native Language Identification / Markov, Ilia; Chen, Lingzhen; Strapparava, Carlo; Sidorov, Grigori. - (2017), pp. 374-381. ((Intervento presentato al convegno 12th Workshop on Innovative Use of NLP for Building Educational Applications tenutosi a Copenhagen, Denmark nel September [10.18653/v1/W17-5042].
Scheda prodotto non validato
I dati visualizzati non sono stati ancora sottoposti a validazione formale da parte dello Staff di IRIS, ma sono stati ugualmente trasmessi al Sito Docente Cineca (Loginmiur).
Titolo: | CIC-FBK Approach to Native Language Identification | |
Autori: | Markov, Ilia; Chen, Lingzhen; Strapparava, Carlo; Sidorov, Grigori | |
Autori Unitn: | ||
Titolo del volume contenente il saggio: | Proceedings of 12th Workshop on Innovative Use of NLP for Building Educational Applications | |
Luogo di edizione: | USA | |
Casa editrice: | Association for Computational Linguistics | |
Anno di pubblicazione: | 2017 | |
Codice identificativo Scopus: | 2-s2.0-85096916226 | |
ISBN: | 978-1-945626-85-2 | |
Handle: | http://hdl.handle.net/11572/343173 | |
Citazione: | CIC-FBK Approach to Native Language Identification / Markov, Ilia; Chen, Lingzhen; Strapparava, Carlo; Sidorov, Grigori. - (2017), pp. 374-381. ((Intervento presentato al convegno 12th Workshop on Innovative Use of NLP for Building Educational Applications tenutosi a Copenhagen, Denmark nel September [10.18653/v1/W17-5042]. | |
Appare nelle tipologie: | 04.1 Saggio in atti di convegno (Paper in proceedings) |