We present a new wordnet resource for Scottish Gaelic, a Celtic minority language spoken by about 60,000 speakers, most of whom live in Northwestern Scotland. The wordnet contains over 15 thousand word senses and was constructed by merging ten thousand new, high-quality translations, provided and validated by language experts, with an existing wordnet derived from Wiktionary. This new, considerably extended wordnet---currently among the 30 largest in the world---targets multiple communities: language speakers and learners; linguists; computer scientists solving problems related to natural language processing. By publishing it as a freely downloadable resource, we hope to contribute to the long-term preservation of Scottish Gaelic as a living language, both offline and on the Web.
A Major Wordnet for a Minority Language: Scottish Gaelic / Bella, Gábor; Mcneill, Fiona; Gorman, Rody; Ó Donnaíle, Caoimhin; Macdonald, Kirsty; Chandrashekar, Yamini; Freihat, Abed Alhakim; Giunchiglia, Fausto. - (2020), pp. 2812-2818. (Intervento presentato al convegno 12th International Conference on Language Resources and Evaluation, LREC 2020 tenutosi a Marseille, France nel 11th-16th May 2020).
A Major Wordnet for a Minority Language: Scottish Gaelic
Bella, Gábor;Chandrashekar, Yamini;Freihat, Abed Alhakim;Giunchiglia, Fausto
2020-01-01
Abstract
We present a new wordnet resource for Scottish Gaelic, a Celtic minority language spoken by about 60,000 speakers, most of whom live in Northwestern Scotland. The wordnet contains over 15 thousand word senses and was constructed by merging ten thousand new, high-quality translations, provided and validated by language experts, with an existing wordnet derived from Wiktionary. This new, considerably extended wordnet---currently among the 30 largest in the world---targets multiple communities: language speakers and learners; linguists; computer scientists solving problems related to natural language processing. By publishing it as a freely downloadable resource, we hope to contribute to the long-term preservation of Scottish Gaelic as a living language, both offline and on the Web.File | Dimensione | Formato | |
---|---|---|---|
2020.lrec-1.342.pdf
accesso aperto
Tipologia:
Versione editoriale (Publisher’s layout)
Licenza:
Creative commons
Dimensione
205.44 kB
Formato
Adobe PDF
|
205.44 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione