This paper deals with a problem in the area of Cross-Language Plagiarism Detection. In particular, it presents a system, able to detect portions of a Wikipedia page, which have been obtained by translating a Wikipedia page on the same semantic content but written in a different language. The problem is relevant in the context of Wikipedia pages maintenance, and could be of interest in other areas such as news comparison in different languages. We discuss the problem, the system and its implementation and briefly present its evaluation.
Hunting for Direct Translations across Wikipedia Articles / Duc Manh Hoang;, Marco; Ronchetti, Marco. - STAMPA. - 1(2017), pp. 31-36. (Intervento presentato al convegno SEMAPRO 2017 tenutosi a Barcelona, Spagna nel 12-16/11/2017).
Hunting for Direct Translations across Wikipedia Articles
Ronchetti
2017-01-01
Abstract
This paper deals with a problem in the area of Cross-Language Plagiarism Detection. In particular, it presents a system, able to detect portions of a Wikipedia page, which have been obtained by translating a Wikipedia page on the same semantic content but written in a different language. The problem is relevant in the context of Wikipedia pages maintenance, and could be of interest in other areas such as news comparison in different languages. We discuss the problem, the system and its implementation and briefly present its evaluation.File | Dimensione | Formato | |
---|---|---|---|
semapro_2017_2_20_30020.pdf
Solo gestori archivio
Descrizione: articolo
Tipologia:
Versione editoriale (Publisher’s layout)
Licenza:
Tutti i diritti riservati (All rights reserved)
Dimensione
315.81 kB
Formato
Adobe PDF
|
315.81 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione