This paper presents a method for measuring the semantic similarity of texts, using corpus-based and knowledge-based measures of similarity. Previous work on this problem has focused mainly on either large documents (e.g. text classification, information retrieval) or individual words (e.g. synonymy tests). Given that a large fraction of the information available today, on the Web and elsewhere, consists of short text snippets (e.g. abstracts of scientific documents, imagine captions, product descriptions), in this paper we focus on measuring the semantic similarity of short texts. Through experiments performed on a paraphrase data set, we show that the semantic similarity method outperforms methods based on simple lexical matching, resulting in up to 13% error rate reduction with respect to the traditional vector-based similarity metric.
Corpus-based and Knowledge-based Measures of Text Semantic Similarity / Mihalcea, R.; Corley, C.; Strapparava, C.. - (2006), pp. 775-780. ((Intervento presentato al convegno 21st conference of American Association for Artificial Intelligence (AAAI-06) tenutosi a Boston, Massachusetts, USA nel 16/07/2006 - 20/07/2006.
Scheda prodotto non validato
I dati visualizzati non sono stati ancora sottoposti a validazione formale da parte dello Staff di IRIS, ma sono stati ugualmente trasmessi al Sito Docente Cineca (Loginmiur).
Titolo: | Corpus-based and Knowledge-based Measures of Text Semantic Similarity | |
Autori: | Mihalcea, R.; Corley, C.; Strapparava, C. | |
Autori Unitn: | ||
Autore/i del libro: | - | |
Titolo del volume contenente il saggio: | 21st conference of American Association for Artificial Intelligence (AAAI-06) | |
Luogo di edizione: | USA | |
Casa editrice: | AAAI | |
Anno di pubblicazione: | 2006 | |
Codice identificativo Scopus: | 2-s2.0-33750693384 | |
Handle: | http://hdl.handle.net/11572/343698 | |
Citazione: | Corpus-based and Knowledge-based Measures of Text Semantic Similarity / Mihalcea, R.; Corley, C.; Strapparava, C.. - (2006), pp. 775-780. ((Intervento presentato al convegno 21st conference of American Association for Artificial Intelligence (AAAI-06) tenutosi a Boston, Massachusetts, USA nel 16/07/2006 - 20/07/2006. | |
Appare nelle tipologie: | 04.1 Saggio in atti di convegno (Paper in proceedings) |