A SICK cure for the evaluation of compositional distributional semantic models

IRIS

Shared and internationally recognized benchmarks are fundamental for the development of any computational system. We aim to help the research community working on compositional distributional semantic models (CDSMs) by providing SICK (Sentences Involving Compositional Knowldedge), a large size English benchmark tailored for them. SICK consists of about 10, 000 English sentence pairs that include many examples of the lexical, syntactic and semantic phenomena that CDSMs are expected to account for, but do not require dealing with other aspects of existing sentential data sets (idiomatic multiword expressions, named entities, telegraphic language) that are not within the scope of CDSMs. By means of crowdsourcing techniques, each pair was annotated for two crucial semantic tasks: relatedness in meaning (with a 5-point rating scale as gold score) and entailment relation between the two elements (with three possible gold labels: entailment, contradiction, and neutral). The SICK data set was used in SemEval-2014 Task 1, and it freely available for research purposes.

A SICK cure for the evaluation of compositional distributional semantic models / Marelli, M., Menini, S., Baroni, M., L., B., Bernardi, R., Zamparelli, R.. - (2014), pp. 216-223. (9th International Conference on Language Resources and Evaluation, LREC 2014 Reykjavik (Iceland) 26-31 Maggio).

A SICK cure for the evaluation of compositional distributional semantic models

Marelli, Marco;Menini, Stefano;Baroni, Marco;L. Bentivogli;Bernardi, Raffaella;Zamparelli, Roberto

2014-01-01

Abstract

Shared and internationally recognized benchmarks are fundamental for the development of any computational system. We aim to help the research community working on compositional distributional semantic models (CDSMs) by providing SICK (Sentences Involving Compositional Knowldedge), a large size English benchmark tailored for them. SICK consists of about 10, 000 English sentence pairs that include many examples of the lexical, syntactic and semantic phenomena that CDSMs are expected to account for, but do not require dealing with other aspects of existing sentential data sets (idiomatic multiword expressions, named entities, telegraphic language) that are not within the scope of CDSMs. By means of crowdsourcing techniques, each pair was annotated for two crucial semantic tasks: relatedness in meaning (with a 5-point rating scale as gold score) and entailment relation between the two elements (with three possible gold labels: entailment, contradiction, and neutral). The SICK data set was used in SemEval-2014 Task 1, and it freely available for research purposes.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2014
			
	Titolo del volume (Proceedings title)
	
				Proceedings of LREC 2014,
			
	Luogo di edizione (Place of publication)
	
				55-57, RUE BRILLAT-SAVARIN, PARIS, 75013, FRANCE
			
	Casa editrice (Publisher)
	
				European Language Resources Association (ELRA)
			
	ISBN
	
				9782951740884
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-84974839320
			
	Codice WOS (WOS identifier)
	
				WOS:000355611001141
			
	Tutti gli autori
	
						Marelli, Marco; Menini, Stefano; Baroni, Marco; L., Bentivogli; Bernardi, Raffaella; Zamparelli, Roberto
					
	Citazione
	
				A SICK cure for the evaluation of compositional distributional semantic models / Marelli, M., Menini, S., Baroni, M., L., B., Bernardi, R., Zamparelli, R.. - (2014), pp. 216-223. (9th International Conference on Language Resources and Evaluation, LREC 2014 Reykjavik (Iceland) 26-31 Maggio).
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/98428

Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni

ND

737

479

ND

social impact