This article presents SICK (Sentences Involving Compositional Knowledge), a large size English benchmark created to evaluate compositional distributional semantic models. SICK consists of about 10,000 English sentence pairs that include examples of the lexical, syntactic and semantic phenomena that distributional models are expected to account for, but do not require dealing with other aspects of existing sentential datasets (e.g. idiomatic multiword expressions, named entities, telegraphic language). Each sentence pair was annotated for two crucial semantic tasks: relatedness in meaning and entailment relation between the two sentences composing the pair. SICK was used in the SemEval-2014 Shared Task, and is freely available for research purposes.

The SICK Dataset / Bentivogli, L., Menini, S., Zamparelli, R.. - STAMPA. - (2026), pp. 657-661. [10.1016/B978-0-323-95504-1.00595-0]

The SICK Dataset

Menini Stefano;Zamparelli Roberto
2026-01-01

Abstract

This article presents SICK (Sentences Involving Compositional Knowledge), a large size English benchmark created to evaluate compositional distributional semantic models. SICK consists of about 10,000 English sentence pairs that include examples of the lexical, syntactic and semantic phenomena that distributional models are expected to account for, but do not require dealing with other aspects of existing sentential datasets (e.g. idiomatic multiword expressions, named entities, telegraphic language). Each sentence pair was annotated for two crucial semantic tasks: relatedness in meaning and entailment relation between the two sentences composing the pair. SICK was used in the SemEval-2014 Shared Task, and is freely available for research purposes.
2026
International Encyclopedia of Language and Linguistics, 3e (LAL3)
International Encyclopedia of Language and Linguistics, 3e.
Elzevier
9780323955041
Settore L-LIN/01 - Glottologia e Linguistica
Settore GLOT-01/A - Glottologia e linguistica
Bentivogli, Luisa; Menini, Stefano; Zamparelli, Roberto
The SICK Dataset / Bentivogli, L., Menini, S., Zamparelli, R.. - STAMPA. - (2026), pp. 657-661. [10.1016/B978-0-323-95504-1.00595-0]
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/492972
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
  • OpenAlex ND
social impact