This paper presents a second release of the ARRAU dataset: a multi-domain corpus with thorough linguistically motivated annotation of anaphora and related phenomena. Building upon the first release almost a decade ago, a considerable effort had been invested in improving the data both quantitatively and qualitatively. Thus, we have doubled the corpus size, expanded the selection of covered phenomena to include referentiality and genericity and designed and implemented a methodology for enforcing the consistency of the manual annotation. We believe that the new release of ARRAU provides a valuable material for ongoing research in complex cases of coreference as well as for a variety of related tasks. The corpus is publicly available through LDC.

ARRAU: Linguistically-Motivated Annotation of Anaphoric Descriptions / Uryupina, O; Artstein, R; Bristot, A; Cavicchio, F; Rodriguez, Kj; Poesio, M. - STAMPA. - (2016), pp. 2058-2062. ((Intervento presentato al convegno LREC tenutosi a Portorozh nel 23-28 May 2016.

ARRAU: Linguistically-Motivated Annotation of Anaphoric Descriptions

Uryupina, O;Bristot, A;Cavicchio, F;Rodriguez, KJ;Poesio, M
2016

Abstract

This paper presents a second release of the ARRAU dataset: a multi-domain corpus with thorough linguistically motivated annotation of anaphora and related phenomena. Building upon the first release almost a decade ago, a considerable effort had been invested in improving the data both quantitatively and qualitatively. Thus, we have doubled the corpus size, expanded the selection of covered phenomena to include referentiality and genericity and designed and implemented a methodology for enforcing the consistency of the manual annotation. We believe that the new release of ARRAU provides a valuable material for ongoing research in complex cases of coreference as well as for a variety of related tasks. The corpus is publicly available through LDC.
Proceedings of the Tenth International Conference on Language Resources and Evaluation
55-57, RUE BRILLAT-SAVARIN, PARIS, 75013, FRANCE
EUROPEAN LANGUAGE RESOURCES ASSOC-ELRA
Uryupina, O; Artstein, R; Bristot, A; Cavicchio, F; Rodriguez, Kj; Poesio, M
ARRAU: Linguistically-Motivated Annotation of Anaphoric Descriptions / Uryupina, O; Artstein, R; Bristot, A; Cavicchio, F; Rodriguez, Kj; Poesio, M. - STAMPA. - (2016), pp. 2058-2062. ((Intervento presentato al convegno LREC tenutosi a Portorozh nel 23-28 May 2016.
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: http://hdl.handle.net/11572/295813
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 7
  • ???jsp.display-item.citation.isi??? 2
social impact