On the Use of an Intermediate Class in Boolean Crowdsourced Relevance
Annotations for Learning to Rank Comments

Barrón-Cedeño, Alberto; Giovanni Da San Martino,; Filice, Simone; Moschitti, Alessandro

doi:10.1145/3077136.3080763

In many Information Retrieval tasks, the boundary between classes is not well defined, and assigning a document to a specific class may be complicated, even for humans. For instance, a document which is not directly related to the user's query may still contain relevant information. In this scenario, an option is to define an intermediate class collecting ambiguous instances. Yet some natural questions arise. Is this annotation strategy convenient? how should the intermediate class be treated? To answer these questions, we explored two community question answering datasets whose comments were originally annotated with three classes. We re-annotated a subset of instances considering a binary good vs bad setting. Our main contribution is to show empirically that the inclusion of an intermediate class to assess Boolean relevance is not useful. Moreover, in case the data is already annotated with a 3-class strategy, the instances from the intermediate class can be safely removed at training time.

On the Use of an Intermediate Class in Boolean Crowdsourced Relevance Annotations for Learning to Rank Comments / Barrón-Cedeño, Alberto; Da San Martino, Giovanni; Filice, Simone; Moschitti, Alessandro. - ELETTRONICO. - (2017), pp. 1209-1212. ( SIGIR '17 Shinjuku, Tokyo, Japan 7 - 11 August, 2017) [10.1145/3077136.3080763].

On the Use of an Intermediate Class in Boolean Crowdsourced Relevance Annotations for Learning to Rank Comments

Alberto Barrón-Cedeño;Giovanni Da San Martino;Simone Filice;Alessandro Moschitti

2017-01-01

Abstract

In many Information Retrieval tasks, the boundary between classes is not well defined, and assigning a document to a specific class may be complicated, even for humans. For instance, a document which is not directly related to the user's query may still contain relevant information. In this scenario, an option is to define an intermediate class collecting ambiguous instances. Yet some natural questions arise. Is this annotation strategy convenient? how should the intermediate class be treated? To answer these questions, we explored two community question answering datasets whose comments were originally annotated with three classes. We re-annotated a subset of instances considering a binary good vs bad setting. Our main contribution is to show empirically that the inclusion of an intermediate class to assess Boolean relevance is not useful. Moreover, in case the data is already annotated with a 3-class strategy, the instances from the intermediate class can be safely removed at training time.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2017
			
	Titolo del volume (Proceedings title)
	
				Proceedings of the 40th International ACM SIGIR Conference onResearch and Development in Information Retrieval, Shinjuku, Tokyo,Japan, August 7-11, 2017
			
	Autore/i del libro (Book author/s)
	
				Alberto Barrón-Cedeño, Giovanni Da San Martino, Simone Filice and Alessandro Moschitti
			
	Luogo di edizione (Place of publication)
	
				New York, NY United States
			
	Casa editrice (Publisher)
	
				ACM
			
	ISBN
	
				978-1-4503-5022-8
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85029364749
			
	Codice WOS (WOS identifier)
	
				WOS:000454711900185
			
	Tutti gli autori
	
						Barrón-Cedeño, Alberto; Da San Martino, Giovanni; Filice, Simone; Moschitti, Alessandro
					
	Citazione
	
				On the Use of an Intermediate Class in Boolean Crowdsourced Relevance
Annotations for Learning to Rank Comments / Barrón-Cedeño, Alberto; Da San Martino, Giovanni; Filice, Simone; Moschitti, Alessandro. - ELETTRONICO. - (2017), pp. 1209-1212. ( SIGIR '17 Shinjuku, Tokyo, Japan 7 - 11 August, 2017) [10.1145/3077136.3080763].
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
2017_SIGIR_Annotations.pdf accesso aperto Tipologia: Post-print referato (Refereed author’s manuscript) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 511.24 kB Formato Adobe PDF Visualizza/Apri	511.24 kB	Adobe PDF	Visualizza/Apri
3077136.3080763.pdf Solo gestori archivio Tipologia: Versione editoriale (Publisher’s layout) Licenza: Tutti i diritti riservati (All rights reserved) Dimensione 907.96 kB Formato Adobe PDF Visualizza/Apri	907.96 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/195424

Citazioni

ND

2

2

ND

On the Use of an Intermediate Class in Boolean Crowdsourced Relevance Annotations for Learning to Rank Comments

Alberto Barrón-Cedeño;Giovanni Da San Martino;Simone Filice;Alessandro Moschitti

2017-01-01

Abstract

Scheda breve Scheda completa Scheda completa (DC)

Informazioni

Citazioni

social impact

Conferma cancellazione

Scheda breve

Scheda completa

Scheda completa (DC)