You and me... in a vector space: modelling individual speakers with distributional semantics

Herbelot, Aurelie; Qasemizadeh, Behrang

The linguistic experiences of a person are an important part of their individuality. In this paper, we show that people can be modelled as vectors in a semantic space, using their personal interaction with specific language data. We also demonstrate that these vectors can be taken as representative of ‘the kind of person’ they are. We build over 4000 speaker-dependent subcorpora using logs of Wikipedia edits, which are then used to build distributional vectors that represent individual speakers. We show that such ‘person vectors’ are informative to others, and they influence basic patterns of communication like the choice of one’s interlocutor in conversation. Tested on an informationseeking scenario, where natural language questions must be answered by addressing the most relevant individuals in a community, our system outperforms a standard information retrieval algorithm by a considerable margin.

You and me... in a vector space: modelling individual speakers with distributional semantics / Herbelot, A., Qasemizadeh, B.. - (2016), pp. 179-188. (*SEM 2016 Berlin, Germany 11-12 August 2016).

You and me... in a vector space: modelling individual speakers with distributional semantics

Herbelot, Aurelie;QasemiZadeh, Behrang

2016-01-01

Abstract

The linguistic experiences of a person are an important part of their individuality. In this paper, we show that people can be modelled as vectors in a semantic space, using their personal interaction with specific language data. We also demonstrate that these vectors can be taken as representative of ‘the kind of person’ they are. We build over 4000 speaker-dependent subcorpora using logs of Wikipedia edits, which are then used to build distributional vectors that represent individual speakers. We show that such ‘person vectors’ are informative to others, and they influence basic patterns of communication like the choice of one’s interlocutor in conversation. Tested on an informationseeking scenario, where natural language questions must be answered by addressing the most relevant individuals in a community, our system outperforms a standard information retrieval algorithm by a considerable margin.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno di pubblicazione (Date of publication)
	
				2016
			
	Titolo del volume (Proceedings title)
	
				Proceedings of the Fifth Joint Conference on Lexical and Computational Semantics (*SEM2016)
			
	Luogo di edizione (Place of publication)
	
				Stroudsburg, PA, USA
			
	Casa editrice (Publisher)
	
				Association for Computational Linguistics
			
	ISBN
	
				978-1-941643-92-1
			
	Codice Scopus (Scopus Identifier)
	
				2-s2.0-85036477069
			
	Tutti gli autori
	
						Herbelot, Aurelie; Qasemizadeh, Behrang
					
	Citazione
	
				You and me... in a vector space: modelling individual speakers with distributional semantics / Herbelot, A., Qasemizadeh, B.. - (2016), pp. 179-188. (*SEM 2016 Berlin, Germany 11-12 August 2016).
			
	Appare nelle tipologie:
	
				04.1 Saggio in atti di convegno (Paper in Proceedings)

File in questo prodotto:

File	Dimensione	Formato
starsem2016_speakers_in_space.pdf accesso aperto Tipologia: Versione editoriale (Publisher’s layout) Licenza: Creative commons Dimensione 251.1 kB Formato Adobe PDF Visualizza/Apri	251.1 kB	Adobe PDF	Visualizza/Apri