The linguistic experiences of a person are an important part of their individuality. In this paper, we show that people can be modelled as vectors in a semantic space, using their personal interaction with specific language data. We also demonstrate that these vectors can be taken as representative of ‘the kind of person’ they are. We build over 4000 speaker-dependent subcorpora using logs of Wikipedia edits, which are then used to build distributional vectors that represent individual speakers. We show that such ‘person vectors’ are informative to others, and they influence basic patterns of communication like the choice of one’s interlocutor in conversation. Tested on an informationseeking scenario, where natural language questions must be answered by addressing the most relevant individuals in a community, our system outperforms a standard information retrieval algorithm by a considerable margin.

You and me... in a vector space: modelling individual speakers with distributional semantics / Herbelot, Aurelie; Qasemizadeh, Behrang. - (2016), pp. 179-188. (Intervento presentato al convegno *SEM 2016 tenutosi a Berlin, Germany nel 11-12 August 2016).

You and me... in a vector space: modelling individual speakers with distributional semantics

Herbelot, Aurelie;
2016-01-01

Abstract

The linguistic experiences of a person are an important part of their individuality. In this paper, we show that people can be modelled as vectors in a semantic space, using their personal interaction with specific language data. We also demonstrate that these vectors can be taken as representative of ‘the kind of person’ they are. We build over 4000 speaker-dependent subcorpora using logs of Wikipedia edits, which are then used to build distributional vectors that represent individual speakers. We show that such ‘person vectors’ are informative to others, and they influence basic patterns of communication like the choice of one’s interlocutor in conversation. Tested on an informationseeking scenario, where natural language questions must be answered by addressing the most relevant individuals in a community, our system outperforms a standard information retrieval algorithm by a considerable margin.
2016
Proceedings of the Fifth Joint Conference on Lexical and Computational Semantics (*SEM2016)
Stroudsburg, PA, USA
Association for Computational Linguistics
978-1-941643-92-1
Herbelot, Aurelie; Qasemizadeh, Behrang
You and me... in a vector space: modelling individual speakers with distributional semantics / Herbelot, Aurelie; Qasemizadeh, Behrang. - (2016), pp. 179-188. (Intervento presentato al convegno *SEM 2016 tenutosi a Berlin, Germany nel 11-12 August 2016).
File in questo prodotto:
File Dimensione Formato  
starsem2016_speakers_in_space.pdf

accesso aperto

Tipologia: Versione editoriale (Publisher’s layout)
Licenza: Creative commons
Dimensione 251.1 kB
Formato Adobe PDF
251.1 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11572/212245
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact