In this paper we investigate the role of user emotions in human-machine goal-oriented conversations. There has been a growing interest in predicting emotions from acted and non-acted spontaneous speech. Much of the research work has gone in determining what are the correct labels and improving emotion prediction accuracy. In this paper we evaluate the value of user emotional state towards a computational model of emotion processing. We consider a binary representation of emotions (positive vs. negative) in the context of a goal-driven conversational system. For each human-machine interaction we acquire the temporal emotion sequence going from the initial to the final conversational state. These traces are used as features to characterize the user state dynamics. We ground the emotion traces by associating its patterns to dialog strategies and their effectiveness. In order to quantify the value of emotion indicators, we evaluate their predictions in terms of speech recognition and spoken language understanding errors as well as task success or failure. We report results on the 11.5K dialog corpus samples from the How may I Help You? corpus.
Scheda prodotto non validato
I dati visualizzati non sono stati ancora sottoposti a validazione formale da parte dello Staff di IRIS, ma sono stati ugualmente trasmessi al Sito Docente Cineca (Loginmiur).
|Titolo:||Global Features for Shallow Discourse Parsing|
|Autori:||Ghosh, Sucheta; Riccardi, Giuseppe; R., Johansson|
|Titolo del volume contenente il saggio:||Proc. of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL)|
|Luogo di edizione:||Seoul, Republic of Korea|
|Casa editrice:||Association for Computational Linguistics Stroudsburg, PA, USA ©2012|
|Anno di pubblicazione:||2012|
|Appare nelle tipologie:||04.1 Saggio in atti di convegno (Paper in proceedings)|