Modern data-driven spoken language systems (SLS) require manual semantic annotation for training spoken language understanding parsers. Multilingual porting of SLS demands significant manual effort and language resources, as this manual annotation has to be replicated. Crowdsourcing is an accessible and cost-effective alternative to traditional methods of collecting and annotating data. The application of crowdsourcing to simple tasks has been well investigated. However, complex tasks, like cross-language semantic annotation transfer, may generate low judgment agreement and/or poor performance. The most serious issue in cross-language porting is the absence of reference annotations in the target language; thus, crowd quality control and the evaluation of the collected annotations is difficult. In this paper we investigate targeted crowdsourcing for semantic annotation transfer that delegates to crowds a complex task such as segmenting and labeling of concepts taken from a domain ontolo...
Cross-language transfer of semantic annotation via targeted crowdsourcing: task design and evaluation / Stepanov, Evgeny A.; Chowdhury, Shammur Absar; Bayer, Ali Orkan; Ghosh, Arindam; Klasinas, Ioannis; Calvo Lance, Marcos; Sanchis, Emilio; Riccardi, Giuseppe. - In: LANGUAGE RESOURCES AND EVALUATION. - ISSN 1574-020X. - 52:1(2018), pp. 341-364. [10.1007/s10579-017-9396-5]
Cross-language transfer of semantic annotation via targeted crowdsourcing: task design and evaluation
Stepanov, Evgeny A.;Chowdhury, Shammur Absar;Bayer, Ali Orkan;Ghosh, Arindam;Calvo Lance, Marcos;Riccardi, Giuseppe
2018-01-01
Abstract
Modern data-driven spoken language systems (SLS) require manual semantic annotation for training spoken language understanding parsers. Multilingual porting of SLS demands significant manual effort and language resources, as this manual annotation has to be replicated. Crowdsourcing is an accessible and cost-effective alternative to traditional methods of collecting and annotating data. The application of crowdsourcing to simple tasks has been well investigated. However, complex tasks, like cross-language semantic annotation transfer, may generate low judgment agreement and/or poor performance. The most serious issue in cross-language porting is the absence of reference annotations in the target language; thus, crowd quality control and the evaluation of the collected annotations is difficult. In this paper we investigate targeted crowdsourcing for semantic annotation transfer that delegates to crowds a complex task such as segmenting and labeling of concepts taken from a domain ontolo...I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione



