This paper explores and offers guidance on a specific and relevant problem in task design for crowdsourcing: how to formulate a complex question used to classify a set of items. In micro-task markets, classification is still among the most popular tasks. We situate our work in the context of information retrieval and multi-predicate classification, i.e., classifying a set of items based on a set of conditions. Our experiments cover a wide range of tasks and domains, and also consider crowd workers alone and in tandem with machine learning classifiers. We provide empirical evidence into how the resulting classification performance is affected by different predicate formulation strategies, emphasizing the importance of predicate formulation as a task design dimension in crowdsourcing.
On the Impact of Predicate Complexity in Crowdsourced Classification Tasks / Ramírez, Jorge; Baez, Marcos; Casati, Fabio; Cernuzzi, Luca; Benatallah, Boualem; Taran, Ekaterina A.; Malanina, Veronika A.. - (2021), pp. 67-75. (Intervento presentato al convegno International Conference on Web Search and Data Mining tenutosi a Virtual Event Israel nel March, 2021) [10.1145/3437963.3441831].
On the Impact of Predicate Complexity in Crowdsourced Classification Tasks
Ramírez, Jorge;Casati, Fabio;
2021-01-01
Abstract
This paper explores and offers guidance on a specific and relevant problem in task design for crowdsourcing: how to formulate a complex question used to classify a set of items. In micro-task markets, classification is still among the most popular tasks. We situate our work in the context of information retrieval and multi-predicate classification, i.e., classifying a set of items based on a set of conditions. Our experiments cover a wide range of tasks and domains, and also consider crowd workers alone and in tandem with machine learning classifiers. We provide empirical evidence into how the resulting classification performance is affected by different predicate formulation strategies, emphasizing the importance of predicate formulation as a task design dimension in crowdsourcing.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione