An analysis of crowd workers mistakes for specific and complex relevance assessment task

Proceedings of the 22nd ACM international conference on Information & Knowledge Management Pub Date : 2013-10-27 DOI:10.1145/2505515.2507884

J. Anderton, Maryam Bashir, Virgil Pavlu, J. Aslam

引用次数: 8

Abstract

The TREC 2012 Crowdsourcing track asked participants to crowdsource relevance assessments with the goal of replicating costly expert judgements with relatively fast, inexpensive, but less reliable judgements from anonymous online workers. The track used 10 "ad-hoc" queries, highly specific and complex (as compared to web search). The crowdsourced assessments were evaluated against expert judgments made by highly trained and capable human analysts in 1999 as part of ad hoc track collection construction. Since most crowdsourcing approaches submitted to the TREC 2012 track produced assessment sets nowhere close to the expert judgements, we decided to analyze crowdsourcing mistakes made on this task using data we collected via Amazon's Mechanical Turk service. We investigate two types of crowdsourcing approaches: one that asks for nominal relevance grades for each document, and the other that asks for preferences on many (not all) pairs of documents.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

针对特定复杂相关性评估任务的群体工作者错误分析

TREC 2012众包跟踪要求参与者将相关性评估众包，目的是用相对快速、廉价、但不太可靠的匿名在线工作者的判断来复制昂贵的专家判断。这个赛道使用了10个“特别”的查询，非常具体和复杂(与网络搜索相比)。1999年，作为特别轨道收集建设的一部分，众包评估与训练有素和有能力的人类分析师的专家判断进行了评估。由于提交给TREC 2012轨道的大多数众包方法产生的评估集与专家判断相差甚远，我们决定使用我们通过亚马逊的Mechanical Turk服务收集的数据来分析众包在这项任务中所犯的错误。我们研究了两种类型的众包方法:一种是要求每个文档的名义相关性等级，另一种是要求对许多(不是全部)文档的偏好。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the 22nd ACM international conference on Information & Knowledge Management

自引率

0.00%

发文量

期刊最新文献

Exploring XML data is as easy as using maps Mining-based compression approach of propositional formulae Flexible and dynamic compromises for effective recommendations Efficient parsing-based search over structured data Recommendation via user's personality and social contextual