Reliability of User-Generated Data: the Case of Biographical Data in Wikipedia

R. Viseur
{"title":"Reliability of User-Generated Data: the Case of Biographical Data in Wikipedia","authors":"R. Viseur","doi":"10.1145/2641580.2641618","DOIUrl":null,"url":null,"abstract":"Wikipedia is a collaborative multilingual encyclopedia launched in 2001. We already conducted a first research on the extraction of biographical data about personalities from Belgium in order to build a large database with biographical data. However, the question of the reliability of the data arises. In particular, in the case of Wikipedia, the data are generated by users and could be subject to errors. In consequence, we wanted to answer to the following question: are the data introduced in Wikipedia articles reliable? Our research is organized in three sections. The first section provides a brief state of the art about the reliability of the user-generated data. A second section presents the methodology of our research. A third section will present the results. The error rates that were measured for the birthdate is low (0.75%), although it is higher than the 0.21% score that we observed for the baseline (reference sources). In a fourth section, the results are discussed.","PeriodicalId":447989,"journal":{"name":"Proceedings of The International Symposium on Open Collaboration","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-08-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of The International Symposium on Open Collaboration","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2641580.2641618","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

Abstract

Wikipedia is a collaborative multilingual encyclopedia launched in 2001. We already conducted a first research on the extraction of biographical data about personalities from Belgium in order to build a large database with biographical data. However, the question of the reliability of the data arises. In particular, in the case of Wikipedia, the data are generated by users and could be subject to errors. In consequence, we wanted to answer to the following question: are the data introduced in Wikipedia articles reliable? Our research is organized in three sections. The first section provides a brief state of the art about the reliability of the user-generated data. A second section presents the methodology of our research. A third section will present the results. The error rates that were measured for the birthdate is low (0.75%), although it is higher than the 0.21% score that we observed for the baseline (reference sources). In a fourth section, the results are discussed.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用户生成数据的可靠性:维基百科传记数据的案例
维基百科是2001年推出的多语种协作百科全书。我们已经进行了第一次研究,从比利时提取人物传记数据,以便建立一个大型的传记数据数据库。然而,数据的可靠性问题出现了。特别是在维基百科的情况下,数据是由用户生成的,可能会有错误。因此,我们想回答以下问题:维基百科文章中引入的数据可靠吗?我们的研究分为三个部分。第一部分简要介绍了用户生成数据的可靠性。第二部分介绍了我们的研究方法。第三部分将展示结果。出生日期测量的错误率很低(0.75%),尽管它高于我们在基线(参考资料来源)观察到的0.21%的分数。第四部分对结果进行了讨论。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Contropedia - the analysis and visualization of controversies in Wikipedia articles Supporting awareness of content-related controversies in a Wiki-based learning environment What do Chinese-language microblog users do with Baidu Baike and Chinese Wikipedia? A case study of information engagement Chinese-language literature about Wikipedia: a meta-analysis of academic search engine result pages Volunteer Attraction and Retention in Open Source Communities
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1