利用响应模式来识别StackOverflow中的专题专家

M. Bhanu, Joydeep Chandra
{"title":"利用响应模式来识别StackOverflow中的专题专家","authors":"M. Bhanu, Joydeep Chandra","doi":"10.1109/ICDIM.2016.7829790","DOIUrl":null,"url":null,"abstract":"The popularity of community question answer (CQA) forums like Stack Overflow, Yahoo Answers and Quora is increasing tremendously with thousands of questions being posted each day and about thrice the number of responses being provided. With such query explosion, users participating in these forums receive a huge number of postings that adversely affects their responsiveness and also the quality of the responses. Hence, identifying topical experts is necessary to improve the efficacy of these systems in terms of both response time and quality. Although expert detection in CQA forums has traditionally been a topic of wide interest, however, many of the proposed techniques use features set that reflect the popularity of the responses of the responder rather than the difficulty level of the questions being responded. In this paper we provide measures of labeling difficult questions and use the number of difficult questions responded by a user combined with other user interaction parameters in identifying potential topical experts. Using a random forest classifier with the proposed feature set on Stack Overflow data, we obtain an improvement in accuracy of 5–16% over existing techniques, in detecting topical experts.","PeriodicalId":146662,"journal":{"name":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Exploiting response patterns for identifying topical experts in StackOverflow\",\"authors\":\"M. Bhanu, Joydeep Chandra\",\"doi\":\"10.1109/ICDIM.2016.7829790\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The popularity of community question answer (CQA) forums like Stack Overflow, Yahoo Answers and Quora is increasing tremendously with thousands of questions being posted each day and about thrice the number of responses being provided. With such query explosion, users participating in these forums receive a huge number of postings that adversely affects their responsiveness and also the quality of the responses. Hence, identifying topical experts is necessary to improve the efficacy of these systems in terms of both response time and quality. Although expert detection in CQA forums has traditionally been a topic of wide interest, however, many of the proposed techniques use features set that reflect the popularity of the responses of the responder rather than the difficulty level of the questions being responded. In this paper we provide measures of labeling difficult questions and use the number of difficult questions responded by a user combined with other user interaction parameters in identifying potential topical experts. Using a random forest classifier with the proposed feature set on Stack Overflow data, we obtain an improvement in accuracy of 5–16% over existing techniques, in detecting topical experts.\",\"PeriodicalId\":146662,\"journal\":{\"name\":\"2016 Eleventh International Conference on Digital Information Management (ICDIM)\",\"volume\":\"10 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 Eleventh International Conference on Digital Information Management (ICDIM)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICDIM.2016.7829790\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Eleventh International Conference on Digital Information Management (ICDIM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDIM.2016.7829790","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

摘要

像Stack Overflow, Yahoo Answers和Quora这样的社区问答(CQA)论坛的受欢迎程度正在急剧增加,每天有数千个问题被发布,并且提供了大约三倍的回答。随着查询的爆炸式增长,参与这些论坛的用户会收到大量的帖子,这对他们的响应能力和回复质量产生了不利影响。因此,确定专题专家对于提高这些系统在响应时间和质量方面的有效性是必要的。尽管CQA论坛中的专家检测传统上是一个广受关注的话题,但是,许多提议的技术使用的特征集反映了应答者回答的受欢迎程度,而不是被应答问题的难度级别。在本文中,我们提供了标记难题的措施,并使用用户回答的难题数量与其他用户交互参数相结合来识别潜在的专题专家。在Stack Overflow数据上使用具有所提出特征集的随机森林分类器,在检测主题专家方面,我们比现有技术的准确率提高了5-16%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Exploiting response patterns for identifying topical experts in StackOverflow
The popularity of community question answer (CQA) forums like Stack Overflow, Yahoo Answers and Quora is increasing tremendously with thousands of questions being posted each day and about thrice the number of responses being provided. With such query explosion, users participating in these forums receive a huge number of postings that adversely affects their responsiveness and also the quality of the responses. Hence, identifying topical experts is necessary to improve the efficacy of these systems in terms of both response time and quality. Although expert detection in CQA forums has traditionally been a topic of wide interest, however, many of the proposed techniques use features set that reflect the popularity of the responses of the responder rather than the difficulty level of the questions being responded. In this paper we provide measures of labeling difficult questions and use the number of difficult questions responded by a user combined with other user interaction parameters in identifying potential topical experts. Using a random forest classifier with the proposed feature set on Stack Overflow data, we obtain an improvement in accuracy of 5–16% over existing techniques, in detecting topical experts.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
The DEWI high-level architecture: Wireless sensor networks in industrial applications Wireless avionics intra-communications: Current trends and design issues Enabling OLAP analyses on the web of data Adding quality in the user requirements specification: A first approach Using rate equation for modeling triad dynamics on Instagram
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1