基于堆栈溢出的群体知识增强软件工程研究的系统映射研究

IF 4.1 2区 计算机科学 Q1 COMPUTER SCIENCE, SOFTWARE ENGINEERING Journal of Systems and Software Pub Date : 2025-08-01 Epub Date: 2025-03-03 DOI:10.1016/j.jss.2025.112405
Minaoar Hossain Tanzil , Shaiful Chowdhury , Somayeh Modaberi , Gias Uddin , Hadi Hemmati
{"title":"基于堆栈溢出的群体知识增强软件工程研究的系统映射研究","authors":"Minaoar Hossain Tanzil ,&nbsp;Shaiful Chowdhury ,&nbsp;Somayeh Modaberi ,&nbsp;Gias Uddin ,&nbsp;Hadi Hemmati","doi":"10.1016/j.jss.2025.112405","DOIUrl":null,"url":null,"abstract":"<div><div>Developers continuously interact in crowd-sourced community-based question-answer (Q&amp;A) sites. Reportedly, <span><math><mo>∼</mo></math></span>30% of all software professionals visit the most popular Q&amp;A site StackOverflow (SO) every day. Software engineering (SE) research studies are also increasingly using SO data. To find out the trend, implication, impact, and future research potential utilizing SO data, a systematic mapping study needs to be conducted. Following a rigorous reproducible mapping study approach, from 18 reputed SE journals and conferences, we collected 384 SO-based research articles and categorized them into 10 facets (i.e., themes). We found that SO contributes to 85% of SE research compared with popular Q&amp;A sites such as Quora, and Reddit. We found that 18 SE domains directly benefited from SO data whereas <em>Recommender Systems</em>, and <em>API Design and Evolution</em> domains use SO data the most (15% and 16% of all SO-based research studies, respectively). <em>API Design and Evolution</em>, and <em>Machine Learning with/for SE</em> domains have consistent upward publication. <em>Deep Learning Bug Analysis</em> and <em>Code Cloning</em> research areas have the highest potential research impact recently. With the insights, recommendations, and facet-based categorized paper list from this mapping study, SE researchers can find out potential research areas according to their interest to utilize large-scale SO data.</div></div>","PeriodicalId":51099,"journal":{"name":"Journal of Systems and Software","volume":"226 ","pages":"Article 112405"},"PeriodicalIF":4.1000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A systematic mapping study of crowd knowledge enhanced software engineering research using Stack Overflow\",\"authors\":\"Minaoar Hossain Tanzil ,&nbsp;Shaiful Chowdhury ,&nbsp;Somayeh Modaberi ,&nbsp;Gias Uddin ,&nbsp;Hadi Hemmati\",\"doi\":\"10.1016/j.jss.2025.112405\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Developers continuously interact in crowd-sourced community-based question-answer (Q&amp;A) sites. Reportedly, <span><math><mo>∼</mo></math></span>30% of all software professionals visit the most popular Q&amp;A site StackOverflow (SO) every day. Software engineering (SE) research studies are also increasingly using SO data. To find out the trend, implication, impact, and future research potential utilizing SO data, a systematic mapping study needs to be conducted. Following a rigorous reproducible mapping study approach, from 18 reputed SE journals and conferences, we collected 384 SO-based research articles and categorized them into 10 facets (i.e., themes). We found that SO contributes to 85% of SE research compared with popular Q&amp;A sites such as Quora, and Reddit. We found that 18 SE domains directly benefited from SO data whereas <em>Recommender Systems</em>, and <em>API Design and Evolution</em> domains use SO data the most (15% and 16% of all SO-based research studies, respectively). <em>API Design and Evolution</em>, and <em>Machine Learning with/for SE</em> domains have consistent upward publication. <em>Deep Learning Bug Analysis</em> and <em>Code Cloning</em> research areas have the highest potential research impact recently. With the insights, recommendations, and facet-based categorized paper list from this mapping study, SE researchers can find out potential research areas according to their interest to utilize large-scale SO data.</div></div>\",\"PeriodicalId\":51099,\"journal\":{\"name\":\"Journal of Systems and Software\",\"volume\":\"226 \",\"pages\":\"Article 112405\"},\"PeriodicalIF\":4.1000,\"publicationDate\":\"2025-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Systems and Software\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0164121225000731\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/3/3 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, SOFTWARE ENGINEERING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Systems and Software","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0164121225000731","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/3 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}
引用次数: 0

摘要

开发人员在基于社区的众包问答网站上持续互动。据报道,大约30%的软件专业人士每天都会访问最受欢迎的问答网站StackOverflow (SO)。软件工程(SE)研究也越来越多地使用SO数据。为了利用SO数据找出趋势、含义、影响和未来的研究潜力,需要进行系统的制图研究。遵循严格的可重复映射研究方法,我们从18个知名SE期刊和会议中收集了384篇基于so的研究文章,并将其分为10个方面(即主题)。我们发现,与Quora和Reddit等热门问答网站相比,SO占SE研究的85%。我们发现18个SE领域直接受益于SO数据,而推荐系统、API设计和进化领域使用SO数据最多(分别占所有基于SO的研究的15%和16%)。API设计与进化和机器学习与/为SE领域有一致的向上发表。深度学习漏洞分析和代码克隆是目前最具潜力的研究领域。通过本次测绘研究的见解、建议和基于facet的分类论文列表,SE研究人员可以根据自己的兴趣找到潜在的研究领域,利用大规模SO数据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A systematic mapping study of crowd knowledge enhanced software engineering research using Stack Overflow
Developers continuously interact in crowd-sourced community-based question-answer (Q&A) sites. Reportedly, 30% of all software professionals visit the most popular Q&A site StackOverflow (SO) every day. Software engineering (SE) research studies are also increasingly using SO data. To find out the trend, implication, impact, and future research potential utilizing SO data, a systematic mapping study needs to be conducted. Following a rigorous reproducible mapping study approach, from 18 reputed SE journals and conferences, we collected 384 SO-based research articles and categorized them into 10 facets (i.e., themes). We found that SO contributes to 85% of SE research compared with popular Q&A sites such as Quora, and Reddit. We found that 18 SE domains directly benefited from SO data whereas Recommender Systems, and API Design and Evolution domains use SO data the most (15% and 16% of all SO-based research studies, respectively). API Design and Evolution, and Machine Learning with/for SE domains have consistent upward publication. Deep Learning Bug Analysis and Code Cloning research areas have the highest potential research impact recently. With the insights, recommendations, and facet-based categorized paper list from this mapping study, SE researchers can find out potential research areas according to their interest to utilize large-scale SO data.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Systems and Software
Journal of Systems and Software 工程技术-计算机:理论方法
CiteScore
8.60
自引率
5.70%
发文量
193
审稿时长
16 weeks
期刊介绍: The Journal of Systems and Software publishes papers covering all aspects of software engineering and related hardware-software-systems issues. All articles should include a validation of the idea presented, e.g. through case studies, experiments, or systematic comparisons with other approaches already in practice. Topics of interest include, but are not limited to: •Methods and tools for, and empirical studies on, software requirements, design, architecture, verification and validation, maintenance and evolution •Agile, model-driven, service-oriented, open source and global software development •Approaches for mobile, multiprocessing, real-time, distributed, cloud-based, dependable and virtualized systems •Human factors and management concerns of software development •Data management and big data issues of software systems •Metrics and evaluation, data mining of software development resources •Business and economic aspects of software development processes The journal welcomes state-of-the-art surveys and reports of practical experience for all of these topics.
期刊最新文献
LogNER: Enhancing log semantics with LLM-driven entity recognition Who “controls” where work shall be done? State-of-practice in post-pandemic remote work regulation ScenGDL: Smart contract vulnerability detection and location based on temporal Scenarios and Graph convolution networks On-the-fly repair of multi-variable atomicity violations in airborne software SCENE: Guidelines for Security Chaos Engineering based on a systematic literature review
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1