Facilitating feasibility analysis: the pilot defects prediction dataset maker

D. Falessi, Max Jason Moede
{"title":"Facilitating feasibility analysis: the pilot defects prediction dataset maker","authors":"D. Falessi, Max Jason Moede","doi":"10.1145/3278142.3278147","DOIUrl":null,"url":null,"abstract":"Our industrial experience in institutionalizing defect prediction models in the software industry shows that the first step is to measure prediction metrics and defects to assess the feasibility of the tool, i.e., if the accuracy of the defect prediction tool is higher than of a random predictor. However, computing prediction metrics is time consuming and error prone. Thus, the feasibility analysis has a cost which needs some initial investment by the potential clients. This initial investment acts as a barrier for convincing potential clients of the benefits of institutionalizing a software prediction model. To reduce this barrier, in this paper we present the Pilot Defects Prediction Dataset Maker (PDPDM), a desktop application for measuring metrics to use for defect prediction. PDPDM receives as input the repository’s information of a software project, and it provides as output, in an easy and replicable way, a dataset containing a set of 17 well-defined product and process metrics, that have been shown to be useful for defect prediction, such as size and smells. PDPDM avoids the use of outdated datasets and it allows researchers and practitioners to create defect datasets without the need to write any lines of code.","PeriodicalId":108238,"journal":{"name":"Proceedings of the 4th ACM SIGSOFT International Workshop on Software Analytics","volume":"9 3 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th ACM SIGSOFT International Workshop on Software Analytics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3278142.3278147","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

Our industrial experience in institutionalizing defect prediction models in the software industry shows that the first step is to measure prediction metrics and defects to assess the feasibility of the tool, i.e., if the accuracy of the defect prediction tool is higher than of a random predictor. However, computing prediction metrics is time consuming and error prone. Thus, the feasibility analysis has a cost which needs some initial investment by the potential clients. This initial investment acts as a barrier for convincing potential clients of the benefits of institutionalizing a software prediction model. To reduce this barrier, in this paper we present the Pilot Defects Prediction Dataset Maker (PDPDM), a desktop application for measuring metrics to use for defect prediction. PDPDM receives as input the repository’s information of a software project, and it provides as output, in an easy and replicable way, a dataset containing a set of 17 well-defined product and process metrics, that have been shown to be useful for defect prediction, such as size and smells. PDPDM avoids the use of outdated datasets and it allows researchers and practitioners to create defect datasets without the need to write any lines of code.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
便于可行性分析:飞行员缺陷预测数据集制作者
我们在软件行业中将缺陷预测模型制度化的行业经验表明,第一步是度量预测度量和缺陷,以评估工具的可行性,也就是说,如果缺陷预测工具的准确性高于随机预测器。然而,计算预测指标非常耗时且容易出错。因此,可行性分析是有成本的,需要潜在客户的一些初始投资。这种最初的投资成为说服潜在客户相信制度化软件预测模型的好处的障碍。为了减少这种障碍,在本文中,我们提出了试点缺陷预测数据集生成器(PDPDM),一个用于测量用于缺陷预测的度量的桌面应用程序。PDPDM接收软件项目的存储库信息作为输入,并以一种简单且可复制的方式提供一个包含17个定义良好的产品和过程度量的数据集作为输出,这些数据集已被证明对缺陷预测有用,例如大小和气味。PDPDM避免使用过时的数据集,它允许研究人员和从业者创建缺陷数据集,而不需要编写任何代码行。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Towards a framework for generating program dependence graphs from source code Differentially-private software analytics for mobile apps: opportunities and challenges Proceedings of the 4th ACM SIGSOFT International Workshop on Software Analytics Facilitating feasibility analysis: the pilot defects prediction dataset maker Is one hyperparameter optimizer enough?
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1