Predicting Defectiveness of Software Patches

Proceedings of the 10th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement Pub Date : 2016-09-08 DOI:10.1145/2961111.2962601

Behjat Soltanifar, Atakan Erdem, A. Bener

{"title":"Predicting Defectiveness of Software Patches","authors":"Behjat Soltanifar, Atakan Erdem, A. Bener","doi":"10.1145/2961111.2962601","DOIUrl":null,"url":null,"abstract":"Context: Software code review, as an engineering best practice, refers to the inspection of the code change in order to find possible defects and ensure change quality. Code reviews, however, may not guarantee finding the defects. Thus, there is a risk for a defective code change in a given patch, to pass the review process and be submitted. Goal: In this research, we aim to apply different machine learning algorithms in order to predict the defectiveness of a patch after being reviewed, at the time of its submission. Method: We built three models using three different machine learning algorithms: Logistic Regression, NaÃŕve Bayes, and Bayesian Network model. To build the models, we consider different factors involved in review process in terms of Product, Process and People (3P). Results: Our empirical results show that, Bayesian Networks is able to better predict the defectiveness of the changed code with 76% accuracy. Conclusions: Predicting defectiveness of change code is beneficial in making patch release decisions. The Bayesian Network model outperforms the others since it capturs the relationship among the factors in the review process.","PeriodicalId":208212,"journal":{"name":"Proceedings of the 10th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 10th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2961111.2962601","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 6

Abstract

Context: Software code review, as an engineering best practice, refers to the inspection of the code change in order to find possible defects and ensure change quality. Code reviews, however, may not guarantee finding the defects. Thus, there is a risk for a defective code change in a given patch, to pass the review process and be submitted. Goal: In this research, we aim to apply different machine learning algorithms in order to predict the defectiveness of a patch after being reviewed, at the time of its submission. Method: We built three models using three different machine learning algorithms: Logistic Regression, NaÃŕve Bayes, and Bayesian Network model. To build the models, we consider different factors involved in review process in terms of Product, Process and People (3P). Results: Our empirical results show that, Bayesian Networks is able to better predict the defectiveness of the changed code with 76% accuracy. Conclusions: Predicting defectiveness of change code is beneficial in making patch release decisions. The Bayesian Network model outperforms the others since it capturs the relationship among the factors in the review process.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

预测软件补丁的缺陷

背景:软件代码审查，作为一项工程最佳实践，指的是对代码变更进行检查，以发现可能的缺陷并确保变更的质量。然而，代码审查可能不能保证找到缺陷。因此，在给定的补丁中存在有缺陷的代码更改的风险，通过审查过程并提交。目标:在这项研究中，我们的目标是应用不同的机器学习算法，以便在提交补丁时，在审查后预测补丁的缺陷。方法:我们使用三种不同的机器学习算法:Logistic回归、NaÃŕve贝叶斯和贝叶斯网络模型建立了三个模型。为了构建模型，我们从产品、过程和人员(3P)的角度考虑评审过程中涉及的不同因素。结果:我们的实证结果表明，贝叶斯网络能够更好地预测更改代码的缺陷，准确率为76%。结论:预测变更代码的缺陷有助于制定补丁发布决策。贝叶斯网络模型优于其他模型，因为它捕获了审查过程中因素之间的关系。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the 10th ACM/IEEE International Symposium on Empirical Software Engineering and Measurement

自引率

0.00%

发文量