Evaluating the Impact of Data Transformation Techniques on the Performance and Interpretability of Software Defect Prediction Models

IF 1.5 4区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING IET Software Pub Date : 2023-11-14 DOI:10.1049/2023/6293074

Yu Zhao, Zhiqiu Huang, Lina Gong, Yi Zhu, Qiao Yu, Yuxiang Gao

{"title":"Evaluating the Impact of Data Transformation Techniques on the Performance and Interpretability of Software Defect Prediction Models","authors":"Yu Zhao, Zhiqiu Huang, Lina Gong, Yi Zhu, Qiao Yu, Yuxiang Gao","doi":"10.1049/2023/6293074","DOIUrl":null,"url":null,"abstract":"<div>\n <p>The performance of software defect prediction (SDP) models determines the priority of test resource allocation. Researchers also use interpretability techniques to gain empirical knowledge about software quality from SDP models. However, SDP methods designed in the past research rarely consider the impact of data transformation methods, simple but commonly used preprocessing techniques, on the performance and interpretability of SDP models. Therefore, in this paper, we investigate the impact of three data transformation methods (Log, Minmax, and <i>Z</i>-score) on the performance and interpretability of SDP models. Through empirical research on (i) six classification techniques (random forest, decision tree, logistic regression, Naive Bayes, K-nearest neighbors, and multilayer perceptron), (ii) six performance evaluation indicators (Accuracy, Precision, Recall, F1, MCC, and AUC), (iii) two interpretable methods (permutation and SHAP), (iv) two feature importance measures (Top-<i>k</i> feature rank overlap and difference), and (v) three datasets (Promise, Relink, and AEEEM), our results show that the data transformation methods can significantly improve the performance of the SDP models and greatly affect the variation of the most important features. Specifically, the impact of data transformation methods on the performance and interpretability of SDP models depends on the classification techniques and evaluation indicators. We observe that log transformation improves NB model performance by 7%–61% on the other five indicators with a 5% drop in Precision. Minmax and <i>Z</i>-score transformation improves NB model performance by 2%–9% across all indicators. However, all three transformation methods lead to substantial changes in the Top-5 important feature ranks, with differences exceeding 2 in 40%–80% of cases (detailed results available in the main content). Based on our findings, we recommend that (1) considering the impact of data transformation methods on model performance and interpretability when designing SDP approaches as transformations can improve model accuracy, and potentially obscure important features, which lead to challenges in interpretation, (2) conducting comparative experiments with and without the transformations to validate the effectiveness of proposed methods which are designed to improve the prediction performance, and (3) tracking changes in the most important features before and after applying data transformation methods to ensure precise and traceable interpretability conclusions to gain insights. Our study reminds researchers and practitioners of the need for comprehensive considerations even when using other similar simple data processing methods.</p>\n </div>","PeriodicalId":50378,"journal":{"name":"IET Software","volume":"2023 1","pages":""},"PeriodicalIF":1.5000,"publicationDate":"2023-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/2023/6293074","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IET Software","FirstCategoryId":"94","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/2023/6293074","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

Abstract

The performance of software defect prediction (SDP) models determines the priority of test resource allocation. Researchers also use interpretability techniques to gain empirical knowledge about software quality from SDP models. However, SDP methods designed in the past research rarely consider the impact of data transformation methods, simple but commonly used preprocessing techniques, on the performance and interpretability of SDP models. Therefore, in this paper, we investigate the impact of three data transformation methods (Log, Minmax, and Z-score) on the performance and interpretability of SDP models. Through empirical research on (i) six classification techniques (random forest, decision tree, logistic regression, Naive Bayes, K-nearest neighbors, and multilayer perceptron), (ii) six performance evaluation indicators (Accuracy, Precision, Recall, F1, MCC, and AUC), (iii) two interpretable methods (permutation and SHAP), (iv) two feature importance measures (Top-k feature rank overlap and difference), and (v) three datasets (Promise, Relink, and AEEEM), our results show that the data transformation methods can significantly improve the performance of the SDP models and greatly affect the variation of the most important features. Specifically, the impact of data transformation methods on the performance and interpretability of SDP models depends on the classification techniques and evaluation indicators. We observe that log transformation improves NB model performance by 7%–61% on the other five indicators with a 5% drop in Precision. Minmax and Z-score transformation improves NB model performance by 2%–9% across all indicators. However, all three transformation methods lead to substantial changes in the Top-5 important feature ranks, with differences exceeding 2 in 40%–80% of cases (detailed results available in the main content). Based on our findings, we recommend that (1) considering the impact of data transformation methods on model performance and interpretability when designing SDP approaches as transformations can improve model accuracy, and potentially obscure important features, which lead to challenges in interpretation, (2) conducting comparative experiments with and without the transformations to validate the effectiveness of proposed methods which are designed to improve the prediction performance, and (3) tracking changes in the most important features before and after applying data transformation methods to ensure precise and traceable interpretability conclusions to gain insights. Our study reminds researchers and practitioners of the need for comprehensive considerations even when using other similar simple data processing methods.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

评估数据转换技术对软件缺陷预测模型的性能和可解释性的影响

软件缺陷预测模型的性能决定了测试资源分配的优先级。研究人员还使用可解释性技术从SDP模型中获得关于软件质量的经验知识。然而，以往研究设计的SDP方法很少考虑数据转换方法(简单但常用的预处理技术)对SDP模型性能和可解释性的影响。因此，在本文中，我们研究了三种数据转换方法(Log, Minmax和Z-score)对SDP模型的性能和可解释性的影响。通过对(i)六种分类技术(随机森林、决策树、逻辑回归、朴素贝叶斯、k近邻和多层感知器)、(ii)六种性能评价指标(准确率、精密度、召回率、F1、MCC和AUC)、(iii)两种可解释方法(置换和SHAP)、(iv)两种特征重要性度量(Top-k特征秩重叠和差异)以及(v)三个数据集(Promise、Relink和AEEEM)的实证研究，结果表明，数据转换方法可以显著提高SDP模型的性能，并对最重要特征的变化有很大影响。具体而言，数据转换方法对SDP模型性能和可解释性的影响取决于分类技术和评价指标。我们观察到，对数变换在其他五个指标上使NB模型性能提高了7%-61%，而精度下降了5%。Minmax和Z-score转换在所有指标上提高了NB模型的性能2%-9%。然而，这三种转换方法都导致了Top-5重要特征排名的实质性变化，在40%-80%的情况下差异超过2(详细结果见主要内容)。基于我们的研究结果，我们建议(1)在设计SDP方法时考虑数据转换方法对模型性能和可解释性的影响，因为转换可以提高模型精度，但可能会模糊重要特征，从而导致解释挑战;(2)进行有转换和没有转换的对比实验，以验证所提出的旨在提高预测性能的方法的有效性。(3)跟踪应用数据转换方法前后最重要特征的变化，确保结论的精确性和可追溯性，从而获得洞见。我们的研究提醒研究人员和从业者，即使使用其他类似的简单数据处理方法，也需要全面考虑。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IET Software 工程技术-计算机：软件工程

CiteScore

4.20

自引率

0.00%

发文量

审稿时长

9 months

期刊介绍： IET Software publishes papers on all aspects of the software lifecycle, including design, development, implementation and maintenance. The focus of the journal is on the methods used to develop and maintain software, and their practical application. Authors are especially encouraged to submit papers on the following topics, although papers on all aspects of software engineering are welcome: Software and systems requirements engineering Formal methods, design methods, practice and experience Software architecture, aspect and object orientation, reuse and re-engineering Testing, verification and validation techniques Software dependability and measurement Human systems engineering and human-computer interaction Knowledge engineering; expert and knowledge-based systems, intelligent agents Information systems engineering Application of software engineering in industry and commerce Software engineering technology transfer Management of software development Theoretical aspects of software development Machine learning Big data and big code Cloud computing Current Special Issue. Call for papers: Knowledge Discovery for Software Development - https://digital-library.theiet.org/files/IET_SEN_CFP_KDSD.pdf Big Data Analytics for Sustainable Software Development - https://digital-library.theiet.org/files/IET_SEN_CFP_BDASSD.pdf