IMDAC: A robust intelligent software defect prediction model via multi‐objective optimization and end‐to‐end hybrid deep learning networks

Software: Practice and Experience Pub Date : 2023-09-28 DOI:10.1002/spe.3274

Kun Zhu, Nana Zhang, Changjun Jiang, Dandan Zhu

{"title":"IMDAC: A robust intelligent software defect prediction model via multi‐objective optimization and end‐to‐end hybrid deep learning networks","authors":"Kun Zhu, Nana Zhang, Changjun Jiang, Dandan Zhu","doi":"10.1002/spe.3274","DOIUrl":null,"url":null,"abstract":"Abstract Software defect prediction (SDP) aims to build an effective prediction model for historical defect data from software repositories by some specialized techniques or algorithms, and predict the defect proneness of new software modules. Nevertheless, the complex internal intrinsic structure hidden behind the defect data makes it challenging for the built prediction model to capture the most expressive defect feature representations, and largely limits the SDP performance. Fortunately, artificial intelligence is interacting closely with humans and provides powerful intelligent technical support for addressing these SDP issues. In this article, we propose a robust intelligent SDP model called IMDAC based on deep learning and soft computing techniques. This model has three main advantages: (1) an effective deep generative network—InfoGAN (information maximizing GANs) is employed to conduct data augmentation, namely generating sufficient defect instances and achieving defect class balance simultaneously. (2) Select the fewest representative feature subset for the minimum error via an advanced multi‐objective optimization approach—MSEA (multi‐stage evolutionary algorithm). (3) Build a powerful end‐to‐end deep defect predictor by hybrid deep learning techniques—DAE (Denoising AutoEncoder) and CNN (convolutional neural network), which can not only reconstruct a clean “repaired” input with strong robustness and generalization capabilities via DAE, but also learn the abstract deep semantic features with strong discriminating capability via CNN. Experimental results verify the superiority and robustness of the IMDAC model across 15 software projects.","PeriodicalId":21899,"journal":{"name":"Software: Practice and Experience","volume":"65 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-09-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Software: Practice and Experience","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/spe.3274","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Abstract Software defect prediction (SDP) aims to build an effective prediction model for historical defect data from software repositories by some specialized techniques or algorithms, and predict the defect proneness of new software modules. Nevertheless, the complex internal intrinsic structure hidden behind the defect data makes it challenging for the built prediction model to capture the most expressive defect feature representations, and largely limits the SDP performance. Fortunately, artificial intelligence is interacting closely with humans and provides powerful intelligent technical support for addressing these SDP issues. In this article, we propose a robust intelligent SDP model called IMDAC based on deep learning and soft computing techniques. This model has three main advantages: (1) an effective deep generative network—InfoGAN (information maximizing GANs) is employed to conduct data augmentation, namely generating sufficient defect instances and achieving defect class balance simultaneously. (2) Select the fewest representative feature subset for the minimum error via an advanced multi‐objective optimization approach—MSEA (multi‐stage evolutionary algorithm). (3) Build a powerful end‐to‐end deep defect predictor by hybrid deep learning techniques—DAE (Denoising AutoEncoder) and CNN (convolutional neural network), which can not only reconstruct a clean “repaired” input with strong robustness and generalization capabilities via DAE, but also learn the abstract deep semantic features with strong discriminating capability via CNN. Experimental results verify the superiority and robustness of the IMDAC model across 15 software projects.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

IMDAC:基于多目标优化和端到端混合深度学习网络的鲁棒智能软件缺陷预测模型

摘要软件缺陷预测(Software defect prediction, SDP)旨在通过一些专门的技术或算法，对软件库中的历史缺陷数据建立有效的预测模型，预测新软件模块的缺陷倾向。然而，隐藏在缺陷数据背后的复杂的内部固有结构使得所构建的预测模型很难捕捉到最具表现力的缺陷特征表示，这在很大程度上限制了SDP的性能。幸运的是，人工智能正在与人类密切互动，为解决这些SDP问题提供强大的智能技术支持。在本文中，我们提出了一个基于深度学习和软计算技术的鲁棒智能SDP模型IMDAC。该模型有三个主要优点:(1)利用有效的深度生成网络——信息最大化gan (information maximize GANs)进行数据扩充，即生成足够的缺陷实例，同时实现缺陷类平衡。(2)采用一种先进的多目标优化方法-多阶段进化算法(msea)，选择具有最小误差的最小代表性特征子集。(3)采用去噪自动编码器(Denoising AutoEncoder)和卷积神经网络(CNN)的混合深度学习技术构建强大的端到端深度缺陷预测器，该预测器不仅可以通过DAE重建具有较强鲁棒性和泛化能力的干净“修复”输入，还可以通过CNN学习具有较强判别能力的抽象深度语义特征。实验结果验证了IMDAC模型在15个软件项目中的优越性和鲁棒性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Software: Practice and Experience

自引率

0.00%

发文量