Cross-Project setting using Deep learning Architectures in Just-In-Time Software Fault Prediction: An Investigation

Sushant Kumar Pandey, A. Tripathi
{"title":"Cross-Project setting using Deep learning Architectures in Just-In-Time Software Fault Prediction: An Investigation","authors":"Sushant Kumar Pandey, A. Tripathi","doi":"10.1109/AST58925.2023.00007","DOIUrl":null,"url":null,"abstract":"The prediction of whether a software change is fault-inducing or not in the software system using various learning methods, the study concerned in Just-In-Time Software Fault Prediction (JIT-SFP). Building such predicting model requires adequate training data. However, there needs to be more training data at the beginning of the software system. Cross-Project (CP) setting can subjugate this challenge by employing data from different software projects. It can achieve similar predictive performance to Within-Project (WP) fault prediction. It is still being determined to what level the CP training data can be useful in such a situation. Furthermore, it also needs to be discovered whether CP data are helpful in the initial phase of fault detection, and when there is an inadequate WP train set, CP could be beneficial to extend. This article deals with such investigations in real software projects. We proposed a new method by levering a deep belief network and long short-term memory called JITCP-Predictor. Out of ten, the proposed model significantly outperforms every ten project benchmark methods, and it is superior from 10.63% to 136.36% and 7.04% to 35.71% in terms of MCC and F-Measure, respectively. The mean values of MCC and F-Measure produced by JITCP-Predictor are 0.52 ± 0.021 and 0.76 ± 0.76, respectively. We also found that the proposed model is more suitable for large and moderate-size projects. The proposed model avoids class imbalance and overfitting problems and takes reasonable training costs.","PeriodicalId":252417,"journal":{"name":"2023 IEEE/ACM International Conference on Automation of Software Test (AST)","volume":"1246 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE/ACM International Conference on Automation of Software Test (AST)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AST58925.2023.00007","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The prediction of whether a software change is fault-inducing or not in the software system using various learning methods, the study concerned in Just-In-Time Software Fault Prediction (JIT-SFP). Building such predicting model requires adequate training data. However, there needs to be more training data at the beginning of the software system. Cross-Project (CP) setting can subjugate this challenge by employing data from different software projects. It can achieve similar predictive performance to Within-Project (WP) fault prediction. It is still being determined to what level the CP training data can be useful in such a situation. Furthermore, it also needs to be discovered whether CP data are helpful in the initial phase of fault detection, and when there is an inadequate WP train set, CP could be beneficial to extend. This article deals with such investigations in real software projects. We proposed a new method by levering a deep belief network and long short-term memory called JITCP-Predictor. Out of ten, the proposed model significantly outperforms every ten project benchmark methods, and it is superior from 10.63% to 136.36% and 7.04% to 35.71% in terms of MCC and F-Measure, respectively. The mean values of MCC and F-Measure produced by JITCP-Predictor are 0.52 ± 0.021 and 0.76 ± 0.76, respectively. We also found that the proposed model is more suitable for large and moderate-size projects. The proposed model avoids class imbalance and overfitting problems and takes reasonable training costs.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
在实时软件故障预测中使用深度学习架构的跨项目设置研究
JIT-SFP (Just-In-Time software Fault prediction)是利用各种学习方法来预测软件系统中的软件变更是否会导致故障的研究。建立这样的预测模型需要足够的训练数据。然而,在软件系统的初始阶段,需要有更多的训练数据。跨项目(CP)设置可以通过使用来自不同软件项目的数据来克服这一挑战。它可以达到与项目内(WP)故障预测相似的预测性能。在这种情况下,CP训练数据的有用程度仍有待确定。此外,还需要发现CP数据在故障检测的初始阶段是否有帮助,当WP训练集不足时,CP是否有利于扩展。本文讨论了在实际软件项目中的此类调查。我们提出了一种利用深度信念网络和长短期记忆的新方法,称为JITCP-Predictor。其中,提出的模型显著优于每10种项目基准方法,在MCC和F-Measure方面分别优于10.63% ~ 136.36%和7.04% ~ 35.71%。JITCP-Predictor的MCC和F-Measure的平均值分别为0.52±0.021和0.76±0.76。我们还发现,所提出的模型更适合大中型项目。该模型避免了类不平衡和过拟合问题,训练成本合理。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
FlakyCat: Predicting Flaky Tests Categories using Few-Shot Learning Evaluating the Trade-offs of Text-based Diversity in Test Prioritisation Cross-Project setting using Deep learning Architectures in Just-In-Time Software Fault Prediction: An Investigation AutoMetric: Towards Measuring Open-Source Software Quality Metrics Automatically Detecting Potential User-data Save & Export Losses due to Android App Termination
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1