Long short-term memory (LSTM) networks based software fault prediction using data transformation methods

Md. Rashedul Islam, M. Akhtar, M. Begum
{"title":"Long short-term memory (LSTM) networks based software fault prediction using data transformation methods","authors":"Md. Rashedul Islam, M. Akhtar, M. Begum","doi":"10.1109/icaeee54957.2022.9836388","DOIUrl":null,"url":null,"abstract":"The upcoming digital transformation of the modern industry will principally build upon the software systems. Certainly, any software system should commit to being fully reliable and free from any deficiency such as software faults. Maintaining the aforementioned consistency is the main objective of software reliability. The long short-term memory (LSTM) networks are employed for the first time in this kind of research to forecast software faults. The one-step walk-forward validation method is used to predict the software faults. Due to the exponential nature of data, we normalized our cumulative software fault count data using Min-Max Scalar and Box-Cox Transformation methods. Each type of normalized data is fed into the LSTM networks. With the same batch size, the number of neurons and epoch parameters were regulated with different tiers of combinations. The time-series-based software fault data were trained and tested after applying Min-Max and Box-Cox data transformation methods to obtain the root means square error (RMSE) values, and then both models were compared with each other. The RMSE values of the model with the Min-Max Scaler transforming method outperform the second model built with the Box-Cox Transformation method. From our very best knowledge, the obtained RMSE value from the software fault count data using LSTM is the first of its kind. Our models clearly show that the LSTM can be used to predict software faults. We also calculated the data dispersion from the observed independent RMSE data points of each model. The quantified data dispersion value of the second model was found to be less minimal than the first one.","PeriodicalId":383872,"journal":{"name":"2022 International Conference on Advancement in Electrical and Electronic Engineering (ICAEEE)","volume":"161 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Advancement in Electrical and Electronic Engineering (ICAEEE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icaeee54957.2022.9836388","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

The upcoming digital transformation of the modern industry will principally build upon the software systems. Certainly, any software system should commit to being fully reliable and free from any deficiency such as software faults. Maintaining the aforementioned consistency is the main objective of software reliability. The long short-term memory (LSTM) networks are employed for the first time in this kind of research to forecast software faults. The one-step walk-forward validation method is used to predict the software faults. Due to the exponential nature of data, we normalized our cumulative software fault count data using Min-Max Scalar and Box-Cox Transformation methods. Each type of normalized data is fed into the LSTM networks. With the same batch size, the number of neurons and epoch parameters were regulated with different tiers of combinations. The time-series-based software fault data were trained and tested after applying Min-Max and Box-Cox data transformation methods to obtain the root means square error (RMSE) values, and then both models were compared with each other. The RMSE values of the model with the Min-Max Scaler transforming method outperform the second model built with the Box-Cox Transformation method. From our very best knowledge, the obtained RMSE value from the software fault count data using LSTM is the first of its kind. Our models clearly show that the LSTM can be used to predict software faults. We also calculated the data dispersion from the observed independent RMSE data points of each model. The quantified data dispersion value of the second model was found to be less minimal than the first one.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于数据转换方法的LSTM网络软件故障预测
即将到来的现代工业数字化转型将主要建立在软件系统的基础上。当然,任何软件系统都应该保证完全可靠,没有任何缺陷,比如软件故障。保持上述一致性是软件可靠性的主要目标。在这类研究中,首次将长短期记忆(LSTM)网络用于软件故障预测。采用一步前向验证法对软件故障进行预测。由于数据的指数性质,我们使用最小-最大标量和Box-Cox变换方法对累积软件故障计数数据进行了归一化。每种类型的规范化数据都被输入到LSTM网络中。在相同的批处理规模下,采用不同层次的组合来调节神经元数量和epoch参数。采用Min-Max和Box-Cox两种数据变换方法对基于时间序列的软件故障数据进行训练和检验,得到误差均方根(RMSE)值,并对两种模型进行比较。用最小-最大标量变换方法建立的模型的RMSE值优于用Box-Cox变换方法建立的第二个模型。据我们所知,使用LSTM从软件故障计数数据中获得的RMSE值是同类中的第一个。我们的模型清楚地表明LSTM可以用来预测软件故障。我们还计算了每个模型观测到的独立RMSE数据点的数据离散度。发现第二种模型的量化数据离散值比第一种模型要小。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Design of a Multi-band Sierpinski Carpet Fractal Antenna With Modified Ground Plane Effect of Number of Modes of EMD in Respiratory Rate Estimation from PPG Signal An User Interest and Payment-aware Automated Car Parking System for the Bangladeshi People Using Android Application An Improved Load Frequency Control Strategy for Single & Multi-Area Power System Wall Shear Stress Assessment of Aorta with Varying Low-density Lipoprotein Concentration
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1