TTAFPred: Prediction of time to aging failure for software systems based on a two-stream multi-scale features fusion network

IF 1.7 3区计算机科学 Q3 COMPUTER SCIENCE, SOFTWARE ENGINEERING Software Quality Journal Pub Date : 2024-07-22 DOI:10.1007/s11219-024-09692-2

Kai Jia, Xiao Yu, Chen Zhang, Wenzhi Xie, Dongdong Zhao, Jianwen Xiang

{"title":"TTAFPred: Prediction of time to aging failure for software systems based on a two-stream multi-scale features fusion network","authors":"Kai Jia, Xiao Yu, Chen Zhang, Wenzhi Xie, Dongdong Zhao, Jianwen Xiang","doi":"10.1007/s11219-024-09692-2","DOIUrl":null,"url":null,"abstract":"<p>Software aging refers to the performance degradation and failure crash phenomena in long-running systems. As a proactive remedy, software rejuvenation can be scheduled timely to mitigate aging effects. Inescapably, how to accurately predict the time to aging failure (TTAF) of software is a prerequisite for implementing effective rejuvenation. However, the characterization of software aging is relatively complicated, leading to the selection of aging indicators case by case, which means that only fitting the variation trend of a single indicator for prediction models to formulate a rejuvenation schedule may be limited. To fill this gap, this paper proposes a novel framework called TTAFPred, which directly constructs the direct mapping relationships between the software aging process considering multiple system indicators and TTAF. Specifically, this framework includes three parts, i.e., data preprocessing, software degradation feature extraction, and TTAF prediction modules. First, the raw data is processed into the input form required by the network. Secondly, a temporal relationship extraction stream integrating bidirectional gated recurrent unit (BiGRU) with attention mechanism is used to extract temporal features from raw inputs. Synchronously, a spatial relationships extraction stream is adopted to extract the spatial features for enhancing the representation ability of degraded features by using the multi-scale one-dimensional convolutional neural network (1DCNN) with the residual connection. Then, extracted temporal-spatial features from the two streams are further fused. Finally, two fully-connected layers are constructed to estimate the TTAF. The experiments are performed on two mainstream software systems (OpenStack and Android), and four sets of real run-to-failure data for each software system are collected. The effectiveness of the proposed TTAFPred is verified through extensive experiments with its seven competing models, and the prediction performance can be improved by 9.1%, 8.0%, and 8.0%, respectively, in terms of three evaluation metrics, compared to the best baseline model.</p>","PeriodicalId":21827,"journal":{"name":"Software Quality Journal","volume":"28 1","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2024-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Software Quality Journal","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s11219-024-09692-2","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, SOFTWARE ENGINEERING","Score":null,"Total":0}

引用次数: 0

Abstract

Software aging refers to the performance degradation and failure crash phenomena in long-running systems. As a proactive remedy, software rejuvenation can be scheduled timely to mitigate aging effects. Inescapably, how to accurately predict the time to aging failure (TTAF) of software is a prerequisite for implementing effective rejuvenation. However, the characterization of software aging is relatively complicated, leading to the selection of aging indicators case by case, which means that only fitting the variation trend of a single indicator for prediction models to formulate a rejuvenation schedule may be limited. To fill this gap, this paper proposes a novel framework called TTAFPred, which directly constructs the direct mapping relationships between the software aging process considering multiple system indicators and TTAF. Specifically, this framework includes three parts, i.e., data preprocessing, software degradation feature extraction, and TTAF prediction modules. First, the raw data is processed into the input form required by the network. Secondly, a temporal relationship extraction stream integrating bidirectional gated recurrent unit (BiGRU) with attention mechanism is used to extract temporal features from raw inputs. Synchronously, a spatial relationships extraction stream is adopted to extract the spatial features for enhancing the representation ability of degraded features by using the multi-scale one-dimensional convolutional neural network (1DCNN) with the residual connection. Then, extracted temporal-spatial features from the two streams are further fused. Finally, two fully-connected layers are constructed to estimate the TTAF. The experiments are performed on two mainstream software systems (OpenStack and Android), and four sets of real run-to-failure data for each software system are collected. The effectiveness of the proposed TTAFPred is verified through extensive experiments with its seven competing models, and the prediction performance can be improved by 9.1%, 8.0%, and 8.0%, respectively, in terms of three evaluation metrics, compared to the best baseline model.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

TTAFPred：基于双流多尺度特征融合网络的软件系统老化故障时间预测

软件老化是指长期运行系统中出现的性能下降和故障崩溃现象。作为一种积极的补救措施，可以及时安排软件返老还童以减轻老化影响。不可回避的是，如何准确预测软件的老化失效时间（TTAF）是实施有效年轻化的前提。然而，软件老化的特征描述相对复杂，导致老化指标的选择需要逐一进行，这意味着仅拟合单一指标的变化趋势建立预测模型来制定年轻化计划可能会受到限制。为了填补这一空白，本文提出了一个名为 TTAFPred 的新框架，该框架直接构建了考虑多个系统指标的软件老化过程与 TTAF 之间的直接映射关系。具体来说，该框架包括三个部分，即数据预处理、软件老化特征提取和 TTAF 预测模块。首先，原始数据被处理成网络所需的输入形式。其次，使用集成了双向门控递归单元（BiGRU）和注意机制的时间关系提取流，从原始输入中提取时间特征。与此同时，利用多尺度一维卷积神经网络（1DCNN）的残差连接，采用空间关系提取流来提取空间特征，以增强降级特征的表示能力。然后，进一步融合从两个流中提取的时空特征。最后，构建两个全连接层来估计 TTAF。实验在两个主流软件系统（OpenStack 和 Android）上进行，每个软件系统收集了四组真实的运行失败数据。通过与七个竞争模型的大量实验，验证了所提出的 TTAFPred 的有效性，与最佳基线模型相比，其预测性能在三个评价指标上分别提高了 9.1%、8.0% 和 8.0%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Software Quality Journal 工程技术-计算机：软件工程

CiteScore

4.90

自引率

5.30%

发文量

审稿时长

>12 weeks

期刊介绍： The aims of the Software Quality Journal are: (1) To promote awareness of the crucial role of quality management in the effective construction of the software systems developed, used, and/or maintained by organizations in pursuit of their business objectives. (2) To provide a forum of the exchange of experiences and information on software quality management and the methods, tools and products used to measure and achieve it. (3) To provide a vehicle for the publication of academic papers related to all aspects of software quality. The Journal addresses all aspects of software quality from both a practical and an academic viewpoint. It invites contributions from practitioners and academics, as well as national and international policy and standard making bodies, and sets out to be the definitive international reference source for such information. The Journal will accept research, technique, case study, survey and tutorial submissions that address quality-related issues including, but not limited to: internal and external quality standards, management of quality within organizations, technical aspects of quality, quality aspects for product vendors, software measurement and metrics, software testing and other quality assurance techniques, total quality management and cultural aspects. Other technical issues with regard to software quality, including: data management, formal methods, safety critical applications, and CASE.