Yancai Zhou, Chen Zhang, Kai Jia, Dongdong Zhao, Jianwen Xiang
{"title":"基于深度学习和弱监督过采样的软件老化Bug预测框架","authors":"Yancai Zhou, Chen Zhang, Kai Jia, Dongdong Zhao, Jianwen Xiang","doi":"10.1109/ISSREW55968.2022.00066","DOIUrl":null,"url":null,"abstract":"Software aging refers to the phenomenon of sys-tem performance degradation and eventual failure caused by Aging-Related Bugs (ARBs). Software aging seriously affects the reliability and availability of software systems. To discover and remove ARBs, ARBs prediction is presented, and most of them only employed static code metrics to predict those buggy codes. However, static code metrics do not capture the syntactic and semantic features of the code, which are important to building accurate prediction models. To address this problem, we design a deep neural network by combining the bidirectional long short-term memory (BLSTM) and the attention mechanism to extract context-sensitive semantic features of the code. In addition, we apply a weakly supervised oversampling (WSO) method to alleviate class imbalance problems in datasets. We named our framework ABLSTM-WSO. We conduct experiments with five classifiers on two widely used open-source projects(MySQL and Linux) and use AUC, Balance, and F1-score as the evaluation metrics. Experimental results show that ABLSTM-WSO can significantly improve the ARBs prediction performance.","PeriodicalId":178302,"journal":{"name":"2022 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A Software Aging-Related Bug Prediction Framework Based on Deep Learning and Weakly Supervised Oversampling\",\"authors\":\"Yancai Zhou, Chen Zhang, Kai Jia, Dongdong Zhao, Jianwen Xiang\",\"doi\":\"10.1109/ISSREW55968.2022.00066\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Software aging refers to the phenomenon of sys-tem performance degradation and eventual failure caused by Aging-Related Bugs (ARBs). Software aging seriously affects the reliability and availability of software systems. To discover and remove ARBs, ARBs prediction is presented, and most of them only employed static code metrics to predict those buggy codes. However, static code metrics do not capture the syntactic and semantic features of the code, which are important to building accurate prediction models. To address this problem, we design a deep neural network by combining the bidirectional long short-term memory (BLSTM) and the attention mechanism to extract context-sensitive semantic features of the code. In addition, we apply a weakly supervised oversampling (WSO) method to alleviate class imbalance problems in datasets. We named our framework ABLSTM-WSO. We conduct experiments with five classifiers on two widely used open-source projects(MySQL and Linux) and use AUC, Balance, and F1-score as the evaluation metrics. Experimental results show that ABLSTM-WSO can significantly improve the ARBs prediction performance.\",\"PeriodicalId\":178302,\"journal\":{\"name\":\"2022 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ISSREW55968.2022.00066\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ISSREW55968.2022.00066","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
摘要
软件老化是指由老化相关bug (aging - related Bugs, arb)引起的系统性能下降和最终失效的现象。软件老化严重影响软件系统的可靠性和可用性。为了发现和删除arb,提出了arb预测方法,大多数方法仅使用静态代码度量来预测这些有bug的代码。然而,静态代码度量不能捕获代码的语法和语义特征,而这些特征对于构建准确的预测模型是很重要的。为了解决这一问题,我们设计了一个深度神经网络,结合双向长短期记忆(BLSTM)和注意机制来提取代码的上下文敏感语义特征。此外,我们应用弱监督过采样(WSO)方法来缓解数据集中的类不平衡问题。我们将我们的框架命名为ABLSTM-WSO。我们在两个广泛使用的开源项目(MySQL和Linux)上使用五个分类器进行实验,并使用AUC, Balance和F1-score作为评估指标。实验结果表明,ABLSTM-WSO能显著提高arb的预测性能。
A Software Aging-Related Bug Prediction Framework Based on Deep Learning and Weakly Supervised Oversampling
Software aging refers to the phenomenon of sys-tem performance degradation and eventual failure caused by Aging-Related Bugs (ARBs). Software aging seriously affects the reliability and availability of software systems. To discover and remove ARBs, ARBs prediction is presented, and most of them only employed static code metrics to predict those buggy codes. However, static code metrics do not capture the syntactic and semantic features of the code, which are important to building accurate prediction models. To address this problem, we design a deep neural network by combining the bidirectional long short-term memory (BLSTM) and the attention mechanism to extract context-sensitive semantic features of the code. In addition, we apply a weakly supervised oversampling (WSO) method to alleviate class imbalance problems in datasets. We named our framework ABLSTM-WSO. We conduct experiments with five classifiers on two widely used open-source projects(MySQL and Linux) and use AUC, Balance, and F1-score as the evaluation metrics. Experimental results show that ABLSTM-WSO can significantly improve the ARBs prediction performance.