SHAP-assisted EE-LightGBM model for explainable fault diagnosis in practical optical networks

IF 4 2区计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Journal of Optical Communications and Networking Pub Date : 2025-01-17 DOI:10.1364/JOCN.527872

Chunyu Zhang;Yu Chen;Min Zhang;Zhuo Liu;Danshi Wang

{"title":"SHAP-assisted EE-LightGBM model for explainable fault diagnosis in practical optical networks","authors":"Chunyu Zhang;Yu Chen;Min Zhang;Zhuo Liu;Danshi Wang","doi":"10.1364/JOCN.527872","DOIUrl":null,"url":null,"abstract":"Reliable fault diagnosis is crucial for ensuring the stable operation of optical networks. Recently, data-driven techniques have demonstrated significant advantages in fault diagnosis due to their outstanding data-processing capabilities and adaptive learning abilities. However, as equipment faults in practical optical networks are rare events, the data collected often faces severe data imbalance issues, greatly limiting the accuracy of traditional data-driven models. To address this challenge, a SHAP-assisted EE-LightGBM scheme is proposed for explainable fault diagnosis in practical optical networks. The EE-LightGBM model integrates undersampling strategies at the data level and hybrid ensemble strategies at the model level, enabling the full utilization of fewer fault samples and effectively alleviating the impact of data imbalance on model training. Furthermore, the SHAP method is used to explain the EE-LightGBM model. This method quantifies the contributions of input features to the model’s decision outputs, facilitating a deeper understanding of the mechanisms underlying faults in the equipment and improving the model’s explainability. Through SHAP analysis, we can determine key features highly correlated with equipment faults, thereby inferring the causes of equipment faults. Evaluation using data from backbone network equipment managed by operators shows excellent detection performance of the EE-LightGBM model at a data imbalance rate of 5.61%, with accuracy and F1 scores of 0.9968 and 0.9711, and false negative and false positive rates of 0.0033 and 0.0032, respectively. Moreover, the cause identification results are consistent with diagnostic expertise. We also explore the impact of data imbalance rates on the detection performance of the EE-LightGBM model. The model’s low false negative rate under data imbalance further demonstrates its effectiveness in practical optical network fault diagnosis.","PeriodicalId":50103,"journal":{"name":"Journal of Optical Communications and Networking","volume":"17 2","pages":"81-94"},"PeriodicalIF":4.0000,"publicationDate":"2025-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Optical Communications and Networking","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10844950/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}

引用次数: 0

Abstract

Reliable fault diagnosis is crucial for ensuring the stable operation of optical networks. Recently, data-driven techniques have demonstrated significant advantages in fault diagnosis due to their outstanding data-processing capabilities and adaptive learning abilities. However, as equipment faults in practical optical networks are rare events, the data collected often faces severe data imbalance issues, greatly limiting the accuracy of traditional data-driven models. To address this challenge, a SHAP-assisted EE-LightGBM scheme is proposed for explainable fault diagnosis in practical optical networks. The EE-LightGBM model integrates undersampling strategies at the data level and hybrid ensemble strategies at the model level, enabling the full utilization of fewer fault samples and effectively alleviating the impact of data imbalance on model training. Furthermore, the SHAP method is used to explain the EE-LightGBM model. This method quantifies the contributions of input features to the model’s decision outputs, facilitating a deeper understanding of the mechanisms underlying faults in the equipment and improving the model’s explainability. Through SHAP analysis, we can determine key features highly correlated with equipment faults, thereby inferring the causes of equipment faults. Evaluation using data from backbone network equipment managed by operators shows excellent detection performance of the EE-LightGBM model at a data imbalance rate of 5.61%, with accuracy and F1 scores of 0.9968 and 0.9711, and false negative and false positive rates of 0.0033 and 0.0032, respectively. Moreover, the cause identification results are consistent with diagnostic expertise. We also explore the impact of data imbalance rates on the detection performance of the EE-LightGBM model. The model’s low false negative rate under data imbalance further demonstrates its effectiveness in practical optical network fault diagnosis.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

实用光网络可解释故障诊断的shap辅助EE-LightGBM模型

可靠的故障诊断是保证光网络稳定运行的关键。近年来，数据驱动技术由于其出色的数据处理能力和自适应学习能力，在故障诊断中显示出显著的优势。然而，由于实际光网络中设备故障很少发生，采集到的数据往往面临着严重的数据不平衡问题，极大地限制了传统数据驱动模型的准确性。为了解决这一挑战，提出了一种shap辅助的EE-LightGBM方案，用于实际光网络中的可解释故障诊断。EE-LightGBM模型集成了数据层的欠采样策略和模型层的混合集成策略，能够充分利用较少的故障样本，有效缓解数据不平衡对模型训练的影响。此外，采用SHAP方法对EE-LightGBM模型进行了解释。该方法量化了输入特征对模型决策输出的贡献，有助于更深入地理解设备故障的机制，并提高模型的可解释性。通过SHAP分析，我们可以确定与设备故障高度相关的关键特征，从而推断设备故障的原因。利用运营商管理的骨干网设备数据进行评估，结果表明，EE-LightGBM模型检测性能优异，数据不平衡率为5.61%，准确率和F1得分分别为0.9968和0.9711，假阴性和假阳性率分别为0.0033和0.0032。此外，原因识别结果与诊断专家一致。我们还探讨了数据不平衡率对EE-LightGBM模型检测性能的影响。该模型在数据不平衡情况下的低假阴性率进一步证明了其在实际光网络故障诊断中的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Journal of Optical Communications and Networking 工程技术-电信学

CiteScore

9.40

自引率

16.00%

发文量

104

审稿时长

4 months

期刊介绍： The scope of the Journal includes advances in the state-of-the-art of optical networking science, technology, and engineering. Both theoretical contributions (including new techniques, concepts, analyses, and economic studies) and practical contributions (including optical networking experiments, prototypes, and new applications) are encouraged. Subareas of interest include the architecture and design of optical networks, optical network survivability and security, software-defined optical networking, elastic optical networks, data and control plane advances, network management related innovation, and optical access networks. Enabling technologies and their applications are suitable topics only if the results are shown to directly impact optical networking beyond simple point-to-point networks.