SHAP-assisted EE-LightGBM model for explainable fault diagnosis in practical optical networks

IF 4 2区 计算机科学 Q1 COMPUTER SCIENCE, HARDWARE & ARCHITECTURE Journal of Optical Communications and Networking Pub Date : 2025-01-17 DOI:10.1364/JOCN.527872
Chunyu Zhang;Yu Chen;Min Zhang;Zhuo Liu;Danshi Wang
{"title":"SHAP-assisted EE-LightGBM model for explainable fault diagnosis in practical optical networks","authors":"Chunyu Zhang;Yu Chen;Min Zhang;Zhuo Liu;Danshi Wang","doi":"10.1364/JOCN.527872","DOIUrl":null,"url":null,"abstract":"Reliable fault diagnosis is crucial for ensuring the stable operation of optical networks. Recently, data-driven techniques have demonstrated significant advantages in fault diagnosis due to their outstanding data-processing capabilities and adaptive learning abilities. However, as equipment faults in practical optical networks are rare events, the data collected often faces severe data imbalance issues, greatly limiting the accuracy of traditional data-driven models. To address this challenge, a SHAP-assisted EE-LightGBM scheme is proposed for explainable fault diagnosis in practical optical networks. The EE-LightGBM model integrates undersampling strategies at the data level and hybrid ensemble strategies at the model level, enabling the full utilization of fewer fault samples and effectively alleviating the impact of data imbalance on model training. Furthermore, the SHAP method is used to explain the EE-LightGBM model. This method quantifies the contributions of input features to the model’s decision outputs, facilitating a deeper understanding of the mechanisms underlying faults in the equipment and improving the model’s explainability. Through SHAP analysis, we can determine key features highly correlated with equipment faults, thereby inferring the causes of equipment faults. Evaluation using data from backbone network equipment managed by operators shows excellent detection performance of the EE-LightGBM model at a data imbalance rate of 5.61%, with accuracy and F1 scores of 0.9968 and 0.9711, and false negative and false positive rates of 0.0033 and 0.0032, respectively. Moreover, the cause identification results are consistent with diagnostic expertise. We also explore the impact of data imbalance rates on the detection performance of the EE-LightGBM model. The model’s low false negative rate under data imbalance further demonstrates its effectiveness in practical optical network fault diagnosis.","PeriodicalId":50103,"journal":{"name":"Journal of Optical Communications and Networking","volume":"17 2","pages":"81-94"},"PeriodicalIF":4.0000,"publicationDate":"2025-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Optical Communications and Networking","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10844950/","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, HARDWARE & ARCHITECTURE","Score":null,"Total":0}
引用次数: 0

Abstract

Reliable fault diagnosis is crucial for ensuring the stable operation of optical networks. Recently, data-driven techniques have demonstrated significant advantages in fault diagnosis due to their outstanding data-processing capabilities and adaptive learning abilities. However, as equipment faults in practical optical networks are rare events, the data collected often faces severe data imbalance issues, greatly limiting the accuracy of traditional data-driven models. To address this challenge, a SHAP-assisted EE-LightGBM scheme is proposed for explainable fault diagnosis in practical optical networks. The EE-LightGBM model integrates undersampling strategies at the data level and hybrid ensemble strategies at the model level, enabling the full utilization of fewer fault samples and effectively alleviating the impact of data imbalance on model training. Furthermore, the SHAP method is used to explain the EE-LightGBM model. This method quantifies the contributions of input features to the model’s decision outputs, facilitating a deeper understanding of the mechanisms underlying faults in the equipment and improving the model’s explainability. Through SHAP analysis, we can determine key features highly correlated with equipment faults, thereby inferring the causes of equipment faults. Evaluation using data from backbone network equipment managed by operators shows excellent detection performance of the EE-LightGBM model at a data imbalance rate of 5.61%, with accuracy and F1 scores of 0.9968 and 0.9711, and false negative and false positive rates of 0.0033 and 0.0032, respectively. Moreover, the cause identification results are consistent with diagnostic expertise. We also explore the impact of data imbalance rates on the detection performance of the EE-LightGBM model. The model’s low false negative rate under data imbalance further demonstrates its effectiveness in practical optical network fault diagnosis.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
实用光网络可解释故障诊断的shap辅助EE-LightGBM模型
可靠的故障诊断是保证光网络稳定运行的关键。近年来,数据驱动技术由于其出色的数据处理能力和自适应学习能力,在故障诊断中显示出显著的优势。然而,由于实际光网络中设备故障很少发生,采集到的数据往往面临着严重的数据不平衡问题,极大地限制了传统数据驱动模型的准确性。为了解决这一挑战,提出了一种shap辅助的EE-LightGBM方案,用于实际光网络中的可解释故障诊断。EE-LightGBM模型集成了数据层的欠采样策略和模型层的混合集成策略,能够充分利用较少的故障样本,有效缓解数据不平衡对模型训练的影响。此外,采用SHAP方法对EE-LightGBM模型进行了解释。该方法量化了输入特征对模型决策输出的贡献,有助于更深入地理解设备故障的机制,并提高模型的可解释性。通过SHAP分析,我们可以确定与设备故障高度相关的关键特征,从而推断设备故障的原因。利用运营商管理的骨干网设备数据进行评估,结果表明,EE-LightGBM模型检测性能优异,数据不平衡率为5.61%,准确率和F1得分分别为0.9968和0.9711,假阴性和假阳性率分别为0.0033和0.0032。此外,原因识别结果与诊断专家一致。我们还探讨了数据不平衡率对EE-LightGBM模型检测性能的影响。该模型在数据不平衡情况下的低假阴性率进一步证明了其在实际光网络故障诊断中的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
9.40
自引率
16.00%
发文量
104
审稿时长
4 months
期刊介绍: The scope of the Journal includes advances in the state-of-the-art of optical networking science, technology, and engineering. Both theoretical contributions (including new techniques, concepts, analyses, and economic studies) and practical contributions (including optical networking experiments, prototypes, and new applications) are encouraged. Subareas of interest include the architecture and design of optical networks, optical network survivability and security, software-defined optical networking, elastic optical networks, data and control plane advances, network management related innovation, and optical access networks. Enabling technologies and their applications are suitable topics only if the results are shown to directly impact optical networking beyond simple point-to-point networks.
期刊最新文献
On the cross-layer restoration to address packet layer failures in P2MP-TRX-based WSONs SHAP-assisted EE-LightGBM model for explainable fault diagnosis in practical optical networks Demonstration of a three-node wavelength division multiplexed hybrid quantum-classical network through multicore fiber Programmable packet-optical network security and monitoring using DPUs with embedded GPUs [Invited] Overview of SDN control of multiband over SDM optical networks with physical layer impairments [Invited Tutorial]
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1