Extended Invariant Risk Minimization for Machine Fault Diagnosis With Label Noise and Data Shift

IF 8.9 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE IEEE transactions on neural networks and learning systems Pub Date : 2025-02-05 DOI:10.1109/TNNLS.2025.3531214

Zhenling Mo;Zijun Zhang;Qiang Miao;Kwok-Leung Tsui

{"title":"Extended Invariant Risk Minimization for Machine Fault Diagnosis With Label Noise and Data Shift","authors":"Zhenling Mo;Zijun Zhang;Qiang Miao;Kwok-Leung Tsui","doi":"10.1109/TNNLS.2025.3531214","DOIUrl":null,"url":null,"abstract":"Incorrect labels as well as the discrepancy between training and test domain data distributions can significantly affect the effectiveness of supervised data-driven models in machine fault diagnosis applications. Such a challenge can be characterized as the noisy label-domain generalization (NL-DG) problem. In this article, the extended invariant risk minimization (EIRM) is developed, which incorporates flat minima seeking to address the NL-DG challenge. The ability of handling NL-DG is realized by shifting the gradient penalty base from the dummy classifier to the entire model. EIRM is shown to be closely related to locating a flat minimum, which is crucial for label noise (LN) robustness and model generalization. Explorations on function smoothness and algorithm convergence are offered to understand EIRM from the theoretical aspect. An efficient implementation of EIRM is also developed to construct the fault diagnosis model. The EIRM-based fault diagnosis method is compared with strong benchmarks on multiple NL-DG tasks using actuator and gearbox fault datasets. Results indicate that the EIRM-based method on average is more effective than the benchmarks. The code is available at <uri>https://github.com/mozhenling/doge-eirm</uri>.","PeriodicalId":13303,"journal":{"name":"IEEE transactions on neural networks and learning systems","volume":"36 8","pages":"15476-15489"},"PeriodicalIF":8.9000,"publicationDate":"2025-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on neural networks and learning systems","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10873284/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Incorrect labels as well as the discrepancy between training and test domain data distributions can significantly affect the effectiveness of supervised data-driven models in machine fault diagnosis applications. Such a challenge can be characterized as the noisy label-domain generalization (NL-DG) problem. In this article, the extended invariant risk minimization (EIRM) is developed, which incorporates flat minima seeking to address the NL-DG challenge. The ability of handling NL-DG is realized by shifting the gradient penalty base from the dummy classifier to the entire model. EIRM is shown to be closely related to locating a flat minimum, which is crucial for label noise (LN) robustness and model generalization. Explorations on function smoothness and algorithm convergence are offered to understand EIRM from the theoretical aspect. An efficient implementation of EIRM is also developed to construct the fault diagnosis model. The EIRM-based fault diagnosis method is compared with strong benchmarks on multiple NL-DG tasks using actuator and gearbox fault datasets. Results indicate that the EIRM-based method on average is more effective than the benchmarks. The code is available at https://github.com/mozhenling/doge-eirm.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

带有标签噪声和数据移位的机器故障诊断的扩展不变风险最小化

不正确的标签以及训练域和测试域数据分布之间的差异会严重影响监督数据驱动模型在机器故障诊断中的有效性。这种挑战可以被描述为噪声标签域泛化（NL-DG）问题。在本文中，开发了扩展不变风险最小化（EIRM），它包含了寻求平面最小值来解决NL-DG挑战。处理NL-DG的能力是通过将梯度惩罚基从虚拟分类器转移到整个模型来实现的。EIRM与定位平坦最小值密切相关，这对标签噪声（LN）的鲁棒性和模型泛化至关重要。对函数的平滑性和算法的收敛性进行了探索，从理论层面理解EIRM。提出了一种有效的EIRM实现方法来构建故障诊断模型。利用致动器和齿轮箱故障数据集，将基于eirm的故障诊断方法与多个NL-DG任务的强基准测试进行了比较。结果表明，平均而言，基于eirm的方法比基准方法更有效。代码可在https://github.com/mozhenling/doge-eirm上获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE transactions on neural networks and learning systems COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-COMPUTER SCIENCE, HARDWARE & ARCHITECTURE

CiteScore

23.80

自引率

9.60%

发文量

2102

审稿时长

3-8 weeks

期刊介绍： The focus of IEEE Transactions on Neural Networks and Learning Systems is to present scholarly articles discussing the theory, design, and applications of neural networks as well as other learning systems. The journal primarily highlights technical and scientific research in this domain.

期刊最新文献

Modality-Mix Learning: Promoting Multimodal Learning Through Multilabel Objective. Evaluating Large Language Models on Named Entity Recognition. Rethinking Topic Modeling With Information Bottleneck Principle. Quantum Convolutional Neural Networks: A Survey on Architectures, Applications, and Future Directions A Joint Learning Framework for Document-Level Event Extraction