Yue Yu , Hamid Reza Karimi , Len Gelman , Ahmet Enis Cetin
{"title":"MSIFT: A novel end-to-end mechanical fault diagnosis framework under limited & imbalanced data using multi-source information fusion","authors":"Yue Yu , Hamid Reza Karimi , Len Gelman , Ahmet Enis Cetin","doi":"10.1016/j.eswa.2025.126947","DOIUrl":null,"url":null,"abstract":"<div><div>Data-driven intelligent fault diagnosis methods have emerged as powerful tools for monitoring and maintaining the operating conditions of mechanical equipment. However, in real-world engineering scenarios, mechanical equipment typically operates under normal conditions, resulting in limited and imbalanced (L&I) data. This situation gives rise to label bias and biased training. Meanwhile, the current multi-source information fault diagnosis research to date has tended to focus on fault identification rather than effective feature fusion strategies. To solve these issues, a novel end-to-end mechanical fault diagnosis framework under limited & imbalanced data using multi-source information fusion is proposed to model data-level and algorithm-level ideas in a unified deep network for achieving effective multi-source information fusion under the L&I working conditions. From a data-level perspective, a data preprocessing operation is first employed to capture time–frequency information simultaneously. Subsequently, multi-source time–frequency information is fed into feature extractors with information discriminators to construct local and information-invariant feature maps with different scales to eliminate multi-source information domain shift. Then, the multi-source feature vectors are modeled by a multi-source information transformer-based neural network to achieve effective multi-source information fusion through cross-attention mechanism. Next, the global max pooling and global average pooling layers are leveraged to obtain the more representative features. Finally, from an algorithm-level perspective, a dual-stream diagnosis predictor with a binary diagnosis predictor and a multi-class diagnosis predictor is designed to synthesize the diagnostic results through a reweighing activation mechanism for addressing the L&I problems. Extensive experiments on four different multi-source information datasets show the superiority and promising performance of our method compared to the state-of-the-art methods, as evidenced by indicators from various aspects.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"274 ","pages":"Article 126947"},"PeriodicalIF":7.5000,"publicationDate":"2025-02-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S095741742500569X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Data-driven intelligent fault diagnosis methods have emerged as powerful tools for monitoring and maintaining the operating conditions of mechanical equipment. However, in real-world engineering scenarios, mechanical equipment typically operates under normal conditions, resulting in limited and imbalanced (L&I) data. This situation gives rise to label bias and biased training. Meanwhile, the current multi-source information fault diagnosis research to date has tended to focus on fault identification rather than effective feature fusion strategies. To solve these issues, a novel end-to-end mechanical fault diagnosis framework under limited & imbalanced data using multi-source information fusion is proposed to model data-level and algorithm-level ideas in a unified deep network for achieving effective multi-source information fusion under the L&I working conditions. From a data-level perspective, a data preprocessing operation is first employed to capture time–frequency information simultaneously. Subsequently, multi-source time–frequency information is fed into feature extractors with information discriminators to construct local and information-invariant feature maps with different scales to eliminate multi-source information domain shift. Then, the multi-source feature vectors are modeled by a multi-source information transformer-based neural network to achieve effective multi-source information fusion through cross-attention mechanism. Next, the global max pooling and global average pooling layers are leveraged to obtain the more representative features. Finally, from an algorithm-level perspective, a dual-stream diagnosis predictor with a binary diagnosis predictor and a multi-class diagnosis predictor is designed to synthesize the diagnostic results through a reweighing activation mechanism for addressing the L&I problems. Extensive experiments on four different multi-source information datasets show the superiority and promising performance of our method compared to the state-of-the-art methods, as evidenced by indicators from various aspects.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.