Dynamic ensemble fault diagnosis framework with adaptive hierarchical sampling strategy for industrial imbalanced and overlapping data

IF 11 1区 工程技术 Q1 ENGINEERING, INDUSTRIAL Reliability Engineering & System Safety Pub Date : 2025-08-01 Epub Date: 2025-03-06 DOI:10.1016/j.ress.2025.110979
Haoyan Dong , Chuang Peng , Lei Chen , Kuangrong Hao
{"title":"Dynamic ensemble fault diagnosis framework with adaptive hierarchical sampling strategy for industrial imbalanced and overlapping data","authors":"Haoyan Dong ,&nbsp;Chuang Peng ,&nbsp;Lei Chen ,&nbsp;Kuangrong Hao","doi":"10.1016/j.ress.2025.110979","DOIUrl":null,"url":null,"abstract":"<div><div>The coexistence of class imbalance and class overlap significantly challenges fault diagnosis in modern industrial processes. Class imbalance, characterized by the scarcity of fault data, and class overlap, arising from similarities between normal and fault data as well as correlations among fault types, are intertwined issues that jointly degrade fault diagnosis performance. To address these coupled issues, this paper proposes a dynamic ensemble fault diagnosis framework with adaptive hierarchical sampling strategy (DEAHS). The framework employs a boosting ensemble structure, effectively mitigating class imbalance through dynamic majority class undersampling and reducing class overlap by focusing on minority classes in high-overlap regions. In the outer layer, a Markov decision process guides the adaptive undersampling of majority class, achieving relatively balanced subsets. In the inner layer, a membership entropy-based method identifies overlap regions, and a weighted oversampling strategy improves minority classes’ representation in these regions. The proposed framework is validated on the Tennessee Eastman process and a real-world polyester esterification process, where its performance is evaluated using four metrics commonly employed for imbalanced datasets. The results demonstrate that the proposed method achieves superior performance across a majority of metrics, highlighting its effectiveness in handling imbalanced and overlapping industrial fault data.</div></div>","PeriodicalId":54500,"journal":{"name":"Reliability Engineering & System Safety","volume":"260 ","pages":"Article 110979"},"PeriodicalIF":11.0000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Reliability Engineering & System Safety","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0951832025001826","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/6 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}
引用次数: 0

Abstract

The coexistence of class imbalance and class overlap significantly challenges fault diagnosis in modern industrial processes. Class imbalance, characterized by the scarcity of fault data, and class overlap, arising from similarities between normal and fault data as well as correlations among fault types, are intertwined issues that jointly degrade fault diagnosis performance. To address these coupled issues, this paper proposes a dynamic ensemble fault diagnosis framework with adaptive hierarchical sampling strategy (DEAHS). The framework employs a boosting ensemble structure, effectively mitigating class imbalance through dynamic majority class undersampling and reducing class overlap by focusing on minority classes in high-overlap regions. In the outer layer, a Markov decision process guides the adaptive undersampling of majority class, achieving relatively balanced subsets. In the inner layer, a membership entropy-based method identifies overlap regions, and a weighted oversampling strategy improves minority classes’ representation in these regions. The proposed framework is validated on the Tennessee Eastman process and a real-world polyester esterification process, where its performance is evaluated using four metrics commonly employed for imbalanced datasets. The results demonstrate that the proposed method achieves superior performance across a majority of metrics, highlighting its effectiveness in handling imbalanced and overlapping industrial fault data.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于自适应分层采样策略的工业不平衡和重叠数据动态集成故障诊断框架
类不平衡和类重叠共存对现代工业过程的故障诊断提出了重大挑战。以故障数据的稀缺性为特征的类不平衡与正常数据与故障数据的相似性以及故障类型之间的相关性引起的类重叠是相互交织的问题,共同降低了故障诊断的性能。为了解决这些耦合问题,本文提出了一种具有自适应分层采样策略的动态集成故障诊断框架。该框架采用boost ensemble结构,通过动态多数类欠采样有效缓解类失衡,通过关注高重叠区域的少数类减少类重叠。在外层,马尔可夫决策过程引导多数类的自适应欠采样,实现相对平衡的子集。在内层,基于隶属度熵的方法识别重叠区域,加权过采样策略提高了这些区域中少数类的代表性。提出的框架在田纳西伊士曼过程和真实的聚酯酯化过程中进行了验证,其中使用通常用于不平衡数据集的四个指标来评估其性能。结果表明,该方法在大多数指标上都取得了优异的性能,突出了其在处理不平衡和重叠工业故障数据方面的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Reliability Engineering & System Safety
Reliability Engineering & System Safety 管理科学-工程:工业
CiteScore
15.20
自引率
39.50%
发文量
621
审稿时长
67 days
期刊介绍: Elsevier publishes Reliability Engineering & System Safety in association with the European Safety and Reliability Association and the Safety Engineering and Risk Analysis Division. The international journal is devoted to developing and applying methods to enhance the safety and reliability of complex technological systems, like nuclear power plants, chemical plants, hazardous waste facilities, space systems, offshore and maritime systems, transportation systems, constructed infrastructure, and manufacturing plants. The journal normally publishes only articles that involve the analysis of substantive problems related to the reliability of complex systems or present techniques and/or theoretical results that have a discernable relationship to the solution of such problems. An important aim is to balance academic material and practical applications.
期刊最新文献
Quantifying potential cyber-attack risks in CNC systems under zero-subjectivity closed-loop Dempster–Shafer theory FMECA and rule-based Bayesian network modelling Inactivity times of components upon system failure with application to missing data problems Domain knowledge-enhanced dual-stream graph joint learning network for aeroengine remaining useful life prediction Revealing the dynamics and multidimensional resilience of rainstorm-flood cascade disasters in mountain valley cities: An interpretable machine learning case study from Southwestern China Robustness of spatial interdependent networks under extreme geographically localized attacks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1