Multi-label domain adversarial reinforcement learning for unsupervised compound fault recognition

IF 9.4 1区 工程技术 Q1 ENGINEERING, INDUSTRIAL Reliability Engineering & System Safety Pub Date : 2024-11-15 DOI:10.1016/j.ress.2024.110638
Zisheng Wang , Jianping Xuan , Tielin Shi , Yan-Fu Li
{"title":"Multi-label domain adversarial reinforcement learning for unsupervised compound fault recognition","authors":"Zisheng Wang ,&nbsp;Jianping Xuan ,&nbsp;Tielin Shi ,&nbsp;Yan-Fu Li","doi":"10.1016/j.ress.2024.110638","DOIUrl":null,"url":null,"abstract":"<div><div>Compound fault composed of coinstantaneous multiple faults frequently causes the failure of a manufacturing system, which greatly reduces the reliability. When measuring the compound fault, two difficulties generally exist: (1) the complex correlation between different single faults, and (2) collected target samples without labels. To accomplish the cross-domain unsupervised compound fault recognition, this study proposes a multi-label domain adversarial reinforcement learning (ML-DARL) framework that implements two multi-label deep reinforcement learning (ML-DRL) models with adversarial domain adaptation. First, a source ML-DRL model is adopted to train a source feature network (SFN) and a policy network by using a dataset with labels (source domain). Then, a discriminator and a target ML-DRL model that includes a target feature network (TFN) are jointly trained with adversarial adaptation by simultaneously using the dataset without labels (target domain) and the source domain. Specifically, two outputs of TFN and SFN are regarded as fake and real components, respectively. Notably, the reward function in the target ML-DRL model is related inversely to the output of the discriminator for the fake component. Finally, a cross-speed case and a cross-location case are executed to verify the adaptation ability of the proposed method on cross-domain unsupervised compound fault recognition.</div></div>","PeriodicalId":54500,"journal":{"name":"Reliability Engineering & System Safety","volume":"254 ","pages":"Article 110638"},"PeriodicalIF":9.4000,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Reliability Engineering & System Safety","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0951832024007099","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}
引用次数: 0

Abstract

Compound fault composed of coinstantaneous multiple faults frequently causes the failure of a manufacturing system, which greatly reduces the reliability. When measuring the compound fault, two difficulties generally exist: (1) the complex correlation between different single faults, and (2) collected target samples without labels. To accomplish the cross-domain unsupervised compound fault recognition, this study proposes a multi-label domain adversarial reinforcement learning (ML-DARL) framework that implements two multi-label deep reinforcement learning (ML-DRL) models with adversarial domain adaptation. First, a source ML-DRL model is adopted to train a source feature network (SFN) and a policy network by using a dataset with labels (source domain). Then, a discriminator and a target ML-DRL model that includes a target feature network (TFN) are jointly trained with adversarial adaptation by simultaneously using the dataset without labels (target domain) and the source domain. Specifically, two outputs of TFN and SFN are regarded as fake and real components, respectively. Notably, the reward function in the target ML-DRL model is related inversely to the output of the discriminator for the fake component. Finally, a cross-speed case and a cross-location case are executed to verify the adaptation ability of the proposed method on cross-domain unsupervised compound fault recognition.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用于无监督复合故障识别的多标签域对抗强化学习
由同时发生的多个故障组成的复合故障经常导致制造系统失灵,从而大大降低了可靠性。在测量复合故障时,一般存在两个困难:(1)不同单一故障之间存在复杂的相关性;(2)采集的目标样本没有标签。为了实现跨域无监督复合故障识别,本研究提出了一种多标签域对抗强化学习(ML-DARL)框架,该框架实现了两个具有对抗域适应性的多标签深度强化学习(ML-DRL)模型。首先,采用源 ML-DRL 模型,利用带标签的数据集(源域)训练源特征网络(SFN)和策略网络。然后,通过同时使用无标签数据集(目标域)和源域,联合训练包含目标特征网络(TFN)的判别器和目标 ML-DRL 模型。具体来说,TFN 和 SFN 的两个输出分别被视为假成分和真成分。值得注意的是,目标 ML-DRL 模型中的奖励函数与假分量判别器的输出成反比关系。最后,通过跨速度案例和跨位置案例验证了所提方法对跨域无监督复合故障识别的适应能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Reliability Engineering & System Safety
Reliability Engineering & System Safety 管理科学-工程:工业
CiteScore
15.20
自引率
39.50%
发文量
621
审稿时长
67 days
期刊介绍: Elsevier publishes Reliability Engineering & System Safety in association with the European Safety and Reliability Association and the Safety Engineering and Risk Analysis Division. The international journal is devoted to developing and applying methods to enhance the safety and reliability of complex technological systems, like nuclear power plants, chemical plants, hazardous waste facilities, space systems, offshore and maritime systems, transportation systems, constructed infrastructure, and manufacturing plants. The journal normally publishes only articles that involve the analysis of substantive problems related to the reliability of complex systems or present techniques and/or theoretical results that have a discernable relationship to the solution of such problems. An important aim is to balance academic material and practical applications.
期刊最新文献
Lifting wavelet-informed hierarchical domain adaptation network: An interpretable digital twin-driven gearbox fault diagnosis method A novel sim2real reinforcement learning algorithm for process control Multi-label domain adversarial reinforcement learning for unsupervised compound fault recognition High-efficient non-iterative reliability-based design optimization based on the design space virtually conditionalized reliability evaluation method Pseudo-label assisted contrastive learning model for unsupervised open-set domain adaptation in fault diagnosis
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1