Zisheng Wang , Jianping Xuan , Tielin Shi , Yan-Fu Li
{"title":"Multi-label domain adversarial reinforcement learning for unsupervised compound fault recognition","authors":"Zisheng Wang , Jianping Xuan , Tielin Shi , Yan-Fu Li","doi":"10.1016/j.ress.2024.110638","DOIUrl":null,"url":null,"abstract":"<div><div>Compound fault composed of coinstantaneous multiple faults frequently causes the failure of a manufacturing system, which greatly reduces the reliability. When measuring the compound fault, two difficulties generally exist: (1) the complex correlation between different single faults, and (2) collected target samples without labels. To accomplish the cross-domain unsupervised compound fault recognition, this study proposes a multi-label domain adversarial reinforcement learning (ML-DARL) framework that implements two multi-label deep reinforcement learning (ML-DRL) models with adversarial domain adaptation. First, a source ML-DRL model is adopted to train a source feature network (SFN) and a policy network by using a dataset with labels (source domain). Then, a discriminator and a target ML-DRL model that includes a target feature network (TFN) are jointly trained with adversarial adaptation by simultaneously using the dataset without labels (target domain) and the source domain. Specifically, two outputs of TFN and SFN are regarded as fake and real components, respectively. Notably, the reward function in the target ML-DRL model is related inversely to the output of the discriminator for the fake component. Finally, a cross-speed case and a cross-location case are executed to verify the adaptation ability of the proposed method on cross-domain unsupervised compound fault recognition.</div></div>","PeriodicalId":54500,"journal":{"name":"Reliability Engineering & System Safety","volume":"254 ","pages":"Article 110638"},"PeriodicalIF":9.4000,"publicationDate":"2024-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Reliability Engineering & System Safety","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0951832024007099","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}
引用次数: 0
Abstract
Compound fault composed of coinstantaneous multiple faults frequently causes the failure of a manufacturing system, which greatly reduces the reliability. When measuring the compound fault, two difficulties generally exist: (1) the complex correlation between different single faults, and (2) collected target samples without labels. To accomplish the cross-domain unsupervised compound fault recognition, this study proposes a multi-label domain adversarial reinforcement learning (ML-DARL) framework that implements two multi-label deep reinforcement learning (ML-DRL) models with adversarial domain adaptation. First, a source ML-DRL model is adopted to train a source feature network (SFN) and a policy network by using a dataset with labels (source domain). Then, a discriminator and a target ML-DRL model that includes a target feature network (TFN) are jointly trained with adversarial adaptation by simultaneously using the dataset without labels (target domain) and the source domain. Specifically, two outputs of TFN and SFN are regarded as fake and real components, respectively. Notably, the reward function in the target ML-DRL model is related inversely to the output of the discriminator for the fake component. Finally, a cross-speed case and a cross-location case are executed to verify the adaptation ability of the proposed method on cross-domain unsupervised compound fault recognition.
期刊介绍:
Elsevier publishes Reliability Engineering & System Safety in association with the European Safety and Reliability Association and the Safety Engineering and Risk Analysis Division. The international journal is devoted to developing and applying methods to enhance the safety and reliability of complex technological systems, like nuclear power plants, chemical plants, hazardous waste facilities, space systems, offshore and maritime systems, transportation systems, constructed infrastructure, and manufacturing plants. The journal normally publishes only articles that involve the analysis of substantive problems related to the reliability of complex systems or present techniques and/or theoretical results that have a discernable relationship to the solution of such problems. An important aim is to balance academic material and practical applications.