在未知分布数据集上对故障诊断模型的性能进行无标签评估

IF 8 1区工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Advanced Engineering Informatics Pub Date : 2024-10-01 DOI:10.1016/j.aei.2024.102912

Zhenyu Liu , Haowen Zheng , Hui Liu , Weiqiang Jia , Jianrong Tan

{"title":"在未知分布数据集上对故障诊断模型的性能进行无标签评估","authors":"Zhenyu Liu , Haowen Zheng , Hui Liu , Weiqiang Jia , Jianrong Tan","doi":"10.1016/j.aei.2024.102912","DOIUrl":null,"url":null,"abstract":"<div><div>Real-time data may undergo distribution drift due to changes in operating conditions and other factors, which can affect the classification accuracy of online fault diagnosis models and potentially lead to serious consequences. Therefore, understanding the classification accuracy of the model on real-time data holds substantial significance. However, the absence of labels in real-time data presents a challenge for evaluating classification accuracy. Furthermore, the real-time nature of fault diagnosis necessitates a swift and straightforward evaluation method. For the above reasons, this paper proposes a method for evaluating the classification accuracy of a model on real-time data, which is done in the absence of labels for the real-time data. The proposed label-free evaluation method transforms the model’s output into a scalar that measures the degree of matching between the classification probabilities, termed the average free energy. It then establishes a mapping between the average free energy and the classification accuracy using an auxiliary dataset. Finally, it predicts the model’s classification accuracy on the real-time data through this mapping and the average free energy of the real-time data. The proposed method is experimentally evaluated on public datasets, demonstrating its superiority in various aspects.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"62 ","pages":"Article 102912"},"PeriodicalIF":8.0000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Label-free evaluation for performance of fault diagnosis model on unknown distribution dataset\",\"authors\":\"Zhenyu Liu , Haowen Zheng , Hui Liu , Weiqiang Jia , Jianrong Tan\",\"doi\":\"10.1016/j.aei.2024.102912\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Real-time data may undergo distribution drift due to changes in operating conditions and other factors, which can affect the classification accuracy of online fault diagnosis models and potentially lead to serious consequences. Therefore, understanding the classification accuracy of the model on real-time data holds substantial significance. However, the absence of labels in real-time data presents a challenge for evaluating classification accuracy. Furthermore, the real-time nature of fault diagnosis necessitates a swift and straightforward evaluation method. For the above reasons, this paper proposes a method for evaluating the classification accuracy of a model on real-time data, which is done in the absence of labels for the real-time data. The proposed label-free evaluation method transforms the model’s output into a scalar that measures the degree of matching between the classification probabilities, termed the average free energy. It then establishes a mapping between the average free energy and the classification accuracy using an auxiliary dataset. Finally, it predicts the model’s classification accuracy on the real-time data through this mapping and the average free energy of the real-time data. The proposed method is experimentally evaluated on public datasets, demonstrating its superiority in various aspects.</div></div>\",\"PeriodicalId\":50941,\"journal\":{\"name\":\"Advanced Engineering Informatics\",\"volume\":\"62 \",\"pages\":\"Article 102912\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2024-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advanced Engineering Informatics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1474034624005639\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034624005639","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

实时数据可能会因运行条件变化和其他因素而发生分布漂移，从而影响在线故障诊断模型的分类准确性，并可能导致严重后果。因此，了解模型在实时数据上的分类准确性具有重要意义。然而，实时数据中没有标签，这给评估分类准确性带来了挑战。此外，故障诊断的实时性要求采用快速、直接的评估方法。基于上述原因，本文提出了一种在实时数据中评估模型分类准确性的方法，这种方法是在实时数据没有标签的情况下完成的。本文提出的无标签评估方法将模型的输出转化为一个标量，用来衡量分类概率之间的匹配程度，即平均自由能。然后，利用辅助数据集建立平均自由能与分类准确率之间的映射关系。最后，通过该映射和实时数据的平均自由能，预测模型在实时数据上的分类准确率。所提出的方法在公共数据集上进行了实验评估，证明了其在各方面的优越性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Label-free evaluation for performance of fault diagnosis model on unknown distribution dataset

Real-time data may undergo distribution drift due to changes in operating conditions and other factors, which can affect the classification accuracy of online fault diagnosis models and potentially lead to serious consequences. Therefore, understanding the classification accuracy of the model on real-time data holds substantial significance. However, the absence of labels in real-time data presents a challenge for evaluating classification accuracy. Furthermore, the real-time nature of fault diagnosis necessitates a swift and straightforward evaluation method. For the above reasons, this paper proposes a method for evaluating the classification accuracy of a model on real-time data, which is done in the absence of labels for the real-time data. The proposed label-free evaluation method transforms the model’s output into a scalar that measures the degree of matching between the classification probabilities, termed the average free energy. It then establishes a mapping between the average free energy and the classification accuracy using an auxiliary dataset. Finally, it predicts the model’s classification accuracy on the real-time data through this mapping and the average free energy of the real-time data. The proposed method is experimentally evaluated on public datasets, demonstrating its superiority in various aspects.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Advanced Engineering Informatics 工程技术-工程：综合

CiteScore

12.40

自引率

18.20%

发文量

292

审稿时长

45 days

期刊介绍： Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.