{"title":"Random convolution layer: an auxiliary method to improve fault diagnosis performance","authors":"Zhiqian Zhao, Runchao Zhao, Yinghou Jiao","doi":"10.1007/s10845-024-02458-4","DOIUrl":null,"url":null,"abstract":"<p>In real industry, it is often difficult to obtain large-scale labeled data. Existing Convolutional Neural Network (CNN)-based fault diagnosis methods often struggle to achieve accurate diagnoses of machine conditions due to the scarcity of labeled data, hindering the ability of models to develop strong inductive biases. We propose a plug-and-play auxiliary method, random convolution layer (RCL), to improve the generalization performance of the fault diagnosis models. This method delves into the fundamental commonalities across diverse tasks and varying network structures, thereby enhancing the diversity of samples to establish a more robust source domain environment. The RCL preserves the dimensional nature of the data in the time domain while randomly altering the kernel sizes during convolution operations, thus generating new data without compromising global information. During the training process, the newly generated data is mixed with the original data and fed into the fault diagnosis model. RCL is incorporated as a module into the inputs of different fault diagnosis models, and its effectiveness is validated on three public datasets as well as a self-built testbed. The results show that the present auxiliary method improves the domain generalization performance of the baselines, and can improve the accuracy of the corresponding fault diagnosis models. Our code is available at https://github.com/zhiqan/Random-convolution-layer.</p>","PeriodicalId":16193,"journal":{"name":"Journal of Intelligent Manufacturing","volume":"5 1","pages":""},"PeriodicalIF":5.9000,"publicationDate":"2024-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Intelligent Manufacturing","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s10845-024-02458-4","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
In real industry, it is often difficult to obtain large-scale labeled data. Existing Convolutional Neural Network (CNN)-based fault diagnosis methods often struggle to achieve accurate diagnoses of machine conditions due to the scarcity of labeled data, hindering the ability of models to develop strong inductive biases. We propose a plug-and-play auxiliary method, random convolution layer (RCL), to improve the generalization performance of the fault diagnosis models. This method delves into the fundamental commonalities across diverse tasks and varying network structures, thereby enhancing the diversity of samples to establish a more robust source domain environment. The RCL preserves the dimensional nature of the data in the time domain while randomly altering the kernel sizes during convolution operations, thus generating new data without compromising global information. During the training process, the newly generated data is mixed with the original data and fed into the fault diagnosis model. RCL is incorporated as a module into the inputs of different fault diagnosis models, and its effectiveness is validated on three public datasets as well as a self-built testbed. The results show that the present auxiliary method improves the domain generalization performance of the baselines, and can improve the accuracy of the corresponding fault diagnosis models. Our code is available at https://github.com/zhiqan/Random-convolution-layer.
期刊介绍:
The Journal of Nonlinear Engineering aims to be a platform for sharing original research results in theoretical, experimental, practical, and applied nonlinear phenomena within engineering. It serves as a forum to exchange ideas and applications of nonlinear problems across various engineering disciplines. Articles are considered for publication if they explore nonlinearities in engineering systems, offering realistic mathematical modeling, utilizing nonlinearity for new designs, stabilizing systems, understanding system behavior through nonlinearity, optimizing systems based on nonlinear interactions, and developing algorithms to harness and leverage nonlinear elements.