基于二维卷积神经网络和自关注一维LSTM的集成深度学习网络用于轴承故障诊断

IF 6.6 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Applied Soft Computing Pub Date : 2025-03-01 Epub Date: 2025-02-18 DOI:10.1016/j.asoc.2025.112889

Liying Wang, Weiguo Zhao

{"title":"基于二维卷积神经网络和自关注一维LSTM的集成深度学习网络用于轴承故障诊断","authors":"Liying Wang, Weiguo Zhao","doi":"10.1016/j.asoc.2025.112889","DOIUrl":null,"url":null,"abstract":"<div><div>Intelligent classification methods based on deep learning (DL) have become widely adopted for bearing fault diagnosis (BFD). However, it is acknowledged that relying on single feature extraction methods may not yield comprehensive representations of the information features. Additionally, DL-based approaches for extracting features from vibration signals typically utilize either one-dimensional (1D) or two-dimensional (2D) networks, which can restrict the network's ability to extract features effectively. In this paper, a time series data representation method called the relative angle matrix (RAM) method is firstly proposed. This method converts 1D time series into 2D images by calculating the angle differences between multiple vectors and a central vector, thereby extracting the hidden spatial features present in the original data. Then, this paper introduces an ensemble deep learning network called 1D2D-EDL, which integrates 1D-based and 2D-based DL mechanisms for feature extraction and classification, leveraging the strengths of each approach. The 1D2D-EDL comprises two channels: the 1D channel combines long short-term memory (LSTM) and multi-head self-attention (MSA) to process raw 1D time series data, facilitating feature extraction in both the time and frequency domains. Meanwhile, the 2D channel employs convolutional neural network (CNN) components to process 2D images for spatial feature extraction, which are derived from the original time series data using the RAM method. Finally, the feature information from these two channels is fused using a feature fusion method. To preliminarily validate the effectiveness of the RAM method, three competitive 2D conversion methods are employed, including Gramian angular difference field (GADF), Gramian angular sum field (GASF), and Markov transition field (MTF). These methods are applied alongside the proposed RAM method within the same CNN network for fault diagnosis testing. The results indicate that the RAM method significantly enhances the diagnostic accuracy of the CNN compared to the other 2D conversion methods. Furthermore, the bearing fault dataset from the University of Ottawa is utilized to validate the performance of the 1D2D-EDL. A comparative analysis with other DL methods using multiple statistical metrics demonstrates the superiority of the 1D2D-EDL. Specifically, when diagnosing faults under four different speed conditions, the 1D2D-EDL attains accuracy rates of 100 %, 99.33 %, 100 %, and 100 %, respectively. This study proposes the incorporation of a novel perspective classifier to enhance DL models for bearing fault diagnosis. The source code of RAM is available at <span><span>https://ww2.mathworks.cn/matlabcentral/fileexchange/180197-relative-angle-matrix-ram</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"172 ","pages":"Article 112889"},"PeriodicalIF":6.6000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An ensemble deep learning network based on 2D convolutional neural network and 1D LSTM with self-attention for bearing fault diagnosis\",\"authors\":\"Liying Wang, Weiguo Zhao\",\"doi\":\"10.1016/j.asoc.2025.112889\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Intelligent classification methods based on deep learning (DL) have become widely adopted for bearing fault diagnosis (BFD). However, it is acknowledged that relying on single feature extraction methods may not yield comprehensive representations of the information features. Additionally, DL-based approaches for extracting features from vibration signals typically utilize either one-dimensional (1D) or two-dimensional (2D) networks, which can restrict the network's ability to extract features effectively. In this paper, a time series data representation method called the relative angle matrix (RAM) method is firstly proposed. This method converts 1D time series into 2D images by calculating the angle differences between multiple vectors and a central vector, thereby extracting the hidden spatial features present in the original data. Then, this paper introduces an ensemble deep learning network called 1D2D-EDL, which integrates 1D-based and 2D-based DL mechanisms for feature extraction and classification, leveraging the strengths of each approach. The 1D2D-EDL comprises two channels: the 1D channel combines long short-term memory (LSTM) and multi-head self-attention (MSA) to process raw 1D time series data, facilitating feature extraction in both the time and frequency domains. Meanwhile, the 2D channel employs convolutional neural network (CNN) components to process 2D images for spatial feature extraction, which are derived from the original time series data using the RAM method. Finally, the feature information from these two channels is fused using a feature fusion method. To preliminarily validate the effectiveness of the RAM method, three competitive 2D conversion methods are employed, including Gramian angular difference field (GADF), Gramian angular sum field (GASF), and Markov transition field (MTF). These methods are applied alongside the proposed RAM method within the same CNN network for fault diagnosis testing. The results indicate that the RAM method significantly enhances the diagnostic accuracy of the CNN compared to the other 2D conversion methods. Furthermore, the bearing fault dataset from the University of Ottawa is utilized to validate the performance of the 1D2D-EDL. A comparative analysis with other DL methods using multiple statistical metrics demonstrates the superiority of the 1D2D-EDL. Specifically, when diagnosing faults under four different speed conditions, the 1D2D-EDL attains accuracy rates of 100 %, 99.33 %, 100 %, and 100 %, respectively. This study proposes the incorporation of a novel perspective classifier to enhance DL models for bearing fault diagnosis. The source code of RAM is available at <span><span>https://ww2.mathworks.cn/matlabcentral/fileexchange/180197-relative-angle-matrix-ram</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":50737,\"journal\":{\"name\":\"Applied Soft Computing\",\"volume\":\"172 \",\"pages\":\"Article 112889\"},\"PeriodicalIF\":6.6000,\"publicationDate\":\"2025-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Soft Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1568494625002005\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/2/18 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625002005","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/18 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

基于深度学习（DL）的智能分类方法已被广泛应用于轴承故障诊断（BFD）中。然而，人们承认，依赖于单一的特征提取方法可能无法产生信息特征的全面表示。此外，从振动信号中提取特征的基于dl的方法通常使用一维（1D）或二维（2D）网络，这可能会限制网络有效提取特征的能力。本文首次提出了一种时间序列数据表示方法——相对角度矩阵（RAM）法。该方法通过计算多个矢量与中心矢量之间的角度差，将一维时间序列转换为二维图像，从而提取原始数据中存在的隐藏空间特征。然后，本文介绍了一个称为1D2D-EDL的集成深度学习网络，该网络集成了基于1d和基于2d的深度学习机制来进行特征提取和分类，利用了每种方法的优势。1D2D-EDL包括两个通道：一维通道结合了长短期记忆（LSTM）和多头自注意（MSA）来处理原始一维时间序列数据，便于在时域和频域提取特征。同时，二维通道采用卷积神经网络（CNN）组件对二维图像进行空间特征提取，这些图像是使用RAM方法从原始时间序列数据中提取出来的。最后，利用特征融合方法对两个通道的特征信息进行融合。为了初步验证RAM方法的有效性，采用了葛兰曼角差场（GADF）、葛兰曼角和场（GASF）和马尔可夫跃迁场（MTF）三种相互竞争的二维转换方法。这些方法与所提出的RAM方法一起应用于同一CNN网络中进行故障诊断测试。结果表明，与其他二维转换方法相比，RAM方法显著提高了CNN的诊断准确率。此外，利用渥太华大学的轴承故障数据集来验证1D2D-EDL的性能。使用多个统计指标与其他深度学习方法进行比较分析，证明了1D2D-EDL的优越性。具体来说，在4种不同转速条件下，1D2D-EDL的故障诊断准确率分别达到了100% %、99.33 %、100% %和100% %。本文提出了一种新的视角分类器来增强深度学习模型在轴承故障诊断中的应用。RAM的源代码可在https://ww2.mathworks.cn/matlabcentral/fileexchange/180197-relative-angle-matrix-ram上获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

An ensemble deep learning network based on 2D convolutional neural network and 1D LSTM with self-attention for bearing fault diagnosis

Intelligent classification methods based on deep learning (DL) have become widely adopted for bearing fault diagnosis (BFD). However, it is acknowledged that relying on single feature extraction methods may not yield comprehensive representations of the information features. Additionally, DL-based approaches for extracting features from vibration signals typically utilize either one-dimensional (1D) or two-dimensional (2D) networks, which can restrict the network's ability to extract features effectively. In this paper, a time series data representation method called the relative angle matrix (RAM) method is firstly proposed. This method converts 1D time series into 2D images by calculating the angle differences between multiple vectors and a central vector, thereby extracting the hidden spatial features present in the original data. Then, this paper introduces an ensemble deep learning network called 1D2D-EDL, which integrates 1D-based and 2D-based DL mechanisms for feature extraction and classification, leveraging the strengths of each approach. The 1D2D-EDL comprises two channels: the 1D channel combines long short-term memory (LSTM) and multi-head self-attention (MSA) to process raw 1D time series data, facilitating feature extraction in both the time and frequency domains. Meanwhile, the 2D channel employs convolutional neural network (CNN) components to process 2D images for spatial feature extraction, which are derived from the original time series data using the RAM method. Finally, the feature information from these two channels is fused using a feature fusion method. To preliminarily validate the effectiveness of the RAM method, three competitive 2D conversion methods are employed, including Gramian angular difference field (GADF), Gramian angular sum field (GASF), and Markov transition field (MTF). These methods are applied alongside the proposed RAM method within the same CNN network for fault diagnosis testing. The results indicate that the RAM method significantly enhances the diagnostic accuracy of the CNN compared to the other 2D conversion methods. Furthermore, the bearing fault dataset from the University of Ottawa is utilized to validate the performance of the 1D2D-EDL. A comparative analysis with other DL methods using multiple statistical metrics demonstrates the superiority of the 1D2D-EDL. Specifically, when diagnosing faults under four different speed conditions, the 1D2D-EDL attains accuracy rates of 100 %, 99.33 %, 100 %, and 100 %, respectively. This study proposes the incorporation of a novel perspective classifier to enhance DL models for bearing fault diagnosis. The source code of RAM is available at https://ww2.mathworks.cn/matlabcentral/fileexchange/180197-relative-angle-matrix-ram.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Applied Soft Computing 工程技术-计算机：跨学科应用

CiteScore

15.80

自引率

6.90%

发文量

874

审稿时长

10.9 months

期刊介绍： Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities. Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.