基于二维卷积神经网络和自关注一维LSTM的集成深度学习网络用于轴承故障诊断

IF 6.6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Applied Soft Computing Pub Date : 2025-03-01 Epub Date: 2025-02-18 DOI:10.1016/j.asoc.2025.112889
Liying Wang, Weiguo Zhao
{"title":"基于二维卷积神经网络和自关注一维LSTM的集成深度学习网络用于轴承故障诊断","authors":"Liying Wang,&nbsp;Weiguo Zhao","doi":"10.1016/j.asoc.2025.112889","DOIUrl":null,"url":null,"abstract":"<div><div>Intelligent classification methods based on deep learning (DL) have become widely adopted for bearing fault diagnosis (BFD). However, it is acknowledged that relying on single feature extraction methods may not yield comprehensive representations of the information features. Additionally, DL-based approaches for extracting features from vibration signals typically utilize either one-dimensional (1D) or two-dimensional (2D) networks, which can restrict the network's ability to extract features effectively. In this paper, a time series data representation method called the relative angle matrix (RAM) method is firstly proposed. This method converts 1D time series into 2D images by calculating the angle differences between multiple vectors and a central vector, thereby extracting the hidden spatial features present in the original data. Then, this paper introduces an ensemble deep learning network called 1D2D-EDL, which integrates 1D-based and 2D-based DL mechanisms for feature extraction and classification, leveraging the strengths of each approach. The 1D2D-EDL comprises two channels: the 1D channel combines long short-term memory (LSTM) and multi-head self-attention (MSA) to process raw 1D time series data, facilitating feature extraction in both the time and frequency domains. Meanwhile, the 2D channel employs convolutional neural network (CNN) components to process 2D images for spatial feature extraction, which are derived from the original time series data using the RAM method. Finally, the feature information from these two channels is fused using a feature fusion method. To preliminarily validate the effectiveness of the RAM method, three competitive 2D conversion methods are employed, including Gramian angular difference field (GADF), Gramian angular sum field (GASF), and Markov transition field (MTF). These methods are applied alongside the proposed RAM method within the same CNN network for fault diagnosis testing. The results indicate that the RAM method significantly enhances the diagnostic accuracy of the CNN compared to the other 2D conversion methods. Furthermore, the bearing fault dataset from the University of Ottawa is utilized to validate the performance of the 1D2D-EDL. A comparative analysis with other DL methods using multiple statistical metrics demonstrates the superiority of the 1D2D-EDL. Specifically, when diagnosing faults under four different speed conditions, the 1D2D-EDL attains accuracy rates of 100 %, 99.33 %, 100 %, and 100 %, respectively. This study proposes the incorporation of a novel perspective classifier to enhance DL models for bearing fault diagnosis. The source code of RAM is available at <span><span>https://ww2.mathworks.cn/matlabcentral/fileexchange/180197-relative-angle-matrix-ram</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50737,"journal":{"name":"Applied Soft Computing","volume":"172 ","pages":"Article 112889"},"PeriodicalIF":6.6000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An ensemble deep learning network based on 2D convolutional neural network and 1D LSTM with self-attention for bearing fault diagnosis\",\"authors\":\"Liying Wang,&nbsp;Weiguo Zhao\",\"doi\":\"10.1016/j.asoc.2025.112889\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Intelligent classification methods based on deep learning (DL) have become widely adopted for bearing fault diagnosis (BFD). However, it is acknowledged that relying on single feature extraction methods may not yield comprehensive representations of the information features. Additionally, DL-based approaches for extracting features from vibration signals typically utilize either one-dimensional (1D) or two-dimensional (2D) networks, which can restrict the network's ability to extract features effectively. In this paper, a time series data representation method called the relative angle matrix (RAM) method is firstly proposed. This method converts 1D time series into 2D images by calculating the angle differences between multiple vectors and a central vector, thereby extracting the hidden spatial features present in the original data. Then, this paper introduces an ensemble deep learning network called 1D2D-EDL, which integrates 1D-based and 2D-based DL mechanisms for feature extraction and classification, leveraging the strengths of each approach. The 1D2D-EDL comprises two channels: the 1D channel combines long short-term memory (LSTM) and multi-head self-attention (MSA) to process raw 1D time series data, facilitating feature extraction in both the time and frequency domains. Meanwhile, the 2D channel employs convolutional neural network (CNN) components to process 2D images for spatial feature extraction, which are derived from the original time series data using the RAM method. Finally, the feature information from these two channels is fused using a feature fusion method. To preliminarily validate the effectiveness of the RAM method, three competitive 2D conversion methods are employed, including Gramian angular difference field (GADF), Gramian angular sum field (GASF), and Markov transition field (MTF). These methods are applied alongside the proposed RAM method within the same CNN network for fault diagnosis testing. The results indicate that the RAM method significantly enhances the diagnostic accuracy of the CNN compared to the other 2D conversion methods. Furthermore, the bearing fault dataset from the University of Ottawa is utilized to validate the performance of the 1D2D-EDL. A comparative analysis with other DL methods using multiple statistical metrics demonstrates the superiority of the 1D2D-EDL. Specifically, when diagnosing faults under four different speed conditions, the 1D2D-EDL attains accuracy rates of 100 %, 99.33 %, 100 %, and 100 %, respectively. This study proposes the incorporation of a novel perspective classifier to enhance DL models for bearing fault diagnosis. The source code of RAM is available at <span><span>https://ww2.mathworks.cn/matlabcentral/fileexchange/180197-relative-angle-matrix-ram</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":50737,\"journal\":{\"name\":\"Applied Soft Computing\",\"volume\":\"172 \",\"pages\":\"Article 112889\"},\"PeriodicalIF\":6.6000,\"publicationDate\":\"2025-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Soft Computing\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1568494625002005\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/2/18 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Soft Computing","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1568494625002005","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/18 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

基于深度学习(DL)的智能分类方法已被广泛应用于轴承故障诊断(BFD)中。然而,人们承认,依赖于单一的特征提取方法可能无法产生信息特征的全面表示。此外,从振动信号中提取特征的基于dl的方法通常使用一维(1D)或二维(2D)网络,这可能会限制网络有效提取特征的能力。本文首次提出了一种时间序列数据表示方法——相对角度矩阵(RAM)法。该方法通过计算多个矢量与中心矢量之间的角度差,将一维时间序列转换为二维图像,从而提取原始数据中存在的隐藏空间特征。然后,本文介绍了一个称为1D2D-EDL的集成深度学习网络,该网络集成了基于1d和基于2d的深度学习机制来进行特征提取和分类,利用了每种方法的优势。1D2D-EDL包括两个通道:一维通道结合了长短期记忆(LSTM)和多头自注意(MSA)来处理原始一维时间序列数据,便于在时域和频域提取特征。同时,二维通道采用卷积神经网络(CNN)组件对二维图像进行空间特征提取,这些图像是使用RAM方法从原始时间序列数据中提取出来的。最后,利用特征融合方法对两个通道的特征信息进行融合。为了初步验证RAM方法的有效性,采用了葛兰曼角差场(GADF)、葛兰曼角和场(GASF)和马尔可夫跃迁场(MTF)三种相互竞争的二维转换方法。这些方法与所提出的RAM方法一起应用于同一CNN网络中进行故障诊断测试。结果表明,与其他二维转换方法相比,RAM方法显著提高了CNN的诊断准确率。此外,利用渥太华大学的轴承故障数据集来验证1D2D-EDL的性能。使用多个统计指标与其他深度学习方法进行比较分析,证明了1D2D-EDL的优越性。具体来说,在4种不同转速条件下,1D2D-EDL的故障诊断准确率分别达到了100% %、99.33 %、100% %和100% %。本文提出了一种新的视角分类器来增强深度学习模型在轴承故障诊断中的应用。RAM的源代码可在https://ww2.mathworks.cn/matlabcentral/fileexchange/180197-relative-angle-matrix-ram上获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
An ensemble deep learning network based on 2D convolutional neural network and 1D LSTM with self-attention for bearing fault diagnosis
Intelligent classification methods based on deep learning (DL) have become widely adopted for bearing fault diagnosis (BFD). However, it is acknowledged that relying on single feature extraction methods may not yield comprehensive representations of the information features. Additionally, DL-based approaches for extracting features from vibration signals typically utilize either one-dimensional (1D) or two-dimensional (2D) networks, which can restrict the network's ability to extract features effectively. In this paper, a time series data representation method called the relative angle matrix (RAM) method is firstly proposed. This method converts 1D time series into 2D images by calculating the angle differences between multiple vectors and a central vector, thereby extracting the hidden spatial features present in the original data. Then, this paper introduces an ensemble deep learning network called 1D2D-EDL, which integrates 1D-based and 2D-based DL mechanisms for feature extraction and classification, leveraging the strengths of each approach. The 1D2D-EDL comprises two channels: the 1D channel combines long short-term memory (LSTM) and multi-head self-attention (MSA) to process raw 1D time series data, facilitating feature extraction in both the time and frequency domains. Meanwhile, the 2D channel employs convolutional neural network (CNN) components to process 2D images for spatial feature extraction, which are derived from the original time series data using the RAM method. Finally, the feature information from these two channels is fused using a feature fusion method. To preliminarily validate the effectiveness of the RAM method, three competitive 2D conversion methods are employed, including Gramian angular difference field (GADF), Gramian angular sum field (GASF), and Markov transition field (MTF). These methods are applied alongside the proposed RAM method within the same CNN network for fault diagnosis testing. The results indicate that the RAM method significantly enhances the diagnostic accuracy of the CNN compared to the other 2D conversion methods. Furthermore, the bearing fault dataset from the University of Ottawa is utilized to validate the performance of the 1D2D-EDL. A comparative analysis with other DL methods using multiple statistical metrics demonstrates the superiority of the 1D2D-EDL. Specifically, when diagnosing faults under four different speed conditions, the 1D2D-EDL attains accuracy rates of 100 %, 99.33 %, 100 %, and 100 %, respectively. This study proposes the incorporation of a novel perspective classifier to enhance DL models for bearing fault diagnosis. The source code of RAM is available at https://ww2.mathworks.cn/matlabcentral/fileexchange/180197-relative-angle-matrix-ram.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Applied Soft Computing
Applied Soft Computing 工程技术-计算机:跨学科应用
CiteScore
15.80
自引率
6.90%
发文量
874
审稿时长
10.9 months
期刊介绍: Applied Soft Computing is an international journal promoting an integrated view of soft computing to solve real life problems.The focus is to publish the highest quality research in application and convergence of the areas of Fuzzy Logic, Neural Networks, Evolutionary Computing, Rough Sets and other similar techniques to address real world complexities. Applied Soft Computing is a rolling publication: articles are published as soon as the editor-in-chief has accepted them. Therefore, the web site will continuously be updated with new articles and the publication time will be short.
期刊最新文献
ISTCLR: Improved sparse temporal contrastive learning for video representation ModuleCNN: A modular framework for unified CNN architecture and hyperparameter optimization via metaheuristics in image classification Modeling and treatment of social media addiction using a fractional-order and deep neural network approach Embedding contrastive memory into variational autoencoder for anomaly detection in IIoT systems A global and local encoder fusion network for pavement crack segmentation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1