首页 > 最新文献

IET Signal Processing最新文献

英文 中文
Small Sample Fiber Full State Diagnosis Based on Fuzzy Clustering and Improved ResNet Network 基于模糊聚类和改进的 ResNet 网络的小样本光纤全状态诊断
IF 1.7 4区 工程技术 Q2 Engineering Pub Date : 2024-02-08 DOI: 10.1049/2024/5512014
Xiangqun Li, Jiawen Liang, Jinyu Zhu, Shengping Shi, Fangyu Ding, Jianpeng Sun, Bo Liu

The optical time domain reflectometer (OTDR) curve features of communication fibers exhibit subtle differences among their normal, subhealthy, and faulty operating states, making it challenging for existing machine learning-based fault diagnosis algorithms to extract these minute features. In addition, the OTDR curve field fault data are scarce, and data-driven deep neural network that needs a lot of data training cannot meet the requirements. In response to this issue, this paper proposes a communication fiber state diagnosis model based on fuzzy clustering and an improved ResNet. First, the pretrained residual network (ResNet) is modified by removing the classification layer and retaining the feature extraction layers. A global average pooling (GAP) layer is designed as a replacement for the fully connected layer. Second, fuzzy clustering, instead of the softmax classification layer, is employed in ResNet for its characteristic of requiring no subsequent data training. The improved model requires only a small amount of sample training to optimize the parameters of the GAP layer, thereby accommodating state diagnosis in scenarios with limited data availability. During the diagnosis process, the OTDR curves are input into the network, resulting in 512 features outputted in the GAP layer. These features are used to construct a feature vector matrix, and a dynamic clustering graph is formed using fuzzy clustering to realize the fiber state diagnosis. Through on-site data detection and validation, it has been demonstrated that the improved ResNet can effectively identify the full cycle of fiber states.

通信光纤的光时域反射仪(OTDR)曲线特征在其正常、亚健康和故障运行状态之间表现出细微差别,这使得现有的基于机器学习的故障诊断算法在提取这些细微特征时面临挑战。此外,OTDR 曲线现场故障数据稀少,需要大量数据训练的数据驱动型深度神经网络无法满足要求。针对这一问题,本文提出了一种基于模糊聚类和改进的 ResNet 的通信光纤状态诊断模型。首先,对预训练的残差网络(ResNet)进行了改进,去掉了分类层,保留了特征提取层。设计了全局平均池化(GAP)层来替代全连接层。其次,ResNet 采用了模糊聚类,而不是软最大分类层,因为它具有无需后续数据训练的特点。改进后的模型只需要少量的样本训练就能优化 GAP 层的参数,从而适应数据有限的情况下的状态诊断。在诊断过程中,OTDR 曲线被输入网络,从而在 GAP 层输出 512 个特征。这些特征用于构建特征向量矩阵,并利用模糊聚类形成动态聚类图,从而实现光纤状态诊断。通过现场数据检测和验证,证明改进后的 ResNet 可以有效识别全周期的光纤状态。
{"title":"Small Sample Fiber Full State Diagnosis Based on Fuzzy Clustering and Improved ResNet Network","authors":"Xiangqun Li,&nbsp;Jiawen Liang,&nbsp;Jinyu Zhu,&nbsp;Shengping Shi,&nbsp;Fangyu Ding,&nbsp;Jianpeng Sun,&nbsp;Bo Liu","doi":"10.1049/2024/5512014","DOIUrl":"10.1049/2024/5512014","url":null,"abstract":"<div>\u0000 <p>The optical time domain reflectometer (OTDR) curve features of communication fibers exhibit subtle differences among their normal, subhealthy, and faulty operating states, making it challenging for existing machine learning-based fault diagnosis algorithms to extract these minute features. In addition, the OTDR curve field fault data are scarce, and data-driven deep neural network that needs a lot of data training cannot meet the requirements. In response to this issue, this paper proposes a communication fiber state diagnosis model based on fuzzy clustering and an improved ResNet. First, the pretrained residual network (ResNet) is modified by removing the classification layer and retaining the feature extraction layers. A global average pooling (GAP) layer is designed as a replacement for the fully connected layer. Second, fuzzy clustering, instead of the softmax classification layer, is employed in ResNet for its characteristic of requiring no subsequent data training. The improved model requires only a small amount of sample training to optimize the parameters of the GAP layer, thereby accommodating state diagnosis in scenarios with limited data availability. During the diagnosis process, the OTDR curves are input into the network, resulting in 512 features outputted in the GAP layer. These features are used to construct a feature vector matrix, and a dynamic clustering graph is formed using fuzzy clustering to realize the fiber state diagnosis. Through on-site data detection and validation, it has been demonstrated that the improved ResNet can effectively identify the full cycle of fiber states.</p>\u0000 </div>","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/2024/5512014","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139852307","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Unsupervised Deep Learning Framework for Retrospective Gating of Catheter-Based Cardiac Imaging 用于导管式心脏成像回溯选通的无监督深度学习框架
IF 1.7 4区 工程技术 Q2 Engineering Pub Date : 2024-02-05 DOI: 10.1049/2024/5664618
Zheng Sun, Yue Yao, Ru Wang
Motion artifacts are a major challenge in the in vivo application of catheter-based cardiac imaging modalities. Gating is a critical tool for suppressing motion artifacts. Electrocardiogram (ECG) gating requires a trigger device or synchronous ECG recordings for retrospective analysis. Existing retrospective software gating methods extract gating signals through separate steps based on changes in vessel morphology or image features, which require a high computational cost and are prone to error accumulation. In this paper, we report on an end-to-end unsupervised learning framework for retrospective image-based gating (IBG) of catheter-based intracoronary images, named IBG Network. It establishes a direct mapping from a continuously acquired image sequence to a gated subsequence. The network was trained on clinical data sets in an unsupervised manner, addressing the difficulty of obtaining the gold standard in deep learning-based motion suppression techniques. Experimental results of in vivo intravascular ultrasound and optical coherence tomography sequences show that the proposed method has better performance in terms of motion artifact suppression and processing efficiency compared with the state-of-the-art nonlearning signal-based and IBG methods.
运动伪影是基于导管的心脏成像模式在体内应用的一大挑战。选通是抑制运动伪影的关键工具。心电图(ECG)选通需要触发设备或同步心电图记录,以便进行回顾性分析。现有的回顾性软件选通方法是根据血管形态或图像特征的变化,通过单独的步骤提取选通信号,这需要很高的计算成本,而且容易造成误差累积。在本文中,我们报告了一种端到端的无监督学习框架,用于基于导管的冠脉内图像的回顾性图像选通(IBG),该框架被命名为 IBG 网络。它建立了从连续采集的图像序列到门控子序列的直接映射。该网络以无监督方式在临床数据集上进行训练,解决了基于深度学习的运动抑制技术难以获得黄金标准的难题。体内血管内超声和光学相干断层扫描序列的实验结果表明,与最先进的基于非学习信号的方法和 IBG 方法相比,所提出的方法在运动伪影抑制和处理效率方面有更好的表现。
{"title":"An Unsupervised Deep Learning Framework for Retrospective Gating of Catheter-Based Cardiac Imaging","authors":"Zheng Sun, Yue Yao, Ru Wang","doi":"10.1049/2024/5664618","DOIUrl":"https://doi.org/10.1049/2024/5664618","url":null,"abstract":"Motion artifacts are a major challenge in the in vivo application of catheter-based cardiac imaging modalities. Gating is a critical tool for suppressing motion artifacts. Electrocardiogram (ECG) gating requires a trigger device or synchronous ECG recordings for retrospective analysis. Existing retrospective software gating methods extract gating signals through separate steps based on changes in vessel morphology or image features, which require a high computational cost and are prone to error accumulation. In this paper, we report on an end-to-end unsupervised learning framework for retrospective image-based gating (IBG) of catheter-based intracoronary images, named IBG Network. It establishes a direct mapping from a continuously acquired image sequence to a gated subsequence. The network was trained on clinical data sets in an unsupervised manner, addressing the difficulty of obtaining the gold standard in deep learning-based motion suppression techniques. Experimental results of in vivo intravascular ultrasound and optical coherence tomography sequences show that the proposed method has better performance in terms of motion artifact suppression and processing efficiency compared with the state-of-the-art nonlearning signal-based and IBG methods.","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139862886","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Unsupervised Deep Learning Framework for Retrospective Gating of Catheter-Based Cardiac Imaging 用于导管式心脏成像回溯选通的无监督深度学习框架
IF 1.7 4区 工程技术 Q2 Engineering Pub Date : 2024-02-05 DOI: 10.1049/2024/5664618
Zheng Sun, Yue Yao, Ru Wang

Motion artifacts are a major challenge in the in vivo application of catheter-based cardiac imaging modalities. Gating is a critical tool for suppressing motion artifacts. Electrocardiogram (ECG) gating requires a trigger device or synchronous ECG recordings for retrospective analysis. Existing retrospective software gating methods extract gating signals through separate steps based on changes in vessel morphology or image features, which require a high computational cost and are prone to error accumulation. In this paper, we report on an end-to-end unsupervised learning framework for retrospective image-based gating (IBG) of catheter-based intracoronary images, named IBG Network. It establishes a direct mapping from a continuously acquired image sequence to a gated subsequence. The network was trained on clinical data sets in an unsupervised manner, addressing the difficulty of obtaining the gold standard in deep learning-based motion suppression techniques. Experimental results of in vivo intravascular ultrasound and optical coherence tomography sequences show that the proposed method has better performance in terms of motion artifact suppression and processing efficiency compared with the state-of-the-art nonlearning signal-based and IBG methods.

运动伪影是基于导管的心脏成像模式在体内应用的一大挑战。选通是抑制运动伪影的关键工具。心电图(ECG)选通需要触发设备或同步心电图记录,以便进行回顾性分析。现有的回顾性软件选通方法是根据血管形态或图像特征的变化,通过单独的步骤提取选通信号,这需要很高的计算成本,而且容易造成误差累积。在本文中,我们报告了一种端到端的无监督学习框架,用于基于导管的冠脉内图像的回顾性图像选通(IBG),该框架被命名为 IBG 网络。它建立了从连续采集的图像序列到门控子序列的直接映射。该网络以无监督方式在临床数据集上进行训练,解决了基于深度学习的运动抑制技术难以获得黄金标准的难题。体内血管内超声和光学相干断层扫描序列的实验结果表明,与最先进的基于非学习信号的方法和 IBG 方法相比,所提出的方法在运动伪影抑制和处理效率方面有更好的表现。
{"title":"An Unsupervised Deep Learning Framework for Retrospective Gating of Catheter-Based Cardiac Imaging","authors":"Zheng Sun,&nbsp;Yue Yao,&nbsp;Ru Wang","doi":"10.1049/2024/5664618","DOIUrl":"10.1049/2024/5664618","url":null,"abstract":"<div>\u0000 <p>Motion artifacts are a major challenge in the <i>in vivo</i> application of catheter-based cardiac imaging modalities. Gating is a critical tool for suppressing motion artifacts. Electrocardiogram (ECG) gating requires a trigger device or synchronous ECG recordings for retrospective analysis. Existing retrospective software gating methods extract gating signals through separate steps based on changes in vessel morphology or image features, which require a high computational cost and are prone to error accumulation. In this paper, we report on an end-to-end unsupervised learning framework for retrospective image-based gating (IBG) of catheter-based intracoronary images, named IBG Network. It establishes a direct mapping from a continuously acquired image sequence to a gated subsequence. The network was trained on clinical data sets in an unsupervised manner, addressing the difficulty of obtaining the gold standard in deep learning-based motion suppression techniques. Experimental results of <i>in vivo</i> intravascular ultrasound and optical coherence tomography sequences show that the proposed method has better performance in terms of motion artifact suppression and processing efficiency compared with the state-of-the-art nonlearning signal-based and IBG methods.</p>\u0000 </div>","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/2024/5664618","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139803067","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Power Resource Allocation Algorithm for Dual-Function Radar–Communication System 双功能雷达通信系统的功率资源分配算法
IF 1.7 4区 工程技术 Q2 Engineering Pub Date : 2024-02-02 DOI: 10.1049/2024/5072597
Yue Xiao, Zhenkai Zhang, Xiaoke Shang

In this paper, a power allocation algorithm of dual-function radar–communication system with limited power is proposed to obtain better overall system performance measured by the weighted summation of radar signal to interference plus noise ratio (SINR) and communication channel capacity. First, a power allocation model is established to maximize the radar SINR and communication channel capacity with limited transmitted power. Then, the Karush–Kuhn–Tucker (KKT) conditions are used to solve the optimal objective function under the condition that only radar SINR or communication channel capacity is considered, respectively. Finally, the optimal value is combined with the original model and transformed into a single objective optimization model, and the optimal power is obtained by solving the model through the iterative optimization algorithm. Simulation results show that, compared with other power allocation algorithms, the proposed algorithm can achieve better radar-communication integration performance under the same transmit power.

本文提出了一种功率有限的双功能雷达-通信系统功率分配算法,以获得更好的系统整体性能(雷达信号干扰加噪声比(SINR)和通信信道容量的加权和)。首先,建立了一个功率分配模型,以在有限发射功率下实现雷达信噪比和通信信道容量的最大化。然后,分别在只考虑雷达信噪比或通信信道容量的条件下,利用卡鲁什-库恩-塔克(KKT)条件求解最优目标函数。最后,将最优值与原始模型相结合,转化为单目标优化模型,通过迭代优化算法求解模型,得到最优功率。仿真结果表明,与其他功率分配算法相比,所提出的算法能在相同发射功率下实现更好的雷达-通信一体化性能。
{"title":"Power Resource Allocation Algorithm for Dual-Function Radar–Communication System","authors":"Yue Xiao,&nbsp;Zhenkai Zhang,&nbsp;Xiaoke Shang","doi":"10.1049/2024/5072597","DOIUrl":"10.1049/2024/5072597","url":null,"abstract":"<div>\u0000 <p>In this paper, a power allocation algorithm of dual-function radar–communication system with limited power is proposed to obtain better overall system performance measured by the weighted summation of radar signal to interference plus noise ratio (SINR) and communication channel capacity. First, a power allocation model is established to maximize the radar SINR and communication channel capacity with limited transmitted power. Then, the Karush–Kuhn–Tucker (KKT) conditions are used to solve the optimal objective function under the condition that only radar SINR or communication channel capacity is considered, respectively. Finally, the optimal value is combined with the original model and transformed into a single objective optimization model, and the optimal power is obtained by solving the model through the iterative optimization algorithm. Simulation results show that, compared with other power allocation algorithms, the proposed algorithm can achieve better radar-communication integration performance under the same transmit power.</p>\u0000 </div>","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/2024/5072597","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139870587","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Power Resource Allocation Algorithm for Dual-Function Radar–Communication System 双功能雷达通信系统的功率资源分配算法
IF 1.7 4区 工程技术 Q2 Engineering Pub Date : 2024-02-02 DOI: 10.1049/2024/5072597
Yue Xiao, Zhenkai Zhang, Xiaoke Shang
In this paper, a power allocation algorithm of dual-function radar–communication system with limited power is proposed to obtain better overall system performance measured by the weighted summation of radar signal to interference plus noise ratio (SINR) and communication channel capacity. First, a power allocation model is established to maximize the radar SINR and communication channel capacity with limited transmitted power. Then, the Karush–Kuhn–Tucker (KKT) conditions are used to solve the optimal objective function under the condition that only radar SINR or communication channel capacity is considered, respectively. Finally, the optimal value is combined with the original model and transformed into a single objective optimization model, and the optimal power is obtained by solving the model through the iterative optimization algorithm. Simulation results show that, compared with other power allocation algorithms, the proposed algorithm can achieve better radar-communication integration performance under the same transmit power.
本文提出了一种功率有限的双功能雷达-通信系统功率分配算法,以获得更好的系统整体性能(雷达信号干扰加噪声比(SINR)和通信信道容量的加权和)。首先,建立了一个功率分配模型,以在有限发射功率下实现雷达信噪比和通信信道容量的最大化。然后,分别在只考虑雷达信噪比或通信信道容量的条件下,利用卡鲁什-库恩-塔克(KKT)条件求解最优目标函数。最后,将最优值与原始模型相结合,转化为单目标优化模型,通过迭代优化算法求解模型,得到最优功率。仿真结果表明,与其他功率分配算法相比,所提出的算法能在相同发射功率下实现更好的雷达-通信一体化性能。
{"title":"Power Resource Allocation Algorithm for Dual-Function Radar–Communication System","authors":"Yue Xiao, Zhenkai Zhang, Xiaoke Shang","doi":"10.1049/2024/5072597","DOIUrl":"https://doi.org/10.1049/2024/5072597","url":null,"abstract":"In this paper, a power allocation algorithm of dual-function radar–communication system with limited power is proposed to obtain better overall system performance measured by the weighted summation of radar signal to interference plus noise ratio (SINR) and communication channel capacity. First, a power allocation model is established to maximize the radar SINR and communication channel capacity with limited transmitted power. Then, the Karush–Kuhn–Tucker (KKT) conditions are used to solve the optimal objective function under the condition that only radar SINR or communication channel capacity is considered, respectively. Finally, the optimal value is combined with the original model and transformed into a single objective optimization model, and the optimal power is obtained by solving the model through the iterative optimization algorithm. Simulation results show that, compared with other power allocation algorithms, the proposed algorithm can achieve better radar-communication integration performance under the same transmit power.","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139810955","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Speech Enhancement Using Joint DNN-NMF Model Learned with Multi-Objective Frequency Differential Spectrum Loss Function 使用利用多目标频谱差分损失函数学习的 DNN-NMF 联合模型进行语音增强
IF 1.7 4区 工程技术 Q2 Engineering Pub Date : 2024-01-24 DOI: 10.1049/2024/8881007
Matin Pashaian, Sanaz Seyedin

We propose a multi-objective joint model of non-negative matrix factorization (NMF) and deep neural network (DNN) with a new loss function for speech enhancement. The proposed loss function (LMOFD) is a weighted combination of a frequency differential spectrum mean squared error (MSE)-based loss function (LFD) and a multi-objective MSE loss function (LMO). The conventional MSE loss function computes the discrepancy between the estimated speech and clean speech across all frequencies, disregarding the process of changing amplitude in the frequency domain which contains valuable information. The differential spectrum representation retains spectral peaks that carry important information. Using this representation helps to ensure that this information in the speech signal is reserved. Also, on the other hand, noise spectra typically have a flat shape and as the differential operation makes the flat spectral partly close to zero, the differential spectrum is resistant to noises with smooth structures. Thus, we propose using a frequency-differentiated loss function that considers the magnitude spectrum differentiations between the neighboring frequency bins in each time frame. This approach maintains the spectrum variations of the objective signal in the frequency domain, which can effectively reduce the noise deterioration effects. The multi-objective MSE term (LMO) is a combined two-loss function related to the NMF coefficients which are the intermediate output targets, and the original spectral signals as the actual output targets. The use of encoded NMF coefficients as low-dimensional structural features for DNN serves as prior knowledge and helps the learning process. LMO is used beside LFD to take advantage of both the properties of the original and the differential spectrum in the training loss function. Moreover, a DNN-based noise classification and fusion strategy (NCF) is proposed to exploit a discriminative model for noise reduction. The experiments reveal the improvements of the proposed approach compared to the previous methods.

我们提出了一种非负矩阵因式分解(NMF)和深度神经网络(DNN)的多目标联合模型,并为语音增强提供了一种新的损失函数。所提出的损失函数(LMOFD)是基于频谱均方误差(MSE)的损失函数(LFD)和多目标 MSE 损失函数 LMO 的加权组合。传统的 MSE 损失函数计算的是估计语音与干净语音在所有频率上的差异,忽略了频域中包含有价值信息的振幅变化过程。微分频谱表示法保留了包含重要信息的频谱峰值。使用这种表示法有助于确保保留语音信号中的这些信息。此外,另一方面,噪声频谱通常具有平坦的形状,由于差分操作会使平坦频谱部分接近零,因此差分频谱对具有平滑结构的噪声具有抵抗力。因此,我们建议使用频率差分损失函数,该函数考虑了每个时间帧中相邻频带之间的幅度频谱差分。这种方法保持了目标信号在频域中的频谱变化,可以有效降低噪声劣化效应。多目标 MSE 项 LMO 是与作为中间输出目标的 NMF 系数和作为实际输出目标的原始频谱信号相关的两个损失函数的组合。将编码的 NMF 系数作为 DNN 的低维结构特征,可作为先验知识,有助于学习过程。LMO 与 LFD 并用,在训练损失函数中利用了原始频谱和差分频谱的特性。此外,还提出了一种基于 DNN 的噪声分类和融合策略(NCF),以利用判别模型来降低噪声。实验表明,与之前的方法相比,所提出的方法有了很大的改进。
{"title":"Speech Enhancement Using Joint DNN-NMF Model Learned with Multi-Objective Frequency Differential Spectrum Loss Function","authors":"Matin Pashaian,&nbsp;Sanaz Seyedin","doi":"10.1049/2024/8881007","DOIUrl":"10.1049/2024/8881007","url":null,"abstract":"<div>\u0000 <p>We propose a multi-objective joint model of non-negative matrix factorization (NMF) and deep neural network (DNN) with a new loss function for speech enhancement. The proposed loss function (<i>L</i><sub>MOFD</sub>) is a weighted combination of a frequency differential spectrum mean squared error (MSE)-based loss function (<i>L</i><sub>FD</sub>) and a multi-objective MSE loss function (<i>L</i><sub>MO</sub>). The conventional MSE loss function computes the discrepancy between the estimated speech and clean speech across all frequencies, disregarding the process of changing amplitude in the frequency domain which contains valuable information. The differential spectrum representation retains spectral peaks that carry important information. Using this representation helps to ensure that this information in the speech signal is reserved. Also, on the other hand, noise spectra typically have a flat shape and as the differential operation makes the flat spectral partly close to zero, the differential spectrum is resistant to noises with smooth structures. Thus, we propose using a frequency-differentiated loss function that considers the magnitude spectrum differentiations between the neighboring frequency bins in each time frame. This approach maintains the spectrum variations of the objective signal in the frequency domain, which can effectively reduce the noise deterioration effects. The multi-objective MSE term (<i>L</i><sub>MO</sub>) is a combined two-loss function related to the NMF coefficients which are the intermediate output targets, and the original spectral signals as the actual output targets. The use of encoded NMF coefficients as low-dimensional structural features for DNN serves as prior knowledge and helps the learning process. <i>L</i><sub>MO</sub> is used beside <i>L</i><sub>FD</sub> to take advantage of both the properties of the original and the differential spectrum in the training loss function. Moreover, a DNN-based noise classification and fusion strategy (NCF) is proposed to exploit a discriminative model for noise reduction. The experiments reveal the improvements of the proposed approach compared to the previous methods.</p>\u0000 </div>","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/2024/8881007","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139601400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MsDC-DEQ-Net: Deep Equilibrium Model (DEQ) with Multiscale Dilated Convolution for Image Compressive Sensing (CS) MsDC-DEQ-Net:用于图像压缩传感(CS)的多尺度稀释卷积深度平衡模型(DEQ)
IF 1.7 4区 工程技术 Q2 Engineering Pub Date : 2024-01-18 DOI: 10.1049/2024/6666549
Youhao Yu, Richard M. Dansereau

Compressive sensing (CS) is a technique that enables the recovery of sparse signals using fewer measurements than traditional sampling methods. To address the computational challenges of CS reconstruction, our objective is to develop an interpretable and concise neural network model for reconstructing natural images using CS. We achieve this by mapping one step of the iterative shrinkage thresholding algorithm (ISTA) to a deep network block, representing one iteration of ISTA. To enhance learning ability and incorporate structural diversity, we integrate aggregated residual transformations (ResNeXt) and squeeze-and-excitation mechanisms into the ISTA block. This block serves as a deep equilibrium layer connected to a semi-tensor product network for convenient sampling and providing an initial reconstruction. The resulting model, called MsDC-DEQ-Net, exhibits competitive performance compared to state-of-the-art network-based methods. It significantly reduces storage requirements compared to deep unrolling methods, using only one iteration block instead of multiple iterations. Unlike deep unrolling models, MsDC-DEQ-Net can be iteratively used, gradually improving reconstruction accuracy while considering computation tradeoffs. Additionally, the model benefits from multiscale dilated convolutions, further enhancing performance.

压缩传感(CS)是一种能利用比传统采样方法更少的测量值恢复稀疏信号的技术。为了应对 CS 重建的计算挑战,我们的目标是开发一种可解释的简洁神经网络模型,用于使用 CS 重建自然图像。为此,我们将迭代收缩阈值算法(ISTA)的一个步骤映射到代表 ISTA 一次迭代的深度网络块。为了增强学习能力并纳入结构多样性,我们将聚合残差变换(ResNeXt)和挤压-激发机制整合到 ISTA 块中。该区块作为深度平衡层,与半张量乘积网络相连,方便采样并提供初始重建。由此产生的模型被称为 MsDC-DEQ-Net,与最先进的基于网络的方法相比,其性能极具竞争力。与深度解卷方法相比,它只使用一个迭代块而不是多个迭代块,从而大大降低了存储需求。与深度解卷模型不同,MsDC-DEQ-Net 可以迭代使用,在考虑计算折衷的同时逐步提高重建精度。此外,该模型还受益于多尺度扩张卷积,进一步提高了性能。
{"title":"MsDC-DEQ-Net: Deep Equilibrium Model (DEQ) with Multiscale Dilated Convolution for Image Compressive Sensing (CS)","authors":"Youhao Yu,&nbsp;Richard M. Dansereau","doi":"10.1049/2024/6666549","DOIUrl":"10.1049/2024/6666549","url":null,"abstract":"<div>\u0000 <p>Compressive sensing (CS) is a technique that enables the recovery of sparse signals using fewer measurements than traditional sampling methods. To address the computational challenges of CS reconstruction, our objective is to develop an interpretable and concise neural network model for reconstructing natural images using CS. We achieve this by mapping one step of the iterative shrinkage thresholding algorithm (ISTA) to a deep network block, representing one iteration of ISTA. To enhance learning ability and incorporate structural diversity, we integrate aggregated residual transformations (ResNeXt) and squeeze-and-excitation mechanisms into the ISTA block. This block serves as a deep equilibrium layer connected to a semi-tensor product network for convenient sampling and providing an initial reconstruction. The resulting model, called MsDC-DEQ-Net, exhibits competitive performance compared to state-of-the-art network-based methods. It significantly reduces storage requirements compared to deep unrolling methods, using only one iteration block instead of multiple iterations. Unlike deep unrolling models, MsDC-DEQ-Net can be iteratively used, gradually improving reconstruction accuracy while considering computation tradeoffs. Additionally, the model benefits from multiscale dilated convolutions, further enhancing performance.</p>\u0000 </div>","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/2024/6666549","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139630022","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Anomaly Detection with Attention (DADA): A Novel Approach for Identifying Multipath Interference in Radar Signals 注意力深度异常检测 (DADA):识别雷达信号中多径干扰的新方法
IF 1.7 4区 工程技术 Q2 Engineering Pub Date : 2024-01-13 DOI: 10.1049/2024/5026821
Kang Yan, Weidong Jin, Yingkun Huang, Zhenhua Li, Pucha Song, Ligang Huang

Multipath interference in radar signals caused by sea, ground, and other environments poses significant challenges to the target detection, tracking, and classification capabilities of radar systems. Existing methods for radar signal identification require labeled samples and focus mainly on the classification of normal signals. However, in practice, anomalous samples (multipath interference signals) may be scarce and highly imbalanced (i.e., mostly normal samples). To address this problem, we propose a deep anomaly detection with attention (DADA) for semisupervised detection of multipath radar signals. The method transforms radar signals into time–frequency images and is trained exclusively on normal samples. The autoencoder architecture is extended with a feature extractor network to capture latent sample features. CBAM attention is introduced to improve feature extraction. By learning the distribution of normal samples in high-dimensional image space and low-dimensional feature space, a two-dimensional feature space representing normal samples is constructed. A one-class SVM then learns the boundary of normal samples for anomaly detection. Extensive experiments on radar signal datasets validate the effectiveness of the proposed approach.

由海洋、地面和其他环境造成的雷达信号多径干扰对雷达系统的目标探测、跟踪和分类能力提出了巨大挑战。现有的雷达信号识别方法需要标注样本,主要侧重于正常信号的分类。然而,在实际应用中,异常样本(多径干扰信号)可能非常稀少且高度不平衡(即大部分为正常样本)。为解决这一问题,我们提出了一种针对多径雷达信号半监督检测的深度异常检测方法(DADA)。该方法将雷达信号转换为时频图像,并完全在正常样本上进行训练。自动编码器架构通过特征提取器网络进行扩展,以捕捉潜在的样本特征。为改进特征提取,引入了 CBAM 注意。通过学习正常样本在高维图像空间和低维特征空间中的分布,构建了代表正常样本的二维特征空间。然后,通过单类 SVM 学习正常样本的边界,从而进行异常检测。在雷达信号数据集上进行的大量实验验证了所提方法的有效性。
{"title":"Deep Anomaly Detection with Attention (DADA): A Novel Approach for Identifying Multipath Interference in Radar Signals","authors":"Kang Yan,&nbsp;Weidong Jin,&nbsp;Yingkun Huang,&nbsp;Zhenhua Li,&nbsp;Pucha Song,&nbsp;Ligang Huang","doi":"10.1049/2024/5026821","DOIUrl":"10.1049/2024/5026821","url":null,"abstract":"<div>\u0000 <p>Multipath interference in radar signals caused by sea, ground, and other environments poses significant challenges to the target detection, tracking, and classification capabilities of radar systems. Existing methods for radar signal identification require labeled samples and focus mainly on the classification of normal signals. However, in practice, anomalous samples (multipath interference signals) may be scarce and highly imbalanced (i.e., mostly normal samples). To address this problem, we propose a deep anomaly detection with attention (DADA) for semisupervised detection of multipath radar signals. The method transforms radar signals into time–frequency images and is trained exclusively on normal samples. The autoencoder architecture is extended with a feature extractor network to capture latent sample features. CBAM attention is introduced to improve feature extraction. By learning the distribution of normal samples in high-dimensional image space and low-dimensional feature space, a two-dimensional feature space representing normal samples is constructed. A one-class SVM then learns the boundary of normal samples for anomaly detection. Extensive experiments on radar signal datasets validate the effectiveness of the proposed approach.</p>\u0000 </div>","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2024-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/2024/5026821","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139531325","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IRR-Net: A Joint Learning Framework for Image Reconstruction and Recognition of Photoacoustic Tomography IRR-Net:光声断层扫描图像重建与识别的联合学习框架
IF 1.7 4区 工程技术 Q2 Engineering Pub Date : 2023-12-22 DOI: 10.1049/2023/6615953
Zheng Sun, Bing Ai, Meichen Sun, Yingsa Hou
In photoacoustic tomography (PAT), object identification and classification are usually performed as postprocessing processes after image reconstruction. Since useful information about the target implied in the raw signal can be lost during image reconstruction, this two-step scheme can reduce the accuracy of tissue characterization. For learning-based methods, it is time consuming to train the network of each subtask separately. In this paper, we report on an end-to-end joint learning framework for simultaneous image reconstruction and object recognition, named IRR-Net. It establishes direct mapping of raw photoacoustic signals to high-quality images with recognized targets. The network consists of an image reconstruction module, an optimization module, and a recognition module, which achieved signal-to-image, image-to-image, and image-to-class conversion, respectively. We built simulation, phantom and in vivo data sets to train and test IRR-Net. The results show that the proposed method successfully yields concurrent improvements in both the quality of the reconstructed images and the accuracy of target recognition at a lower time cost compared to the separately trained networks.
在光声断层扫描(PAT)中,物体识别和分类通常是在图像重建后进行的后处理过程。由于原始信号中隐含的目标有用信息可能会在图像重建过程中丢失,这种两步法会降低组织特征描述的准确性。对于基于学习的方法来说,分别训练每个子任务的网络非常耗时。在本文中,我们报告了一种端到端的联合学习框架,用于同时进行图像重建和物体识别,命名为 IRR-Net。它能将原始光声信号直接映射到带有识别目标的高质量图像上。该网络由图像重建模块、优化模块和识别模块组成,分别实现信号到图像、图像到图像和图像到类别的转换。我们建立了模拟、模型和体内数据集来训练和测试 IRR-Net。结果表明,与单独训练的网络相比,所提出的方法成功地同时提高了重建图像的质量和目标识别的准确性,而且时间成本更低。
{"title":"IRR-Net: A Joint Learning Framework for Image Reconstruction and Recognition of Photoacoustic Tomography","authors":"Zheng Sun, Bing Ai, Meichen Sun, Yingsa Hou","doi":"10.1049/2023/6615953","DOIUrl":"https://doi.org/10.1049/2023/6615953","url":null,"abstract":"In photoacoustic tomography (PAT), object identification and classification are usually performed as postprocessing processes after image reconstruction. Since useful information about the target implied in the raw signal can be lost during image reconstruction, this two-step scheme can reduce the accuracy of tissue characterization. For learning-based methods, it is time consuming to train the network of each subtask separately. In this paper, we report on an end-to-end joint learning framework for simultaneous image reconstruction and object recognition, named IRR-Net. It establishes direct mapping of raw photoacoustic signals to high-quality images with recognized targets. The network consists of an image reconstruction module, an optimization module, and a recognition module, which achieved signal-to-image, image-to-image, and image-to-class conversion, respectively. We built simulation, phantom and in vivo data sets to train and test IRR-Net. The results show that the proposed method successfully yields concurrent improvements in both the quality of the reconstructed images and the accuracy of target recognition at a lower time cost compared to the separately trained networks.","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":null,"pages":null},"PeriodicalIF":1.7,"publicationDate":"2023-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139164286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Low-Complexity BFGS-Based Soft-Output MMSE Detector for Massive MIMO Uplink 基于低复杂度bfgs的海量MIMO上行软输出MMSE检测器
4区 工程技术 Q2 Engineering Pub Date : 2023-11-14 DOI: 10.1049/2023/8887060
Lin Li, Jianhao Hu
For the massive multiple-input multiple-output (MIMO) uplink, the linear minimum mean square error (MMSE) detector is near-optimal but involves undesirable matrix inversion. In this paper, we propose a low-complexity soft-output detector based on the simplified Broyden–Fletcher–Goldfarb–Shanno method to realize the matrix-inversion-free MMSE detection iteratively. To accelerate convergence with minimal computational overhead, an appropriate initial solution is presented leveraging the channel-hardening property of massive MIMO. Moreover, we employ a low-complexity approximated approach to calculating the log-likelihood ratios with negligible performance losses. Simulation results finally verify that the proposed detector can achieve the near-MMSE performance with a few iterations and outperforms the recently reported linear detectors in terms of lower complexity and faster convergence for the realistic massive MIMO systems.
对于大规模多输入多输出(MIMO)上行链路,线性最小均方误差(MMSE)检测器接近最优,但涉及不良的矩阵反演。本文提出了一种基于简化Broyden-Fletcher-Goldfarb-Shanno方法的低复杂度软输出检测器,迭代实现无矩阵逆的MMSE检测。为了以最小的计算开销加速收敛,利用大规模MIMO的信道硬化特性,提出了一个合适的初始解。此外,我们采用了一种低复杂度的近似方法来计算具有可忽略性能损失的对数似然比。仿真结果表明,该检测器只需几次迭代即可达到接近mmse的性能,并且在较低的复杂度和更快的收敛速度方面优于最近报道的线性检测器,适用于实际的大规模MIMO系统。
{"title":"Low-Complexity BFGS-Based Soft-Output MMSE Detector for Massive MIMO Uplink","authors":"Lin Li, Jianhao Hu","doi":"10.1049/2023/8887060","DOIUrl":"https://doi.org/10.1049/2023/8887060","url":null,"abstract":"For the massive multiple-input multiple-output (MIMO) uplink, the linear minimum mean square error (MMSE) detector is near-optimal but involves undesirable matrix inversion. In this paper, we propose a low-complexity soft-output detector based on the simplified Broyden–Fletcher–Goldfarb–Shanno method to realize the matrix-inversion-free MMSE detection iteratively. To accelerate convergence with minimal computational overhead, an appropriate initial solution is presented leveraging the channel-hardening property of massive MIMO. Moreover, we employ a low-complexity approximated approach to calculating the log-likelihood ratios with negligible performance losses. Simulation results finally verify that the proposed detector can achieve the near-MMSE performance with a few iterations and outperforms the recently reported linear detectors in terms of lower complexity and faster convergence for the realistic massive MIMO systems.","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134957184","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IET Signal Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1