首页 > 最新文献

IET Signal Processing最新文献

英文 中文
Speech Enhancement Using Joint DNN-NMF Model Learned with Multi-Objective Frequency Differential Spectrum Loss Function 使用利用多目标频谱差分损失函数学习的 DNN-NMF 联合模型进行语音增强
IF 1.7 4区 工程技术 Q4 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-01-24 DOI: 10.1049/2024/8881007
Matin Pashaian, Sanaz Seyedin

We propose a multi-objective joint model of non-negative matrix factorization (NMF) and deep neural network (DNN) with a new loss function for speech enhancement. The proposed loss function (LMOFD) is a weighted combination of a frequency differential spectrum mean squared error (MSE)-based loss function (LFD) and a multi-objective MSE loss function (LMO). The conventional MSE loss function computes the discrepancy between the estimated speech and clean speech across all frequencies, disregarding the process of changing amplitude in the frequency domain which contains valuable information. The differential spectrum representation retains spectral peaks that carry important information. Using this representation helps to ensure that this information in the speech signal is reserved. Also, on the other hand, noise spectra typically have a flat shape and as the differential operation makes the flat spectral partly close to zero, the differential spectrum is resistant to noises with smooth structures. Thus, we propose using a frequency-differentiated loss function that considers the magnitude spectrum differentiations between the neighboring frequency bins in each time frame. This approach maintains the spectrum variations of the objective signal in the frequency domain, which can effectively reduce the noise deterioration effects. The multi-objective MSE term (LMO) is a combined two-loss function related to the NMF coefficients which are the intermediate output targets, and the original spectral signals as the actual output targets. The use of encoded NMF coefficients as low-dimensional structural features for DNN serves as prior knowledge and helps the learning process. LMO is used beside LFD to take advantage of both the properties of the original and the differential spectrum in the training loss function. Moreover, a DNN-based noise classification and fusion strategy (NCF) is proposed to exploit a discriminative model for noise reduction. The experiments reveal the improvements of the proposed approach compared to the previous methods.

我们提出了一种非负矩阵因式分解(NMF)和深度神经网络(DNN)的多目标联合模型,并为语音增强提供了一种新的损失函数。所提出的损失函数(LMOFD)是基于频谱均方误差(MSE)的损失函数(LFD)和多目标 MSE 损失函数 LMO 的加权组合。传统的 MSE 损失函数计算的是估计语音与干净语音在所有频率上的差异,忽略了频域中包含有价值信息的振幅变化过程。微分频谱表示法保留了包含重要信息的频谱峰值。使用这种表示法有助于确保保留语音信号中的这些信息。此外,另一方面,噪声频谱通常具有平坦的形状,由于差分操作会使平坦频谱部分接近零,因此差分频谱对具有平滑结构的噪声具有抵抗力。因此,我们建议使用频率差分损失函数,该函数考虑了每个时间帧中相邻频带之间的幅度频谱差分。这种方法保持了目标信号在频域中的频谱变化,可以有效降低噪声劣化效应。多目标 MSE 项 LMO 是与作为中间输出目标的 NMF 系数和作为实际输出目标的原始频谱信号相关的两个损失函数的组合。将编码的 NMF 系数作为 DNN 的低维结构特征,可作为先验知识,有助于学习过程。LMO 与 LFD 并用,在训练损失函数中利用了原始频谱和差分频谱的特性。此外,还提出了一种基于 DNN 的噪声分类和融合策略(NCF),以利用判别模型来降低噪声。实验表明,与之前的方法相比,所提出的方法有了很大的改进。
{"title":"Speech Enhancement Using Joint DNN-NMF Model Learned with Multi-Objective Frequency Differential Spectrum Loss Function","authors":"Matin Pashaian,&nbsp;Sanaz Seyedin","doi":"10.1049/2024/8881007","DOIUrl":"10.1049/2024/8881007","url":null,"abstract":"<div>\u0000 <p>We propose a multi-objective joint model of non-negative matrix factorization (NMF) and deep neural network (DNN) with a new loss function for speech enhancement. The proposed loss function (<i>L</i><sub>MOFD</sub>) is a weighted combination of a frequency differential spectrum mean squared error (MSE)-based loss function (<i>L</i><sub>FD</sub>) and a multi-objective MSE loss function (<i>L</i><sub>MO</sub>). The conventional MSE loss function computes the discrepancy between the estimated speech and clean speech across all frequencies, disregarding the process of changing amplitude in the frequency domain which contains valuable information. The differential spectrum representation retains spectral peaks that carry important information. Using this representation helps to ensure that this information in the speech signal is reserved. Also, on the other hand, noise spectra typically have a flat shape and as the differential operation makes the flat spectral partly close to zero, the differential spectrum is resistant to noises with smooth structures. Thus, we propose using a frequency-differentiated loss function that considers the magnitude spectrum differentiations between the neighboring frequency bins in each time frame. This approach maintains the spectrum variations of the objective signal in the frequency domain, which can effectively reduce the noise deterioration effects. The multi-objective MSE term (<i>L</i><sub>MO</sub>) is a combined two-loss function related to the NMF coefficients which are the intermediate output targets, and the original spectral signals as the actual output targets. The use of encoded NMF coefficients as low-dimensional structural features for DNN serves as prior knowledge and helps the learning process. <i>L</i><sub>MO</sub> is used beside <i>L</i><sub>FD</sub> to take advantage of both the properties of the original and the differential spectrum in the training loss function. Moreover, a DNN-based noise classification and fusion strategy (NCF) is proposed to exploit a discriminative model for noise reduction. The experiments reveal the improvements of the proposed approach compared to the previous methods.</p>\u0000 </div>","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":"2024 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2024-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/2024/8881007","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139601400","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MsDC-DEQ-Net: Deep Equilibrium Model (DEQ) with Multiscale Dilated Convolution for Image Compressive Sensing (CS) MsDC-DEQ-Net:用于图像压缩传感(CS)的多尺度稀释卷积深度平衡模型(DEQ)
IF 1.7 4区 工程技术 Q4 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-01-18 DOI: 10.1049/2024/6666549
Youhao Yu, Richard M. Dansereau

Compressive sensing (CS) is a technique that enables the recovery of sparse signals using fewer measurements than traditional sampling methods. To address the computational challenges of CS reconstruction, our objective is to develop an interpretable and concise neural network model for reconstructing natural images using CS. We achieve this by mapping one step of the iterative shrinkage thresholding algorithm (ISTA) to a deep network block, representing one iteration of ISTA. To enhance learning ability and incorporate structural diversity, we integrate aggregated residual transformations (ResNeXt) and squeeze-and-excitation mechanisms into the ISTA block. This block serves as a deep equilibrium layer connected to a semi-tensor product network for convenient sampling and providing an initial reconstruction. The resulting model, called MsDC-DEQ-Net, exhibits competitive performance compared to state-of-the-art network-based methods. It significantly reduces storage requirements compared to deep unrolling methods, using only one iteration block instead of multiple iterations. Unlike deep unrolling models, MsDC-DEQ-Net can be iteratively used, gradually improving reconstruction accuracy while considering computation tradeoffs. Additionally, the model benefits from multiscale dilated convolutions, further enhancing performance.

压缩传感(CS)是一种能利用比传统采样方法更少的测量值恢复稀疏信号的技术。为了应对 CS 重建的计算挑战,我们的目标是开发一种可解释的简洁神经网络模型,用于使用 CS 重建自然图像。为此,我们将迭代收缩阈值算法(ISTA)的一个步骤映射到代表 ISTA 一次迭代的深度网络块。为了增强学习能力并纳入结构多样性,我们将聚合残差变换(ResNeXt)和挤压-激发机制整合到 ISTA 块中。该区块作为深度平衡层,与半张量乘积网络相连,方便采样并提供初始重建。由此产生的模型被称为 MsDC-DEQ-Net,与最先进的基于网络的方法相比,其性能极具竞争力。与深度解卷方法相比,它只使用一个迭代块而不是多个迭代块,从而大大降低了存储需求。与深度解卷模型不同,MsDC-DEQ-Net 可以迭代使用,在考虑计算折衷的同时逐步提高重建精度。此外,该模型还受益于多尺度扩张卷积,进一步提高了性能。
{"title":"MsDC-DEQ-Net: Deep Equilibrium Model (DEQ) with Multiscale Dilated Convolution for Image Compressive Sensing (CS)","authors":"Youhao Yu,&nbsp;Richard M. Dansereau","doi":"10.1049/2024/6666549","DOIUrl":"10.1049/2024/6666549","url":null,"abstract":"<div>\u0000 <p>Compressive sensing (CS) is a technique that enables the recovery of sparse signals using fewer measurements than traditional sampling methods. To address the computational challenges of CS reconstruction, our objective is to develop an interpretable and concise neural network model for reconstructing natural images using CS. We achieve this by mapping one step of the iterative shrinkage thresholding algorithm (ISTA) to a deep network block, representing one iteration of ISTA. To enhance learning ability and incorporate structural diversity, we integrate aggregated residual transformations (ResNeXt) and squeeze-and-excitation mechanisms into the ISTA block. This block serves as a deep equilibrium layer connected to a semi-tensor product network for convenient sampling and providing an initial reconstruction. The resulting model, called MsDC-DEQ-Net, exhibits competitive performance compared to state-of-the-art network-based methods. It significantly reduces storage requirements compared to deep unrolling methods, using only one iteration block instead of multiple iterations. Unlike deep unrolling models, MsDC-DEQ-Net can be iteratively used, gradually improving reconstruction accuracy while considering computation tradeoffs. Additionally, the model benefits from multiscale dilated convolutions, further enhancing performance.</p>\u0000 </div>","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":"2024 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2024-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/2024/6666549","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139630022","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Anomaly Detection with Attention (DADA): A Novel Approach for Identifying Multipath Interference in Radar Signals 注意力深度异常检测 (DADA):识别雷达信号中多径干扰的新方法
IF 1.7 4区 工程技术 Q4 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-01-13 DOI: 10.1049/2024/5026821
Kang Yan, Weidong Jin, Yingkun Huang, Zhenhua Li, Pucha Song, Ligang Huang

Multipath interference in radar signals caused by sea, ground, and other environments poses significant challenges to the target detection, tracking, and classification capabilities of radar systems. Existing methods for radar signal identification require labeled samples and focus mainly on the classification of normal signals. However, in practice, anomalous samples (multipath interference signals) may be scarce and highly imbalanced (i.e., mostly normal samples). To address this problem, we propose a deep anomaly detection with attention (DADA) for semisupervised detection of multipath radar signals. The method transforms radar signals into time–frequency images and is trained exclusively on normal samples. The autoencoder architecture is extended with a feature extractor network to capture latent sample features. CBAM attention is introduced to improve feature extraction. By learning the distribution of normal samples in high-dimensional image space and low-dimensional feature space, a two-dimensional feature space representing normal samples is constructed. A one-class SVM then learns the boundary of normal samples for anomaly detection. Extensive experiments on radar signal datasets validate the effectiveness of the proposed approach.

由海洋、地面和其他环境造成的雷达信号多径干扰对雷达系统的目标探测、跟踪和分类能力提出了巨大挑战。现有的雷达信号识别方法需要标注样本,主要侧重于正常信号的分类。然而,在实际应用中,异常样本(多径干扰信号)可能非常稀少且高度不平衡(即大部分为正常样本)。为解决这一问题,我们提出了一种针对多径雷达信号半监督检测的深度异常检测方法(DADA)。该方法将雷达信号转换为时频图像,并完全在正常样本上进行训练。自动编码器架构通过特征提取器网络进行扩展,以捕捉潜在的样本特征。为改进特征提取,引入了 CBAM 注意。通过学习正常样本在高维图像空间和低维特征空间中的分布,构建了代表正常样本的二维特征空间。然后,通过单类 SVM 学习正常样本的边界,从而进行异常检测。在雷达信号数据集上进行的大量实验验证了所提方法的有效性。
{"title":"Deep Anomaly Detection with Attention (DADA): A Novel Approach for Identifying Multipath Interference in Radar Signals","authors":"Kang Yan,&nbsp;Weidong Jin,&nbsp;Yingkun Huang,&nbsp;Zhenhua Li,&nbsp;Pucha Song,&nbsp;Ligang Huang","doi":"10.1049/2024/5026821","DOIUrl":"10.1049/2024/5026821","url":null,"abstract":"<div>\u0000 <p>Multipath interference in radar signals caused by sea, ground, and other environments poses significant challenges to the target detection, tracking, and classification capabilities of radar systems. Existing methods for radar signal identification require labeled samples and focus mainly on the classification of normal signals. However, in practice, anomalous samples (multipath interference signals) may be scarce and highly imbalanced (i.e., mostly normal samples). To address this problem, we propose a deep anomaly detection with attention (DADA) for semisupervised detection of multipath radar signals. The method transforms radar signals into time–frequency images and is trained exclusively on normal samples. The autoencoder architecture is extended with a feature extractor network to capture latent sample features. CBAM attention is introduced to improve feature extraction. By learning the distribution of normal samples in high-dimensional image space and low-dimensional feature space, a two-dimensional feature space representing normal samples is constructed. A one-class SVM then learns the boundary of normal samples for anomaly detection. Extensive experiments on radar signal datasets validate the effectiveness of the proposed approach.</p>\u0000 </div>","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":"2024 1","pages":""},"PeriodicalIF":1.7,"publicationDate":"2024-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/2024/5026821","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139531325","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
IRR-Net: A Joint Learning Framework for Image Reconstruction and Recognition of Photoacoustic Tomography IRR-Net:光声断层扫描图像重建与识别的联合学习框架
IF 1.7 4区 工程技术 Q4 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2023-12-22 DOI: 10.1049/2023/6615953
Zheng Sun, Bing Ai, Meichen Sun, Yingsa Hou
In photoacoustic tomography (PAT), object identification and classification are usually performed as postprocessing processes after image reconstruction. Since useful information about the target implied in the raw signal can be lost during image reconstruction, this two-step scheme can reduce the accuracy of tissue characterization. For learning-based methods, it is time consuming to train the network of each subtask separately. In this paper, we report on an end-to-end joint learning framework for simultaneous image reconstruction and object recognition, named IRR-Net. It establishes direct mapping of raw photoacoustic signals to high-quality images with recognized targets. The network consists of an image reconstruction module, an optimization module, and a recognition module, which achieved signal-to-image, image-to-image, and image-to-class conversion, respectively. We built simulation, phantom and in vivo data sets to train and test IRR-Net. The results show that the proposed method successfully yields concurrent improvements in both the quality of the reconstructed images and the accuracy of target recognition at a lower time cost compared to the separately trained networks.
在光声断层扫描(PAT)中,物体识别和分类通常是在图像重建后进行的后处理过程。由于原始信号中隐含的目标有用信息可能会在图像重建过程中丢失,这种两步法会降低组织特征描述的准确性。对于基于学习的方法来说,分别训练每个子任务的网络非常耗时。在本文中,我们报告了一种端到端的联合学习框架,用于同时进行图像重建和物体识别,命名为 IRR-Net。它能将原始光声信号直接映射到带有识别目标的高质量图像上。该网络由图像重建模块、优化模块和识别模块组成,分别实现信号到图像、图像到图像和图像到类别的转换。我们建立了模拟、模型和体内数据集来训练和测试 IRR-Net。结果表明,与单独训练的网络相比,所提出的方法成功地同时提高了重建图像的质量和目标识别的准确性,而且时间成本更低。
{"title":"IRR-Net: A Joint Learning Framework for Image Reconstruction and Recognition of Photoacoustic Tomography","authors":"Zheng Sun, Bing Ai, Meichen Sun, Yingsa Hou","doi":"10.1049/2023/6615953","DOIUrl":"https://doi.org/10.1049/2023/6615953","url":null,"abstract":"In photoacoustic tomography (PAT), object identification and classification are usually performed as postprocessing processes after image reconstruction. Since useful information about the target implied in the raw signal can be lost during image reconstruction, this two-step scheme can reduce the accuracy of tissue characterization. For learning-based methods, it is time consuming to train the network of each subtask separately. In this paper, we report on an end-to-end joint learning framework for simultaneous image reconstruction and object recognition, named IRR-Net. It establishes direct mapping of raw photoacoustic signals to high-quality images with recognized targets. The network consists of an image reconstruction module, an optimization module, and a recognition module, which achieved signal-to-image, image-to-image, and image-to-class conversion, respectively. We built simulation, phantom and in vivo data sets to train and test IRR-Net. The results show that the proposed method successfully yields concurrent improvements in both the quality of the reconstructed images and the accuracy of target recognition at a lower time cost compared to the separately trained networks.","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":"82 12","pages":""},"PeriodicalIF":1.7,"publicationDate":"2023-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139164286","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Low-Complexity BFGS-Based Soft-Output MMSE Detector for Massive MIMO Uplink 基于低复杂度bfgs的海量MIMO上行软输出MMSE检测器
4区 工程技术 Q4 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2023-11-14 DOI: 10.1049/2023/8887060
Lin Li, Jianhao Hu
For the massive multiple-input multiple-output (MIMO) uplink, the linear minimum mean square error (MMSE) detector is near-optimal but involves undesirable matrix inversion. In this paper, we propose a low-complexity soft-output detector based on the simplified Broyden–Fletcher–Goldfarb–Shanno method to realize the matrix-inversion-free MMSE detection iteratively. To accelerate convergence with minimal computational overhead, an appropriate initial solution is presented leveraging the channel-hardening property of massive MIMO. Moreover, we employ a low-complexity approximated approach to calculating the log-likelihood ratios with negligible performance losses. Simulation results finally verify that the proposed detector can achieve the near-MMSE performance with a few iterations and outperforms the recently reported linear detectors in terms of lower complexity and faster convergence for the realistic massive MIMO systems.
对于大规模多输入多输出(MIMO)上行链路,线性最小均方误差(MMSE)检测器接近最优,但涉及不良的矩阵反演。本文提出了一种基于简化Broyden-Fletcher-Goldfarb-Shanno方法的低复杂度软输出检测器,迭代实现无矩阵逆的MMSE检测。为了以最小的计算开销加速收敛,利用大规模MIMO的信道硬化特性,提出了一个合适的初始解。此外,我们采用了一种低复杂度的近似方法来计算具有可忽略性能损失的对数似然比。仿真结果表明,该检测器只需几次迭代即可达到接近mmse的性能,并且在较低的复杂度和更快的收敛速度方面优于最近报道的线性检测器,适用于实际的大规模MIMO系统。
{"title":"Low-Complexity BFGS-Based Soft-Output MMSE Detector for Massive MIMO Uplink","authors":"Lin Li, Jianhao Hu","doi":"10.1049/2023/8887060","DOIUrl":"https://doi.org/10.1049/2023/8887060","url":null,"abstract":"For the massive multiple-input multiple-output (MIMO) uplink, the linear minimum mean square error (MMSE) detector is near-optimal but involves undesirable matrix inversion. In this paper, we propose a low-complexity soft-output detector based on the simplified Broyden–Fletcher–Goldfarb–Shanno method to realize the matrix-inversion-free MMSE detection iteratively. To accelerate convergence with minimal computational overhead, an appropriate initial solution is presented leveraging the channel-hardening property of massive MIMO. Moreover, we employ a low-complexity approximated approach to calculating the log-likelihood ratios with negligible performance losses. Simulation results finally verify that the proposed detector can achieve the near-MMSE performance with a few iterations and outperforms the recently reported linear detectors in terms of lower complexity and faster convergence for the realistic massive MIMO systems.","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":"5 3","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134957184","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Preset Conditional Generative Adversarial Network for Massive MIMO Detection 大规模MIMO检测的预置条件生成对抗网络
4区 工程技术 Q4 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2023-11-14 DOI: 10.1049/2023/6610762
Yongzhi Yu, Shiqi Zhang, Jiadong Shang, Ping Wang
In recent years, extensive research has been conducted to obtain better detection performance by combining massive multiple-input multiple-output (MIMO) signal detection with deep neural network (DNN). However, spatial correlation and channel estimation errors significantly affect the performance of DNN-based detection methods. In this study, we consider applying conditional generation adversarial network (CGAN) model to massive MIMO signal detection. First, we propose a preset conditional generative adversarial network (PC-GAN). We construct the dataset with the channel state information (CSI) as a condition preset in the received signal, and train the detector without direct involvement of CSI, which effectively resists the impact of imperfect CSI on the detection performance. Then, we propose a noise removal and preset conditional generative adversarial network (NR-PC-GAN) suitable for low-signal-to-noise ratio (SNR) communication scenarios. The noise in the received signal is removed to improve the detection performance of the detector. The numerical results show that PC-GAN performs well in spatially correlated and imperfect channels. The detection performance of NR-PC-GAN is far superior to the other algorithms in low-SNR scenarios.
近年来,将海量多输入多输出(MIMO)信号检测与深度神经网络(DNN)相结合以获得更好的检测性能得到了广泛的研究。然而,空间相关性和信道估计误差会显著影响基于深度神经网络的检测方法的性能。在本研究中,我们考虑将条件生成对抗网络(CGAN)模型应用于大规模MIMO信号检测。首先,我们提出了一种预置条件生成对抗网络(PC-GAN)。我们将信道状态信息(CSI)作为接收信号中预设的条件来构建数据集,并在没有CSI直接参与的情况下训练检测器,有效地抵抗了不完善的CSI对检测性能的影响。然后,我们提出了一种适合于低信噪比(SNR)通信场景的降噪和预置条件生成对抗网络(NR-PC-GAN)。去除接收信号中的噪声,提高检测器的检测性能。数值结果表明,PC-GAN在空间相关和不完全通道中表现良好。在低信噪比情况下,NR-PC-GAN的检测性能远远优于其他算法。
{"title":"Preset Conditional Generative Adversarial Network for Massive MIMO Detection","authors":"Yongzhi Yu, Shiqi Zhang, Jiadong Shang, Ping Wang","doi":"10.1049/2023/6610762","DOIUrl":"https://doi.org/10.1049/2023/6610762","url":null,"abstract":"In recent years, extensive research has been conducted to obtain better detection performance by combining massive multiple-input multiple-output (MIMO) signal detection with deep neural network (DNN). However, spatial correlation and channel estimation errors significantly affect the performance of DNN-based detection methods. In this study, we consider applying conditional generation adversarial network (CGAN) model to massive MIMO signal detection. First, we propose a preset conditional generative adversarial network (PC-GAN). We construct the dataset with the channel state information (CSI) as a condition preset in the received signal, and train the detector without direct involvement of CSI, which effectively resists the impact of imperfect CSI on the detection performance. Then, we propose a noise removal and preset conditional generative adversarial network (NR-PC-GAN) suitable for low-signal-to-noise ratio (SNR) communication scenarios. The noise in the received signal is removed to improve the detection performance of the detector. The numerical results show that PC-GAN performs well in spatially correlated and imperfect channels. The detection performance of NR-PC-GAN is far superior to the other algorithms in low-SNR scenarios.","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":"31 11","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134957570","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
GLAD: Global–Local Approach; Disentanglement Learning for Financial Market Prediction GLAD:全球-地方方法;金融市场预测的解纠缠学习
4区 工程技术 Q4 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2023-11-10 DOI: 10.1049/2023/6623718
Humam M. Abdulsahib, Foad Ghaderi
Accurate prediction of financial market trends can have a great impact on maximizing profits and avoiding risks. Conventional methods, e.g., regression or SVR, or end-to-end training approaches, coined as deep learning algorithms, have restraints as a consequence of capturing noisy and unnecessary data. Financial market’s data are composed of stock’s price time series that are correlated, and each time series has both global and local dynamics. Inspired by recent advancements in disentanglement representation learning, in this paper, we present a promising model for predicting financial markets that learn disentangled representations of features and eliminate those features that cause interference. Our model uses the informer encoder to extract features, capturing global–local patterns by using the time and frequency domains, augmenting the clean features with time and frequency-based features, and using the decoder to predict. To be more specific, we adopt contrastive learning in the time and frequency domains to learn both global and local patterns. We argue that our methodology, disentangling and learning the influential factors, holds the potential for more accurate predictions and a better understanding of how time series move and behave. We conducted our experiments using the S&P 500, CSI 300, Hang Seng, and Nikkei 225 stock market datasets to predict their next-day closing prices. The results showed that our model outperformed existing methods in terms of prediction error (mean squared error and mean absolute error), financial risk measurement (volatility and max drawdown), and prediction net curves, which means that it may enhance traders’ profits.
准确预测金融市场走势对企业实现利润最大化、规避风险具有重要意义。传统方法,如回归或SVR,或端到端训练方法,被称为深度学习算法,由于捕获噪声和不必要的数据而受到限制。金融市场的数据是由相互关联的股票价格时间序列组成的,每个时间序列都具有全局和局部动态。受解纠缠表示学习的最新进展的启发,在本文中,我们提出了一个有前途的模型,用于预测金融市场,该模型可以学习特征的解纠缠表示并消除那些引起干扰的特征。我们的模型使用信息编码器提取特征,通过使用时间和频率域捕获全局-局部模式,使用基于时间和频率的特征增强干净特征,并使用解码器进行预测。更具体地说,我们采用时域和频域的对比学习来学习全局和局部模式。我们认为,我们的方法,解开和学习影响因素,具有更准确的预测和更好地理解时间序列如何移动和表现的潜力。我们使用标准普尔500指数、沪深300指数、恒生指数和日经225指数的股票市场数据集进行了实验,以预测它们第二天的收盘价。结果表明,我们的模型在预测误差(均方误差和平均绝对误差)、金融风险度量(波动率和最大回撤率)和预测净曲线方面优于现有方法,这意味着它可以提高交易者的利润。
{"title":"GLAD: Global–Local Approach; Disentanglement Learning for Financial Market Prediction","authors":"Humam M. Abdulsahib, Foad Ghaderi","doi":"10.1049/2023/6623718","DOIUrl":"https://doi.org/10.1049/2023/6623718","url":null,"abstract":"Accurate prediction of financial market trends can have a great impact on maximizing profits and avoiding risks. Conventional methods, e.g., regression or SVR, or end-to-end training approaches, coined as deep learning algorithms, have restraints as a consequence of capturing noisy and unnecessary data. Financial market’s data are composed of stock’s price time series that are correlated, and each time series has both global and local dynamics. Inspired by recent advancements in disentanglement representation learning, in this paper, we present a promising model for predicting financial markets that learn disentangled representations of features and eliminate those features that cause interference. Our model uses the informer encoder to extract features, capturing global–local patterns by using the time and frequency domains, augmenting the clean features with time and frequency-based features, and using the decoder to predict. To be more specific, we adopt contrastive learning in the time and frequency domains to learn both global and local patterns. We argue that our methodology, disentangling and learning the influential factors, holds the potential for more accurate predictions and a better understanding of how time series move and behave. We conducted our experiments using the S&amp;P 500, CSI 300, Hang Seng, and Nikkei 225 stock market datasets to predict their next-day closing prices. The results showed that our model outperformed existing methods in terms of prediction error (mean squared error and mean absolute error), financial risk measurement (volatility and max drawdown), and prediction net curves, which means that it may enhance traders’ profits.","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":" 6","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135141493","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Recovery of Sparse Signals via Modified Hard Thresholding Pursuit Algorithms 基于改进硬阈值追踪算法的稀疏信号恢复
4区 工程技术 Q4 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2023-11-03 DOI: 10.1049/2023/9937696
Li-Ping Geng, Jin-Chuan Zhou, Zhong-Feng Sun, Jing-Yong Tang
In this paper, we propose a modified version of the hard thresholding pursuit algorithm, called modified hard thresholding pursuit (MHTP), using a convex combination of the current and previous points. The convergence analysis, finite termination properties, and stability of the MHTP are established under the restricted isometry property of the measurement matrix. Simulations are performed in noiseless and noisy environments using synthetic data, in which the successful frequencies, average runtime, and phase transition of the MHTP are considered. Standard test images are also used to test the reconstruction capability of the MHTP in terms of the peak signal-to-noise ratio. Numerical results indicate that the MHTP is competitive with several mainstream thresholding and greedy algorithms, such as hard thresholding pursuit, compressive sampling matching pursuit, subspace pursuit, generalized orthogonal matching pursuit, and Newton-step-based hard thresholding pursuit, in terms of recovery capability and runtime.
在本文中,我们提出了一种改进版本的硬阈值追踪算法,称为改进硬阈值追踪(MHTP),使用当前点和先前点的凸组合。在测量矩阵的受限等距性质下,建立了MHTP的收敛性分析、有限终止性和稳定性。利用合成数据在无噪声和有噪声环境下进行了仿真,其中考虑了MHTP的成功频率、平均运行时间和相变。还使用标准测试图像来测试MHTP在峰值信噪比方面的重建能力。数值结果表明,MHTP在恢复能力和运行时间方面与硬阈值追踪、压缩采样匹配追踪、子空间追踪、广义正交匹配追踪和基于牛顿步长的硬阈值追踪等几种主流阈值算法和贪婪算法具有一定的竞争力。
{"title":"Recovery of Sparse Signals via Modified Hard Thresholding Pursuit Algorithms","authors":"Li-Ping Geng, Jin-Chuan Zhou, Zhong-Feng Sun, Jing-Yong Tang","doi":"10.1049/2023/9937696","DOIUrl":"https://doi.org/10.1049/2023/9937696","url":null,"abstract":"In this paper, we propose a modified version of the hard thresholding pursuit algorithm, called modified hard thresholding pursuit (MHTP), using a convex combination of the current and previous points. The convergence analysis, finite termination properties, and stability of the MHTP are established under the restricted isometry property of the measurement matrix. Simulations are performed in noiseless and noisy environments using synthetic data, in which the successful frequencies, average runtime, and phase transition of the MHTP are considered. Standard test images are also used to test the reconstruction capability of the MHTP in terms of the peak signal-to-noise ratio. Numerical results indicate that the MHTP is competitive with several mainstream thresholding and greedy algorithms, such as hard thresholding pursuit, compressive sampling matching pursuit, subspace pursuit, generalized orthogonal matching pursuit, and Newton-step-based hard thresholding pursuit, in terms of recovery capability and runtime.","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":"40 2","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135818725","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RF Signal Feature Extraction in Integrated Sensing and Communication 集成传感与通信中的射频信号特征提取
4区 工程技术 Q4 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2023-10-28 DOI: 10.1049/2023/4251265
Xiaoya Wang, Songlin Sun, Haiying Zhang, Qiang Liu
Because of the open property of information sharing in integrated sensing and communication, it is inevitable to face security problems such as user information being tampered, eavesdropped, and copied. Radio frequency (RF) individual identification technology is an important means to solve its security problems at present. Whether using machine learning methods or current deep learning-based target fingerprint identification, its performance is based on how well the radio frequency features (RFF) are extracted. Since the received signal is affected by various factors, we believe that we should first find the intrinsic features that can describe the properties of the target, which is the key to enhance the RF fingerprint recognition. In this paper, we try to analyze the intrinsic characteristics of the components that influenced the signal by the transmitting source and derive a mathematical formula to describe the RF characteristics. We propose a method using dynamic wavelet transform and wavelet spectrum (DWTWS) to enhance RFF features. The performance of the proposed method was evaluated by experimental data. Using a support vector machine classifier, the recognition accuracy is 99.6% for 10 individuals at a signal-to-noise ratio (SNR) of 10 dB. In comparison with the dual-tree complex wavelet transform (DT-CWT) feature extraction method and the wavelet scattering transform method, the DWTWS method has increased the interclass distance of different individuals and enhanced the recognition accuracy. The DWTWS method is superior at low SNR, with performance improvements of 53.1% and 10.7% at 0 dB.
由于集成传感与通信中信息共享的开放性,不可避免地面临用户信息被篡改、窃听、复制等安全问题。射频(RF)个人识别技术是目前解决其安全问题的重要手段。无论是使用机器学习方法还是当前基于深度学习的目标指纹识别,其性能都取决于射频特征(RFF)的提取程度。由于接收到的信号受到各种因素的影响,我们认为首先要找到能够描述目标属性的内在特征,这是增强射频指纹识别的关键。在本文中,我们试图分析受发射源影响信号的元件的固有特性,并推导出描述射频特性的数学公式。提出了一种利用动态小波变换和小波谱(DWTWS)增强RFF特征的方法。实验数据验证了该方法的性能。使用支持向量机分类器,在信噪比为10 dB的情况下,对10个个体的识别准确率达到99.6%。与双树复小波变换(DT-CWT)特征提取方法和小波散射变换方法相比,DWTWS方法增加了不同个体的类间距离,提高了识别精度。DWTWS方法在低信噪比条件下性能较好,在0 dB条件下性能分别提高53.1%和10.7%。
{"title":"RF Signal Feature Extraction in Integrated Sensing and Communication","authors":"Xiaoya Wang, Songlin Sun, Haiying Zhang, Qiang Liu","doi":"10.1049/2023/4251265","DOIUrl":"https://doi.org/10.1049/2023/4251265","url":null,"abstract":"Because of the open property of information sharing in integrated sensing and communication, it is inevitable to face security problems such as user information being tampered, eavesdropped, and copied. Radio frequency (RF) individual identification technology is an important means to solve its security problems at present. Whether using machine learning methods or current deep learning-based target fingerprint identification, its performance is based on how well the radio frequency features (RFF) are extracted. Since the received signal is affected by various factors, we believe that we should first find the intrinsic features that can describe the properties of the target, which is the key to enhance the RF fingerprint recognition. In this paper, we try to analyze the intrinsic characteristics of the components that influenced the signal by the transmitting source and derive a mathematical formula to describe the RF characteristics. We propose a method using dynamic wavelet transform and wavelet spectrum (DWTWS) to enhance RFF features. The performance of the proposed method was evaluated by experimental data. Using a support vector machine classifier, the recognition accuracy is 99.6% for 10 individuals at a signal-to-noise ratio (SNR) of 10 dB. In comparison with the dual-tree complex wavelet transform (DT-CWT) feature extraction method and the wavelet scattering transform method, the DWTWS method has increased the interclass distance of different individuals and enhanced the recognition accuracy. The DWTWS method is superior at low SNR, with performance improvements of 53.1% and 10.7% at 0 dB.","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":"30 7","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"136233733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Li-Ion Battery State of Health Estimation Based on Short Random Charging Segment and Improved Long Short-Term Memory 基于短时随机充电段和改进长短期记忆的锂离子电池健康状态估计
4区 工程技术 Q4 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2023-10-23 DOI: 10.1049/2023/8839034
Aina Tian, Zhe Chen, Zhuangzhuang Pan, Chen Yang, Yuqin Wang, Kailang Dong, Yang Gao, Jiuchun Jiang
Lithium-ion batteries have been used in a wide range of applications, including electrochemical energy storage and electrical transportation. In order to ensure safe and stable battery operation, the State of Health (SOH) needs to be accurately estimated. In recent years, model-based and data-driven methods have been widely used for SOH estimation, but due to the uncertainty of battery charging conditions in practice, it is difficult to obtain a fixed local segment. In this paper, the charging curve is first divided into several equal voltage difference segments based on charging segment voltage difference ΔV in order to solve the random charging segment problem. Time interval of equal charge voltage difference of the voltage curve, coefficient of variation and euclidean distance of the charging capacity difference curve are extracted as health features. The improved flow direction algorithmlong short term memory-based SOH assessment method is proposed and verified by the Oxford battery degradation dataset and experimental battery degradation dataset with a maximum error of 0.6%.
锂离子电池在电化学储能和电力运输等领域有着广泛的应用。为了保证电池安全稳定的运行,需要对电池的健康状态(SOH)进行准确的估算。近年来,基于模型和数据驱动的SOH估计方法得到了广泛的应用,但由于实践中电池充电条件的不确定性,难以获得固定的局部分段。本文首先根据充电段电压差ΔV将充电曲线划分为若干等电压差段,以解决随机充电段问题。提取电压曲线等充电电压差的时间间隔、充电容量差曲线的变异系数和欧氏距离作为健康特征。提出了改进的流动方向算法-基于长短期记忆的SOH评估方法,并通过牛津电池退化数据集和实验电池退化数据集进行了验证,最大误差为0.6%。
{"title":"Li-Ion Battery State of Health Estimation Based on Short Random Charging Segment and Improved Long Short-Term Memory","authors":"Aina Tian, Zhe Chen, Zhuangzhuang Pan, Chen Yang, Yuqin Wang, Kailang Dong, Yang Gao, Jiuchun Jiang","doi":"10.1049/2023/8839034","DOIUrl":"https://doi.org/10.1049/2023/8839034","url":null,"abstract":"Lithium-ion batteries have been used in a wide range of applications, including electrochemical energy storage and electrical transportation. In order to ensure safe and stable battery operation, the State of Health (SOH) needs to be accurately estimated. In recent years, model-based and data-driven methods have been widely used for SOH estimation, but due to the uncertainty of battery charging conditions in practice, it is difficult to obtain a fixed local segment. In this paper, the charging curve is first divided into several equal voltage difference segments based on charging segment voltage difference ΔV in order to solve the random charging segment problem. Time interval of equal charge voltage difference of the voltage curve, coefficient of variation and euclidean distance of the charging capacity difference curve are extracted as health features. The improved flow direction algorithmlong short term memory-based SOH assessment method is proposed and verified by the Oxford battery degradation dataset and experimental battery degradation dataset with a maximum error of 0.6%.","PeriodicalId":56301,"journal":{"name":"IET Signal Processing","volume":"54 10","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-10-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"135365414","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
IET Signal Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1