首页 > 最新文献

Digital Signal Processing最新文献

英文 中文
Research on ZYNQ neural network acceleration method for aluminum surface microdefects 铝表面微缺陷的ZYNQ神经网络加速方法研究
IF 2.9 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-11-28 DOI: 10.1016/j.dsp.2024.104900
Dongxue Zhao, Shenbo Liu, Zhigang Zhang, Zhao Zhang, Lijun Tang
Convolutional Neural Networks (CNN) are an important means of detection of microdefects on the aluminum surface, and the high complexity and computing power requirements of the CNN model lead to difficulties in deploying them on edge computing platforms as the detection accuracy continues to improve. We have studied a lightweight acceleration method for detecting microdefects on aluminum surfaces on the Zynq-7000 All Programmable SoC (ZYNQ) platform. A lightweight aluminum surface defect detection network (LADFastDet) and high-performance accelerators based on ZYNQ are designed to meet the requirements of precision and speed under limited resources. In the LADFastDet structure, a lightweight inverted residual block is designed by combining depthwise convolution, inverted residual block, and inverted bottleneck. A multiscale feature fusion structure is designed to effectively improve the detection accuracy of LADFastDet, especially small target defects. We design accelerators on ZYNQ through optimization methods such as loop optimization strategy, ping-pong buffering, and multichannel and multiple interfaces data reading and writing to reduce data access latency and thus improve the computing speed. The experimental results show that the LADFastDet model has a mAP of 97.51%, the inference time of the accelerators for a single image is 42.57 ms, and a power consumption of 2.15 W, which achieves a throughput of 24.9 GOPS and an energy efficiency of 11.58 GOPS/W.
卷积神经网络(CNN)是铝表面微缺陷检测的重要手段,随着检测精度的不断提高,CNN模型的高复杂性和对计算能力的要求导致其在边缘计算平台上部署困难。我们在ZYNQ -7000全可编程SoC (All Programmable SoC, ZYNQ)平台上研究了一种用于检测铝表面微缺陷的轻量级加速方法。基于ZYNQ的轻量化铝表面缺陷检测网络(LADFastDet)和高性能加速器设计,满足有限资源下精度和速度的要求。在LADFastDet结构中,将深度卷积、反向残差块和反向瓶颈相结合,设计了轻量级的反向残差块。设计了一种多尺度特征融合结构,有效提高了LADFastDet的检测精度,特别是小目标缺陷的检测精度。我们通过循环优化策略、乒乓缓冲、多通道多接口数据读写等优化方法在ZYNQ上设计加速器,以减少数据访问延迟,从而提高计算速度。实验结果表明,LADFastDet模型的mAP率为97.51%,加速器对单幅图像的推理时间为42.57 ms,功耗为2.15 W,吞吐量为24.9 GOPS,能效为11.58 GOPS/W。
{"title":"Research on ZYNQ neural network acceleration method for aluminum surface microdefects","authors":"Dongxue Zhao,&nbsp;Shenbo Liu,&nbsp;Zhigang Zhang,&nbsp;Zhao Zhang,&nbsp;Lijun Tang","doi":"10.1016/j.dsp.2024.104900","DOIUrl":"10.1016/j.dsp.2024.104900","url":null,"abstract":"<div><div>Convolutional Neural Networks (CNN) are an important means of detection of microdefects on the aluminum surface, and the high complexity and computing power requirements of the CNN model lead to difficulties in deploying them on edge computing platforms as the detection accuracy continues to improve. We have studied a lightweight acceleration method for detecting microdefects on aluminum surfaces on the Zynq-7000 All Programmable SoC (ZYNQ) platform. A lightweight aluminum surface defect detection network (LADFastDet) and high-performance accelerators based on ZYNQ are designed to meet the requirements of precision and speed under limited resources. In the LADFastDet structure, a lightweight inverted residual block is designed by combining depthwise convolution, inverted residual block, and inverted bottleneck. A multiscale feature fusion structure is designed to effectively improve the detection accuracy of LADFastDet, especially small target defects. We design accelerators on ZYNQ through optimization methods such as loop optimization strategy, ping-pong buffering, and multichannel and multiple interfaces data reading and writing to reduce data access latency and thus improve the computing speed. The experimental results show that the LADFastDet model has a mAP of 97.51%, the inference time of the accelerators for a single image is 42.57 ms, and a power consumption of 2.15 W, which achieves a throughput of 24.9 GOPS and an energy efficiency of 11.58 GOPS/W.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"157 ","pages":"Article 104900"},"PeriodicalIF":2.9,"publicationDate":"2024-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142744712","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cross-scale informative priors network for medical image segmentation 医学图像分割的跨尺度信息先验网络
IF 2.9 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-11-28 DOI: 10.1016/j.dsp.2024.104883
Fuxian Sui , Hua Wang , Fan Zhang
Accurate segmentation of medical images is of great significance for computer-aided diagnosis. Transformers show great promise in medical image segmentation, where they can complement local convolutions by capturing long-range dependencies via self-attention. Recent methods have shown good performance in dealing with variations in global context modeling. However, they do not deal well with problems such as boundary blurring because they ignore the edge prior and the complementarity of the global context. To address this challenge, we propose a segmentation network based on informative priors across scales. The encoder in our network utilizes the self-attention mechanism to capture long-range dependencies, while the proposed cross-scale prior decoder makes full use of the multi-scale features in the hierarchical vision transformer to capture boundary information by using a prior perceptron, and enhances both remote and local context information by suppressing background information using a pattern perceptron. Through the internal organic combination, the edge prior and the global background are fully used to complement each other, and the problem of inaccurate boundary segmentation is better solved. Extensive experiments have been conducted on multiple segmented datasets to validate the advanced performance of the model.
医学图像的准确分割对计算机辅助诊断具有重要意义。变形金刚在医学图像分割中显示出巨大的前景,它们可以通过自关注捕获远程依赖关系来补充局部卷积。最近的方法在处理全局上下文建模中的变化方面表现出良好的性能。然而,由于忽略了边缘先验和全局背景的互补性,它们不能很好地处理边界模糊等问题。为了解决这一挑战,我们提出了一种基于信息先验的跨尺度分割网络。该网络中的编码器利用自注意机制捕获远程依赖关系,而跨尺度先验解码器则充分利用分层视觉转换器中的多尺度特征,利用先验感知器捕获边界信息,并利用模式感知器抑制背景信息,增强远程和本地上下文信息。通过内部有机结合,充分利用边缘先验和全局背景相互补充,较好地解决了边界分割不准确的问题。在多个分割数据集上进行了大量的实验,以验证该模型的先进性能。
{"title":"Cross-scale informative priors network for medical image segmentation","authors":"Fuxian Sui ,&nbsp;Hua Wang ,&nbsp;Fan Zhang","doi":"10.1016/j.dsp.2024.104883","DOIUrl":"10.1016/j.dsp.2024.104883","url":null,"abstract":"<div><div>Accurate segmentation of medical images is of great significance for computer-aided diagnosis. Transformers show great promise in medical image segmentation, where they can complement local convolutions by capturing long-range dependencies via self-attention. Recent methods have shown good performance in dealing with variations in global context modeling. However, they do not deal well with problems such as boundary blurring because they ignore the edge prior and the complementarity of the global context. To address this challenge, we propose a segmentation network based on informative priors across scales. The encoder in our network utilizes the self-attention mechanism to capture long-range dependencies, while the proposed cross-scale prior decoder makes full use of the multi-scale features in the hierarchical vision transformer to capture boundary information by using a prior perceptron, and enhances both remote and local context information by suppressing background information using a pattern perceptron. Through the internal organic combination, the edge prior and the global background are fully used to complement each other, and the problem of inaccurate boundary segmentation is better solved. Extensive experiments have been conducted on multiple segmented datasets to validate the advanced performance of the model.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"157 ","pages":"Article 104883"},"PeriodicalIF":2.9,"publicationDate":"2024-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142744713","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An improved digital predistortion scheme for nonlinear transmitters with limited bandwidth 一种改进的有限带宽非线性发射机数字预失真方案
IF 2.9 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-11-28 DOI: 10.1016/j.dsp.2024.104874
Linshan Zhao , Kai Ying , Disheng Xiao , Jian Pang , Kai Kang
In modern wireless communication systems, wide signal bandwidth is the most straightforward approach to accommodate high data rates. Wide signal bandwidth, on the other hand, introduces severe challenges to the power amplifier (PA) and digital predistortion (DPD) design in both performance and cost. Conventional DPD systems usually ignore the impact of the transmit low-pass filter (Tx LPF) bandwidth and assume the transmit bandwidth is sufficiently large. In wideband signal transmissions, the bandwidth of Tx LPF can become the system bottleneck, limiting DPDs compensation effects. Existing DPD studies mostly investigate the DPD with reduced feedback bandwidth. In this paper, we study the impact of Tx LPF bandwidth on the DPD performance. A full-band error minimization DPD based on direct learning structure is proposed. The DPD coefficients are estimated by minimizing the full-band error between the input signal and PA output signal in the frequency domain. Furthermore, we propose a weighted DPD with improved performance by introducing a weighting diagonal matrix to the error function. Compared to existing solutions, the weighted DPD achieves a good trade-off between the in-band distortion compensation and out-of-band spectral regrowth suppression. Simulations and experiments validate the effectiveness of the proposed DPD schemes.
在现代无线通信系统中,宽信号带宽是适应高数据速率的最直接方法。另一方面,宽信号带宽给功率放大器(PA)和数字预失真(DPD)设计带来了性能和成本方面的严峻挑战。传统的DPD系统通常忽略发射低通滤波器(Tx LPF)带宽的影响,并假设发射带宽足够大。在宽带信号传输中,Tx LPF的带宽会成为系统的瓶颈,限制DPDs的补偿效果。现有的DPD研究主要针对反馈带宽减小的DPD。在本文中,我们研究了Tx LPF带宽对DPD性能的影响。提出了一种基于直接学习结构的全频带误差最小化DPD。通过在频域内最小化输入信号和PA输出信号之间的全频带误差来估计DPD系数。此外,我们通过在误差函数中引入加权对角矩阵,提出了一种具有改进性能的加权DPD。与现有解决方案相比,加权DPD在带内失真补偿和带外频谱再生抑制之间取得了很好的平衡。仿真和实验验证了所提DPD方案的有效性。
{"title":"An improved digital predistortion scheme for nonlinear transmitters with limited bandwidth","authors":"Linshan Zhao ,&nbsp;Kai Ying ,&nbsp;Disheng Xiao ,&nbsp;Jian Pang ,&nbsp;Kai Kang","doi":"10.1016/j.dsp.2024.104874","DOIUrl":"10.1016/j.dsp.2024.104874","url":null,"abstract":"<div><div>In modern wireless communication systems, wide signal bandwidth is the most straightforward approach to accommodate high data rates. Wide signal bandwidth, on the other hand, introduces severe challenges to the power amplifier (PA) and digital predistortion (DPD) design in both performance and cost. Conventional DPD systems usually ignore the impact of the transmit low-pass filter (Tx LPF) bandwidth and assume the transmit bandwidth is sufficiently large. In wideband signal transmissions, the bandwidth of Tx LPF can become the system bottleneck, limiting DPDs compensation effects. Existing DPD studies mostly investigate the DPD with reduced feedback bandwidth. In this paper, we study the impact of Tx LPF bandwidth on the DPD performance. A full-band error minimization DPD based on direct learning structure is proposed. The DPD coefficients are estimated by minimizing the full-band error between the input signal and PA output signal in the frequency domain. Furthermore, we propose a weighted DPD with improved performance by introducing a weighting diagonal matrix to the error function. Compared to existing solutions, the weighted DPD achieves a good trade-off between the in-band distortion compensation and out-of-band spectral regrowth suppression. Simulations and experiments validate the effectiveness of the proposed DPD schemes.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"157 ","pages":"Article 104874"},"PeriodicalIF":2.9,"publicationDate":"2024-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142744714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Weight consistency and cluster diversity based concept factorization for multi-view clustering 基于权重一致性和聚类多样性的多视图聚类概念分解
IF 2.9 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-11-26 DOI: 10.1016/j.dsp.2024.104879
Youyang Tao , Hangjun Che , Chenglu Li , Baicheng Pan , Man-Fai Leung
In the era of information explosion, clustering analysis of multi-view data plays a crucial role in revealing the intrinsic structures of data. Despite the advancements in existing multi-view clustering methods for processing complex data, they often overlook the weight differences among various views and the diversity between clusters. To address the issues, the paper introduces a novel multi-view clustering approach termed weight consistency and cluster diversity based concept factorization for multi-view clustering (MVCF-WD). Specifically, the proposed method automatically learns the weights of the views, and incorporates a cluster diversity term to enhance the discriminability of clusters. Furthermore, to solve the formulated optimization model, an iterative optimization algorithm based on multiplication rules is developed and the convergence is analyzed. Extensive experiments conducted across seven datasets compared with ten state-of-the-art clustering algorithms demonstrate the superior clustering performance of the proposed method.
在信息爆炸时代,多视图数据的聚类分析对于揭示数据的内在结构起着至关重要的作用。尽管现有的多视图聚类方法在处理复杂数据方面取得了进步,但它们往往忽略了不同视图之间的权重差异和聚类之间的多样性。为了解决这些问题,本文引入了一种新的多视图聚类方法,即基于权重一致性和聚类多样性的多视图聚类概念分解(MVCF-WD)。具体而言,该方法自动学习视图的权重,并引入聚类多样性项来增强聚类的可分辨性。在此基础上,提出了一种基于乘法规则的迭代优化算法,并对其收敛性进行了分析。在七个数据集上进行的大量实验与十种最先进的聚类算法进行了比较,证明了所提出方法的优越聚类性能。
{"title":"Weight consistency and cluster diversity based concept factorization for multi-view clustering","authors":"Youyang Tao ,&nbsp;Hangjun Che ,&nbsp;Chenglu Li ,&nbsp;Baicheng Pan ,&nbsp;Man-Fai Leung","doi":"10.1016/j.dsp.2024.104879","DOIUrl":"10.1016/j.dsp.2024.104879","url":null,"abstract":"<div><div>In the era of information explosion, clustering analysis of multi-view data plays a crucial role in revealing the intrinsic structures of data. Despite the advancements in existing multi-view clustering methods for processing complex data, they often overlook the weight differences among various views and the diversity between clusters. To address the issues, the paper introduces a novel multi-view clustering approach termed weight consistency and cluster diversity based concept factorization for multi-view clustering (MVCF-WD). Specifically, the proposed method automatically learns the weights of the views, and incorporates a cluster diversity term to enhance the discriminability of clusters. Furthermore, to solve the formulated optimization model, an iterative optimization algorithm based on multiplication rules is developed and the convergence is analyzed. Extensive experiments conducted across seven datasets compared with ten state-of-the-art clustering algorithms demonstrate the superior clustering performance of the proposed method.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"157 ","pages":"Article 104879"},"PeriodicalIF":2.9,"publicationDate":"2024-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142744711","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient gridless wideband sparse array synthesis with tapped delay-lines 带抽头延迟线的高效无网格宽带稀疏阵列合成
IF 2.9 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-11-26 DOI: 10.1016/j.dsp.2024.104893
Wenjing Zhou , Mingwei Shen , Di Wu , Daiyin Zhu , Guodong Han
In this paper, we propose a new wideband sparse array synthesis method based on gridless compressed sensing to solve the basis mismatch problem for discrete grids. Considering the tapped delay-lines (TDL) structure for space-time domain processing, and using successive frequency-varying atoms for sparse representation of wideband signals, an arbitrary sampling-atomic norm minimization is introduced to model the group sparsity-constrained wideband arrays in which the positions and the excitation values of array element are obtained with a high freedom. The above nonconvex problem is then transformed into a convex relaxation, which is solved using the Prolate Spheroidal Wave Functions (PSWFs). The experimental results show that the proposed sparse array design has higher matching accuracy and sparsity, compared with the discretized wideband sparse array design, which verifies the effectiveness and efficiency of this method.
针对离散网格的基错配问题,提出了一种基于无网格压缩感知的宽带稀疏阵列综合方法。利用抽头延迟线(TDL)结构进行空时域处理,利用连续变频原子对宽带信号进行稀疏表示,采用任意采样原子范数最小化方法对群稀疏约束的宽带阵列进行建模,使阵元的位置和激励值具有较高的自由度。然后将上述非凸问题转化为凸松弛问题,并利用长球面波函数(PSWFs)求解。实验结果表明,与离散化宽带稀疏阵列设计相比,本文提出的稀疏阵列设计具有更高的匹配精度和稀疏度,验证了该方法的有效性和高效性。
{"title":"Efficient gridless wideband sparse array synthesis with tapped delay-lines","authors":"Wenjing Zhou ,&nbsp;Mingwei Shen ,&nbsp;Di Wu ,&nbsp;Daiyin Zhu ,&nbsp;Guodong Han","doi":"10.1016/j.dsp.2024.104893","DOIUrl":"10.1016/j.dsp.2024.104893","url":null,"abstract":"<div><div>In this paper, we propose a new wideband sparse array synthesis method based on gridless compressed sensing to solve the basis mismatch problem for discrete grids. Considering the tapped delay-lines (TDL) structure for space-time domain processing, and using successive frequency-varying atoms for sparse representation of wideband signals, an arbitrary sampling-atomic norm minimization is introduced to model the group sparsity-constrained wideband arrays in which the positions and the excitation values of array element are obtained with a high freedom. The above nonconvex problem is then transformed into a convex relaxation, which is solved using the Prolate Spheroidal Wave Functions (PSWFs). The experimental results show that the proposed sparse array design has higher matching accuracy and sparsity, compared with the discretized wideband sparse array design, which verifies the effectiveness and efficiency of this method.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"157 ","pages":"Article 104893"},"PeriodicalIF":2.9,"publicationDate":"2024-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142744710","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Noise-aware network with shared channel-attention encoder and joint constraint for noisy speech separation 基于共享信道注意编码器和联合约束的噪声感知网络用于噪声语音分离
IF 2.9 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-11-26 DOI: 10.1016/j.dsp.2024.104891
Linhui Sun , Xiaolong Zhou , Aifei Gong , Lei Ye , Pingan Li , Eng Siong Chng
Recently, significant progress has been made in the end-to-end single-channel speech separation in clean environments. For noisy speech separation, existing research mainly uses deep neural networks to implicitly process the noise in speech signals, which does not fully utilize the impact of noise reconstruction errors on network training. We propose a lightweight noise-aware network with shared channel-attention encoder and joint constraint, named NSCJnet, which aims to improve the speech separation system performance in noisy environments. Firstly, to reduce network parameters, the model uses a parameter sharing channel attention encoder to convert noisy speech signals into a feature space. In addition, the channel attention layer (CAlayer) in encoder enhances the network's representational capacity and separation performance in noisy environments by calculating different weights of the filters in the convolution. Secondly, to make the network converge quickly, we regard noise as an estimation target of equal significance to speech, which compel the network to separate residual noise from the estimated speech, effectively suppressing lingering noise within the speech signal. Furthermore, by integrating a multi-resolution frequency constraint into the time domain loss, we introduce a weighted time-frequency joint loss constraint, empowering the network to acquire information across both dimensions to conducive to separating mixed speech with noise. It automatically strengthens important features for separation and suppresses unimportant ones during the learning process. The results on the noisy WHAM! dataset and the noisy Libri2Mix dataset show that our method has less computational complexity, and outperforms some advanced methods in various speech quality and intelligibility metrics.
近年来,清洁环境下的端到端单通道语音分离技术取得了重大进展。对于带噪语音分离,现有研究主要是利用深度神经网络对语音信号中的噪声进行隐式处理,没有充分利用噪声重构误差对网络训练的影响。为了提高语音分离系统在噪声环境下的性能,提出了一种具有共享信道注意编码器和联合约束的轻量级噪声感知网络NSCJnet。首先,为了减小网络参数,该模型使用参数共享通道关注编码器将带噪声的语音信号转换为特征空间;此外,编码器中的信道注意层(callayer)通过计算卷积中滤波器的不同权重,增强了网络在噪声环境下的表示能力和分离性能。其次,为了使网络快速收敛,我们将噪声作为与语音同等重要的估计目标,迫使网络将残差噪声从估计的语音中分离出来,有效地抑制语音信号中的残留噪声。此外,通过将多分辨率频率约束集成到时域损失中,我们引入了加权时频联合损失约束,使网络能够跨两个维度获取信息,从而有利于分离混合语音和噪声。在学习过程中,它会自动强化重要的特征进行分离,并抑制不重要的特征。结果在嘈杂的WHAM!数据集和带噪声的Libri2Mix数据集表明,我们的方法具有较低的计算复杂度,并且在各种语音质量和可理解性指标上优于一些先进的方法。
{"title":"Noise-aware network with shared channel-attention encoder and joint constraint for noisy speech separation","authors":"Linhui Sun ,&nbsp;Xiaolong Zhou ,&nbsp;Aifei Gong ,&nbsp;Lei Ye ,&nbsp;Pingan Li ,&nbsp;Eng Siong Chng","doi":"10.1016/j.dsp.2024.104891","DOIUrl":"10.1016/j.dsp.2024.104891","url":null,"abstract":"<div><div>Recently, significant progress has been made in the end-to-end single-channel speech separation in clean environments. For noisy speech separation, existing research mainly uses deep neural networks to implicitly process the noise in speech signals, which does not fully utilize the impact of noise reconstruction errors on network training. We propose a lightweight noise-aware network with shared channel-attention encoder and joint constraint, named NSCJnet, which aims to improve the speech separation system performance in noisy environments. Firstly, to reduce network parameters, the model uses a parameter sharing channel attention encoder to convert noisy speech signals into a feature space. In addition, the channel attention layer (CAlayer) in encoder enhances the network's representational capacity and separation performance in noisy environments by calculating different weights of the filters in the convolution. Secondly, to make the network converge quickly, we regard noise as an estimation target of equal significance to speech, which compel the network to separate residual noise from the estimated speech, effectively suppressing lingering noise within the speech signal. Furthermore, by integrating a multi-resolution frequency constraint into the time domain loss, we introduce a weighted time-frequency joint loss constraint, empowering the network to acquire information across both dimensions to conducive to separating mixed speech with noise. It automatically strengthens important features for separation and suppresses unimportant ones during the learning process. The results on the noisy WHAM! dataset and the noisy Libri2Mix dataset show that our method has less computational complexity, and outperforms some advanced methods in various speech quality and intelligibility metrics.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"157 ","pages":"Article 104891"},"PeriodicalIF":2.9,"publicationDate":"2024-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142745363","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Adaptive detection of radar range-Doppler dual-spread targets in lognormal-texture clutter 对数正态纹理杂波中雷达距离-多普勒双展目标的自适应检测
IF 2.9 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-11-26 DOI: 10.1016/j.dsp.2024.104882
Jian Xue , Zhen Fan , Shuwen Xu , Meiyan Pan
This paper investigates the problem of adaptive detection of radar targets in non-Gaussian clutter, where the target to be detected is considered to behave the dual-spread in the Doppler frequency dimension and the range dimension. The clutter is assumed to follow the compound Gaussian model with lognormal texture and unknown covariance matrix structure. The multi-rank linear subspace model and the range-spread model are employed to depict the Doppler and range spread characteristics of target echoes. Then, the range-Doppler dual-spread adaptive radar target detector with lognormal-texture is proposed using the two-step generalized likelihood ratio criteria, which replaces the true values of the unknown parameters with their maximum likelihood and maximum a posteriori estimates. Experimental results on simulated and measured data demonstrate that the proposed detector shows superior performance in different clutter and target parameters compared to the competitors.
本文研究了非高斯杂波条件下雷达目标的自适应检测问题,考虑被检测目标在多普勒频率维和距离维上具有双扩频特性。假设杂波遵循对数正态纹理和未知协方差矩阵结构的复合高斯模型。采用多阶线性子空间模型和距离扩展模型来描述目标回波的多普勒和距离扩展特性。然后,利用两步广义似然比准则提出了对数正态纹理的距离-多普勒双扩频自适应雷达目标探测器,用未知参数的最大似然估计和最大后验估计代替未知参数的真值。仿真和实测数据的实验结果表明,该探测器在不同杂波和目标参数下的性能优于同类探测器。
{"title":"Adaptive detection of radar range-Doppler dual-spread targets in lognormal-texture clutter","authors":"Jian Xue ,&nbsp;Zhen Fan ,&nbsp;Shuwen Xu ,&nbsp;Meiyan Pan","doi":"10.1016/j.dsp.2024.104882","DOIUrl":"10.1016/j.dsp.2024.104882","url":null,"abstract":"<div><div>This paper investigates the problem of adaptive detection of radar targets in non-Gaussian clutter, where the target to be detected is considered to behave the dual-spread in the Doppler frequency dimension and the range dimension. The clutter is assumed to follow the compound Gaussian model with lognormal texture and unknown covariance matrix structure. The multi-rank linear subspace model and the range-spread model are employed to depict the Doppler and range spread characteristics of target echoes. Then, the range-Doppler dual-spread adaptive radar target detector with lognormal-texture is proposed using the two-step generalized likelihood ratio criteria, which replaces the true values of the unknown parameters with their maximum likelihood and maximum a posteriori estimates. Experimental results on simulated and measured data demonstrate that the proposed detector shows superior performance in different clutter and target parameters compared to the competitors.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"157 ","pages":"Article 104882"},"PeriodicalIF":2.9,"publicationDate":"2024-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142745362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unified framework for linear scale invariant signals, systems, and transforms: A tutorial 线性尺度不变信号、系统和变换的统一框架:教程
IF 2.9 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-11-26 DOI: 10.1016/j.dsp.2024.104880
Anubha Gupta , Pushpendra Singh , Priya Aggarwal , Shiv Dutt Joshi
This paper presents a unified framework for linear scale invariant signals, systems, and transforms from a system theoretic perspective. The work is the scale counterpart of the theory related to linear shift invariant systems and transforms. Similar to Fourier and Laplace transforms that are used to study linear shift or time invariant systems, Mellin transform is used to study scale invariant systems. However, unlike the shift invariant theory, the theory related to scale invariant systems and transforms has so far not been presented with a unified approach. In this work, we present this theory from signal processing viewpoint, where we present the development of scale invariant transform as a systematic progression from scale series for scale periodic signals to scale invariant transform for scale aperiodic signals. We also present a few examples to illustrate the utility of the presented theory.
本文从系统理论的角度提出了线性尺度不变信号、系统和变换的统一框架。这项工作是与线性平移不变系统和变换相关的理论的尺度对应。与用于研究线性移位或时不变系统的傅里叶变换和拉普拉斯变换类似,Mellin变换用于研究尺度不变系统。然而,与平移不变量理论不同的是,尺度不变量系统和变换的相关理论到目前为止还没有一个统一的方法。在这项工作中,我们从信号处理的角度提出了这一理论,其中我们将尺度不变变换的发展作为一个系统的进展,从尺度周期信号的尺度级数到尺度非周期信号的尺度不变变换。我们还提出了几个例子来说明所提出的理论的效用。
{"title":"Unified framework for linear scale invariant signals, systems, and transforms: A tutorial","authors":"Anubha Gupta ,&nbsp;Pushpendra Singh ,&nbsp;Priya Aggarwal ,&nbsp;Shiv Dutt Joshi","doi":"10.1016/j.dsp.2024.104880","DOIUrl":"10.1016/j.dsp.2024.104880","url":null,"abstract":"<div><div>This paper presents a unified framework for linear scale invariant signals, systems, and transforms from a system theoretic perspective. The work is the scale counterpart of the theory related to linear shift invariant systems and transforms. Similar to Fourier and Laplace transforms that are used to study linear shift or time invariant systems, Mellin transform is used to study scale invariant systems. However, unlike the shift invariant theory, the theory related to scale invariant systems and transforms has so far not been presented with a unified approach. In this work, we present this theory from signal processing viewpoint, where we present the development of scale invariant transform as a systematic progression from scale series for scale periodic signals to scale invariant transform for scale aperiodic signals. We also present a few examples to illustrate the utility of the presented theory.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"157 ","pages":"Article 104880"},"PeriodicalIF":2.9,"publicationDate":"2024-11-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142745364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MBF-Net: Multi-scale boundary-aware aggregation for bi-directional information exchange and feature reshaping for medical image segmentation MBF-Net:用于医学图像分割的双向信息交换和特征重构的多尺度边界感知聚合
IF 2.9 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-11-24 DOI: 10.1016/j.dsp.2024.104885
Junran Qian , Xudong Xiang , Haiyan Li , Shuhua Ye , Hongsong Li
As a critical component of computer-aided diagnosis systems, medical image segmentation plays a vital role in assisting clinicians in making rapid and accurate decisions and formulating treatment plans. Nevertheless, precise medical image segmentation still presents a number of challenges, including insufficient feature extraction capabilities in the presence of limited sample sizes, blurred segmentation boundaries, and information loss between the encoder and decoder. In order to address these issues, we propose a Multi-Scale Boundary-Aware Aggregation Network with Bidirectional Information Exchange and Feature Refinement (MBF-Net) for medical image segmentation. Initially, we design a Multi-Scale Boundary-Aware Aggregation Encoder (MBAE) that aggregates features from different scales and pixel levels within the input images, capturing fine-grained boundary information in deep features and establishing comprehensive global and local multi-scale contextual dependencies. This design significantly enhances the model's understanding of the overall image structure and its ability to discern subtle differences between lesions and background. Subsequently, a Multi-Scale Bidirectional Information Transmission (MBIT) module is introduced, which integrates bidirectional information flow between low-level and high-level features, enabling multi-scale features to flow bidirectionally across different layers. The MBIT module effectively preserves crucial boundary details during cross-layer information transmission, thereby bridging the semantic gap between the encoder and decoder, and thereby improving the clarity of the segmentation boundaries. Finally, we develop a Feature Refinement and Aggregation Fusion (FRAF) module, designed to integrate feature information from various semantic levels, which alleviates discrepancies between features at varying scales, thus enhancing the segmentation accuracy of the network. The generalisation and effectiveness of MBF-Net are validated through comprehensive experiments on a range of tasks, including nuclear segmentation, breast cancer segmentation, polyp segmentation and skin lesion segmentation. Both subjective and objective evaluations demonstrate that MBF-Net significantly outperforms current state-of-the-art methods, achieving average Dice Similarity Coefficient (DSC) and Intersection over Union (IoU) scores of 86.34 % and 78.37 %, respectively. The superior performance of MBF-Net in terms of segmentation accuracy and quality is demonstrated across five public datasets.
医学图像分割作为计算机辅助诊断系统的重要组成部分,在帮助临床医生做出快速准确的决策和制定治疗方案方面起着至关重要的作用。然而,精确的医学图像分割仍然面临许多挑战,包括在有限的样本量下特征提取能力不足,分割边界模糊以及编码器和解码器之间的信息丢失。为了解决这些问题,我们提出了一种具有双向信息交换和特征细化的多尺度边界感知聚合网络(MBF-Net)用于医学图像分割。首先,我们设计了一个多尺度边界感知聚合编码器(MBAE),该编码器在输入图像中聚合来自不同尺度和像素级别的特征,捕获深度特征中的细粒度边界信息,并建立全面的全局和局部多尺度上下文依赖关系。这种设计显著提高了模型对整体图像结构的理解,以及对病灶和背景之间细微差别的识别能力。随后,引入了多尺度双向信息传输(Multi-Scale Bidirectional Information Transmission, MBIT)模块,该模块集成了低级特征和高级特征之间的双向信息流,使多尺度特征能够在不同的层之间双向流动。MBIT模块在跨层信息传输过程中有效地保留了关键的边界细节,从而弥合了编码器和解码器之间的语义鸿沟,从而提高了分割边界的清晰度。最后,我们开发了一个特征细化和聚合融合(FRAF)模块,旨在整合来自不同语义层次的特征信息,从而缓解不同尺度下特征之间的差异,从而提高网络的分割精度。通过核分割、乳腺癌分割、息肉分割和皮肤病变分割等一系列任务的综合实验,验证了MBF-Net的泛化和有效性。主观和客观的评估都表明MBF-Net显著优于当前最先进的方法,平均骰子相似系数(DSC)和交集超过联盟(IoU)得分分别为86.34%和78.37%。在五个公共数据集上证明了MBF-Net在分割精度和质量方面的优越性能。
{"title":"MBF-Net: Multi-scale boundary-aware aggregation for bi-directional information exchange and feature reshaping for medical image segmentation","authors":"Junran Qian ,&nbsp;Xudong Xiang ,&nbsp;Haiyan Li ,&nbsp;Shuhua Ye ,&nbsp;Hongsong Li","doi":"10.1016/j.dsp.2024.104885","DOIUrl":"10.1016/j.dsp.2024.104885","url":null,"abstract":"<div><div>As a critical component of computer-aided diagnosis systems, medical image segmentation plays a vital role in assisting clinicians in making rapid and accurate decisions and formulating treatment plans. Nevertheless, precise medical image segmentation still presents a number of challenges, including insufficient feature extraction capabilities in the presence of limited sample sizes, blurred segmentation boundaries, and information loss between the encoder and decoder. In order to address these issues, we propose a Multi-Scale Boundary-Aware Aggregation Network with Bidirectional Information Exchange and Feature Refinement (MBF-Net) for medical image segmentation. Initially, we design a Multi-Scale Boundary-Aware Aggregation Encoder (MBAE) that aggregates features from different scales and pixel levels within the input images, capturing fine-grained boundary information in deep features and establishing comprehensive global and local multi-scale contextual dependencies. This design significantly enhances the model's understanding of the overall image structure and its ability to discern subtle differences between lesions and background. Subsequently, a Multi-Scale Bidirectional Information Transmission (MBIT) module is introduced, which integrates bidirectional information flow between low-level and high-level features, enabling multi-scale features to flow bidirectionally across different layers. The MBIT module effectively preserves crucial boundary details during cross-layer information transmission, thereby bridging the semantic gap between the encoder and decoder, and thereby improving the clarity of the segmentation boundaries. Finally, we develop a Feature Refinement and Aggregation Fusion (FRAF) module, designed to integrate feature information from various semantic levels, which alleviates discrepancies between features at varying scales, thus enhancing the segmentation accuracy of the network. The generalisation and effectiveness of MBF-Net are validated through comprehensive experiments on a range of tasks, including nuclear segmentation, breast cancer segmentation, polyp segmentation and skin lesion segmentation. Both subjective and objective evaluations demonstrate that MBF-Net significantly outperforms current state-of-the-art methods, achieving average Dice Similarity Coefficient (DSC) and Intersection over Union (IoU) scores of 86.34 % and 78.37 %, respectively. The superior performance of MBF-Net in terms of segmentation accuracy and quality is demonstrated across five public datasets.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"157 ","pages":"Article 104885"},"PeriodicalIF":2.9,"publicationDate":"2024-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142744715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Controllable artificial watermark injected parallel compressive sensing for simultaneous compression-encryption applications 可控人工水印注入并行压缩传感,用于同时压缩加密应用
IF 2.9 3区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2024-11-22 DOI: 10.1016/j.dsp.2024.104859
Xiuling Li , Bo Zhang , Haijian Wei , Qiang Wang , Zhengdong Li
The emerging compressed sensing (CS) enables compression and encryption simultaneously, which is very suitable for the resource-constraint Internet of things (IoT) applications. However, traditional CS-based cryptosystem can not provide efficient resistance to known-plaintext attack (KPA) under the multi-time-sampling (MTS) scenario. A novel CS-based privacy-preserving cryptosystem, called PCS-CAW (parallel CS injected with controllable artificial watermark), for simultaneous compression-encryption applications is proposed. Firstly, the original plaintext is scrambled by the global random permutation (GRP) operation. Then, the novel watermark injected parallel CS (PCS) is developed to re-encrypt and compress the intermediate ciphertext. Since a controllable artificial random watermark is injected into PCS sampling processing, the proposed PCS-CAW cryptosystem provides efficient resistance to KPA under the MTS scenario. In the decoding stage, a distinctive watermark removing strategy is developed. Experiments demonstrate that the proposed cryptosystem can achieve superior security and compression performance than previous CS-based ones.
新兴的压缩传感(CS)可同时实现压缩和加密,非常适合资源受限的物联网(IoT)应用。然而,在多时间采样(MTS)场景下,传统的基于CS的密码系统无法有效抵御已知明文攻击(KPA)。本文提出了一种基于 CS 的新型隐私保护密码系统,称为 PCS-CAW(注入可控人工水印的并行 CS),适用于压缩-加密同步应用。首先,通过全局随机置换(GRP)操作对原始明文进行扰码。然后,开发新型水印注入并行 CS(PCS)来重新加密和压缩中间密文。由于在 PCS 采样处理过程中注入了可控的人工随机水印,因此所提出的 PCS-CAW 密码系统能在 MTS 场景下有效抵抗 KPA。在解码阶段,开发了一种独特的水印去除策略。实验证明,与之前基于 CS 的密码系统相比,所提出的密码系统能实现更优越的安全性和压缩性能。
{"title":"Controllable artificial watermark injected parallel compressive sensing for simultaneous compression-encryption applications","authors":"Xiuling Li ,&nbsp;Bo Zhang ,&nbsp;Haijian Wei ,&nbsp;Qiang Wang ,&nbsp;Zhengdong Li","doi":"10.1016/j.dsp.2024.104859","DOIUrl":"10.1016/j.dsp.2024.104859","url":null,"abstract":"<div><div>The emerging compressed sensing (CS) enables compression and encryption simultaneously, which is very suitable for the resource-constraint Internet of things (IoT) applications. However, traditional CS-based cryptosystem can not provide efficient resistance to known-plaintext attack (KPA) under the multi-time-sampling (MTS) scenario. A novel CS-based privacy-preserving cryptosystem, called PCS-CAW (parallel CS injected with controllable artificial watermark), for simultaneous compression-encryption applications is proposed. Firstly, the original plaintext is scrambled by the global random permutation (GRP) operation. Then, the novel watermark injected parallel CS (PCS) is developed to re-encrypt and compress the intermediate ciphertext. Since a controllable artificial random watermark is injected into PCS sampling processing, the proposed PCS-CAW cryptosystem provides efficient resistance to KPA under the MTS scenario. In the decoding stage, a distinctive watermark removing strategy is developed. Experiments demonstrate that the proposed cryptosystem can achieve superior security and compression performance than previous CS-based ones.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"157 ","pages":"Article 104859"},"PeriodicalIF":2.9,"publicationDate":"2024-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142719617","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Digital Signal Processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1