Digital Signal Processing最新文献_第6页

Intelligent recovery of low-rank sparse tensor for noisy hydroacoustic with use of nonconvex regularization 基于非凸正则化的噪声水声低秩稀疏张量智能恢复

IF 3 3区工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

Digital Signal Processing

Pub Date : 2026-01-18 DOI: 10.1016/j.dsp.2026.105927

Yuhang Mei, Chengming Luo, Jinqing Cao, Zizhuo Liu, Yongshuai Fei, Fantong Kong, Biao Wang

Ocean information perception based on artificial intelligence is driving the innovative advancements in comprehensive sea observation. The underwater acoustic communication, as the neural link for ocean information interconnection, is susceptible to various interferences such as complex ocean environments and unstable communications. Considering the measurement errors caused by noisy hydroacoustic signals, this paper proposes a tensor low-rank sparse representation by nonconvex regularization (TLSRNR) model for hydroacoustic intelligent recovery. Firstly, the hydroacoustic original tensor mapped by multidimensional hydroacoustic data is decomposed into hydroacoustic sparse tensor, and hydroacoustic target tensor obtained by the t-product of hydroacoustic dictionary tensor and coefficient tensor. Secondly, the nonconvex penalty function is introduced to reduce the approximation error in the tubal rank of coefficient tensor, while the inherent deviation of hydroacoustic sparse tensor is solved by smoothly clipped absolute deviation. Thirdly, the alternating direction method of multipliers is employed to solve proposed TLSRNR model efficiently for recovering the hydroacoustic target tensor. Through simulation experiments and platform lake trials, the recovery performance of noisy hydroacoustic data is evaluated under different algorithms, demonstrating that the proposed model achieves superior accuracy and robustness.

基于人工智能的海洋信息感知正在推动海洋综合观测的创新发展。水声通信作为海洋信息互联的神经链路，容易受到复杂海洋环境和通信不稳定等各种干扰。考虑到噪声水声信号引起的测量误差，提出了一种基于非凸正则化（TLSRNR）的张量低秩稀疏表示水声智能恢复模型。首先，将多维水声数据映射的水声原始张量分解为水声稀疏张量和水声字典张量与系数张量的t积得到的水声目标张量；其次，引入非凸惩罚函数来减小系数张量管阶的近似误差，而水声稀疏张量的固有偏差则采用平滑裁剪的绝对偏差来解决；再次，采用乘子交替方向法对所提出的TLSRNR模型进行有效求解，恢复水声目标张量。通过仿真实验和平台湖泊试验，对不同算法下的噪声水声数据恢复性能进行了评价，结果表明该模型具有较好的精度和鲁棒性。

{"title":"Intelligent recovery of low-rank sparse tensor for noisy hydroacoustic with use of nonconvex regularization","authors":"Yuhang Mei, Chengming Luo, Jinqing Cao, Zizhuo Liu, Yongshuai Fei, Fantong Kong, Biao Wang","doi":"10.1016/j.dsp.2026.105927","DOIUrl":"10.1016/j.dsp.2026.105927","url":null,"abstract":"<div><div>Ocean information perception based on artificial intelligence is driving the innovative advancements in comprehensive sea observation. The underwater acoustic communication, as the neural link for ocean information interconnection, is susceptible to various interferences such as complex ocean environments and unstable communications. Considering the measurement errors caused by noisy hydroacoustic signals, this paper proposes a tensor low-rank sparse representation by nonconvex regularization (TLSRNR) model for hydroacoustic intelligent recovery. Firstly, the hydroacoustic original tensor mapped by multidimensional hydroacoustic data is decomposed into hydroacoustic sparse tensor, and hydroacoustic target tensor obtained by the <em>t</em>-product of hydroacoustic dictionary tensor and coefficient tensor. Secondly, the nonconvex penalty function is introduced to reduce the approximation error in the tubal rank of coefficient tensor, while the inherent deviation of hydroacoustic sparse tensor is solved by smoothly clipped absolute deviation. Thirdly, the alternating direction method of multipliers is employed to solve proposed TLSRNR model efficiently for recovering the hydroacoustic target tensor. Through simulation experiments and platform lake trials, the recovery performance of noisy hydroacoustic data is evaluated under different algorithms, demonstrating that the proposed model achieves superior accuracy and robustness.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"173 ","pages":"Article 105927"},"PeriodicalIF":3.0,"publicationDate":"2026-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146039209","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Generalized low-rank matrix completion model with overlapping group error representation 具有重叠组误差表示的广义低秩矩阵补全模型

IF 3 3区工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

Digital Signal Processing

Pub Date : 2026-01-17 DOI: 10.1016/j.dsp.2026.105925

Wenjing Lu , Zhuang Fang , Liang Wu , Liming Tang , Hanxin Liu , Chuanjiang He

The low-rank matrix completion (LRMC) technology has achieved remarkable results in low-level visual tasks. There is an underlying assumption that the real-world matrix data is low-rank in LRMC. However, when matrix data do not strictly satisfy the low-rank property, this assumption creates serious challenges for existing matrix recovery methods. Fortunately, there exist feasible schemes that devise appropriate and effective priori representations for describing the intrinsic information of real data. In this paper, we first model the matrix data Y as the sum of a low-rank approximation component

X

and an approximation error component

E

. This finer-grained data decomposition framework allows each component of information to be portrayed more precisely. To effectively characterize the structured error, we design an overlapping group error representation (OGER) function, which captures structured sparsity by modeling locally correlated errors. Finally, we develop an efficient optimization algorithm based on the alternating direction method of multipliers (ADMM), which integrates the majorization-minimization (MM) technique to ensure efficient convergence. We also provide a rigorous theoretical analysis, including a detailed proof of the convexity of the OGER function and the convergence guarantees of our algorithm. In addition, numerical experiment results demonstrate that the proposed model consistently outperforms existing competing models.

低秩矩阵补全（LRMC）技术在低阶视觉任务中取得了显著的效果。有一个潜在的假设，即现实世界的矩阵数据在LRMC中是低秩的。然而，当矩阵数据不严格满足低秩性时，这一假设对现有的矩阵恢复方法提出了严峻的挑战。幸运的是，有可行的方案，设计适当和有效的先验表示来描述真实数据的内在信息。在本文中，我们首先将矩阵数据Y建模为低秩近似分量X和近似误差分量e的和，这种更细粒度的数据分解框架允许更精确地描绘信息的每个分量。为了有效地表征结构误差，我们设计了一个重叠组误差表示（OGER）函数，该函数通过对局部相关误差建模来捕获结构稀疏性。最后，我们提出了一种基于乘法器交替方向法（ADMM）的高效优化算法，该算法集成了极大化-极小化（MM）技术以保证算法的高效收敛。我们还提供了严格的理论分析，包括OGER函数的凸性的详细证明和我们的算法的收敛性保证。此外，数值实验结果表明，该模型优于现有的竞争模型。

{"title":"Generalized low-rank matrix completion model with overlapping group error representation","authors":"Wenjing Lu , Zhuang Fang , Liang Wu , Liming Tang , Hanxin Liu , Chuanjiang He","doi":"10.1016/j.dsp.2026.105925","DOIUrl":"10.1016/j.dsp.2026.105925","url":null,"abstract":"<div><div>The low-rank matrix completion (LRMC) technology has achieved remarkable results in low-level visual tasks. There is an underlying assumption that the real-world matrix data is low-rank in LRMC. However, when matrix data do not strictly satisfy the low-rank property, this assumption creates serious challenges for existing matrix recovery methods. Fortunately, there exist feasible schemes that devise appropriate and effective priori representations for describing the intrinsic information of real data. In this paper, we first model the matrix data <strong>Y</strong> as the sum of a low-rank approximation component <span><math><mi>X</mi></math></span> and an approximation error component <span><math><mi>E</mi></math></span>. This finer-grained data decomposition framework allows each component of information to be portrayed more precisely. To effectively characterize the structured error, we design an overlapping group error representation (OGER) function, which captures structured sparsity by modeling locally correlated errors. Finally, we develop an efficient optimization algorithm based on the alternating direction method of multipliers (ADMM), which integrates the majorization-minimization (MM) technique to ensure efficient convergence. We also provide a rigorous theoretical analysis, including a detailed proof of the convexity of the OGER function and the convergence guarantees of our algorithm. In addition, numerical experiment results demonstrate that the proposed model consistently outperforms existing competing models.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"173 ","pages":"Article 105925"},"PeriodicalIF":3.0,"publicationDate":"2026-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146039118","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Unique word orthogonal signal division multiplexing with complex unitary neural network for underwater acoustic communication 基于复杂酉神经网络的水声通信独字正交信分复用

IF 3 3区工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

Digital Signal Processing

Pub Date : 2026-01-17 DOI: 10.1016/j.dsp.2026.105926

Zeyad A.H. Qasem , Xingbin Tu , Chunyi Song , Hamada Esmaiel , Waheb A. Jabbar , Fengzhong Qu

Although orthogonal signal division multiplexing (OSDM) offers improved performance for underwater acoustic communication (UWAC), it still faces two major challenges related to the high peak-to-average power ratio (PAPR) and increased sensitivity to inter-vector interference (IVI). This paper proposes a novel OSDM system, termed precoded unique word OSDM based on unitary neural network (UW-OSDM-UNN), to address these issues effectively. The proposed scheme embeds the guard interval within the fast Fourier transform duration to mitigate inter-symbol interference and employs a UNN-based precoder at the transmitter to reduce PAPR and significantly overcome the IVI sensitivity. The UNN-based transmitter is completely independent of the UWAC channel, eliminating the need for receiver-side training or additional testing-stage training. Furthermore, zero vectors and frequency-shifted Chu sequences are incorporated to enable robust Doppler shift estimation and multipath compensation, respectively. The Chu sequences are inserted in the frequency domain to generate deterministic sequences within the guard interval without introducing additional inter-symbol interference. The system is validated through both simulations and real-world sea trials over a 300-meter underwater connection. Results show that the proposed scheme achieves up to a 4 dB PAPR reduction, a 5 dB improvement in bit error rate (BER), and superior robustness against challenging UWAC channel conditions compared to state-of-the-art OSDM-based systems.

尽管正交信号分复用（OSDM）为水声通信（UWAC）提供了更好的性能，但它仍然面临着两个主要挑战，即高峰值平均功率比（PAPR）和对矢量间干扰（IVI）的灵敏度增加。本文提出了一种新的基于统一神经网络的预编码唯一字OSDM系统（UW-OSDM-UNN）来有效地解决这些问题。该方案在快速傅里叶变换持续时间内嵌入保护间隔以减轻符号间干扰，并在发射机处采用基于unn的预编码器来降低PAPR并显著克服IVI灵敏度。基于unn的发射机完全独立于UWAC信道，消除了接收机侧训练或额外测试阶段训练的需要。此外，零矢量和频移Chu序列分别用于鲁棒多普勒频移估计和多径补偿。在不引入额外符号间干扰的情况下，将Chu序列插入频域以在保护区间内生成确定性序列。该系统通过模拟和现实世界中300米水下连接的海上试验进行了验证。结果表明，与最先进的基于osdm的系统相比，该方案实现了高达4 dB的PAPR降低，5 dB的误码率（BER）提高，以及对具有挑战性的UWAC信道条件的卓越鲁棒性。

{"title":"Unique word orthogonal signal division multiplexing with complex unitary neural network for underwater acoustic communication","authors":"Zeyad A.H. Qasem , Xingbin Tu , Chunyi Song , Hamada Esmaiel , Waheb A. Jabbar , Fengzhong Qu","doi":"10.1016/j.dsp.2026.105926","DOIUrl":"10.1016/j.dsp.2026.105926","url":null,"abstract":"<div><div>Although orthogonal signal division multiplexing (OSDM) offers improved performance for underwater acoustic communication (UWAC), it still faces two major challenges related to the high peak-to-average power ratio (PAPR) and increased sensitivity to inter-vector interference (IVI). This paper proposes a novel OSDM system, termed precoded unique word OSDM based on unitary neural network (UW-OSDM-UNN), to address these issues effectively. The proposed scheme embeds the guard interval within the fast Fourier transform duration to mitigate inter-symbol interference and employs a UNN-based precoder at the transmitter to reduce PAPR and significantly overcome the IVI sensitivity. The UNN-based transmitter is completely independent of the UWAC channel, eliminating the need for receiver-side training or additional testing-stage training. Furthermore, zero vectors and frequency-shifted Chu sequences are incorporated to enable robust Doppler shift estimation and multipath compensation, respectively. The Chu sequences are inserted in the frequency domain to generate deterministic sequences within the guard interval without introducing additional inter-symbol interference. The system is validated through both simulations and real-world sea trials over a 300-meter underwater connection. Results show that the proposed scheme achieves up to a 4 dB PAPR reduction, a 5 dB improvement in bit error rate (BER), and superior robustness against challenging UWAC channel conditions compared to state-of-the-art OSDM-based systems.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"173 ","pages":"Article 105926"},"PeriodicalIF":3.0,"publicationDate":"2026-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146079694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Robust time-frequency preamble detection for LoRa-modulated signals using optimized generalized likelihood ratio test 基于优化广义似然比检验的lora调制信号鲁棒时频前导检测

IF 3 3区工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

Digital Signal Processing

Pub Date : 2026-01-16 DOI: 10.1016/j.dsp.2026.105892

Nan Chen, Xia Liu, Huafeng Wu

This paper proposes a robust preamble detection algorithm for long range radio signals. The method integrates the short-time Fourier transform (STFT) with a generalized likelihood ratio test (GLRT), detecting signals via coherent integration along an estimated time-frequency trajectory. First, a binary hypothesis testing framework is established based on the time-frequency characteristics of the LoRa preamble to discriminate between signal and noise. Then, an STFT with optimized window parameters is adopted to extract time-frequency features. To address the uncertainty of the preamble’s starting position, a discrete time-frequency path model is introduced. By exploiting the known linear frequency modulation structure and optimized window parameters, a discretized grid path is constructed in the time-frequency domain to estimate the signal trajectory. Sliding coherent accumulation is then performed along these paths to form the GLRT statistic. Theoretical analysis shows that the STFT coefficients follow a chi-square distribution under noise-only conditions and a non-central chi-square distribution in the presence of a signal. Based on this, the probability distributions of the coherent accumulated value and the test statistic are derived. Finally, an adaptive threshold computation method is also proposed to optimally balance the detection probability and false alarm rate. Simulations are conducted under various spreading factors, preamble lengths, and carrier frequency offsets. Results indicate that the proposed GLRT detector improves detection probability by about 25% and synchronization accuracy by about 17% in low-SNR scenarios, compared with conventional methods.

提出了一种鲁棒的远程无线电信号前导检测算法。该方法将短时傅里叶变换（STFT）与广义似然比检验（GLRT）相结合，沿估计的时频轨迹通过相干积分检测信号。首先，基于LoRa前导的时频特性，建立二元假设检验框架，区分信号和噪声；然后，采用优化窗口参数的STFT提取时频特征；为了解决前导起始位置的不确定性，引入了离散时频路径模型。利用已知的线性调频结构和优化的窗口参数，在时频域构造离散网格路径来估计信号轨迹。然后沿着这些路径进行滑动相干累积，形成GLRT统计量。理论分析表明，STFT系数在无噪声条件下服从卡方分布，在有信号时服从非中心卡方分布。在此基础上，推导了相干累积值的概率分布和检验统计量。最后，提出了一种自适应阈值计算方法，以最优平衡检测概率和虚警率。在不同的扩频因子、前导长度和载波频偏下进行了仿真。结果表明，在低信噪比情况下，与传统方法相比，所提出的GLRT检测器的检测概率提高了约25%，同步精度提高了约17%。

{"title":"Robust time-frequency preamble detection for LoRa-modulated signals using optimized generalized likelihood ratio test","authors":"Nan Chen, Xia Liu, Huafeng Wu","doi":"10.1016/j.dsp.2026.105892","DOIUrl":"10.1016/j.dsp.2026.105892","url":null,"abstract":"<div><div>This paper proposes a robust preamble detection algorithm for long range radio signals. The method integrates the short-time Fourier transform (STFT) with a generalized likelihood ratio test (GLRT), detecting signals via coherent integration along an estimated time-frequency trajectory. First, a binary hypothesis testing framework is established based on the time-frequency characteristics of the LoRa preamble to discriminate between signal and noise. Then, an STFT with optimized window parameters is adopted to extract time-frequency features. To address the uncertainty of the preamble’s starting position, a discrete time-frequency path model is introduced. By exploiting the known linear frequency modulation structure and optimized window parameters, a discretized grid path is constructed in the time-frequency domain to estimate the signal trajectory. Sliding coherent accumulation is then performed along these paths to form the GLRT statistic. Theoretical analysis shows that the STFT coefficients follow a chi-square distribution under noise-only conditions and a non-central chi-square distribution in the presence of a signal. Based on this, the probability distributions of the coherent accumulated value and the test statistic are derived. Finally, an adaptive threshold computation method is also proposed to optimally balance the detection probability and false alarm rate. Simulations are conducted under various spreading factors, preamble lengths, and carrier frequency offsets. Results indicate that the proposed GLRT detector improves detection probability by about 25% and synchronization accuracy by about 17% in low-SNR scenarios, compared with conventional methods.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"173 ","pages":"Article 105892"},"PeriodicalIF":3.0,"publicationDate":"2026-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146039111","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Wideband DOA estimation based on time-domain energy focusing 基于时域能量聚焦的宽带DOA估计

IF 3 3区工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

Digital Signal Processing

Pub Date : 2026-01-15 DOI: 10.1016/j.dsp.2026.105903

Yuxiang Jiang , Qing Shen , Kejiang Wu , Zexiang Zhang , Chenxi Liao , Shuyuan Xu

Wideband DOA estimation has become a significant concern in communication, navigation, and radar systems. Previous approaches employed the frequency-domain focusing method to alleviate the wideband impact, but it was constrained by its reliance on prior DOA knowledge. The time-domain wideband DOA estimation methods have also been explored, but often suffered from high-dimensional complexity. This work introduces a time-domain energy focusing (TDEF) scheme that leverages the known waveform and eliminates the reliance on prior DOA information and reduce the high-dimensional complexity. TDEF consists of multi-channel matched filtering and joint power-peak detection. The former concentrates signal energy in the time domain, while the latter mitigates peak migration induced by the wideband scenario. Through this process, the wideband scenario is transformed into an equivalent narrowband counterpart, enabling the application of narrowband DOA estimation techniques. Using matrix-perturbation analysis, we establish the theoretically asymptotic MSE equivalence between TDEF scheme and frequency-domain focusing. The numerical simulations show that the TDEF-based method achieves asymptotic performance approaching the CRLB without prior DOA information, improved resolution for closely spaced sources with different TOAs, and lower computational complexity, especially compared to time-domian sparsity-recovery methods.

宽带DOA估计已成为通信、导航和雷达系统中一个重要的问题。以往的方法采用频域聚焦方法来减轻宽带影响，但由于依赖于先验的DOA知识而受到限制。对时域宽带DOA估计方法也进行了研究，但往往存在高维复杂度的问题。本文引入了一种时域能量聚焦（TDEF）方案，该方案利用已知波形，消除了对先前DOA信息的依赖，降低了高维复杂度。TDEF由多通道匹配滤波和联合功率峰值检测组成。前者将信号能量集中在时域，而后者减轻了宽带场景引起的峰值迁移。通过这个过程，将宽带场景转换为等效的窄带场景，从而实现窄带DOA估计技术的应用。利用矩阵摄动分析，建立了TDEF格式与频域聚焦之间的理论渐近MSE等价。数值模拟结果表明，与时域稀疏恢复方法相比，基于tdefs的方法在没有先验DOA信息的情况下具有接近CRLB的渐近性能，提高了具有不同toa的紧密间隔源的分辨率，并且降低了计算复杂度。

{"title":"Wideband DOA estimation based on time-domain energy focusing","authors":"Yuxiang Jiang , Qing Shen , Kejiang Wu , Zexiang Zhang , Chenxi Liao , Shuyuan Xu","doi":"10.1016/j.dsp.2026.105903","DOIUrl":"10.1016/j.dsp.2026.105903","url":null,"abstract":"<div><div>Wideband DOA estimation has become a significant concern in communication, navigation, and radar systems. Previous approaches employed the frequency-domain focusing method to alleviate the wideband impact, but it was constrained by its reliance on prior DOA knowledge. The time-domain wideband DOA estimation methods have also been explored, but often suffered from high-dimensional complexity. This work introduces a time-domain energy focusing (TDEF) scheme that leverages the known waveform and eliminates the reliance on prior DOA information and reduce the high-dimensional complexity. TDEF consists of multi-channel matched filtering and joint power-peak detection. The former concentrates signal energy in the time domain, while the latter mitigates peak migration induced by the wideband scenario. Through this process, the wideband scenario is transformed into an equivalent narrowband counterpart, enabling the application of narrowband DOA estimation techniques. Using matrix-perturbation analysis, we establish the theoretically asymptotic MSE equivalence between TDEF scheme and frequency-domain focusing. The numerical simulations show that the TDEF-based method achieves asymptotic performance approaching the CRLB without prior DOA information, improved resolution for closely spaced sources with different TOAs, and lower computational complexity, especially compared to time-domian sparsity-recovery methods.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"173 ","pages":"Article 105903"},"PeriodicalIF":3.0,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146039207","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

SWaRaA: A multi-modal deep learning framework for the diagnosis and classification of respiratory diseases using medical acoustic representations SWaRaA：一个多模态深度学习框架，用于使用医学声学表示诊断和分类呼吸系统疾病

IF 3 3区工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

Digital Signal Processing

Pub Date : 2026-01-15 DOI: 10.1016/j.dsp.2025.105861

Panigrahi Srikanth, Chandan Kumar Behera

Audio-based diagnostics are rapidly emerging as non-invasive and accessible tools for identifying respiratory diseases. Medical acoustic signals such as coughs, breaths, and lung sounds carry clinically relevant information with strong potential for disease detection and monitoring. In this context, we introduce SWaRaA, a novel multi-modal deep learning framework that leverages the complementary characteristics of two distinct types of respiratory sound representations. The framework integrates Mel-spectrogram-based image features and Wav2Vec 2.0 embeddings of medical acoustic signals to enhance classification accuracy by capturing both spectral and contextual information. SWaRaA consists of two parallel processing pathways. The first extracts spectral-temporal features using a proposed lightweight CNN-Transformer model comprising Depth-Wise Separable Convolution (DSC), Parallel Convolution Series (PCS), Serial Convolution Series (SCS), and Transformer blocks (TR). The second processes raw acoustic signals through the Wav2Vec 2.0 model to capture deep contextual and temporal features. These representations are fused through a dedicated integration module and passed to a classification head for final prediction. The proposed framework effectively captures both local and long-range dependencies, enabling robust respiratory disease classification. Through extensive experiments across three benchmark datasets and 15 medical acoustic tasks, we establish SWaRaA as a state-of-the-art multi-modal acoustic classification model, offering a scalable and high-performance solution for real-world healthcare applications.

基于音频的诊断正在迅速成为识别呼吸道疾病的非侵入性和可获得的工具。咳嗽、呼吸、肺音等医学声信号携带临床相关信息，具有很强的疾病检测和监测潜力。在此背景下，我们介绍了一种新的多模态深度学习框架SWaRaA，它利用了两种不同类型的呼吸声音表征的互补特征。该框架集成了基于mel光谱图的图像特征和医学声学信号的Wav2Vec 2.0嵌入，通过捕获光谱和上下文信息来提高分类精度。SWaRaA由两条并行处理路径组成。第一种方法是使用轻量级CNN-Transformer模型提取光谱-时间特征，该模型包括深度可分离卷积（DSC）、并行卷积系列（PCS）、串行卷积系列（SCS）和Transformer块（TR）。第二种方法是通过Wav2Vec 2.0模型处理原始声音信号，以捕获深层的上下文和时间特征。这些表示通过专用的集成模块进行融合，并传递给分类头进行最终预测。所提出的框架有效地捕获了本地和远程依赖关系，实现了稳健的呼吸系统疾病分类。通过对三个基准数据集和15个医学声学任务的广泛实验，我们建立了SWaRaA作为最先进的多模态声学分类模型，为现实世界的医疗保健应用提供了可扩展的高性能解决方案。

{"title":"SWaRaA: A multi-modal deep learning framework for the diagnosis and classification of respiratory diseases using medical acoustic representations","authors":"Panigrahi Srikanth, Chandan Kumar Behera","doi":"10.1016/j.dsp.2025.105861","DOIUrl":"10.1016/j.dsp.2025.105861","url":null,"abstract":"<div><div>Audio-based diagnostics are rapidly emerging as non-invasive and accessible tools for identifying respiratory diseases. Medical acoustic signals such as coughs, breaths, and lung sounds carry clinically relevant information with strong potential for disease detection and monitoring. In this context, we introduce SWaRaA, a novel multi-modal deep learning framework that leverages the complementary characteristics of two distinct types of respiratory sound representations. The framework integrates Mel-spectrogram-based image features and Wav2Vec 2.0 embeddings of medical acoustic signals to enhance classification accuracy by capturing both spectral and contextual information. SWaRaA consists of two parallel processing pathways. The first extracts spectral-temporal features using a proposed lightweight CNN-Transformer model comprising Depth-Wise Separable Convolution (DSC), Parallel Convolution Series (PCS), Serial Convolution Series (SCS), and Transformer blocks (TR). The second processes raw acoustic signals through the Wav2Vec 2.0 model to capture deep contextual and temporal features. These representations are fused through a dedicated integration module and passed to a classification head for final prediction. The proposed framework effectively captures both local and long-range dependencies, enabling robust respiratory disease classification. Through extensive experiments across three benchmark datasets and 15 medical acoustic tasks, we establish SWaRaA as a state-of-the-art multi-modal acoustic classification model, offering a scalable and high-performance solution for real-world healthcare applications.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"173 ","pages":"Article 105861"},"PeriodicalIF":3.0,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146079692","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Performance of HEVC video coding for delivery over IP networks HEVC视频编码在IP网络上传输的性能

IF 3 3区工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

Digital Signal Processing

Pub Date : 2026-01-15 DOI: 10.1016/j.dsp.2026.105911

Khalid Abdullah M. Salih , Ismail Amin Ali , Ramadhan J. Mstafa

Efficient video streaming over IP networks faces significant challenges due to packet loss and network congestion, particularly when using User Datagram Protocol (UDP), which lacks inherent error correction mechanisms. This study provides a comprehensive framework for selecting HEVC encoding configurations based on motion content and network condition. The paper evaluates the packet loss resilience of various HEVC encoding configurations across video content with high-motion, intermediate-motion, and low-motion activity. Utilizing UDP streaming in conjunction with the MPEG Transport Stream (MPEG-TS) container, video quality was quantified using the Peak Signal-to-Noise Ratio (PSNR) and Structural Similarity Index Measure (SSIM) under packet loss rates of up to 1.0%. Three HEVC encoding configurations IPPP, periodic I, and periodic IDR were assessed. The results indicate that periodic IDR, with its closed GOP structure, achieves the highest resilience to packet loss, rendering it ideal for unreliable networks. Specifically, for high-motion video content, periodic IDR limited PSNR degradation to 6.97 dB (from 28.78 dB to 21.87 dB) under a 0.5% packet loss rate. For intermediate-motion content (Mobcal), PSNR decreased by 9.26 dB (from 34.85 dB to 25.23 dB), and for low-motion content (FourPeople), PSNR degraded by 6.96 dB (from 40.87 dB to 33.91 dB), consistently outperforming the other configurations. In contrast, periodic I demonstrated moderate resilience, with PSNR degradation of 9.6 dB for high-motion content, up to 14.36 dB for intermediate-motion content, and approximately 11.46 dB for low-motion content. The IPPP configuration exhibited the greatest vulnerability, with PSNR degradations of 12.66 dB, 18.7 dB, and 11.95 dB for Crowd_run, Mobcal, and FourPeople, respectively, due to extensive error propagation inherent in its open GOP structure. The findings advance the understanding of error resilience in video compression and offer practical guidelines for maximizing video quality in real-world streaming scenarios over lossy IP networks.

由于数据包丢失和网络拥塞，IP网络上的高效视频流面临着巨大的挑战，特别是当使用用户数据报协议（UDP）时，它缺乏固有的纠错机制。本研究提供了一个基于运动内容和网络条件选择HEVC编码配置的综合框架。本文评估了各种HEVC编码配置在高运动、中运动和低运动视频内容中的丢包弹性。将UDP流与MPEG传输流（MPEG- ts）容器结合使用，在丢包率高达1.0%的情况下，使用峰值信噪比（PSNR）和结构相似性指数度量（SSIM）对视频质量进行量化。评估了三种HEVC编码配置IPPP、周期性I和周期性IDR。结果表明，周期IDR具有封闭的GOP结构，具有最高的丢包弹性，是不可靠网络的理想选择。具体来说，对于高运动视频内容，在丢包率为0.5%的情况下，周期性IDR将PSNR降至6.97 dB（从28.78 dB降至21.87 dB）。对于中运动内容（Mobcal）， PSNR下降了9.26 dB（从34.85 dB下降到25.23 dB），对于低运动内容（FourPeople）， PSNR下降了6.96 dB（从40.87 dB下降到33.91 dB），始终优于其他配置。相比之下，周期I表现出中等的弹性，高运动内容的PSNR下降为9.6 dB，中运动内容的PSNR下降为14.36 dB，低运动内容的PSNR下降约为11.46 dB。IPPP配置的漏洞最大，Crowd_run、Mobcal和FourPeople的PSNR分别下降了12.66 dB、18.7 dB和11.95 dB，这是由于其开放GOP结构固有的广泛错误传播。这些发现促进了对视频压缩中的错误恢复能力的理解，并为在有损IP网络上的真实流场景中最大限度地提高视频质量提供了实用指南。

{"title":"Performance of HEVC video coding for delivery over IP networks","authors":"Khalid Abdullah M. Salih , Ismail Amin Ali , Ramadhan J. Mstafa","doi":"10.1016/j.dsp.2026.105911","DOIUrl":"10.1016/j.dsp.2026.105911","url":null,"abstract":"<div><div>Efficient video streaming over IP networks faces significant challenges due to packet loss and network congestion, particularly when using User Datagram Protocol (UDP), which lacks inherent error correction mechanisms. This study provides a comprehensive framework for selecting HEVC encoding configurations based on motion content and network condition. The paper evaluates the packet loss resilience of various HEVC encoding configurations across video content with high-motion, intermediate-motion, and low-motion activity. Utilizing UDP streaming in conjunction with the MPEG Transport Stream (MPEG-TS) container, video quality was quantified using the Peak Signal-to-Noise Ratio (PSNR) and Structural Similarity Index Measure (SSIM) under packet loss rates of up to 1.0%. Three HEVC encoding configurations <em>IPPP, periodic I</em>, and <em>periodic IDR</em> were assessed. The results indicate that <em>periodic IDR</em>, with its closed GOP structure, achieves the highest resilience to packet loss, rendering it ideal for unreliable networks. Specifically, for high-motion video content, <em>periodic IDR</em> limited PSNR degradation to 6.97 dB (from 28.78 dB to 21.87 dB) under a 0.5% packet loss rate. For intermediate-motion content (Mobcal), PSNR decreased by 9.26 dB (from 34.85 dB to 25.23 dB), and for low-motion content (FourPeople), PSNR degraded by 6.96 dB (from 40.87 dB to 33.91 dB), consistently outperforming the other configurations. In contrast, <em>periodic I</em> demonstrated moderate resilience, with PSNR degradation of 9.6 dB for high-motion content, up to 14.36 dB for intermediate-motion content, and approximately 11.46 dB for low-motion content. The <em>IPPP</em> configuration exhibited the greatest vulnerability, with PSNR degradations of 12.66 dB, 18.7 dB, and 11.95 dB for Crowd_run, Mobcal, and FourPeople, respectively, due to extensive error propagation inherent in its open GOP structure. The findings advance the understanding of error resilience in video compression and offer practical guidelines for maximizing video quality in real-world streaming scenarios over lossy IP networks.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"173 ","pages":"Article 105911"},"PeriodicalIF":3.0,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146039210","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Model selection method based on the neural networks for signal processing 基于神经网络的模型选择方法进行信号处理

IF 3 3区工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

Digital Signal Processing

Pub Date : 2026-01-14 DOI: 10.1016/j.dsp.2026.105908

Z.M. Kurdoshev , E.A. Pchelintsev

The paper considers the optimal filtering of square integrable signals in Gaussian noise of small intensity. The problem is studied under the condition that the observed process is available only at discrete time moments. This study aims to develop an automated and data-driven model selection procedure (MSP) based on sharp oracle inequalities for optimal estimation of an unknown signal by determining the best combination of smoothness parameters that minimizes the mean square error. We propose a novel hybrid neural network architecture that combines statistical estimation theory with deep learning. A dedicated neural MSP layer is designed to generate a wide range of potential parameter combinations. For each combination, a weighted least squares estimate of the signal is calculated. A gateway network, inspired by the mixture of experts paradigm, is then used to dynamically select the most accurate estimate from this set of candidates. The entire system is trained on a variety of synthetic datasets of clean and noisy signal pairs containing different waveforms, using the mean square error. The proposed MSP demonstrates high performance over a wide range of noise levels. The mean square error for elementary signals remained below 0.5 even in high-noise scenarios. The method also proved to be robust for complex signal combinations, hybrid waveforms, ECG and CWRU signals, successfully reconstructing them with satisfactory accuracy. The gating network effectively learned to set optimal parameters by continuously selecting values within stable ranges. The developed MSP-NN system provides a robust automated solution for nonparametric signals estimation from noisy discrete observations. It successfully bridges the gap between theoretical statistical efficiency and practical application by automating the important and previously manual step of parameter selection. This work paves the way for the development of intelligent data-driven signal processing systems that can operate reliably in the presence of noise uncertainty.

研究了小强度高斯噪声中平方可积信号的最优滤波问题。在观测过程只在离散时刻可用的条件下，研究了该问题。本研究旨在开发一种基于尖锐oracle不等式的自动化数据驱动模型选择程序（MSP），通过确定平滑参数的最佳组合来最小化均方误差，从而对未知信号进行最佳估计。我们提出了一种新的混合神经网络架构，将统计估计理论与深度学习相结合。设计了一个专用的神经网络MSP层来生成广泛的潜在参数组合。对于每个组合，计算信号的加权最小二乘估计。然后使用一个网关网络，受混合专家范式的启发，从这组候选者中动态选择最准确的估计。整个系统在包含不同波形的干净和噪声信号对的各种合成数据集上进行训练，使用均方误差。建议的MSP在广泛的噪音水平范围内表现出高性能。即使在高噪声情况下，基本信号的均方误差也保持在0.5以下。该方法对复杂的信号组合、混合波形、心电和CWRU信号也具有较好的鲁棒性，并成功地以令人满意的精度重建了它们。门控网络通过连续选择稳定范围内的值，有效地学习设置最优参数。所开发的MSP-NN系统为从噪声离散观测中估计非参数信号提供了鲁棒的自动化解决方案。它成功地通过自动化重要的和以前手动的参数选择步骤，弥合了理论统计效率和实际应用之间的差距。这项工作为智能数据驱动信号处理系统的发展铺平了道路，该系统可以在存在噪声不确定性的情况下可靠地运行。

{"title":"Model selection method based on the neural networks for signal processing","authors":"Z.M. Kurdoshev , E.A. Pchelintsev","doi":"10.1016/j.dsp.2026.105908","DOIUrl":"10.1016/j.dsp.2026.105908","url":null,"abstract":"<div><div>The paper considers the optimal filtering of square integrable signals in Gaussian noise of small intensity. The problem is studied under the condition that the observed process is available only at discrete time moments. This study aims to develop an automated and data-driven model selection procedure (MSP) based on sharp oracle inequalities for optimal estimation of an unknown signal by determining the best combination of smoothness parameters that minimizes the mean square error. We propose a novel hybrid neural network architecture that combines statistical estimation theory with deep learning. A dedicated neural MSP layer is designed to generate a wide range of potential parameter combinations. For each combination, a weighted least squares estimate of the signal is calculated. A gateway network, inspired by the mixture of experts paradigm, is then used to dynamically select the most accurate estimate from this set of candidates. The entire system is trained on a variety of synthetic datasets of clean and noisy signal pairs containing different waveforms, using the mean square error. The proposed MSP demonstrates high performance over a wide range of noise levels. The mean square error for elementary signals remained below 0.5 even in high-noise scenarios. The method also proved to be robust for complex signal combinations, hybrid waveforms, ECG and CWRU signals, successfully reconstructing them with satisfactory accuracy. The gating network effectively learned to set optimal parameters by continuously selecting values within stable ranges. The developed MSP-NN system provides a robust automated solution for nonparametric signals estimation from noisy discrete observations. It successfully bridges the gap between theoretical statistical efficiency and practical application by automating the important and previously manual step of parameter selection. This work paves the way for the development of intelligent data-driven signal processing systems that can operate reliably in the presence of noise uncertainty.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"173 ","pages":"Article 105908"},"PeriodicalIF":3.0,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146039112","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

HAIR-GLMB: Hybrid appearance-IoU reinforced GLMB filter for UAV-based multi-target tracking HAIR-GLMB：用于无人机多目标跟踪的混合appearance-IoU增强GLMB滤波器

IF 3 3区工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

Digital Signal Processing

Pub Date : 2026-01-13 DOI: 10.1016/j.dsp.2026.105906

Haiyi Tong, Dekang Zhu, Zhou Zhang

This paper presents HAIR-GLMB, a Hybrid Appearance and IoU Reinforced Generalized Labeled Multi-Bernoulli (GLMB) filter tailored for multi-target tracking in challenging unmanned aerial vehicle (UAV) scenarios. To address frequent association ambiguities caused by dense target distributions, we propose an adaptive hybrid cost matrix that integrates Intersection-over-Union (IoU) spatial cues with appearance similarity. Specifically, an entropy-based adaptive weighting mechanism dynamically balances spatial and appearance information, thereby enhancing association reliability. We further develop a reinforced likelihood computation within the GLMB recursion, explicitly embedding spatial and appearance information into the update process. A motion-aware adaptive survival probability model is also proposed, effectively sustaining track continuity for inward-moving targets near the boundaries of the camera’s field of view. To improve efficiency, the Gibbs sampler is initialized with an assignment obtained by the Hungarian algorithm on the hybrid cost matrix, placing the Markov chain near high-probability regions and reducing sampling overhead under a limited computational budget. Experiments on challenging UAV benchmarks (VisDrone2019, UAVDT) show that HAIR-GLMB consistently outperforms a GLMB baseline relying only on IoU, yielding higher tracking accuracy, fewer identity switches, and reduced fragmentation.

本文提出了HAIR-GLMB滤波器，这是一种专为具有挑战性的无人机场景中的多目标跟踪而设计的混合外观和IoU增强广义标记多伯努利（GLMB）滤波器。为了解决密集目标分布引起的频繁关联模糊，我们提出了一个自适应混合成本矩阵，该矩阵将交叉-超联合（IoU）空间线索与外观相似性集成在一起。具体而言，基于熵的自适应加权机制动态平衡空间和外观信息，从而提高关联的可靠性。我们进一步在GLMB递归中开发了强化的似然计算，明确地将空间和外观信息嵌入到更新过程中。提出了一种运动感知自适应生存概率模型，有效地维持了摄像机视场边界附近向内运动目标的轨迹连续性。为了提高效率，Gibbs采样器使用匈牙利算法在混合代价矩阵上得到的赋值进行初始化，将马尔可夫链放置在高概率区域附近，在有限的计算预算下减少采样开销。在具有挑战性的无人机基准测试（VisDrone2019， UAVDT）上进行的实验表明，HAIR-GLMB始终优于仅依赖IoU的GLMB基线，具有更高的跟踪精度，更少的身份切换和更少的碎片化。

{"title":"HAIR-GLMB: Hybrid appearance-IoU reinforced GLMB filter for UAV-based multi-target tracking","authors":"Haiyi Tong, Dekang Zhu, Zhou Zhang","doi":"10.1016/j.dsp.2026.105906","DOIUrl":"10.1016/j.dsp.2026.105906","url":null,"abstract":"<div><div>This paper presents HAIR-GLMB, a Hybrid Appearance and IoU Reinforced Generalized Labeled Multi-Bernoulli (GLMB) filter tailored for multi-target tracking in challenging unmanned aerial vehicle (UAV) scenarios. To address frequent association ambiguities caused by dense target distributions, we propose an adaptive hybrid cost matrix that integrates Intersection-over-Union (IoU) spatial cues with appearance similarity. Specifically, an entropy-based adaptive weighting mechanism dynamically balances spatial and appearance information, thereby enhancing association reliability. We further develop a reinforced likelihood computation within the GLMB recursion, explicitly embedding spatial and appearance information into the update process. A motion-aware adaptive survival probability model is also proposed, effectively sustaining track continuity for inward-moving targets near the boundaries of the camera’s field of view. To improve efficiency, the Gibbs sampler is initialized with an assignment obtained by the Hungarian algorithm on the hybrid cost matrix, placing the Markov chain near high-probability regions and reducing sampling overhead under a limited computational budget. Experiments on challenging UAV benchmarks (VisDrone2019, UAVDT) show that HAIR-GLMB consistently outperforms a GLMB baseline relying only on IoU, yielding higher tracking accuracy, fewer identity switches, and reduced fragmentation.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"173 ","pages":"Article 105906"},"PeriodicalIF":3.0,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145981795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

A true target signal extraction method for defending against dense false target jamming in multistatic radar systems 多基地雷达系统中防御密集假目标干扰的真目标信号提取方法

IF 3 3区工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC

Digital Signal Processing

Pub Date : 2026-01-13 DOI: 10.1016/j.dsp.2026.105910

Dingli Lou, Tuo Fu, Defeng Chen, Huawei Cao

Dense false target jamming (DFTJ) is a typical form of active jamming that generates numerous false targets along the radar line of sight, significantly degrading the detection and tracking performance of radar systems. In multistatic radar systems with spatially separated receivers, jamming signals originating from the same source become highly correlated across various receivers after compensating for their delay and Doppler frequency differences, whereas true target echoes remain weakly correlated because of varying observation geometries. On the basis of these differences, we propose a method for extracting true target signals from jammed echoes. First, the jamming signals are aligned across different receivers by compensating for their amplitude, delay, and Doppler frequency differences. The compensated and pulse-compressed echoes are then stacked into a signal matrix, where the false targets remain nearly invariant across different columns and thus form a low-rank component, while the true targets exhibit amplitude, delay, and Doppler frequency variations, manifesting as sparse high-rank components. Based on this structural distinction, we formulate a robust principal component analysis problem for extracting the true target signals and solve it using the block coordinate descent approach. To satisfy real-time processing demands, we further develop a sequential processing-based version of the proposed method. The numerical simulation results demonstrate the effectiveness of the proposed method, which shows stable performance under different DFTJ strategies, jamming parameters and target characteristics.

密集假目标干扰（DFTJ）是一种典型的有源干扰形式，它会在雷达瞄准线沿线产生大量假目标，严重影响雷达系统的探测和跟踪性能。在具有空间分离接收机的多基地雷达系统中，来自同一源的干扰信号在补偿延迟和多普勒频率差异后，在不同接收机之间变得高度相关，而真实目标回波由于观测几何形状的变化而保持弱相关。基于这些差异，我们提出了一种从干扰回波中提取真实目标信号的方法。首先，通过补偿干扰信号的幅度、延迟和多普勒频率差异，使干扰信号在不同的接收器上对齐。然后将经过补偿和脉冲压缩的回波叠加到信号矩阵中，其中假目标在不同列之间保持几乎不变，从而形成低秩分量，而真实目标表现出幅度、延迟和多普勒频率变化，表现为稀疏的高秩分量。基于这种结构上的区别，我们提出了一个鲁棒的主成分分析问题，用于提取真实目标信号，并使用块坐标下降法进行求解。为了满足实时处理需求，我们进一步开发了基于顺序处理的方法。数值仿真结果验证了该方法的有效性，在不同的DFTJ策略、干扰参数和目标特性下均表现出稳定的性能。

{"title":"A true target signal extraction method for defending against dense false target jamming in multistatic radar systems","authors":"Dingli Lou, Tuo Fu, Defeng Chen, Huawei Cao","doi":"10.1016/j.dsp.2026.105910","DOIUrl":"10.1016/j.dsp.2026.105910","url":null,"abstract":"<div><div>Dense false target jamming (DFTJ) is a typical form of active jamming that generates numerous false targets along the radar line of sight, significantly degrading the detection and tracking performance of radar systems. In multistatic radar systems with spatially separated receivers, jamming signals originating from the same source become highly correlated across various receivers after compensating for their delay and Doppler frequency differences, whereas true target echoes remain weakly correlated because of varying observation geometries. On the basis of these differences, we propose a method for extracting true target signals from jammed echoes. First, the jamming signals are aligned across different receivers by compensating for their amplitude, delay, and Doppler frequency differences. The compensated and pulse-compressed echoes are then stacked into a signal matrix, where the false targets remain nearly invariant across different columns and thus form a low-rank component, while the true targets exhibit amplitude, delay, and Doppler frequency variations, manifesting as sparse high-rank components. Based on this structural distinction, we formulate a robust principal component analysis problem for extracting the true target signals and solve it using the block coordinate descent approach. To satisfy real-time processing demands, we further develop a sequential processing-based version of the proposed method. The numerical simulation results demonstrate the effectiveness of the proposed method, which shows stable performance under different DFTJ strategies, jamming parameters and target characteristics.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"173 ","pages":"Article 105910"},"PeriodicalIF":3.0,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146039120","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0