首页 > 最新文献

IEEE Signal Processing Letters最新文献

英文 中文
On the Performance of Discrete Papoulis–Gerchberg Type Iterative Reconstruction 离散Papoulis-Gerchberg型迭代重构的性能研究
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-21 DOI: 10.1109/LSP.2026.3656465
Li Cho;Shi-Wei Chang
The Papoulis–Gerchberg iterative reconstruction (PGIR) algorithm has been widely applied across diverse signal processing tasks, and reliable performance prediction during the system design stage is crucial for ensuring its effectiveness. However, predicting PGIR performance for arbitrary signal lengths and observation patterns has long been computationally intractable due to the combinatorial explosion of possible configurations. This letter addresses the problem by analyzing convergence conditions and modeling reconstruction error distributions in both noise-free and noisy scenarios. The derived closed-form probability laws enable accurate prediction for individual geometries, and the observed concentration of the operator's spectral radius with increasing signal length further allows performance characterization based only on loss and knowledge ratios. Tresulting probabilistic framework thus provides the first scalable tool for predicting PGIR performance, validated through case studies in multicarrier communication systems.
Papoulis-Gerchberg迭代重建(PGIR)算法已广泛应用于各种信号处理任务,在系统设计阶段进行可靠的性能预测是保证其有效性的关键。然而,预测任意信号长度和观测模式下的pir性能长期以来一直难以计算,因为可能的结构组合爆炸。本文通过分析无噪声和有噪声情况下的收敛条件和建模重建误差分布来解决这个问题。推导出的封闭式概率定律能够对单个几何形状进行精确预测,并且随着信号长度的增加,操作员的频谱半径的浓度进一步允许仅基于损失和知识比进行性能表征。由此产生的概率框架为预测PGIR性能提供了第一个可扩展的工具,并通过多载波通信系统的案例研究进行了验证。
{"title":"On the Performance of Discrete Papoulis–Gerchberg Type Iterative Reconstruction","authors":"Li Cho;Shi-Wei Chang","doi":"10.1109/LSP.2026.3656465","DOIUrl":"https://doi.org/10.1109/LSP.2026.3656465","url":null,"abstract":"The Papoulis–Gerchberg iterative reconstruction (PGIR) algorithm has been widely applied across diverse signal processing tasks, and reliable performance prediction during the system design stage is crucial for ensuring its effectiveness. However, predicting PGIR performance for arbitrary signal lengths and observation patterns has long been computationally intractable due to the combinatorial explosion of possible configurations. This letter addresses the problem by analyzing convergence conditions and modeling reconstruction error distributions in both noise-free and noisy scenarios. The derived closed-form probability laws enable accurate prediction for individual geometries, and the observed concentration of the operator's spectral radius with increasing signal length further allows performance characterization based only on loss and knowledge ratios. Tresulting probabilistic framework thus provides the first scalable tool for predicting PGIR performance, validated through case studies in multicarrier communication systems.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"718-722"},"PeriodicalIF":3.9,"publicationDate":"2026-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146082161","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
DBA-PCGC: Dual-Domain Boundary Aware for Task-Friendly Point Cloud Geometry Compression 面向任务友好型点云几何压缩的双域边界感知
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-20 DOI: 10.1109/LSP.2026.3656054
Minjian Chen;Liquan Shen;Qi Teng;Shiwei Wang;Feifeng Wang
Compressed point clouds are increasingly used in machine vision tasks, which rely on key semantic regions of the point cloud such as geometric details and structural boundaries. However, existing point cloud compression methods for machine vision lack explicit awareness of geometrically induced semantic boundaries, causing semantic ambiguity in certain boundary regions during compression, thereby degrading machine vision performance. To address this issue, we propose a Dual-domain Boundary Aware Point Cloud Geometry Compression (DBA-PCGC) method that explicitly preserves semantic geometric boundaries from complementary spatial and frequency perspectives, enabling beneficial for machine vision tasks. Specifically, a Structure Aware Transform Module (SATM) exploits Gram matrix traces on local graphs to capture structural variations and highlight high-variation boundary regions, while compactly encoding smooth areas. In parallel, a Frequency Aware Transform Module (FATM) applies Chebyshev high-pass filtering to enhance high-frequency components corresponding to semantic geometric boundaries and suppress redundant low-frequency content. Experimental results on point cloud machine vision tasks demonstrate that our method achieves superior performance compared with existing compression approaches.
压缩点云越来越多地应用于机器视觉任务,这依赖于点云的关键语义区域,如几何细节和结构边界。然而,现有的机器视觉点云压缩方法缺乏对几何诱导的语义边界的明确感知,在压缩过程中会导致某些边界区域的语义模糊,从而降低机器视觉性能。为了解决这个问题,我们提出了一种双域边界感知点云几何压缩(DBA-PCGC)方法,该方法从互补的空间和频率角度明确地保留语义几何边界,从而有利于机器视觉任务。具体来说,结构感知变换模块(SATM)利用局部图上的Gram矩阵跟踪来捕获结构变化并突出显示高变化的边界区域,同时紧凑地编码光滑区域。同时,频率感知变换模块(FATM)利用切比雪夫高通滤波增强与语义几何边界对应的高频分量,抑制冗余的低频内容。在点云机器视觉任务上的实验结果表明,与现有的压缩方法相比,我们的方法具有更好的性能。
{"title":"DBA-PCGC: Dual-Domain Boundary Aware for Task-Friendly Point Cloud Geometry Compression","authors":"Minjian Chen;Liquan Shen;Qi Teng;Shiwei Wang;Feifeng Wang","doi":"10.1109/LSP.2026.3656054","DOIUrl":"https://doi.org/10.1109/LSP.2026.3656054","url":null,"abstract":"Compressed point clouds are increasingly used in machine vision tasks, which rely on key semantic regions of the point cloud such as geometric details and structural boundaries. However, existing point cloud compression methods for machine vision lack explicit awareness of geometrically induced semantic boundaries, causing semantic ambiguity in certain boundary regions during compression, thereby degrading machine vision performance. To address this issue, we propose a Dual-domain Boundary Aware Point Cloud Geometry Compression (DBA-PCGC) method that explicitly preserves semantic geometric boundaries from complementary spatial and frequency perspectives, enabling beneficial for machine vision tasks. Specifically, a Structure Aware Transform Module (SATM) exploits Gram matrix traces on local graphs to capture structural variations and highlight high-variation boundary regions, while compactly encoding smooth areas. In parallel, a Frequency Aware Transform Module (FATM) applies Chebyshev high-pass filtering to enhance high-frequency components corresponding to semantic geometric boundaries and suppress redundant low-frequency content. Experimental results on point cloud machine vision tasks demonstrate that our method achieves superior performance compared with existing compression approaches.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"713-717"},"PeriodicalIF":3.9,"publicationDate":"2026-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146082109","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prototype Distance and Local Manifold Guided Sample-Weighted Kernel Clustering 原型距离和局部流形引导的样本加权核聚类
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-19 DOI: 10.1109/LSP.2026.3655339
Chang Wu;Pengxin Xu;Zhaohu Liu;Luyun Wang;Yong Peng
Conventional clustering algorithms, such as $k$-means and its variants, often assume that data are linearly separable and that all samples contribute equally to the clustering process. However, real-world data usually lies on nonlinear manifolds and contains noisy or ambiguous samples, making such assumptions unrealistic. To address these challenges, we incorporate a Sample-Weighting mechanism into the Kernel Clustering model, which is based on the strategy of coupling Prototype Distance with Local Manifold information together (PDLM-SWKC). Specifically, PDLM-SWKC performs clustering in kernel space to capture nonlinear structures, while adaptively assigning sample weights according to both their proximity to cluster centers and their local manifold connectivity; Besides, the learned sample weights in turn guide graph affinity matrix learning to generate better topological relation matrix, achieving tight coupling between sample-weighted kernel clustering and topological manifold learning. This dual-driven weighting mechanism enhances the robustness and structural consistency, effectively emphasizing reliable samples and suppressing outliers. Extensive experiments on eight benchmark datasets demonstrate that PDLM-SWKC achieves superior performance compared with state-of-the-art clustering methods. Moreover, convergence and visualization analyses confirm its stability, interpretability, and strong capability in modeling complex nonlinear data distributions.
传统的聚类算法,如$k$-means及其变体,通常假设数据是线性可分的,并且所有样本对聚类过程的贡献相同。然而,现实世界的数据通常存在于非线性流形上,并且包含有噪声或模糊的样本,使得这样的假设不现实。为了解决这些问题,我们在核聚类模型中引入了样本加权机制,该模型基于原型距离与局部流形信息耦合的策略(PDLM-SWKC)。具体来说,PDLM-SWKC在核空间中执行聚类来捕获非线性结构,同时根据它们与聚类中心的接近程度和它们的局部流形连通性自适应分配样本权重;此外,学习到的样本权重反过来引导图亲和矩阵学习生成更好的拓扑关系矩阵,实现了样本加权核聚类与拓扑流形学习的紧密耦合。这种双重驱动的加权机制增强了鲁棒性和结构一致性,有效地强调了可靠样本并抑制了异常值。在8个基准数据集上进行的大量实验表明,与最先进的聚类方法相比,PDLM-SWKC具有优越的性能。通过收敛性和可视化分析,验证了该方法的稳定性、可解释性和对复杂非线性数据分布的建模能力。
{"title":"Prototype Distance and Local Manifold Guided Sample-Weighted Kernel Clustering","authors":"Chang Wu;Pengxin Xu;Zhaohu Liu;Luyun Wang;Yong Peng","doi":"10.1109/LSP.2026.3655339","DOIUrl":"https://doi.org/10.1109/LSP.2026.3655339","url":null,"abstract":"Conventional clustering algorithms, such as <inline-formula><tex-math>$k$</tex-math></inline-formula>-means and its variants, often assume that data are linearly separable and that all samples contribute equally to the clustering process. However, real-world data usually lies on nonlinear manifolds and contains noisy or ambiguous samples, making such assumptions unrealistic. To address these challenges, we incorporate a Sample-Weighting mechanism into the Kernel Clustering model, which is based on the strategy of coupling Prototype Distance with Local Manifold information together (PDLM-SWKC). Specifically, PDLM-SWKC performs clustering in kernel space to capture nonlinear structures, while adaptively assigning sample weights according to both their proximity to cluster centers and their local manifold connectivity; Besides, the learned sample weights in turn guide graph affinity matrix learning to generate better topological relation matrix, achieving tight coupling between sample-weighted kernel clustering and topological manifold learning. This dual-driven weighting mechanism enhances the robustness and structural consistency, effectively emphasizing reliable samples and suppressing outliers. Extensive experiments on eight benchmark datasets demonstrate that PDLM-SWKC achieves superior performance compared with state-of-the-art clustering methods. Moreover, convergence and visualization analyses confirm its stability, interpretability, and strong capability in modeling complex nonlinear data distributions.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"658-662"},"PeriodicalIF":3.9,"publicationDate":"2026-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146082086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Deep Fixed Projector: Fast Projection Network for Image Denoising via Frozen Weights and Inter-Inference Consistency 深度固定投影仪:快速投影网络图像去噪通过冻结权值和内部推理一致性
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-19 DOI: 10.1109/LSP.2026.3655611
Shaoping Xu;Hanyang Hu;Wuyong Tao
Unsupervised methods like deep image prior (DIP) leverage network priors for denoising without labeled data but suffer from slow convergence and overfitting, while deep random projector (DRP) improves efficiency via fixed weights and a random seed yet remains limited by its fully random initialization. In this work, we propose deep fixed projector (DFP), an enhanced DRP-based framework featuring three synergistic improvements: (1) initializing the seed with the noisy image to align optimization with the clean image manifold, (2) using pre-trained clean-to-clean encoder-decoder weights to embed structural priors and accelerate convergence, and (3) introducing inter-inference consistency (IIC), a self-supervised regularization that enforces output stability under input perturbations to suppress noise and reduce overfitting. Experiments show DFP consistently surpasses DIP, DRP, and recent variants in PSNR, while achieving faster convergence and robust denoising quality.
像深度图像先验(DIP)这样的无监督方法利用网络先验在没有标记数据的情况下进行去噪,但存在缓慢收敛和过拟合的问题,而深度随机投影(DRP)通过固定权重和随机种子提高效率,但仍然受到其完全随机初始化的限制。在这项工作中,我们提出了深度固定投影仪(DFP),这是一种增强的基于drp的框架,具有三个协同改进:(1)用噪声图像初始化种子,使优化与干净图像流形对齐;(2)使用预训练的clean-to-clean编码器-解码器权值嵌入结构先验并加速收敛;(3)引入自监督正则化,在输入扰动下强制输出稳定性,以抑制噪声并减少过拟合。实验表明,DFP始终优于DIP, DRP和PSNR的最新变体,同时实现更快的收敛和鲁棒的去噪质量。
{"title":"Deep Fixed Projector: Fast Projection Network for Image Denoising via Frozen Weights and Inter-Inference Consistency","authors":"Shaoping Xu;Hanyang Hu;Wuyong Tao","doi":"10.1109/LSP.2026.3655611","DOIUrl":"https://doi.org/10.1109/LSP.2026.3655611","url":null,"abstract":"Unsupervised methods like deep image prior (DIP) leverage network priors for denoising without labeled data but suffer from slow convergence and overfitting, while deep random projector (DRP) improves efficiency via fixed weights and a random seed yet remains limited by its fully random initialization. In this work, we propose deep fixed projector (DFP), an enhanced DRP-based framework featuring three synergistic improvements: (1) initializing the seed with the noisy image to align optimization with the clean image manifold, (2) using pre-trained clean-to-clean encoder-decoder weights to embed structural priors and accelerate convergence, and (3) introducing inter-inference consistency (IIC), a self-supervised regularization that enforces output stability under input perturbations to suppress noise and reduce overfitting. Experiments show DFP consistently surpasses DIP, DRP, and recent variants in PSNR, while achieving faster convergence and robust denoising quality.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"708-712"},"PeriodicalIF":3.9,"publicationDate":"2026-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146082042","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
R3VQ: Redundancy-Reduced Residual Vector Quantization for Low-Bitrate Neural Speech Coding R3VQ:用于低比特率神经语音编码的冗余减残差矢量量化
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-19 DOI: 10.1109/LSP.2026.3655351
Eunkyun Lee;Jongwook Chae;Sooyoung Park;Jong Won Shin
Neural speech and audio codecs have demonstrated decent quality of the decoded audio at low bitrates. They consist of three parts, an encoder, a decoder, and a quantizer. Residual vector quantization (RVQ) or multi-stage vector quantization in which the residual signal from the previous stage is quantized in the next stage is employed in many neural speech codecs and has exhibited good performance while providing bitrate scalability. In this letter, we propose the redundancy-reduced residual vector quantization (R3VQ) which improves the RVQ by inserting a neural network called a refiner. The role of the refiner is to reduce the power of the residual signal to be quantized by enhancing the estimate of the original speech from the quantized signals in the previous stages. We also present a part-wise (PW) training scheme suitable for the training of the neural speech codec with the R3VQ. Experimental results showed that the proposed R3VQ trained with a PW training scheme outperformed the RVQ in both objective measures for speech quality and subjective MUltiple Stimuli with Hidden Reference and Anchor (MUSHRA) test.
神经语音和音频编解码器已经证明在低比特率下解码的音频质量不错。它们由三部分组成:编码器、解码器和量化器。残差矢量量化(RVQ)或多级矢量量化(将前一级的残差信号在下一级进行量化)在许多神经语音编解码器中得到了应用,在提供比特率可扩展性的同时表现出了良好的性能。在这封信中,我们提出了冗余减少残差矢量量化(R3VQ),它通过插入一个称为细化器的神经网络来改善RVQ。细化器的作用是通过增强前一阶段量化信号对原始语音的估计来降低待量化残余信号的功率。我们还提出了一种适用于R3VQ训练神经语音编解码器的部分智能(PW)训练方案。实验结果表明,采用PW训练方案训练的R3VQ在语音质量的客观测量和基于隐藏参考和锚点的主观多重刺激(MUSHRA)测试上都优于RVQ。
{"title":"R3VQ: Redundancy-Reduced Residual Vector Quantization for Low-Bitrate Neural Speech Coding","authors":"Eunkyun Lee;Jongwook Chae;Sooyoung Park;Jong Won Shin","doi":"10.1109/LSP.2026.3655351","DOIUrl":"https://doi.org/10.1109/LSP.2026.3655351","url":null,"abstract":"Neural speech and audio codecs have demonstrated decent quality of the decoded audio at low bitrates. They consist of three parts, an encoder, a decoder, and a quantizer. Residual vector quantization (RVQ) or multi-stage vector quantization in which the residual signal from the previous stage is quantized in the next stage is employed in many neural speech codecs and has exhibited good performance while providing bitrate scalability. In this letter, we propose the redundancy-reduced residual vector quantization (R3VQ) which improves the RVQ by inserting a neural network called a refiner. The role of the refiner is to reduce the power of the residual signal to be quantized by enhancing the estimate of the original speech from the quantized signals in the previous stages. We also present a part-wise (PW) training scheme suitable for the training of the neural speech codec with the R3VQ. Experimental results showed that the proposed R3VQ trained with a PW training scheme outperformed the RVQ in both objective measures for speech quality and subjective MUltiple Stimuli with Hidden Reference and Anchor (MUSHRA) test.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"693-697"},"PeriodicalIF":3.9,"publicationDate":"2026-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146082126","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Semantic-Aware and Semi-Fragile Diffusion Watermarking for Proactive Deepfake Detection 基于语义感知和半脆弱扩散水印的主动深度伪造检测
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-19 DOI: 10.1109/LSP.2026.3655346
Rui Sun;Yifan Zhang;Xiaolu Yu;Yuwei Dai;Yaofei Wang
The rapid progress of deepfake technology, which primarily manipulates facial identity and image semantics, has made detection and defense critically important. Conventional global watermarking methods offer limited capacity for protecting key semantic content, as they typically rely on uniformly distributed watermarks across the entire image. This letter presents a method that weave watermarks as intrinsic components into the semantic content of images (facial regions) in the latent space. By aligning watermark embedding regions with facial content, we establish an inherent fragility mechanism wherein any deepfake manipulation that modifies facial semantics inevitably disrupts the watermark, enabling precise detection. Simultaneously, adversarial training of the extractor ensures robustness against conventional signal processing operations. A local entropy perception module dynamically adjusts embedding intensity based on regional texture complexity, maintaining high perceptual fidelity. Extensive experiments indicate that compared to advanced methods, the proposed approach maintains robustness against conventional benign operations while achieving reliable detection of deepfake forgeries, thereby enabling precise protection of image semantic content.
深度伪造技术的快速发展,主要是操纵面部身份和图像语义,使得检测和防御至关重要。传统的全局水印方法对关键语义内容的保护能力有限,因为它们通常依赖于整个图像中均匀分布的水印。本文提出了一种将水印作为内在成分编织到潜在空间中图像(面部区域)的语义内容中的方法。通过将水印嵌入区域与面部内容对齐,我们建立了一种固有的脆弱性机制,其中任何修改面部语义的深度伪造操作都不可避免地会破坏水印,从而实现精确检测。同时,提取器的对抗性训练确保了对传统信号处理操作的鲁棒性。局部熵感知模块根据区域纹理复杂度动态调整嵌入强度,保持较高的感知保真度。大量的实验表明,与先进的方法相比,本文提出的方法在对传统良性操作保持鲁棒性的同时,实现了对深度伪造伪造的可靠检测,从而能够精确保护图像语义内容。
{"title":"Semantic-Aware and Semi-Fragile Diffusion Watermarking for Proactive Deepfake Detection","authors":"Rui Sun;Yifan Zhang;Xiaolu Yu;Yuwei Dai;Yaofei Wang","doi":"10.1109/LSP.2026.3655346","DOIUrl":"https://doi.org/10.1109/LSP.2026.3655346","url":null,"abstract":"The rapid progress of deepfake technology, which primarily manipulates facial identity and image semantics, has made detection and defense critically important. Conventional global watermarking methods offer limited capacity for protecting key semantic content, as they typically rely on uniformly distributed watermarks across the entire image. This letter presents a method that weave watermarks as intrinsic components into the semantic content of images (facial regions) in the latent space. By aligning watermark embedding regions with facial content, we establish an inherent fragility mechanism wherein any deepfake manipulation that modifies facial semantics inevitably disrupts the watermark, enabling precise detection. Simultaneously, adversarial training of the extractor ensures robustness against conventional signal processing operations. A local entropy perception module dynamically adjusts embedding intensity based on regional texture complexity, maintaining high perceptual fidelity. Extensive experiments indicate that compared to advanced methods, the proposed approach maintains robustness against conventional benign operations while achieving reliable detection of deepfake forgeries, thereby enabling precise protection of image semantic content.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"688-692"},"PeriodicalIF":3.9,"publicationDate":"2026-01-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146082204","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Suppression of Nyquist Ringing in FFT-Based Sample Rate Conversion 基于fft的采样率转换中奈奎斯特振铃的抑制
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-15 DOI: 10.1109/LSP.2026.3654546
Roope Salmi;Vesa Välimäki
Sample rate conversion, a common task in audio signal processing, can be performed with high quality using the fast Fourier transform (FFT) on the whole audio file. Before returning to the time domain using the inverse FFT, the sample rate of the signal is changed by either truncating or zero-padding the frequency-domain buffer. This operation leaves a discontinuity in the spectrum, which causes time-domain ringing at that frequency. The ringing can be suppressed by tapering the highest frequency bins. This letter introduces the double Dolph-Chebyshev window, a frequency-domain tapering function with a configurable level of ringing outside its main lobe in the transform domain. In comparison to basic cosine tapering, the proposed method provides, for example, a 150-dB suppression 91% faster. This letter improves the accuracy of FFT-based sample rate conversion, making it a practical tool for signal processing.
采样率转换是音频信号处理中常见的任务,利用快速傅里叶变换(FFT)可以对整个音频文件进行高质量的采样率转换。在使用逆FFT返回到时域之前,通过截断或零填充频域缓冲区来改变信号的采样率。这个操作在频谱中留下一个不连续,这导致该频率的时域振铃。振铃可以通过使最高频率的箱子变细来抑制。本文介绍了双道尔夫-切比雪夫窗口,这是一种频域锥形函数,在变换域中具有可配置的主瓣外振铃电平。与基本的余弦渐变相比,所提出的方法提供了,例如,提高了91%的150 db抑制速度。这封信提高了基于fft的采样率转换的精度,使其成为信号处理的实用工具。
{"title":"Suppression of Nyquist Ringing in FFT-Based Sample Rate Conversion","authors":"Roope Salmi;Vesa Välimäki","doi":"10.1109/LSP.2026.3654546","DOIUrl":"https://doi.org/10.1109/LSP.2026.3654546","url":null,"abstract":"Sample rate conversion, a common task in audio signal processing, can be performed with high quality using the fast Fourier transform (FFT) on the whole audio file. Before returning to the time domain using the inverse FFT, the sample rate of the signal is changed by either truncating or zero-padding the frequency-domain buffer. This operation leaves a discontinuity in the spectrum, which causes time-domain ringing at that frequency. The ringing can be suppressed by tapering the highest frequency bins. This letter introduces the double Dolph-Chebyshev window, a frequency-domain tapering function with a configurable level of ringing outside its main lobe in the transform domain. In comparison to basic cosine tapering, the proposed method provides, for example, a 150-dB suppression 91% faster. This letter improves the accuracy of FFT-based sample rate conversion, making it a practical tool for signal processing.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"683-687"},"PeriodicalIF":3.9,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11354502","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146082045","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the Asymptotic MSE-Optimality of Parametric Bayesian Channel Estimation in mmWave Systems 毫米波系统参数贝叶斯信道估计的渐近mse最优性
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-15 DOI: 10.1109/LSP.2026.3654532
Franz Weißer;Wolfgang Utschick
The mean square error (MSE)-optimal estimator is known to be the conditional mean estimator (CME). This letter introduces a parametric channel estimation technique based on Bayesian estimation. This technique uses the estimated channel parameters to parameterize the well-known LMMSE channel estimator. We first derive an asymptotic CME formulation that holds for a wide range of priors on the channel parameters. Based on this, we show that parametric Bayesian channel estimation is MSE-optimal for high signal-to-noise ratio (SNR) and/or long coherence intervals, i.e., many noisy observations provided within one coherence interval. Numerical simulations validate the derived formulations.
均方误差(MSE)最优估计量被称为条件平均估计量(CME)。本文介绍了一种基于贝叶斯估计的参数信道估计技术。该技术使用估计的信道参数来参数化众所周知的LMMSE信道估计器。我们首先推导了一个渐近CME公式,该公式适用于通道参数的大范围先验。基于此,我们证明了参数贝叶斯信道估计对于高信噪比(SNR)和/或长相干间隔(即在一个相干间隔内提供许多噪声观测)是mse最优的。数值模拟验证了推导出的公式。
{"title":"On the Asymptotic MSE-Optimality of Parametric Bayesian Channel Estimation in mmWave Systems","authors":"Franz Weißer;Wolfgang Utschick","doi":"10.1109/LSP.2026.3654532","DOIUrl":"https://doi.org/10.1109/LSP.2026.3654532","url":null,"abstract":"The mean square error (MSE)-optimal estimator is known to be the conditional mean estimator (CME). This letter introduces a parametric channel estimation technique based on Bayesian estimation. This technique uses the estimated channel parameters to parameterize the well-known LMMSE channel estimator. We first derive an asymptotic CME formulation that holds for a wide range of priors on the channel parameters. Based on this, we show that parametric Bayesian channel estimation is MSE-optimal for high signal-to-noise ratio (SNR) and/or long coherence intervals, i.e., many noisy observations provided within one coherence interval. Numerical simulations validate the derived formulations.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"653-657"},"PeriodicalIF":3.9,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11354545","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146082047","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CWSNet: A Building Layout Sensing Network With Corner and Wall Information Fusion From Through-the-Wall Radar CWSNet:一种基于穿墙雷达的角与墙信息融合的建筑布局传感网络
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-15 DOI: 10.1109/LSP.2026.3654540
Shichao Zhong;Zhongjie Ma;Xiaolu Zeng;Renjie Liu;Xiaopeng Yang
Building layout sensing of through-the-wall radar (TWR) plays a vital role in fields such as counter-terrorism operations and post-disaster rescue. Existing layout sensing methods based on TWR typically focus solely on either corner information or wall surface features, neglecting the complementarity between the two, which leads to low sensing accuracy in complex environments. To address this issue, we propose a Corner-Wall Sensing Network (CWSNet), a building layout sensing network that fuses corner and wall surface information. First, deep convolutional networks are used to extract wall and corner features from TWR images. Then, these complementary structural features are fused to form an integrated representation. Finally, a transformer-based dynamic graph reasoning module (DGRM) captures their spatial relationships, enabling high-precision layout sensing. Both simulated and real-world experimental datasets demonstrate that CWSNet significantly outperforms existing methods across multiple evaluation metrics, achieving superior wall localization accuracy and layout connectivity, while also exhibiting strong robustness and generalization capabilities.
穿墙雷达的建筑布局感知在反恐行动和灾后救援等领域发挥着至关重要的作用。现有的基于TWR的布局感知方法通常只关注边角信息或墙体表面特征,忽略了两者之间的互补性,导致在复杂环境下的感知精度较低。为了解决这一问题,我们提出了一种角墙传感网络(CWSNet),一种融合角墙表面信息的建筑布局传感网络。首先,利用深度卷积网络提取TWR图像的边角特征。然后,将这些互补的结构特征融合在一起,形成一个完整的表征。最后,基于变压器的动态图推理模块(DGRM)捕获它们的空间关系,实现高精度布局感知。模拟和现实世界的实验数据集表明,CWSNet在多个评估指标上都明显优于现有方法,实现了卓越的墙壁定位精度和布局连通性,同时还表现出强大的鲁棒性和泛化能力。
{"title":"CWSNet: A Building Layout Sensing Network With Corner and Wall Information Fusion From Through-the-Wall Radar","authors":"Shichao Zhong;Zhongjie Ma;Xiaolu Zeng;Renjie Liu;Xiaopeng Yang","doi":"10.1109/LSP.2026.3654540","DOIUrl":"https://doi.org/10.1109/LSP.2026.3654540","url":null,"abstract":"Building layout sensing of through-the-wall radar (TWR) plays a vital role in fields such as counter-terrorism operations and post-disaster rescue. Existing layout sensing methods based on TWR typically focus solely on either corner information or wall surface features, neglecting the complementarity between the two, which leads to low sensing accuracy in complex environments. To address this issue, we propose a Corner-Wall Sensing Network (CWSNet), a building layout sensing network that fuses corner and wall surface information. First, deep convolutional networks are used to extract wall and corner features from TWR images. Then, these complementary structural features are fused to form an integrated representation. Finally, a transformer-based dynamic graph reasoning module (DGRM) captures their spatial relationships, enabling high-precision layout sensing. Both simulated and real-world experimental datasets demonstrate that CWSNet significantly outperforms existing methods across multiple evaluation metrics, achieving superior wall localization accuracy and layout connectivity, while also exhibiting strong robustness and generalization capabilities.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"703-707"},"PeriodicalIF":3.9,"publicationDate":"2026-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146082177","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MTT Resource Allocation in Space-Based Netted MIMO Radar Under Main-Lobe Clutter 主瓣杂波条件下天基组网MIMO雷达MTT资源分配
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-13 DOI: 10.1109/LSP.2026.3653400
Zhifu Jiang;Jianxin Wu;Lei Zhang
High mobility of space-based radar (SBR) platforms risks target velocities falling below the minimum detectable velocity (MDV), rendering them undetectable in main-lobe clutter. Aiming at multi-target tracking (MTT) in space-based multiple-input multiple-output (MIMO) radar systems, this paper proposes a joint beam and dwell time allocation (JBTA) strategy. This strategy incorporates the MDV constraint and adopts the Bayesian Cramér-Rao Lower Bound (BCRLB) as the performance metric, where BCRLB is a lower bound for the mean square error (MSE) of target state estimation. To solve the non-convex mixed-integer optimization problem of JBTA, a two-step decomposition approach is designed. Numerical results verify that JBTA effectively improves global MTT performance.
天基雷达(SBR)平台的高机动性使目标速度低于最小可探测速度(MDV),使其在主瓣杂波中无法被探测到。针对天基多输入多输出(MIMO)雷达系统中的多目标跟踪问题,提出了一种波束与停留时间联合分配(JBTA)策略。该策略结合MDV约束,采用Bayesian cram - rao下界(BCRLB)作为性能指标,BCRLB是目标状态估计均方误差(MSE)的下界。针对JBTA的非凸混合整数优化问题,设计了一种两步分解方法。数值结果验证了JBTA有效地提高了全局MTT性能。
{"title":"MTT Resource Allocation in Space-Based Netted MIMO Radar Under Main-Lobe Clutter","authors":"Zhifu Jiang;Jianxin Wu;Lei Zhang","doi":"10.1109/LSP.2026.3653400","DOIUrl":"https://doi.org/10.1109/LSP.2026.3653400","url":null,"abstract":"High mobility of space-based radar (SBR) platforms risks target velocities falling below the minimum detectable velocity (MDV), rendering them undetectable in main-lobe clutter. Aiming at multi-target tracking (MTT) in space-based multiple-input multiple-output (MIMO) radar systems, this paper proposes a joint beam and dwell time allocation (JBTA) strategy. This strategy incorporates the MDV constraint and adopts the Bayesian Cramér-Rao Lower Bound (BCRLB) as the performance metric, where BCRLB is a lower bound for the mean square error (MSE) of target state estimation. To solve the non-convex mixed-integer optimization problem of JBTA, a two-step decomposition approach is designed. Numerical results verify that JBTA effectively improves global MTT performance.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"624-628"},"PeriodicalIF":3.9,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146082079","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Signal Processing Letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1