首页 > 最新文献

IEEE Signal Processing Letters最新文献

英文 中文
Low-Complexity Approximations of the WECS Method for SAR Change Detection 基于wcs的SAR变化检测方法的低复杂度逼近
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-12 DOI: 10.1109/LSP.2026.3653395
Luan Portella;R. J. Cintra;Aluísio Pinheiro
This letter introduces low-complexity approximations for the Wavelet Energy Correlation Screening (WECS) method, which aims at change detection in multitemporal SAR images. The WECS method relies on the non-decimated discrete wavelet transform (ND-DWT) to compute approximation coefficients employed in a feature screening process based on the Pearson correlation. Although effective, WECS presents a high computational cost due to its repeated wavelet filtering stage. To overcome this drawback, we propose two approximations for the wavelet filter coefficients, obtained by truncating their canonical signed digit (CSD) representation, which significantly reduces the number of arithmetic operations. Numerical experiments using both simulated and real-world datasets demonstrate that the proposed methods not only maintain the performance of the original WECS but also achieve computational gains, even outperforming it in certain scenarios.
本文介绍了小波能量相关筛选(WECS)方法的低复杂度近似,该方法旨在对多时相SAR图像进行变化检测。WECS方法依靠非抽取离散小波变换(ND-DWT)来计算基于Pearson相关性的特征筛选过程中使用的近似系数。小波滤波虽然有效,但由于小波滤波阶段重复,计算量大。为了克服这个缺点,我们提出了两个小波滤波器系数的近似,通过截断它们的正则符号数(CSD)表示来获得,这大大减少了算术运算的次数。使用模拟和真实数据集进行的数值实验表明,所提出的方法不仅保持了原始WECS的性能,而且还获得了计算增益,甚至在某些情况下优于原始WECS。
{"title":"Low-Complexity Approximations of the WECS Method for SAR Change Detection","authors":"Luan Portella;R. J. Cintra;Aluísio Pinheiro","doi":"10.1109/LSP.2026.3653395","DOIUrl":"https://doi.org/10.1109/LSP.2026.3653395","url":null,"abstract":"This letter introduces low-complexity approximations for the Wavelet Energy Correlation Screening (WECS) method, which aims at change detection in multitemporal SAR images. The WECS method relies on the non-decimated discrete wavelet transform (ND-DWT) to compute approximation coefficients employed in a feature screening process based on the Pearson correlation. Although effective, WECS presents a high computational cost due to its repeated wavelet filtering stage. To overcome this drawback, we propose two approximations for the wavelet filter coefficients, obtained by truncating their canonical signed digit (CSD) representation, which significantly reduces the number of arithmetic operations. Numerical experiments using both simulated and real-world datasets demonstrate that the proposed methods not only maintain the performance of the original WECS but also achieve computational gains, even outperforming it in certain scenarios.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"643-647"},"PeriodicalIF":3.9,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146081993","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Boosting Small Object Detection via High-Frequency Feature Oriented Network 基于高频特征网络的小目标检测
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-12 DOI: 10.1109/LSP.2026.3652955
Min Li;Zhaofei Hao;Gang Li;Jin Wan;Delong Han;Mingle Zhou
Small Object Detection (SOD) aims to accurately identify and locate small objects in images. However, existing methods usually focus on exploring spatial domain features, neglecting high-frequency features that preserve fine-grained details such as texture and edge information. To overcome this limitation, we propose a High-Frequency Feature-Oriented Network (HFFO-Net). First, we introduce the Channel-wise Frequency Modulation Module (CFMM), which leverages the 2D Discrete Cosine Transform (DCT) to accentuate salient frequency components while mitigating noise interference. Second, we design a High-Frequency Oriented Module (HFOM), which utilizes the Channel Selection Branch (CSB) and Spatial Selection Branch (SSB) to highlight small objects in the channel and spatial region. Third, we introduce a Dual-Query Attention Fusion Mechanism (DQAFM), which reduces the semantic gap between spatial and frequency features and achieves better feature fusion through bidirectional cross-attention. Extensive experiments are implemented, and the corresponding results demonstrate that HFFO-Net excels at detecting small objects.
小目标检测(Small Object Detection, SOD)旨在准确识别和定位图像中的小目标。然而,现有方法通常侧重于探索空间域特征,而忽略了保留纹理和边缘信息等细粒度细节的高频特征。为了克服这一限制,我们提出了高频特征导向网络(HFFO-Net)。首先,我们介绍了信道调频模块(CFMM),它利用二维离散余弦变换(DCT)来突出显著频率分量,同时减轻噪声干扰。其次,我们设计了一个高频定向模块(hfm),该模块利用信道选择分支(CSB)和空间选择分支(SSB)来突出显示信道和空间区域中的小目标。第三,引入双查询注意融合机制(Dual-Query Attention Fusion Mechanism, DQAFM),减小空间特征和频率特征之间的语义差距,通过双向交叉注意实现更好的特征融合。大量的实验结果表明,HFFO-Net在检测小目标方面表现优异。
{"title":"Boosting Small Object Detection via High-Frequency Feature Oriented Network","authors":"Min Li;Zhaofei Hao;Gang Li;Jin Wan;Delong Han;Mingle Zhou","doi":"10.1109/LSP.2026.3652955","DOIUrl":"https://doi.org/10.1109/LSP.2026.3652955","url":null,"abstract":"Small Object Detection (SOD) aims to accurately identify and locate small objects in images. However, existing methods usually focus on exploring spatial domain features, neglecting high-frequency features that preserve fine-grained details such as texture and edge information. To overcome this limitation, we propose a High-Frequency Feature-Oriented Network (HFFO-Net). First, we introduce the Channel-wise Frequency Modulation Module (CFMM), which leverages the 2D Discrete Cosine Transform (DCT) to accentuate salient frequency components while mitigating noise interference. Second, we design a High-Frequency Oriented Module (HFOM), which utilizes the Channel Selection Branch (CSB) and Spatial Selection Branch (SSB) to highlight small objects in the channel and spatial region. Third, we introduce a Dual-Query Attention Fusion Mechanism (DQAFM), which reduces the semantic gap between spatial and frequency features and achieves better feature fusion through bidirectional cross-attention. Extensive experiments are implemented, and the corresponding results demonstrate that HFFO-Net excels at detecting small objects.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"584-588"},"PeriodicalIF":3.9,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146026329","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
HIER: Heterogeneous Information Bottleneck and Expert Routing for Social Bot Detection 基于异构信息瓶颈和专家路由的社交机器人检测
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-12 DOI: 10.1109/LSP.2026.3652127
Kun Lu;Hongli Zhang;Yuchen Yang;Chao Meng;Binxing Fang
Social bots constitute a substantial fraction of active accounts on digital platforms, fundamentally threatening information authenticity and democratic discourse. Contemporary detection methods confront critical limitations: information imbalance across heterogeneous relations, computational challenges in processing massive neighborhoods, and inadequate multi-scale representation learning. We propose HIER (Heterogeneous Information Bottleneck and Expert Routing), a pioneering framework that integrates variational information theory with mixture-of-experts paradigms for social network analysis. HIER introduces relation-aware variational information bottleneck for optimal compression across relationship types, dynamic sparse expert routing that extends mixture-of-experts to edge-level graph processing, and dual-scale mutual information maximization enhancing representation discriminability through neighborhood consistency and graph-level contrastive learning. Experimental validation demonstrates HIER’s superior performance across real-world datasets, establishing new benchmarks for heterogeneous social bot detection.
社交机器人在数字平台上的活跃账户中占很大比例,从根本上威胁着信息的真实性和民主话语。当代检测方法面临着严重的局限性:跨异构关系的信息不平衡,处理大量邻域的计算挑战,以及多尺度表示学习的不足。我们提出了HIER(异构信息瓶颈和专家路由),这是一个开创性的框架,将变分信息理论与专家混合范式集成在一起,用于社会网络分析。HIER引入了关系感知的变分信息瓶颈,用于跨关系类型的最佳压缩,将专家混合扩展到边缘级图处理的动态稀疏专家路由,以及通过邻域一致性和图级对比学习增强表征判别性的双尺度互信息最大化。实验验证证明了HIER在真实世界数据集上的卓越性能,为异构社交机器人检测建立了新的基准。
{"title":"HIER: Heterogeneous Information Bottleneck and Expert Routing for Social Bot Detection","authors":"Kun Lu;Hongli Zhang;Yuchen Yang;Chao Meng;Binxing Fang","doi":"10.1109/LSP.2026.3652127","DOIUrl":"https://doi.org/10.1109/LSP.2026.3652127","url":null,"abstract":"Social bots constitute a substantial fraction of active accounts on digital platforms, fundamentally threatening information authenticity and democratic discourse. Contemporary detection methods confront critical limitations: information imbalance across heterogeneous relations, computational challenges in processing massive neighborhoods, and inadequate multi-scale representation learning. We propose HIER (Heterogeneous Information Bottleneck and Expert Routing), a pioneering framework that integrates variational information theory with mixture-of-experts paradigms for social network analysis. HIER introduces relation-aware variational information bottleneck for optimal compression across relationship types, dynamic sparse expert routing that extends mixture-of-experts to edge-level graph processing, and dual-scale mutual information maximization enhancing representation discriminability through neighborhood consistency and graph-level contrastive learning. Experimental validation demonstrates HIER’s superior performance across real-world datasets, establishing new benchmarks for heterogeneous social bot detection.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"521-525"},"PeriodicalIF":3.9,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146026347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Risk-Aware Low-Pass Importance Sampling for Graph Signals Under Heterogeneous Noise and Model Mismatch 非均匀噪声和模型失配下图信号风险感知低通重要性采样
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-12 DOI: 10.1109/LSP.2026.3652954
Kai-Wei Peng
We study sampling of smooth/bandlimited graph signals when (i) sensor noise is heterogeneous across vertices and (ii) the graph used to design the sampler can be mildly mismatched to the true topology.We propose a risk-aware variant of local low-pass importance sampling that scores each vertex via a Hutchinson estimator of the diagonal of a graph heat-kernel operator and reweights the score by the inverse noise variance. The sampler selects without replacement according to these risk-aware scores. Reconstruction is performed with standard decoders (Tikhonov, Bandlimited, and a Chebyshev data-consistent smoother), enabling fair comparisons to prior work. On grid, Erdős–Rényi (ER), and Barabási–Albert (BA) graphs, our approach consistently reduces the normalized root-mean-square error (NRMSE) compared to random sampling; the gain increases with the sampling rate and persists under selection-graph mismatch. The method is simple, eigendecomposition-free, and scales linearly in the number of edges per Hutchinson probe.
我们研究了当(i)传感器噪声在各个顶点之间是异构的,以及(ii)用于设计采样器的图可能与真实拓扑有轻微的不匹配时,平滑/限带图信号的采样。我们提出了一种局部低通重要性采样的风险感知变体,通过图热核算子对角线的Hutchinson估计器对每个顶点进行评分,并通过逆噪声方差重新加权得分。抽样者根据这些风险意识得分进行选择而不进行替换。使用标准解码器(Tikhonov, Bandlimited和Chebyshev数据一致性平滑器)进行重建,可以与先前的工作进行公平比较。在网格、Erdős-Rényi (ER)和Barabási-Albert (BA)图上,与随机抽样相比,我们的方法一致地降低了归一化均方根误差(NRMSE);增益随采样率的增加而增加,并在选择图不匹配的情况下保持不变。该方法简单,无特征分解,且每个哈钦森探针的边数呈线性扩展。
{"title":"Risk-Aware Low-Pass Importance Sampling for Graph Signals Under Heterogeneous Noise and Model Mismatch","authors":"Kai-Wei Peng","doi":"10.1109/LSP.2026.3652954","DOIUrl":"https://doi.org/10.1109/LSP.2026.3652954","url":null,"abstract":"We study sampling of smooth/bandlimited graph signals when (i) sensor noise is heterogeneous across vertices and (ii) the graph used to design the sampler can be mildly mismatched to the true topology.We propose a risk-aware variant of local low-pass importance sampling that scores each vertex via a Hutchinson estimator of the diagonal of a graph heat-kernel operator and reweights the score by the inverse noise variance. The sampler selects without replacement according to these risk-aware scores. Reconstruction is performed with standard decoders (<sc>Tikhonov</small>, <sc>Bandlimited</small>, and a Chebyshev data-consistent smoother), enabling fair comparisons to prior work. On grid, Erdős–Rényi (ER), and Barabási–Albert (BA) graphs, our approach consistently reduces the normalized root-mean-square error (NRMSE) compared to random sampling; the gain increases with the sampling rate and persists under selection-graph mismatch. The method is simple, eigendecomposition-free, and scales linearly in the number of edges per Hutchinson probe.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"556-558"},"PeriodicalIF":3.9,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146026402","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Latent-Level Enhancement With Flow Matching for Robust Automatic Speech Recognition 基于流匹配的潜在级增强鲁棒自动语音识别
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-12 DOI: 10.1109/LSP.2026.3653238
Da-Hee Yang;Joon-Hyuk Chang
Noise-robust automatic speech recognition (ASR) has been commonly addressed by applying speech enhancement (SE) at the waveform level before recognition. However, speech-level enhancement does not always translate into consistent recognition improvements due to residual distortions and mismatches with the latent space of the ASR encoder. In this letter, we introduce a complementary strategy termed latent-level enhancement, where distorted representations are refined during ASR inference. Specifically, we propose a plug-and-play Flow Matching Refinement module (FM-Refiner) that operates on the output latents of a pretrained CTC-based ASR encoder. Trained to map imperfect latents—either directly from noisy inputs or from enhanced-but-imperfect speech—toward their clean counterparts, the FM-Refiner is applied only at inference, without fine-tuning ASR parameters. Experiments show that FM-Refiner consistently reduces word error rate, both when directly applied to noisy inputs and when combined with conventional SE front-ends. These results demonstrate that latent-level refinement via flow matching provides a lightweight and effective complement to existing SE approaches for robust ASR.
噪声鲁棒性自动语音识别(ASR)通常通过在识别前的波形级应用语音增强(SE)来解决。然而,由于残余的失真和与ASR编码器的潜在空间不匹配,语音水平的增强并不总是转化为一致的识别改进。在这封信中,我们介绍了一种称为潜在级增强的互补策略,其中在ASR推理期间对扭曲表示进行了细化。具体来说,我们提出了一个即插即用的流匹配细化模块(FM-Refiner),它在预训练的基于ctc的ASR编码器的输出电位上运行。经过训练,FM-Refiner可以直接从噪声输入或从增强但不完美的语音中映射不完美的潜在信号,并将其映射到干净的对应信号中,它只应用于推理,不需要对ASR参数进行微调。实验表明,FM-Refiner无论是直接应用于噪声输入还是与传统SE前端结合使用,都能持续降低单词错误率。这些结果表明,通过流匹配进行的潜在级细化为鲁棒ASR的现有SE方法提供了轻量级和有效的补充。
{"title":"Latent-Level Enhancement With Flow Matching for Robust Automatic Speech Recognition","authors":"Da-Hee Yang;Joon-Hyuk Chang","doi":"10.1109/LSP.2026.3653238","DOIUrl":"https://doi.org/10.1109/LSP.2026.3653238","url":null,"abstract":"Noise-robust automatic speech recognition (ASR) has been commonly addressed by applying speech enhancement (SE) at the waveform level before recognition. However, speech-level enhancement does not always translate into consistent recognition improvements due to residual distortions and mismatches with the latent space of the ASR encoder. In this letter, we introduce a complementary strategy termed latent-level enhancement, where distorted representations are refined during ASR inference. Specifically, we propose a plug-and-play Flow Matching Refinement module (FM-Refiner) that operates on the output latents of a pretrained CTC-based ASR encoder. Trained to map imperfect latents—either directly from noisy inputs or from enhanced-but-imperfect speech—toward their clean counterparts, the FM-Refiner is applied only at inference, without fine-tuning ASR parameters. Experiments show that FM-Refiner consistently reduces word error rate, both when directly applied to noisy inputs and when combined with conventional SE front-ends. These results demonstrate that latent-level refinement via flow matching provides a lightweight and effective complement to existing SE approaches for robust ASR.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"589-593"},"PeriodicalIF":3.9,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146026484","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MIMO Through-the-Wall Radar Micro-Doppler Signature Augmentation Method Based on Multi-Channel Information Fusion 基于多通道信息融合的MIMO穿墙雷达微多普勒特征增强方法
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-12 DOI: 10.1109/LSP.2026.3652951
Weicheng Gao;Shui Liu;Jinshuo Wang;Xiaodong Qu;Xiaopeng Yang
Through-the-wall radar (TWR) can monitor and analyze the motion characteristics and activity patterns of indoor human targets, with the advantages of non-contact, high flexibility and privacy protection. However, existing TWR human activity recognition (HAR) techniques developed based on single-channel radar contain limited Doppler information, making it difficult to achieve accurate recognition on data where the direction of human motion is not parallel to the radar observation. To solve this problem, in this letter, a multi-input-multi-output (MIMO) TWR micro-Doppler signature augmentation method based on multi-channel information fusion is proposed. First, a multi-channel Doppler profile feature fusion method based on multi-scale wavelets with low-rank decomposition is presented. Then, a motion parameter estimation method based on Broyden-Fletcher-Goldfarb-Shanno (BFGS) global optimization is proposed, and the fused Doppler profile transformation is implemented using the obtained orientation of human motion. Numerical simulated and measured experiments demonstrate the effectiveness of the proposed method.
穿墙雷达(TWR)可以监测和分析室内人体目标的运动特性和活动模式,具有非接触、高灵活性和隐私保护等优点。然而,现有的基于单通道雷达的TWR人体活动识别(HAR)技术包含有限的多普勒信息,这使得在人体运动方向与雷达观测不平行的数据上难以实现准确识别。针对这一问题,本文提出了一种基于多通道信息融合的多输入多输出(MIMO) TWR微多普勒特征增强方法。首先,提出了一种基于低秩分解的多尺度小波多普勒特征融合方法。然后,提出了一种基于BFGS (Broyden-Fletcher-Goldfarb-Shanno)全局优化的运动参数估计方法,并利用得到的人体运动方向进行融合多普勒轮廓变换。数值模拟和实测实验验证了该方法的有效性。
{"title":"MIMO Through-the-Wall Radar Micro-Doppler Signature Augmentation Method Based on Multi-Channel Information Fusion","authors":"Weicheng Gao;Shui Liu;Jinshuo Wang;Xiaodong Qu;Xiaopeng Yang","doi":"10.1109/LSP.2026.3652951","DOIUrl":"https://doi.org/10.1109/LSP.2026.3652951","url":null,"abstract":"Through-the-wall radar (TWR) can monitor and analyze the motion characteristics and activity patterns of indoor human targets, with the advantages of non-contact, high flexibility and privacy protection. However, existing TWR human activity recognition (HAR) techniques developed based on single-channel radar contain limited Doppler information, making it difficult to achieve accurate recognition on data where the direction of human motion is not parallel to the radar observation. To solve this problem, in this letter, a multi-input-multi-output (MIMO) TWR micro-Doppler signature augmentation method based on multi-channel information fusion is proposed. First, a multi-channel Doppler profile feature fusion method based on multi-scale wavelets with low-rank decomposition is presented. Then, a motion parameter estimation method based on Broyden-Fletcher-Goldfarb-Shanno (BFGS) global optimization is proposed, and the fused Doppler profile transformation is implemented using the obtained orientation of human motion. Numerical simulated and measured experiments demonstrate the effectiveness of the proposed method.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"579-583"},"PeriodicalIF":3.9,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146026499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
LRCPN: A Lightweight Parallel Scheme for Underwater Acoustic Modulation Recognition LRCPN:一种轻量级的水声调制识别并行方案
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-12 DOI: 10.1109/LSP.2026.3652122
Bingzhang Wu;Shaoxuan Li;Ziyao Pan;Rongxin Zhang;Wei Su
This letter proposes a lightweight parallel recurrent–convolutional scheme to improve generalization capability and recognition accuracy while maintaining low computational complexity in resource-constrained underwater acoustic channels. In this scheme, the lightweight convolutional network is used to extract time–frequency features, and the lightweight recurrent network with gated recurrent units is used to capture long-term temporal phase correlations, thereby alleviating the Doppler-induced phase rotation and inter-symbol interference in time-varying multipath underwater acoustic channels. Sea-trial data are collected during shallow-water sea trials with strictly separated training and evaluation datasets. Experimental results on ten underwater acoustic modulation types show that the proposed scheme improves recognition accuracy by 6.2% and reduces computational cost by 22.4%, while exhibiting stronger generalization capability compared with benchmark schemes.
本文提出了一种轻量级的并行递归卷积方案,在资源受限的水声信道中提高泛化能力和识别精度,同时保持较低的计算复杂度。该方案采用轻量级卷积网络提取时频特征,采用带门控递归单元的轻量级递归网络捕获长期时间相位相关性,从而减轻时变多径水声信道中多普勒诱导的相位旋转和符号间干扰。海试数据是在浅水海试期间收集的,训练数据集和评估数据集严格分离。在10种水声调制类型下的实验结果表明,与基准方案相比,该方案的识别精度提高了6.2%,计算成本降低了22.4%,同时具有更强的泛化能力。
{"title":"LRCPN: A Lightweight Parallel Scheme for Underwater Acoustic Modulation Recognition","authors":"Bingzhang Wu;Shaoxuan Li;Ziyao Pan;Rongxin Zhang;Wei Su","doi":"10.1109/LSP.2026.3652122","DOIUrl":"https://doi.org/10.1109/LSP.2026.3652122","url":null,"abstract":"This letter proposes a lightweight parallel recurrent–convolutional scheme to improve generalization capability and recognition accuracy while maintaining low computational complexity in resource-constrained underwater acoustic channels. In this scheme, the lightweight convolutional network is used to extract time–frequency features, and the lightweight recurrent network with gated recurrent units is used to capture long-term temporal phase correlations, thereby alleviating the Doppler-induced phase rotation and inter-symbol interference in time-varying multipath underwater acoustic channels. Sea-trial data are collected during shallow-water sea trials with strictly separated training and evaluation datasets. Experimental results on ten underwater acoustic modulation types show that the proposed scheme improves recognition accuracy by 6.2% and reduces computational cost by 22.4%, while exhibiting stronger generalization capability compared with benchmark schemes.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"516-520"},"PeriodicalIF":3.9,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146026512","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multimodal Cosine Similarity Transformer for Gloss-Guided Sign Language Recognition 多模态余弦相似度转换器用于光泽引导的手语识别
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-12 DOI: 10.1109/LSP.2026.3653403
Lu Li;Qinkun Xiao;Peiran Liu
Continuous sign language recognition (CSLR) requires fine-grained alignment between visual sequences and gloss annotations under weak supervision, which is challenged by modality heterogeneity and ambiguous frame-to-gloss correspondence. We propose a Multimodal Cosine Similarity Transformer (MMCST) to address these issues. MMCST integrates RGB and keypoint heatmap features via gated fusion, and aligns them with gloss embeddings through a Gloss-Conditioned Cosine-Normalized Attention (GCNA) mechanism that stabilizes cross-modal alignment. To further enhance semantic consistency, we introduce Gloss-aware Contrastive Regularization (GLCR). The fused representation is modeled by a cosine-similarity Transformer and decoded with CTC. Experimental results show that MMCST achieves consistent improvements over strong baselines, and ablation studies confirm the effectiveness of gated fusion, GCNA, and GLCR in improving semantic alignment and yielding smoother training dynamics.
连续手语识别(CSLR)需要在弱监督下对视觉序列和语义注释进行细粒度对齐,但这一过程存在模态异质性和模糊的帧-语义对应关系。我们提出了一个多模态余弦相似变压器(MMCST)来解决这些问题。MMCST通过门控融合集成了RGB和关键点热图特征,并通过光泽条件余弦归一化注意(GCNA)机制将它们与光泽嵌入对齐,该机制稳定了跨模态对齐。为了进一步增强语义一致性,我们引入了感知光泽的对比正则化(GLCR)。融合表示采用余弦相似度变压器建模,并用CTC进行解码。实验结果表明,MMCST在强基线上取得了一致的改进,消融研究证实了门控融合、GCNA和GLCR在改善语义对齐和产生更平滑的训练动态方面的有效性。
{"title":"Multimodal Cosine Similarity Transformer for Gloss-Guided Sign Language Recognition","authors":"Lu Li;Qinkun Xiao;Peiran Liu","doi":"10.1109/LSP.2026.3653403","DOIUrl":"https://doi.org/10.1109/LSP.2026.3653403","url":null,"abstract":"Continuous sign language recognition (CSLR) requires fine-grained alignment between visual sequences and gloss annotations under weak supervision, which is challenged by modality heterogeneity and ambiguous frame-to-gloss correspondence. We propose a Multimodal Cosine Similarity Transformer (MMCST) to address these issues. MMCST integrates RGB and keypoint heatmap features via gated fusion, and aligns them with gloss embeddings through a Gloss-Conditioned Cosine-Normalized Attention (GCNA) mechanism that stabilizes cross-modal alignment. To further enhance semantic consistency, we introduce Gloss-aware Contrastive Regularization (GLCR). The fused representation is modeled by a cosine-similarity Transformer and decoded with CTC. Experimental results show that MMCST achieves consistent improvements over strong baselines, and ablation studies confirm the effectiveness of gated fusion, GCNA, and GLCR in improving semantic alignment and yielding smoother training dynamics.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"673-677"},"PeriodicalIF":3.9,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146082110","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Local-CGFC: A Local Cumulant Generating Function Classification Rule 局部累积量生成函数分类规则Local- cgfc
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-12 DOI: 10.1109/LSP.2026.3652119
Bo Tang;Steven Kay;Kaushallya Adhikari
A classification rule based on the cumulant generating function of the training data, called the Cumulant Generating Function Classifier (CGFC), has been recently proposed, and has shown promising performance in terms of improved classification accuracy and robustness against noises. This paper first presents a new information-theoretical explanation of CGFC which indeed makes a classification by minimizing sample mutual information. The original CGFC is a type of global model, and a new variant, called Local-CGFC, is further introduced in this paper to achieve a local classification rule. Experimental studies on real-life datasets demonstrate the effectiveness of the proposed classifier and further illustrate its great potential for a number of real-world applications.
最近提出了一种基于训练数据的累积量生成函数的分类规则,称为累积量生成函数分类器(CGFC),并在提高分类精度和对噪声的鲁棒性方面表现出了良好的性能。本文首先提出了一种新的信息理论解释,即通过最小化样本互信息进行分类。原来的CGFC是一种全局模型,本文进一步引入了一种新的变体local -CGFC来实现局部分类规则。对现实数据集的实验研究证明了所提出分类器的有效性,并进一步说明了其在许多现实应用中的巨大潜力。
{"title":"Local-CGFC: A Local Cumulant Generating Function Classification Rule","authors":"Bo Tang;Steven Kay;Kaushallya Adhikari","doi":"10.1109/LSP.2026.3652119","DOIUrl":"https://doi.org/10.1109/LSP.2026.3652119","url":null,"abstract":"A classification rule based on the cumulant generating function of the training data, called the Cumulant Generating Function Classifier (CGFC), has been recently proposed, and has shown promising performance in terms of improved classification accuracy and robustness against noises. This paper first presents a new information-theoretical explanation of CGFC which indeed makes a classification by minimizing sample mutual information. The original CGFC is a type of global model, and a new variant, called Local-CGFC, is further introduced in this paper to achieve a local classification rule. Experimental studies on real-life datasets demonstrate the effectiveness of the proposed classifier and further illustrate its great potential for a number of real-world applications.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"546-550"},"PeriodicalIF":3.9,"publicationDate":"2026-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146026463","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Toward Detecting Hidden Functionalities in Deep Learning Models 探索深度学习模型中的隐藏功能
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-06 DOI: 10.1109/LSP.2026.3651083
Guobiao Li;Sheng Li;Zhenxing Qian;Xinpeng Zhang
Deep functionality hiding is an emerging technique that embeds confidential or sensitive functions within seemingly benign deep learning models (DLMs), which perform ordinary machine learning tasks. This enables such models to execute covert tasks while remaining undetected. Despite the rapid progress in deep functionality hiding, countermeasures remain unexplored. In this paper, we propose Distribution Offset Analysis (DOA), a novel method for detecting hidden functionalities in DLMs. Our key insight is that the weight distribution of a benign DLM typically follows a Gaussian distribution, whereas a container DLM with hidden functionalities exhibits notable statistical deviations from this Gaussian pattern. In our methodology, we first compute the distributional distance (i.e., offsets) between the model's weights and an ideal Gaussian distribution. We then fuse these offsets with weight features into a unified representation, which is subsequently used to train a meta-classifier for hidden functionality detection. Through extensive experiments, we demonstrate the effectiveness of the proposed DOA method, which achieves an average detection rate of over 87% against existing state-of-the-art deep functionality hiding techniques.
深度功能隐藏是一种新兴技术,它将机密或敏感功能嵌入到执行普通机器学习任务的看似良性的深度学习模型(dlm)中。这使得这些模型能够在不被发现的情况下执行隐蔽任务。尽管在深度功能隐藏方面进展迅速,但对策仍未探索。本文提出了一种检测dlm中隐藏功能的新方法——分布偏移分析(DOA)。我们的关键见解是,良性DLM的权重分布通常遵循高斯分布,而具有隐藏功能的容器DLM则表现出与这种高斯模式的显著统计偏差。在我们的方法中,我们首先计算模型权重和理想高斯分布之间的分布距离(即偏移量)。然后,我们将这些偏移量与权重特征融合成一个统一的表示,随后用于训练用于隐藏功能检测的元分类器。通过大量的实验,我们证明了所提出的DOA方法的有效性,相对于现有的最先进的深度功能隐藏技术,该方法的平均检测率超过87%。
{"title":"Toward Detecting Hidden Functionalities in Deep Learning Models","authors":"Guobiao Li;Sheng Li;Zhenxing Qian;Xinpeng Zhang","doi":"10.1109/LSP.2026.3651083","DOIUrl":"https://doi.org/10.1109/LSP.2026.3651083","url":null,"abstract":"Deep functionality hiding is an emerging technique that embeds confidential or sensitive functions within seemingly benign deep learning models (DLMs), which perform ordinary machine learning tasks. This enables such models to execute covert tasks while remaining undetected. Despite the rapid progress in deep functionality hiding, countermeasures remain unexplored. In this paper, we propose Distribution Offset Analysis (DOA), a novel method for detecting hidden functionalities in DLMs. Our key insight is that the weight distribution of a benign DLM typically follows a Gaussian distribution, whereas a container DLM with hidden functionalities exhibits notable statistical deviations from this Gaussian pattern. In our methodology, we first compute the distributional distance (<italic>i.e.,</i> offsets) between the model's weights and an ideal Gaussian distribution. We then fuse these offsets with weight features into a unified representation, which is subsequently used to train a meta-classifier for hidden functionality detection. Through extensive experiments, we demonstrate the effectiveness of the proposed DOA method, which achieves an average detection rate of over 87% against existing state-of-the-art deep functionality hiding techniques.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"541-545"},"PeriodicalIF":3.9,"publicationDate":"2026-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146026418","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Signal Processing Letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1