首页 > 最新文献

IEEE Signal Processing Letters最新文献

英文 中文
Interpreting the Trispectrum as the Cross-Spectrum of the Wigner-Ville Distribution. 将三谱解释为维格纳-维尔分布的交叉谱。
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-01 Epub Date: 2025-12-05 DOI: 10.1109/lsp.2025.3640510
Christopher K Kovach, Stephen V Gliske, Erin M Radcliffe, Sam Shipley, John A Thompson, Aviva Abosch

The fourth-order time-invariant spectrum, or trispectrum, has a simple derivation as the cross-spectrum among frequency bands in the Wigner-Ville distribution (WVD). Viewed this way, the trispectrum gains intuitive meaning as a measure of the linear dependence of power across frequencies, which yields some insight into its structure and interpretation. We highlight, in particular, a two-dimensional subdomain as useful for identifying modulated oscillations when the modulating envelope is non-negative or lowpass. Spectral characteristics of the carrier and modulating signals are revealed along separate axes of a two-dimensional representation of this domain. The application of this framework, combined with a previously described additive decomposition technique for higher-order spectra, is demonstrated by the blind identification and separation of sleep spindles and beta bursts in EEG.

四阶时不变谱,或三谱,在Wigner-Ville分布(WVD)中有一个简单的推导作为频带间的交叉谱。从这个角度来看,三谱作为功率在频率上的线性依赖的度量,获得了直观的意义,这对其结构和解释产生了一些见解。我们特别强调,当调制包络线是非负或低通时,二维子域对于识别调制振荡是有用的。载波和调制信号的频谱特性沿着该域的二维表示的单独轴显示。结合先前描述的高阶谱的加性分解技术,通过脑电睡眠纺锤波和β爆发的盲识别和分离证明了该框架的应用。
{"title":"Interpreting the Trispectrum as the Cross-Spectrum of the Wigner-Ville Distribution.","authors":"Christopher K Kovach, Stephen V Gliske, Erin M Radcliffe, Sam Shipley, John A Thompson, Aviva Abosch","doi":"10.1109/lsp.2025.3640510","DOIUrl":"10.1109/lsp.2025.3640510","url":null,"abstract":"<p><p>The fourth-order time-invariant spectrum, or trispectrum, has a simple derivation as the cross-spectrum among frequency bands in the Wigner-Ville distribution (WVD). Viewed this way, the trispectrum gains intuitive meaning as a measure of the linear dependence of power across frequencies, which yields some insight into its structure and interpretation. We highlight, in particular, a two-dimensional subdomain as useful for identifying modulated oscillations when the modulating envelope is non-negative or lowpass. Spectral characteristics of the carrier and modulating signals are revealed along separate axes of a two-dimensional representation of this domain. The application of this framework, combined with a previously described additive decomposition technique for higher-order spectra, is demonstrated by the blind identification and separation of sleep spindles and beta bursts in EEG.</p>","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"221-225"},"PeriodicalIF":3.9,"publicationDate":"2026-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12829976/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146046600","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Language-Prompted Dynamic Learning for Semi-Supervised Medical Image Segmentation 半监督医学图像分割的语言提示动态学习
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-31 DOI: 10.1109/LSP.2025.3649590
Ying Zeng;Jialong Zhu
Medical image segmentation is fundamental to clinical diagnosis and treatment planning, yet existing models are constrained by the scarcity of annotated data, which are costly and labor-intensive to obtain. Semi-supervised learning (SSL) mitigates this issue by leveraging large volumes of unlabeled data, but most SSL methods rely solely on visual cues and often fail to capture subtle structures or low-contrast regions common in medical imaging. To address this limitation, we present LanDy, a Language-Prompted Dynamic Learning framework for semi-supervised medical image segmentation. LanDy introduces textual semantics from medical descriptions to enrich visual representations and reduce the ambiguity of pseudo-labels. Concretely, textual embeddings dynamically modulate convolutional filters to provide context-aware feature extraction, while a text-guided refinement mechanism improves the reliability of pseudo-labels on unlabeled data. Extensive experiments on benchmark datasets demonstrate that LanDy consistently outperforms state-of-the-art SSL methods, delivering more accurate and robust segmentation under annotation-efficient settings.
医学图像分割是临床诊断和治疗计划的基础,但现有的模型受到标注数据稀缺的限制,这些数据的获取成本高,劳动强度大。半监督学习(SSL)通过利用大量未标记的数据缓解了这个问题,但是大多数SSL方法仅仅依赖于视觉线索,通常无法捕获医学成像中常见的细微结构或低对比度区域。为了解决这一限制,我们提出了LanDy,一个用于半监督医学图像分割的语言提示动态学习框架。LanDy从医学描述中引入文本语义,丰富了视觉表征,减少了伪标签的歧义。具体而言,文本嵌入动态调制卷积过滤器以提供上下文感知的特征提取,而文本引导的细化机制提高了未标记数据上伪标签的可靠性。在基准数据集上的大量实验表明,LanDy始终优于最先进的SSL方法,在注释高效的设置下提供更准确和健壮的分割。
{"title":"Language-Prompted Dynamic Learning for Semi-Supervised Medical Image Segmentation","authors":"Ying Zeng;Jialong Zhu","doi":"10.1109/LSP.2025.3649590","DOIUrl":"https://doi.org/10.1109/LSP.2025.3649590","url":null,"abstract":"Medical image segmentation is fundamental to clinical diagnosis and treatment planning, yet existing models are constrained by the scarcity of annotated data, which are costly and labor-intensive to obtain. Semi-supervised learning (SSL) mitigates this issue by leveraging large volumes of unlabeled data, but most SSL methods rely solely on visual cues and often fail to capture subtle structures or low-contrast regions common in medical imaging. To address this limitation, we present <bold>LanDy</b>, a Language-Prompted Dynamic Learning framework for semi-supervised medical image segmentation. LanDy introduces textual semantics from medical descriptions to enrich visual representations and reduce the ambiguity of pseudo-labels. Concretely, textual embeddings dynamically modulate convolutional filters to provide context-aware feature extraction, while a text-guided refinement mechanism improves the reliability of pseudo-labels on unlabeled data. Extensive experiments on benchmark datasets demonstrate that LanDy consistently outperforms state-of-the-art SSL methods, delivering more accurate and robust segmentation under annotation-efficient settings.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"668-672"},"PeriodicalIF":3.9,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146082076","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scenario-Based Distributed Fusion Estimation for Uncertain Systems With Bounded Noise 具有有界噪声的不确定系统基于场景的分布式融合估计
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-31 DOI: 10.1109/LSP.2025.3649602
Changyong Xu;Bo Chen;Rusheng Wang;Zheming Wang
This letter investigates the distributed fusion estimation problem for uncertain systems, where noise statistics are unavailable. A scenario optimization framework is employed to handle model uncertainties, in which sampled uncertainty realizations are transformed into linear matrix inequality (LMI) constraints. By solving the resulting convex problems, local estimator gains are obtained, ensuring bounded mean-square error. Furthermore, an explicit upper bound for the fusion error is derived, and optimal fusion weights are determined through an LMI-based criterion. Finally, target tracking systems are provided to demonstrate the advantages and effectiveness of the proposed methods. The influence of the violation and confidence parameters on estimation accuracy and computational complexity is further analyzed.
本文研究了不可用噪声统计量的不确定系统的分布式融合估计问题。采用场景优化框架处理模型不确定性,将采样的不确定性实现转化为线性矩阵不等式约束。通过求解得到的凸问题,得到了局部估计量增益,保证了均方误差有界。进一步推导了融合误差的显式上界,并通过基于lmi的准则确定了最优融合权重。最后,以目标跟踪系统为例,说明了所提方法的优越性和有效性。进一步分析了违和参数和置信参数对估计精度和计算复杂度的影响。
{"title":"Scenario-Based Distributed Fusion Estimation for Uncertain Systems With Bounded Noise","authors":"Changyong Xu;Bo Chen;Rusheng Wang;Zheming Wang","doi":"10.1109/LSP.2025.3649602","DOIUrl":"https://doi.org/10.1109/LSP.2025.3649602","url":null,"abstract":"This letter investigates the distributed fusion estimation problem for uncertain systems, where noise statistics are unavailable. A scenario optimization framework is employed to handle model uncertainties, in which sampled uncertainty realizations are transformed into linear matrix inequality (LMI) constraints. By solving the resulting convex problems, local estimator gains are obtained, ensuring bounded mean-square error. Furthermore, an explicit upper bound for the fusion error is derived, and optimal fusion weights are determined through an LMI-based criterion. Finally, target tracking systems are provided to demonstrate the advantages and effectiveness of the proposed methods. The influence of the violation and confidence parameters on estimation accuracy and computational complexity is further analyzed.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"569-573"},"PeriodicalIF":3.9,"publicationDate":"2025-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146026472","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Event-Based Dynamic Turbulence Mitigation 基于事件的动态湍流缓解
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-26 DOI: 10.1109/LSP.2025.3648967
Haoyi Zhao;Zeyu Xiao;Zihan Qi;Yang Zhao;Wei Jia
Atmospheric turbulence induces coupled spatio-temporal distortions, including blur, geometric deformation, and temporal jitter, which severely degrade image quality.We propose EvTurM, a practical framework leveraging event camera data for dynamic turbulence mitigation with precise motion cues and stable temporal modeling. Leveraging the high temporal resolution and dynamic range of events, EvTurM achieves robust restoration under diverse turbulence conditions. EvTurM comprises two key modules: (1) the event-aware modality enhancement module, which uses event-derived motion to enrich RGB features and recover structural details, and (2) the bidirectional modality calibration module, which jointly aligns RGB and event features in forward and backward propagation to reduce misalignment and enhance temporal consistency. Extensive experiments show EvTurM consistently surpasses existing methods and achieves superior performance.
大气湍流引起的时空耦合畸变包括模糊、几何变形和时间抖动,严重降低了图像质量。我们提出了EvTurM,这是一个实用的框架,利用事件相机数据进行动态湍流缓解,具有精确的运动线索和稳定的时间建模。利用事件的高时间分辨率和动态范围,EvTurM在各种湍流条件下实现了鲁棒恢复。EvTurM包括两个关键模块:(1)事件感知模态增强模块,该模块利用事件衍生的运动来丰富RGB特征并恢复结构细节;(2)双向模态校准模块,该模块将RGB和事件特征在前向和后向传播中联合对齐,以减少不对齐并增强时间一致性。大量的实验表明,EvTurM不断超越现有的方法,并取得了卓越的性能。
{"title":"Event-Based Dynamic Turbulence Mitigation","authors":"Haoyi Zhao;Zeyu Xiao;Zihan Qi;Yang Zhao;Wei Jia","doi":"10.1109/LSP.2025.3648967","DOIUrl":"https://doi.org/10.1109/LSP.2025.3648967","url":null,"abstract":"Atmospheric turbulence induces coupled spatio-temporal distortions, including blur, geometric deformation, and temporal jitter, which severely degrade image quality.We propose EvTurM, a practical framework leveraging event camera data for dynamic turbulence mitigation with precise motion cues and stable temporal modeling. Leveraging the high temporal resolution and dynamic range of events, EvTurM achieves robust restoration under diverse turbulence conditions. EvTurM comprises two key modules: (1) the event-aware modality enhancement module, which uses event-derived motion to enrich RGB features and recover structural details, and (2) the bidirectional modality calibration module, which jointly aligns RGB and event features in forward and backward propagation to reduce misalignment and enhance temporal consistency. Extensive experiments show EvTurM consistently surpasses existing methods and achieves superior performance.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"564-568"},"PeriodicalIF":3.9,"publicationDate":"2025-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146026352","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multi-Kernel Maximum Asymmetric Correntropy Criterion: Foundation and Analysis 多核最大不对称熵准则:基础与分析
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-26 DOI: 10.1109/LSP.2025.3648910
Xiaoqiang Long;Haiquan Zhao;Xinyan Hou
Traditional single-kernel or fixed-center multi-kernel collaborative correntropies fundamentally assume that errors primarily cluster around a central point (typically zero). However, in real-world complex noise environments—such as those generated by mixed interference sources with diverse mechanisms—errors may exhibit multi-modal or highly asymmetric statistical characteristics. In such cases, a single central point or multi-kernels fixed at the origin cannot effectively capture the true shape of the error distribution. To address these problems, this letter proposes a novel robust learning algorithm by introducing variable-center multi-kernel correntropy into an asymmetric correntropy framework, where the kernel centers can be positioned at arbitrary locations. Compared with the maximum asymmetric correntropy criterion (MACC) algorithm, the proposed approach offers a more generalized formulation that enhances its capability to handle more complex error distributions, thereby improving algorithm performance. Notably, existing literature has not yet provided theoretical analysis for such variable-center multi-kernel asymmetric correntropy robust algorithms. Therefore, the main contributions of this work include: conducting the first theoretical analysis of the proposed algorithm, and validating the effectiveness of the analytical methodology.
传统的单核或固定中心多核协同熵从根本上假设错误主要围绕一个中心点(通常为零)聚集。然而,在现实世界的复杂噪声环境中,例如由多种机制的混合干扰源产生的噪声,误差可能表现出多模态或高度不对称的统计特征。在这种情况下,单一中心点或固定在原点的多核不能有效地捕捉误差分布的真实形状。为了解决这些问题,本文提出了一种新的鲁棒学习算法,通过将变中心多核熵引入非对称熵框架,其中核中心可以定位在任意位置。与最大不对称熵准则(MACC)算法相比,该方法提供了更广义的公式,增强了其处理更复杂误差分布的能力,从而提高了算法性能。值得注意的是,现有文献尚未对这种变中心多核不对称熵鲁棒算法进行理论分析。因此,本工作的主要贡献包括:对所提出的算法进行第一次理论分析,并验证分析方法的有效性。
{"title":"Multi-Kernel Maximum Asymmetric Correntropy Criterion: Foundation and Analysis","authors":"Xiaoqiang Long;Haiquan Zhao;Xinyan Hou","doi":"10.1109/LSP.2025.3648910","DOIUrl":"https://doi.org/10.1109/LSP.2025.3648910","url":null,"abstract":"Traditional single-kernel or fixed-center multi-kernel collaborative correntropies fundamentally assume that errors primarily cluster around a central point (typically zero). However, in real-world complex noise environments—such as those generated by mixed interference sources with diverse mechanisms—errors may exhibit multi-modal or highly asymmetric statistical characteristics. In such cases, a single central point or multi-kernels fixed at the origin cannot effectively capture the true shape of the error distribution. To address these problems, this letter proposes a novel robust learning algorithm by introducing variable-center multi-kernel correntropy into an asymmetric correntropy framework, where the kernel centers can be positioned at arbitrary locations. Compared with the maximum asymmetric correntropy criterion (MACC) algorithm, the proposed approach offers a more generalized formulation that enhances its capability to handle more complex error distributions, thereby improving algorithm performance. Notably, existing literature has not yet provided theoretical analysis for such variable-center multi-kernel asymmetric correntropy robust algorithms. Therefore, the main contributions of this work include: conducting the first theoretical analysis of the proposed algorithm, and validating the effectiveness of the analytical methodology.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"411-415"},"PeriodicalIF":3.9,"publicationDate":"2025-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145929407","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Fractional-Order Cauchy Penalty With Enhanced Adaptability for Signal Recovery 具有增强信号恢复适应性的分数阶柯西惩罚
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-26 DOI: 10.1109/LSP.2025.3648640
Zhen Gao;Yiping Jiang;Rong Yang;Xingqun Zhan
Recovering signals from noisy observations remains challenging due to the ill-posedness of inverse problems. While non-convex regularization methods like the standard Cauchy penalty improve estimation accuracy, it lacks adaptability across diverse scenarios. In response, this letter proposes a fractional-order Cauchy (q-Cauchy) penalty inspired by the Lq maximum likelihood estimation. By introducing the parameter $q$, the q-Cauchy penalty achieves greater adaptability in diverse scenarios. Specifically, we also derive sufficient convexity conditions for its proximal operator and propose a forward-backward solver. Simulation results demonstrate that the q-Cauchy with the appropriate $q$ outperforms the baseline methods in both 1D signal denoising and 2D image deblurring tasks.
由于逆问题的病态性,从噪声观测中恢复信号仍然具有挑战性。虽然像标准柯西惩罚这样的非凸正则化方法提高了估计精度,但它缺乏对不同场景的适应性。作为回应,这封信提出了一个分数阶柯西(q-Cauchy)惩罚,灵感来自Lq最大似然估计。通过引入参数$q$, q- cauchy penalty在不同的场景下具有更强的适应性。具体地说,我们还得到了它的近算子的充分凸性条件,并提出了一个正向向后求解器。仿真结果表明,适当的q- cauchy方法在一维信号去噪和二维图像去模糊任务中都优于基线方法。
{"title":"A Fractional-Order Cauchy Penalty With Enhanced Adaptability for Signal Recovery","authors":"Zhen Gao;Yiping Jiang;Rong Yang;Xingqun Zhan","doi":"10.1109/LSP.2025.3648640","DOIUrl":"https://doi.org/10.1109/LSP.2025.3648640","url":null,"abstract":"Recovering signals from noisy observations remains challenging due to the ill-posedness of inverse problems. While non-convex regularization methods like the standard Cauchy penalty improve estimation accuracy, it lacks adaptability across diverse scenarios. In response, this letter proposes a fractional-order Cauchy (q-Cauchy) penalty inspired by the Lq maximum likelihood estimation. By introducing the parameter <inline-formula><tex-math>$q$</tex-math></inline-formula>, the q-Cauchy penalty achieves greater adaptability in diverse scenarios. Specifically, we also derive sufficient convexity conditions for its proximal operator and propose a forward-backward solver. Simulation results demonstrate that the q-Cauchy with the appropriate <inline-formula><tex-math>$q$</tex-math></inline-formula> outperforms the baseline methods in both 1D signal denoising and 2D image deblurring tasks.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"456-460"},"PeriodicalIF":3.9,"publicationDate":"2025-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145929583","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Bayesian Hybrid Attention Module for Underwater Acoustic Target Recognition 一种用于水声目标识别的贝叶斯混合注意模块
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-25 DOI: 10.1109/LSP.2025.3648249
Menghui Lei;Xiangyang Zeng;Mingmin Zeng;Anqi Jin
The attention mechanism improves underwater acoustic target recognition (UATR) by suppressing irrelevant features. However, due to the uncertainty and scarcity of underwater acoustic target (UWAT) signals, complicated deterministic attention modules increase the risk of model overfitting, resulting in limited improvement or even degradation in the performance of UATR. This letter proposes a Bayesian Hybrid Attention Module (BHAM) that enhances UATR based on time–frequency (T–F) features. BHAM models attention weights as random variables following Beta and Dirichlet distributions to capture uncertainty of UWAT signals and mitigate overfitting, while strengthening T–F feature representation via Bayesian channel attention and Bayesian T–F attention. By learning attention distributions in a Bayesian manner, BHAM effectively models complex dependencies in UWAT signals. Experiments on the DeepShip dataset demonstrate that BHAM alleviates overfitting and generalizes well across different network backbones.
注意机制通过抑制无关特征来提高水声目标识别能力。然而,由于水声目标(UWAT)信号的不确定性和稀缺性,复杂的确定性注意模块增加了模型过拟合的风险,导致UATR性能的提高有限甚至下降。这封信提出了一种基于时频(T-F)特征增强UATR的贝叶斯混合注意模块(BHAM)。BHAM将注意力权重建模为遵循Beta和Dirichlet分布的随机变量,以捕获UWAT信号的不确定性并减轻过拟合,同时通过贝叶斯信道注意和贝叶斯T-F注意加强T-F特征表示。通过以贝叶斯方式学习注意力分布,BHAM有效地模拟了UWAT信号中的复杂依赖关系。在DeepShip数据集上进行的实验表明,该方法可以很好地缓解过拟合问题,并能很好地泛化不同网络骨干网。
{"title":"A Bayesian Hybrid Attention Module for Underwater Acoustic Target Recognition","authors":"Menghui Lei;Xiangyang Zeng;Mingmin Zeng;Anqi Jin","doi":"10.1109/LSP.2025.3648249","DOIUrl":"https://doi.org/10.1109/LSP.2025.3648249","url":null,"abstract":"The attention mechanism improves underwater acoustic target recognition (UATR) by suppressing irrelevant features. However, due to the uncertainty and scarcity of underwater acoustic target (UWAT) signals, complicated deterministic attention modules increase the risk of model overfitting, resulting in limited improvement or even degradation in the performance of UATR. This letter proposes a Bayesian Hybrid Attention Module (BHAM) that enhances UATR based on time–frequency (T–F) features. BHAM models attention weights as random variables following Beta and Dirichlet distributions to capture uncertainty of UWAT signals and mitigate overfitting, while strengthening T–F feature representation via Bayesian channel attention and Bayesian T–F attention. By learning attention distributions in a Bayesian manner, BHAM effectively models complex dependencies in UWAT signals. Experiments on the DeepShip dataset demonstrate that BHAM alleviates overfitting and generalizes well across different network backbones.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"441-445"},"PeriodicalIF":3.9,"publicationDate":"2025-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145929434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient CRB Estimation for Linear Models via Expectation Propagation and Monte Carlo Sampling 基于期望传播和蒙特卡罗抽样的线性模型有效CRB估计
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-25 DOI: 10.1109/LSP.2025.3648638
Fangqing Xiao;Dirk Slock
The Cramér–Rao bound (CRB) quantifies the variance lower bound for unbiased estimators, but it is intractable to evaluate in linear hierarchical Bayesian models with non-Gaussian priors due to the intractable marginal likelihood. Existing methods, including variational Bayes and Markov chain Monte Carlo (MCMC)-based approaches, often have high computational cost and slow convergence. We propose an efficient framework to approximate the Fisher information matrix (FIM) and the CRB by expressing the gradient of the log marginal likelihood as a posterior expectation. Expectation propagation (EP) is used to approximate the posterior as a Gaussian, enabling accurate moment estimation compared to pure sampling-based methods. Numerical experiments on small-scale sparse models show that the EP-based CRB approximation achieves lower average normalized mean squared error (NMSE) and faster convergence than classical baselines in non-Gaussian settings.
cram - rao界(CRB)量化了无偏估计量的方差下界,但在具有非高斯先验的线性层次贝叶斯模型中由于难以处理的边际似然而难以估计。现有的方法,包括变分贝叶斯和基于马尔可夫链蒙特卡罗(MCMC)的方法,往往计算成本高,收敛速度慢。通过将对数边际似然的梯度表示为后验期望,我们提出了一个有效的框架来近似Fisher信息矩阵(FIM)和CRB。期望传播(EP)用于将后验近似为高斯,与纯基于抽样的方法相比,能够实现准确的矩估计。在小尺度稀疏模型上的数值实验表明,在非高斯环境下,基于ep的CRB近似比经典基线具有更低的平均归一化均方误差(NMSE)和更快的收敛速度。
{"title":"Efficient CRB Estimation for Linear Models via Expectation Propagation and Monte Carlo Sampling","authors":"Fangqing Xiao;Dirk Slock","doi":"10.1109/LSP.2025.3648638","DOIUrl":"https://doi.org/10.1109/LSP.2025.3648638","url":null,"abstract":"The Cramér–Rao bound (CRB) quantifies the variance lower bound for unbiased estimators, but it is intractable to evaluate in linear hierarchical Bayesian models with non-Gaussian priors due to the intractable marginal likelihood. Existing methods, including variational Bayes and Markov chain Monte Carlo (MCMC)-based approaches, often have high computational cost and slow convergence. We propose an efficient framework to approximate the Fisher information matrix (FIM) and the CRB by expressing the gradient of the log marginal likelihood as a posterior expectation. Expectation propagation (EP) is used to approximate the posterior as a Gaussian, enabling accurate moment estimation compared to pure sampling-based methods. Numerical experiments on small-scale sparse models show that the EP-based CRB approximation achieves lower average normalized mean squared error (NMSE) and faster convergence than classical baselines in non-Gaussian settings.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"451-455"},"PeriodicalIF":3.9,"publicationDate":"2025-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145929514","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Video Face Super-Resolution With High-Precision Identity Preservation 具有高精度身份保存的视频人脸超分辨率
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-25 DOI: 10.1109/LSP.2025.3648639
Chaoliang Wu;Ting Zhang;Xianbin Zhang;Nian He;Yiwen Xu
As an emerging technology, Video Face Super-Resolution (VFSR) aims to reconstruct high-resolution facial images from low-quality video sequences while maintaining identity consistency, which makes it applicable to scenarios such as surveillance, video conferencing, and film restoration. Compared with image-based face restoration and general video super-resolution, VFSR is more challenging because it requires accurate facial detail reconstruction, strict identity preservation, and computational efficiency under varying poses and expressions. To address these challenges, we propose a High-Precision Identity Preserving VFSR framework (HPIP), which integrates a Multi-Scale Prediction Module (MPM) and an Identity Preservation Module (IPM). The MPM focuses on identity-critical facial regions (e.g., eyes, nose, and mouth) and leverages multi-scale feature prediction to improve reconstruction accuracy and robustness while maintaining computational efficiency. The IPM further projects features into a latent representation space, generating temporally consistent dictionary features and enhancing temporal coherence. Extensive experiments demonstrate that HPIP achieves superior performance in both qualitative and quantitative evaluations, producing visually pleasing facial details while maintaining an efficient and lightweight design.
视频人脸超分辨率(Video Face Super-Resolution, VFSR)是一项新兴技术,旨在从低质量的视频序列中重建高分辨率的人脸图像,同时保持身份一致性,适用于监控、视频会议和电影修复等场景。与基于图像的人脸恢复和一般的视频超分辨率相比,VFSR需要精确的人脸细节重建,严格的身份保持,以及不同姿态和表情下的计算效率,更具挑战性。为了解决这些问题,我们提出了一个高精度身份保持VFSR框架(HPIP),该框架集成了一个多尺度预测模块(MPM)和一个身份保持模块(IPM)。MPM专注于身份关键面部区域(例如,眼睛,鼻子和嘴巴),并利用多尺度特征预测来提高重建精度和鲁棒性,同时保持计算效率。IPM进一步将特征投射到潜在的表示空间中,生成时间一致的字典特征并增强时间一致性。大量的实验表明,HPIP在定性和定量评估中都取得了卓越的性能,在保持高效和轻量化设计的同时,产生了视觉上令人愉悦的面部细节。
{"title":"Video Face Super-Resolution With High-Precision Identity Preservation","authors":"Chaoliang Wu;Ting Zhang;Xianbin Zhang;Nian He;Yiwen Xu","doi":"10.1109/LSP.2025.3648639","DOIUrl":"https://doi.org/10.1109/LSP.2025.3648639","url":null,"abstract":"As an emerging technology, Video Face Super-Resolution (VFSR) aims to reconstruct high-resolution facial images from low-quality video sequences while maintaining identity consistency, which makes it applicable to scenarios such as surveillance, video conferencing, and film restoration. Compared with image-based face restoration and general video super-resolution, VFSR is more challenging because it requires accurate facial detail reconstruction, strict identity preservation, and computational efficiency under varying poses and expressions. To address these challenges, we propose a High-Precision Identity Preserving VFSR framework (HPIP), which integrates a Multi-Scale Prediction Module (MPM) and an Identity Preservation Module (IPM). The MPM focuses on identity-critical facial regions (<italic>e.g.</i>, eyes, nose, and mouth) and leverages multi-scale feature prediction to improve reconstruction accuracy and robustness while maintaining computational efficiency. The IPM further projects features into a latent representation space, generating temporally consistent dictionary features and enhancing temporal coherence. Extensive experiments demonstrate that HPIP achieves superior performance in both qualitative and quantitative evaluations, producing visually pleasing facial details while maintaining an efficient and lightweight design.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"406-410"},"PeriodicalIF":3.9,"publicationDate":"2025-12-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145929369","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
High-Capacity Image Steganography via Latent Diffusion Models 基于潜在扩散模型的大容量图像隐写
IF 3.9 2区 工程技术 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-12-24 DOI: 10.1109/LSP.2025.3647567
Ruijie Du;Na Wang;Cheng Xiong;Chuan Qin;Xinpeng Zhang
Generative steganography has recently attracted considerable attention due to its superior security properties. However, most existing approaches suffer from limited hiding capacity. To address this issue, this paper proposes a high-capacity image steganography framework that integrates an encoder–decoder architecture with a latent diffusion model. Specifically, a message encoder is designed to transform binary secret messages into latent-space representations through a series of ResDense modules, enabling efficient hiding of large-scale information. The encoded latent features are then guided by the latent diffusion model to synthesize visually realistic stego images. During message extraction, the stego image undergoes iterative noise addition within the diffusion process to reconstruct the latent representation, from which a message decoder accurately recovers the hidden message. Extensive experimental results demonstrate that the proposed method achieves a high hiding capacity of over 30,000 bits, outperforming state-of-the-art methods while ensuring reliable message recovery under common image storage formats such as JPEG and PNG.
生成隐写术由于其优越的安全性能,近年来引起了人们的广泛关注。然而,大多数现有的方法都存在隐藏能力有限的问题。为了解决这个问题,本文提出了一个高容量的图像隐写框架,该框架将编码器-解码器架构与潜在扩散模型集成在一起。具体来说,设计了一个消息编码器,通过一系列ResDense模块将二进制秘密消息转换为潜在空间表示,从而实现大规模信息的有效隐藏。然后利用潜扩散模型对编码后的潜特征进行引导,合成视觉逼真的隐写图像。在信息提取过程中,隐写图像在扩散过程中进行迭代噪声相加,重建潜在表示,信息解码器从中准确地恢复隐藏信息。大量的实验结果表明,该方法实现了超过30,000位的高隐藏容量,在保证常见图像存储格式(如JPEG和PNG)下可靠的消息恢复的同时,优于目前最先进的方法。
{"title":"High-Capacity Image Steganography via Latent Diffusion Models","authors":"Ruijie Du;Na Wang;Cheng Xiong;Chuan Qin;Xinpeng Zhang","doi":"10.1109/LSP.2025.3647567","DOIUrl":"https://doi.org/10.1109/LSP.2025.3647567","url":null,"abstract":"Generative steganography has recently attracted considerable attention due to its superior security properties. However, most existing approaches suffer from limited hiding capacity. To address this issue, this paper proposes a high-capacity image steganography framework that integrates an encoder–decoder architecture with a latent diffusion model. Specifically, a message encoder is designed to transform binary secret messages into latent-space representations through a series of ResDense modules, enabling efficient hiding of large-scale information. The encoded latent features are then guided by the latent diffusion model to synthesize visually realistic stego images. During message extraction, the stego image undergoes iterative noise addition within the diffusion process to reconstruct the latent representation, from which a message decoder accurately recovers the hidden message. Extensive experimental results demonstrate that the proposed method achieves a high hiding capacity of over 30,000 bits, outperforming state-of-the-art methods while ensuring reliable message recovery under common image storage formats such as JPEG and PNG.","PeriodicalId":13154,"journal":{"name":"IEEE Signal Processing Letters","volume":"33 ","pages":"401-405"},"PeriodicalIF":3.9,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145929556","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE Signal Processing Letters
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1