首页 > 最新文献

IEEE open journal of signal processing最新文献

英文 中文
Robust Kalman Filtering via Correntropy-Based Higher-Order Moment Adaptation and Variable Structure Gains 基于相关熵的高阶矩自适应和变结构增益鲁棒卡尔曼滤波
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-02-23 DOI: 10.1109/OJSP.2026.3667079
Waleed Hilal;Alex McCafferty-Leroux;John Yawney;S. Andrew Gadsden
This paper proposes two novel filtering strategies as sub-optimal robust solutions for state estimation in systems affected by non-Gaussian noise, outliers, or modeling uncertainties. The moments-based Kalman filter (MKF) and moments-based innovation filter (MIF) replace the mean squared error criterion with a correntropy-based cost function that incorporates higher-order statistical moments of the innovation sequence. Through Taylor series expansion of the Gaussian kernel, correntropy inherently captures all even-order moments—including variance, kurtosis, and higher-order statistics—providing natural robustness to heavy-tailed and asymmetric noise distributions. An adaptive kernel bandwidth mechanism uses real-time estimates of innovation skewness and kurtosis to automatically balance efficiency and robustness. The MIF augments this framework with variable structure control theory, incorporating a saturation-based gain that bounds corrective action during large disturbances. Both methods employ fixed-point iteration with correntropy-weighted covariance matrices in their predictor-corrector algorithms. Mathematical derivations and stability proofs are provided for both filters. The approaches extend to nonlinear systems through first-order Taylor series linearization, yielding the extended MKF (EMKF) and extended MIF (EMIF). To validate their robustness relative to the conventional Kalman filter, the proposed methods are applied to both linear and nonlinear representations of a simulated electrohydrostatic actuator (EHA) experiencing leakage faults. Computational experiments demonstrate that the MKF and MIF achieve superior estimation accuracy compared to the KF under non-Gaussian conditions, more faithfully representing faulty system behavior.
本文提出了两种新的滤波策略,作为受非高斯噪声、异常值或建模不确定性影响的系统状态估计的次优鲁棒解。基于矩的卡尔曼滤波器(MKF)和基于矩的创新滤波器(MIF)用包含创新序列高阶统计矩的基于熵的成本函数取代均方误差准则。通过高斯核的泰勒级数展开,相关熵固有地捕获所有偶阶矩——包括方差、峰度和高阶统计量——为重尾和非对称噪声分布提供自然的鲁棒性。自适应核带宽机制利用创新偏度和峰度的实时估计来自动平衡效率和鲁棒性。MIF用变结构控制理论增强了这一框架,结合了基于饱和的增益,在大扰动时限制校正动作。这两种方法的预测校正算法都采用了熵权加权协方差矩阵的不动点迭代。给出了两种滤波器的数学推导和稳定性证明。该方法通过一阶泰勒级数线性化扩展到非线性系统,得到了扩展MKF (EMKF)和扩展MIF (EMIF)。为了验证其相对于传统卡尔曼滤波的鲁棒性,将所提出的方法应用于模拟电静液执行器(EHA)泄漏故障的线性和非线性表示。计算实验表明,与非高斯条件下的KF相比,MKF和MIF具有更高的估计精度,更忠实地反映了系统的故障行为。
{"title":"Robust Kalman Filtering via Correntropy-Based Higher-Order Moment Adaptation and Variable Structure Gains","authors":"Waleed Hilal;Alex McCafferty-Leroux;John Yawney;S. Andrew Gadsden","doi":"10.1109/OJSP.2026.3667079","DOIUrl":"https://doi.org/10.1109/OJSP.2026.3667079","url":null,"abstract":"This paper proposes two novel filtering strategies as sub-optimal robust solutions for state estimation in systems affected by non-Gaussian noise, outliers, or modeling uncertainties. The moments-based Kalman filter (MKF) and moments-based innovation filter (MIF) replace the mean squared error criterion with a correntropy-based cost function that incorporates higher-order statistical moments of the innovation sequence. Through Taylor series expansion of the Gaussian kernel, correntropy inherently captures all even-order moments—including variance, kurtosis, and higher-order statistics—providing natural robustness to heavy-tailed and asymmetric noise distributions. An adaptive kernel bandwidth mechanism uses real-time estimates of innovation skewness and kurtosis to automatically balance efficiency and robustness. The MIF augments this framework with variable structure control theory, incorporating a saturation-based gain that bounds corrective action during large disturbances. Both methods employ fixed-point iteration with correntropy-weighted covariance matrices in their predictor-corrector algorithms. Mathematical derivations and stability proofs are provided for both filters. The approaches extend to nonlinear systems through first-order Taylor series linearization, yielding the extended MKF (EMKF) and extended MIF (EMIF). To validate their robustness relative to the conventional Kalman filter, the proposed methods are applied to both linear and nonlinear representations of a simulated electrohydrostatic actuator (EHA) experiencing leakage faults. Computational experiments demonstrate that the MKF and MIF achieve superior estimation accuracy compared to the KF under non-Gaussian conditions, more faithfully representing faulty system behavior.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"7 ","pages":"343-355"},"PeriodicalIF":2.7,"publicationDate":"2026-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11406897","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147440614","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Diverse Subset Selection via Norm-Based Sampling and Orthogonality 基于范数采样和正交性的多样化子集选择
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-02-20 DOI: 10.1109/OJSP.2026.3666822
Noga Bar;Raja Giryes
Large annotated datasets are crucial for the success of deep learning, but labeling data can be prohibitively expensive in domains such as medical imaging. This work tackles the subset selection problem: selecting a small set of the most informative examples from a large unlabeled pool for annotation. We propose a simple and effective method that combines feature norms, randomization, and orthogonality (via the Gram–Schmidt process) to select diverse and informative samples. Feature norms serve as a proxy for informativeness, while randomization and orthogonalization reduce redundancy and encourage coverage of the feature space. Extensive experiments on image and text benchmarks, including CIFAR-10/100, Tiny ImageNet, ImageNet, OrganAMNIST, and Yelp, show that our method consistently improves subset selection performance, both as a standalone approach and when integrated with existing techniques.
大型标注数据集对于深度学习的成功至关重要,但在医学成像等领域,标注数据可能过于昂贵。这项工作解决了子集选择问题:从一个大型未标记池中选择一小部分最有信息的示例进行注释。我们提出了一种简单有效的方法,结合特征规范、随机化和正交性(通过Gram-Schmidt过程)来选择多样化和信息丰富的样本。特征规范作为信息性的代理,而随机化和正交化则减少冗余并鼓励特征空间的覆盖。在图像和文本基准测试(包括CIFAR-10/100、Tiny ImageNet、ImageNet、OrganAMNIST和Yelp)上进行的大量实验表明,我们的方法无论是作为独立方法还是与现有技术集成时,都能持续提高子集选择性能。
{"title":"Diverse Subset Selection via Norm-Based Sampling and Orthogonality","authors":"Noga Bar;Raja Giryes","doi":"10.1109/OJSP.2026.3666822","DOIUrl":"https://doi.org/10.1109/OJSP.2026.3666822","url":null,"abstract":"Large annotated datasets are crucial for the success of deep learning, but labeling data can be prohibitively expensive in domains such as medical imaging. This work tackles the <italic>subset selection problem:</i> selecting a small set of the most informative examples from a large unlabeled pool for annotation. We propose a simple and effective method that combines feature norms, randomization, and orthogonality (via the Gram–Schmidt process) to select diverse and informative samples. Feature norms serve as a proxy for informativeness, while randomization and orthogonalization reduce redundancy and encourage coverage of the feature space. Extensive experiments on image and text benchmarks, including CIFAR-10/100, Tiny ImageNet, ImageNet, OrganAMNIST, and Yelp, show that our method consistently improves subset selection performance, both as a standalone approach and when integrated with existing techniques.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"7 ","pages":"333-342"},"PeriodicalIF":2.7,"publicationDate":"2026-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11404417","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147440547","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Robust Localization in Modern Cellular Networks Using Global Map Features 基于全局地图特征的现代蜂窝网络鲁棒定位
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-02-16 DOI: 10.1109/OJSP.2026.3665385
Junshi Chen;Xuhong Li;Russ Whiton;Erik Leitinger;Fredrik Tufvesson
Radio frequency (RF) signal-based localization using modern cellular networks has emerged as a promising solution to accurately locate objects in challenging environments. One of the most promising solutions for situations involving obstructed-line-of-sight (OLoS) and multipath propagation is multipath-based simultaneous localization and mapping (MP-SLAM) that employs map features (MFs), such as virtual anchors. This paper presents an extended MP-SLAM method that is augmented with a global map feature (GMF) repository. This repository stores consistent MFs of high quality that are collected during prior traversals. We integrate these GMFs back into the MP-SLAM framework via a probability hypothesis density (PHD) filter, which propagates GMF intensity functions over time. Extensive simulations, together with a challenging real-world experiment using LTE RF signals in a dense urban scenario with severe multipath propagation and inter-cell interference, demonstrate that our framework achieves robust and accurate localization, thereby showcasing its effectiveness in realistic modern cellular networks such as 5G or future 6G networks. It outperforms conventional proprioceptive sensor-based localization and conventional MP-SLAM methods, and achieves reliable localization even under adverse signal conditions.
使用现代蜂窝网络的射频(RF)信号定位已经成为一种有前途的解决方案,可以在具有挑战性的环境中准确定位物体。针对视线受阻(OLoS)和多路径传播的情况,最有希望的解决方案之一是基于多路径的同步定位和映射(MP-SLAM),该解决方案采用地图特征(mf),如虚拟锚点。本文提出了一种扩展的MP-SLAM方法,该方法增加了一个全局地图特征库(GMF)。此存储库存储在先前遍历期间收集的一致的高质量mf。我们通过概率假设密度(PHD)过滤器将这些GMF整合回MP-SLAM框架,该过滤器随时间传播GMF强度函数。广泛的模拟,以及在具有严重多径传播和蜂窝间干扰的密集城市场景中使用LTE RF信号的具有挑战性的现实世界实验,表明我们的框架实现了鲁棒和准确的定位,从而展示了其在现实的现代蜂窝网络(如5G或未来6G网络)中的有效性。它优于传统的基于本体感觉传感器的定位方法和传统的MP-SLAM方法,即使在不利的信号条件下也能实现可靠的定位。
{"title":"Robust Localization in Modern Cellular Networks Using Global Map Features","authors":"Junshi Chen;Xuhong Li;Russ Whiton;Erik Leitinger;Fredrik Tufvesson","doi":"10.1109/OJSP.2026.3665385","DOIUrl":"https://doi.org/10.1109/OJSP.2026.3665385","url":null,"abstract":"Radio frequency (RF) signal-based localization using modern cellular networks has emerged as a promising solution to accurately locate objects in challenging environments. One of the most promising solutions for situations involving obstructed-line-of-sight (OLoS) and multipath propagation is multipath-based simultaneous localization and mapping (MP-SLAM) that employs map features (MFs), such as virtual anchors. This paper presents an extended MP-SLAM method that is augmented with a global map feature (GMF) repository. This repository stores consistent MFs of high quality that are collected during prior traversals. We integrate these GMFs back into the MP-SLAM framework via a probability hypothesis density (PHD) filter, which propagates GMF intensity functions over time. Extensive simulations, together with a challenging real-world experiment using LTE RF signals in a dense urban scenario with severe multipath propagation and inter-cell interference, demonstrate that our framework achieves robust and accurate localization, thereby showcasing its effectiveness in realistic modern cellular networks such as 5G or future 6G networks. It outperforms conventional proprioceptive sensor-based localization and conventional MP-SLAM methods, and achieves reliable localization even under adverse signal conditions.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"7 ","pages":"356-372"},"PeriodicalIF":2.7,"publicationDate":"2026-02-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11397371","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147440679","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sampling Method for Generalized Graph Signals With Pre-Selected Vertices via DC Optimization 基于DC优化的带预选点的广义图信号采样方法
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-02-13 DOI: 10.1109/OJSP.2026.3664335
Keitaro Yamashita;Kazuki Naganuma;Shunsuke Ono
This paper proposes a method for vertex-wise aggregation sampling of a broad class of graph signals, designed to attain the best possible recovery based on the generalized sampling theory. This is achieved by designing a sampling operator by an optimization problem, which is inherently non-convex, as the best possible recovery imposes a rank constraint. An existing method for vertex-wise aggregation sampling is able to control the number of active vertices but cannot incorporate prior knowledge of mandatory or avoided vertices. To address these challenges, we formulate the operator design as a problem that handles a constraint on the number of active vertices and prior knowledge on specific vertices for sampling, mandatory inclusion or exclusion. We transformed this constrained problem into a difference-of-convex (DC) optimization problem by using the nuclear norm and a DC penalty for vertex selection. To solve this, we develop a convergent solver based on the general double-proximal gradient DC algorithm. The effectiveness of our method is demonstrated through experiments on various graph signal models, including real-world data, showing superior performance in the recovery accuracy compared to existing methods.
本文提出了一种基于广义采样理论的对一类图信号进行逐点聚集采样的方法,旨在获得尽可能好的恢复。这是通过一个优化问题设计一个采样算子来实现的,这个优化问题本质上是非凸的,因为最佳可能的恢复施加了秩约束。现有的基于顶点的聚合采样方法能够控制活动顶点的数量,但不能包含强制顶点或避免顶点的先验知识。为了解决这些挑战,我们将算子设计表述为一个问题,该问题处理对活动顶点数量的约束和对特定顶点的先验知识进行采样,强制包含或排除。通过核范数和顶点选择的凸差惩罚,将该约束问题转化为凸差优化问题。为了解决这个问题,我们开发了一个基于一般双近端梯度DC算法的收敛求解器。通过对各种图形信号模型(包括实际数据)的实验证明了我们方法的有效性,与现有方法相比,在恢复精度方面表现出优越的性能。
{"title":"Sampling Method for Generalized Graph Signals With Pre-Selected Vertices via DC Optimization","authors":"Keitaro Yamashita;Kazuki Naganuma;Shunsuke Ono","doi":"10.1109/OJSP.2026.3664335","DOIUrl":"https://doi.org/10.1109/OJSP.2026.3664335","url":null,"abstract":"This paper proposes a method for vertex-wise aggregation sampling of a broad class of graph signals, designed to attain the best possible recovery based on the generalized sampling theory. This is achieved by designing a sampling operator by an optimization problem, which is inherently non-convex, as the best possible recovery imposes a rank constraint. An existing method for vertex-wise aggregation sampling is able to control the number of active vertices but cannot incorporate prior knowledge of mandatory or avoided vertices. To address these challenges, we formulate the operator design as a problem that handles a constraint on the number of active vertices and prior knowledge on specific vertices for sampling, mandatory inclusion or exclusion. We transformed this constrained problem into a difference-of-convex (DC) optimization problem by using the nuclear norm and a DC penalty for vertex selection. To solve this, we develop a convergent solver based on the general double-proximal gradient DC algorithm. The effectiveness of our method is demonstrated through experiments on various graph signal models, including real-world data, showing superior performance in the recovery accuracy compared to existing methods.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"7 ","pages":"314-323"},"PeriodicalIF":2.7,"publicationDate":"2026-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11395613","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147299610","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Alternatives to Sine Carrier in Auditory BCI: Exploring Machine Learning Strategies for Assessing Modulation Detectability in EEG 听觉脑机接口中正弦载波的替代方案:探索脑电调制可检测性评估的机器学习策略
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-02-13 DOI: 10.1109/OJSP.2026.3664271
Lenaïg Guého;Henrique Lefundes da Silva;Cyril Plapous;Laurent Bougrain;Patrick Hénaff;Rozenn Nicol
In this paper, the use of non-sinusoidal amplitude-modulated stimuli is assessed for Brain-Computer Interfaces (BCIs) based on Steady-State Auditory Evoked Potentials (SSAEPs). Three different stimuli are compared to the frequently used 1-kHz pure tone: Brownian noise, cicada song and cat's purr. While these alternative sounds are intended to be more pleasant for listeners, they may impact the detectability of the modulation frequency in ElectroEncephaloGraphic (EEG) signals. Stimuli are equalized in loudness using an Head And Torso Simulator (HATS). The experiment is conducted at two loudness levels (50 and 56 phons), with 24 subjects participating in each condition. Hearing capacity is assessed prior to the experiment, using an audiometry test and questionnaires. For each stimulus, detection is performed by using 10 different classifiers: a linear discriminant analysis, deep learning networks and Riemannian classifiers including tangent space-based algorithms. These latter consistently outperformed alternative approaches. Pure tones provide the highest accuracy of detection (above 83%), whereas cicada song only achieve 60%. Classification using the proposed models fails for Brownian noise and cat's purr, with accuracy at a chance level. Additionally, increasing the loudness of the stimuli does not enhance the detectability of the modulation frequency for any stimulus. Amplitude modulation, frequency content and temporal characteristics of stimuli are further analyzed for explanation. These findings provide practical recommendations for auditory BCI classification and audio stimuli design.
在本文中,基于稳态听觉诱发电位(SSAEPs)评估了非正弦调幅刺激在脑机接口(bci)中的应用。将三种不同的刺激与常用的1 khz纯音进行比较:布朗噪声、蝉鸣和猫的咕噜声。虽然这些替代声音旨在让听者更愉快,但它们可能会影响脑电图(EEG)信号中调制频率的可探测性。使用头部和躯干模拟器(HATS)来平衡刺激的响度。实验在两个响度水平(50和56个声部)下进行,每个条件下有24名受试者参与。听力能力在实验前进行评估,使用听力测试和问卷调查。对于每个刺激,使用10种不同的分类器进行检测:线性判别分析、深度学习网络和黎曼分类器,包括基于切线空间的算法。后者的表现一直优于其他方法。纯音的准确率最高(83%以上),而蝉鸣的准确率只有60%。使用所提出的模型对布朗噪声和猫的咕噜声进行分类是失败的,准确率在偶然的水平上。此外,增加刺激的响度并不能提高任何刺激调制频率的可探测性。进一步分析了刺激的幅度调制、频率内容和时间特征。这些发现为听觉脑接口分类和听觉刺激设计提供了实用的建议。
{"title":"Alternatives to Sine Carrier in Auditory BCI: Exploring Machine Learning Strategies for Assessing Modulation Detectability in EEG","authors":"Lenaïg Guého;Henrique Lefundes da Silva;Cyril Plapous;Laurent Bougrain;Patrick Hénaff;Rozenn Nicol","doi":"10.1109/OJSP.2026.3664271","DOIUrl":"https://doi.org/10.1109/OJSP.2026.3664271","url":null,"abstract":"In this paper, the use of non-sinusoidal amplitude-modulated stimuli is assessed for Brain-Computer Interfaces (BCIs) based on Steady-State Auditory Evoked Potentials (SSAEPs). Three different stimuli are compared to the frequently used 1-kHz pure tone: Brownian noise, cicada song and cat's purr. While these alternative sounds are intended to be more pleasant for listeners, they may impact the detectability of the modulation frequency in ElectroEncephaloGraphic (EEG) signals. Stimuli are equalized in loudness using an Head And Torso Simulator (HATS). The experiment is conducted at two loudness levels (50 and 56 phons), with 24 subjects participating in each condition. Hearing capacity is assessed prior to the experiment, using an audiometry test and questionnaires. For each stimulus, detection is performed by using 10 different classifiers: a linear discriminant analysis, deep learning networks and Riemannian classifiers including tangent space-based algorithms. These latter consistently outperformed alternative approaches. Pure tones provide the highest accuracy of detection (above 83%), whereas cicada song only achieve 60%. Classification using the proposed models fails for Brownian noise and cat's purr, with accuracy at a chance level. Additionally, increasing the loudness of the stimuli does not enhance the detectability of the modulation frequency for any stimulus. Amplitude modulation, frequency content and temporal characteristics of stimuli are further analyzed for explanation. These findings provide practical recommendations for auditory BCI classification and audio stimuli design.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"7 ","pages":"324-332"},"PeriodicalIF":2.7,"publicationDate":"2026-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11395630","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147299575","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
EcDiff-LLIE: Event-Conditional Diffusion Model for Structure-Preserving Low-Light Image Enhancement EcDiff-LLIE:保结构弱光图像增强的事件条件扩散模型
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-02-06 DOI: 10.1109/OJSP.2026.3662627
Ramna Maqsood;Paulo Nunes;Luís Ducla Soares;Caroline Conti
Low-light image enhancement (LLIE) aims to restore the visual quality of poorly illuminated images by recovering fine details and textures while suppressing noise and artifacts. Recently, diffusion models have shown superior generative capabilities for LLIE. However, existing diffusion-based methods condition the denoising process only on low-light images or features derived from them (e.g., structural or illumination maps). Since the low-light images are severely degraded, this limits the denoising model’s ability to restore fine structure and reduce artifacts. In this work, we show that the event data captured simultaneously with the low-light images provides complementary high-dynamic-range and high-temporal-resolution structural information that can overcome this limitation. Therefore, we propose EcDiff-LLIE, a novel event-conditional diffusion framework for LLIE. At its core, we introduce a multimodality denoising network that conditions on both low-light images and concurrent event streams. To effectively fuse the two modalities, we design a cross-modality attention block that bridge their domain differences, while also enabling long-range dependency modeling for improved structural preservation. Experiments on the synthetic SDSD and real-world SDE datasets show significant improvements in quantitative evaluation metrics. Furthermore, evaluation on the high-resolution real-world HUE dataset further shows the generalization ability of the proposed framework.
低光图像增强(LLIE)旨在通过恢复精细细节和纹理,同时抑制噪声和伪影,恢复光照不足图像的视觉质量。近年来,扩散模型对LLIE表现出了优越的生成能力。然而,现有的基于扩散的方法仅对低光图像或从中衍生的特征(例如,结构或照明地图)进行去噪处理。由于弱光图像严重退化,这限制了去噪模型恢复精细结构和减少伪影的能力。在这项工作中,我们表明,与低光图像同时捕获的事件数据提供了互补的高动态范围和高时间分辨率的结构信息,可以克服这一限制。因此,我们提出了一种新的LLIE事件条件扩散框架ecff -LLIE。在其核心,我们引入了一个多模态去噪网络,该网络同时适用于低光图像和并发事件流。为了有效地融合这两种模式,我们设计了一个跨模式的注意力块,以弥合它们的领域差异,同时还支持远程依赖建模,以改进结构保存。在合成SDSD和真实SDE数据集上进行的实验表明,该方法在定量评价指标方面有显著改善。此外,对高分辨率真实HUE数据集的评估进一步表明了该框架的泛化能力。
{"title":"EcDiff-LLIE: Event-Conditional Diffusion Model for Structure-Preserving Low-Light Image Enhancement","authors":"Ramna Maqsood;Paulo Nunes;Luís Ducla Soares;Caroline Conti","doi":"10.1109/OJSP.2026.3662627","DOIUrl":"https://doi.org/10.1109/OJSP.2026.3662627","url":null,"abstract":"Low-light image enhancement (LLIE) aims to restore the visual quality of poorly illuminated images by recovering fine details and textures while suppressing noise and artifacts. Recently, diffusion models have shown superior generative capabilities for LLIE. However, existing diffusion-based methods condition the denoising process only on low-light images or features derived from them (e.g., structural or illumination maps). Since the low-light images are severely degraded, this limits the denoising model’s ability to restore fine structure and reduce artifacts. In this work, we show that the event data captured simultaneously with the low-light images provides complementary high-dynamic-range and high-temporal-resolution structural information that can overcome this limitation. Therefore, we propose EcDiff-LLIE, a novel event-conditional diffusion framework for LLIE. At its core, we introduce a multimodality denoising network that conditions on both low-light images and concurrent event streams. To effectively fuse the two modalities, we design a cross-modality attention block that bridge their domain differences, while also enabling long-range dependency modeling for improved structural preservation. Experiments on the synthetic SDSD and real-world SDE datasets show significant improvements in quantitative evaluation metrics. Furthermore, evaluation on the high-resolution real-world HUE dataset further shows the generalization ability of the proposed framework.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"7 ","pages":"266-275"},"PeriodicalIF":2.7,"publicationDate":"2026-02-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11373830","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146223561","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
VCP: Visible Context Propagation for Electrocardiogram Recovery VCP:心电图恢复的可见上下文传播
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-28 DOI: 10.1109/OJSP.2026.3657696
Jaeho Park;Yong-Yeon Jo;Jong-Hwan Jang;Jin Yu;Joon-myoung Kwon;Junho Song
Electrocardiograms (ECGs) remain widely archived as paper ECG charts. In the 12-lead paper ECG chart layout, each lead shows only 2.5-second visible segments. Therefore, digitized charts are incomplete, leaving most of the 10-second recording invisible and misaligned with the digital standard required by ECG-AI models. Previous work has attempted to recover these invisible segments but has shown markedly lower performance than visible segments. We propose the Visible Context Propagation (VCP) architecture, an extension of ECGrecover, which leverages the quasi-periodic structure of ECGs and employs cross-attention to propagate contextual information from visible to invisible segments. Our model consistently outperformed ECGrecover, the strongest baseline, reducing RMSE by 32.4% overall, including 12.0% on invisible segments. Beyond recovery accuracy, evaluations on ECG applications demonstrated that recovered ECGs achieved performance comparable to raw ECGs in both diagnostic classification and ECG feature measurement. These results highlight the effectiveness of explicitly modeling the propagation of visible-to-invisible context and establish VCP as a robust solution for recovering incomplete paper-based ECGs, enabling reliable surrogates for clinical and analytical use.
心电图(ECGs)仍以纸质心电图的形式被广泛存档。在12导联的纸质心电图布局中,每个导联只能显示2.5秒的可见片段。因此,数字化图表是不完整的,使得大多数10秒的记录不可见,并且与ECG-AI模型所需的数字标准不一致。以前的工作试图恢复这些不可见的段,但表现出明显低于可见段的性能。我们提出了可视上下文传播(VCP)架构,这是ECGrecover的扩展,它利用了ecg的准周期结构,并采用交叉注意将上下文信息从可见片段传播到不可见片段。我们的模型始终优于最强基线ECGrecover,总体上RMSE降低了32.4%,其中不可见部分降低了12.0%。除了恢复的准确性之外,对心电图应用的评估表明,恢复后的心电图在诊断分类和心电图特征测量方面的性能与原始心电图相当。这些结果强调了明确建模可见到不可见上下文传播的有效性,并将VCP建立为恢复不完整纸质心电图的稳健解决方案,为临床和分析使用提供可靠的替代品。
{"title":"VCP: Visible Context Propagation for Electrocardiogram Recovery","authors":"Jaeho Park;Yong-Yeon Jo;Jong-Hwan Jang;Jin Yu;Joon-myoung Kwon;Junho Song","doi":"10.1109/OJSP.2026.3657696","DOIUrl":"https://doi.org/10.1109/OJSP.2026.3657696","url":null,"abstract":"Electrocardiograms (ECGs) remain widely archived as paper ECG charts. In the 12-lead paper ECG chart layout, each lead shows only 2.5-second visible segments. Therefore, digitized charts are incomplete, leaving most of the 10-second recording invisible and misaligned with the digital standard required by ECG-AI models. Previous work has attempted to recover these invisible segments but has shown markedly lower performance than visible segments. We propose the <bold>Visible Context Propagation (VCP)</b> architecture, an extension of ECGrecover, which leverages the quasi-periodic structure of ECGs and employs cross-attention to propagate contextual information from visible to invisible segments. Our model consistently outperformed ECGrecover, the strongest baseline, reducing RMSE by 32.4% overall, including 12.0% on invisible segments. Beyond recovery accuracy, evaluations on ECG applications demonstrated that recovered ECGs achieved performance comparable to raw ECGs in both diagnostic classification and ECG feature measurement. These results highlight the effectiveness of explicitly modeling the propagation of visible-to-invisible context and establish VCP as a robust solution for recovering incomplete paper-based ECGs, enabling reliable surrogates for clinical and analytical use.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"7 ","pages":"185-194"},"PeriodicalIF":2.7,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11363447","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175984","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Self-Labeling Sounds Using Optimal Transport 使用最佳传输的自标签声音
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-28 DOI: 10.1109/OJSP.2026.3659053
Manu Harju;Frederic Font;Annamaria Mesaros
Self-labeling is a method to simultaneously learn representations and classes using unlabeled data. The naive approach to self-labeling leads to a degenerate solution, and the model-generated labels require regularization to serve as useful training targets. In this work, we adapt a self-labeling method using optimal transport to the audio domain using the FSD50K dataset. We analyze the structure of the learned representations and compare the emergent classes with the reference annotations. We compare the learned representations with the ones produced using Bootstrap Your Own Latent for Audio (BYOL-A) across several downstream tasks. Our findings indicate that the method learns to group perceptually similar sounds without supervision. The results show that the method is a viable approach for audio representation learning, and that the learned embeddings are as effective for downstream tasks as the ones obtained with the benchmark method. As an additional outcome, the generated classifications give valuable insight into what the model learns, promoting explainability in feature learning.
自标记是一种使用未标记数据同时学习表示和类的方法。朴素的自标记方法会导致退化的解决方案,并且模型生成的标签需要正则化才能作为有用的训练目标。在这项工作中,我们使用FSD50K数据集将一种使用最佳传输的自标记方法应用到音频域。我们分析了学习到的表示的结构,并将紧急类与参考注释进行了比较。我们将学习到的表示与使用Bootstrap Your Own Latent for Audio (BYOL-A)在几个下游任务中产生的表示进行比较。我们的研究结果表明,这种方法可以在没有监督的情况下学习对感知相似的声音进行分组。结果表明,该方法是一种可行的音频表示学习方法,并且学习到的嵌入与使用基准方法获得的嵌入对下游任务同样有效。作为一个额外的结果,生成的分类对模型学习的内容提供了有价值的见解,促进了特征学习的可解释性。
{"title":"Self-Labeling Sounds Using Optimal Transport","authors":"Manu Harju;Frederic Font;Annamaria Mesaros","doi":"10.1109/OJSP.2026.3659053","DOIUrl":"https://doi.org/10.1109/OJSP.2026.3659053","url":null,"abstract":"Self-labeling is a method to simultaneously learn representations and classes using unlabeled data. The naive approach to self-labeling leads to a degenerate solution, and the model-generated labels require regularization to serve as useful training targets. In this work, we adapt a self-labeling method using optimal transport to the audio domain using the FSD50K dataset. We analyze the structure of the learned representations and compare the emergent classes with the reference annotations. We compare the learned representations with the ones produced using Bootstrap Your Own Latent for Audio (BYOL-A) across several downstream tasks. Our findings indicate that the method learns to group perceptually similar sounds without supervision. The results show that the method is a viable approach for audio representation learning, and that the learned embeddings are as effective for downstream tasks as the ones obtained with the benchmark method. As an additional outcome, the generated classifications give valuable insight into what the model learns, promoting explainability in feature learning.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"7 ","pages":"116-124"},"PeriodicalIF":2.7,"publicationDate":"2026-01-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11366927","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175951","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
From Informed Independent Vector Extraction to Hybrid Architectures for Target Source Extraction 从知情独立矢量提取到目标源提取的混合体系结构
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-26 DOI: 10.1109/OJSP.2026.3657698
Zbyněk Koldovský;Jiří Málek;Martin Vrátný;Tereza Vrbová;Jaroslav Čmejla;Stephen O'Regan
This article revises informed independent vector extraction (iIVE) as a framework for connecting model-based blind source extraction (BSE) with deep learning. We introduce the contrast function for iIVE, which is derived by extending IVE with beamforming-based constraints, enabling an interpretable use of reference signals. We also show that structured mixing models implementing physical knowledge can be integrated, which is demonstrated by two far-field models. With the contrast functions, rapidly converging second-order algorithms are developed, whose performance is first verified through simulations. In the experimental part, we refine iIVE by training models containing unrolled iterations of the developed algorithm. The resulting structures achieve performance comparable to state-of-the-art networks while requiring two orders of magnitude fewer trainable parameters and exhibiting strong generalization to unseen conditions.
本文将知情独立向量提取(iIVE)修改为连接基于模型的盲源提取(BSE)与深度学习的框架。我们引入了iIVE的对比度函数,该函数是通过使用基于波束形成的约束对IVE进行扩展而得到的,从而可以解释参考信号的使用。我们还通过两个远场模型证明了实现物理知识的结构化混合模型是可以集成的。结合对比函数,提出了快速收敛的二阶算法,并通过仿真验证了算法的性能。在实验部分,我们通过训练包含所开发算法的展开迭代的模型来改进ive。所得到的结构实现了与最先进的网络相当的性能,同时需要的可训练参数减少了两个数量级,并且对未知条件表现出很强的泛化能力。
{"title":"From Informed Independent Vector Extraction to Hybrid Architectures for Target Source Extraction","authors":"Zbyněk Koldovský;Jiří Málek;Martin Vrátný;Tereza Vrbová;Jaroslav Čmejla;Stephen O'Regan","doi":"10.1109/OJSP.2026.3657698","DOIUrl":"https://doi.org/10.1109/OJSP.2026.3657698","url":null,"abstract":"This article revises informed independent vector extraction (iIVE) as a framework for connecting model-based blind source extraction (BSE) with deep learning. We introduce the contrast function for iIVE, which is derived by extending IVE with beamforming-based constraints, enabling an interpretable use of reference signals. We also show that structured mixing models implementing physical knowledge can be integrated, which is demonstrated by two far-field models. With the contrast functions, rapidly converging second-order algorithms are developed, whose performance is first verified through simulations. In the experimental part, we refine iIVE by training models containing unrolled iterations of the developed algorithm. The resulting structures achieve performance comparable to state-of-the-art networks while requiring two orders of magnitude fewer trainable parameters and exhibiting strong generalization to unseen conditions.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"7 ","pages":"195-212"},"PeriodicalIF":2.7,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11363453","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146175985","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Better Naturalness Evaluation of TTS Systems TTS系统更好的自然度评价
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2026-01-23 DOI: 10.1109/OJSP.2026.3657142
Sajad Shirali-Shahreza;Gerald Penn
One of the main goals of Text-To-Speech systems is to generate natural speech. Therefore, a major evaluation criterion of TTS outputs is their naturalness, usually measured through a Mean Opinion Score (MOS). Naturalness is not a well-defined property, however. This paper decomposes naturalness into eight specific dimensions, based on how judges define the term. We then evaluate the outputs of the systems submitted to the Blizzard 2025 Challenge based on these new dimensions and compare the results with alternative evaluations of naturalness. This includes recent subjective human evaluations that were performed by the 2025 Blizzard Challenge organizers, as well as various automatic MOS methods. Based on this analysis, we propose to use five dimensions in place of a single, primitive notion of naturalness: Clarity, Fluency, Human-vs-Computer, Pronunciation, and Understandability. We propose that these would serve as the basis of a better evaluation framework for advanced TTS systems.
文本-语音转换系统的主要目标之一是生成自然语音。因此,TTS输出的一个主要评价标准是其自然度,通常通过平均意见得分(Mean Opinion Score, MOS)来衡量。然而,自然性并不是一个定义明确的属性。本文根据法官对自然性的定义,将自然性分解为八个具体维度。然后,我们根据这些新维度评估提交给暴雪2025挑战赛的系统的输出,并将结果与其他自然性评估进行比较。这包括最近由2025暴雪挑战赛组织者进行的主观人类评估,以及各种自动MOS方法。基于这一分析,我们建议使用五个维度来代替单一的、原始的自然性概念:清晰性、流畅性、人机、发音和可理解性。我们建议这些将作为先进TTS系统更好的评估框架的基础。
{"title":"Better Naturalness Evaluation of TTS Systems","authors":"Sajad Shirali-Shahreza;Gerald Penn","doi":"10.1109/OJSP.2026.3657142","DOIUrl":"https://doi.org/10.1109/OJSP.2026.3657142","url":null,"abstract":"One of the main goals of Text-To-Speech systems is to generate natural speech. Therefore, a major evaluation criterion of TTS outputs is their naturalness, usually measured through a Mean Opinion Score (MOS). Naturalness is not a well-defined property, however. This paper decomposes naturalness into eight specific dimensions, based on how judges define the term. We then evaluate the outputs of the systems submitted to the Blizzard 2025 Challenge based on these new dimensions and compare the results with alternative evaluations of naturalness. This includes recent subjective human evaluations that were performed by the 2025 Blizzard Challenge organizers, as well as various automatic MOS methods. Based on this analysis, we propose to use five dimensions in place of a single, primitive notion of naturalness: Clarity, Fluency, Human-vs-Computer, Pronunciation, and Understandability. We propose that these would serve as the basis of a better evaluation framework for advanced TTS systems.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"7 ","pages":"296-304"},"PeriodicalIF":2.7,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11362964","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146223698","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE open journal of signal processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1