首页 > 最新文献

IEEE open journal of signal processing最新文献

英文 中文
Constrained Cramér-Rao Bound for Higher-Order Singular Value Decomposition 高阶奇异值分解的约束cram<s:1> - rao界
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-09-08 DOI: 10.1109/OJSP.2025.3607278
Metin Calis;Massimo Mischi;Alle-Jan van der Veen;Raj Thilak Rajan;Borbàla Hunyadi
Tensor decomposition methods for signal processing applications are an active area of research. Real data are often low-rank, noisy, and come in a higher-order format. As such, low-rank tensor approximation methods that account for the high-order structure of the data are often used for denoising. One way to represent a tensor in a low-rank form is to decompose the tensor into a set of orthonormal factor matrices and an all-orthogonal core tensor using a higher-order singular value decomposition. Under noisy measurements, the lower bound for recovering the factor matrices and the core tensor is unknown. In this paper, we exploit the well-studied constrained Cramér-Rao bound to calculate a lower bound on the mean squared error of the unbiased estimates of the components of the multilinear singular value decomposition under additive white Gaussian noise, and we validate our approach through simulations.
张量分解方法在信号处理中的应用是一个活跃的研究领域。真实数据通常是低秩的、有噪声的,并且以高阶格式出现。因此,考虑到数据的高阶结构的低秩张量近似方法通常用于去噪。以低秩形式表示张量的一种方法是使用高阶奇异值分解将张量分解为一组标准正交因子矩阵和一个全正交核心张量。在噪声测量下,恢复因子矩阵和核心张量的下界是未知的。本文利用已被广泛研究的约束cram r- rao界,计算了加性高斯白噪声下多元线性奇异值分解分量的无偏估计均方误差的下界,并通过仿真验证了该方法。
{"title":"Constrained Cramér-Rao Bound for Higher-Order Singular Value Decomposition","authors":"Metin Calis;Massimo Mischi;Alle-Jan van der Veen;Raj Thilak Rajan;Borbàla Hunyadi","doi":"10.1109/OJSP.2025.3607278","DOIUrl":"https://doi.org/10.1109/OJSP.2025.3607278","url":null,"abstract":"Tensor decomposition methods for signal processing applications are an active area of research. Real data are often low-rank, noisy, and come in a higher-order format. As such, low-rank tensor approximation methods that account for the high-order structure of the data are often used for denoising. One way to represent a tensor in a low-rank form is to decompose the tensor into a set of orthonormal factor matrices and an all-orthogonal core tensor using a higher-order singular value decomposition. Under noisy measurements, the lower bound for recovering the factor matrices and the core tensor is unknown. In this paper, we exploit the well-studied constrained Cramér-Rao bound to calculate a lower bound on the mean squared error of the unbiased estimates of the components of the multilinear singular value decomposition under additive white Gaussian noise, and we validate our approach through simulations.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"6 ","pages":"1048-1055"},"PeriodicalIF":2.7,"publicationDate":"2025-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11153050","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145090152","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gaussian Filtering Using a Spherical-Radial Double Exponential Cubature 利用球-径向双指数模型进行高斯滤波
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-08-28 DOI: 10.1109/OJSP.2025.3604381
Quade Butler;Youssef Ziada;S. Andrew Gadsden
Gaussian filters use quadrature rules or cubature rules to recursively solve Gaussian-weighted integrals. Classical and contemporary methods use stable rules with a minimal number of cubature points to achieve the highest accuracy. Gaussian quadrature is widely believed to be optimal due to its polynomial degree of exactness and higher degree cubature methods often require complex optimization to solve moment equations. In this paper, Gaussian-weighted integrals and Gaussian filtering are approached using a double exponential (DE) transformation and the trapezoidal rule. The DE rule is principled in high rates of convergence for certain integrands and the DE transform ensures that the trapezoidal rule maximizes its performance. A novel spherical-radial cubature rule is derived for Gaussian-weighted integrals where it is shown to be perfectly stable and highly efficient. A new Gaussian filter is then built on top of this cubature rule. The filter is shown to be stable with bounded estimation error. The effect of varying the number of cubature points on filter stability and convergence is also examined. The advantages of the DE method over comparable Gaussian filters and their cubature methods are outlined. These advantages are realized in two numerical examples: a challenging non-polynomial integral and a benchmark filtering problem. The results show that simple and fundamental cubature methods can lead to great improvements in performance when applied correctly.
高斯滤波器使用正交规则或培养规则递归地求解高斯加权积分。经典和现代的方法使用稳定的规则与最小数量的培养点,以达到最高的精度。高斯正交由于其多项式精度而被广泛认为是最优的,而更高次的培养方法往往需要复杂的优化来求解力矩方程。本文利用双指数变换和梯形规则研究高斯加权积分和高斯滤波。DE规则在某些积分的高收敛率方面是原则性的,并且DE变换确保梯形规则最大化其性能。导出了一种新的球-径向定殖规则,并证明了该规则是完全稳定和高效的。然后在这个培养规则的基础上建立一个新的高斯滤波器。该滤波器在估计误差有界的情况下是稳定的。研究了不同培养点个数对滤波器稳定性和收敛性的影响。概述了DE方法相对于可比较的高斯滤波器及其培养方法的优点。这些优点在两个数值例子中得到了体现:一个具有挑战性的非多项式积分和一个基准滤波问题。结果表明,简单而基本的培养方法在正确应用的情况下可以大大提高性能。
{"title":"Gaussian Filtering Using a Spherical-Radial Double Exponential Cubature","authors":"Quade Butler;Youssef Ziada;S. Andrew Gadsden","doi":"10.1109/OJSP.2025.3604381","DOIUrl":"https://doi.org/10.1109/OJSP.2025.3604381","url":null,"abstract":"Gaussian filters use quadrature rules or cubature rules to recursively solve Gaussian-weighted integrals. Classical and contemporary methods use stable rules with a minimal number of cubature points to achieve the highest accuracy. Gaussian quadrature is widely believed to be optimal due to its polynomial degree of exactness and higher degree cubature methods often require complex optimization to solve moment equations. In this paper, Gaussian-weighted integrals and Gaussian filtering are approached using a double exponential (DE) transformation and the <italic>trapezoidal rule</i>. The DE rule is principled in high rates of convergence for certain integrands and the DE transform ensures that the trapezoidal rule maximizes its performance. A novel spherical-radial cubature rule is derived for Gaussian-weighted integrals where it is shown to be perfectly stable and highly efficient. A new Gaussian filter is then built on top of this cubature rule. The filter is shown to be stable with bounded estimation error. The effect of varying the number of cubature points on filter stability and convergence is also examined. The advantages of the DE method over comparable Gaussian filters and their cubature methods are outlined. These advantages are realized in two numerical examples: a challenging non-polynomial integral and a benchmark filtering problem. The results show that simple and fundamental cubature methods can lead to great improvements in performance when applied correctly.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"6 ","pages":"1056-1076"},"PeriodicalIF":2.7,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11144509","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145141718","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Test-Time Cost-and-Quality Controllable Arbitrary-Scale Super-Resolution With Variable Fourier Components 可变傅立叶分量的测试时间成本和质量可控任意尺度超分辨率
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-08-25 DOI: 10.1109/OJSP.2025.3602742
Kazutoshi Akita;Norimichi Ukita
Super-resolution (SR) with arbitrary scale factor and cost-and-quality controllability at test time is essential for various applications. While several arbitrary-scale SR methods have been proposed, these methods require us to modify the model structure and retrain it to control the computational cost and SR quality. To address this limitation, we propose a novel SR method using a Recurrent Neural Network (RNN) with the Fourier representation. In our method, the RNN sequentially estimates Fourier components, each consisting of frequency and amplitude, and aggregates these components to reconstruct an SR image. Since the RNN can adjust the number of recurrences at test time, we can control the computational cost and SR quality in a single model: fewer recurrences (i.e., fewer Fourier components) lead to lower cost but lower quality, while more recurrences (i.e., more Fourier components) lead to better quality but more cost. Experimental results prove that more Fourier components improve the PSNR score. Furthermore, even with fewer Fourier components, our method achieves a lower PSNR drop than other state-of-the-art arbitrary-scale SR methods.
在各种应用中,具有任意比例因子和测试时成本和质量可控性的超分辨率(SR)是必不可少的。虽然已经提出了几种任意尺度的SR方法,但这些方法需要我们修改模型结构并对其进行重新训练,以控制计算成本和SR质量。为了解决这一限制,我们提出了一种使用傅里叶表示的递归神经网络(RNN)的新颖SR方法。在我们的方法中,RNN依次估计傅里叶分量,每个分量由频率和幅度组成,并将这些分量聚合以重建SR图像。由于RNN可以在测试时调整递归的数量,我们可以控制单个模型的计算成本和SR质量:更少的递归(即更少的傅里叶分量)导致更低的成本但更低的质量,而更多的递归(即更多的傅里叶分量)导致更好的质量但更多的成本。实验结果表明,增加傅里叶分量可以提高PSNR分数。此外,即使使用较少的傅里叶分量,我们的方法也比其他最先进的任意尺度SR方法实现了更低的PSNR下降。
{"title":"Test-Time Cost-and-Quality Controllable Arbitrary-Scale Super-Resolution With Variable Fourier Components","authors":"Kazutoshi Akita;Norimichi Ukita","doi":"10.1109/OJSP.2025.3602742","DOIUrl":"https://doi.org/10.1109/OJSP.2025.3602742","url":null,"abstract":"Super-resolution (SR) with arbitrary scale factor and cost-and-quality controllability at test time is essential for various applications. While several arbitrary-scale SR methods have been proposed, these methods require us to modify the model structure and retrain it to control the computational cost and SR quality. To address this limitation, we propose a novel SR method using a Recurrent Neural Network (RNN) with the Fourier representation. In our method, the RNN sequentially estimates Fourier components, each consisting of frequency and amplitude, and aggregates these components to reconstruct an SR image. Since the RNN can adjust the number of recurrences at test time, we can control the computational cost and SR quality in a single model: fewer recurrences (i.e., fewer Fourier components) lead to lower cost but lower quality, while more recurrences (i.e., more Fourier components) lead to better quality but more cost. Experimental results prove that more Fourier components improve the PSNR score. Furthermore, even with fewer Fourier components, our method achieves a lower PSNR drop than other state-of-the-art arbitrary-scale SR methods.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"6 ","pages":"1017-1030"},"PeriodicalIF":2.7,"publicationDate":"2025-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11141341","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145036198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sparsity Apprised Logarithmic Hyperbolic Tan Adaptive Filters for Nonlinear System Identification and Acoustic Feedback Cancellation 稀疏度通知对数双曲Tan自适应滤波器用于非线性系统辨识和声反馈抵消
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-08-20 DOI: 10.1109/OJSP.2025.3600904
Neetu Chikyal;Vasundhara;Chayan Bhar;Asutosh Kar;Mads Græsbøll Christensen
Recently, various robust algorithms based on hyperbolic cosine and sine functions, such as hyperbolic cosine (HCAF), exponential hyperbolic cosine, joint logarithmic hyperbolic cosine adaptive filter, etc., have been predominantly employed for different aspects of adaptive filtering, including nonlinear-system-identification. Further, in this manuscript, an attempt is made to elevate the performance of nonlinear system identification in the wake of impulsive noise interference along with consideration of a sparse environment. Henceforth, in lieu of this, the present paper introduces a new sparsity-apprised logarithmic hyperbolic tan adaptive filter (SA-LHTAF) to handle impulsive noise while dealing with sparse systems. It utilizes a $l_{1}$ norm-related sparsity penalty factor in the robust cost function constructed with a logarithmic hyperbolic tangent function. Further, an improved SA-LHTAF (ISA-LHTAF) is introduced for varying sparsity or moderately sparse systems employing the log sum penalty factor in the proposed technique. The weight update for the proposed technique has been derived from the modified cost function. In addition, the conditions for the upper bound on the convergence factor have been derived. The efficacy of the developed robust techniques is demonstrated for identifying nonlinear systems along with feedback paths of behind-the-ear (BTE) hearing aid. In addition, the proposed techniques are evaluated for training an acoustic feedback canceller for hearing aids.
近年来,各种基于双曲余弦和正弦函数的鲁棒算法,如双曲余弦(HCAF)、指数双曲余弦、联合对数双曲余弦自适应滤波器等,已主要用于自适应滤波的各个方面,包括非线性系统辨识。此外,在本文中,尝试在考虑稀疏环境的情况下,提高脉冲噪声干扰后非线性系统识别的性能。因此,本文引入了一种新的稀疏性通知对数双曲tan自适应滤波器(SA-LHTAF)来处理稀疏系统中的脉冲噪声。它在由对数双曲正切函数构造的鲁棒代价函数中利用了一个$l_{1}$规范相关的稀疏惩罚因子。此外,在提出的技术中,采用对数和惩罚因子,引入了一种改进的SA-LHTAF (ISA-LHTAF),用于变稀疏或中等稀疏的系统。该方法的权值更新由修正后的代价函数推导而来。此外,还导出了收敛因子上界的条件。所开发的鲁棒技术在识别非线性系统以及耳后助听器(BTE)反馈路径方面的有效性得到了证明。此外,所提出的技术被评估用于训练助听器的声反馈消除器。
{"title":"Sparsity Apprised Logarithmic Hyperbolic Tan Adaptive Filters for Nonlinear System Identification and Acoustic Feedback Cancellation","authors":"Neetu Chikyal;Vasundhara;Chayan Bhar;Asutosh Kar;Mads Græsbøll Christensen","doi":"10.1109/OJSP.2025.3600904","DOIUrl":"https://doi.org/10.1109/OJSP.2025.3600904","url":null,"abstract":"Recently, various robust algorithms based on hyperbolic cosine and sine functions, such as hyperbolic cosine (HCAF), exponential hyperbolic cosine, joint logarithmic hyperbolic cosine adaptive filter, etc., have been predominantly employed for different aspects of adaptive filtering, including nonlinear-system-identification. Further, in this manuscript, an attempt is made to elevate the performance of nonlinear system identification in the wake of impulsive noise interference along with consideration of a sparse environment. Henceforth, in lieu of this, the present paper introduces a new sparsity-apprised logarithmic hyperbolic tan adaptive filter (SA-LHTAF) to handle impulsive noise while dealing with sparse systems. It utilizes a <inline-formula><tex-math>$l_{1}$</tex-math></inline-formula> norm-related sparsity penalty factor in the robust cost function constructed with a logarithmic hyperbolic tangent function. Further, an improved SA-LHTAF (ISA-LHTAF) is introduced for varying sparsity or moderately sparse systems employing the log sum penalty factor in the proposed technique. The weight update for the proposed technique has been derived from the modified cost function. In addition, the conditions for the upper bound on the convergence factor have been derived. The efficacy of the developed robust techniques is demonstrated for identifying nonlinear systems along with feedback paths of behind-the-ear (BTE) hearing aid. In addition, the proposed techniques are evaluated for training an acoustic feedback canceller for hearing aids.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"6 ","pages":"1031-1047"},"PeriodicalIF":2.7,"publicationDate":"2025-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11130900","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145090171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ULDepth: Transform Self-Supervised Depth Estimation to Unpaired Multi-Domain Learning ULDepth:将自监督深度估计转化为非配对多域学习
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-08-11 DOI: 10.1109/OJSP.2025.3597873
Phan Thi Huyen Thanh;Trung Thai Tran;The Hiep Nguyen;Minh Huy Vu Nguyen;Tran Vu Pham;Truong Vinh Truong Duy;Duc Dung Nguyen
This paper introduces a general plug-in framework designed to enhance the robustness and cross-domain generalization of self-supervised depth estimation models. Current models often struggle with real-world deployment due to their limited ability to generalize across diverse domains, such as varying lighting and weather conditions. Single-domain models are optimized for specific scenarios while existing multi-domain approaches typically rely on paired images, which are rarely available in real-world datasets. Our framework addresses these limitations by training directly on unpaired real images from multiple domains. Daytime images serve as a reference to guide the model in learning consistent depth distributions across these diverse domains through adversarial training, eliminating the need for paired images. To refine regions prone to artifacts, we augment the discriminator with positional encoding, which is combined with the predicted depth maps. We also incorporate a dynamic normalization mechanism to capture shared depth features across domains, removing the requirement for separate domain-specific encoders. Furthermore, we introduce a new benchmark designed for a more comprehensive evaluation, encompassing previously unaddressed real-world scenarios. By focusing on unpaired real data, our framework significantly improves the generalization capabilities of existing models, enabling them to better adapt to the complexities and authentic data encountered in real-world environments.
本文介绍了一个通用的插件框架,旨在提高自监督深度估计模型的鲁棒性和跨域泛化。当前的模型由于其在不同领域(如不同的照明和天气条件)的泛化能力有限,常常难以在现实世界中部署。单域模型针对特定场景进行了优化,而现有的多域方法通常依赖于配对图像,这在现实世界的数据集中很少可用。我们的框架通过直接训练来自多个域的未配对的真实图像来解决这些限制。白天的图像可以作为参考,指导模型通过对抗性训练学习这些不同领域的一致深度分布,从而消除对成对图像的需求。为了细化容易产生伪影的区域,我们使用位置编码增强鉴别器,该编码与预测的深度图相结合。我们还结合了一个动态规范化机制来捕获跨域的共享深度特征,从而消除了对单独的特定于域的编码器的需求。此外,我们引入了一个新的基准,用于更全面的评估,包括以前未解决的现实世界场景。通过关注未配对的真实数据,我们的框架显著提高了现有模型的泛化能力,使它们能够更好地适应现实环境中遇到的复杂性和真实数据。
{"title":"ULDepth: Transform Self-Supervised Depth Estimation to Unpaired Multi-Domain Learning","authors":"Phan Thi Huyen Thanh;Trung Thai Tran;The Hiep Nguyen;Minh Huy Vu Nguyen;Tran Vu Pham;Truong Vinh Truong Duy;Duc Dung Nguyen","doi":"10.1109/OJSP.2025.3597873","DOIUrl":"https://doi.org/10.1109/OJSP.2025.3597873","url":null,"abstract":"This paper introduces a general plug-in framework designed to enhance the robustness and cross-domain generalization of self-supervised depth estimation models. Current models often struggle with real-world deployment due to their limited ability to generalize across diverse domains, such as varying lighting and weather conditions. Single-domain models are optimized for specific scenarios while existing multi-domain approaches typically rely on paired images, which are rarely available in real-world datasets. Our framework addresses these limitations by training directly on unpaired real images from multiple domains. Daytime images serve as a reference to guide the model in learning consistent depth distributions across these diverse domains through adversarial training, eliminating the need for paired images. To refine regions prone to artifacts, we augment the discriminator with positional encoding, which is combined with the predicted depth maps. We also incorporate a dynamic normalization mechanism to capture shared depth features across domains, removing the requirement for separate domain-specific encoders. Furthermore, we introduce a new benchmark designed for a more comprehensive evaluation, encompassing previously unaddressed real-world scenarios. By focusing on unpaired real data, our framework significantly improves the generalization capabilities of existing models, enabling them to better adapt to the complexities and authentic data encountered in real-world environments.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"6 ","pages":"1004-1016"},"PeriodicalIF":2.7,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11122640","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144928877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Benchmarking Diffusion Annealing-Based Bayesian Inverse Problem Solvers 基于基准扩散退火的贝叶斯反问题求解方法
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-08-11 DOI: 10.1109/OJSP.2025.3597867
Evan Scope Crafts;Umberto Villa
In recent years, the ascendance of diffusion modeling as a state-of-the-art generative modeling approach has spurred significant interest in their use as priors in Bayesian inverse problems. However, it is unclear how to optimally integrate a diffusion model trained on the prior distribution with a given likelihood function to obtain posterior samples. While algorithms developed for this purpose can produce high-quality, diverse point estimates of the unknown parameters of interest, they are often tested on problems where the prior distribution is analytically unknown, making it difficult to assess their performance in providing rigorous uncertainty quantification. Motivated by this challenge, this work introduces three benchmark problems for evaluating the performance of diffusion model based samplers. The benchmark problems, which are inspired by problems in image inpainting, x-ray tomography, and phase retrieval, have a posterior density that is analytically known. In this setting, approximate ground-truth posterior samples can be obtained, enabling principled evaluation of the performance of posterior sampling algorithms. This work also introduces a general framework for diffusion model based posterior sampling, Bayesian Inverse Problem Solvers through Diffusion Annealing (BIPSDA). This framework unifies several recently proposed diffusion-model-based posterior sampling algorithms and contains novel algorithms that can be realized through flexible combinations of design choices. We tested the performance of a set of BIPSDA algorithms, including previously proposed state-of-the-art approaches, on the proposed benchmark problems. The results provide insight into the strengths and limitations of existing diffusion-model based posterior samplers, while the benchmark problems provide a testing ground for future algorithmic developments.
近年来,扩散建模作为最先进的生成建模方法的优势激发了人们对将其用作贝叶斯反问题先验的极大兴趣。然而,目前尚不清楚如何将先验分布训练的扩散模型与给定的似然函数最佳地整合以获得后验样本。虽然为此目的开发的算法可以产生高质量的、不同的未知感兴趣参数的点估计,但它们经常在先验分布分析未知的问题上进行测试,这使得很难评估它们在提供严格的不确定性量化方面的性能。在这一挑战的激励下,本文引入了三个基准问题来评估基于扩散模型的采样器的性能。基准问题的灵感来自图像绘制、x射线断层扫描和相位检索中的问题,它们具有分析已知的后验密度。在这种情况下,可以获得近似的真值后验样本,从而可以对后验抽样算法的性能进行原则性评估。本工作还介绍了基于后验抽样的扩散模型的一般框架,即通过扩散退火的贝叶斯反问题求解器(BIPSDA)。该框架统一了最近提出的几种基于扩散模型的后验抽样算法,并包含可以通过灵活组合设计选择来实现的新算法。我们在提出的基准问题上测试了一组BIPSDA算法的性能,包括以前提出的最先进的方法。结果提供了对现有的基于扩散模型的后验采样器的优势和局限性的洞察,而基准问题为未来的算法发展提供了一个试验场。
{"title":"Benchmarking Diffusion Annealing-Based Bayesian Inverse Problem Solvers","authors":"Evan Scope Crafts;Umberto Villa","doi":"10.1109/OJSP.2025.3597867","DOIUrl":"https://doi.org/10.1109/OJSP.2025.3597867","url":null,"abstract":"In recent years, the ascendance of diffusion modeling as a state-of-the-art generative modeling approach has spurred significant interest in their use as priors in Bayesian inverse problems. However, it is unclear how to optimally integrate a diffusion model trained on the prior distribution with a given likelihood function to obtain posterior samples. While algorithms developed for this purpose can produce high-quality, diverse point estimates of the unknown parameters of interest, they are often tested on problems where the prior distribution is analytically unknown, making it difficult to assess their performance in providing rigorous uncertainty quantification. Motivated by this challenge, this work introduces three benchmark problems for evaluating the performance of diffusion model based samplers. The benchmark problems, which are inspired by problems in image inpainting, x-ray tomography, and phase retrieval, have a posterior density that is analytically known. In this setting, approximate ground-truth posterior samples can be obtained, enabling principled evaluation of the performance of posterior sampling algorithms. This work also introduces a general framework for diffusion model based posterior sampling, Bayesian Inverse Problem Solvers through Diffusion Annealing (BIPSDA). This framework unifies several recently proposed diffusion-model-based posterior sampling algorithms and contains novel algorithms that can be realized through flexible combinations of design choices. We tested the performance of a set of BIPSDA algorithms, including previously proposed state-of-the-art approaches, on the proposed benchmark problems. The results provide insight into the strengths and limitations of existing diffusion-model based posterior samplers, while the benchmark problems provide a testing ground for future algorithmic developments.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"6 ","pages":"975-991"},"PeriodicalIF":2.7,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11122619","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144891005","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Low-Complexity Peak-Power-Assisted Data-Aided Channel Estimation Scheme for MIMO-OFDM Wireless Systems MIMO-OFDM无线系统中一种新的低复杂度峰值功率辅助数据辅助信道估计方案
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-08-01 DOI: 10.1109/OJSP.2025.3595039
Inaamullah Khan;Mohammad Mahmudul Hasan;Michael Cheffena
Low-complexity channel estimation techniques are key to enabling efficient, reliable, and real-time communication in modern wireless devices operating under resource and energy constraints. This paper presents for the first time a low-complexity peak-power-assisted data-aided channel estimation (DACE) scheme for both single-input single-output (SISO) and multiple-input multiple-output orthogonal frequency division multiplexing (MIMO-OFDM) wireless systems. In OFDM, high peak-power levels occur when the subcarriers align in phase and constructively interfere with each other. The research proposes a peak-power-assisted channel estimation scheme that accurately selects peak-power carriers at the transmitter of an OFDM system and uses them as reliable carriers for the DACE scheme. By incorporating these reliable carriers with known pilot symbols as additional pilot signals, channel estimation accuracy significantly improves in MIMO-OFDM systems. This eliminates the need to determine reliable data symbols at the receiver, thereby significantly reducing the computational complexity of the system. However, high peak-powers are considered a major drawback in OFDM. In this work, we incorporate a companding technique to mitigate this issue and provide sufficient margin for the DACE scheme. The performance of the proposed DACE scheme is evaluated using both least square (LS) and linear minimum mean square error (LMMSE) channel estimators. In this regard, the proposed technique not only improves channel estimation accuracy but also enhances the spectral efficiency of the wireless system. It outperforms traditional channel estimators in terms of system mean square error (MSE) and bit-error-rate (BER) performance. It also reduces the pilot overhead by 50$%$ compared to traditional channel estimators and provides bandwidth optimization for MIMO-OFDM systems. This makes it a promising solution for enhancing the performance and efficiency of next-generation wireless communication systems across diverse applications.
低复杂度信道估计技术是在资源和能源有限的现代无线设备中实现高效、可靠和实时通信的关键。本文首次提出了一种适用于单输入单输出(SISO)和多输入多输出正交频分复用(MIMO-OFDM)无线系统的低复杂度峰值功率辅助数据辅助信道估计(DACE)方案。在OFDM中,当子载波相位排列并相互产生建设性干扰时,会出现高峰值功率水平。研究提出了一种峰值功率辅助信道估计方案,该方案可以准确地选择OFDM系统发射机处的峰值功率载波,并将其作为DACE方案的可靠载波。通过将这些具有已知导频符号的可靠载波作为附加导频信号,可以显著提高MIMO-OFDM系统的信道估计精度。这消除了在接收端确定可靠数据符号的需要,从而大大降低了系统的计算复杂性。然而,高峰值功率被认为是OFDM的一个主要缺点。在这项工作中,我们采用了一种扩展技术来缓解这个问题,并为DACE方案提供足够的余量。采用最小二乘(LS)和线性最小均方误差(LMMSE)信道估计器对所提DACE方案的性能进行了评估。因此,该技术不仅提高了信道估计精度,而且提高了无线系统的频谱效率。它在系统均方误差(MSE)和误码率(BER)性能方面优于传统的信道估计器。与传统的信道估计器相比,它还减少了50%的导频开销,并为MIMO-OFDM系统提供了带宽优化。这使得它成为一种有前途的解决方案,用于提高跨各种应用的下一代无线通信系统的性能和效率。
{"title":"A Novel Low-Complexity Peak-Power-Assisted Data-Aided Channel Estimation Scheme for MIMO-OFDM Wireless Systems","authors":"Inaamullah Khan;Mohammad Mahmudul Hasan;Michael Cheffena","doi":"10.1109/OJSP.2025.3595039","DOIUrl":"https://doi.org/10.1109/OJSP.2025.3595039","url":null,"abstract":"Low-complexity channel estimation techniques are key to enabling efficient, reliable, and real-time communication in modern wireless devices operating under resource and energy constraints. This paper presents for the first time a low-complexity peak-power-assisted data-aided channel estimation (DACE) scheme for both single-input single-output (SISO) and multiple-input multiple-output orthogonal frequency division multiplexing (MIMO-OFDM) wireless systems. In OFDM, high peak-power levels occur when the subcarriers align in phase and constructively interfere with each other. The research proposes a peak-power-assisted channel estimation scheme that accurately selects peak-power carriers at the transmitter of an OFDM system and uses them as reliable carriers for the DACE scheme. By incorporating these reliable carriers with known pilot symbols as additional pilot signals, channel estimation accuracy significantly improves in MIMO-OFDM systems. This eliminates the need to determine reliable data symbols at the receiver, thereby significantly reducing the computational complexity of the system. However, high peak-powers are considered a major drawback in OFDM. In this work, we incorporate a companding technique to mitigate this issue and provide sufficient margin for the DACE scheme. The performance of the proposed DACE scheme is evaluated using both least square (LS) and linear minimum mean square error (LMMSE) channel estimators. In this regard, the proposed technique not only improves channel estimation accuracy but also enhances the spectral efficiency of the wireless system. It outperforms traditional channel estimators in terms of system mean square error (MSE) and bit-error-rate (BER) performance. It also reduces the pilot overhead by 50<inline-formula><tex-math>$%$</tex-math></inline-formula> compared to traditional channel estimators and provides bandwidth optimization for MIMO-OFDM systems. This makes it a promising solution for enhancing the performance and efficiency of next-generation wireless communication systems across diverse applications.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"6 ","pages":"992-1003"},"PeriodicalIF":2.7,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11106695","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144914241","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multispectral Extended Depth-of-Field Imaging via Stochastic Wavefront Optimization 随机波前优化的多光谱扩展景深成像
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-08-01 DOI: 10.1109/OJSP.2025.3595046
Exequiel Oliva;Nelson Díaz;Samuel Pinilla;Esteban Vera
Extended depth-of-field (EDoF) is a desirable attribute for imaging systems where all features in the scene are in focus despite their relative distance. Traditional imaging systems can achieve EDoF by reducing the aperture size at the expense of signal-to-noise ratio, particularly relevant in spectral imaging systems where incoming light is further divided. By designing and integrating diffractive optical elements (DOEs) placed at the aperture plane of the imaging system, wavefront coding has enabled EDoF while maintaining a larger aperture size at the expense of post-processing. Nevertheless, chromatic aberrations may appear and can often be confused by defocus, jeopardizing the fidelity of the reconstructions. This work presents a novel design approach for a multispectral-aware DOE for EDoF. By considering and modeling a refractive-diffractive optical setup, our proposed system uses the stochastic optimization framework to optimize DOE patterns to preserve spectral fidelity while extending the depth-of-field simultaneously. The optimization process exploits the covariance matrix adaptation evolution strategy (CMA-ES), efficiently exploring complex, high-dimensional phase configurations without the need for explicit gradient information. The optimized DOE is constantly evaluated in a simulated imaging pipeline where the EDoF multispectral datacube is deblurred using Richardson-Lucy deconvolution. Both qualitative and quantitative results demonstrate that the proposed DOE significantly improves depth invariance and spectral fidelity of the reconstructed datacubes compared to conventional and state-of-the-art DOE designs, making it a cost-effective solution for real-world multispectral EDoF applications.
扩展景深(EDoF)是成像系统的理想属性,其中场景中的所有特征尽管相对距离较远,但仍能对焦。传统成像系统可以通过降低孔径尺寸来实现EDoF,但代价是降低信噪比,特别是在光谱成像系统中,入射光被进一步分割。通过设计和集成放置在成像系统孔径平面上的衍射光学元件(do),波前编码实现了EDoF,同时以后处理为代价保持更大的孔径尺寸。然而,色差可能会出现,并且经常会因散焦而混淆,从而危及重建的保真度。本文提出了一种新的多光谱感知DOE的设计方法。通过考虑和建模折射-衍射光学装置,我们提出的系统使用随机优化框架来优化DOE模式,以保持光谱保真度,同时扩展景深。优化过程利用协方差矩阵自适应进化策略(CMA-ES),在不需要显式梯度信息的情况下有效地探索复杂的高维相位配置。优化后的DOE在模拟成像管道中不断进行评估,其中使用Richardson-Lucy反卷积对EDoF多光谱数据进行去模糊处理。定性和定量结果表明,与传统和最先进的DOE设计相比,所提出的DOE显着提高了重建数据的深度不变性和光谱保真度,使其成为现实世界中多光谱EDoF应用的经济有效的解决方案。
{"title":"Multispectral Extended Depth-of-Field Imaging via Stochastic Wavefront Optimization","authors":"Exequiel Oliva;Nelson Díaz;Samuel Pinilla;Esteban Vera","doi":"10.1109/OJSP.2025.3595046","DOIUrl":"https://doi.org/10.1109/OJSP.2025.3595046","url":null,"abstract":"Extended depth-of-field (EDoF) is a desirable attribute for imaging systems where all features in the scene are in focus despite their relative distance. Traditional imaging systems can achieve EDoF by reducing the aperture size at the expense of signal-to-noise ratio, particularly relevant in spectral imaging systems where incoming light is further divided. By designing and integrating diffractive optical elements (DOEs) placed at the aperture plane of the imaging system, wavefront coding has enabled EDoF while maintaining a larger aperture size at the expense of post-processing. Nevertheless, chromatic aberrations may appear and can often be confused by defocus, jeopardizing the fidelity of the reconstructions. This work presents a novel design approach for a multispectral-aware DOE for EDoF. By considering and modeling a refractive-diffractive optical setup, our proposed system uses the stochastic optimization framework to optimize DOE patterns to preserve spectral fidelity while extending the depth-of-field simultaneously. The optimization process exploits the covariance matrix adaptation evolution strategy (CMA-ES), efficiently exploring complex, high-dimensional phase configurations without the need for explicit gradient information. The optimized DOE is constantly evaluated in a simulated imaging pipeline where the EDoF multispectral datacube is deblurred using Richardson-Lucy deconvolution. Both qualitative and quantitative results demonstrate that the proposed DOE significantly improves depth invariance and spectral fidelity of the reconstructed datacubes compared to conventional and state-of-the-art DOE designs, making it a cost-effective solution for real-world multispectral EDoF applications.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"6 ","pages":"965-974"},"PeriodicalIF":2.7,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11106763","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144842966","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Spatial Sigma-Delta Approach to Mitigation of Power Amplifier Distortions in Massive MIMO Downlink 空间Sigma-Delta方法缓解大规模MIMO下行链路中功率放大器失真
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-07-23 DOI: 10.1109/OJSP.2025.3589747
Yatao Liu;Mingjie Shao;Wing-Kin Ma
In massive multiple-input multiple-output (MIMO) downlink systems, the physical implementation of the base stations (BSs) requires the use of cheap and power-efficient power amplifiers (PAs) to avoid high hardware cost and high power consumption. However, such PAs usually have limited linear amplification ranges. Nonlinear distortions arising from operation beyond the linear amplification ranges can significantly degrade system performance. Existing approaches to handle the nonlinear distortions, such as digital predistortion (DPD), typically require accurate knowledge, or acquisition, of the PA transfer function. In this paper, we present a new concept for mitigation of the PA distortions. Assuming a uniform linear array (ULA) at the BS, the idea is to apply a Sigma-Delta ($Sigma Delta$) modulator to spatially shape the PA distortions to the high-angle region. By having the system operating in the low-angle region, the received signals are less affected by the PA distortions. To demonstrate the potential of this spatial $Sigma Delta$ approach, we study the application of our approach to the multi-user MIMO-orthogonal frequency division modulation (OFDM) downlink scenario. A symbol-level precoding (SLP) scheme and a zero-forcing (ZF) precoding scheme, with the new design requirement by the spatial $Sigma Delta$ approach being taken into account, are developed. Numerical simulations are performed to show the effectiveness of the developed $Sigma Delta$ precoding schemes.
在大规模多输入多输出(MIMO)下行链路系统中,为了避免高硬件成本和高功耗,基站(BSs)的物理实现需要使用廉价且节能的功率放大器(pa)。然而,这种放大器通常具有有限的线性放大范围。超出线性放大范围的非线性失真会显著降低系统性能。现有的处理非线性失真的方法,如数字预失真(DPD),通常需要准确地了解或获取PA传递函数。在本文中,我们提出了一个新的概念,以减轻PA失真。假设在BS处有一个均匀线性阵列(ULA),我们的想法是应用Sigma-Delta ($Sigma Delta$)调制器在空间上塑造高角度区域的PA扭曲。通过使系统工作在低角度区域,接收到的信号受扩频失真的影响较小。为了证明这种空间$Sigma Delta$方法的潜力,我们研究了我们的方法在多用户mimo -正交频分调制(OFDM)下行场景中的应用。考虑到空间$Sigma Delta$方法的新设计要求,提出了符号级预编码(SLP)方案和强制零预编码(ZF)方案。通过数值仿真验证了$Sigma Delta$预编码方案的有效性。
{"title":"A Spatial Sigma-Delta Approach to Mitigation of Power Amplifier Distortions in Massive MIMO Downlink","authors":"Yatao Liu;Mingjie Shao;Wing-Kin Ma","doi":"10.1109/OJSP.2025.3589747","DOIUrl":"https://doi.org/10.1109/OJSP.2025.3589747","url":null,"abstract":"In massive multiple-input multiple-output (MIMO) downlink systems, the physical implementation of the base stations (BSs) requires the use of cheap and power-efficient power amplifiers (PAs) to avoid high hardware cost and high power consumption. However, such PAs usually have limited linear amplification ranges. Nonlinear distortions arising from operation beyond the linear amplification ranges can significantly degrade system performance. Existing approaches to handle the nonlinear distortions, such as digital predistortion (DPD), typically require accurate knowledge, or acquisition, of the PA transfer function. In this paper, we present a new concept for mitigation of the PA distortions. Assuming a uniform linear array (ULA) at the BS, the idea is to apply a Sigma-Delta (<inline-formula><tex-math>$Sigma Delta$</tex-math></inline-formula>) modulator to spatially shape the PA distortions to the high-angle region. By having the system operating in the low-angle region, the received signals are less affected by the PA distortions. To demonstrate the potential of this spatial <inline-formula><tex-math>$Sigma Delta$</tex-math></inline-formula> approach, we study the application of our approach to the multi-user MIMO-orthogonal frequency division modulation (OFDM) downlink scenario. A symbol-level precoding (SLP) scheme and a zero-forcing (ZF) precoding scheme, with the new design requirement by the spatial <inline-formula><tex-math>$Sigma Delta$</tex-math></inline-formula> approach being taken into account, are developed. Numerical simulations are performed to show the effectiveness of the developed <inline-formula><tex-math>$Sigma Delta$</tex-math></inline-formula> precoding schemes.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"6 ","pages":"900-916"},"PeriodicalIF":2.7,"publicationDate":"2025-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11091482","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144750802","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cross-Dataset Head-Related Transfer Function Harmonization Based on Perceptually Relevant Loss Function 基于感知相关损失函数的跨数据集头部相关传递函数协调
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-07-17 DOI: 10.1109/OJSP.2025.3590248
Jiale Zhao;Dingding Yao;Junfeng Li
Head-Related Transfer Functions (HRTFs) play a vital role in binaural spatial audio rendering. With the release of numerous HRTF datasets in recent years, abundant data has become available to support HRTF-related research based on deep learning. However, measurement discrepancies across different datasets introduce significant variations in the data and directly merging these datasets may lead to systematic biases. The recent Listener Acoustic Personalization Challenge 2024 (European Signal Processing Conference) dealt with this issue, with the task of harmonizing different datasets to achieve lower classification accuracy while meeting thresholds over various localization metrics. To mitigate cross-dataset differences, this paper proposes a neural network-based HRTF harmonization approach aimed at eliminating dataset-specific properties embedded in the original measurements. The proposed method utilizes a perceptually relevant loss function, which jointly constrains multiple objectives, including interaural level differences, auditory-filter excitation patterns, and classification accuracy. Experimental results based on eight datasets demonstrate that the proposed approach can effectively minimize distributional disparities between datasets while mostly preserving localization performance. The classification accuracy for harmonized HRTFs between different datasets is reduced to as low as 31%, indicating a significant reduction in cross-dataset discrepancies. The proposed method ranked first in this challenge, which validates its effectiveness.
头部相关传递函数(hrtf)在双耳空间音频渲染中起着至关重要的作用。近年来,随着大量HRTF数据集的发布,为基于深度学习的HRTF相关研究提供了丰富的数据支持。然而,不同数据集之间的测量差异会导致数据的显著变化,直接合并这些数据集可能会导致系统偏差。最近的听众声学个性化挑战2024(欧洲信号处理会议)处理了这个问题,其任务是协调不同的数据集,以达到较低的分类精度,同时满足各种定位指标的阈值。为了减轻跨数据集的差异,本文提出了一种基于神经网络的HRTF协调方法,旨在消除嵌入在原始测量中的数据集特定属性。该方法利用感知相关损失函数,共同约束多个目标,包括耳间电平差异、听觉滤波激励模式和分类精度。基于8个数据集的实验结果表明,该方法可以有效地减少数据集之间的分布差异,同时基本保持定位性能。不同数据集之间协调hrtf的分类准确率降至31%,表明跨数据集差异显著降低。该方法在本次挑战中排名第一,验证了其有效性。
{"title":"Cross-Dataset Head-Related Transfer Function Harmonization Based on Perceptually Relevant Loss Function","authors":"Jiale Zhao;Dingding Yao;Junfeng Li","doi":"10.1109/OJSP.2025.3590248","DOIUrl":"https://doi.org/10.1109/OJSP.2025.3590248","url":null,"abstract":"Head-Related Transfer Functions (HRTFs) play a vital role in binaural spatial audio rendering. With the release of numerous HRTF datasets in recent years, abundant data has become available to support HRTF-related research based on deep learning. However, measurement discrepancies across different datasets introduce significant variations in the data and directly merging these datasets may lead to systematic biases. The recent Listener Acoustic Personalization Challenge 2024 (European Signal Processing Conference) dealt with this issue, with the task of harmonizing different datasets to achieve lower classification accuracy while meeting thresholds over various localization metrics. To mitigate cross-dataset differences, this paper proposes a neural network-based HRTF harmonization approach aimed at eliminating dataset-specific properties embedded in the original measurements. The proposed method utilizes a perceptually relevant loss function, which jointly constrains multiple objectives, including interaural level differences, auditory-filter excitation patterns, and classification accuracy. Experimental results based on eight datasets demonstrate that the proposed approach can effectively minimize distributional disparities between datasets while mostly preserving localization performance. The classification accuracy for harmonized HRTFs between different datasets is reduced to as low as 31%, indicating a significant reduction in cross-dataset discrepancies. The proposed method ranked first in this challenge, which validates its effectiveness.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"6 ","pages":"865-875"},"PeriodicalIF":2.7,"publicationDate":"2025-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11082560","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144739816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE open journal of signal processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1