首页 > 最新文献

IEEE open journal of signal processing最新文献

英文 中文
Anderson Accelerated Operator Splitting Methods for Convex-Nonconvex Regularized Problems 凸-非凸正则化问题的Anderson加速算子分裂方法
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-10-06 DOI: 10.1109/OJSP.2025.3618583
Qiang Heng;Xiaoqian Liu;Eric C. Chi
Convex–nonconvex (CNC) regularization is a novel paradigm that employs a nonconvex penalty function while preserving the convexity of the overall objective function. It has found successful applications in signal processing, statistics, and machine learning. Despite its wide applicability, the computation of CNC-regularized problems is still dominated by the forward–backward splitting method, which can be computationally slow in practice and is restricted to handling a single regularizer. To address these limitations, we develop a unified Anderson acceleration framework that encompasses multiple prevalent operator-splitting schemes, thereby enabling the efficient solution of a broad class of CNC-regularized problems with a quadratic data-fidelity term. We establish global convergence of the proposed algorithm to an optimal point and demonstrate its substantial speed-ups across diverse applications.
凸-非凸正则化(CNC)是一种采用非凸惩罚函数同时保持整体目标函数的凸性的新范式。它已经在信号处理、统计学和机器学习中得到了成功的应用。尽管具有广泛的适用性,但cnc正则化问题的计算仍然以前向向后分裂方法为主,这种方法在实际中计算速度很慢,并且仅限于处理单个正则化器。为了解决这些限制,我们开发了一个统一的安德森加速框架,该框架包含多个流行的算子分裂方案,从而能够有效地解决具有二次数据保真度项的广泛类别的cnc正则化问题。我们建立了该算法到最优点的全局收敛性,并证明了其在不同应用中的显著加速。
{"title":"Anderson Accelerated Operator Splitting Methods for Convex-Nonconvex Regularized Problems","authors":"Qiang Heng;Xiaoqian Liu;Eric C. Chi","doi":"10.1109/OJSP.2025.3618583","DOIUrl":"https://doi.org/10.1109/OJSP.2025.3618583","url":null,"abstract":"Convex–nonconvex (CNC) regularization is a novel paradigm that employs a nonconvex penalty function while preserving the convexity of the overall objective function. It has found successful applications in signal processing, statistics, and machine learning. Despite its wide applicability, the computation of CNC-regularized problems is still dominated by the forward–backward splitting method, which can be computationally slow in practice and is restricted to handling a single regularizer. To address these limitations, we develop a unified Anderson acceleration framework that encompasses multiple prevalent operator-splitting schemes, thereby enabling the efficient solution of a broad class of CNC-regularized problems with a quadratic data-fidelity term. We establish global convergence of the proposed algorithm to an optimal point and demonstrate its substantial speed-ups across diverse applications.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"6 ","pages":"1094-1108"},"PeriodicalIF":2.7,"publicationDate":"2025-10-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11194222","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145352016","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Parameter-Efficient Multi-Task and Multi-Domain Learning Using Factorized Tensor Networks 基于分解张量网络的参数高效多任务多域学习
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-09-22 DOI: 10.1109/OJSP.2025.3613142
Yash Garg;Nebiyou Yismaw;Rakib Hyder;Ashley Prater-Bennette;Amit Roy-Chowdhury;M. Salman Asif
Multi-task and multi-domain learning methods seek to learn multiple tasks/domains, jointly or one after another, using a single unified network. The primary challenge and opportunity lie in leveraging shared information across these tasks and domains to enhance the efficiency of the unified network. The efficiency can be in terms of accuracy, storage cost, computation, or sample complexity. In this paper, we introduce a factorized tensor network (FTN) designed to achieve accuracy comparable to that of independent single-task or single-domain networks, while introducing a minimal number of additional parameters. The FTN approach entails incorporating task- or domain-specific low-rank tensor factors into a shared frozen network derived from a source model. This strategy allows for adaptation to numerous target domains and tasks without encountering catastrophic forgetting. Furthermore, FTN requires a significantly smaller number of task-specific parameters compared to existing methods. We performed experiments on widely used multi-domain and multi-task datasets. We show the experiments on convolutional-based architecture with different backbones and on transformer-based architecture. Our findings indicate that FTN attains similar accuracy as single-task or single-domain methods while using only a fraction of additional parameters per task.
多任务和多领域学习方法寻求使用一个统一的网络,共同或依次学习多个任务/领域。主要的挑战和机遇在于利用这些任务和领域之间的共享信息来提高统一网络的效率。效率可以体现在准确性、存储成本、计算或样本复杂性方面。在本文中,我们引入了一个分解张量网络(FTN),旨在达到与独立的单任务或单域网络相当的精度,同时引入了最少数量的附加参数。FTN方法需要将任务或领域特定的低秩张量因子合并到从源模型派生的共享冻结网络中。这种策略允许适应许多目标领域和任务,而不会遇到灾难性的遗忘。此外,与现有方法相比,FTN需要的任务特定参数数量要少得多。我们在广泛使用的多领域和多任务数据集上进行了实验。我们展示了基于不同主干的卷积架构和基于变压器的架构的实验。我们的研究结果表明,FTN在每个任务只使用一小部分额外参数的情况下,获得了与单任务或单域方法相似的准确性。
{"title":"Parameter-Efficient Multi-Task and Multi-Domain Learning Using Factorized Tensor Networks","authors":"Yash Garg;Nebiyou Yismaw;Rakib Hyder;Ashley Prater-Bennette;Amit Roy-Chowdhury;M. Salman Asif","doi":"10.1109/OJSP.2025.3613142","DOIUrl":"https://doi.org/10.1109/OJSP.2025.3613142","url":null,"abstract":"Multi-task and multi-domain learning methods seek to learn multiple tasks/domains, jointly or one after another, using a single unified network. The primary challenge and opportunity lie in leveraging shared information across these tasks and domains to enhance the efficiency of the unified network. The efficiency can be in terms of accuracy, storage cost, computation, or sample complexity. In this paper, we introduce a factorized tensor network (FTN) designed to achieve accuracy comparable to that of independent single-task or single-domain networks, while introducing a minimal number of additional parameters. The FTN approach entails incorporating task- or domain-specific low-rank tensor factors into a shared frozen network derived from a source model. This strategy allows for adaptation to numerous target domains and tasks without encountering catastrophic forgetting. Furthermore, FTN requires a significantly smaller number of task-specific parameters compared to existing methods. We performed experiments on widely used multi-domain and multi-task datasets. We show the experiments on convolutional-based architecture with different backbones and on transformer-based architecture. Our findings indicate that FTN attains similar accuracy as single-task or single-domain methods while using only a fraction of additional parameters per task.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"6 ","pages":"1077-1085"},"PeriodicalIF":2.7,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11175489","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145255920","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatial Upsampling of Head-Related Impulse Responses via Elevation-Wise Encoder-Decoder Networks 通过高程方向编码器-解码器网络的头部相关脉冲响应空间上采样
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-09-22 DOI: 10.1109/OJSP.2025.3613209
Camilo Arevalo;Julián Villegas
A method for performing spatial upsampling of Head-Related Impulse Responses (HRIRs) from sparse measurements is introduced. Based on a supervised elevation-wise encoder-decoder network design, we present two variants: one that performs progressive reconstructions with feed-forward connections from higher to lower elevations, and another that excludes these connections. The variants were evaluated in terms of the errors in interaural time and level differences, as well as the spectral distortion in the ipsilateral and contralateral ears. The additional complexity introduced by the variant with feed-forward connections does not always translate into accuracy gains, making the simpler variant preferable for efficiency. Performance generally improved as the number of available measurements increased. However, accuracy was also found to strongly depend on the spatial distribution of those measurements. Compared to an average non-personalized HRIRs, interaural time differences remain similar, while the proposed method achieves higher spectral and level accuracy, highlighting its practical use for HRIR upsampling.
介绍了一种利用稀疏测量对头部相关脉冲响应进行空间上采样的方法。基于监督海拔方向的编码器-解码器网络设计,我们提出了两种变体:一种是通过从高海拔到低海拔的前馈连接执行渐进式重建,另一种是排除这些连接。根据耳间时间误差和水平差,以及同侧和对侧耳的频谱失真来评估变异。带有前馈连接的变体所带来的额外复杂性并不总是转化为准确性的提高,这使得更简单的变体更有利于效率。性能通常随着可用度量的增加而提高。然而,人们还发现,准确性在很大程度上取决于这些测量的空间分布。与非个性化的平均HRIR相比,该方法保持了相似的时间差异,同时获得了更高的光谱和水平精度,突出了其在HRIR上采样中的实用性。
{"title":"Spatial Upsampling of Head-Related Impulse Responses via Elevation-Wise Encoder-Decoder Networks","authors":"Camilo Arevalo;Julián Villegas","doi":"10.1109/OJSP.2025.3613209","DOIUrl":"https://doi.org/10.1109/OJSP.2025.3613209","url":null,"abstract":"A method for performing spatial upsampling of Head-Related Impulse Responses (HRIRs) from sparse measurements is introduced. Based on a supervised elevation-wise encoder-decoder network design, we present two variants: one that performs progressive reconstructions with feed-forward connections from higher to lower elevations, and another that excludes these connections. The variants were evaluated in terms of the errors in interaural time and level differences, as well as the spectral distortion in the ipsilateral and contralateral ears. The additional complexity introduced by the variant with feed-forward connections does not always translate into accuracy gains, making the simpler variant preferable for efficiency. Performance generally improved as the number of available measurements increased. However, accuracy was also found to strongly depend on the spatial distribution of those measurements. Compared to an average non-personalized HRIRs, interaural time differences remain similar, while the proposed method achieves higher spectral and level accuracy, highlighting its practical use for HRIR upsampling.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"6 ","pages":"1086-1093"},"PeriodicalIF":2.7,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11175513","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145351900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Spatial Upsampling of Head-Related Transfer Function Using Neural Network Conditioned on Source Position and Frequency 基于源位置和频率的神经网络头部相关传递函数空间上采样
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-09-22 DOI: 10.1109/OJSP.2025.3613132
Yuki Ito;Tomohiko Nakamura;Shoichi Koyama;Shuichi Sakamoto;Hiroshi Saruwatari
A spatial upsampling method for the head-related transfer function (HRTF) using deep neural networks (DNNs), consisting of an autoencoder conditioned on the source position and frequency, is proposed. On the basis of our finding that the conventional regularized linear regression (RLR)-based upsampling method can be reinterpreted as a linear autoencoder, we designed our network architecture as a nonlinear extension of the RLR-based method, whose key features are the encoder and decoder weights depending on the source positions and the latent variables independent of the source positions. We also extend this architecture to upsample HRTFs and interaural time differences (ITDs) in a single network, which allows us to efficiently obtain head-related impulse responses (HRIRs). Experimental results on upsampling accuracy and perceptual quality indicated that our proposed method can upsample HRTFs from sparse measurements with sufficient quality.
提出了一种基于深度神经网络(dnn)的头部相关传递函数(HRTF)空间上采样方法,该方法由一个以源位置和频率为条件的自编码器组成。基于传统的正则化线性回归(RLR)上采样方法可以被重新解释为线性自编码器,我们将网络架构设计为基于RLR方法的非线性扩展,其关键特征是编码器和解码器的权重取决于源位置和独立于源位置的潜在变量。我们还扩展了该架构,在单个网络中对hrtf和内部时差(ITDs)进行上采样,这使我们能够有效地获得头部相关脉冲响应(HRIRs)。上采样精度和感知质量的实验结果表明,该方法能够以足够的质量从稀疏测量中上采样hrtf。
{"title":"Spatial Upsampling of Head-Related Transfer Function Using Neural Network Conditioned on Source Position and Frequency","authors":"Yuki Ito;Tomohiko Nakamura;Shoichi Koyama;Shuichi Sakamoto;Hiroshi Saruwatari","doi":"10.1109/OJSP.2025.3613132","DOIUrl":"https://doi.org/10.1109/OJSP.2025.3613132","url":null,"abstract":"A spatial upsampling method for the head-related transfer function (HRTF) using deep neural networks (DNNs), consisting of an autoencoder conditioned on the source position and frequency, is proposed. On the basis of our finding that the conventional regularized linear regression (RLR)-based upsampling method can be reinterpreted as a linear autoencoder, we designed our network architecture as a nonlinear extension of the RLR-based method, whose key features are the encoder and decoder weights depending on the source positions and the latent variables independent of the source positions. We also extend this architecture to upsample HRTFs and interaural time differences (ITDs) in a single network, which allows us to efficiently obtain head-related impulse responses (HRIRs). Experimental results on upsampling accuracy and perceptual quality indicated that our proposed method can upsample HRTFs from sparse measurements with sufficient quality.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"6 ","pages":"1109-1123"},"PeriodicalIF":2.7,"publicationDate":"2025-09-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11175492","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145405295","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Constrained Cramér-Rao Bound for Higher-Order Singular Value Decomposition 高阶奇异值分解的约束cram<s:1> - rao界
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-09-08 DOI: 10.1109/OJSP.2025.3607278
Metin Calis;Massimo Mischi;Alle-Jan van der Veen;Raj Thilak Rajan;Borbàla Hunyadi
Tensor decomposition methods for signal processing applications are an active area of research. Real data are often low-rank, noisy, and come in a higher-order format. As such, low-rank tensor approximation methods that account for the high-order structure of the data are often used for denoising. One way to represent a tensor in a low-rank form is to decompose the tensor into a set of orthonormal factor matrices and an all-orthogonal core tensor using a higher-order singular value decomposition. Under noisy measurements, the lower bound for recovering the factor matrices and the core tensor is unknown. In this paper, we exploit the well-studied constrained Cramér-Rao bound to calculate a lower bound on the mean squared error of the unbiased estimates of the components of the multilinear singular value decomposition under additive white Gaussian noise, and we validate our approach through simulations.
张量分解方法在信号处理中的应用是一个活跃的研究领域。真实数据通常是低秩的、有噪声的,并且以高阶格式出现。因此,考虑到数据的高阶结构的低秩张量近似方法通常用于去噪。以低秩形式表示张量的一种方法是使用高阶奇异值分解将张量分解为一组标准正交因子矩阵和一个全正交核心张量。在噪声测量下,恢复因子矩阵和核心张量的下界是未知的。本文利用已被广泛研究的约束cram r- rao界,计算了加性高斯白噪声下多元线性奇异值分解分量的无偏估计均方误差的下界,并通过仿真验证了该方法。
{"title":"Constrained Cramér-Rao Bound for Higher-Order Singular Value Decomposition","authors":"Metin Calis;Massimo Mischi;Alle-Jan van der Veen;Raj Thilak Rajan;Borbàla Hunyadi","doi":"10.1109/OJSP.2025.3607278","DOIUrl":"https://doi.org/10.1109/OJSP.2025.3607278","url":null,"abstract":"Tensor decomposition methods for signal processing applications are an active area of research. Real data are often low-rank, noisy, and come in a higher-order format. As such, low-rank tensor approximation methods that account for the high-order structure of the data are often used for denoising. One way to represent a tensor in a low-rank form is to decompose the tensor into a set of orthonormal factor matrices and an all-orthogonal core tensor using a higher-order singular value decomposition. Under noisy measurements, the lower bound for recovering the factor matrices and the core tensor is unknown. In this paper, we exploit the well-studied constrained Cramér-Rao bound to calculate a lower bound on the mean squared error of the unbiased estimates of the components of the multilinear singular value decomposition under additive white Gaussian noise, and we validate our approach through simulations.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"6 ","pages":"1048-1055"},"PeriodicalIF":2.7,"publicationDate":"2025-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11153050","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145090152","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Gaussian Filtering Using a Spherical-Radial Double Exponential Cubature 利用球-径向双指数模型进行高斯滤波
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-08-28 DOI: 10.1109/OJSP.2025.3604381
Quade Butler;Youssef Ziada;S. Andrew Gadsden
Gaussian filters use quadrature rules or cubature rules to recursively solve Gaussian-weighted integrals. Classical and contemporary methods use stable rules with a minimal number of cubature points to achieve the highest accuracy. Gaussian quadrature is widely believed to be optimal due to its polynomial degree of exactness and higher degree cubature methods often require complex optimization to solve moment equations. In this paper, Gaussian-weighted integrals and Gaussian filtering are approached using a double exponential (DE) transformation and the trapezoidal rule. The DE rule is principled in high rates of convergence for certain integrands and the DE transform ensures that the trapezoidal rule maximizes its performance. A novel spherical-radial cubature rule is derived for Gaussian-weighted integrals where it is shown to be perfectly stable and highly efficient. A new Gaussian filter is then built on top of this cubature rule. The filter is shown to be stable with bounded estimation error. The effect of varying the number of cubature points on filter stability and convergence is also examined. The advantages of the DE method over comparable Gaussian filters and their cubature methods are outlined. These advantages are realized in two numerical examples: a challenging non-polynomial integral and a benchmark filtering problem. The results show that simple and fundamental cubature methods can lead to great improvements in performance when applied correctly.
高斯滤波器使用正交规则或培养规则递归地求解高斯加权积分。经典和现代的方法使用稳定的规则与最小数量的培养点,以达到最高的精度。高斯正交由于其多项式精度而被广泛认为是最优的,而更高次的培养方法往往需要复杂的优化来求解力矩方程。本文利用双指数变换和梯形规则研究高斯加权积分和高斯滤波。DE规则在某些积分的高收敛率方面是原则性的,并且DE变换确保梯形规则最大化其性能。导出了一种新的球-径向定殖规则,并证明了该规则是完全稳定和高效的。然后在这个培养规则的基础上建立一个新的高斯滤波器。该滤波器在估计误差有界的情况下是稳定的。研究了不同培养点个数对滤波器稳定性和收敛性的影响。概述了DE方法相对于可比较的高斯滤波器及其培养方法的优点。这些优点在两个数值例子中得到了体现:一个具有挑战性的非多项式积分和一个基准滤波问题。结果表明,简单而基本的培养方法在正确应用的情况下可以大大提高性能。
{"title":"Gaussian Filtering Using a Spherical-Radial Double Exponential Cubature","authors":"Quade Butler;Youssef Ziada;S. Andrew Gadsden","doi":"10.1109/OJSP.2025.3604381","DOIUrl":"https://doi.org/10.1109/OJSP.2025.3604381","url":null,"abstract":"Gaussian filters use quadrature rules or cubature rules to recursively solve Gaussian-weighted integrals. Classical and contemporary methods use stable rules with a minimal number of cubature points to achieve the highest accuracy. Gaussian quadrature is widely believed to be optimal due to its polynomial degree of exactness and higher degree cubature methods often require complex optimization to solve moment equations. In this paper, Gaussian-weighted integrals and Gaussian filtering are approached using a double exponential (DE) transformation and the <italic>trapezoidal rule</i>. The DE rule is principled in high rates of convergence for certain integrands and the DE transform ensures that the trapezoidal rule maximizes its performance. A novel spherical-radial cubature rule is derived for Gaussian-weighted integrals where it is shown to be perfectly stable and highly efficient. A new Gaussian filter is then built on top of this cubature rule. The filter is shown to be stable with bounded estimation error. The effect of varying the number of cubature points on filter stability and convergence is also examined. The advantages of the DE method over comparable Gaussian filters and their cubature methods are outlined. These advantages are realized in two numerical examples: a challenging non-polynomial integral and a benchmark filtering problem. The results show that simple and fundamental cubature methods can lead to great improvements in performance when applied correctly.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"6 ","pages":"1056-1076"},"PeriodicalIF":2.7,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11144509","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145141718","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Test-Time Cost-and-Quality Controllable Arbitrary-Scale Super-Resolution With Variable Fourier Components 可变傅立叶分量的测试时间成本和质量可控任意尺度超分辨率
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-08-25 DOI: 10.1109/OJSP.2025.3602742
Kazutoshi Akita;Norimichi Ukita
Super-resolution (SR) with arbitrary scale factor and cost-and-quality controllability at test time is essential for various applications. While several arbitrary-scale SR methods have been proposed, these methods require us to modify the model structure and retrain it to control the computational cost and SR quality. To address this limitation, we propose a novel SR method using a Recurrent Neural Network (RNN) with the Fourier representation. In our method, the RNN sequentially estimates Fourier components, each consisting of frequency and amplitude, and aggregates these components to reconstruct an SR image. Since the RNN can adjust the number of recurrences at test time, we can control the computational cost and SR quality in a single model: fewer recurrences (i.e., fewer Fourier components) lead to lower cost but lower quality, while more recurrences (i.e., more Fourier components) lead to better quality but more cost. Experimental results prove that more Fourier components improve the PSNR score. Furthermore, even with fewer Fourier components, our method achieves a lower PSNR drop than other state-of-the-art arbitrary-scale SR methods.
在各种应用中,具有任意比例因子和测试时成本和质量可控性的超分辨率(SR)是必不可少的。虽然已经提出了几种任意尺度的SR方法,但这些方法需要我们修改模型结构并对其进行重新训练,以控制计算成本和SR质量。为了解决这一限制,我们提出了一种使用傅里叶表示的递归神经网络(RNN)的新颖SR方法。在我们的方法中,RNN依次估计傅里叶分量,每个分量由频率和幅度组成,并将这些分量聚合以重建SR图像。由于RNN可以在测试时调整递归的数量,我们可以控制单个模型的计算成本和SR质量:更少的递归(即更少的傅里叶分量)导致更低的成本但更低的质量,而更多的递归(即更多的傅里叶分量)导致更好的质量但更多的成本。实验结果表明,增加傅里叶分量可以提高PSNR分数。此外,即使使用较少的傅里叶分量,我们的方法也比其他最先进的任意尺度SR方法实现了更低的PSNR下降。
{"title":"Test-Time Cost-and-Quality Controllable Arbitrary-Scale Super-Resolution With Variable Fourier Components","authors":"Kazutoshi Akita;Norimichi Ukita","doi":"10.1109/OJSP.2025.3602742","DOIUrl":"https://doi.org/10.1109/OJSP.2025.3602742","url":null,"abstract":"Super-resolution (SR) with arbitrary scale factor and cost-and-quality controllability at test time is essential for various applications. While several arbitrary-scale SR methods have been proposed, these methods require us to modify the model structure and retrain it to control the computational cost and SR quality. To address this limitation, we propose a novel SR method using a Recurrent Neural Network (RNN) with the Fourier representation. In our method, the RNN sequentially estimates Fourier components, each consisting of frequency and amplitude, and aggregates these components to reconstruct an SR image. Since the RNN can adjust the number of recurrences at test time, we can control the computational cost and SR quality in a single model: fewer recurrences (i.e., fewer Fourier components) lead to lower cost but lower quality, while more recurrences (i.e., more Fourier components) lead to better quality but more cost. Experimental results prove that more Fourier components improve the PSNR score. Furthermore, even with fewer Fourier components, our method achieves a lower PSNR drop than other state-of-the-art arbitrary-scale SR methods.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"6 ","pages":"1017-1030"},"PeriodicalIF":2.7,"publicationDate":"2025-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11141341","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145036198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sparsity Apprised Logarithmic Hyperbolic Tan Adaptive Filters for Nonlinear System Identification and Acoustic Feedback Cancellation 稀疏度通知对数双曲Tan自适应滤波器用于非线性系统辨识和声反馈抵消
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-08-20 DOI: 10.1109/OJSP.2025.3600904
Neetu Chikyal;Vasundhara;Chayan Bhar;Asutosh Kar;Mads Græsbøll Christensen
Recently, various robust algorithms based on hyperbolic cosine and sine functions, such as hyperbolic cosine (HCAF), exponential hyperbolic cosine, joint logarithmic hyperbolic cosine adaptive filter, etc., have been predominantly employed for different aspects of adaptive filtering, including nonlinear-system-identification. Further, in this manuscript, an attempt is made to elevate the performance of nonlinear system identification in the wake of impulsive noise interference along with consideration of a sparse environment. Henceforth, in lieu of this, the present paper introduces a new sparsity-apprised logarithmic hyperbolic tan adaptive filter (SA-LHTAF) to handle impulsive noise while dealing with sparse systems. It utilizes a $l_{1}$ norm-related sparsity penalty factor in the robust cost function constructed with a logarithmic hyperbolic tangent function. Further, an improved SA-LHTAF (ISA-LHTAF) is introduced for varying sparsity or moderately sparse systems employing the log sum penalty factor in the proposed technique. The weight update for the proposed technique has been derived from the modified cost function. In addition, the conditions for the upper bound on the convergence factor have been derived. The efficacy of the developed robust techniques is demonstrated for identifying nonlinear systems along with feedback paths of behind-the-ear (BTE) hearing aid. In addition, the proposed techniques are evaluated for training an acoustic feedback canceller for hearing aids.
近年来,各种基于双曲余弦和正弦函数的鲁棒算法,如双曲余弦(HCAF)、指数双曲余弦、联合对数双曲余弦自适应滤波器等,已主要用于自适应滤波的各个方面,包括非线性系统辨识。此外,在本文中,尝试在考虑稀疏环境的情况下,提高脉冲噪声干扰后非线性系统识别的性能。因此,本文引入了一种新的稀疏性通知对数双曲tan自适应滤波器(SA-LHTAF)来处理稀疏系统中的脉冲噪声。它在由对数双曲正切函数构造的鲁棒代价函数中利用了一个$l_{1}$规范相关的稀疏惩罚因子。此外,在提出的技术中,采用对数和惩罚因子,引入了一种改进的SA-LHTAF (ISA-LHTAF),用于变稀疏或中等稀疏的系统。该方法的权值更新由修正后的代价函数推导而来。此外,还导出了收敛因子上界的条件。所开发的鲁棒技术在识别非线性系统以及耳后助听器(BTE)反馈路径方面的有效性得到了证明。此外,所提出的技术被评估用于训练助听器的声反馈消除器。
{"title":"Sparsity Apprised Logarithmic Hyperbolic Tan Adaptive Filters for Nonlinear System Identification and Acoustic Feedback Cancellation","authors":"Neetu Chikyal;Vasundhara;Chayan Bhar;Asutosh Kar;Mads Græsbøll Christensen","doi":"10.1109/OJSP.2025.3600904","DOIUrl":"https://doi.org/10.1109/OJSP.2025.3600904","url":null,"abstract":"Recently, various robust algorithms based on hyperbolic cosine and sine functions, such as hyperbolic cosine (HCAF), exponential hyperbolic cosine, joint logarithmic hyperbolic cosine adaptive filter, etc., have been predominantly employed for different aspects of adaptive filtering, including nonlinear-system-identification. Further, in this manuscript, an attempt is made to elevate the performance of nonlinear system identification in the wake of impulsive noise interference along with consideration of a sparse environment. Henceforth, in lieu of this, the present paper introduces a new sparsity-apprised logarithmic hyperbolic tan adaptive filter (SA-LHTAF) to handle impulsive noise while dealing with sparse systems. It utilizes a <inline-formula><tex-math>$l_{1}$</tex-math></inline-formula> norm-related sparsity penalty factor in the robust cost function constructed with a logarithmic hyperbolic tangent function. Further, an improved SA-LHTAF (ISA-LHTAF) is introduced for varying sparsity or moderately sparse systems employing the log sum penalty factor in the proposed technique. The weight update for the proposed technique has been derived from the modified cost function. In addition, the conditions for the upper bound on the convergence factor have been derived. The efficacy of the developed robust techniques is demonstrated for identifying nonlinear systems along with feedback paths of behind-the-ear (BTE) hearing aid. In addition, the proposed techniques are evaluated for training an acoustic feedback canceller for hearing aids.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"6 ","pages":"1031-1047"},"PeriodicalIF":2.7,"publicationDate":"2025-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11130900","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145090171","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ULDepth: Transform Self-Supervised Depth Estimation to Unpaired Multi-Domain Learning ULDepth:将自监督深度估计转化为非配对多域学习
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-08-11 DOI: 10.1109/OJSP.2025.3597873
Phan Thi Huyen Thanh;Trung Thai Tran;The Hiep Nguyen;Minh Huy Vu Nguyen;Tran Vu Pham;Truong Vinh Truong Duy;Duc Dung Nguyen
This paper introduces a general plug-in framework designed to enhance the robustness and cross-domain generalization of self-supervised depth estimation models. Current models often struggle with real-world deployment due to their limited ability to generalize across diverse domains, such as varying lighting and weather conditions. Single-domain models are optimized for specific scenarios while existing multi-domain approaches typically rely on paired images, which are rarely available in real-world datasets. Our framework addresses these limitations by training directly on unpaired real images from multiple domains. Daytime images serve as a reference to guide the model in learning consistent depth distributions across these diverse domains through adversarial training, eliminating the need for paired images. To refine regions prone to artifacts, we augment the discriminator with positional encoding, which is combined with the predicted depth maps. We also incorporate a dynamic normalization mechanism to capture shared depth features across domains, removing the requirement for separate domain-specific encoders. Furthermore, we introduce a new benchmark designed for a more comprehensive evaluation, encompassing previously unaddressed real-world scenarios. By focusing on unpaired real data, our framework significantly improves the generalization capabilities of existing models, enabling them to better adapt to the complexities and authentic data encountered in real-world environments.
本文介绍了一个通用的插件框架,旨在提高自监督深度估计模型的鲁棒性和跨域泛化。当前的模型由于其在不同领域(如不同的照明和天气条件)的泛化能力有限,常常难以在现实世界中部署。单域模型针对特定场景进行了优化,而现有的多域方法通常依赖于配对图像,这在现实世界的数据集中很少可用。我们的框架通过直接训练来自多个域的未配对的真实图像来解决这些限制。白天的图像可以作为参考,指导模型通过对抗性训练学习这些不同领域的一致深度分布,从而消除对成对图像的需求。为了细化容易产生伪影的区域,我们使用位置编码增强鉴别器,该编码与预测的深度图相结合。我们还结合了一个动态规范化机制来捕获跨域的共享深度特征,从而消除了对单独的特定于域的编码器的需求。此外,我们引入了一个新的基准,用于更全面的评估,包括以前未解决的现实世界场景。通过关注未配对的真实数据,我们的框架显著提高了现有模型的泛化能力,使它们能够更好地适应现实环境中遇到的复杂性和真实数据。
{"title":"ULDepth: Transform Self-Supervised Depth Estimation to Unpaired Multi-Domain Learning","authors":"Phan Thi Huyen Thanh;Trung Thai Tran;The Hiep Nguyen;Minh Huy Vu Nguyen;Tran Vu Pham;Truong Vinh Truong Duy;Duc Dung Nguyen","doi":"10.1109/OJSP.2025.3597873","DOIUrl":"https://doi.org/10.1109/OJSP.2025.3597873","url":null,"abstract":"This paper introduces a general plug-in framework designed to enhance the robustness and cross-domain generalization of self-supervised depth estimation models. Current models often struggle with real-world deployment due to their limited ability to generalize across diverse domains, such as varying lighting and weather conditions. Single-domain models are optimized for specific scenarios while existing multi-domain approaches typically rely on paired images, which are rarely available in real-world datasets. Our framework addresses these limitations by training directly on unpaired real images from multiple domains. Daytime images serve as a reference to guide the model in learning consistent depth distributions across these diverse domains through adversarial training, eliminating the need for paired images. To refine regions prone to artifacts, we augment the discriminator with positional encoding, which is combined with the predicted depth maps. We also incorporate a dynamic normalization mechanism to capture shared depth features across domains, removing the requirement for separate domain-specific encoders. Furthermore, we introduce a new benchmark designed for a more comprehensive evaluation, encompassing previously unaddressed real-world scenarios. By focusing on unpaired real data, our framework significantly improves the generalization capabilities of existing models, enabling them to better adapt to the complexities and authentic data encountered in real-world environments.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"6 ","pages":"1004-1016"},"PeriodicalIF":2.7,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11122640","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144928877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Benchmarking Diffusion Annealing-Based Bayesian Inverse Problem Solvers 基于基准扩散退火的贝叶斯反问题求解方法
IF 2.7 Q2 ENGINEERING, ELECTRICAL & ELECTRONIC Pub Date : 2025-08-11 DOI: 10.1109/OJSP.2025.3597867
Evan Scope Crafts;Umberto Villa
In recent years, the ascendance of diffusion modeling as a state-of-the-art generative modeling approach has spurred significant interest in their use as priors in Bayesian inverse problems. However, it is unclear how to optimally integrate a diffusion model trained on the prior distribution with a given likelihood function to obtain posterior samples. While algorithms developed for this purpose can produce high-quality, diverse point estimates of the unknown parameters of interest, they are often tested on problems where the prior distribution is analytically unknown, making it difficult to assess their performance in providing rigorous uncertainty quantification. Motivated by this challenge, this work introduces three benchmark problems for evaluating the performance of diffusion model based samplers. The benchmark problems, which are inspired by problems in image inpainting, x-ray tomography, and phase retrieval, have a posterior density that is analytically known. In this setting, approximate ground-truth posterior samples can be obtained, enabling principled evaluation of the performance of posterior sampling algorithms. This work also introduces a general framework for diffusion model based posterior sampling, Bayesian Inverse Problem Solvers through Diffusion Annealing (BIPSDA). This framework unifies several recently proposed diffusion-model-based posterior sampling algorithms and contains novel algorithms that can be realized through flexible combinations of design choices. We tested the performance of a set of BIPSDA algorithms, including previously proposed state-of-the-art approaches, on the proposed benchmark problems. The results provide insight into the strengths and limitations of existing diffusion-model based posterior samplers, while the benchmark problems provide a testing ground for future algorithmic developments.
近年来,扩散建模作为最先进的生成建模方法的优势激发了人们对将其用作贝叶斯反问题先验的极大兴趣。然而,目前尚不清楚如何将先验分布训练的扩散模型与给定的似然函数最佳地整合以获得后验样本。虽然为此目的开发的算法可以产生高质量的、不同的未知感兴趣参数的点估计,但它们经常在先验分布分析未知的问题上进行测试,这使得很难评估它们在提供严格的不确定性量化方面的性能。在这一挑战的激励下,本文引入了三个基准问题来评估基于扩散模型的采样器的性能。基准问题的灵感来自图像绘制、x射线断层扫描和相位检索中的问题,它们具有分析已知的后验密度。在这种情况下,可以获得近似的真值后验样本,从而可以对后验抽样算法的性能进行原则性评估。本工作还介绍了基于后验抽样的扩散模型的一般框架,即通过扩散退火的贝叶斯反问题求解器(BIPSDA)。该框架统一了最近提出的几种基于扩散模型的后验抽样算法,并包含可以通过灵活组合设计选择来实现的新算法。我们在提出的基准问题上测试了一组BIPSDA算法的性能,包括以前提出的最先进的方法。结果提供了对现有的基于扩散模型的后验采样器的优势和局限性的洞察,而基准问题为未来的算法发展提供了一个试验场。
{"title":"Benchmarking Diffusion Annealing-Based Bayesian Inverse Problem Solvers","authors":"Evan Scope Crafts;Umberto Villa","doi":"10.1109/OJSP.2025.3597867","DOIUrl":"https://doi.org/10.1109/OJSP.2025.3597867","url":null,"abstract":"In recent years, the ascendance of diffusion modeling as a state-of-the-art generative modeling approach has spurred significant interest in their use as priors in Bayesian inverse problems. However, it is unclear how to optimally integrate a diffusion model trained on the prior distribution with a given likelihood function to obtain posterior samples. While algorithms developed for this purpose can produce high-quality, diverse point estimates of the unknown parameters of interest, they are often tested on problems where the prior distribution is analytically unknown, making it difficult to assess their performance in providing rigorous uncertainty quantification. Motivated by this challenge, this work introduces three benchmark problems for evaluating the performance of diffusion model based samplers. The benchmark problems, which are inspired by problems in image inpainting, x-ray tomography, and phase retrieval, have a posterior density that is analytically known. In this setting, approximate ground-truth posterior samples can be obtained, enabling principled evaluation of the performance of posterior sampling algorithms. This work also introduces a general framework for diffusion model based posterior sampling, Bayesian Inverse Problem Solvers through Diffusion Annealing (BIPSDA). This framework unifies several recently proposed diffusion-model-based posterior sampling algorithms and contains novel algorithms that can be realized through flexible combinations of design choices. We tested the performance of a set of BIPSDA algorithms, including previously proposed state-of-the-art approaches, on the proposed benchmark problems. The results provide insight into the strengths and limitations of existing diffusion-model based posterior samplers, while the benchmark problems provide a testing ground for future algorithmic developments.","PeriodicalId":73300,"journal":{"name":"IEEE open journal of signal processing","volume":"6 ","pages":"975-991"},"PeriodicalIF":2.7,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11122619","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144891005","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
IEEE open journal of signal processing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1