Physically Architected Recurrent Neural Networks for Nonlinear Dynamical Loudspeaker Modeling

IF 5.8 2区工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Signal Processing Pub Date : 2024-10-14 DOI:10.1109/TSP.2024.3480321

Christian Gruber;Gerald Enzner;Rainer Martin

{"title":"Physically Architected Recurrent Neural Networks for Nonlinear Dynamical Loudspeaker Modeling","authors":"Christian Gruber;Gerald Enzner;Rainer Martin","doi":"10.1109/TSP.2024.3480321","DOIUrl":null,"url":null,"abstract":"The nonlinear behavior of loudspeakers is of great interest in a number of audio processing algorithms, as it may have a detrimental effect on their performance. These algorithms may be further enhanced when an accurate model of the loudspeaker's input-output behavior is available. A variety of approaches has been investigated in the past to solve this task via nonlinear adaptive system identification. Their modeling capabilities are often limited due to a mismatch with electroacoustic principles of real loudspeakers. This paper therefore presents a novel approach using recurrent neural networks (RNN) specifically designed to match the dynamical loudspeaker's physical model behavior. By means of the physical model and its corresponding state-space representation, we derive three conceptually different RNN architectures, which are initially trained on synthetic audio data in order to gain insights into the required training procedure and limitations. Further training and evaluation of the preferred architecture on real loudspeaker recordings shows consistent improvements of the mean-squared modeling error compared to a linear system model and to nonlinear baseline algorithms.","PeriodicalId":13330,"journal":{"name":"IEEE Transactions on Signal Processing","volume":"72 ","pages":"5371-5387"},"PeriodicalIF":5.8000,"publicationDate":"2024-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10716735/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

The nonlinear behavior of loudspeakers is of great interest in a number of audio processing algorithms, as it may have a detrimental effect on their performance. These algorithms may be further enhanced when an accurate model of the loudspeaker's input-output behavior is available. A variety of approaches has been investigated in the past to solve this task via nonlinear adaptive system identification. Their modeling capabilities are often limited due to a mismatch with electroacoustic principles of real loudspeakers. This paper therefore presents a novel approach using recurrent neural networks (RNN) specifically designed to match the dynamical loudspeaker's physical model behavior. By means of the physical model and its corresponding state-space representation, we derive three conceptually different RNN architectures, which are initially trained on synthetic audio data in order to gain insights into the required training procedure and limitations. Further training and evaluation of the preferred architecture on real loudspeaker recordings shows consistent improvements of the mean-squared modeling error compared to a linear system model and to nonlinear baseline algorithms.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用于非线性动态扬声器建模的物理架构递归神经网络

扬声器的非线性行为是许多音频处理算法非常感兴趣的问题，因为它可能对扬声器的性能产生不利影响。当扬声器输入输出行为的精确模型可用时，这些算法可能会进一步增强。过去已经研究了各种方法来通过非线性自适应系统辨识来解决这一任务。由于与真实扬声器的电声原理不匹配，它们的建模能力往往受到限制。因此，本文提出了一种新颖的方法，使用递归神经网络（RNN）来匹配动态扬声器的物理模型行为。通过物理模型及其相应的状态空间表示，我们推导出三种概念上不同的RNN架构，它们最初在合成音频数据上进行训练，以便深入了解所需的训练过程和局限性。在真实扬声器记录上对首选架构进行进一步的训练和评估表明，与线性系统模型和非线性基线算法相比，均方建模误差得到了一致的改善。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Signal Processing 工程技术-工程：电子与电气

CiteScore

11.20

自引率

9.30%

发文量

310

审稿时长

3.0 months

期刊介绍： The IEEE Transactions on Signal Processing covers novel theory, algorithms, performance analyses and applications of techniques for the processing, understanding, learning, retrieval, mining, and extraction of information from signals. The term “signal” includes, among others, audio, video, speech, image, communication, geophysical, sonar, radar, medical and musical signals. Examples of topics of interest include, but are not limited to, information processing and the theory and application of filtering, coding, transmitting, estimating, detecting, analyzing, recognizing, synthesizing, recording, and reproducing signals.

期刊最新文献

Asymptotically Optimal Search for a Change Point Anomaly under a Composite Hypothesis Model Where Prior Learning Can and Can’t Work in Unsupervised Inverse Problems A Fast Robust Adaptive Filter using Improved Data-Reuse Method Einstein from Noise: Statistical Analysis A Large-dimensional Analysis of ESPRIT DoA Estimation: Inconsistency and a Correction via RMT