Physically Architected Recurrent Neural Networks for Nonlinear Dynamical Loudspeaker Modeling

IF 5.8 2区 工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Signal Processing Pub Date : 2024-10-14 DOI:10.1109/TSP.2024.3480321
Christian Gruber;Gerald Enzner;Rainer Martin
{"title":"Physically Architected Recurrent Neural Networks for Nonlinear Dynamical Loudspeaker Modeling","authors":"Christian Gruber;Gerald Enzner;Rainer Martin","doi":"10.1109/TSP.2024.3480321","DOIUrl":null,"url":null,"abstract":"The nonlinear behavior of loudspeakers is of great interest in a number of audio processing algorithms, as it may have a detrimental effect on their performance. These algorithms may be further enhanced when an accurate model of the loudspeaker's input-output behavior is available. A variety of approaches has been investigated in the past to solve this task via nonlinear adaptive system identification. Their modeling capabilities are often limited due to a mismatch with electroacoustic principles of real loudspeakers. This paper therefore presents a novel approach using recurrent neural networks (RNN) specifically designed to match the dynamical loudspeaker's physical model behavior. By means of the physical model and its corresponding state-space representation, we derive three conceptually different RNN architectures, which are initially trained on synthetic audio data in order to gain insights into the required training procedure and limitations. Further training and evaluation of the preferred architecture on real loudspeaker recordings shows consistent improvements of the mean-squared modeling error compared to a linear system model and to nonlinear baseline algorithms.","PeriodicalId":13330,"journal":{"name":"IEEE Transactions on Signal Processing","volume":"72 ","pages":"5371-5387"},"PeriodicalIF":5.8000,"publicationDate":"2024-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10716735/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

The nonlinear behavior of loudspeakers is of great interest in a number of audio processing algorithms, as it may have a detrimental effect on their performance. These algorithms may be further enhanced when an accurate model of the loudspeaker's input-output behavior is available. A variety of approaches has been investigated in the past to solve this task via nonlinear adaptive system identification. Their modeling capabilities are often limited due to a mismatch with electroacoustic principles of real loudspeakers. This paper therefore presents a novel approach using recurrent neural networks (RNN) specifically designed to match the dynamical loudspeaker's physical model behavior. By means of the physical model and its corresponding state-space representation, we derive three conceptually different RNN architectures, which are initially trained on synthetic audio data in order to gain insights into the required training procedure and limitations. Further training and evaluation of the preferred architecture on real loudspeaker recordings shows consistent improvements of the mean-squared modeling error compared to a linear system model and to nonlinear baseline algorithms.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
用于非线性动态扬声器建模的物理架构递归神经网络
扬声器的非线性行为是许多音频处理算法非常感兴趣的问题,因为它可能对扬声器的性能产生不利影响。当扬声器输入输出行为的精确模型可用时,这些算法可能会进一步增强。过去已经研究了各种方法来通过非线性自适应系统辨识来解决这一任务。由于与真实扬声器的电声原理不匹配,它们的建模能力往往受到限制。因此,本文提出了一种新颖的方法,使用递归神经网络(RNN)来匹配动态扬声器的物理模型行为。通过物理模型及其相应的状态空间表示,我们推导出三种概念上不同的RNN架构,它们最初在合成音频数据上进行训练,以便深入了解所需的训练过程和局限性。在真实扬声器记录上对首选架构进行进一步的训练和评估表明,与线性系统模型和非线性基线算法相比,均方建模误差得到了一致的改善。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Signal Processing
IEEE Transactions on Signal Processing 工程技术-工程:电子与电气
CiteScore
11.20
自引率
9.30%
发文量
310
审稿时长
3.0 months
期刊介绍: The IEEE Transactions on Signal Processing covers novel theory, algorithms, performance analyses and applications of techniques for the processing, understanding, learning, retrieval, mining, and extraction of information from signals. The term “signal” includes, among others, audio, video, speech, image, communication, geophysical, sonar, radar, medical and musical signals. Examples of topics of interest include, but are not limited to, information processing and the theory and application of filtering, coding, transmitting, estimating, detecting, analyzing, recognizing, synthesizing, recording, and reproducing signals.
期刊最新文献
Asymptotically Optimal Search for a Change Point Anomaly under a Composite Hypothesis Model Where Prior Learning Can and Can’t Work in Unsupervised Inverse Problems A Fast Robust Adaptive Filter using Improved Data-Reuse Method Einstein from Noise: Statistical Analysis A Large-dimensional Analysis of ESPRIT DoA Estimation: Inconsistency and a Correction via RMT
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1