Journal of the Audio Engineering Society最新文献

英文中文

Loudspeaker Equalization for a Moving Listener 适用于移动听众的扬声器均衡

IF 1.4 4区工程技术 Q3 ACOUSTICS

Journal of the Audio Engineering Society

Pub Date : 2022-11-02 DOI: 10.17743/jaes.2022.0020

Joel Lindfors, Juho Liski, V. Välimäki

When a person listens to loudspeakers, the perceived sound is affected not only by the loudspeaker properties but also by the acoustics of the surroundings. Loudspeaker equalization can be used to correct the loudspeaker-room response. However, when the listener moves in front of the loudspeakers, both the loudspeaker response and room effect change. In order for the best correction to be achieved at all times, adaptive equalization is proposed in this paper. A loudspeaker-correction system using the listener’s current location to determine the correction parameters is proposed. The position of the listener’s head is located using a depth- sensing camera, and suitable equalizer settings are then selected based on measurements and interpolation.Aftercorrectingfortheloudspeaker’sresponseatmultiplelocationsandchangingtheequalizationinrealtimebasedontheuser’slocation,aloudspeakerresponsewithreducedcolorationisachievedcomparedtonocalibrationorconventionalcalibrationmethods,withthemagnitude-responsedeviationsdecreasingfrom10.0to5.6dBwithinthepassbandofahigh-qualityloudspeaker.Theproposedmethodcanimprovetheaudiomonitoringinmusicstudiosandotheroccasionsinwhichasinglelistenerismovinginarestrictedspace.

当一个人听扬声器时，感知的声音不仅受到扬声器特性的影响，还受到周围环境的声学影响。扬声器均衡可用于校正扬声器室的响应。然而，当听众移动到扬声器前面时，扬声器的响应和房间效果都会发生变化。为了在任何时候都能获得最佳的校正，本文提出了自适应均衡。提出了一种使用收听者的当前位置来确定校正参数的扬声器校正系统。听众头部的位置是使用深度感应相机定位的，然后根据测量和插值选择合适的均衡器设置。在校正了扬声器的响应多个位置并根据用户的位置实时改变相等值后，与无校准或传统校准方法相比，扬声器响应降低了色度，在高质量扬声器的通带内，识别响应偏差从10.0降至5.6dB。所提出的方法可以改善音乐研究中的音频监控，也可以改善在有限空间内移动音乐播放器的情况。

{"title":"Loudspeaker Equalization for a Moving Listener","authors":"Joel Lindfors, Juho Liski, V. Välimäki","doi":"10.17743/jaes.2022.0020","DOIUrl":"https://doi.org/10.17743/jaes.2022.0020","url":null,"abstract":"When a person listens to loudspeakers, the perceived sound is affected not only by the loudspeaker properties but also by the acoustics of the surroundings. Loudspeaker equalization can be used to correct the loudspeaker-room response. However, when the listener moves in front of the loudspeakers, both the loudspeaker response and room effect change. In order for the best correction to be achieved at all times, adaptive equalization is proposed in this paper. A loudspeaker-correction system using the listener’s current location to determine the correction parameters is proposed. The position of the listener’s head is located using a depth- sensing camera, and suitable equalizer settings are then selected based on measurements and interpolation.Aftercorrectingfortheloudspeaker’sresponseatmultiplelocationsandchangingtheequalizationinrealtimebasedontheuser’slocation,aloudspeakerresponsewithreducedcolorationisachievedcomparedtonocalibrationorconventionalcalibrationmethods,withthemagnitude-responsedeviationsdecreasingfrom10.0to5.6dBwithinthepassbandofahigh-qualityloudspeaker.Theproposedmethodcanimprovetheaudiomonitoringinmusicstudiosandotheroccasionsinwhichasinglelistenerismovinginarestrictedspace.","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2022-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"49538954","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Dual-Residual Transformer Network for Speech Recognition 用于语音识别的双残差变换器网络

IF 1.4 4区工程技术 Q3 ACOUSTICS

Journal of the Audio Engineering Society

Pub Date : 2022-11-02 DOI: 10.17743/jaes.2022.0029

Zhikui Duan, Guozhi Gao, Jiawei Chen, Shiren Li, Jinbiao Ruan, Guangguang Yang, Xinmei Yu

引用次数: 4

A Multi-Angle, Multi-Distance Dataset of Microphone Impulse Responses 多角度、多距离麦克风脉冲响应数据集

IF 1.4 4区工程技术 Q3 ACOUSTICS

Journal of the Audio Engineering Society

Pub Date : 2022-11-02 DOI: 10.17743/jaes.2022.0027

Juan Carlos Franco Hernández, Bogdan Baǎcilǎ, Tim S. Brookes, E. De Sena

A new publicly available dataset of microphone impulse responses (IRs) has been gener- ated. The dataset covers 25 microphones, including a Class-1 measurement microphone and polar pattern variations for seven of the microphones. Microphones that were included had omnidirectional, cardioid, supercardioid, and bidirectional polar patterns; condenser, moving-coil, and ribbon transduction types; single and dual diaphragms; multiple body and head basket shapes;smallandlargediaphragms;andend-addressandside-addressdesigns.Usingacustom-developedcomputer-controlledprecisionturntable,IRswerecapturedquasi-anechoicallyatincidentanglesfrom0 ◦ to 355 ◦ in steps of 5 ◦ and at source-to-microphone distances of 0.5, 1.25, and 5 m. The resulting dataset is suitable for perceptual and objective studies related to the incident-angle–dependent response of microphones and for the development of tools for predicting and emulating on-axis and off-axis microphone characteristics. The captured IRs allow generation of frequency response plots with a degree of detail not commonly available in manufacturer-supplied data sheets and are also particularly well-suited to harmonic distortion analysis.

已经生成了一个新的公开可用的麦克风脉冲响应（IR）数据集。该数据集涵盖25个麦克风，包括一个1类测量麦克风和其中7个麦克风的极性模式变化。包括的麦克风具有全向、心形、超心形和双向极性模式；冷凝器、动圈和带状换能类型；单隔膜和双隔膜；多个身体和头部篮子形状；小光圈和大光圈；以及端地址和侧地址设计。使用自定义开发的计算机控制的决策转台，IR可以从0中选择性地获取决策数据◦ 至355◦ 步骤5◦ 源到麦克风的距离分别为0.5、1.25和5米。所得数据集适用于与麦克风的入射角相关响应相关的感知和客观研究，也适用于开发预测和模拟同轴和离轴麦克风特性的工具。捕获的IR允许生成具有制造商提供的数据表中不常见的详细程度的频率响应图，并且也特别适合谐波失真分析。

{"title":"A Multi-Angle, Multi-Distance Dataset of Microphone Impulse Responses","authors":"Juan Carlos Franco Hernández, Bogdan Baǎcilǎ, Tim S. Brookes, E. De Sena","doi":"10.17743/jaes.2022.0027","DOIUrl":"https://doi.org/10.17743/jaes.2022.0027","url":null,"abstract":"A new publicly available dataset of microphone impulse responses (IRs) has been gener- ated. The dataset covers 25 microphones, including a Class-1 measurement microphone and polar pattern variations for seven of the microphones. Microphones that were included had omnidirectional, cardioid, supercardioid, and bidirectional polar patterns; condenser, moving-coil, and ribbon transduction types; single and dual diaphragms; multiple body and head basket shapes;smallandlargediaphragms;andend-addressandside-addressdesigns.Usingacustom-developedcomputer-controlledprecisionturntable,IRswerecapturedquasi-anechoicallyatincidentanglesfrom0 ◦ to 355 ◦ in steps of 5 ◦ and at source-to-microphone distances of 0.5, 1.25, and 5 m. The resulting dataset is suitable for perceptual and objective studies related to the incident-angle–dependent response of microphones and for the development of tools for predicting and emulating on-axis and off-axis microphone characteristics. The captured IRs allow generation of frequency response plots with a degree of detail not commonly available in manufacturer-supplied data sheets and are also particularly well-suited to harmonic distortion analysis.","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2022-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42003071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 2

Aluminum-Based Push-Pull Electrostatic MEMS Transducer for Earphones 耳机用铝基推挽式静电MEMS传感器

IF 1.4 4区工程技术 Q3 ACOUSTICS

Journal of the Audio Engineering Society

Pub Date : 2022-11-02 DOI: 10.17743/jaes.2022.0035

Aviad Zamir, G. Seiden, H. Kupershmidt

引用次数: 0

Temporal Trends in the Practice Pattern for Sleep-Disordered Breathing in Patients With Cardiovascular Diseases in Japan　- Insights From the Japanese Registry of All Cardiac and Vascular Diseases - Diagnosis Procedure Combination. 日本心血管疾病患者睡眠呼吸障碍诊疗模式的时间趋势--来自日本所有心脏和血管疾病登记处的启示--诊断程序组合。

IF 3.1 4区工程技术 Q3 ACOUSTICS

Journal of the Audio Engineering Society

Pub Date : 2022-08-25 Epub Date: 2022-04-27 DOI: 10.1253/circj.CJ-22-0082

Ryohei Takeishi, Akiomi Yoshihisa, Yu Hotsuki, Fumiya Anzai, Yu Sato, Yoko Sumita, Michikazu Nakai, Tomofumi Misaka, Yasuchika Takeishi

Background: After the publication of the Japanese Circulation Society guideline of sleep-disordered breathing (SDB) in 2010, with new evidence and changes to the health insurance system, trends in the practice pattern for SDB in patients with cardiovascular disease (CVD) might have changed.

Methods and results: This study evaluated the temporal changes in the practice pattern for SDB by using a nationwide claim database, the Japanese Registry of All Cardiac and Vascular Diseases - Diagnosis Procedure Combination (JROAD-DPC), from 2012 to 2019. The main findings were: (1) the number of CVD patients diagnosed with SDB increased (especially those with atrial fibrillation [AF] and heart failure [HF]); (2) the number of diagnostic tests for SDB performed during hospitalization increased for AF patients (from 1.3% in 2012 to 1.8% in 2019), whereas it decreased for other CVD patients; (3) the number of patients diagnosed with SDB increased in each type of CVD, except for patients with acute myocardial infarction (AMI); (4) continuous positive airway pressure (CPAP) treatment increased for AF patients (from 15.2% to 17.5%); (5) CPAP treatment decreased for patients with angina pectoris (AP) and AMI, and any treatment decreased for HF patients (from 46.1% to 39.7%); and (6) SDB was treated more often in HF patients than in AF, AP, and AMI patients (41.7% vs. 17.2%, 19.1% and 20.4%, respectively).

Conclusions: The practice pattern for SDB in CVD patients has changed from 2012 to 2019.

背景：2010年日本循环学会发布睡眠呼吸障碍（SDB）指南后，随着新证据的出现和医疗保险制度的变化，心血管疾病（CVD）患者SDB诊疗模式的趋势可能发生了变化：本研究利用日本所有心脏和血管疾病登记-诊断程序组合（JROAD-DPC）这一全国性索赔数据库，评估了 2012 年至 2019 年 SDB 治疗模式的时间变化。主要研究结果如下(1）确诊为 SDB 的心血管疾病患者人数增加（尤其是心房颤动 [AF] 和心力衰竭 [HF]）；（2）心房颤动患者住院期间进行的 SDB 诊断检查次数增加（从 2012 年的 1.3% 增加到 2019 年的 1.8%），而其他心血管疾病患者则有所下降；（3）除急性心肌梗死（AMI）患者外，每种心血管疾病类型中确诊为 SDB 的患者人数均有所增加；（4）心房颤动患者接受持续气道正压（CPAP）治疗的人数有所增加（从 15.2%增至17.5%）；（5）心绞痛（AP）和急性心肌梗死（AMI）患者的CPAP治疗减少，而高血压患者的任何治疗均减少（从46.1%降至39.7%）；（6）高血压患者比心房颤动、心绞痛和急性心肌梗死患者更常接受SDB治疗（分别为41.7%对17.2%、19.1%和20.4%）：结论：从2012年到2019年，心血管疾病患者SDB的治疗模式发生了变化。

{"title":"Temporal Trends in the Practice Pattern for Sleep-Disordered Breathing in Patients With Cardiovascular Diseases in Japan　- Insights From the Japanese Registry of All Cardiac and Vascular Diseases - Diagnosis Procedure Combination.","authors":"Ryohei Takeishi, Akiomi Yoshihisa, Yu Hotsuki, Fumiya Anzai, Yu Sato, Yoko Sumita, Michikazu Nakai, Tomofumi Misaka, Yasuchika Takeishi","doi":"10.1253/circj.CJ-22-0082","DOIUrl":"10.1253/circj.CJ-22-0082","url":null,"abstract":"Background: After the publication of the Japanese Circulation Society guideline of sleep-disordered breathing (SDB) in 2010, with new evidence and changes to the health insurance system, trends in the practice pattern for SDB in patients with cardiovascular disease (CVD) might have changed.Methods and results: This study evaluated the temporal changes in the practice pattern for SDB by using a nationwide claim database, the Japanese Registry of All Cardiac and Vascular Diseases - Diagnosis Procedure Combination (JROAD-DPC), from 2012 to 2019. The main findings were: (1) the number of CVD patients diagnosed with SDB increased (especially those with atrial fibrillation [AF] and heart failure [HF]); (2) the number of diagnostic tests for SDB performed during hospitalization increased for AF patients (from 1.3% in 2012 to 1.8% in 2019), whereas it decreased for other CVD patients; (3) the number of patients diagnosed with SDB increased in each type of CVD, except for patients with acute myocardial infarction (AMI); (4) continuous positive airway pressure (CPAP) treatment increased for AF patients (from 15.2% to 17.5%); (5) CPAP treatment decreased for patients with angina pectoris (AP) and AMI, and any treatment decreased for HF patients (from 46.1% to 39.7%); and (6) SDB was treated more often in HF patients than in AF, AP, and AMI patients (41.7% vs. 17.2%, 19.1% and 20.4%, respectively).Conclusions: The practice pattern for SDB in CVD patients has changed from 2012 to 2019.","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":"1 1","pages":"1428-1436"},"PeriodicalIF":3.1,"publicationDate":"2022-08-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"85721214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Research on Additive Margin Softmax Speaker Recognition Based on Convolutional and Gated Recurrent Neural Networks 基于卷积和门控递归神经网络的加性余量Softmax说话人识别研究

IF 1.4 4区工程技术 Q3 ACOUSTICS

Journal of the Audio Engineering Society

Pub Date : 2022-07-25 DOI: 10.17743/jaes.2022.0018

Chaofeng Lan, Yuqiao Wang, Lei Zhang, Hongyun Zhao

引用次数: 0

Near-Field Evaluation of Reproducible Speech Sources 可复制语音源的近场评价

IF 1.4 4区工程技术 Q3 ACOUSTICS

Journal of the Audio Engineering Society

Pub Date : 2022-07-25 DOI: 10.17743/jaes.2022.0022

Raimundo Gonzalez, Thomas McKenzie, A. Politis, T. Lokki

The spatial speech reproduction capabilities of a KEMAR mouth simulator, a loudspeaker, the piston on sphere model and a circular harmonic ﬁtting are evaluated in the near-ﬁeld. The speech directivity of 24 human subjects, both male and female, is measured using a semi-circular microphone array of radius 36.5 cm in the horizontal plane. Impulse responses are captured for the two devices and ﬁlters are generated for the two numerical models to emulate their directional effect on speech reproduction. The four repeatable speech sources are evaluated through comparison to the recorded human speech both objectively, through directivity pattern and spectral magnitude differences, and subjectively, through a listening test on perceived coloration. Results show that the repeatable sources perform relatively well under the metric of directivity but irregularities in their directivity patterns introduce audible coloration for off-axis directions.

引用次数: 2

Linear-Phase Octave Graphic Equalizer 线性相位八度图形均衡器

IF 1.4 4区工程技术 Q3 ACOUSTICS

Journal of the Audio Engineering Society

Pub Date : 2022-07-25 DOI: 10.17743/jaes.2022.0014

V. Bruschi, V. Välimäki, Juho Liski, S. Cecchi

A computationally efficient octave-band graphic equalizer having a linear-phase response is introduced. The linear-phase graphic equalizer is useful in audio applications in which phase distortion is not tolerated, such as in multichannel equalization, parallel processing, phase compatibility of audio equipment, and crossover network design. The structure is based on the interpolated finite impulse response (IFIR) philosophy. The proposed octave-band graphic equalizer uses one prototype low-pass filter, which is a half-band FIR filter designed using the window method. Stretched versions of the prototype filter and its complementary high-pass filter implement all ten band filters needed. The graphic equalizer is realized in the parallel form, in which the outputs of all band filters, scaled with their individual command gain, are added to compute the equalized output signal. The command gains can be used directly as filter band gains. The number of operations needed per sample is only slightly more than that needed for the graphic equalizer based on minimum-phase recursive filters. A comparison with other implementation approaches demonstrates that the proposed structure requires 99% fewer operations than a high-order FIR filter. The proposed filter uses 39% fewer operations per sample than the fast Fourier transform–based filtering method and causes over 78% less latency.

介绍了一种计算效率高的具有线性相位响应的倍频带图形均衡器。线性相位图形均衡器在相位失真不能容忍的音频应用中非常有用，例如多通道均衡、并行处理、音频设备的相位兼容性和交叉网络设计。该结构基于插值有限脉冲响应(IFIR)原理。所提出的八倍频带图形均衡器使用了一个原型低通滤波器，该滤波器是采用窗法设计的半带FIR滤波器。原型滤波器的扩展版本及其互补高通滤波器实现了所需的所有十个频带滤波器。图形均衡器以并行形式实现，其中所有带滤波器的输出，按其单独的命令增益缩放，添加以计算均衡输出信号。命令增益可以直接用作滤波器带增益。每个样本所需的操作数量仅略多于基于最小相位递归滤波器的图形均衡器所需的操作数量。与其他实现方法的比较表明，该结构所需的操作比高阶FIR滤波器少99%。与基于傅立叶变换的快速滤波方法相比，该滤波器每个样本的运算次数减少39%，延迟减少78%以上。

{"title":"Linear-Phase Octave Graphic Equalizer","authors":"V. Bruschi, V. Välimäki, Juho Liski, S. Cecchi","doi":"10.17743/jaes.2022.0014","DOIUrl":"https://doi.org/10.17743/jaes.2022.0014","url":null,"abstract":"A computationally efficient octave-band graphic equalizer having a linear-phase response is introduced. The linear-phase graphic equalizer is useful in audio applications in which phase distortion is not tolerated, such as in multichannel equalization, parallel processing, phase compatibility of audio equipment, and crossover network design. The structure is based on the interpolated finite impulse response (IFIR) philosophy. The proposed octave-band graphic equalizer uses one prototype low-pass filter, which is a half-band FIR filter designed using the window method. Stretched versions of the prototype filter and its complementary high-pass filter implement all ten band filters needed. The graphic equalizer is realized in the parallel form, in which the outputs of all band filters, scaled with their individual command gain, are added to compute the equalized output signal. The command gains can be used directly as filter band gains. The number of operations needed per sample is only slightly more than that needed for the graphic equalizer based on minimum-phase recursive filters. A comparison with other implementation approaches demonstrates that the proposed structure requires 99% fewer operations than a high-order FIR filter. The proposed filter uses 39% fewer operations per sample than the fast Fourier transform–based filtering method and causes over 78% less latency.","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2022-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"42701641","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

Real-Time Transient Reduction in Higher-Order Time-Varying Musical Filters 高阶时变音乐滤波器的实时瞬态抑制

IF 1.4 4区工程技术 Q3 ACOUSTICS

Journal of the Audio Engineering Society

Pub Date : 2022-07-25 DOI: 10.17743/jaes.2022.0015

Nikhil Deshpande, Russell Wedelich

引用次数: 0

Resynthesis of Spatial Room Impulse Response Tails With Anisotropic Multi-Slope Decays 具有各向异性多斜率衰减的空间房间脉冲响应尾的再合成

IF 1.4 4区工程技术 Q3 ACOUSTICS

Journal of the Audio Engineering Society

Pub Date : 2022-07-25 DOI: 10.17743/jaes.2022.0017

C. Hold, Thomas McKenzie, Georg Götz, Sebastian J. Schlecht, V. Pulkki

Spatial room impulse responses (SRIRs) capture room acoustics with directional information. SRIRs measured in coupled rooms and spaces with non-uniform absorption distribution may exhibit anisotropic reverberation decays and multiple decay slopes. However, noisy measurements with low signal-to-noise ratios pose issues in analysis and reproduction in practice. This paper presents a method for resynthesis of the late decay of anisotropic SRIRs, effectively removing noise from SRIR measurements. The method accounts for both multi-slope decays and directional reverberation. A spherical filter bank extracts directionally constrained signals from Ambisonic input, which are then analyzed and parameterized in terms of multiple exponential decays and a noise floor. The noisy late reverberation is then resynthesized from the estimated parameters using modal synthesis, and the restored SRIR is reconstructed as Ambisonic signals. The method is evaluated both numerically and perceptually, which shows that SRIRs can be denoised with minimal error as long as parts of the decay slope are above the noise level, with signal-to-noise ratios as low as 40 dB in the presented experiment. The method can be used to increase the perceived spatial audio quality of noise-impaired SRIRs.

空间房间脉冲响应（SRIR）利用方向信息捕捉房间声学。在具有非均匀吸收分布的耦合房间和空间中测量的SRIR可能表现出各向异性混响衰减和多个衰减斜率。然而，具有低信噪比的噪声测量在实践中的分析和再现中提出了问题。本文提出了一种重新合成各向异性SRIR延迟衰减的方法，有效地去除了SRIR测量中的噪声。该方法同时考虑了多斜率衰减和定向混响。球形滤波器组从Ambisonic输入中提取定向约束信号，然后根据多个指数衰减和本底噪声对这些信号进行分析和参数化。然后使用模态合成从估计的参数中重新合成噪声后期混响，并将恢复的SRIR重建为Ambisonic信号。对该方法进行了数值和感知评估，表明只要部分衰减斜率高于噪声水平，SRIR就可以以最小的误差去噪，在所提出的实验中，信噪比低至40dB。该方法可用于提高噪声受损SRIR的感知空间音频质量。

{"title":"Resynthesis of Spatial Room Impulse Response Tails With Anisotropic Multi-Slope Decays","authors":"C. Hold, Thomas McKenzie, Georg Götz, Sebastian J. Schlecht, V. Pulkki","doi":"10.17743/jaes.2022.0017","DOIUrl":"https://doi.org/10.17743/jaes.2022.0017","url":null,"abstract":"Spatial room impulse responses (SRIRs) capture room acoustics with directional information. SRIRs measured in coupled rooms and spaces with non-uniform absorption distribution may exhibit anisotropic reverberation decays and multiple decay slopes. However, noisy measurements with low signal-to-noise ratios pose issues in analysis and reproduction in practice. This paper presents a method for resynthesis of the late decay of anisotropic SRIRs, effectively removing noise from SRIR measurements. The method accounts for both multi-slope decays and directional reverberation. A spherical filter bank extracts directionally constrained signals from Ambisonic input, which are then analyzed and parameterized in terms of multiple exponential decays and a noise floor. The noisy late reverberation is then resynthesized from the estimated parameters using modal synthesis, and the restored SRIR is reconstructed as Ambisonic signals. The method is evaluated both numerically and perceptually, which shows that SRIRs can be denoised with minimal error as long as parts of the decay slope are above the noise level, with signal-to-noise ratios as low as 40 dB in the presented experiment. The method can be used to increase the perceived spatial audio quality of noise-impaired SRIRs.","PeriodicalId":50008,"journal":{"name":"Journal of the Audio Engineering Society","volume":" ","pages":""},"PeriodicalIF":1.4,"publicationDate":"2022-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45070778","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 5

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Journal of the Audio Engineering Society

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀