宽带静态声源人体定向定位的贝叶斯模型

IF 1 3区物理与天体物理 Q4 ACOUSTICS Acta Acustica Pub Date : 2023-01-01 DOI:10.1051/aacus/2023006

Roberto Barumerli, Piotr Majdak, Michele Geronazzo, David Meijer, Federico Avanzini, Robert Baumgartner

{"title":"宽带静态声源人体定向定位的贝叶斯模型","authors":"Roberto Barumerli, Piotr Majdak, Michele Geronazzo, David Meijer, Federico Avanzini, Robert Baumgartner","doi":"10.1051/aacus/2023006","DOIUrl":null,"url":null,"abstract":"Humans estimate sound-source directions by combining prior beliefs with sensory evidence. Prior beliefs represent statistical knowledge about the environment, and the sensory evidence consists of auditory features such as interaural disparities and monaural spectral shapes. Models of directional sound localization often impose constraints on the contribution of these features to either the horizontal or vertical dimension. Instead, we propose a Bayesian model that flexibly incorporates each feature according to its spatial precision and integrates prior beliefs in the inference process. The model estimates the direction of a single, broadband, stationary sound source presented to a static human listener in an anechoic environment. We simplified interaural features to be broadband and compared two model variants, each considering a different type of monaural spectral features: magnitude profiles and gradient profiles. Both model variants were fitted to the baseline performance of five listeners and evaluated on the effects of localizing with non-individual head-related transfer functions (HRTFs) and sounds with rippled spectrum. We found that the variant equipped with spectral gradient profiles outperformed other localization models. The proposed model appears particularly useful for the evaluation of HRTFs and may serve as a basis for future extensions towards modeling dynamic listening conditions.","PeriodicalId":48486,"journal":{"name":"Acta Acustica","volume":"4 1","pages":"0"},"PeriodicalIF":1.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Bayesian model for human directional localization of broadband static sound sources\",\"authors\":\"Roberto Barumerli, Piotr Majdak, Michele Geronazzo, David Meijer, Federico Avanzini, Robert Baumgartner\",\"doi\":\"10.1051/aacus/2023006\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Humans estimate sound-source directions by combining prior beliefs with sensory evidence. Prior beliefs represent statistical knowledge about the environment, and the sensory evidence consists of auditory features such as interaural disparities and monaural spectral shapes. Models of directional sound localization often impose constraints on the contribution of these features to either the horizontal or vertical dimension. Instead, we propose a Bayesian model that flexibly incorporates each feature according to its spatial precision and integrates prior beliefs in the inference process. The model estimates the direction of a single, broadband, stationary sound source presented to a static human listener in an anechoic environment. We simplified interaural features to be broadband and compared two model variants, each considering a different type of monaural spectral features: magnitude profiles and gradient profiles. Both model variants were fitted to the baseline performance of five listeners and evaluated on the effects of localizing with non-individual head-related transfer functions (HRTFs) and sounds with rippled spectrum. We found that the variant equipped with spectral gradient profiles outperformed other localization models. The proposed model appears particularly useful for the evaluation of HRTFs and may serve as a basis for future extensions towards modeling dynamic listening conditions.\",\"PeriodicalId\":48486,\"journal\":{\"name\":\"Acta Acustica\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":1.0000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Acta Acustica\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1051/aacus/2023006\",\"RegionNum\":3,\"RegionCategory\":\"物理与天体物理\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"ACOUSTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acta Acustica","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1051/aacus/2023006","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ACOUSTICS","Score":null,"Total":0}

引用次数: 1

摘要

人类通过结合先验信念和感官证据来估计声源的方向。先验信念代表了对环境的统计知识，感官证据由听觉特征组成，如耳间差异和单耳频谱形状。定向声音定位模型通常对这些特征对水平或垂直维度的贡献施加限制。相反，我们提出了一种贝叶斯模型，根据其空间精度灵活地融合每个特征，并在推理过程中集成先验信念。该模型估计了一个单一的、宽带的、固定的声源在消声环境中呈现给静态的人类听众的方向。我们将声间特征简化为宽带，并比较了两种模型变体，每种模型变体都考虑了不同类型的单声光谱特征:星等剖面和梯度剖面。这两种模型变体都拟合了五名听众的基线表现，并评估了非个体头部相关传递函数(hrtf)和波纹频谱声音的定位效果。我们发现配备光谱梯度轮廓的变体优于其他定位模型。所提出的模型似乎对hrtf的评估特别有用，并且可以作为未来扩展到动态聆听条件建模的基础。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

A Bayesian model for human directional localization of broadband static sound sources

Humans estimate sound-source directions by combining prior beliefs with sensory evidence. Prior beliefs represent statistical knowledge about the environment, and the sensory evidence consists of auditory features such as interaural disparities and monaural spectral shapes. Models of directional sound localization often impose constraints on the contribution of these features to either the horizontal or vertical dimension. Instead, we propose a Bayesian model that flexibly incorporates each feature according to its spatial precision and integrates prior beliefs in the inference process. The model estimates the direction of a single, broadband, stationary sound source presented to a static human listener in an anechoic environment. We simplified interaural features to be broadband and compared two model variants, each considering a different type of monaural spectral features: magnitude profiles and gradient profiles. Both model variants were fitted to the baseline performance of five listeners and evaluated on the effects of localizing with non-individual head-related transfer functions (HRTFs) and sounds with rippled spectrum. We found that the variant equipped with spectral gradient profiles outperformed other localization models. The proposed model appears particularly useful for the evaluation of HRTFs and may serve as a basis for future extensions towards modeling dynamic listening conditions.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Acta Acustica ACOUSTICS-

CiteScore

2.80

自引率

21.40%

发文量

审稿时长

12 weeks

期刊介绍： Acta Acustica, the Journal of the European Acoustics Association (EAA). After the publication of its Journal Acta Acustica from 1993 to 1995, the EAA published Acta Acustica united with Acustica from 1996 to 2019. From 2020, the EAA decided to publish a journal in full Open Access. See Article Processing charges. Acta Acustica reports on original scientific research in acoustics and on engineering applications. The journal considers review papers, scientific papers, technical and applied papers, short communications, letters to the editor. From time to time, special issues and review articles are also published. For book reviews or doctoral thesis abstracts, please contact the Editor in Chief.