Effects of spectral peaks and notches in head-related transfer function on median plane sound localization with dynamic binaural playback

IF 3.4 2区 物理与天体物理 Q1 ACOUSTICS Applied Acoustics Pub Date : 2024-11-21 DOI:10.1016/j.apacoust.2024.110424
Yunan Wang , Dingding Yao , Zhi Zhou , Daocheng Chen , Wenquan Feng , Junfeng Li
{"title":"Effects of spectral peaks and notches in head-related transfer function on median plane sound localization with dynamic binaural playback","authors":"Yunan Wang ,&nbsp;Dingding Yao ,&nbsp;Zhi Zhou ,&nbsp;Daocheng Chen ,&nbsp;Wenquan Feng ,&nbsp;Junfeng Li","doi":"10.1016/j.apacoust.2024.110424","DOIUrl":null,"url":null,"abstract":"<div><div>Spectral peaks and notches in the head-related transfer function (HRTF) are considered pivotal for elevation perception in virtual auditory displays (VAD), especially during static binaural signal playback. However, studies on dynamic binaural signal playback in VAD have shown that the auditory system can still utilize dynamic cues for elevation localization, even when these high-frequency spectral components are missing, although this may compromise localization accuracy. This study investigated the effects of spectral peaks and notches in dynamic playback, examining how distorting these features and their contrasts at various levels (33%, 66%, and 100% removal) influenced elevation localization along different rotational axes (yaw and pitch rotation). The results revealed that at the same distortion level, the impact of these features on median plane localization decreased sequentially from spectral contrast, to peaks, to notches. At a distortion level of 33%, notch removal enabled dynamic playback results that were not significantly different from control conditions. As distortion levels increased to 66% and 100%, localization performance progressively deteriorated, including increased localization errors and up-down confusion with head yaw rotation as well as front-back and up-down confusion with head pitch rotation. Simultaneously, localization performance with head pitch rotation exhibited poorer performance compared to yaw rotation, particularly in cases involving peak removal and contrast compression. The experimental results further revealed that auditory elevation localization benefits from multiple localization cues generated by head movements, including dynamic spectral cues produced during large head rotations when all spectral cues are available or distorted at a level of 33%.</div></div>","PeriodicalId":55506,"journal":{"name":"Applied Acoustics","volume":"230 ","pages":"Article 110424"},"PeriodicalIF":3.4000,"publicationDate":"2024-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Acoustics","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0003682X24005759","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0

Abstract

Spectral peaks and notches in the head-related transfer function (HRTF) are considered pivotal for elevation perception in virtual auditory displays (VAD), especially during static binaural signal playback. However, studies on dynamic binaural signal playback in VAD have shown that the auditory system can still utilize dynamic cues for elevation localization, even when these high-frequency spectral components are missing, although this may compromise localization accuracy. This study investigated the effects of spectral peaks and notches in dynamic playback, examining how distorting these features and their contrasts at various levels (33%, 66%, and 100% removal) influenced elevation localization along different rotational axes (yaw and pitch rotation). The results revealed that at the same distortion level, the impact of these features on median plane localization decreased sequentially from spectral contrast, to peaks, to notches. At a distortion level of 33%, notch removal enabled dynamic playback results that were not significantly different from control conditions. As distortion levels increased to 66% and 100%, localization performance progressively deteriorated, including increased localization errors and up-down confusion with head yaw rotation as well as front-back and up-down confusion with head pitch rotation. Simultaneously, localization performance with head pitch rotation exhibited poorer performance compared to yaw rotation, particularly in cases involving peak removal and contrast compression. The experimental results further revealed that auditory elevation localization benefits from multiple localization cues generated by head movements, including dynamic spectral cues produced during large head rotations when all spectral cues are available or distorted at a level of 33%.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
头部相关传递函数中的频谱峰值和凹槽对动态双耳重放中值平面声音定位的影响
头部相关传递函数(HRTF)中的频谱峰值和凹槽被认为是虚拟听觉显示(VAD)中海拔感知的关键,尤其是在静态双耳信号播放时。然而,对 VAD 中动态双耳信号回放的研究表明,即使缺少这些高频频谱成分,听觉系统仍可利用动态线索进行海拔定位,尽管这可能会影响定位的准确性。本研究调查了动态回放中频谱峰值和凹槽的影响,研究了在不同程度上(33%、66% 和 100%)扭曲这些特征及其对比度对沿不同旋转轴(偏航和俯仰旋转)进行海拔定位的影响。结果显示,在相同的失真度下,这些特征对中值平面定位的影响从光谱对比度、峰值到凹槽依次减小。在失真度为 33% 的情况下,去除凹槽后的动态回放结果与对照条件下的结果没有显著差异。当失真度增加到 66% 和 100% 时,定位性能逐渐下降,包括定位误差增加、头部偏航旋转时上下混淆以及头部俯仰旋转时前后和上下混淆。同时,与偏航旋转相比,头部俯仰旋转时的定位性能更差,尤其是在峰值移除和对比度压缩的情况下。实验结果进一步表明,听觉高程定位可从头部运动产生的多种定位线索中获益,包括头部大幅旋转时产生的动态频谱线索,此时所有频谱线索均可用或失真度为 33%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Applied Acoustics
Applied Acoustics 物理-声学
CiteScore
7.40
自引率
11.80%
发文量
618
审稿时长
7.5 months
期刊介绍: Since its launch in 1968, Applied Acoustics has been publishing high quality research papers providing state-of-the-art coverage of research findings for engineers and scientists involved in applications of acoustics in the widest sense. Applied Acoustics looks not only at recent developments in the understanding of acoustics but also at ways of exploiting that understanding. The Journal aims to encourage the exchange of practical experience through publication and in so doing creates a fund of technological information that can be used for solving related problems. The presentation of information in graphical or tabular form is especially encouraged. If a report of a mathematical development is a necessary part of a paper it is important to ensure that it is there only as an integral part of a practical solution to a problem and is supported by data. Applied Acoustics encourages the exchange of practical experience in the following ways: • Complete Papers • Short Technical Notes • Review Articles; and thereby provides a wealth of technological information that can be used to solve related problems. Manuscripts that address all fields of applications of acoustics ranging from medicine and NDT to the environment and buildings are welcome.
期刊最新文献
Development of a code-switched Hindi-Marathi dataset and transformer-based architecture for enhanced speech recognition using dynamic switching algorithms Eco-design of airborne sound insulation in Recycled Lightweight Concrete walls for Brazilian social housing: A reliability-based approach Does loudspeaker directivity really influence the reconstructed indoor temperature quality using Acoustic travel-time TOMography? A new deep learning forward BSS (D-FBSS) algorithm for acoustic noise reduction and speech enhancement Source depth classification in shallow sea negative thermocline waveguide with small aperture vertical arrays
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1