The influence of informational masking in complex real-world environments

Adam Westermann, J. Buchholz
{"title":"The influence of informational masking in complex real-world environments","authors":"Adam Westermann, J. Buchholz","doi":"10.1109/WASPAA.2013.6701873","DOIUrl":null,"url":null,"abstract":"Spatial release from masking (SRM) is believed to be an essential auditory mechanism aiding listeners in reverberant multi-talker environments. However, SRM is often measured in simplified spatial configurations using speech corpora with exaggerated talker and/or context confusions. Besides energetic better-ear listening and binaural unmasking, the perceived spatial separation of target and masking speech signals is thought to aid listener's segregation of speech signals, resulting in a so-called release from informational masking. This study aims to estimate the amount of informational masking that is apparent in complex real-world environments. Speech reception thresholds (SRTs) were measured by presenting Bamford-Kowal-Bench (BKB) sentences in a simulated cafeteria environment recreated by a spherical array of 41 loudspeakers placed in an anechoic chamber. Three maskers with varying degree of informational masking were realized: one with talkers different from the target, one with an unintelligible noise vocoder (minimal informational masking) and one with the same talker as the target (maximum informational masking). The maskers were constructed with either two or seven two-talker conversations and were either spatially distributed in the simulated cafeteria or colocated with the target. Seven normal hearing listeners were tested. All conditions showed improved thresholds for the spatialized condition compared to the colocated condition. However there was no significant difference between the different talker speech and vocoded masker. Only the same talker masker showed increased thresholds and this was only substantial in the two conversation colocated condition. These results suggest that informational masking is of low relevance in real-life listening and is exaggerated in listening tests by target/masker similarities and the colocated spatial configuration. However, this may be different in (aided) hearing impaired listeners where spectral and spatial cues can be significantly disturbed.","PeriodicalId":341888,"journal":{"name":"2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics","volume":"117 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WASPAA.2013.6701873","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Spatial release from masking (SRM) is believed to be an essential auditory mechanism aiding listeners in reverberant multi-talker environments. However, SRM is often measured in simplified spatial configurations using speech corpora with exaggerated talker and/or context confusions. Besides energetic better-ear listening and binaural unmasking, the perceived spatial separation of target and masking speech signals is thought to aid listener's segregation of speech signals, resulting in a so-called release from informational masking. This study aims to estimate the amount of informational masking that is apparent in complex real-world environments. Speech reception thresholds (SRTs) were measured by presenting Bamford-Kowal-Bench (BKB) sentences in a simulated cafeteria environment recreated by a spherical array of 41 loudspeakers placed in an anechoic chamber. Three maskers with varying degree of informational masking were realized: one with talkers different from the target, one with an unintelligible noise vocoder (minimal informational masking) and one with the same talker as the target (maximum informational masking). The maskers were constructed with either two or seven two-talker conversations and were either spatially distributed in the simulated cafeteria or colocated with the target. Seven normal hearing listeners were tested. All conditions showed improved thresholds for the spatialized condition compared to the colocated condition. However there was no significant difference between the different talker speech and vocoded masker. Only the same talker masker showed increased thresholds and this was only substantial in the two conversation colocated condition. These results suggest that informational masking is of low relevance in real-life listening and is exaggerated in listening tests by target/masker similarities and the colocated spatial configuration. However, this may be different in (aided) hearing impaired listeners where spectral and spatial cues can be significantly disturbed.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
复杂现实环境中信息掩蔽的影响
空间掩蔽释放(SRM)被认为是混响多语环境中帮助听者的重要听觉机制。然而,SRM通常是在简化的空间配置中测量的,使用的是带有夸张说话者和/或上下文混淆的语音语料库。除了精力充沛的好耳聆听和双耳揭开外,目标和掩蔽语音信号的感知空间分离被认为有助于听者对语音信号的分离,从而导致所谓的从信息掩蔽中释放。本研究旨在估计在复杂的现实世界环境中明显的信息掩蔽量。语音接收阈值(srt)是通过在一个模拟自助餐厅环境中呈现Bamford-Kowal-Bench (BKB)句子来测量的,该环境由放置在消声室中的41个球形扬声器阵列重建。实现了三种不同程度信息掩蔽的掩蔽器:一种是与目标不同的说话者,一种是不可理解的噪声声码器(最小信息掩蔽),一种是与目标相同的说话者(最大信息掩蔽)。面具由两个或七个双说话者的对话组成,并在空间上分布在模拟的自助餐厅中,或者与目标重合。对7名听力正常的听众进行了测试。所有条件下,空间化条件的阈值都比并置条件有所提高。然而,不同说话者的语音和语音编码掩模之间没有显著差异。只有相同的谈话掩蔽器显示出阈值的增加,这只有在两个谈话同时发生的情况下才明显。这些结果表明,信息掩蔽在现实听力中的相关性较低,并且在听力测试中被目标/掩蔽物的相似性和空间配置所夸大。然而,在(辅助)听力受损的听众中,频谱和空间线索可能会受到严重干扰,这可能会有所不同。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Using articulation index band correlations to objectively estimate speech intelligibility consistent with the modified rhyme test Roomprints for forensic audio applications The geometry of sound-source localization using non-coplanar microphone arrays Sparse representation and epoch estimation of voiced speech Spotforming using distributed microphone arrays
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1