The influence of informational masking in complex real-world environments

2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics Pub Date : 2013-10-01 DOI:10.1109/WASPAA.2013.6701873

Adam Westermann, J. Buchholz

{"title":"The influence of informational masking in complex real-world environments","authors":"Adam Westermann, J. Buchholz","doi":"10.1109/WASPAA.2013.6701873","DOIUrl":null,"url":null,"abstract":"Spatial release from masking (SRM) is believed to be an essential auditory mechanism aiding listeners in reverberant multi-talker environments. However, SRM is often measured in simplified spatial configurations using speech corpora with exaggerated talker and/or context confusions. Besides energetic better-ear listening and binaural unmasking, the perceived spatial separation of target and masking speech signals is thought to aid listener's segregation of speech signals, resulting in a so-called release from informational masking. This study aims to estimate the amount of informational masking that is apparent in complex real-world environments. Speech reception thresholds (SRTs) were measured by presenting Bamford-Kowal-Bench (BKB) sentences in a simulated cafeteria environment recreated by a spherical array of 41 loudspeakers placed in an anechoic chamber. Three maskers with varying degree of informational masking were realized: one with talkers different from the target, one with an unintelligible noise vocoder (minimal informational masking) and one with the same talker as the target (maximum informational masking). The maskers were constructed with either two or seven two-talker conversations and were either spatially distributed in the simulated cafeteria or colocated with the target. Seven normal hearing listeners were tested. All conditions showed improved thresholds for the spatialized condition compared to the colocated condition. However there was no significant difference between the different talker speech and vocoded masker. Only the same talker masker showed increased thresholds and this was only substantial in the two conversation colocated condition. These results suggest that informational masking is of low relevance in real-life listening and is exaggerated in listening tests by target/masker similarities and the colocated spatial configuration. However, this may be different in (aided) hearing impaired listeners where spectral and spatial cues can be significantly disturbed.","PeriodicalId":341888,"journal":{"name":"2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics","volume":"117 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WASPAA.2013.6701873","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Spatial release from masking (SRM) is believed to be an essential auditory mechanism aiding listeners in reverberant multi-talker environments. However, SRM is often measured in simplified spatial configurations using speech corpora with exaggerated talker and/or context confusions. Besides energetic better-ear listening and binaural unmasking, the perceived spatial separation of target and masking speech signals is thought to aid listener's segregation of speech signals, resulting in a so-called release from informational masking. This study aims to estimate the amount of informational masking that is apparent in complex real-world environments. Speech reception thresholds (SRTs) were measured by presenting Bamford-Kowal-Bench (BKB) sentences in a simulated cafeteria environment recreated by a spherical array of 41 loudspeakers placed in an anechoic chamber. Three maskers with varying degree of informational masking were realized: one with talkers different from the target, one with an unintelligible noise vocoder (minimal informational masking) and one with the same talker as the target (maximum informational masking). The maskers were constructed with either two or seven two-talker conversations and were either spatially distributed in the simulated cafeteria or colocated with the target. Seven normal hearing listeners were tested. All conditions showed improved thresholds for the spatialized condition compared to the colocated condition. However there was no significant difference between the different talker speech and vocoded masker. Only the same talker masker showed increased thresholds and this was only substantial in the two conversation colocated condition. These results suggest that informational masking is of low relevance in real-life listening and is exaggerated in listening tests by target/masker similarities and the colocated spatial configuration. However, this may be different in (aided) hearing impaired listeners where spectral and spatial cues can be significantly disturbed.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

复杂现实环境中信息掩蔽的影响

空间掩蔽释放(SRM)被认为是混响多语环境中帮助听者的重要听觉机制。然而，SRM通常是在简化的空间配置中测量的，使用的是带有夸张说话者和/或上下文混淆的语音语料库。除了精力充沛的好耳聆听和双耳揭开外，目标和掩蔽语音信号的感知空间分离被认为有助于听者对语音信号的分离，从而导致所谓的从信息掩蔽中释放。本研究旨在估计在复杂的现实世界环境中明显的信息掩蔽量。语音接收阈值(srt)是通过在一个模拟自助餐厅环境中呈现Bamford-Kowal-Bench (BKB)句子来测量的，该环境由放置在消声室中的41个球形扬声器阵列重建。实现了三种不同程度信息掩蔽的掩蔽器:一种是与目标不同的说话者，一种是不可理解的噪声声码器(最小信息掩蔽)，一种是与目标相同的说话者(最大信息掩蔽)。面具由两个或七个双说话者的对话组成，并在空间上分布在模拟的自助餐厅中，或者与目标重合。对7名听力正常的听众进行了测试。所有条件下，空间化条件的阈值都比并置条件有所提高。然而，不同说话者的语音和语音编码掩模之间没有显著差异。只有相同的谈话掩蔽器显示出阈值的增加，这只有在两个谈话同时发生的情况下才明显。这些结果表明，信息掩蔽在现实听力中的相关性较低，并且在听力测试中被目标/掩蔽物的相似性和空间配置所夸大。然而，在(辅助)听力受损的听众中，频谱和空间线索可能会受到严重干扰，这可能会有所不同。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2013 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics

自引率

0.00%

发文量

期刊最新文献

Using articulation index band correlations to objectively estimate speech intelligibility consistent with the modified rhyme test Roomprints for forensic audio applications The geometry of sound-source localization using non-coplanar microphone arrays Sparse representation and epoch estimation of voiced speech Spotforming using distributed microphone arrays