Release from same-talker speech-in-speech masking: Effects of masker intelligibility and other contributing factorsa).

IF 2.1 2区 物理与天体物理 Q2 ACOUSTICS Journal of the Acoustical Society of America Pub Date : 2024-11-01 DOI:10.1121/10.0034235
Mingyue Huo, Yinglun Sun, Daniel Fogerty, Yan Tang
{"title":"Release from same-talker speech-in-speech masking: Effects of masker intelligibility and other contributing factorsa).","authors":"Mingyue Huo, Yinglun Sun, Daniel Fogerty, Yan Tang","doi":"10.1121/10.0034235","DOIUrl":null,"url":null,"abstract":"<p><p>Human speech perception declines in the presence of masking speech, particularly when the masker is intelligible and acoustically similar to the target. A prior investigation demonstrated a substantial reduction in masking when the intelligibility of competing speech was reduced by corrupting voiced segments with noise [Huo, Sun, Fogerty, and Tang (2023), \"Quantifying informational masking due to masker intelligibility in same-talker speech-in-speech perception,\" in Interspeech 2023, pp. 1783-1787]. As this processing also reduced the prominence of voiced segments, it was unclear whether the unmasking was due to reduced linguistic content, acoustic similarity, or both. The current study compared the masking of original competing speech (high intelligibility) to competing speech with time reversal of voiced segments (VS-reversed, low intelligibility) at various target-to-masker ratios. Modeling results demonstrated similar energetic masking between the two maskers. However, intelligibility of the target speech was considerably better with the VS-reversed masker compared to the original masker, likely due to the reduced linguistic content. Further corrupting the masker's voiced segments resulted in additional release from masking. Acoustic analyses showed that the portion of target voiced segments overlapping with masker voiced segments and the similarity between target and masker overlapped voiced segments impacted listeners' speech recognition. Evidence also suggested modulation masking in the spectro-temporal domain interferes with listeners' ability to glimpse the target.</p>","PeriodicalId":17168,"journal":{"name":"Journal of the Acoustical Society of America","volume":"156 5","pages":"2960-2973"},"PeriodicalIF":2.1000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Acoustical Society of America","FirstCategoryId":"101","ListUrlMain":"https://doi.org/10.1121/10.0034235","RegionNum":2,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0

Abstract

Human speech perception declines in the presence of masking speech, particularly when the masker is intelligible and acoustically similar to the target. A prior investigation demonstrated a substantial reduction in masking when the intelligibility of competing speech was reduced by corrupting voiced segments with noise [Huo, Sun, Fogerty, and Tang (2023), "Quantifying informational masking due to masker intelligibility in same-talker speech-in-speech perception," in Interspeech 2023, pp. 1783-1787]. As this processing also reduced the prominence of voiced segments, it was unclear whether the unmasking was due to reduced linguistic content, acoustic similarity, or both. The current study compared the masking of original competing speech (high intelligibility) to competing speech with time reversal of voiced segments (VS-reversed, low intelligibility) at various target-to-masker ratios. Modeling results demonstrated similar energetic masking between the two maskers. However, intelligibility of the target speech was considerably better with the VS-reversed masker compared to the original masker, likely due to the reduced linguistic content. Further corrupting the masker's voiced segments resulted in additional release from masking. Acoustic analyses showed that the portion of target voiced segments overlapping with masker voiced segments and the similarity between target and masker overlapped voiced segments impacted listeners' speech recognition. Evidence also suggested modulation masking in the spectro-temporal domain interferes with listeners' ability to glimpse the target.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
从同一说话者的语中语掩蔽中解脱出来:掩蔽者可懂度和其他促成因素的影响a)。
人类的语音感知能力在有掩蔽语音的情况下会下降,尤其是当掩蔽者可懂且与目标语音在声学上相似时。之前的一项研究表明,通过用噪声破坏发声片段来降低竞争语音的可懂度时,掩蔽现象会大大减少[Huo, Sun, Fogerty, and Tang (2023),"Quantifying informational masking due to masker intelligibility in same-talker speech-in-speech perception," in Interspeech 2023, pp.1783-1787]。由于这种处理方式也降低了发声片段的突出度,因此尚不清楚解除掩蔽的原因是语言内容减少、声学相似性降低还是两者兼而有之。本研究比较了在不同的目标与掩蔽者比率下,原始竞争语音(高可懂度)与带有发声片段时间反转的竞争语音(VS-反转,低可懂度)的掩蔽情况。建模结果表明,两种掩蔽器之间的能量掩蔽相似。不过,与原始掩蔽器相比,VS 反转掩蔽器的目标语音可懂度要好得多,这可能是由于语言内容减少了。进一步破坏掩蔽者的发声片段可进一步解除掩蔽。声学分析表明,目标发声片段与掩蔽者发声片段的重叠部分以及目标发声片段与掩蔽者重叠发声片段之间的相似性会影响听者的语音识别能力。还有证据表明,在频谱-时间域的调制掩蔽会干扰听者瞥见目标的能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
4.60
自引率
16.70%
发文量
1433
审稿时长
4.7 months
期刊介绍: Since 1929 The Journal of the Acoustical Society of America has been the leading source of theoretical and experimental research results in the broad interdisciplinary study of sound. Subject coverage includes: linear and nonlinear acoustics; aeroacoustics, underwater sound and acoustical oceanography; ultrasonics and quantum acoustics; architectural and structural acoustics and vibration; speech, music and noise; psychology and physiology of hearing; engineering acoustics, transduction; bioacoustics, animal bioacoustics.
期刊最新文献
Ducting of wave-breaking sound by the sea surface bubble layer. Soundscape perception indices (SPIs): Developing context-dependent single value scores of multidimensional soundscape perceptual qualitya). The influence of dialect loss on tone perception: Diminishing voice quality cues in preserved tone contrast. Transcranial ultrasound modeling using the spectral-element method. Noise assessment of multirotor configurations during landing proceduresa).
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1