Hearing in categories and speech perception at the "cocktail party".

IF 2.8 3区 综合性期刊 Q1 MULTIDISCIPLINARY SCIENCES PLoS ONE Pub Date : 2025-01-30 eCollection Date: 2025-01-01 DOI:10.1371/journal.pone.0318600
Gavin M Bidelman, Fallon Bernard, Kimberly Skubic
{"title":"Hearing in categories and speech perception at the \"cocktail party\".","authors":"Gavin M Bidelman, Fallon Bernard, Kimberly Skubic","doi":"10.1371/journal.pone.0318600","DOIUrl":null,"url":null,"abstract":"<p><p>We aimed to test whether hearing speech in phonetic categories (as opposed to a continuous/gradient fashion) affords benefits to \"cocktail party\" speech perception. We measured speech perception performance (recognition, localization, and source monitoring) in a simulated 3D cocktail party environment. We manipulated task difficulty by varying the number of additional maskers presented at other spatial locations in the horizontal soundfield (1-4 talkers) and via forward vs. time-reversed maskers, the latter promoting a release from masking. In separate tasks, we measured isolated phoneme categorization using two-alternative forced choice (2AFC) and visual analog scaling (VAS) tasks designed to promote more/less categorical hearing and thus test putative links between categorization and real-world speech-in-noise skills. We first show cocktail party speech recognition accuracy and speed decline with additional competing talkers and amidst forward compared to reverse maskers. Dividing listeners into \"discrete\" vs. \"continuous\" categorizers based on their VAS labeling (i.e., whether responses were binary or continuous judgments), we then show the degree of release from masking experienced at the cocktail party is predicted by their degree of categoricity in phoneme labeling and not high-frequency audiometric thresholds; more discrete listeners make less effective use of time-reversal and show less release from masking than their gradient responding peers. Our results suggest a link between speech categorization skills and cocktail party processing, with a gradient (rather than discrete) listening strategy benefiting degraded speech perception. These findings suggest that less flexibility in binning sounds into categories may be one factor that contributes to figure-ground deficits.</p>","PeriodicalId":20189,"journal":{"name":"PLoS ONE","volume":"20 1","pages":"e0318600"},"PeriodicalIF":2.8000,"publicationDate":"2025-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11781644/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PLoS ONE","FirstCategoryId":"103","ListUrlMain":"https://doi.org/10.1371/journal.pone.0318600","RegionNum":3,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"MULTIDISCIPLINARY SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

We aimed to test whether hearing speech in phonetic categories (as opposed to a continuous/gradient fashion) affords benefits to "cocktail party" speech perception. We measured speech perception performance (recognition, localization, and source monitoring) in a simulated 3D cocktail party environment. We manipulated task difficulty by varying the number of additional maskers presented at other spatial locations in the horizontal soundfield (1-4 talkers) and via forward vs. time-reversed maskers, the latter promoting a release from masking. In separate tasks, we measured isolated phoneme categorization using two-alternative forced choice (2AFC) and visual analog scaling (VAS) tasks designed to promote more/less categorical hearing and thus test putative links between categorization and real-world speech-in-noise skills. We first show cocktail party speech recognition accuracy and speed decline with additional competing talkers and amidst forward compared to reverse maskers. Dividing listeners into "discrete" vs. "continuous" categorizers based on their VAS labeling (i.e., whether responses were binary or continuous judgments), we then show the degree of release from masking experienced at the cocktail party is predicted by their degree of categoricity in phoneme labeling and not high-frequency audiometric thresholds; more discrete listeners make less effective use of time-reversal and show less release from masking than their gradient responding peers. Our results suggest a link between speech categorization skills and cocktail party processing, with a gradient (rather than discrete) listening strategy benefiting degraded speech perception. These findings suggest that less flexibility in binning sounds into categories may be one factor that contributes to figure-ground deficits.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
分类听力与“鸡尾酒会”上的言语感知。
我们的目的是测试语音分类(而不是连续/梯度方式)是否对“鸡尾酒会”语音感知有好处。我们在模拟的3D鸡尾酒会环境中测量了语音感知性能(识别、定位和源监控)。我们通过改变在水平声场(1-4个说话者)的其他空间位置出现的额外掩蔽物的数量,以及通过正向掩蔽物和时间反转掩蔽物来控制任务难度,后者促进掩蔽的释放。在单独的任务中,我们使用两种选择强迫选择(2AFC)和视觉模拟尺度(VAS)任务来测量孤立音素分类,这些任务旨在促进更多/更少的分类听力,从而测试分类与现实世界语音噪音技能之间的假定联系。我们首先展示了鸡尾酒会语音识别的准确性和速度下降与额外的竞争说话者和在向前与反向掩蔽。将听众分成“离散”vs。“连续”分类器基于他们的VAS标签(即,无论反应是二元判断还是连续判断),我们然后显示,在鸡尾酒会上经历的掩蔽释放程度是由他们在音素标签上的分类程度预测的,而不是高频听力阈值;与梯度响应的监听器相比,更离散的监听器使用时间反转的效率更低,并且从掩蔽中释放的释放更少。我们的研究结果表明,语音分类技能和鸡尾酒会处理之间存在联系,梯度(而不是离散)倾听策略有利于退化的语音感知。这些发现表明,将声音分类的灵活性较差可能是导致图形背景缺陷的一个因素。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
PLoS ONE
PLoS ONE 生物-生物学
CiteScore
6.20
自引率
5.40%
发文量
14242
审稿时长
3.7 months
期刊介绍: PLOS ONE is an international, peer-reviewed, open-access, online publication. PLOS ONE welcomes reports on primary research from any scientific discipline. It provides: * Open-access—freely accessible online, authors retain copyright * Fast publication times * Peer review by expert, practicing researchers * Post-publication tools to indicate quality and impact * Community-based dialogue on articles * Worldwide media coverage
期刊最新文献
Biogenic Silver-Selenium nanocomposite with anticancer activity and potent efficacy against vancomycin-resistant Staphylococcus aureus. Morpho-biochemical diversity and phytochemical profiling of Rubus fruticosus L. landraces. Calculation method and predictive analysis of flexural capacity of reinforced concrete beams strengthened with carbon fiber-reinforced polymer sheets applied to the side surfaces. Cirrhosis outcomes on rurality and weekend admissions revisited: A contemporary analysis of the national inpatient sample. COMBINA: The EAAD four-level approach for depression and suicide prevention and wellbeing promotion in the community and vulnerable populations. A cross-country study protocol from the MENTBEST project.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1