首页 > 最新文献

Trends in Hearing最新文献

英文 中文
Note on the Dual-Task Paradigm and its Use to Measure Listening Effort. 双任务范式及其用于测量听力努力程度的说明。
IF 2.6 2区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2024-01-01 DOI: 10.1177/23312165241292215
Stefanie E Kuchinsky, Frederick J Gallun, Adrian K C Lee

People regularly communicate in complex environments, requiring them to flexibly shift their attention across multiple sources of sensory information. Increasing recruitment of the executive functions that support successful speech comprehension in these multitasking settings is thought to contribute to the sense of effort that listeners often experience. One common research method employed to quantify listening effort is the dual-task paradigm in which individuals recognize speech and concurrently perform a secondary (often visual) task. Effort is operationalized as performance decrements on the secondary task as speech processing demands increase. However, recent reviews have noted critical inconsistencies in the results of dual-task experiments, likely in part due to how and when the two tasks place demands on a common set of mental resources and how flexibly individuals can allocate their attention to them. We propose that in order to move forward to address this gap, we need to first look backward: better integrating theoretical models of resource capacity and allocation as well as of task-switching that have been historically developed in domains outside of hearing research (viz., cognitive psychology and neuroscience). With this context in mind, we describe how dual-task experiments could be designed and interpreted such that they provide better and more robust insights into the mechanisms that contribute to effortful listening.

人们经常在复杂的环境中进行交流,这就要求他们在多种感官信息源之间灵活转换注意力。在这些多任务环境中,支持成功理解语音的执行功能被认为是造成听者经常感到费力的原因。量化听力努力程度的一种常用研究方法是双任务范式,即个人在识别语音的同时执行一项次要任务(通常是视觉任务)。随着语音处理需求的增加,听力强度会随着次要任务的完成而降低。然而,最近的评论指出,双任务实验的结果存在严重的不一致性,部分原因可能是这两项任务如何以及何时对一组共同的心理资源提出要求,以及个体如何灵活地将注意力分配给它们。我们建议,为了弥补这一差距,我们首先需要向后看:更好地整合资源能力和分配以及任务切换的理论模型,这些模型历来是在听力研究以外的领域(即认知心理学和神经科学)发展起来的。有鉴于此,我们将介绍如何设计和解释双任务实验,使其能更好、更有力地揭示有助于努力倾听的机制。
{"title":"Note on the Dual-Task Paradigm and its Use to Measure Listening Effort.","authors":"Stefanie E Kuchinsky, Frederick J Gallun, Adrian K C Lee","doi":"10.1177/23312165241292215","DOIUrl":"10.1177/23312165241292215","url":null,"abstract":"<p><p>People regularly communicate in complex environments, requiring them to flexibly shift their attention across multiple sources of sensory information. Increasing recruitment of the executive functions that support successful speech comprehension in these multitasking settings is thought to contribute to the sense of effort that listeners often experience. One common research method employed to quantify listening effort is the dual-task paradigm in which individuals recognize speech and concurrently perform a secondary (often visual) task. Effort is operationalized as performance decrements on the secondary task as speech processing demands increase. However, recent reviews have noted critical inconsistencies in the results of dual-task experiments, likely in part due to how and when the two tasks place demands on a common set of mental resources and how flexibly individuals can allocate their attention to them. We propose that in order to move forward to address this gap, we need to first look backward: better integrating theoretical models of resource capacity and allocation as well as of task-switching that have been historically developed in domains outside of hearing research (viz., cognitive psychology and neuroscience). With this context in mind, we describe how dual-task experiments could be designed and interpreted such that they provide better and more robust insights into the mechanisms that contribute to effortful listening.</p>","PeriodicalId":48678,"journal":{"name":"Trends in Hearing","volume":"28 ","pages":"23312165241292215"},"PeriodicalIF":2.6,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11626669/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142548411","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Hearing Aid Amplification Improves Postural Control for Older Adults With Hearing Loss When Other Sensory Cues Are Impoverished. 当其他感官线索贫乏时,助听器放大可改善听力损失老年人的姿势控制。
IF 2.7 2区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2024-01-01 DOI: 10.1177/23312165241232219
L Behtani, D Paromov, K Moïn-Darbari, M S Houde, B A Bacon, M Maheu, T Leroux, F Champoux

Recent studies suggest that sound amplification via hearing aids can improve postural control in adults with hearing impairments. Unfortunately, only a few studies used well-defined posturography measures to assess balance in adults with hearing loss with and without their hearing aids. Of these, only two examined postural control specifically in the elderly with hearing loss. The present study examined the impact of hearing aid use on postural control during various sensory perturbations in older adults with age-related hearing loss. Thirty individuals with age-related hearing impairments and using hearing aids bilaterally were tested. Participants were asked to perform a modified clinical sensory integration in balance test on a force platform with and without hearing aids. The experiment was conducted in the presence of a broadband noise ranging from 0.1 to 4 kHz presented through a loudspeaker. As expected, hearing aid use had a beneficial impact on postural control, but only when visual and somatosensory inputs were both reduced. Data also suggest that hearing aid use decreases the dependence on somatosensory input for maintaining postural control. This finding can be of particular importance in older adults considering the reduction of tactile and proprioceptive sensitivity and acuity often associated with aging. These results provide an additional argument for encouraging early hearing aid fitting for people with hearing loss.

最近的研究表明,通过助听器放大声音可以改善听力受损成年人的姿势控制。遗憾的是,只有少数研究使用了定义明确的体位测量法来评估有听力损失的成年人在佩戴和未佩戴助听器时的平衡能力。在这些研究中,只有两项专门研究了听力损失老年人的体位控制。本研究考察了助听器的使用对患有老年性听力损失的老年人在各种感觉干扰时的姿势控制的影响。研究人员对 30 名患有老年性听力障碍并双侧使用助听器的人进行了测试。受试者被要求在有助听器和没有助听器的情况下,在力平台上进行改良的临床感觉统合平衡测试。实验是在通过扬声器播放 0.1 至 4 kHz 宽带噪音的情况下进行的。正如预期的那样,使用助听器对姿势控制有好处,但只有当视觉和体感输入都减少时才会产生这种效果。数据还表明,使用助听器可降低维持姿势控制对体感输入的依赖。考虑到触觉和本体感觉的敏感度和敏锐度通常会随着年龄的增长而降低,这一发现对老年人尤为重要。这些结果为鼓励听力损失患者尽早验配助听器提供了新的论据。
{"title":"Hearing Aid Amplification Improves Postural Control for Older Adults With Hearing Loss When Other Sensory Cues Are Impoverished.","authors":"L Behtani, D Paromov, K Moïn-Darbari, M S Houde, B A Bacon, M Maheu, T Leroux, F Champoux","doi":"10.1177/23312165241232219","DOIUrl":"10.1177/23312165241232219","url":null,"abstract":"<p><p>Recent studies suggest that sound amplification via hearing aids can improve postural control in adults with hearing impairments. Unfortunately, only a few studies used well-defined posturography measures to assess balance in adults with hearing loss with and without their hearing aids. Of these, only two examined postural control specifically in the elderly with hearing loss. The present study examined the impact of hearing aid use on postural control during various sensory perturbations in older adults with age-related hearing loss. Thirty individuals with age-related hearing impairments and using hearing aids bilaterally were tested. Participants were asked to perform a modified clinical sensory integration in balance test on a force platform with and without hearing aids. The experiment was conducted in the presence of a broadband noise ranging from 0.1 to 4 kHz presented through a loudspeaker. As expected, hearing aid use had a beneficial impact on postural control, but only when visual and somatosensory inputs were both reduced. Data also suggest that hearing aid use decreases the dependence on somatosensory input for maintaining postural control. This finding can be of particular importance in older adults considering the reduction of tactile and proprioceptive sensitivity and acuity often associated with aging. These results provide an additional argument for encouraging early hearing aid fitting for people with hearing loss.</p>","PeriodicalId":48678,"journal":{"name":"Trends in Hearing","volume":"28 ","pages":"23312165241232219"},"PeriodicalIF":2.7,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10868491/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139736482","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Relationships Between Speech, Spatial and Qualities of Hearing Short Form SSQ12 Item Scores and their Use in Guiding Rehabilitation for Cochlear Implant Recipients. 言语、空间和听力质量短表 SSQ12 项目得分之间的关系及其在指导人工耳蜗植入者康复中的应用。
IF 2.7 2区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2024-01-01 DOI: 10.1177/23312165231224643
Dianne J Mecklenburg, Petra L Graham, Chris J James

Cochlear implantation successfully improves hearing in most adult recipients. However, in rare cases, post-implant rehabilitation is required to maximize benefit. The primary aim of this investigation was to test if self-reports by cochlear implant users indicate the need for post-implant rehabilitation. Listening performance was assessed with the Speech, Spatial and Qualities short-form SSQ12, which was self-administered via a web-based survey. Subjects included over 2000 adult bilateral or unilateral cochlear implant users with at least one year of experience. A novel application of regression tree analysis identified core SSQ12 items that serve as first steps in establishing a plan for further rehabilitation: items 1, 8, and 11 dealing with single-talker situations, loudness perception, and clarity, respectively. Further regression and classification tree analyses revealed that SSQ12 item scores were weakly related to age, degree of tinnitus, and use of bilateral versus unilateral implants. Conversely, SSQ12 scores were strongly associated with self-rated satisfaction and confidence in using their cochlear implant. The SSQ12 total scores did not vary significantly over 1-9 or more years' experience. These findings suggest that the SSQ12 may be a useful tool to guide rehabilitation at any time after cochlear implantation. Identification of poor performance may have implications for timely management to improve the outcomes, through various techniques such as device fitting adjustments, counseling, active sound exposure, and training spatial hearing.

人工耳蜗植入术能成功改善大多数成年受术者的听力。然而,在极少数情况下,需要进行植入后康复训练才能最大限度地获益。这项调查的主要目的是测试人工耳蜗用户的自我报告是否表明需要进行植入后康复。听力表现通过言语、空间和素质短式 SSQ12 进行评估,SSQ12 通过基于网络的调查进行自我管理。调查对象包括 2000 多名使用人工耳蜗至少一年的成年双侧或单侧用户。通过回归树分析的新颖应用,确定了 SSQ12 的核心项目,这些项目是制定进一步康复计划的第一步:项目 1、8 和 11 分别涉及单声道情况、响度感知和清晰度。进一步的回归和分类树分析表明,SSQ12 项目得分与年龄、耳鸣程度以及使用双侧与单侧植入体的关系不大。相反,SSQ12 分数与使用人工耳蜗的自评满意度和信心密切相关。SSQ12 总分在 1-9 年或更长时间内没有显著变化。这些研究结果表明,SSQ12 可能是指导人工耳蜗植入术后任何时间康复的有用工具。通过各种技术,如设备安装调整、咨询、主动声音暴露和空间听力训练等,识别不良表现可能对及时处理以改善结果有影响。
{"title":"Relationships Between Speech, Spatial and Qualities of Hearing Short Form SSQ12 Item Scores and their Use in Guiding Rehabilitation for Cochlear Implant Recipients.","authors":"Dianne J Mecklenburg, Petra L Graham, Chris J James","doi":"10.1177/23312165231224643","DOIUrl":"10.1177/23312165231224643","url":null,"abstract":"<p><p>Cochlear implantation successfully improves hearing in most adult recipients. However, in rare cases, post-implant rehabilitation is required to maximize benefit. The primary aim of this investigation was to test if self-reports by cochlear implant users indicate the need for post-implant rehabilitation. Listening performance was assessed with the Speech, Spatial and Qualities short-form SSQ12, which was self-administered via a web-based survey. Subjects included over 2000 adult bilateral or unilateral cochlear implant users with at least one year of experience. A novel application of regression tree analysis identified core SSQ12 items that serve as first steps in establishing a plan for further rehabilitation: items 1, 8, and 11 dealing with single-talker situations, loudness perception, and clarity, respectively. Further regression and classification tree analyses revealed that SSQ12 item scores were weakly related to age, degree of tinnitus, and use of bilateral versus unilateral implants. Conversely, SSQ12 scores were strongly associated with self-rated satisfaction and confidence in using their cochlear implant. The SSQ12 total scores did not vary significantly over 1-9 or more years' experience. These findings suggest that the SSQ12 may be a useful tool to guide rehabilitation at any time after cochlear implantation. Identification of poor performance may have implications for timely management to improve the outcomes, through various techniques such as device fitting adjustments, counseling, active sound exposure, and training spatial hearing.</p>","PeriodicalId":48678,"journal":{"name":"Trends in Hearing","volume":"28 ","pages":"23312165231224643"},"PeriodicalIF":2.7,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10874150/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139742321","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Amplitude Compression for Preventing Rollover at Above-Conversational Speech Levels. 压缩振幅,防止在高于对话语音水平时出现翻转。
IF 2.7 2区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2024-01-01 DOI: 10.1177/23312165231224597
Michal Fereczkowski, Raul H Sanchez-Lopez, Stine Christiansen, Tobias Neher

Hearing aids provide nonlinear amplification to improve speech audibility and loudness perception. While more audibility typically increases speech intelligibility at low levels, the same is not true for above-conversational levels, where decreases in intelligibility ("rollover") can occur. In a previous study, we found rollover in speech intelligibility measurements made in quiet for 35 out of 74 test ears with a hearing loss. Furthermore, we found rollover occurrence in quiet to be associated with poorer speech intelligibility in noise as measured with linear amplification. Here, we retested 16 participants with rollover with three amplitude-compression settings. Two were designed to prevent rollover by applying slow- or fast-acting compression with a 5:1 compression ratio around the "sweet spot," that is, the area in an individual performance-intensity function with high intelligibility and listening comfort. The third, reference setting used gains and compression ratios prescribed by the "National Acoustic Laboratories Non-Linear 1" rule. Speech intelligibility was assessed in quiet and in noise. Pairwise preference judgments were also collected. For speech levels of 70 dB SPL and above, slow-acting sweet-spot compression gave better intelligibility in quiet and noise than the reference setting. Additionally, the participants clearly preferred slow-acting sweet-spot compression over the other settings. At lower levels, the three settings gave comparable speech intelligibility, and the participants preferred the reference setting over both sweet-spot settings. Overall, these results suggest that, for listeners with rollover, slow-acting sweet-spot compression is beneficial at 70 dB SPL and above, while at lower levels clinically established gain targets are more suited.

助听器提供非线性放大,以改善语言可听性和响度感知。虽然更高的可听度通常会提高低声级的言语清晰度,但高于对话声级的言语清晰度却并非如此,此时会出现清晰度下降("翻转")的情况。在之前的一项研究中,我们发现在 74 只听力损失的测试耳中,有 35 只在安静环境下进行的语言清晰度测量中出现了滚转现象。此外,我们还发现,在安静环境下出现的翻转现象与线性放大测量的噪音环境下较低的言语清晰度有关。在这里,我们用三种振幅压缩设置重新测试了 16 名有翻转现象的参与者。其中两种设置的目的是通过在 "甜点 "周围以 5:1 的压缩比进行慢速或快速压缩来防止翻转,"甜点 "是指个人性能强度函数中具有高清晰度和听觉舒适度的区域。第三种参考设置使用 "国家声学实验室非线性 1 "规则规定的增益和压缩比。语音清晰度在安静和噪音环境下进行评估。同时还收集了两两之间的偏好判断。对于 70 dB SPL 及以上的语音水平,慢动作甜点压缩在安静和噪音环境下的清晰度均优于参考设置。此外,与其他设置相比,参与者明显更喜欢慢动作甜点压缩。在较低的音量下,三种设置的语音清晰度不相上下,与两种甜点设置相比,学员更喜欢参考设置。总之,这些结果表明,对于有滚动现象的听者来说,在 70 dB SPL 及以上时,慢动作甜点压缩是有益的,而在较低水平时,临床确定的增益目标更适合。
{"title":"Amplitude Compression for Preventing Rollover at Above-Conversational Speech Levels.","authors":"Michal Fereczkowski, Raul H Sanchez-Lopez, Stine Christiansen, Tobias Neher","doi":"10.1177/23312165231224597","DOIUrl":"10.1177/23312165231224597","url":null,"abstract":"<p><p>Hearing aids provide nonlinear amplification to improve speech audibility and loudness perception. While more audibility typically increases speech intelligibility at low levels, the same is not true for above-conversational levels, where decreases in intelligibility (\"rollover\") can occur. In a previous study, we found rollover in speech intelligibility measurements made in quiet for 35 out of 74 test ears with a hearing loss. Furthermore, we found rollover occurrence in quiet to be associated with poorer speech intelligibility in noise as measured with linear amplification. Here, we retested 16 participants with rollover with three amplitude-compression settings. Two were designed to prevent rollover by applying slow- or fast-acting compression with a 5:1 compression ratio around the \"sweet spot,\" that is, the area in an individual performance-intensity function with high intelligibility and listening comfort. The third, reference setting used gains and compression ratios prescribed by the \"National Acoustic Laboratories Non-Linear 1\" rule. Speech intelligibility was assessed in quiet and in noise. Pairwise preference judgments were also collected. For speech levels of 70 dB SPL and above, slow-acting sweet-spot compression gave better intelligibility in quiet and noise than the reference setting. Additionally, the participants clearly preferred slow-acting sweet-spot compression over the other settings. At lower levels, the three settings gave comparable speech intelligibility, and the participants preferred the reference setting over both sweet-spot settings. Overall, these results suggest that, for listeners with rollover, slow-acting sweet-spot compression is beneficial at 70 dB SPL and above, while at lower levels clinically established gain targets are more suited.</p>","PeriodicalId":48678,"journal":{"name":"Trends in Hearing","volume":"28 ","pages":"23312165231224597"},"PeriodicalIF":2.7,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10771052/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139099037","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Perceptual Consequences of Cochlear Deafferentation in Humans. 人类耳蜗失聪的感知后果
IF 2.7 2区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2024-01-01 DOI: 10.1177/23312165241239541
Naomi F Bramhall, Garnett P McMillan

Cochlear synaptopathy, a form of cochlear deafferentation, has been demonstrated in a number of animal species, including non-human primates. Both age and noise exposure contribute to synaptopathy in animal models, indicating that it may be a common type of auditory dysfunction in humans. Temporal bone and auditory physiological data suggest that age and occupational/military noise exposure also lead to synaptopathy in humans. The predicted perceptual consequences of synaptopathy include tinnitus, hyperacusis, and difficulty with speech-in-noise perception. However, confirming the perceptual impacts of this form of cochlear deafferentation presents a particular challenge because synaptopathy can only be confirmed through post-mortem temporal bone analysis and auditory perception is difficult to evaluate in animals. Animal data suggest that deafferentation leads to increased central gain, signs of tinnitus and abnormal loudness perception, and deficits in temporal processing and signal-in-noise detection. If equivalent changes occur in humans following deafferentation, this would be expected to increase the likelihood of developing tinnitus, hyperacusis, and difficulty with speech-in-noise perception. Physiological data from humans is consistent with the hypothesis that deafferentation is associated with increased central gain and a greater likelihood of tinnitus perception, while human data on the relationship between deafferentation and hyperacusis is extremely limited. Many human studies have investigated the relationship between physiological correlates of deafferentation and difficulty with speech-in-noise perception, with mixed findings. A non-linear relationship between deafferentation and speech perception may have contributed to the mixed results. When differences in sample characteristics and study measurements are considered, the findings may be more consistent.

耳蜗突触病是耳蜗失聪的一种形式,已在包括非人灵长类动物在内的许多动物物种中得到证实。在动物模型中,年龄和噪音暴露都会导致突触病,这表明突触病可能是人类听觉功能障碍的一种常见类型。颞骨和听觉生理数据表明,年龄和职业/军事噪声暴露也会导致人类突触病。根据预测,突触病的感知后果包括耳鸣、听觉障碍和噪音中的语言感知困难。然而,确认这种形式的耳蜗失聪对知觉的影响是一项特殊的挑战,因为突触病只能通过尸检颞骨分析来确认,而听觉知觉很难在动物身上进行评估。动物数据表明,耳蜗失聪会导致中枢增益增加、耳鸣症状和响度感知异常,以及时间处理和噪声信号检测缺陷。如果人类在去发声后发生了同样的变化,预计会增加出现耳鸣、听觉障碍和噪声中语音感知困难的可能性。来自人类的生理学数据与下述假设相一致,即去发声与中枢增益增加和耳鸣感知可能性增大有关,而关于去发声与听觉亢进之间关系的人类数据则极为有限。许多人类研究调查了去感受化的生理相关性与噪声中言语感知困难之间的关系,结果不一。失音与言语感知之间的非线性关系可能是导致结果不一的原因之一。如果考虑到样本特征和研究测量的差异,研究结果可能会更加一致。
{"title":"Perceptual Consequences of Cochlear Deafferentation in Humans.","authors":"Naomi F Bramhall, Garnett P McMillan","doi":"10.1177/23312165241239541","DOIUrl":"10.1177/23312165241239541","url":null,"abstract":"<p><p>Cochlear synaptopathy, a form of cochlear deafferentation, has been demonstrated in a number of animal species, including non-human primates. Both age and noise exposure contribute to synaptopathy in animal models, indicating that it may be a common type of auditory dysfunction in humans. Temporal bone and auditory physiological data suggest that age and occupational/military noise exposure also lead to synaptopathy in humans. The predicted perceptual consequences of synaptopathy include tinnitus, hyperacusis, and difficulty with speech-in-noise perception. However, confirming the perceptual impacts of this form of cochlear deafferentation presents a particular challenge because synaptopathy can only be confirmed through post-mortem temporal bone analysis and auditory perception is difficult to evaluate in animals. Animal data suggest that deafferentation leads to increased central gain, signs of tinnitus and abnormal loudness perception, and deficits in temporal processing and signal-in-noise detection. If equivalent changes occur in humans following deafferentation, this would be expected to increase the likelihood of developing tinnitus, hyperacusis, and difficulty with speech-in-noise perception. Physiological data from humans is consistent with the hypothesis that deafferentation is associated with increased central gain and a greater likelihood of tinnitus perception, while human data on the relationship between deafferentation and hyperacusis is extremely limited. Many human studies have investigated the relationship between physiological correlates of deafferentation and difficulty with speech-in-noise perception, with mixed findings. A non-linear relationship between deafferentation and speech perception may have contributed to the mixed results. When differences in sample characteristics and study measurements are considered, the findings may be more consistent.</p>","PeriodicalId":48678,"journal":{"name":"Trends in Hearing","volume":"28 ","pages":"23312165241239541"},"PeriodicalIF":2.7,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11092548/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"140913237","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Is Recognition of Speech in Noise Related to Memory Disruption Caused by Irrelevant Sound? 噪音中的语音识别与无关声音造成的记忆中断有关吗?
IF 2.6 2区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2024-01-01 DOI: 10.1177/23312165241262517
Daniel Oberfeld, Katharina Staab, Florian Kattner, Wolfgang Ellermeier

Listeners with normal audiometric thresholds show substantial variability in their ability to understand speech in noise (SiN). These individual differences have been reported to be associated with a range of auditory and cognitive abilities. The present study addresses the association between SiN processing and the individual susceptibility of short-term memory to auditory distraction (i.e., the irrelevant sound effect [ISE]). In a sample of 67 young adult participants with normal audiometric thresholds, we measured speech recognition performance in a spatial listening task with two interfering talkers (speech-in-speech identification), audiometric thresholds, binaural sensitivity to the temporal fine structure (interaural phase differences [IPD]), serial memory with and without interfering talkers, and self-reported noise sensitivity. Speech-in-speech processing was not significantly associated with the ISE. The most important predictors of high speech-in-speech recognition performance were a large short-term memory span, low IPD thresholds, bilaterally symmetrical audiometric thresholds, and low individual noise sensitivity. Surprisingly, the susceptibility of short-term memory to irrelevant sound accounted for a substantially smaller amount of variance in speech-in-speech processing than the nondisrupted short-term memory capacity. The data confirm the role of binaural sensitivity to the temporal fine structure, although its association to SiN recognition was weaker than in some previous studies. The inverse association between self-reported noise sensitivity and SiN processing deserves further investigation.

听力阈值正常的听者在理解噪声语音(SiN)的能力上存在很大差异。据报道,这些个体差异与一系列听觉和认知能力有关。本研究探讨了噪音语言处理能力与个体短期记忆对听觉干扰(即无关声音效应 [ISE])的敏感性之间的关联。我们以听阈正常的 67 名年轻成年参与者为样本,测量了在有两个干扰说话者的空间听力任务中的语音识别成绩(语音中的语音识别)、听阈、对时间精细结构的双耳敏感度(耳间相位差 [IPD])、有干扰说话者和无干扰说话者的序列记忆以及自我报告的噪声敏感度。语音中的语音处理与 ISE 没有明显关联。短期记忆跨度大、IPD阈值低、双侧听力阈值对称和个体噪声敏感度低是预测高语音识别能力的最重要因素。令人惊讶的是,短期记忆对无关声音的易感性在语音-语音处理中造成的差异远远小于未受干扰的短期记忆能力。这些数据证实了双耳对时间精细结构的敏感性所起的作用,尽管它与 SiN 识别的关联性比以前的一些研究要弱。自我报告的噪声敏感度与 SiN 处理之间的反向关联值得进一步研究。
{"title":"Is Recognition of Speech in Noise Related to Memory Disruption Caused by Irrelevant Sound?","authors":"Daniel Oberfeld, Katharina Staab, Florian Kattner, Wolfgang Ellermeier","doi":"10.1177/23312165241262517","DOIUrl":"10.1177/23312165241262517","url":null,"abstract":"<p><p>Listeners with normal audiometric thresholds show substantial variability in their ability to understand speech in noise (SiN). These individual differences have been reported to be associated with a range of auditory and cognitive abilities. The present study addresses the association between SiN processing and the individual susceptibility of short-term memory to auditory distraction (i.e., the irrelevant sound effect [ISE]). In a sample of 67 young adult participants with normal audiometric thresholds, we measured speech recognition performance in a spatial listening task with two interfering talkers (speech-in-speech identification), audiometric thresholds, binaural sensitivity to the temporal fine structure (interaural phase differences [IPD]), serial memory with and without interfering talkers, and self-reported noise sensitivity. Speech-in-speech processing was not significantly associated with the ISE. The most important predictors of high speech-in-speech recognition performance were a large short-term memory span, low IPD thresholds, bilaterally symmetrical audiometric thresholds, and low individual noise sensitivity. Surprisingly, the susceptibility of short-term memory to irrelevant sound accounted for a substantially smaller amount of variance in speech-in-speech processing than the nondisrupted short-term memory capacity. The data confirm the role of binaural sensitivity to the temporal fine structure, although its association to SiN recognition was weaker than in some previous studies. The inverse association between self-reported noise sensitivity and SiN processing deserves further investigation.</p>","PeriodicalId":48678,"journal":{"name":"Trends in Hearing","volume":"28 ","pages":"23312165241262517"},"PeriodicalIF":2.6,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11273587/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141761865","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Phoneme-Scale Assessment of Multichannel Speech Enhancement Algorithms. 多通道语音增强算法的音素尺度评价。
IF 2.6 2区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2024-01-01 DOI: 10.1177/23312165241292205
Nasser-Eddine Monir, Paul Magron, Romain Serizel

In the intricate acoustic landscapes where speech intelligibility is challenged by noise and reverberation, multichannel speech enhancement emerges as a promising solution for individuals with hearing loss. Such algorithms are commonly evaluated at the utterance scale. However, this approach overlooks the granular acoustic nuances revealed by phoneme-specific analysis, potentially obscuring key insights into their performance. This paper presents an in-depth phoneme-scale evaluation of three state-of-the-art multichannel speech enhancement algorithms. These algorithms-filter-and-sum network, minimum variance distortionless response, and Tango-are here extensively evaluated across different noise conditions and spatial setups, employing realistic acoustic simulations with measured room impulse responses, and leveraging diversity offered by multiple microphones in a binaural hearing setup. The study emphasizes the fine-grained phoneme-scale analysis, revealing that while some phonemes like plosives are heavily impacted by environmental acoustics and challenging to deal with by the algorithms, others like nasals and sibilants see substantial improvements after enhancement. These investigations demonstrate important improvements in phoneme clarity in noisy conditions, with insights that could drive the development of more personalized and phoneme-aware hearing aid technologies. Additionally, while this study provides extensive data on the physical metrics of processed speech, these physical metrics do not necessarily imitate human perceptions of speech, and the impact of the findings presented would have to be investigated through listening tests.

在复杂的声学环境中,语音清晰度受到噪声和混响的挑战,多通道语音增强成为听力损失患者的一种有希望的解决方案。这种算法通常在话语尺度上进行评估。然而,这种方法忽略了音素特定分析所揭示的颗粒声学细微差别,潜在地模糊了对其性能的关键见解。本文对三种最先进的多通道语音增强算法进行了深入的音素尺度评估。这些算法——滤波和网络、最小方差无失真响应和tango——在不同的噪声条件和空间设置下进行了广泛的评估,采用了真实的声学模拟和测量的房间脉冲响应,并利用了双耳听力设置中多个麦克风提供的多样性。该研究强调了细粒度的音位尺度分析,揭示了一些音位,如爆破音,受到环境声学的严重影响,很难通过算法处理,而其他音位,如鼻音和硅音,在增强后得到了实质性的改善。这些研究表明,在嘈杂条件下,音素清晰度有了重要的提高,其见解可以推动更加个性化和音素感知助听器技术的发展。此外,虽然这项研究提供了大量关于处理语音的物理指标的数据,但这些物理指标并不一定模仿人类对语音的感知,并且所提出的研究结果的影响必须通过听力测试来调查。
{"title":"A Phoneme-Scale Assessment of Multichannel Speech Enhancement Algorithms.","authors":"Nasser-Eddine Monir, Paul Magron, Romain Serizel","doi":"10.1177/23312165241292205","DOIUrl":"10.1177/23312165241292205","url":null,"abstract":"<p><p>In the intricate acoustic landscapes where speech intelligibility is challenged by noise and reverberation, multichannel speech enhancement emerges as a promising solution for individuals with hearing loss. Such algorithms are commonly evaluated at the utterance scale. However, this approach overlooks the granular acoustic nuances revealed by phoneme-specific analysis, potentially obscuring key insights into their performance. This paper presents an in-depth phoneme-scale evaluation of three state-of-the-art multichannel speech enhancement algorithms. These algorithms-filter-and-sum network, minimum variance distortionless response, and Tango-are here extensively evaluated across different noise conditions and spatial setups, employing realistic acoustic simulations with measured room impulse responses, and leveraging diversity offered by multiple microphones in a binaural hearing setup. The study emphasizes the fine-grained phoneme-scale analysis, revealing that while some phonemes like plosives are heavily impacted by environmental acoustics and challenging to deal with by the algorithms, others like nasals and sibilants see substantial improvements after enhancement. These investigations demonstrate important improvements in phoneme clarity in noisy conditions, with insights that could drive the development of more personalized and phoneme-aware hearing aid technologies. Additionally, while this study provides extensive data on the physical metrics of processed speech, these physical metrics do not necessarily imitate human perceptions of speech, and the impact of the findings presented would have to be investigated through listening tests.</p>","PeriodicalId":48678,"journal":{"name":"Trends in Hearing","volume":"28 ","pages":"23312165241292205"},"PeriodicalIF":2.6,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11638999/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142814772","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Factors Influencing Stream Segregation Based on Interaural Phase Difference Cues. 基于耳间相位差线索的流偏析影响因素。
IF 2.6 2区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2024-01-01 DOI: 10.1177/23312165241293787
Nicholas R Haywood, David McAlpine, Deborah Vickers, Brian Roberts

Interaural time differences are often considered a weak cue for stream segregation. We investigated this claim with headphone-presented pure tones differing in a related form of interaural configuration-interaural phase differences (ΔIPD)-or/and in frequency (ΔF). In experiment 1, sequences comprised 5 × ABA- repetitions (A and B = 80-ms tones, "-" = 160-ms silence), and listeners reported whether integration or segregation was heard. Envelope shape was varied but remained constant across all tones within a trial. Envelopes were either quasi-trapezoidal or had a fast attack and slow release (FA-SR) or vice versa (SA-FR). The FA-SR envelope caused more segregation than SA-FR in a task where only ΔIPD cues were present, but not in a corresponding ΔF-only task. In experiment 2, interstimulus interval (ISI) was varied (0-60 ms) between FA-SR tones. ΔF-based segregation decreased with increasing ISI, whereas ΔIPD-based segregation increased. This suggests that binaural temporal integration may limit segregation at short ISIs. In another task, ΔF and ΔIPD cues were presented alone or in combination. Here, ΔIPD-based segregation was greatly reduced, suggesting ΔIPD-based segregation is highly sensitive to experimental context. Experiments 1-2 demonstrate that ΔIPD can promote segregation in optimized stimuli/tasks. Experiment 3 employed a task requiring integration for good performance. Listeners detected a delay on the final four B tones of an 8 × ABA- sequence. Although performance worsened with increasing ΔF, increasing ΔIPD had only a marginal impact. This suggests that, even in stimuli optimized for ΔIPD-based segregation, listeners remained mostly able to disregard ΔIPD when segregation was detrimental to performance.

内部时差通常被认为是流分离的弱提示。我们对这一说法进行了调查,耳机呈现的纯音在耳间配置的相关形式-耳间相位差(ΔIPD)-或/和频率(ΔF)中有所不同。在实验1中,序列由5 × ABA-重复组成(A和B = 80 ms音调,“-”= 160 ms沉默),听者报告是否听到融合或隔离。在一次试验中,包络形状各不相同,但在所有音调中保持不变。包膜呈准梯形或快攻缓释(FA-SR),反之亦然(SA-FR)。在只有ΔIPD线索的任务中,FA-SR包膜比SA-FR引起更多的隔离,而在相应的ΔF-only任务中则没有。在实验2中,FA-SR音调间刺激间隔(ISI)在0-60 ms之间变化。随着ISI的增加,ΔF-based偏析减少,而ΔIPD-based偏析增加。这表明双耳时间整合可能限制短ISIs的分离。在另一项任务中,ΔF和ΔIPD线索被单独或组合呈现。在这里,ΔIPD-based隔离大大减少,表明ΔIPD-based隔离对实验环境高度敏感。实验1-2表明ΔIPD可以促进优化刺激/任务的隔离。实验3采用了一个需要整合才能获得良好表现的任务。听众在8 × ABA序列的最后四个B音上检测到延迟。虽然性能随着ΔF的增加而恶化,但增加ΔIPD的影响微乎其微。这表明,即使在针对ΔIPD-based隔离进行优化的刺激中,当隔离对表现有害时,听众仍然大多能够忽略ΔIPD。
{"title":"Factors Influencing Stream Segregation Based on Interaural Phase Difference Cues.","authors":"Nicholas R Haywood, David McAlpine, Deborah Vickers, Brian Roberts","doi":"10.1177/23312165241293787","DOIUrl":"10.1177/23312165241293787","url":null,"abstract":"<p><p>Interaural time differences are often considered a weak cue for stream segregation. We investigated this claim with headphone-presented pure tones differing in a related form of interaural configuration-interaural phase differences (ΔIPD)-or/and in frequency (ΔF). In experiment 1, sequences comprised 5 × ABA- repetitions (A and B = 80-ms tones, \"-\" = 160-ms silence), and listeners reported whether integration or segregation was heard. Envelope shape was varied but remained constant across all tones within a trial. Envelopes were either quasi-trapezoidal or had a fast attack and slow release (FA-SR) or vice versa (SA-FR). The FA-SR envelope caused more segregation than SA-FR in a task where only ΔIPD cues were present, but not in a corresponding ΔF-only task. In experiment 2, interstimulus interval (ISI) was varied (0-60 ms) between FA-SR tones. ΔF-based segregation decreased with increasing ISI, whereas ΔIPD-based segregation increased. This suggests that binaural temporal integration may limit segregation at short ISIs. In another task, ΔF and ΔIPD cues were presented alone or in combination. Here, ΔIPD-based segregation was greatly reduced, suggesting ΔIPD-based segregation is highly sensitive to experimental context. Experiments 1-2 demonstrate that ΔIPD can promote segregation in optimized stimuli/tasks. Experiment 3 employed a task requiring integration for good performance. Listeners detected a delay on the final four B tones of an 8 × ABA- sequence. Although performance worsened with increasing ΔF, increasing ΔIPD had only a marginal impact. This suggests that, even in stimuli optimized for ΔIPD-based segregation, listeners remained mostly able to disregard ΔIPD when segregation was detrimental to performance.</p>","PeriodicalId":48678,"journal":{"name":"Trends in Hearing","volume":"28 ","pages":"23312165241293787"},"PeriodicalIF":2.6,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11629429/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142802838","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Performance and Reliability Evaluation of an Automated Bone-Conduction Audiometry Using Machine Learning. 利用机器学习对自动骨导听力计的性能和可靠性进行评估。
IF 2.6 2区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2024-01-01 DOI: 10.1177/23312165241286456
Nicolas Wallaert, Antoine Perry, Hadrien Jean, Gwenaelle Creff, Benoit Godey, Nihaad Paraouty

To date, pure-tone audiometry remains the gold standard for clinical auditory testing. However, pure-tone audiometry is time-consuming and only provides a discrete estimate of hearing acuity. Here, we aim to address these two main drawbacks by developing a machine learning (ML)-based approach for fully automated bone-conduction (BC) audiometry tests with forehead vibrator placement. Study 1 examines the occlusion effects when the headphones are positioned on both ears during BC forehead testing. Study 2 describes the ML-based approach for BC audiometry, with automated contralateral masking rules, compensation for occlusion effects and forehead-mastoid corrections. Next, the performance of ML-audiometry is examined in comparison to manual and conventional BC audiometry with mastoid placement. Finally, Study 3 examines the test-retest reliability of ML-audiometry. Our results show no significant performance difference between automated ML-audiometry and manual conventional audiometry. High test-retest reliability is achieved with the automated ML-audiometry. Together, our findings demonstrate the performance and reliability of the automated ML-based BC audiometry for both normal-hearing and hearing-impaired adult listeners with mild to severe hearing losses.

迄今为止,纯音测听仍是临床听觉测试的黄金标准。然而,纯音测听耗时较长,而且只能提供离散的听敏度估计值。在此,我们旨在通过开发一种基于机器学习(ML)的方法来解决这两个主要缺点,即使用前额振动器进行全自动骨传导(BC)听力测试。研究 1 探讨了 BC 前额测试中耳机置于双耳时的闭塞效应。研究 2 介绍了基于 ML 的 BC 听力测量方法,包括自动对侧掩蔽规则、闭塞效应补偿和前额-乳突校正。接下来,研究人员将 ML 测听法的性能与手动测听法和乳突置位的传统 BC 测听法进行了比较。最后,研究 3 检验了 ML 听力测定法的重复测试可靠性。研究结果表明,自动 ML 听力测定法与手动传统听力测定法之间没有明显的性能差异。自动 ML 听力测定法的测试再测可靠性很高。总之,我们的研究结果表明,对于听力正常和听力受损的轻度至重度听力损失的成年听众,基于 ML 的自动 BC 听力测定法都具有良好的性能和可靠性。
{"title":"Performance and Reliability Evaluation of an Automated Bone-Conduction Audiometry Using Machine Learning.","authors":"Nicolas Wallaert, Antoine Perry, Hadrien Jean, Gwenaelle Creff, Benoit Godey, Nihaad Paraouty","doi":"10.1177/23312165241286456","DOIUrl":"10.1177/23312165241286456","url":null,"abstract":"<p><p>To date, pure-tone audiometry remains the gold standard for clinical auditory testing. However, pure-tone audiometry is time-consuming and only provides a discrete estimate of hearing acuity. Here, we aim to address these two main drawbacks by developing a machine learning (ML)-based approach for fully automated bone-conduction (BC) audiometry tests with forehead vibrator placement. Study 1 examines the occlusion effects when the headphones are positioned on both ears during BC forehead testing. Study 2 describes the ML-based approach for BC audiometry, with automated contralateral masking rules, compensation for occlusion effects and forehead-mastoid corrections. Next, the performance of ML-audiometry is examined in comparison to manual and conventional BC audiometry with mastoid placement. Finally, Study 3 examines the test-retest reliability of ML-audiometry. Our results show no significant performance difference between automated ML-audiometry and manual conventional audiometry. High test-retest reliability is achieved with the automated ML-audiometry. Together, our findings demonstrate the performance and reliability of the automated ML-based BC audiometry for both normal-hearing and hearing-impaired adult listeners with mild to severe hearing losses.</p>","PeriodicalId":48678,"journal":{"name":"Trends in Hearing","volume":"28 ","pages":"23312165241286456"},"PeriodicalIF":2.6,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11703668/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142570248","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ADT Network: A Novel Nonlinear Method for Decoding Speech Envelopes From EEG Signals. ADT 网络:从脑电图信号中解码语音包络的新型非线性方法
IF 2.6 2区 医学 Q1 AUDIOLOGY & SPEECH-LANGUAGE PATHOLOGY Pub Date : 2024-01-01 DOI: 10.1177/23312165241282872
Ruixiang Liu, Chang Liu, Dan Cui, Huan Zhang, Xinmeng Xu, Yuxin Duan, Yihu Chao, Xianzheng Sha, Limin Sun, Xiulan Ma, Shuo Li, Shijie Chang

Decoding speech envelopes from electroencephalogram (EEG) signals holds potential as a research tool for objectively assessing auditory processing, which could contribute to future developments in hearing loss diagnosis. However, current methods struggle to meet both high accuracy and interpretability. We propose a deep learning model called the auditory decoding transformer (ADT) network for speech envelope reconstruction from EEG signals to address these issues. The ADT network uses spatio-temporal convolution for feature extraction, followed by a transformer decoder to decode the speech envelopes. Through anticausal masking, the ADT considers only the current and future EEG features to match the natural relationship of speech and EEG. Performance evaluation shows that the ADT network achieves average reconstruction scores of 0.168 and 0.167 on the SparrKULee and DTU datasets, respectively, rivaling those of other nonlinear models. Furthermore, by visualizing the weights of the spatio-temporal convolution layer as time-domain filters and brain topographies, combined with an ablation study of the temporal convolution kernels, we analyze the behavioral patterns of the ADT network in decoding speech envelopes. The results indicate that low- (0.5-8 Hz) and high-frequency (14-32 Hz) EEG signals are more critical for envelope reconstruction and that the active brain regions are primarily distributed bilaterally in the auditory cortex, consistent with previous research. Visualization of attention scores further validated previous research. In summary, the ADT network balances high performance and interpretability, making it a promising tool for studying neural speech envelope tracking.

从脑电图(EEG)信号中解码语音包络线有望成为客观评估听觉处理过程的研究工具,这将有助于未来听力损失诊断的发展。然而,目前的方法很难同时满足高准确性和可解释性的要求。为了解决这些问题,我们提出了一种名为听觉解码转换器(ADT)网络的深度学习模型,用于从脑电图信号重建语音包络。ADT 网络使用时空卷积进行特征提取,然后使用变压器解码器对语音包络进行解码。通过反因果掩蔽,ADT 只考虑当前和未来的脑电图特征,以符合语音和脑电图的自然关系。性能评估结果表明,ADT 网络在 SparrKULee 和 DTU 数据集上的平均重建分数分别达到了 0.168 和 0.167,可与其他非线性模型相媲美。此外,通过将时空卷积层的权重可视化为时域滤波器和脑拓扑图,并结合对时空卷积核的消融研究,我们分析了 ADT 网络在解码语音包络时的行为模式。结果表明,低频(0.5-8 Hz)和高频(14-32 Hz)脑电信号对包络重构更为关键,而活跃的脑区主要分布在听觉皮层的双侧,这与之前的研究一致。注意力得分的可视化进一步验证了之前的研究。总之,ADT 网络兼顾了高性能和可解释性,是研究神经语音包络跟踪的理想工具。
{"title":"ADT Network: A Novel Nonlinear Method for Decoding Speech Envelopes From EEG Signals.","authors":"Ruixiang Liu, Chang Liu, Dan Cui, Huan Zhang, Xinmeng Xu, Yuxin Duan, Yihu Chao, Xianzheng Sha, Limin Sun, Xiulan Ma, Shuo Li, Shijie Chang","doi":"10.1177/23312165241282872","DOIUrl":"https://doi.org/10.1177/23312165241282872","url":null,"abstract":"<p><p>Decoding speech envelopes from electroencephalogram (EEG) signals holds potential as a research tool for objectively assessing auditory processing, which could contribute to future developments in hearing loss diagnosis. However, current methods struggle to meet both high accuracy and interpretability. We propose a deep learning model called the auditory decoding transformer (ADT) network for speech envelope reconstruction from EEG signals to address these issues. The ADT network uses spatio-temporal convolution for feature extraction, followed by a transformer decoder to decode the speech envelopes. Through anticausal masking, the ADT considers only the current and future EEG features to match the natural relationship of speech and EEG. Performance evaluation shows that the ADT network achieves average reconstruction scores of 0.168 and 0.167 on the SparrKULee and DTU datasets, respectively, rivaling those of other nonlinear models. Furthermore, by visualizing the weights of the spatio-temporal convolution layer as time-domain filters and brain topographies, combined with an ablation study of the temporal convolution kernels, we analyze the behavioral patterns of the ADT network in decoding speech envelopes. The results indicate that low- (0.5-8 Hz) and high-frequency (14-32 Hz) EEG signals are more critical for envelope reconstruction and that the active brain regions are primarily distributed bilaterally in the auditory cortex, consistent with previous research. Visualization of attention scores further validated previous research. In summary, the ADT network balances high performance and interpretability, making it a promising tool for studying neural speech envelope tracking.</p>","PeriodicalId":48678,"journal":{"name":"Trends in Hearing","volume":"28 ","pages":"23312165241282872"},"PeriodicalIF":2.6,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11489951/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142478206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Trends in Hearing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1