语音理解中对有意义子词单元的主动和预测处理的神经基础

IF 4 2区医学 Q1 NEUROSCIENCES Journal of Neuroscience Pub Date : 2025-02-12 DOI:10.1523/JNEUROSCI.0781-24.2024

Suhail Matar, Alec Marantz

{"title":"语音理解中对有意义子词单元的主动和预测处理的神经基础","authors":"Suhail Matar, Alec Marantz","doi":"10.1523/JNEUROSCI.0781-24.2024","DOIUrl":null,"url":null,"abstract":"To comprehend speech, human brains identify meaningful units, like words, in the speech stream. But whereas the English 'She believed him.' has three words, the Arabic equivalent 'ṣaddaqathu' forms one word with three meaningful subword units, called morphemes: a verb stem ('ṣaddaqa'), a subject suffix ('-t-'), and a direct object pronoun ('-hu'). It remains unclear whether and how speech comprehension involves morpheme processing, above and beyond other language units. Here, we propose and test hierarchically nested encoding models of speech comprehension: a naïve model with word-, syllable-, and sound-level information; a bottom-up model with additional morpheme boundary information; and predictive models that process morphemes before these boundaries. We recorded MEG data as 27 participants (16 female) listened to Arabic sentences like 'ṣaddaqathu .' A temporal response function analysis revealed that in temporal and left inferior frontal regions, predictive models outperform the bottom-up model, which outperforms the naïve model. Moreover, verb stems were either length-ambiguous (e.g., 'ṣaddaqa' is initially mistakable for the shorter stem 'ṣadda', meaning 'blocked') or length-unambiguous (e.g., 'qayyama', meaning 'evaluated', cannot be mistaken for a shorter stem) but shared a uniqueness point, beyond which stem identity is disambiguated. Evoked analyses revealed differences between conditions before the uniqueness point, suggesting that, rather than await disambiguation, the brain employs proactive predictive strategies, processing accumulated input as soon as any possible stem is identifiable, even if not uniquely. These findings highlight the role of morphemes in speech and the importance of including morpheme-level information in neural and computational models of speech comprehension.","PeriodicalId":50114,"journal":{"name":"Journal of Neuroscience","volume":" ","pages":""},"PeriodicalIF":4.0000,"publicationDate":"2025-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11823338/pdf/","citationCount":"0","resultStr":"{\"title\":\"Neural Bases of Proactive and Predictive Processing of Meaningful Subword Units in Speech Comprehension.\",\"authors\":\"Suhail Matar, Alec Marantz\",\"doi\":\"10.1523/JNEUROSCI.0781-24.2024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To comprehend speech, human brains identify meaningful units, like words, in the speech stream. But whereas the English 'She believed him.' has three words, the Arabic equivalent 'ṣaddaqathu' forms one word with three meaningful subword units, called morphemes: a verb stem ('ṣaddaqa'), a subject suffix ('-t-'), and a direct object pronoun ('-hu'). It remains unclear whether and how speech comprehension involves morpheme processing, above and beyond other language units. Here, we propose and test hierarchically nested encoding models of speech comprehension: a naïve model with word-, syllable-, and sound-level information; a bottom-up model with additional morpheme boundary information; and predictive models that process morphemes before these boundaries. We recorded MEG data as 27 participants (16 female) listened to Arabic sentences like 'ṣaddaqathu .' A temporal response function analysis revealed that in temporal and left inferior frontal regions, predictive models outperform the bottom-up model, which outperforms the naïve model. Moreover, verb stems were either length-ambiguous (e.g., 'ṣaddaqa' is initially mistakable for the shorter stem 'ṣadda', meaning 'blocked') or length-unambiguous (e.g., 'qayyama', meaning 'evaluated', cannot be mistaken for a shorter stem) but shared a uniqueness point, beyond which stem identity is disambiguated. Evoked analyses revealed differences between conditions before the uniqueness point, suggesting that, rather than await disambiguation, the brain employs proactive predictive strategies, processing accumulated input as soon as any possible stem is identifiable, even if not uniquely. These findings highlight the role of morphemes in speech and the importance of including morpheme-level information in neural and computational models of speech comprehension.\",\"PeriodicalId\":50114,\"journal\":{\"name\":\"Journal of Neuroscience\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":4.0000,\"publicationDate\":\"2025-02-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11823338/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Neuroscience\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1523/JNEUROSCI.0781-24.2024\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"NEUROSCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Neuroscience","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1523/JNEUROSCI.0781-24.2024","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NEUROSCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

为了理解语音，人脑需要识别语音流中有意义的单元。但是，英语 "She believed him. "有 3 个单词单位，而阿拉伯语对应的 "ṣaddaqathu. "是一个单词单位，包含 3 个有意义的子单词单位，称为语素：一个动词词干（'ṣaddaqa'）、一个主语后缀（'-t-'）和一个直接宾语代词（'-hu'）。目前仍不清楚在语音理解过程中，大脑是否以及如何在其他语言单位之外处理语素。在此，我们提出并测试了语音理解的分层嵌套编码模型：一个包含单词、音节和声音级信息的天真模型；一个包含额外语素边界信息的自下而上模型；以及在这些边界之前处理语素的预测模型。我们记录了 27 名参与者（16 名女性）聆听阿拉伯语句子 "ṣaddaqathu. "时的脑磁图（MEG）数据。时间反应函数（TRF）分析表明，在颞叶和左下额区，预测模型优于自下而上模型，而自下而上模型优于天真模型。此外，动词词干要么是长度模糊的（例如，'ṣaddaqa'最初可能被误认为是较短的词干'ṣadda'='blocked'），要么是长度不模糊的（例如，'qayyama'='evaluated'不会被误认为是较短的词干），但都有一个唯一性点，过了这个唯一性点，词干的身份就完全不模糊了。诱发分析表明，在唯一性点之前，不同条件之间存在差异，这表明大脑不是等待消歧，而是采用主动预测策略，一旦任何可能的词干可以识别，即使不是唯一的，也会立即处理累积的输入。这些发现凸显了语素在语音中的作用，以及在语音理解的神经和计算模型中包含语素级信息的重要性。但是，语言在单词单位中包含的意义量方面存在很大差异。这项研究提出了包含有意义的子单词单位信息的语音理解模型，这些单词单位被称为词素（例如 "烘焙 "中的 "bake-"和"-ing"），研究结果表明，与不包含词素信息的模型相比，这些模型能解释更多的神经活动。我们还展示了大脑是如何预测性地处理语素信息的。这些发现突出了语素在语音理解中的作用，并强调了语素级信息论度量（如惊奇和熵等）的贡献。我们的发现可用于更新当前语音理解的神经、认知和计算模型，并为完善这些模型以适应自然、连贯的语音迈出了一步。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Neural Bases of Proactive and Predictive Processing of Meaningful Subword Units in Speech Comprehension.

To comprehend speech, human brains identify meaningful units, like words, in the speech stream. But whereas the English 'She believed him.' has three words, the Arabic equivalent 'ṣaddaqathu' forms one word with three meaningful subword units, called morphemes: a verb stem ('ṣaddaqa'), a subject suffix ('-t-'), and a direct object pronoun ('-hu'). It remains unclear whether and how speech comprehension involves morpheme processing, above and beyond other language units. Here, we propose and test hierarchically nested encoding models of speech comprehension: a naïve model with word-, syllable-, and sound-level information; a bottom-up model with additional morpheme boundary information; and predictive models that process morphemes before these boundaries. We recorded MEG data as 27 participants (16 female) listened to Arabic sentences like 'ṣaddaqathu .' A temporal response function analysis revealed that in temporal and left inferior frontal regions, predictive models outperform the bottom-up model, which outperforms the naïve model. Moreover, verb stems were either length-ambiguous (e.g., 'ṣaddaqa' is initially mistakable for the shorter stem 'ṣadda', meaning 'blocked') or length-unambiguous (e.g., 'qayyama', meaning 'evaluated', cannot be mistaken for a shorter stem) but shared a uniqueness point, beyond which stem identity is disambiguated. Evoked analyses revealed differences between conditions before the uniqueness point, suggesting that, rather than await disambiguation, the brain employs proactive predictive strategies, processing accumulated input as soon as any possible stem is identifiable, even if not uniquely. These findings highlight the role of morphemes in speech and the importance of including morpheme-level information in neural and computational models of speech comprehension.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Neuroscience 医学-神经科学

CiteScore

9.30

自引率

3.80%

发文量

1164

审稿时长

12 months

期刊介绍： JNeurosci (ISSN 0270-6474) is an official journal of the Society for Neuroscience. It is published weekly by the Society, fifty weeks a year, one volume a year. JNeurosci publishes papers on a broad range of topics of general interest to those working on the nervous system. Authors now have an Open Choice option for their published articles