论话语连接词对人类和语言模型预测的影响。

IF 2.4 3区 医学 Q3 NEUROSCIENCES Frontiers in Human Neuroscience Pub Date : 2024-09-30 eCollection Date: 2024-01-01 DOI:10.3389/fnhum.2024.1363120
James Britton, Yan Cong, Yu-Yin Hsu, Emmanuele Chersoni, Philippe Blache
{"title":"论话语连接词对人类和语言模型预测的影响。","authors":"James Britton, Yan Cong, Yu-Yin Hsu, Emmanuele Chersoni, Philippe Blache","doi":"10.3389/fnhum.2024.1363120","DOIUrl":null,"url":null,"abstract":"<p><p>Psycholinguistic literature has consistently shown that humans rely on a rich and organized understanding of event knowledge to predict the forthcoming linguistic input during online sentence comprehension. We, the authors, expect sentences to maintain coherence with the preceding context, making congruent sentence sequences easier to process than incongruent ones. It is widely known that discourse relations between sentences (e.g., temporal, contingency, comparison) are generally made explicit through specific particles, known as <i>discourse connectives</i>, (e.g., <i>and, but, because, after</i>). However, some relations that are easily accessible to the speakers, given their event knowledge, can also be left implicit. The goal of this paper is to investigate the importance of discourse connectives in the prediction of events in human language processing and pretrained language models, with a specific focus on concessives and contrastives, which signal to comprehenders that their event-related predictions have to be <i>reversed</i>. Inspired by previous work, we built a comprehensive set of story stimuli in Italian and Mandarin Chinese that differ in the plausibility and coherence of the situation being described and the presence or absence of a discourse connective. We collected plausibility judgments and reading times from native speakers for the stimuli. Moreover, we correlated the results of the experiments with the predictions given by computational modeling, using Surprisal scores obtained via Transformer-based language models. The human judgements were collected using a seven-point Likert scale and analyzed using cumulative link mixed modeling (CLMM), while the human reading times and language model surprisal scores were analyzed using linear mixed effects regression (LMER). We found that Chinese NLMs are sensitive to plausibility and connectives, although they struggle to reproduce expectation reversal effects due to a connective changing the plausibility of a given scenario; Italian results are even less aligned with human data, with no effects of either plausibility and connectives on Surprisal.</p>","PeriodicalId":12536,"journal":{"name":"Frontiers in Human Neuroscience","volume":"18 ","pages":"1363120"},"PeriodicalIF":2.4000,"publicationDate":"2024-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11471541/pdf/","citationCount":"0","resultStr":"{\"title\":\"On the influence of discourse connectives on the predictions of humans and language models.\",\"authors\":\"James Britton, Yan Cong, Yu-Yin Hsu, Emmanuele Chersoni, Philippe Blache\",\"doi\":\"10.3389/fnhum.2024.1363120\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Psycholinguistic literature has consistently shown that humans rely on a rich and organized understanding of event knowledge to predict the forthcoming linguistic input during online sentence comprehension. We, the authors, expect sentences to maintain coherence with the preceding context, making congruent sentence sequences easier to process than incongruent ones. It is widely known that discourse relations between sentences (e.g., temporal, contingency, comparison) are generally made explicit through specific particles, known as <i>discourse connectives</i>, (e.g., <i>and, but, because, after</i>). However, some relations that are easily accessible to the speakers, given their event knowledge, can also be left implicit. The goal of this paper is to investigate the importance of discourse connectives in the prediction of events in human language processing and pretrained language models, with a specific focus on concessives and contrastives, which signal to comprehenders that their event-related predictions have to be <i>reversed</i>. Inspired by previous work, we built a comprehensive set of story stimuli in Italian and Mandarin Chinese that differ in the plausibility and coherence of the situation being described and the presence or absence of a discourse connective. We collected plausibility judgments and reading times from native speakers for the stimuli. Moreover, we correlated the results of the experiments with the predictions given by computational modeling, using Surprisal scores obtained via Transformer-based language models. The human judgements were collected using a seven-point Likert scale and analyzed using cumulative link mixed modeling (CLMM), while the human reading times and language model surprisal scores were analyzed using linear mixed effects regression (LMER). We found that Chinese NLMs are sensitive to plausibility and connectives, although they struggle to reproduce expectation reversal effects due to a connective changing the plausibility of a given scenario; Italian results are even less aligned with human data, with no effects of either plausibility and connectives on Surprisal.</p>\",\"PeriodicalId\":12536,\"journal\":{\"name\":\"Frontiers in Human Neuroscience\",\"volume\":\"18 \",\"pages\":\"1363120\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2024-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11471541/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Frontiers in Human Neuroscience\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.3389/fnhum.2024.1363120\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q3\",\"JCRName\":\"NEUROSCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Human Neuroscience","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3389/fnhum.2024.1363120","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q3","JCRName":"NEUROSCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

心理语言学文献一致表明,在在线句子理解过程中,人类依靠对事件知识丰富而有序的理解来预测即将出现的语言输入。我们--作者--期望句子与前面的上下文保持一致,从而使一致的句子序列比不一致的句子序列更容易处理。众所周知,句子之间的话语关系(如时间关系、或然关系、比较关系)一般通过特定的微粒(即话语连接词,如and、but、because、after)来明确表达。然而,有些关系是说话者根据自己的事件知识很容易获得的,但也可能是隐含的。本文的目的是研究在人类语言处理和预训练语言模型中,话语连接词在事件预测中的重要性,特别关注让步词和对比词,它们向理解者发出的信号是,其与事件相关的预测必须颠倒过来。受先前工作的启发,我们用意大利语和普通话建立了一套全面的故事刺激,这些故事刺激在所描述情况的可信度和连贯性以及是否存在话语连接词方面各不相同。我们收集了母语人士对这些刺激的可信度判断和阅读时间。此外,我们还利用基于 Transformer 的语言模型获得的 Surprisal 分数,将实验结果与计算模型给出的预测结果进行了关联。我们使用七分李克特量表收集了人类的判断,并使用累积链接混合建模(CLMM)进行了分析,同时使用线性混合效应回归(LMER)分析了人类的阅读时间和语言模型的意外得分。我们发现,中文 NLMs 对可信度和连接词很敏感,但它们很难再现由于连接词改变了给定情景的可信度而导致的期望反转效应;意大利语的结果与人类数据更不一致,可信度和连接词对惊奇度都没有影响。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
On the influence of discourse connectives on the predictions of humans and language models.

Psycholinguistic literature has consistently shown that humans rely on a rich and organized understanding of event knowledge to predict the forthcoming linguistic input during online sentence comprehension. We, the authors, expect sentences to maintain coherence with the preceding context, making congruent sentence sequences easier to process than incongruent ones. It is widely known that discourse relations between sentences (e.g., temporal, contingency, comparison) are generally made explicit through specific particles, known as discourse connectives, (e.g., and, but, because, after). However, some relations that are easily accessible to the speakers, given their event knowledge, can also be left implicit. The goal of this paper is to investigate the importance of discourse connectives in the prediction of events in human language processing and pretrained language models, with a specific focus on concessives and contrastives, which signal to comprehenders that their event-related predictions have to be reversed. Inspired by previous work, we built a comprehensive set of story stimuli in Italian and Mandarin Chinese that differ in the plausibility and coherence of the situation being described and the presence or absence of a discourse connective. We collected plausibility judgments and reading times from native speakers for the stimuli. Moreover, we correlated the results of the experiments with the predictions given by computational modeling, using Surprisal scores obtained via Transformer-based language models. The human judgements were collected using a seven-point Likert scale and analyzed using cumulative link mixed modeling (CLMM), while the human reading times and language model surprisal scores were analyzed using linear mixed effects regression (LMER). We found that Chinese NLMs are sensitive to plausibility and connectives, although they struggle to reproduce expectation reversal effects due to a connective changing the plausibility of a given scenario; Italian results are even less aligned with human data, with no effects of either plausibility and connectives on Surprisal.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Frontiers in Human Neuroscience
Frontiers in Human Neuroscience 医学-神经科学
CiteScore
4.70
自引率
6.90%
发文量
830
审稿时长
2-4 weeks
期刊介绍: Frontiers in Human Neuroscience is a first-tier electronic journal devoted to understanding the brain mechanisms supporting cognitive and social behavior in humans, and how these mechanisms might be altered in disease states. The last 25 years have seen an explosive growth in both the methods and the theoretical constructs available to study the human brain. Advances in electrophysiological, neuroimaging, neuropsychological, psychophysical, neuropharmacological and computational approaches have provided key insights into the mechanisms of a broad range of human behaviors in both health and disease. Work in human neuroscience ranges from the cognitive domain, including areas such as memory, attention, language and perception to the social domain, with this last subject addressing topics, such as interpersonal interactions, social discourse and emotional regulation. How these processes unfold during development, mature in adulthood and often decline in aging, and how they are altered in a host of developmental, neurological and psychiatric disorders, has become increasingly amenable to human neuroscience research approaches. Work in human neuroscience has influenced many areas of inquiry ranging from social and cognitive psychology to economics, law and public policy. Accordingly, our journal will provide a forum for human research spanning all areas of human cognitive, social, developmental and translational neuroscience using any research approach.
期刊最新文献
Brain spatial reconciliation through multisensory integration in the impact of pandemic fatigue on workplace. Cognitive dysfunction in Moyamoya disease: latest developments and future directions. Development and validation of a two-dimensional pseudorandom balance perturbation test. Effects of fully immersive virtual reality training on cognitive function in patients with mild cognitive impairment: a systematic review and meta-analysis. Alterations in large-scale resting-state network nodes following transcranial focused ultrasound of deep brain structures.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1