Determining the importance of frequency and contextual diversity in the lexical organization of multiword expressions.

IF 1.1 4区 心理学 Q4 PSYCHOLOGY, EXPERIMENTAL Canadian Journal of Experimental Psychology-Revue Canadienne De Psychologie Experimentale Pub Date : 2022-06-01 Epub Date: 2022-02-10 DOI:10.1037/cep0000271
Marco S G Senaldi, Debra A Titone, Brendan T Johns
{"title":"Determining the importance of frequency and contextual diversity in the lexical organization of multiword expressions.","authors":"Marco S G Senaldi,&nbsp;Debra A Titone,&nbsp;Brendan T Johns","doi":"10.1037/cep0000271","DOIUrl":null,"url":null,"abstract":"<p><p>Corpus-based models of lexical strength have called into question the role of word frequency as an organizing principle of the lexicon, revealing that contextual and semantic diversity measures provide a closer fit to lexical behavior data (Adelman et al., 2006; Jones et al., 2012). Contextual diversity measures modify word frequency by ignoring word repetition in context, while semantic diversity measures consider the semantic consistency of contextual word occurrence. Recent research has shown that a better account of lexical organization data is provided by socially based measures of semantic diversity, which encode the communication patterns of individuals across discourses (Johns, 2021b). While most research on contextual diversity has focused on single words, recent corpus-based and experimental evidence suggests that an integral part of language use involves recurrent and more structurally complex units, such as multiword phrases and idioms. The aim of the present work was to determine if contextual and semantic diversity drive lexical organization at the level of multiword units (here, operationalized as idiomatic expressions), in addition to single words. To this end, we analyzed normative ratings of familiarity for 210 English idioms (Libben & Titone, 2008) using a set of contextual, semantic, and socially based diversity measures that were computed from a 55-billion word corpus of Reddit comments. The results confirm the superiority of diversity measures over frequency for multiword expressions, suggesting that multiword units, such as idiomatic phrases, show similar lexical organization dynamics as single words. (PsycInfo Database Record (c) 2022 APA, all rights reserved).</p>","PeriodicalId":51529,"journal":{"name":"Canadian Journal of Experimental Psychology-Revue Canadienne De Psychologie Experimentale","volume":"76 2","pages":"87-98"},"PeriodicalIF":1.1000,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Canadian Journal of Experimental Psychology-Revue Canadienne De Psychologie Experimentale","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1037/cep0000271","RegionNum":4,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2022/2/10 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"PSYCHOLOGY, EXPERIMENTAL","Score":null,"Total":0}
引用次数: 2

Abstract

Corpus-based models of lexical strength have called into question the role of word frequency as an organizing principle of the lexicon, revealing that contextual and semantic diversity measures provide a closer fit to lexical behavior data (Adelman et al., 2006; Jones et al., 2012). Contextual diversity measures modify word frequency by ignoring word repetition in context, while semantic diversity measures consider the semantic consistency of contextual word occurrence. Recent research has shown that a better account of lexical organization data is provided by socially based measures of semantic diversity, which encode the communication patterns of individuals across discourses (Johns, 2021b). While most research on contextual diversity has focused on single words, recent corpus-based and experimental evidence suggests that an integral part of language use involves recurrent and more structurally complex units, such as multiword phrases and idioms. The aim of the present work was to determine if contextual and semantic diversity drive lexical organization at the level of multiword units (here, operationalized as idiomatic expressions), in addition to single words. To this end, we analyzed normative ratings of familiarity for 210 English idioms (Libben & Titone, 2008) using a set of contextual, semantic, and socially based diversity measures that were computed from a 55-billion word corpus of Reddit comments. The results confirm the superiority of diversity measures over frequency for multiword expressions, suggesting that multiword units, such as idiomatic phrases, show similar lexical organization dynamics as single words. (PsycInfo Database Record (c) 2022 APA, all rights reserved).

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
确定频率和语境多样性在多词表达词汇组织中的重要性。
基于语料库的词汇强度模型对词频作为词汇组织原则的作用提出了质疑,揭示了上下文和语义多样性测量更适合词汇行为数据(Adelman et al., 2006;Jones et al., 2012)。语境多样性测量通过忽略词在语境中的重复来修改词频,而语义多样性测量则考虑语境词出现的语义一致性。最近的研究表明,基于社会的语义多样性测量可以更好地解释词汇组织数据,它编码了跨语篇个体的交流模式(Johns, 2021b)。虽然大多数关于语境多样性的研究都集中在单个单词上,但最近基于语料库和实验的证据表明,语言使用的一个组成部分涉及到反复出现的、结构更复杂的单元,如多词短语和习语。本研究的目的是确定上下文和语义的多样性是否在多词单位(在这里,作为习惯表达进行操作)水平上驱动词汇组织,而不是单个单词。为此,我们分析了210个英语习语的熟悉度的标准评级(Libben & Titone, 2008),使用了一套上下文、语义和基于社会的多样性措施,这些措施是从550亿字的Reddit评论语料库中计算出来的。研究结果证实了多样性测量在多词表达中的优势,表明多词单位(如习语短语)表现出与单个词相似的词汇组织动态。(PsycInfo Database Record (c) 2022 APA,版权所有)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
2.30
自引率
7.70%
发文量
40
期刊介绍: The Canadian Journal of Experimental Psychology publishes original research papers that advance understanding of the field of experimental psychology, broadly considered. This includes, but is not restricted to, cognition, perception, motor performance, attention, memory, learning, language, decision making, development, comparative psychology, and neuroscience. The journal publishes - papers reporting empirical results that advance knowledge in a particular research area; - papers describing theoretical, methodological, or conceptual advances that are relevant to the interpretation of empirical evidence in the field; - brief reports (less than 2,500 words for the main text) that describe new results or analyses with clear theoretical or methodological import.
期刊最新文献
Math attitudes and verbal memory in multilingual younger adults. Multiple constraint network classification reveals functional brain networks distinguishing 0-back and 2-back task. Personal likelihood and event familiarity influence the simulation of future events. It is a "small world": Relations between performance on five spatial tasks and five mathematical tasks in undergraduate students. Proactive and reactive cognitive control in the absence of learning and memory confounds: Evidence from a cross-modal trial-unique Stroop task.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1