{"title":"CAOSS中的模式:分布式表示预测了熟悉和新复合词关系解释的变化","authors":"Fritz Günther , Marco Marelli","doi":"10.1016/j.cogpsych.2022.101471","DOIUrl":null,"url":null,"abstract":"<div><p><span>While distributional semantic models that represent word meanings as high-dimensional vectors induced from large text corpora have been shown to successfully predict human behavior across a wide range of tasks, they have also received criticism from different directions. These include concerns over their interpretability (how can numbers specifying abstract, latent dimensions represent meaning?) and their ability to capture variation in meaning (how can a single vector representation capture multiple different interpretations for the same expression?). Here, we demonstrate that semantic vectors can indeed rise up to these challenges, by training a mapping system (a simple linear regression) that predicts inter-individual variation in relational interpretations for compounds such as </span><em>wood brush</em> (for example <em>brush FOR wood</em>, or <em>brush MADE OF wood</em>) from (compositional) semantic vectors representing the meanings of these compounds. These predictions consistently beat different random baselines, both for familiar compounds (<em>moon light</em>, Experiment 1) as well as novel compounds (<em>wood brush</em>, Experiment 2), demonstrating that distributional semantic vectors encode variations in qualitative interpretations that can be decoded using techniques as simple as linear regression.</p></div>","PeriodicalId":50669,"journal":{"name":"Cognitive Psychology","volume":"134 ","pages":"Article 101471"},"PeriodicalIF":3.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Patterns in CAOSS: Distributed representations predict variation in relational interpretations for familiar and novel compound words\",\"authors\":\"Fritz Günther , Marco Marelli\",\"doi\":\"10.1016/j.cogpsych.2022.101471\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p><span>While distributional semantic models that represent word meanings as high-dimensional vectors induced from large text corpora have been shown to successfully predict human behavior across a wide range of tasks, they have also received criticism from different directions. These include concerns over their interpretability (how can numbers specifying abstract, latent dimensions represent meaning?) and their ability to capture variation in meaning (how can a single vector representation capture multiple different interpretations for the same expression?). Here, we demonstrate that semantic vectors can indeed rise up to these challenges, by training a mapping system (a simple linear regression) that predicts inter-individual variation in relational interpretations for compounds such as </span><em>wood brush</em> (for example <em>brush FOR wood</em>, or <em>brush MADE OF wood</em>) from (compositional) semantic vectors representing the meanings of these compounds. These predictions consistently beat different random baselines, both for familiar compounds (<em>moon light</em>, Experiment 1) as well as novel compounds (<em>wood brush</em>, Experiment 2), demonstrating that distributional semantic vectors encode variations in qualitative interpretations that can be decoded using techniques as simple as linear regression.</p></div>\",\"PeriodicalId\":50669,\"journal\":{\"name\":\"Cognitive Psychology\",\"volume\":\"134 \",\"pages\":\"Article 101471\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2022-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cognitive Psychology\",\"FirstCategoryId\":\"102\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0010028522000093\",\"RegionNum\":2,\"RegionCategory\":\"心理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PSYCHOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cognitive Psychology","FirstCategoryId":"102","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010028522000093","RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHOLOGY","Score":null,"Total":0}
引用次数: 3
摘要
虽然从大型文本语料库中提取的以高维向量表示词义的分布式语义模型已经被证明可以成功地预测人类在各种任务中的行为,但它们也受到了来自不同方向的批评。这些问题包括它们的可解释性(数字如何指定抽象的、潜在的维度来表示意义?)和它们捕获意义变化的能力(单个向量表示如何捕获同一表达的多个不同解释?)在这里,我们证明了语义向量确实可以应对这些挑战,通过训练一个映射系统(一个简单的线性回归),该映射系统可以从代表这些化合物含义的(组成)语义向量中预测化合物(例如木刷)(例如刷子for wood,或刷子MADE OF wood)的关系解释中的个体间变化。这些预测始终优于不同的随机基线,无论是熟悉的化合物(月光,实验1)还是新化合物(木刷,实验2),这表明分布语义向量在定性解释中编码变化,可以使用线性回归等简单技术解码。
Patterns in CAOSS: Distributed representations predict variation in relational interpretations for familiar and novel compound words
While distributional semantic models that represent word meanings as high-dimensional vectors induced from large text corpora have been shown to successfully predict human behavior across a wide range of tasks, they have also received criticism from different directions. These include concerns over their interpretability (how can numbers specifying abstract, latent dimensions represent meaning?) and their ability to capture variation in meaning (how can a single vector representation capture multiple different interpretations for the same expression?). Here, we demonstrate that semantic vectors can indeed rise up to these challenges, by training a mapping system (a simple linear regression) that predicts inter-individual variation in relational interpretations for compounds such as wood brush (for example brush FOR wood, or brush MADE OF wood) from (compositional) semantic vectors representing the meanings of these compounds. These predictions consistently beat different random baselines, both for familiar compounds (moon light, Experiment 1) as well as novel compounds (wood brush, Experiment 2), demonstrating that distributional semantic vectors encode variations in qualitative interpretations that can be decoded using techniques as simple as linear regression.
期刊介绍:
Cognitive Psychology is concerned with advances in the study of attention, memory, language processing, perception, problem solving, and thinking. Cognitive Psychology specializes in extensive articles that have a major impact on cognitive theory and provide new theoretical advances.
Research Areas include:
• Artificial intelligence
• Developmental psychology
• Linguistics
• Neurophysiology
• Social psychology.