首页 > 最新文献

Intelligence最新文献

英文 中文
Individual differences in spatial navigation and working memory 空间导航和工作记忆的个体差异
IF 3.3 2区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2025-07-01 Epub Date: 2025-06-18 DOI: 10.1016/j.intell.2025.101932
Alexa K. Bushinski, Thomas S. Redick
Spatial navigation is a complex skill that relies on many aspects of cognition. Our study aims to clarify the role of working memory in spatial navigation, and particularly, the potentially separate contributions of verbal and visuospatial working memory. We leverage individual differences to understand how working memory differs among types of navigators and the predictive utility of verbal and visuospatial working memory. Data were analyzed from N = 253 healthy, young adults. Participants completed multiple measures of verbal and visuospatial working memory and a spatial navigation task called Virtual Silcton. We found that better navigators may rely more on visuospatial working memory. Additionally, using a relative weights analysis, we found that visuospatial working memory accounts for a large majority of variance in spatial navigation when compared to verbal working memory. Our results suggest individual differences in working memory are domain-specific in this context of spatial navigation, with visuospatial working memory being the primary contributor.
空间导航是一项复杂的技能,它依赖于认知的许多方面。我们的研究旨在阐明工作记忆在空间导航中的作用,特别是语言和视觉空间工作记忆的潜在独立贡献。我们利用个体差异来理解工作记忆在不同类型的导航者之间的差异,以及语言和视觉空间工作记忆的预测效用。数据分析来自253名健康的年轻成年人。参与者完成了多项口头和视觉空间工作记忆测试,以及一项名为“虚拟记忆”的空间导航任务。我们发现,更好的导航员可能更多地依赖于视觉空间工作记忆。此外,使用相对权重分析,我们发现与语言工作记忆相比,视觉空间工作记忆占了空间导航变异的大部分。我们的研究结果表明,在空间导航的背景下,工作记忆的个体差异是特定领域的,视觉空间工作记忆是主要的贡献者。
{"title":"Individual differences in spatial navigation and working memory","authors":"Alexa K. Bushinski,&nbsp;Thomas S. Redick","doi":"10.1016/j.intell.2025.101932","DOIUrl":"10.1016/j.intell.2025.101932","url":null,"abstract":"<div><div>Spatial navigation is a complex skill that relies on many aspects of cognition. Our study aims to clarify the role of working memory in spatial navigation, and particularly, the potentially separate contributions of verbal and visuospatial working memory. We leverage individual differences to understand how working memory differs among types of navigators and the predictive utility of verbal and visuospatial working memory. Data were analyzed from <em>N</em> = 253 healthy, young adults. Participants completed multiple measures of verbal and visuospatial working memory and a spatial navigation task called Virtual Silcton. We found that better navigators may rely more on visuospatial working memory. Additionally, using a relative weights analysis, we found that visuospatial working memory accounts for a large majority of variance in spatial navigation when compared to verbal working memory. Our results suggest individual differences in working memory are domain-specific in this context of spatial navigation, with visuospatial working memory being the primary contributor.</div></div>","PeriodicalId":13862,"journal":{"name":"Intelligence","volume":"111 ","pages":"Article 101932"},"PeriodicalIF":3.3,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144306984","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Heritability of metacognitive judgement of intelligence: A twin study on the Dunning-Kruger effect 智力元认知判断的遗传力:邓宁-克鲁格效应的双胞胎研究
IF 3.3 2区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2025-07-01 Epub Date: 2025-06-04 DOI: 10.1016/j.intell.2025.101931
Joseph L. Nedelec , Curtis S. Dunkel , Dimitri van der Linden
Metacognition is a process that relates to thinking about thinking. Observed variation in metacognitive processes related to intelligence have often been referred to as the Dunning-Kruger effect (DKE). The DKE describes how individuals often overestimate their competence in a field where they lack expertise, while experts tend to slightly underestimate their competence. Applied to general intelligence, the DKE suggests discrepancies between self-assessed intelligence (SAI) and objective measures of intelligence. Recently, however, the methods used to assess the DKE have been subject to critique. The current study innovatively assessed the DKE by using a mechanistic and genetically informed approach. ACE decomposition models were estimated on a large sample of twins (n = 920; [nMZ = 388; nDZ = 532]) drawn from the restricted version of the National Longitudinal Study of Adolescent to Adult Health. Findings illustrated that about 44 % of the variance in a traditional measure of the DKE (difference scores: SAI – objective IQ) was accounted for by genetic factors in the full sample. However, the pattern differed over quartiles of objective IQ where genetic factors accounted for less of the variance in the lower quartiles (about 30 %) and increased to over 75 % of the variance in the highest quartile (remaining variance was due to nonshared environmental factors). Limitations notwithstanding (including a weak and relatively isolated DKE), the current study adds potential support for the validity of the DKE.
元认知是一个与思考有关的过程。观察到的与智力相关的元认知过程的变化通常被称为邓宁-克鲁格效应(DKE)。DKE描述了个人在缺乏专业知识的领域往往高估自己的能力,而专家往往会略微低估自己的能力。应用于一般智力,DKE表明了自我评估智力(SAI)和客观智力测量之间的差异。然而,最近用于评估DKE的方法受到了批评。目前的研究创新性地通过使用机械和遗传信息的方法评估了DKE。ACE分解模型在大量双胞胎样本上进行了估计(n = 920;[nMZ = 388;nDZ = 532]),摘自国家青少年到成人健康纵向研究的限制性版本。研究结果表明,在DKE(差异分数:SAI -客观智商)的传统测量中,约44%的方差是由整个样本中的遗传因素造成的。然而,这种模式在客观智商的四分位数上有所不同,遗传因素在较低的四分位数中占较少的方差(约30%),在最高的四分位数中增加到75%以上的方差(其余的方差是由于非共享的环境因素)。尽管存在局限性(包括弱且相对孤立的DKE),但当前的研究为DKE的有效性增加了潜在的支持。
{"title":"Heritability of metacognitive judgement of intelligence: A twin study on the Dunning-Kruger effect","authors":"Joseph L. Nedelec ,&nbsp;Curtis S. Dunkel ,&nbsp;Dimitri van der Linden","doi":"10.1016/j.intell.2025.101931","DOIUrl":"10.1016/j.intell.2025.101931","url":null,"abstract":"<div><div>Metacognition is a process that relates to thinking about thinking. Observed variation in metacognitive processes related to intelligence have often been referred to as the Dunning-Kruger effect (DKE). The DKE describes how individuals often overestimate their competence in a field where they lack expertise, while experts tend to slightly underestimate their competence. Applied to general intelligence, the DKE suggests discrepancies between self-assessed intelligence (SAI) and objective measures of intelligence. Recently, however, the methods used to assess the DKE have been subject to critique. The current study innovatively assessed the DKE by using a mechanistic and genetically informed approach. ACE decomposition models were estimated on a large sample of twins (<em>n</em> = 920; [<em>n</em><sub>MZ</sub> = 388; <em>n</em><sub>DZ</sub> = 532]) drawn from the restricted version of the National Longitudinal Study of Adolescent to Adult Health. Findings illustrated that about 44 % of the variance in a traditional measure of the DKE (difference scores: SAI – objective IQ) was accounted for by genetic factors in the full sample. However, the pattern differed over quartiles of objective IQ where genetic factors accounted for less of the variance in the lower quartiles (about 30 %) and increased to over 75 % of the variance in the highest quartile (remaining variance was due to nonshared environmental factors). Limitations notwithstanding (including a weak and relatively isolated DKE), the current study adds potential support for the validity of the DKE.</div></div>","PeriodicalId":13862,"journal":{"name":"Intelligence","volume":"111 ","pages":"Article 101931"},"PeriodicalIF":3.3,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144205195","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How accurately does self-reported intelligence reflect psychometrically measured IQ? 自我报告的智力反映心理测量学测量的智商有多准确?
IF 3.3 2区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2025-07-01 Epub Date: 2025-06-21 DOI: 10.1016/j.intell.2025.101933
Jüri Allik, Helle Pullmann
Studies have demonstrated that people's self-reported intelligence (SRI) is only weakly correlated with their psychometrically measured IQ, which challenges the idea that asking someone how intelligent they are can serve as a reliable proxy for formal ability testing. Data collected from a large sample of Estonian schoolchildren aged 7 to 18 years (N = 4544) showed that only by around age of 10 do children's cognitive abilities develop to a level that allows them to make reasonably accurate self-assessments, as measured by the Raven Standard Progressive Matrices (RSPM). We conclude that meaningful comparisons of one's intellectual abilities with those of peers are only possible once general cognitive development has reached a stage of reflective intelligence, capable of using formal operations and aligning mental concepts with reality. One way to improve the agreement between the SRI and IQ test scores is to increase the reliability of subjective ratings, either by using more items or by enhancing inter-item correlations. However, this agreement has an upper limit, as discrepancies remain between psychologists' definitions of intelligence and lay conceptions, which often conflate intelligence with self-esteem and other unrelated constructs.
研究表明,人们的自我报告智力(SRI)与他们的心理测量学测量的智商只有微弱的相关性,这挑战了问一个人有多聪明可以作为正式能力测试的可靠代表的想法。从爱沙尼亚7至18岁的学童(N = 4544)的大样本中收集的数据表明,只有在10岁左右,儿童的认知能力才会发展到允许他们做出合理准确的自我评估的水平,这是由Raven标准渐进矩阵(RSPM)衡量的。我们得出的结论是,只有当一般认知发展达到反思智力的阶段,能够使用形式运算并使心理概念与现实保持一致时,才有可能与同龄人进行有意义的智力能力比较。提高SRI和IQ测试成绩之间一致性的一种方法是通过使用更多的项目或通过增强项目间的相关性来提高主观评分的可靠性。然而,这种一致是有上限的,因为心理学家对智力的定义和外行的概念之间仍然存在差异,后者经常将智力与自尊和其他不相关的概念混为一谈。
{"title":"How accurately does self-reported intelligence reflect psychometrically measured IQ?","authors":"Jüri Allik,&nbsp;Helle Pullmann","doi":"10.1016/j.intell.2025.101933","DOIUrl":"10.1016/j.intell.2025.101933","url":null,"abstract":"<div><div>Studies have demonstrated that people's self-reported intelligence (SRI) is only weakly correlated with their psychometrically measured IQ, which challenges the idea that asking someone how intelligent they are can serve as a reliable proxy for formal ability testing. Data collected from a large sample of Estonian schoolchildren aged 7 to 18 years (<em>N</em> = 4544) showed that only by around age of 10 do children's cognitive abilities develop to a level that allows them to make reasonably accurate self-assessments, as measured by the Raven Standard Progressive Matrices (RSPM). We conclude that meaningful comparisons of one's intellectual abilities with those of peers are only possible once general cognitive development has reached a stage of reflective intelligence, capable of using formal operations and aligning mental concepts with reality. One way to improve the agreement between the SRI and IQ test scores is to increase the reliability of subjective ratings, either by using more items or by enhancing inter-item correlations. However, this agreement has an upper limit, as discrepancies remain between psychologists' definitions of intelligence and lay conceptions, which often conflate intelligence with self-esteem and other unrelated constructs.</div></div>","PeriodicalId":13862,"journal":{"name":"Intelligence","volume":"111 ","pages":"Article 101933"},"PeriodicalIF":3.3,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144329659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reciprocal effects between information and communication technology literacy and conventional literacies 信息和通信技术素养与传统素养之间的相互影响
IF 3.3 2区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2025-07-01 Epub Date: 2025-07-12 DOI: 10.1016/j.intell.2025.101936
Timo Gnambs
Information and communication technology (ICT) literacy encompasses a range of cognitive abilities that facilitate the effective use of digital technologies. Two studies on German students investigated the role of reading comprehension and mathematical competence in the development of ICT literacy in adolescence. A variance decomposition analysis (N = 13,335) revealed that both competence domains together accounted for nearly half of the explained item variances in two ICT literacy assessments. Additionally, a cross-lagged panel analysis (N = 4,872) demonstrated that reading and mathematical competencies predicted ICT literacy growth over three years, while ICT literacy also had reciprocal effects on domain-specific competencies. These findings emphasize that ICT literacy is not merely a technical skill set but is also closely related to other cognitive abilities.
信息和通信技术素养包括促进有效使用数字技术的一系列认知能力。两项针对德国学生的研究调查了阅读理解和数学能力在青少年ICT素养发展中的作用。方差分解分析(N = 13,335)显示,这两个能力领域共同占了两个ICT素养评估中解释项目方差的近一半。此外,交叉滞后面板分析(N = 4,872)表明,阅读和数学能力预测了三年内ICT素养的增长,而ICT素养对特定领域的能力也有相互影响。这些发现强调,信息通信技术素养不仅是一项技术技能,而且与其他认知能力密切相关。
{"title":"Reciprocal effects between information and communication technology literacy and conventional literacies","authors":"Timo Gnambs","doi":"10.1016/j.intell.2025.101936","DOIUrl":"10.1016/j.intell.2025.101936","url":null,"abstract":"<div><div>Information and communication technology (ICT) literacy encompasses a range of cognitive abilities that facilitate the effective use of digital technologies. Two studies on German students investigated the role of reading comprehension and mathematical competence in the development of ICT literacy in adolescence. A variance decomposition analysis (<em>N</em> = 13,335) revealed that both competence domains together accounted for nearly half of the explained item variances in two ICT literacy assessments. Additionally, a cross-lagged panel analysis (<em>N</em> = 4,872) demonstrated that reading and mathematical competencies predicted ICT literacy growth over three years, while ICT literacy also had reciprocal effects on domain-specific competencies. These findings emphasize that ICT literacy is not merely a technical skill set but is also closely related to other cognitive abilities.</div></div>","PeriodicalId":13862,"journal":{"name":"Intelligence","volume":"111 ","pages":"Article 101936"},"PeriodicalIF":3.3,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144604507","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Validating Rules: A non-verbal free fluid intelligence test 验证规则:非语言自由流体智力测试
IF 3.3 2区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2025-07-01 Epub Date: 2025-05-29 DOI: 10.1016/j.intell.2025.101923
Sofie Van Cauwenberghe , Stijn Schelfhout , Elisabeth Roels , Jordi Heeren , Lieve De Wachter , Wouter Duyck , Nicolas Dirix
Intelligence is one of the strongest predictors of academic achievement. Fluid intelligence is one part of the construct, that can be measured by deductive and inductive reasoning. We set up a validation study of a free, non-verbal fluid intelligence test (Rules) in the context of study orientation. In this study, we investigate the reliability, distribution and structural validity of Rules, consisting of 28 items. Evidence from confirmatory multidimensional item response theory models suggests structural validity of the non-verbal reasoning test. For construct validity, a cross-validation between Rules and Raven's 2 Progressive Matrices in a sample of 235 last-year secondary school students resulted in a correlation of 0.62. Furthermore, we analyzed the predictive validity of the non-verbal reasoning test, which was administered to 32,585 last-year secondary school students. A standardized mathematics and language test were administered as a proxy for academic achievement scores. The results confirmed the predictive validity of the non-verbal reasoning test for cognitive achievement, with correlations of r = 0.61 for mathematics and r = 0.41 for language. Findings support the use of Rules in psychological practice, in particular for large-scale study exploration tools and low-stakes testing as a proxy for cognition or fluid reasoning.
智力是学术成就最有力的预测因素之一。流动智力是这个结构的一部分,可以通过演绎推理和归纳推理来衡量。我们在学习取向的背景下建立了一个自由的、非语言的流体智力测验(规则)的验证研究。在本研究中,我们研究了由28个条目组成的规则的信度、分布和结构效度。来自验证性多维项目反应理论模型的证据表明非言语推理测试具有结构效度。在构念效度方面,以235名初中生为样本,对规则与Raven’s 2递进矩阵进行交叉验证,相关系数为0.62。此外,我们分析了非语言推理测试的预测效度,该测试对32,585名高三学生进行了测试。一项标准化的数学和语言测试作为学业成绩的代表。结果证实了非语言推理测试对认知成就的预测有效性,数学和语言的相关性r = 0.61和r = 0.41。研究结果支持在心理学实践中使用规则,特别是在大规模研究探索工具和低风险测试中作为认知或流动推理的代理。
{"title":"Validating Rules: A non-verbal free fluid intelligence test","authors":"Sofie Van Cauwenberghe ,&nbsp;Stijn Schelfhout ,&nbsp;Elisabeth Roels ,&nbsp;Jordi Heeren ,&nbsp;Lieve De Wachter ,&nbsp;Wouter Duyck ,&nbsp;Nicolas Dirix","doi":"10.1016/j.intell.2025.101923","DOIUrl":"10.1016/j.intell.2025.101923","url":null,"abstract":"<div><div>Intelligence is one of the strongest predictors of academic achievement. Fluid intelligence is one part of the construct, that can be measured by deductive and inductive reasoning. We set up a validation study of a free, non-verbal fluid intelligence test (Rules) in the context of study orientation. In this study, we investigate the reliability, distribution and structural validity of Rules, consisting of 28 items. Evidence from confirmatory multidimensional item response theory models suggests structural validity of the non-verbal reasoning test. For construct validity, a cross-validation between Rules and Raven's 2 Progressive Matrices in a sample of 235 last-year secondary school students resulted in a correlation of 0.62. Furthermore, we analyzed the predictive validity of the non-verbal reasoning test, which was administered to 32,585 last-year secondary school students. A standardized mathematics and language test were administered as a proxy for academic achievement scores. The results confirmed the predictive validity of the non-verbal reasoning test for cognitive achievement, with correlations of <em>r</em> = 0.61 for mathematics and <em>r</em> = 0.41 for language. Findings support the use of Rules in psychological practice, in particular for large-scale study exploration tools and low-stakes testing as a proxy for cognition or fluid reasoning.</div></div>","PeriodicalId":13862,"journal":{"name":"Intelligence","volume":"111 ","pages":"Article 101923"},"PeriodicalIF":3.3,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144170440","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Age and ability-based emotional intelligence: Evidence from the Geneva Emotional Competence Test 基于年龄和能力的情绪智力:来自日内瓦情绪能力测试的证据
IF 3.3 2区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2025-07-01 Epub Date: 2025-07-02 DOI: 10.1016/j.intell.2025.101934
Gilles E. Gignac , Katja Schlegel
One criterion for considering emotional intelligence (EI) a true intelligence is the observation of an increase in EI across age. However, findings in this area have been mixed and predominantly based on a single measure, the MSCEIT. This study examined the relationship between age and ability-based emotional intelligence (EI) using the Geneva Emotional Competence Test (GECo) in a sample of 456 adults. Results indicated that total EI increases from early adulthood to approximately age 40 (≈ 9 EQ points), after which it plateaus and shows a modest decline in later adulthood. Notably, the emotion regulation subdimension showed no evidence of decline. Overall, these findings support the view that EI may be considered an intelligence, one that may be shaped by gains in crystallized abilities, but also potentially susceptible to later declines in fluid cognitive functioning.
将情绪智力(EI)视为真正智力的一个标准是观察到情商在不同年龄阶段的增长。然而,这一领域的研究结果参差不齐,主要基于单一的衡量标准,即MSCEIT。这项研究利用日内瓦情绪能力测试(GECo)对456名成年人进行了抽样调查,考察了年龄和基于能力的情绪智力(EI)之间的关系。结果表明,从成年早期到大约40岁(≈9 EQ点),总EI增加,之后趋于平稳,并在成年后期呈现适度下降。值得注意的是,情绪调节子维度没有显示出下降的迹象。总的来说,这些发现支持了这样一种观点,即情商可能被认为是一种智力,一种可能由结晶能力的增长形成的智力,但也可能容易受到后来液态认知功能下降的影响。
{"title":"Age and ability-based emotional intelligence: Evidence from the Geneva Emotional Competence Test","authors":"Gilles E. Gignac ,&nbsp;Katja Schlegel","doi":"10.1016/j.intell.2025.101934","DOIUrl":"10.1016/j.intell.2025.101934","url":null,"abstract":"<div><div>One criterion for considering emotional intelligence (EI) a true intelligence is the observation of an increase in EI across age. However, findings in this area have been mixed and predominantly based on a single measure, the MSCEIT. This study examined the relationship between age and ability-based emotional intelligence (EI) using the Geneva Emotional Competence Test (GECo) in a sample of 456 adults. Results indicated that total EI increases from early adulthood to approximately age 40 (≈ 9 EQ points), after which it plateaus and shows a modest decline in later adulthood. Notably, the emotion regulation subdimension showed no evidence of decline. Overall, these findings support the view that EI may be considered an intelligence, one that may be shaped by gains in crystallized abilities, but also potentially susceptible to later declines in fluid cognitive functioning.</div></div>","PeriodicalId":13862,"journal":{"name":"Intelligence","volume":"111 ","pages":"Article 101934"},"PeriodicalIF":3.3,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144524339","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cognitive performance from infancy to adulthood in Japan from 1998–2001 to 2014–2019: A study using the Kyoto Scale of Psychological Development 1998-2001年至2014-2019年日本婴儿期至成年期的认知表现:使用京都心理发展量表的研究
IF 3.3 2区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2025-07-01 Epub Date: 2025-07-12 DOI: 10.1016/j.intell.2025.101937
Hideyo Goma , Shun Tanaka , Toshiki Matsuoka , Hiroyuki Shimizu , Satomi Shimizu , Yui Zen , Emi Adachi , Motoko Ishikawa , Karri Silventoinen
Declines in cognitive performance have been observed in many Western countries in the 21st century, but it is still unclear whether similar declines are occurring in East Asian countries. We examined trends in cognitive performance over a 20-year period in Japan and analyzed how these trends varied from infancy to adulthood and between males and females. Data were collected between 1998 and 2001 (n = 2677; 49 % females) and 2014 and 2019 (n = 3243; 51 % females) using the Kyoto Scale of Psychological Development and analyzed using linear regression models. We found that cognitive performance had improved over the years for boys and girls aged 3 to 10 years old, with only minor changes observed at younger and older ages. Girls showed an advantage in cognitive development over boys from 1 to 3 years of age, while boys outperformed girls from 10 years of age into adulthood. We found suggestive evidence of slightly faster improvement in cognitive development over the years in boys compared to girls. These results were roughly similar for general cognitive development, cognitive-adaptive development, and language-social development. These findings suggest that the cognitive performance of Japanese children has continued to improve over the past two decades. Japan provides an example that the decline in cognitive performance observed in many industrialized countries in the 21st century is not inevitable.
在21世纪,许多西方国家都观察到认知能力的下降,但尚不清楚东亚国家是否也出现了类似的下降。我们研究了日本20年来认知表现的趋势,并分析了这些趋势从婴儿期到成年期以及男性和女性之间的变化。数据收集于1998年至2001年(n = 2677;49%女性)以及2014年和2019年(n = 3243;使用京都心理发展量表,并使用线性回归模型进行分析。我们发现,随着时间的推移,3到10岁的男孩和女孩的认知能力有所提高,在更小和更大的年龄只有微小的变化。从1岁到3岁,女孩在认知发展方面比男孩有优势,而从10岁到成年,男孩的表现优于女孩。我们发现了启发性的证据,表明多年来男孩在认知发展方面的改善略快于女孩。这些结果在一般认知发展、认知适应性发展和语言社会发展方面大致相似。这些发现表明,在过去的二十年里,日本儿童的认知能力一直在不断提高。日本提供了一个例子,说明21世纪许多工业化国家的认知能力下降并不是不可避免的。
{"title":"Cognitive performance from infancy to adulthood in Japan from 1998–2001 to 2014–2019: A study using the Kyoto Scale of Psychological Development","authors":"Hideyo Goma ,&nbsp;Shun Tanaka ,&nbsp;Toshiki Matsuoka ,&nbsp;Hiroyuki Shimizu ,&nbsp;Satomi Shimizu ,&nbsp;Yui Zen ,&nbsp;Emi Adachi ,&nbsp;Motoko Ishikawa ,&nbsp;Karri Silventoinen","doi":"10.1016/j.intell.2025.101937","DOIUrl":"10.1016/j.intell.2025.101937","url":null,"abstract":"<div><div>Declines in cognitive performance have been observed in many Western countries in the 21st century, but it is still unclear whether similar declines are occurring in East Asian countries. We examined trends in cognitive performance over a 20-year period in Japan and analyzed how these trends varied from infancy to adulthood and between males and females. Data were collected between 1998 and 2001 (<em>n</em> = 2677; 49 % females) and 2014 and 2019 (<em>n</em> = 3243; 51 % females) using the Kyoto Scale of Psychological Development and analyzed using linear regression models. We found that cognitive performance had improved over the years for boys and girls aged 3 to 10 years old, with only minor changes observed at younger and older ages. Girls showed an advantage in cognitive development over boys from 1 to 3 years of age, while boys outperformed girls from 10 years of age into adulthood. We found suggestive evidence of slightly faster improvement in cognitive development over the years in boys compared to girls. These results were roughly similar for general cognitive development, cognitive-adaptive development, and language-social development. These findings suggest that the cognitive performance of Japanese children has continued to improve over the past two decades. Japan provides an example that the decline in cognitive performance observed in many industrialized countries in the 21st century is not inevitable.</div></div>","PeriodicalId":13862,"journal":{"name":"Intelligence","volume":"111 ","pages":"Article 101937"},"PeriodicalIF":3.3,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144611600","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Psychometrically derived 60-question benchmarks: Substantial efficiencies and the possibility of human-AI comparisons 心理测量学衍生的60个问题基准:实质性的效率和人类与人工智能比较的可能性
IF 3.3 2区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2025-05-01 Epub Date: 2025-05-22 DOI: 10.1016/j.intell.2025.101922
Gilles E. Gignac , David Ilić
Large Language Model (LLM) benchmark evaluation tests often comprise thousands of questions. Based on psychometric principles, reliable and valid benchmark tests can likely be developed with as few as 60 items, comparable to human intelligence tests, which typically include only 15 to 60 items. The establishment of shorter benchmark tests offers numerous potential benefits, including more efficient evaluation of LLMs, the practical feasibility of creating parallel forms, and the ability to directly compare LLM performance with human capabilities. Consequently, we analysed the performance of 591 LLMs across three widely recognized benchmarks—HellaSwag, Winogrande, and GSM8K—and developed short-forms (≈ 60 questions each) using psychometric principles. The short-forms exhibited high internal consistency reliability, with coefficient omega values ranging from 0.96 for Winogrande to 0.99 for HellaSwag and GSM8K. Additionally, strong correlations between short- and long-form scores (r ≈ 0.90) provided evidence of concurrent validity. Finally, model size (number of parameters) was a slightly stronger predictor of overall LLM performance for the short-forms compared to the long-forms, indicating that the short forms exhibited comparable, if not slightly superior, convergent validity. It is concluded that shorter benchmarks may accelerate AI development by enabling more efficient evaluations. Additionally, research into the nature of intelligence may be facilitated by benchmark short-forms by enabling direct comparisons between AI and human performance.
大型语言模型(LLM)基准评估测试通常包含数千个问题。根据心理测量学原理,可靠和有效的基准测试可能只需要60个项目,而人类智力测试通常只包括15到60个项目。建立更短的基准测试提供了许多潜在的好处,包括更有效地评估LLM、创建并行表单的实际可行性,以及直接将LLM性能与人类能力进行比较的能力。因此,我们分析了591名法学硕士在三个广泛认可的基准(hellaswag、Winogrande和gsm8k)上的表现,并使用心理测量学原理开发了简短的表格(每个约60个问题)。短格式具有较高的内部一致性信度,Winogrande的欧米茄系数为0.96,HellaSwag和GSM8K的欧米茄系数为0.99。此外,短格式和长格式分数之间的强相关性(r≈0.90)提供了并发效度的证据。最后,与长格式相比,模型大小(参数数量)对短格式的整体LLM性能的预测略强,这表明短格式表现出相当的收敛效度,如果不是略优的话。结论是,更短的基准可以通过更有效的评估来加速人工智能的发展。此外,通过直接比较人工智能和人类的表现,基准简短形式可能会促进对智能本质的研究。
{"title":"Psychometrically derived 60-question benchmarks: Substantial efficiencies and the possibility of human-AI comparisons","authors":"Gilles E. Gignac ,&nbsp;David Ilić","doi":"10.1016/j.intell.2025.101922","DOIUrl":"10.1016/j.intell.2025.101922","url":null,"abstract":"<div><div>Large Language Model (LLM) benchmark evaluation tests often comprise thousands of questions. Based on psychometric principles, reliable and valid benchmark tests can likely be developed with as few as 60 items, comparable to human intelligence tests, which typically include only 15 to 60 items. The establishment of shorter benchmark tests offers numerous potential benefits, including more efficient evaluation of LLMs, the practical feasibility of creating parallel forms, and the ability to directly compare LLM performance with human capabilities. Consequently, we analysed the performance of 591 LLMs across three widely recognized benchmarks—HellaSwag, Winogrande, and GSM8K—and developed short-forms (≈ 60 questions each) using psychometric principles. The short-forms exhibited high internal consistency reliability, with coefficient omega values ranging from 0.96 for Winogrande to 0.99 for HellaSwag and GSM8K. Additionally, strong correlations between short- and long-form scores (<em>r</em> ≈ 0.90) provided evidence of concurrent validity. Finally, model size (number of parameters) was a slightly stronger predictor of overall LLM performance for the short-forms compared to the long-forms, indicating that the short forms exhibited comparable, if not slightly superior, convergent validity. It is concluded that shorter benchmarks may accelerate AI development by enabling more efficient evaluations. Additionally, research into the nature of intelligence may be facilitated by benchmark short-forms by enabling direct comparisons between AI and human performance.</div></div>","PeriodicalId":13862,"journal":{"name":"Intelligence","volume":"110 ","pages":"Article 101922"},"PeriodicalIF":3.3,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144115844","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reevaluating the Flynn effect, and the reversal: Temporal trends and measurement invariance in Norwegian armed forces intelligence scores 重新评估弗林效应和逆转:挪威武装部队智力得分的时间趋势和测量不变性
IF 3.3 2区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2025-05-01 Epub Date: 2025-04-01 DOI: 10.1016/j.intell.2025.101909
Morten Nordmo , Tore Nøttestad Norrøne , Ole Christian Lang-Ree
Since 1954, the Norwegian Armed Forces have annually administered an unchanged general mental ability test to male cohorts, comprising figure matrices, word similarities, and mathematical reasoning tests. These stable and representative data have supported various claims about shifts in general mental ability (GMA) levels, notably the Flynn effect and its reversal, influencing extensive research linking these scores with health and other outcomes. This study examines whether observed temporal trends in scores reflect changes in latent intelligence or are confounded by evolving test characteristics and specific test-taking abilities in numerical reasoning, word comprehension, and figure matrices reasoning. Our findings, using multiple-group factor analysis and multiple indicator multiple cause (MIMIC) models, indicate that while there was a general upward trend in observed scores until 1993, this was predominantly driven by enhancements in the fluid intelligence task, specifically figure matrices reasoning. Notably, these gains do not uniformly translate to a rise in underlying GMA, suggesting the presence of domain-specific improvements and test characteristic changes over time. Conversely, the observed decline is primarily due to decreases in word comprehension and numerical reasoning tests, also reflecting specific abilities not attributable to changes in the latent GMA factor. Our findings further challenge the validity of claims that changes in the general factor drive the Flynn effect and its reversal. Furthermore, they caution against using these scores for longitudinal studies without accounting for changes in test characteristics.
自1954年以来,挪威武装部队每年对男性进行一项不变的一般心理能力测试,包括图形矩阵、单词相似度和数学推理测试。这些稳定且具有代表性的数据支持了关于一般心理能力(GMA)水平变化的各种说法,特别是弗林效应及其逆转,影响了将这些分数与健康和其他结果联系起来的广泛研究。本研究探讨了观察到的分数的时间趋势是否反映了潜在智力的变化,或者是由不断发展的测试特征和特定的测试能力在数值推理、文字理解和图形矩阵推理中混淆。我们使用多组因素分析和多指标多原因(MIMIC)模型的研究结果表明,尽管在1993年之前观察到的分数总体呈上升趋势,但这主要是由流体智力任务的增强所驱动的,特别是图形矩阵推理。值得注意的是,这些收益并没有统一地转化为潜在GMA的上升,这表明随着时间的推移存在特定领域的改进和测试特征的变化。相反,观察到的下降主要是由于单词理解和数字推理测试的下降,也反映了不归因于潜在GMA因素变化的特定能力。我们的研究结果进一步挑战了一般因素的变化驱动弗林效应及其逆转的说法的有效性。此外,他们警告不要在没有考虑测试特征变化的情况下将这些分数用于纵向研究。
{"title":"Reevaluating the Flynn effect, and the reversal: Temporal trends and measurement invariance in Norwegian armed forces intelligence scores","authors":"Morten Nordmo ,&nbsp;Tore Nøttestad Norrøne ,&nbsp;Ole Christian Lang-Ree","doi":"10.1016/j.intell.2025.101909","DOIUrl":"10.1016/j.intell.2025.101909","url":null,"abstract":"<div><div>Since 1954, the Norwegian Armed Forces have annually administered an unchanged general mental ability test to male cohorts, comprising figure matrices, word similarities, and mathematical reasoning tests. These stable and representative data have supported various claims about shifts in general mental ability (GMA) levels, notably the Flynn effect and its reversal, influencing extensive research linking these scores with health and other outcomes. This study examines whether observed temporal trends in scores reflect changes in latent intelligence or are confounded by evolving test characteristics and specific test-taking abilities in numerical reasoning, word comprehension, and figure matrices reasoning. Our findings, using multiple-group factor analysis and multiple indicator multiple cause (MIMIC) models, indicate that while there was a general upward trend in observed scores until 1993, this was predominantly driven by enhancements in the fluid intelligence task, specifically figure matrices reasoning. Notably, these gains do not uniformly translate to a rise in underlying GMA, suggesting the presence of domain-specific improvements and test characteristic changes over time. Conversely, the observed decline is primarily due to decreases in word comprehension and numerical reasoning tests, also reflecting specific abilities not attributable to changes in the latent GMA factor. Our findings further challenge the validity of claims that changes in the general factor drive the Flynn effect and its reversal. Furthermore, they caution against using these scores for longitudinal studies without accounting for changes in test characteristics.</div></div>","PeriodicalId":13862,"journal":{"name":"Intelligence","volume":"110 ","pages":"Article 101909"},"PeriodicalIF":3.3,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143748461","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The other half of intelligence: An obstacle-racecourse performance-based model of intelligence in action 智力的另一半以障碍赛跑成绩为基础的行动智力模型
IF 3.3 2区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2025-05-01 Epub Date: 2025-04-18 DOI: 10.1016/j.intell.2025.101919
Robert J. Sternberg
Intelligence in action, or as performance, is almost inevitably at a different level from intelligence as some theoretical level of competence. The article considers a metaphor of an obstacle race as illustrating how intelligence functions in everyday practice. Intelligence as performance always includes responses to the obstacles one confronts. The article opens with an introduction explaining the racetrack metaphor. The article then considers the various obstacles to the deployment of intelligence, including both primarily internal and primarily external obstacles. The article then considers three models for the relationship between intelligence as competence and intelligence as performance. Finally, the article draws some conclusions about intelligence as it acts in the everyday world.
行动中的智力,或作为表现的智力,几乎不可避免地与作为某种理论能力水平的智力处于不同的水平。这篇文章用一个障碍赛跑的比喻来说明智力在日常实践中是如何发挥作用的。智力作为一种表现,总是包括对所面临的障碍的反应。文章以介绍赛马场比喻作为开头。然后,文章考虑了部署情报的各种障碍,主要包括内部和外部障碍。然后,本文考虑了作为能力的智力和作为绩效的智力之间关系的三种模型。最后,文章得出了一些关于智能在日常生活中的作用的结论。
{"title":"The other half of intelligence: An obstacle-racecourse performance-based model of intelligence in action","authors":"Robert J. Sternberg","doi":"10.1016/j.intell.2025.101919","DOIUrl":"10.1016/j.intell.2025.101919","url":null,"abstract":"<div><div>Intelligence in action, or as performance, is almost inevitably at a different level from intelligence as some theoretical level of competence. The article considers a metaphor of an obstacle race as illustrating how intelligence functions in everyday practice. Intelligence as performance always includes responses to the obstacles one confronts. The article opens with an introduction explaining the racetrack metaphor. The article then considers the various obstacles to the deployment of intelligence, including both primarily internal and primarily external obstacles. The article then considers three models for the relationship between intelligence as competence and intelligence as performance. Finally, the article draws some conclusions about intelligence as it acts in the everyday world.</div></div>","PeriodicalId":13862,"journal":{"name":"Intelligence","volume":"110 ","pages":"Article 101919"},"PeriodicalIF":3.3,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143848083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Intelligence
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1