首页 > 最新文献

Behavior Research Methods最新文献

英文 中文
The Typability Index: A tool for measuring and controlling for typing difficulty in text stimuli. 可打字性指数:测量和控制文本刺激输入困难的工具。
IF 3.9 2区 心理学 Q1 PSYCHOLOGY, EXPERIMENTAL Pub Date : 2026-02-12 DOI: 10.3758/s13428-025-02877-y
Emily A Williams, Matthew Warburton, Martin Krzywinski, Faisal Mushtaq

In typing proficiency tests, like those used in job recruitment or research studies, individuals are evaluated based on their speed and accuracy. However, the difficulty of the typed text, its 'typability', can impact typing performance, introducing variability that is unrelated to skill. To ensure valid comparisons across individuals, time, and conditions, it is crucial to control for this variation in text difficulty. To address this issue, we develop the Typability Index, a model that predicts the relative typing speed of text. Building on earlier attempts to quantify typing difficulty from the 1940s, we create a more advanced typability model using the 136 Million (136 M) Keystrokes Dataset (Dhakal et al., Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, 1-12, 2018), where over 168,000 participants each typed 15 sentences from a pool of 1,525 items. Through random forest regression, we identify eight key predictors from 30 candidate variables, including the proportion of lowercase letters, word frequency, and syllables per word. Trained on 80% of the dataset and validated on the remaining 20% and a novel dataset, the Typability Index explained 68-88% of the variance in typability, compared to the 34% explained by an earlier leading model (Bell, Unpublished Doctor's Dissertation, University of Oklahoma, 1949). To promote higher control in typing research and assessments, we introduce a web-based tool to facilitate accurate measurement and fair comparisons of text typability.

在打字能力测试中,就像在招聘或研究中使用的测试一样,个人的评估是基于他们的速度和准确性。然而,输入文本的难度,即“可排版性”,会影响打字性能,引入与技能无关的可变性。为了确保在个体、时间和条件之间进行有效的比较,控制文本难度的这种变化是至关重要的。为了解决这个问题,我们开发了Typability Index,这是一个预测文本相对打字速度的模型。在20世纪40年代早期量化打字难度的尝试的基础上,我们使用1.36亿(136 M)击键数据集(Dhakal等人,2018年CHI计算系统人为因素会议记录,1- 12,2018)创建了一个更高级的可分类性模型,其中超过168,000名参与者每人从1,525个项目池中键入15个句子。通过随机森林回归,我们从30个候选变量中确定了8个关键预测因子,包括小写字母的比例、词频和每个单词的音节。在80%的数据集上进行训练,并在剩下的20%和一个新的数据集上进行验证,可Typability指数解释了68-88%的可Typability差异,而早期的领先模型解释了34% (Bell,未发表的博士论文,University of Oklahoma, 1949)。为了加强对打字研究和评估的控制,我们引入了一个基于网络的工具,以方便准确测量和公平比较文本可打字性。
{"title":"The Typability Index: A tool for measuring and controlling for typing difficulty in text stimuli.","authors":"Emily A Williams, Matthew Warburton, Martin Krzywinski, Faisal Mushtaq","doi":"10.3758/s13428-025-02877-y","DOIUrl":"10.3758/s13428-025-02877-y","url":null,"abstract":"<p><p>In typing proficiency tests, like those used in job recruitment or research studies, individuals are evaluated based on their speed and accuracy. However, the difficulty of the typed text, its 'typability', can impact typing performance, introducing variability that is unrelated to skill. To ensure valid comparisons across individuals, time, and conditions, it is crucial to control for this variation in text difficulty. To address this issue, we develop the Typability Index, a model that predicts the relative typing speed of text. Building on earlier attempts to quantify typing difficulty from the 1940s, we create a more advanced typability model using the 136 Million (136 M) Keystrokes Dataset (Dhakal et al., Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, 1-12, 2018), where over 168,000 participants each typed 15 sentences from a pool of 1,525 items. Through random forest regression, we identify eight key predictors from 30 candidate variables, including the proportion of lowercase letters, word frequency, and syllables per word. Trained on 80% of the dataset and validated on the remaining 20% and a novel dataset, the Typability Index explained 68-88% of the variance in typability, compared to the 34% explained by an earlier leading model (Bell, Unpublished Doctor's Dissertation, University of Oklahoma, 1949). To promote higher control in typing research and assessments, we introduce a web-based tool to facilitate accurate measurement and fair comparisons of text typability.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"61"},"PeriodicalIF":3.9,"publicationDate":"2026-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12901113/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146177629","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Distinguishing abstraction from abstractness: Specificity norms for 8,500 English words. 抽象与抽象性的区别:8500个英语单词的特异性规范。
IF 3.9 2区 心理学 Q1 PSYCHOLOGY, EXPERIMENTAL Pub Date : 2026-02-11 DOI: 10.3758/s13428-026-02949-7
Emiko J Muraki, Penny M Pexman

Psycholinguistic norms quantify dimensions of word form, use, and/or meaning and are important for studies involving word stimuli. Among the most commonly used norms are those for concreteness, quantifying the degree to which a word's referent can be understood through the senses. The dimension of concreteness has been found to be related to performance in language processing and memory tasks, yet recent research suggests that (1) ratings of concreteness may be conflated with how specific a word's meaning is, and (2) concreteness and specificity may have opposite effects on lexical-semantic processing. In the present study, we collected specificity scores for a large sample of English words, assessed their reliability and validity, and tested their relationships to lexical processing and memory accuracy. We found that specificity and concreteness are indeed correlated and have different behavioural effects. This large dataset of specificity scores will be an important resource for future lexical-semantic studies.

心理语言规范量化了词汇形式、用法和/或意义的维度,这对涉及词汇刺激的研究很重要。最常用的规范是具体规范,量化一个词的所指物可以通过感官理解的程度。具体的维度已经被发现与语言处理和记忆任务的表现有关,但最近的研究表明:(1)具体的评级可能与单词的含义的具体程度相混淆;(2)具体和具体可能对词汇语义处理产生相反的影响。在本研究中,我们收集了大量英语单词的特异性分数,评估了它们的信度和效度,并测试了它们与词汇加工和记忆准确性的关系。我们发现,特异性和具体性确实是相关的,并有不同的行为影响。这一庞大的特异性分数数据集将成为未来词汇语义研究的重要资源。
{"title":"Distinguishing abstraction from abstractness: Specificity norms for 8,500 English words.","authors":"Emiko J Muraki, Penny M Pexman","doi":"10.3758/s13428-026-02949-7","DOIUrl":"10.3758/s13428-026-02949-7","url":null,"abstract":"<p><p>Psycholinguistic norms quantify dimensions of word form, use, and/or meaning and are important for studies involving word stimuli. Among the most commonly used norms are those for concreteness, quantifying the degree to which a word's referent can be understood through the senses. The dimension of concreteness has been found to be related to performance in language processing and memory tasks, yet recent research suggests that (1) ratings of concreteness may be conflated with how specific a word's meaning is, and (2) concreteness and specificity may have opposite effects on lexical-semantic processing. In the present study, we collected specificity scores for a large sample of English words, assessed their reliability and validity, and tested their relationships to lexical processing and memory accuracy. We found that specificity and concreteness are indeed correlated and have different behavioural effects. This large dataset of specificity scores will be an important resource for future lexical-semantic studies.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"60"},"PeriodicalIF":3.9,"publicationDate":"2026-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146163777","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The Chameleon Paradigm: An effective method for masking biological motion stimuli. 变色龙范式:掩盖生物运动刺激的有效方法。
IF 3.9 2区 心理学 Q1 PSYCHOLOGY, EXPERIMENTAL Pub Date : 2026-02-04 DOI: 10.3758/s13428-025-02924-8
Jiaxu Zhao, Xin He, Yi Jiang, Min Bao

Continuous flash suppression (CFS) is widely used in research on unconscious visual processing due to its long-lasting masking. While CFS effectively masks static stimuli, its application to motion stimuli remains challenging. To resolve this issue, our previous work developed the Chameleon-1 paradigm (Zhao & Bao, 2022), an enhanced CFS technique that enables robust masking of translational motion stimuli for up to 10 s through precise spatiotemporal matching of color dynamics between target and masking stimuli. The current study systematically evaluated and optimized this paradigm through three studies. We first assessed the masking efficacy of the Chameleon-1 paradigm across different motion parameters and patterns (Study 1). Because Chameleon-1 failed to effectively mask biological motion (BM) stimuli, we then upgraded the paradigm to accommodate BM stimulus characteristics (Study 2). The results demonstrated that this Chameleon-2 paradigm achieved superior masking efficacy for BM stimuli, with average breakthrough time extended by over two-fold compared to Chameleon-1 and breakthrough ratios approximately 75% for upright and 45% for inverted BM stimuli during 10-s of BM presentation. We further employed this paradigm to investigate the neural correlates of conscious and unconscious BM processing using functional near-infrared spectroscopy in Study 3. Our work establishes a robust paradigm for sustained masking of BM stimuli and validates its utility in unconscious processing research. We also provide new insights into the neural mechanisms underlying unconscious BM perception.

连续闪光抑制(CFS)由于其持久的掩蔽作用而被广泛应用于无意识视觉加工的研究中。虽然CFS有效地掩盖了静态刺激,但它在运动刺激中的应用仍然具有挑战性。为了解决这个问题,我们之前的工作开发了变色龙-1范式(Zhao & Bao, 2022),这是一种增强的CFS技术,通过精确的目标和掩蔽刺激之间的颜色动态时空匹配,可以对平移运动刺激进行长达10秒的稳健掩蔽。本研究通过三项研究对这一范式进行了系统的评价和优化。我们首先评估了变色龙-1范式在不同运动参数和模式下的掩蔽效果(研究1)。由于变色龙-1未能有效地掩盖生物运动(BM)刺激,我们随后升级了范式以适应BM刺激特征(研究2)。结果表明,“变色龙-2”模式对脑刺激的掩蔽效果较“变色龙-1”模式好,平均突破时间比“变色龙-1”模式延长了2倍以上,在10秒内,直立脑刺激的突破率约为75%,倒转脑刺激的突破率约为45%。在研究3中,我们进一步利用功能近红外光谱研究了有意识和无意识脑电信号加工的神经相关因素。我们的工作为脑脑刺激的持续掩蔽建立了一个强大的范式,并验证了其在无意识加工研究中的实用性。我们也为无意识脑损伤感知的神经机制提供了新的见解。
{"title":"The Chameleon Paradigm: An effective method for masking biological motion stimuli.","authors":"Jiaxu Zhao, Xin He, Yi Jiang, Min Bao","doi":"10.3758/s13428-025-02924-8","DOIUrl":"10.3758/s13428-025-02924-8","url":null,"abstract":"<p><p>Continuous flash suppression (CFS) is widely used in research on unconscious visual processing due to its long-lasting masking. While CFS effectively masks static stimuli, its application to motion stimuli remains challenging. To resolve this issue, our previous work developed the Chameleon-1 paradigm (Zhao & Bao, 2022), an enhanced CFS technique that enables robust masking of translational motion stimuli for up to 10 s through precise spatiotemporal matching of color dynamics between target and masking stimuli. The current study systematically evaluated and optimized this paradigm through three studies. We first assessed the masking efficacy of the Chameleon-1 paradigm across different motion parameters and patterns (Study 1). Because Chameleon-1 failed to effectively mask biological motion (BM) stimuli, we then upgraded the paradigm to accommodate BM stimulus characteristics (Study 2). The results demonstrated that this Chameleon-2 paradigm achieved superior masking efficacy for BM stimuli, with average breakthrough time extended by over two-fold compared to Chameleon-1 and breakthrough ratios approximately 75% for upright and 45% for inverted BM stimuli during 10-s of BM presentation. We further employed this paradigm to investigate the neural correlates of conscious and unconscious BM processing using functional near-infrared spectroscopy in Study 3. Our work establishes a robust paradigm for sustained masking of BM stimuli and validates its utility in unconscious processing research. We also provide new insights into the neural mechanisms underlying unconscious BM perception.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"59"},"PeriodicalIF":3.9,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146117507","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
How well do large language models mirror human cognition of word concepts?: A comparison of psychological ratings for early-acquired English words. 大型语言模型在多大程度上反映了人类对单词概念的认知?早期习得英语词汇的心理评分比较。
IF 3.9 2区 心理学 Q1 PSYCHOLOGY, EXPERIMENTAL Pub Date : 2026-02-02 DOI: 10.3758/s13428-025-02938-2
Hiromichi Hagihara, Kazuki Miyazawa

This study examined how well large language models (LLMs) approximate human psychological ratings for early-acquired English words. We used four state-of-the-art LLMs, including GPT-4o and Meta-Llama-3.1, to evaluate 21 static psychological features for 695 words and compared these estimates with human norms. The results showed that LLMs aligned well with human ratings for some features (e.g., Concreteness, Bodily Interactiveness) in terms of rank correlations (rs > .82) and distributional similarities but diverged notably for others (e.g., Iconicity, Arousal; rs < .48). Compared with content words, function words showed more pronounced discrepancies between human and LLM ratings. We also assessed how similarly human- and LLM-derived psychological features predicted words' age of acquisition (AoA), revealing both strong correspondences and systematic biases, depending on the model (differences in correlations ranged from -.27 to .28). Based on these analyses, we identified which features may be reliably estimated using LLMs, which require further refinement, and what methodological considerations are necessary for applying LLM-based measures in cognitive science. We discuss the implications of using LLMs as methodological tools in psychology and cognitive science, highlighting both their practical advantages (e.g., data coverage and data collection efficiency) and theoretical relevance. The present study provides a novel framework for evaluating the cognitive plausibility of LLMs by using lexical psychological features, complementing existing benchmarks.

本研究考察了大型语言模型(llm)在多大程度上近似人类对早期习得英语单词的心理评分。我们使用了四个最先进的llm,包括gpt - 40和meta - lama-3.1,来评估695个单词的21个静态心理特征,并将这些估计与人类标准进行比较。结果表明,llm与人类对某些特征(例如,具体性,身体互动性)的评级在等级相关性(rs - 0.82)和分布相似性方面保持一致,但在其他特征(例如,象似性,唤醒;rs - 0.82)方面存在显著差异
{"title":"How well do large language models mirror human cognition of word concepts?: A comparison of psychological ratings for early-acquired English words.","authors":"Hiromichi Hagihara, Kazuki Miyazawa","doi":"10.3758/s13428-025-02938-2","DOIUrl":"10.3758/s13428-025-02938-2","url":null,"abstract":"<p><p>This study examined how well large language models (LLMs) approximate human psychological ratings for early-acquired English words. We used four state-of-the-art LLMs, including GPT-4o and Meta-Llama-3.1, to evaluate 21 static psychological features for 695 words and compared these estimates with human norms. The results showed that LLMs aligned well with human ratings for some features (e.g., Concreteness, Bodily Interactiveness) in terms of rank correlations (rs > .82) and distributional similarities but diverged notably for others (e.g., Iconicity, Arousal; rs < .48). Compared with content words, function words showed more pronounced discrepancies between human and LLM ratings. We also assessed how similarly human- and LLM-derived psychological features predicted words' age of acquisition (AoA), revealing both strong correspondences and systematic biases, depending on the model (differences in correlations ranged from -.27 to .28). Based on these analyses, we identified which features may be reliably estimated using LLMs, which require further refinement, and what methodological considerations are necessary for applying LLM-based measures in cognitive science. We discuss the implications of using LLMs as methodological tools in psychology and cognitive science, highlighting both their practical advantages (e.g., data coverage and data collection efficiency) and theoretical relevance. The present study provides a novel framework for evaluating the cognitive plausibility of LLMs by using lexical psychological features, complementing existing benchmarks.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"58"},"PeriodicalIF":3.9,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12864368/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103563","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Drawings of THINGS: A large-scale drawing dataset of 1854 object concepts. drawing of THINGS: 1854个物体概念的大规模绘图数据集。
IF 3.9 2区 心理学 Q1 PSYCHOLOGY, EXPERIMENTAL Pub Date : 2026-01-30 DOI: 10.3758/s13428-025-02887-w
Kushin Mukherjee, Holly Huey, Laura M Stoinski, Martin N Hebart, Judith E Fan, Wilma A Bainbridge

The development of large datasets of natural images has galvanized progress in psychology, neuroscience, and computer science. Notably, the THINGS database constitutes a collective effort towards understanding of human visual knowledge by accumulating rich data on a shared set of visual object concepts across several studies. In this paper, we introduce Drawing of THINGS ( DoT ), a novel dataset of 28,627 human drawings of 1854 diverse object concepts, sampled systematically from concrete picturable and nameable nouns in the American English language, mirroring the structure of the THINGS image database. In addition to data on drawings' stroke history, we further collected fine-grained recognition data for each drawing, along with metadata on participant demographics, drawing ability, and mental imagery. We characterize people's ability to communicate and recognize semantic information encoded in drawings and compare this ability to their ability to recognize real-world images of the same visual objects. We also explore the relationship between drawing understanding and the memorability and typicality of the objects contained in THINGS. In sum, we envision DoT as a powerful tool that builds on the THINGS database to advance understanding of how humans express knowledge about visual concepts.

自然图像的大型数据集的发展促进了心理学、神经科学和计算机科学的进步。值得注意的是,THINGS数据库通过在多个研究中共享的视觉对象概念集上积累丰富的数据,构成了对人类视觉知识理解的集体努力。在本文中,我们介绍了一个新的数据集——物图(DoT),该数据集包含28,627张人类绘制的1854种不同物体概念,系统地从美国英语中具体的可描绘和可命名名词中取样,反映了物图图像数据库的结构。除了绘画笔画历史数据外,我们还进一步收集了每幅绘画的细粒度识别数据,以及参与者人口统计、绘画能力和心理意象的元数据。我们描述了人们交流和识别图形中编码的语义信息的能力,并将这种能力与他们识别相同视觉对象的现实世界图像的能力进行了比较。我们还探讨了绘画理解与《事物》中包含的物体的可记忆性和典型性之间的关系。总之,我们设想DoT是一个强大的工具,它建立在THINGS数据库的基础上,以促进对人类如何表达视觉概念知识的理解。
{"title":"Drawings of THINGS: A large-scale drawing dataset of 1854 object concepts.","authors":"Kushin Mukherjee, Holly Huey, Laura M Stoinski, Martin N Hebart, Judith E Fan, Wilma A Bainbridge","doi":"10.3758/s13428-025-02887-w","DOIUrl":"10.3758/s13428-025-02887-w","url":null,"abstract":"<p><p>The development of large datasets of natural images has galvanized progress in psychology, neuroscience, and computer science. Notably, the THINGS database constitutes a collective effort towards understanding of human visual knowledge by accumulating rich data on a shared set of visual object concepts across several studies. In this paper, we introduce Drawing of THINGS ( DoT ), a novel dataset of 28,627 human drawings of 1854 diverse object concepts, sampled systematically from concrete picturable and nameable nouns in the American English language, mirroring the structure of the THINGS image database. In addition to data on drawings' stroke history, we further collected fine-grained recognition data for each drawing, along with metadata on participant demographics, drawing ability, and mental imagery. We characterize people's ability to communicate and recognize semantic information encoded in drawings and compare this ability to their ability to recognize real-world images of the same visual objects. We also explore the relationship between drawing understanding and the memorability and typicality of the objects contained in THINGS. In sum, we envision DoT as a powerful tool that builds on the THINGS database to advance understanding of how humans express knowledge about visual concepts.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"57"},"PeriodicalIF":3.9,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12858628/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146091869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scale abbreviation with supervised machine learning: A comparison of feature selection techniques. 尺度缩写与监督机器学习:特征选择技术的比较。
IF 3.9 2区 心理学 Q1 PSYCHOLOGY, EXPERIMENTAL Pub Date : 2026-01-29 DOI: 10.3758/s13428-025-02913-x
Wenshuo Li, Okan Bulut, Mark J Gierl, Sheng Zhang

Scale abbreviation is a crucial task for researchers aiming to reduce response burden and optimize data collection when using self-report instruments such as online surveys and questionnaires. Among various data-driven strategies available for scale abbreviation, supervised machine learning (SML) algorithms have emerged as a prominent approach due to their accuracy in predicting total scores from the original instrument. However, previous studies offer limited insights into how SML-abbreviated scales can be evaluated using both SML and psychometric metrics across different feature selection techniques. To address this gap, the current study aims to evaluate the effectiveness of seven feature selection methods: item-total-correlation-based filters (ITC), Minimum-Redundancy-Maximum-Relevance (MRMR), Lasso, Sequential Forward Selection (SFS), Sequential Backward Selection (SBS), Genetic Algorithms (GA), and Non-dominated Sorting Genetic Algorithms-II (NSGA-II), all used in conjunction with SML. Additionally, the psychometric properties of these SML methods are compared with two non-SML approaches. Using simulated datasets varying in sample size, model error, and factorial correlations, the study examines predictive accuracy, reliability, and the ability to recover both inter-subscale correlations and external criterion correlations. The findings indicate that no single method consistently excels across all conditions, with specific feature selection techniques performing better under certain circumstances. Key insights are provided to guide researchers in selecting appropriate feature selection methods based on their specific dataset characteristics and goals.

量表缩写是研究人员在使用在线调查和问卷等自我报告工具时,减轻响应负担和优化数据收集的关键任务。在各种可用于缩尺的数据驱动策略中,监督机器学习(SML)算法已成为一种突出的方法,因为它们在预测原始仪器的总分方面具有准确性。然而,先前的研究对如何在不同的特征选择技术中使用SML和心理测量指标来评估SML-缩略量表提供了有限的见解。为了解决这一差距,本研究旨在评估7种特征选择方法的有效性:基于项目总相关性的过滤器(ITC)、最小冗余-最大相关性(MRMR)、Lasso、顺序正向选择(SFS)、顺序向后选择(SBS)、遗传算法(GA)和非支配排序遗传算法- ii (NSGA-II),它们都与SML一起使用。此外,将这些方法与两种非SML方法的心理测量特性进行了比较。本研究使用不同样本量、模型误差和因子相关性的模拟数据集,检验了预测的准确性、可靠性以及恢复亚尺度间相关性和外部标准相关性的能力。研究结果表明,没有一种方法在所有条件下都表现出色,特定的特征选择技术在某些情况下表现更好。提供了关键的见解,以指导研究人员根据其特定的数据集特征和目标选择适当的特征选择方法。
{"title":"Scale abbreviation with supervised machine learning: A comparison of feature selection techniques.","authors":"Wenshuo Li, Okan Bulut, Mark J Gierl, Sheng Zhang","doi":"10.3758/s13428-025-02913-x","DOIUrl":"10.3758/s13428-025-02913-x","url":null,"abstract":"<p><p>Scale abbreviation is a crucial task for researchers aiming to reduce response burden and optimize data collection when using self-report instruments such as online surveys and questionnaires. Among various data-driven strategies available for scale abbreviation, supervised machine learning (SML) algorithms have emerged as a prominent approach due to their accuracy in predicting total scores from the original instrument. However, previous studies offer limited insights into how SML-abbreviated scales can be evaluated using both SML and psychometric metrics across different feature selection techniques. To address this gap, the current study aims to evaluate the effectiveness of seven feature selection methods: item-total-correlation-based filters (ITC), Minimum-Redundancy-Maximum-Relevance (MRMR), Lasso, Sequential Forward Selection (SFS), Sequential Backward Selection (SBS), Genetic Algorithms (GA), and Non-dominated Sorting Genetic Algorithms-II (NSGA-II), all used in conjunction with SML. Additionally, the psychometric properties of these SML methods are compared with two non-SML approaches. Using simulated datasets varying in sample size, model error, and factorial correlations, the study examines predictive accuracy, reliability, and the ability to recover both inter-subscale correlations and external criterion correlations. The findings indicate that no single method consistently excels across all conditions, with specific feature selection techniques performing better under certain circumstances. Key insights are provided to guide researchers in selecting appropriate feature selection methods based on their specific dataset characteristics and goals.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"55"},"PeriodicalIF":3.9,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146083942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A comprehensive psychometrics of cognitive ability measures: Reliability, practice effects, and the stability of latent factor structures across retesting. 认知能力测量的综合心理测量学:可靠性、实践效果和潜在因素结构在复测中的稳定性。
IF 3.9 2区 心理学 Q1 PSYCHOLOGY, EXPERIMENTAL Pub Date : 2026-01-29 DOI: 10.3758/s13428-025-02897-8
Matthew K Robison, Stephen Campbell, Lauren D Garner, Ciara Sibley, Joseph Coyne

The present study examined individual differences in 24 measures of cognitive ability in a sample of young adults (N = 255). Each measure was completed twice, separated by a period of 2 weeks, to assess test-retest reliability and retesting (i.e., practice) effects. Latent variable modeling was used to assess the convergent and discriminant validity of the measures, as they were selected to measure seven different cognitive constructs (attention control, processing speed, working memory, primary memory, secondary memory, fluid intelligence, and spatial ability). The measures showed adequate to high intrasession and intersession reliability. Construct-level estimates were highly reliable, and the measurement structure was invariant across the two testing occasions. In several instances, correlations among latent variables warranted further testing to ensure adequate discriminability. Finally, latent state-trait modeling indicated that the majority of systematic variance in cognitive measures is due to latent traits, rather than state-specific or task-specific factors. We discuss the practical and theoretical implications of these findings.

目前的研究检查了年轻人(N = 255)的24项认知能力指标的个体差异。每个测量完成两次,间隔2周,以评估测试-重测试的可靠性和重测试(即实践)的效果。使用潜变量模型来评估这些测量的收敛效度和判别效度,因为它们被选择用于测量七个不同的认知结构(注意控制、处理速度、工作记忆、初级记忆、次级记忆、流体智力和空间能力)。测量结果表明,该方法具有较高的会话内和会话间可靠性。构造级估计是高度可靠的,并且度量结构在两个测试场合中是不变的。在一些情况下,潜在变量之间的相关性需要进一步测试,以确保充分的可区分性。最后,潜在状态-特质模型表明,认知测量的大部分系统方差是由潜在特质引起的,而不是由特定状态或特定任务因素引起的。我们讨论了这些发现的实践和理论意义。
{"title":"A comprehensive psychometrics of cognitive ability measures: Reliability, practice effects, and the stability of latent factor structures across retesting.","authors":"Matthew K Robison, Stephen Campbell, Lauren D Garner, Ciara Sibley, Joseph Coyne","doi":"10.3758/s13428-025-02897-8","DOIUrl":"10.3758/s13428-025-02897-8","url":null,"abstract":"<p><p>The present study examined individual differences in 24 measures of cognitive ability in a sample of young adults (N = 255). Each measure was completed twice, separated by a period of 2 weeks, to assess test-retest reliability and retesting (i.e., practice) effects. Latent variable modeling was used to assess the convergent and discriminant validity of the measures, as they were selected to measure seven different cognitive constructs (attention control, processing speed, working memory, primary memory, secondary memory, fluid intelligence, and spatial ability). The measures showed adequate to high intrasession and intersession reliability. Construct-level estimates were highly reliable, and the measurement structure was invariant across the two testing occasions. In several instances, correlations among latent variables warranted further testing to ensure adequate discriminability. Finally, latent state-trait modeling indicated that the majority of systematic variance in cognitive measures is due to latent traits, rather than state-specific or task-specific factors. We discuss the practical and theoretical implications of these findings.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"56"},"PeriodicalIF":3.9,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12855318/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146083978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Publisher Correction: Movement tracking of psychological processes: A tutorial using mousetrap. 出版商更正:心理过程的运动跟踪:一个使用捕鼠器的教程。
IF 3.9 2区 心理学 Q1 PSYCHOLOGY, EXPERIMENTAL Pub Date : 2026-01-26 DOI: 10.3758/s13428-025-02894-x
Dirk U Wulff, Pascal J Kieslich, Felix Henninger, Jonas M B Haslbeck, Michael Schulte-Mecklenbeck
{"title":"Publisher Correction: Movement tracking of psychological processes: A tutorial using mousetrap.","authors":"Dirk U Wulff, Pascal J Kieslich, Felix Henninger, Jonas M B Haslbeck, Michael Schulte-Mecklenbeck","doi":"10.3758/s13428-025-02894-x","DOIUrl":"10.3758/s13428-025-02894-x","url":null,"abstract":"","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"53"},"PeriodicalIF":3.9,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12835042/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146050125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A database of overlapping ambiguous strings in Chinese reading. 中文阅读中重叠歧义字串数据库。
IF 3.9 2区 心理学 Q1 PSYCHOLOGY, EXPERIMENTAL Pub Date : 2026-01-26 DOI: 10.3758/s13428-025-02935-5
Linjieqiong Huang, Chenxi Li, Xingshan Li

In the absence of inter-word spaces, Chinese text sometimes presents word boundary ambiguity. One common case is the overlapping ambiguous string (OAS), a three-character string (ABC) where the middle character can form distinct words with both the character to its left (AB) and the character to its right (BC), creating segmentation ambiguity between AB-C and A-BC. This structure makes OASs a valuable tool for investigating the cognitive mechanisms of Chinese word segmentation. We introduce a comprehensive OAS database consisting of 952,497 OASs, each with 43 types of linguistic information at the character, word, and OAS levels. To illustrate how to use the database, we conducted an eye-tracking reading experiment manipulating whether the first character of the OAS (i.e., character A) could stand alone in sentences. Results showed that when character A could not stand alone, readers were more likely to group it with the next character B, leading to an AB-C segmentation. These findings validate the utility of the OAS database in understanding word segmentation during Chinese reading. The potential applications of the database in artificial intelligence, education, and writing system reform are discussed.

在缺乏词间空间的情况下,汉语文本有时会出现词界歧义。一种常见的情况是重叠的模糊字符串(OAS),这是一个三字符字符串(ABC),其中中间的字符可以用其左边的字符(AB)和右边的字符(BC)组成不同的单词,从而在AB- c和a -BC之间产生分割歧义。这种结构使得oas成为研究汉语分词认知机制的重要工具。我们介绍了一个由952,497个OAS组成的综合OAS数据库,每个OAS具有字符、单词和OAS级别的43种语言信息。为了说明如何使用数据库,我们进行了一个眼球追踪阅读实验,操纵OAS的第一个字符(即字符A)是否可以单独存在于句子中。结果表明,当字符A不能单独存在时,读者更有可能将其与下一个字符B组合在一起,从而导致AB-C分词。这些发现验证了OAS数据库在理解汉语阅读分词过程中的实用性。讨论了数据库在人工智能、教育和书写系统改革方面的潜在应用。
{"title":"A database of overlapping ambiguous strings in Chinese reading.","authors":"Linjieqiong Huang, Chenxi Li, Xingshan Li","doi":"10.3758/s13428-025-02935-5","DOIUrl":"10.3758/s13428-025-02935-5","url":null,"abstract":"<p><p>In the absence of inter-word spaces, Chinese text sometimes presents word boundary ambiguity. One common case is the overlapping ambiguous string (OAS), a three-character string (ABC) where the middle character can form distinct words with both the character to its left (AB) and the character to its right (BC), creating segmentation ambiguity between AB-C and A-BC. This structure makes OASs a valuable tool for investigating the cognitive mechanisms of Chinese word segmentation. We introduce a comprehensive OAS database consisting of 952,497 OASs, each with 43 types of linguistic information at the character, word, and OAS levels. To illustrate how to use the database, we conducted an eye-tracking reading experiment manipulating whether the first character of the OAS (i.e., character A) could stand alone in sentences. Results showed that when character A could not stand alone, readers were more likely to group it with the next character B, leading to an AB-C segmentation. These findings validate the utility of the OAS database in understanding word segmentation during Chinese reading. The potential applications of the database in artificial intelligence, education, and writing system reform are discussed.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"51"},"PeriodicalIF":3.9,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146050169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Unifying temporal preparation: The temporal preparation task (TEP-Task). 统一时间准备:时间准备任务(TEP-Task)。
IF 3.9 2区 心理学 Q1 PSYCHOLOGY, EXPERIMENTAL Pub Date : 2026-01-26 DOI: 10.3758/s13428-025-02908-8
Mariagrazia Capizzi, Lucie Attout, Giovanna Mioni, Pom Charras

The dynamic nature of our environment allows us to anticipate the onset of relevant events, enhancing our responses to them. Temporal preparation can be assessed in the laboratory using various tasks, including foreperiod tasks, temporal orienting tasks, and rhythmic tasks. However, the existing literature lacks a unified task to measure the most common temporal preparation effects (i.e., foreperiod, sequential, temporal orienting, and rhythmic effects) in a single session. The main goal of the present study was to fill this gap by devising the temporal preparation task (TEP-Task) to measure temporal preparation effects in a single 35-min testing session. Besides its utility in single-session assessments, the TEP-Task may also serve for future research across diverse populations and experimental demands.

环境的动态特性使我们能够预测相关事件的发生,增强我们对它们的反应。时间准备可以在实验室中使用各种任务进行评估,包括周期前任务、时间定向任务和节奏任务。然而,现有文献缺乏一个统一的任务来测量单次会话中最常见的时间准备效应(即前期效应、顺序效应、时间定向效应和节奏效应)。本研究的主要目的是通过设计时间准备任务(TEP-Task)来测量35分钟的时间准备效应,从而填补这一空白。除了在单会话评估中的效用外,TEP-Task也可以用于未来不同人群和实验需求的研究。
{"title":"Unifying temporal preparation: The temporal preparation task (TEP-Task).","authors":"Mariagrazia Capizzi, Lucie Attout, Giovanna Mioni, Pom Charras","doi":"10.3758/s13428-025-02908-8","DOIUrl":"10.3758/s13428-025-02908-8","url":null,"abstract":"<p><p>The dynamic nature of our environment allows us to anticipate the onset of relevant events, enhancing our responses to them. Temporal preparation can be assessed in the laboratory using various tasks, including foreperiod tasks, temporal orienting tasks, and rhythmic tasks. However, the existing literature lacks a unified task to measure the most common temporal preparation effects (i.e., foreperiod, sequential, temporal orienting, and rhythmic effects) in a single session. The main goal of the present study was to fill this gap by devising the temporal preparation task (TEP-Task) to measure temporal preparation effects in a single 35-min testing session. Besides its utility in single-session assessments, the TEP-Task may also serve for future research across diverse populations and experimental demands.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"52"},"PeriodicalIF":3.9,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146050142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Behavior Research Methods
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1