Pub Date : 2026-02-12DOI: 10.3758/s13428-025-02877-y
Emily A Williams, Matthew Warburton, Martin Krzywinski, Faisal Mushtaq
In typing proficiency tests, like those used in job recruitment or research studies, individuals are evaluated based on their speed and accuracy. However, the difficulty of the typed text, its 'typability', can impact typing performance, introducing variability that is unrelated to skill. To ensure valid comparisons across individuals, time, and conditions, it is crucial to control for this variation in text difficulty. To address this issue, we develop the Typability Index, a model that predicts the relative typing speed of text. Building on earlier attempts to quantify typing difficulty from the 1940s, we create a more advanced typability model using the 136 Million (136 M) Keystrokes Dataset (Dhakal et al., Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, 1-12, 2018), where over 168,000 participants each typed 15 sentences from a pool of 1,525 items. Through random forest regression, we identify eight key predictors from 30 candidate variables, including the proportion of lowercase letters, word frequency, and syllables per word. Trained on 80% of the dataset and validated on the remaining 20% and a novel dataset, the Typability Index explained 68-88% of the variance in typability, compared to the 34% explained by an earlier leading model (Bell, Unpublished Doctor's Dissertation, University of Oklahoma, 1949). To promote higher control in typing research and assessments, we introduce a web-based tool to facilitate accurate measurement and fair comparisons of text typability.
在打字能力测试中,就像在招聘或研究中使用的测试一样,个人的评估是基于他们的速度和准确性。然而,输入文本的难度,即“可排版性”,会影响打字性能,引入与技能无关的可变性。为了确保在个体、时间和条件之间进行有效的比较,控制文本难度的这种变化是至关重要的。为了解决这个问题,我们开发了Typability Index,这是一个预测文本相对打字速度的模型。在20世纪40年代早期量化打字难度的尝试的基础上,我们使用1.36亿(136 M)击键数据集(Dhakal等人,2018年CHI计算系统人为因素会议记录,1- 12,2018)创建了一个更高级的可分类性模型,其中超过168,000名参与者每人从1,525个项目池中键入15个句子。通过随机森林回归,我们从30个候选变量中确定了8个关键预测因子,包括小写字母的比例、词频和每个单词的音节。在80%的数据集上进行训练,并在剩下的20%和一个新的数据集上进行验证,可Typability指数解释了68-88%的可Typability差异,而早期的领先模型解释了34% (Bell,未发表的博士论文,University of Oklahoma, 1949)。为了加强对打字研究和评估的控制,我们引入了一个基于网络的工具,以方便准确测量和公平比较文本可打字性。
{"title":"The Typability Index: A tool for measuring and controlling for typing difficulty in text stimuli.","authors":"Emily A Williams, Matthew Warburton, Martin Krzywinski, Faisal Mushtaq","doi":"10.3758/s13428-025-02877-y","DOIUrl":"10.3758/s13428-025-02877-y","url":null,"abstract":"<p><p>In typing proficiency tests, like those used in job recruitment or research studies, individuals are evaluated based on their speed and accuracy. However, the difficulty of the typed text, its 'typability', can impact typing performance, introducing variability that is unrelated to skill. To ensure valid comparisons across individuals, time, and conditions, it is crucial to control for this variation in text difficulty. To address this issue, we develop the Typability Index, a model that predicts the relative typing speed of text. Building on earlier attempts to quantify typing difficulty from the 1940s, we create a more advanced typability model using the 136 Million (136 M) Keystrokes Dataset (Dhakal et al., Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems, 1-12, 2018), where over 168,000 participants each typed 15 sentences from a pool of 1,525 items. Through random forest regression, we identify eight key predictors from 30 candidate variables, including the proportion of lowercase letters, word frequency, and syllables per word. Trained on 80% of the dataset and validated on the remaining 20% and a novel dataset, the Typability Index explained 68-88% of the variance in typability, compared to the 34% explained by an earlier leading model (Bell, Unpublished Doctor's Dissertation, University of Oklahoma, 1949). To promote higher control in typing research and assessments, we introduce a web-based tool to facilitate accurate measurement and fair comparisons of text typability.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"61"},"PeriodicalIF":3.9,"publicationDate":"2026-02-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12901113/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146177629","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-11DOI: 10.3758/s13428-026-02949-7
Emiko J Muraki, Penny M Pexman
Psycholinguistic norms quantify dimensions of word form, use, and/or meaning and are important for studies involving word stimuli. Among the most commonly used norms are those for concreteness, quantifying the degree to which a word's referent can be understood through the senses. The dimension of concreteness has been found to be related to performance in language processing and memory tasks, yet recent research suggests that (1) ratings of concreteness may be conflated with how specific a word's meaning is, and (2) concreteness and specificity may have opposite effects on lexical-semantic processing. In the present study, we collected specificity scores for a large sample of English words, assessed their reliability and validity, and tested their relationships to lexical processing and memory accuracy. We found that specificity and concreteness are indeed correlated and have different behavioural effects. This large dataset of specificity scores will be an important resource for future lexical-semantic studies.
{"title":"Distinguishing abstraction from abstractness: Specificity norms for 8,500 English words.","authors":"Emiko J Muraki, Penny M Pexman","doi":"10.3758/s13428-026-02949-7","DOIUrl":"10.3758/s13428-026-02949-7","url":null,"abstract":"<p><p>Psycholinguistic norms quantify dimensions of word form, use, and/or meaning and are important for studies involving word stimuli. Among the most commonly used norms are those for concreteness, quantifying the degree to which a word's referent can be understood through the senses. The dimension of concreteness has been found to be related to performance in language processing and memory tasks, yet recent research suggests that (1) ratings of concreteness may be conflated with how specific a word's meaning is, and (2) concreteness and specificity may have opposite effects on lexical-semantic processing. In the present study, we collected specificity scores for a large sample of English words, assessed their reliability and validity, and tested their relationships to lexical processing and memory accuracy. We found that specificity and concreteness are indeed correlated and have different behavioural effects. This large dataset of specificity scores will be an important resource for future lexical-semantic studies.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"60"},"PeriodicalIF":3.9,"publicationDate":"2026-02-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146163777","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-04DOI: 10.3758/s13428-025-02924-8
Jiaxu Zhao, Xin He, Yi Jiang, Min Bao
Continuous flash suppression (CFS) is widely used in research on unconscious visual processing due to its long-lasting masking. While CFS effectively masks static stimuli, its application to motion stimuli remains challenging. To resolve this issue, our previous work developed the Chameleon-1 paradigm (Zhao & Bao, 2022), an enhanced CFS technique that enables robust masking of translational motion stimuli for up to 10 s through precise spatiotemporal matching of color dynamics between target and masking stimuli. The current study systematically evaluated and optimized this paradigm through three studies. We first assessed the masking efficacy of the Chameleon-1 paradigm across different motion parameters and patterns (Study 1). Because Chameleon-1 failed to effectively mask biological motion (BM) stimuli, we then upgraded the paradigm to accommodate BM stimulus characteristics (Study 2). The results demonstrated that this Chameleon-2 paradigm achieved superior masking efficacy for BM stimuli, with average breakthrough time extended by over two-fold compared to Chameleon-1 and breakthrough ratios approximately 75% for upright and 45% for inverted BM stimuli during 10-s of BM presentation. We further employed this paradigm to investigate the neural correlates of conscious and unconscious BM processing using functional near-infrared spectroscopy in Study 3. Our work establishes a robust paradigm for sustained masking of BM stimuli and validates its utility in unconscious processing research. We also provide new insights into the neural mechanisms underlying unconscious BM perception.
{"title":"The Chameleon Paradigm: An effective method for masking biological motion stimuli.","authors":"Jiaxu Zhao, Xin He, Yi Jiang, Min Bao","doi":"10.3758/s13428-025-02924-8","DOIUrl":"10.3758/s13428-025-02924-8","url":null,"abstract":"<p><p>Continuous flash suppression (CFS) is widely used in research on unconscious visual processing due to its long-lasting masking. While CFS effectively masks static stimuli, its application to motion stimuli remains challenging. To resolve this issue, our previous work developed the Chameleon-1 paradigm (Zhao & Bao, 2022), an enhanced CFS technique that enables robust masking of translational motion stimuli for up to 10 s through precise spatiotemporal matching of color dynamics between target and masking stimuli. The current study systematically evaluated and optimized this paradigm through three studies. We first assessed the masking efficacy of the Chameleon-1 paradigm across different motion parameters and patterns (Study 1). Because Chameleon-1 failed to effectively mask biological motion (BM) stimuli, we then upgraded the paradigm to accommodate BM stimulus characteristics (Study 2). The results demonstrated that this Chameleon-2 paradigm achieved superior masking efficacy for BM stimuli, with average breakthrough time extended by over two-fold compared to Chameleon-1 and breakthrough ratios approximately 75% for upright and 45% for inverted BM stimuli during 10-s of BM presentation. We further employed this paradigm to investigate the neural correlates of conscious and unconscious BM processing using functional near-infrared spectroscopy in Study 3. Our work establishes a robust paradigm for sustained masking of BM stimuli and validates its utility in unconscious processing research. We also provide new insights into the neural mechanisms underlying unconscious BM perception.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"59"},"PeriodicalIF":3.9,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146117507","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-02DOI: 10.3758/s13428-025-02938-2
Hiromichi Hagihara, Kazuki Miyazawa
This study examined how well large language models (LLMs) approximate human psychological ratings for early-acquired English words. We used four state-of-the-art LLMs, including GPT-4o and Meta-Llama-3.1, to evaluate 21 static psychological features for 695 words and compared these estimates with human norms. The results showed that LLMs aligned well with human ratings for some features (e.g., Concreteness, Bodily Interactiveness) in terms of rank correlations (rs > .82) and distributional similarities but diverged notably for others (e.g., Iconicity, Arousal; rs < .48). Compared with content words, function words showed more pronounced discrepancies between human and LLM ratings. We also assessed how similarly human- and LLM-derived psychological features predicted words' age of acquisition (AoA), revealing both strong correspondences and systematic biases, depending on the model (differences in correlations ranged from -.27 to .28). Based on these analyses, we identified which features may be reliably estimated using LLMs, which require further refinement, and what methodological considerations are necessary for applying LLM-based measures in cognitive science. We discuss the implications of using LLMs as methodological tools in psychology and cognitive science, highlighting both their practical advantages (e.g., data coverage and data collection efficiency) and theoretical relevance. The present study provides a novel framework for evaluating the cognitive plausibility of LLMs by using lexical psychological features, complementing existing benchmarks.
{"title":"How well do large language models mirror human cognition of word concepts?: A comparison of psychological ratings for early-acquired English words.","authors":"Hiromichi Hagihara, Kazuki Miyazawa","doi":"10.3758/s13428-025-02938-2","DOIUrl":"10.3758/s13428-025-02938-2","url":null,"abstract":"<p><p>This study examined how well large language models (LLMs) approximate human psychological ratings for early-acquired English words. We used four state-of-the-art LLMs, including GPT-4o and Meta-Llama-3.1, to evaluate 21 static psychological features for 695 words and compared these estimates with human norms. The results showed that LLMs aligned well with human ratings for some features (e.g., Concreteness, Bodily Interactiveness) in terms of rank correlations (rs > .82) and distributional similarities but diverged notably for others (e.g., Iconicity, Arousal; rs < .48). Compared with content words, function words showed more pronounced discrepancies between human and LLM ratings. We also assessed how similarly human- and LLM-derived psychological features predicted words' age of acquisition (AoA), revealing both strong correspondences and systematic biases, depending on the model (differences in correlations ranged from -.27 to .28). Based on these analyses, we identified which features may be reliably estimated using LLMs, which require further refinement, and what methodological considerations are necessary for applying LLM-based measures in cognitive science. We discuss the implications of using LLMs as methodological tools in psychology and cognitive science, highlighting both their practical advantages (e.g., data coverage and data collection efficiency) and theoretical relevance. The present study provides a novel framework for evaluating the cognitive plausibility of LLMs by using lexical psychological features, complementing existing benchmarks.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"58"},"PeriodicalIF":3.9,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12864368/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103563","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-30DOI: 10.3758/s13428-025-02887-w
Kushin Mukherjee, Holly Huey, Laura M Stoinski, Martin N Hebart, Judith E Fan, Wilma A Bainbridge
The development of large datasets of natural images has galvanized progress in psychology, neuroscience, and computer science. Notably, the THINGS database constitutes a collective effort towards understanding of human visual knowledge by accumulating rich data on a shared set of visual object concepts across several studies. In this paper, we introduce Drawing of THINGS ( DoT ), a novel dataset of 28,627 human drawings of 1854 diverse object concepts, sampled systematically from concrete picturable and nameable nouns in the American English language, mirroring the structure of the THINGS image database. In addition to data on drawings' stroke history, we further collected fine-grained recognition data for each drawing, along with metadata on participant demographics, drawing ability, and mental imagery. We characterize people's ability to communicate and recognize semantic information encoded in drawings and compare this ability to their ability to recognize real-world images of the same visual objects. We also explore the relationship between drawing understanding and the memorability and typicality of the objects contained in THINGS. In sum, we envision DoT as a powerful tool that builds on the THINGS database to advance understanding of how humans express knowledge about visual concepts.
{"title":"Drawings of THINGS: A large-scale drawing dataset of 1854 object concepts.","authors":"Kushin Mukherjee, Holly Huey, Laura M Stoinski, Martin N Hebart, Judith E Fan, Wilma A Bainbridge","doi":"10.3758/s13428-025-02887-w","DOIUrl":"10.3758/s13428-025-02887-w","url":null,"abstract":"<p><p>The development of large datasets of natural images has galvanized progress in psychology, neuroscience, and computer science. Notably, the THINGS database constitutes a collective effort towards understanding of human visual knowledge by accumulating rich data on a shared set of visual object concepts across several studies. In this paper, we introduce Drawing of THINGS ( DoT ), a novel dataset of 28,627 human drawings of 1854 diverse object concepts, sampled systematically from concrete picturable and nameable nouns in the American English language, mirroring the structure of the THINGS image database. In addition to data on drawings' stroke history, we further collected fine-grained recognition data for each drawing, along with metadata on participant demographics, drawing ability, and mental imagery. We characterize people's ability to communicate and recognize semantic information encoded in drawings and compare this ability to their ability to recognize real-world images of the same visual objects. We also explore the relationship between drawing understanding and the memorability and typicality of the objects contained in THINGS. In sum, we envision DoT as a powerful tool that builds on the THINGS database to advance understanding of how humans express knowledge about visual concepts.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"57"},"PeriodicalIF":3.9,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12858628/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146091869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-29DOI: 10.3758/s13428-025-02913-x
Wenshuo Li, Okan Bulut, Mark J Gierl, Sheng Zhang
Scale abbreviation is a crucial task for researchers aiming to reduce response burden and optimize data collection when using self-report instruments such as online surveys and questionnaires. Among various data-driven strategies available for scale abbreviation, supervised machine learning (SML) algorithms have emerged as a prominent approach due to their accuracy in predicting total scores from the original instrument. However, previous studies offer limited insights into how SML-abbreviated scales can be evaluated using both SML and psychometric metrics across different feature selection techniques. To address this gap, the current study aims to evaluate the effectiveness of seven feature selection methods: item-total-correlation-based filters (ITC), Minimum-Redundancy-Maximum-Relevance (MRMR), Lasso, Sequential Forward Selection (SFS), Sequential Backward Selection (SBS), Genetic Algorithms (GA), and Non-dominated Sorting Genetic Algorithms-II (NSGA-II), all used in conjunction with SML. Additionally, the psychometric properties of these SML methods are compared with two non-SML approaches. Using simulated datasets varying in sample size, model error, and factorial correlations, the study examines predictive accuracy, reliability, and the ability to recover both inter-subscale correlations and external criterion correlations. The findings indicate that no single method consistently excels across all conditions, with specific feature selection techniques performing better under certain circumstances. Key insights are provided to guide researchers in selecting appropriate feature selection methods based on their specific dataset characteristics and goals.
量表缩写是研究人员在使用在线调查和问卷等自我报告工具时,减轻响应负担和优化数据收集的关键任务。在各种可用于缩尺的数据驱动策略中,监督机器学习(SML)算法已成为一种突出的方法,因为它们在预测原始仪器的总分方面具有准确性。然而,先前的研究对如何在不同的特征选择技术中使用SML和心理测量指标来评估SML-缩略量表提供了有限的见解。为了解决这一差距,本研究旨在评估7种特征选择方法的有效性:基于项目总相关性的过滤器(ITC)、最小冗余-最大相关性(MRMR)、Lasso、顺序正向选择(SFS)、顺序向后选择(SBS)、遗传算法(GA)和非支配排序遗传算法- ii (NSGA-II),它们都与SML一起使用。此外,将这些方法与两种非SML方法的心理测量特性进行了比较。本研究使用不同样本量、模型误差和因子相关性的模拟数据集,检验了预测的准确性、可靠性以及恢复亚尺度间相关性和外部标准相关性的能力。研究结果表明,没有一种方法在所有条件下都表现出色,特定的特征选择技术在某些情况下表现更好。提供了关键的见解,以指导研究人员根据其特定的数据集特征和目标选择适当的特征选择方法。
{"title":"Scale abbreviation with supervised machine learning: A comparison of feature selection techniques.","authors":"Wenshuo Li, Okan Bulut, Mark J Gierl, Sheng Zhang","doi":"10.3758/s13428-025-02913-x","DOIUrl":"10.3758/s13428-025-02913-x","url":null,"abstract":"<p><p>Scale abbreviation is a crucial task for researchers aiming to reduce response burden and optimize data collection when using self-report instruments such as online surveys and questionnaires. Among various data-driven strategies available for scale abbreviation, supervised machine learning (SML) algorithms have emerged as a prominent approach due to their accuracy in predicting total scores from the original instrument. However, previous studies offer limited insights into how SML-abbreviated scales can be evaluated using both SML and psychometric metrics across different feature selection techniques. To address this gap, the current study aims to evaluate the effectiveness of seven feature selection methods: item-total-correlation-based filters (ITC), Minimum-Redundancy-Maximum-Relevance (MRMR), Lasso, Sequential Forward Selection (SFS), Sequential Backward Selection (SBS), Genetic Algorithms (GA), and Non-dominated Sorting Genetic Algorithms-II (NSGA-II), all used in conjunction with SML. Additionally, the psychometric properties of these SML methods are compared with two non-SML approaches. Using simulated datasets varying in sample size, model error, and factorial correlations, the study examines predictive accuracy, reliability, and the ability to recover both inter-subscale correlations and external criterion correlations. The findings indicate that no single method consistently excels across all conditions, with specific feature selection techniques performing better under certain circumstances. Key insights are provided to guide researchers in selecting appropriate feature selection methods based on their specific dataset characteristics and goals.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"55"},"PeriodicalIF":3.9,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146083942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-29DOI: 10.3758/s13428-025-02897-8
Matthew K Robison, Stephen Campbell, Lauren D Garner, Ciara Sibley, Joseph Coyne
The present study examined individual differences in 24 measures of cognitive ability in a sample of young adults (N = 255). Each measure was completed twice, separated by a period of 2 weeks, to assess test-retest reliability and retesting (i.e., practice) effects. Latent variable modeling was used to assess the convergent and discriminant validity of the measures, as they were selected to measure seven different cognitive constructs (attention control, processing speed, working memory, primary memory, secondary memory, fluid intelligence, and spatial ability). The measures showed adequate to high intrasession and intersession reliability. Construct-level estimates were highly reliable, and the measurement structure was invariant across the two testing occasions. In several instances, correlations among latent variables warranted further testing to ensure adequate discriminability. Finally, latent state-trait modeling indicated that the majority of systematic variance in cognitive measures is due to latent traits, rather than state-specific or task-specific factors. We discuss the practical and theoretical implications of these findings.
{"title":"A comprehensive psychometrics of cognitive ability measures: Reliability, practice effects, and the stability of latent factor structures across retesting.","authors":"Matthew K Robison, Stephen Campbell, Lauren D Garner, Ciara Sibley, Joseph Coyne","doi":"10.3758/s13428-025-02897-8","DOIUrl":"10.3758/s13428-025-02897-8","url":null,"abstract":"<p><p>The present study examined individual differences in 24 measures of cognitive ability in a sample of young adults (N = 255). Each measure was completed twice, separated by a period of 2 weeks, to assess test-retest reliability and retesting (i.e., practice) effects. Latent variable modeling was used to assess the convergent and discriminant validity of the measures, as they were selected to measure seven different cognitive constructs (attention control, processing speed, working memory, primary memory, secondary memory, fluid intelligence, and spatial ability). The measures showed adequate to high intrasession and intersession reliability. Construct-level estimates were highly reliable, and the measurement structure was invariant across the two testing occasions. In several instances, correlations among latent variables warranted further testing to ensure adequate discriminability. Finally, latent state-trait modeling indicated that the majority of systematic variance in cognitive measures is due to latent traits, rather than state-specific or task-specific factors. We discuss the practical and theoretical implications of these findings.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"56"},"PeriodicalIF":3.9,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12855318/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146083978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-26DOI: 10.3758/s13428-025-02894-x
Dirk U Wulff, Pascal J Kieslich, Felix Henninger, Jonas M B Haslbeck, Michael Schulte-Mecklenbeck
{"title":"Publisher Correction: Movement tracking of psychological processes: A tutorial using mousetrap.","authors":"Dirk U Wulff, Pascal J Kieslich, Felix Henninger, Jonas M B Haslbeck, Michael Schulte-Mecklenbeck","doi":"10.3758/s13428-025-02894-x","DOIUrl":"10.3758/s13428-025-02894-x","url":null,"abstract":"","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"53"},"PeriodicalIF":3.9,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12835042/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146050125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-26DOI: 10.3758/s13428-025-02935-5
Linjieqiong Huang, Chenxi Li, Xingshan Li
In the absence of inter-word spaces, Chinese text sometimes presents word boundary ambiguity. One common case is the overlapping ambiguous string (OAS), a three-character string (ABC) where the middle character can form distinct words with both the character to its left (AB) and the character to its right (BC), creating segmentation ambiguity between AB-C and A-BC. This structure makes OASs a valuable tool for investigating the cognitive mechanisms of Chinese word segmentation. We introduce a comprehensive OAS database consisting of 952,497 OASs, each with 43 types of linguistic information at the character, word, and OAS levels. To illustrate how to use the database, we conducted an eye-tracking reading experiment manipulating whether the first character of the OAS (i.e., character A) could stand alone in sentences. Results showed that when character A could not stand alone, readers were more likely to group it with the next character B, leading to an AB-C segmentation. These findings validate the utility of the OAS database in understanding word segmentation during Chinese reading. The potential applications of the database in artificial intelligence, education, and writing system reform are discussed.
{"title":"A database of overlapping ambiguous strings in Chinese reading.","authors":"Linjieqiong Huang, Chenxi Li, Xingshan Li","doi":"10.3758/s13428-025-02935-5","DOIUrl":"10.3758/s13428-025-02935-5","url":null,"abstract":"<p><p>In the absence of inter-word spaces, Chinese text sometimes presents word boundary ambiguity. One common case is the overlapping ambiguous string (OAS), a three-character string (ABC) where the middle character can form distinct words with both the character to its left (AB) and the character to its right (BC), creating segmentation ambiguity between AB-C and A-BC. This structure makes OASs a valuable tool for investigating the cognitive mechanisms of Chinese word segmentation. We introduce a comprehensive OAS database consisting of 952,497 OASs, each with 43 types of linguistic information at the character, word, and OAS levels. To illustrate how to use the database, we conducted an eye-tracking reading experiment manipulating whether the first character of the OAS (i.e., character A) could stand alone in sentences. Results showed that when character A could not stand alone, readers were more likely to group it with the next character B, leading to an AB-C segmentation. These findings validate the utility of the OAS database in understanding word segmentation during Chinese reading. The potential applications of the database in artificial intelligence, education, and writing system reform are discussed.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"51"},"PeriodicalIF":3.9,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146050169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The dynamic nature of our environment allows us to anticipate the onset of relevant events, enhancing our responses to them. Temporal preparation can be assessed in the laboratory using various tasks, including foreperiod tasks, temporal orienting tasks, and rhythmic tasks. However, the existing literature lacks a unified task to measure the most common temporal preparation effects (i.e., foreperiod, sequential, temporal orienting, and rhythmic effects) in a single session. The main goal of the present study was to fill this gap by devising the temporal preparation task (TEP-Task) to measure temporal preparation effects in a single 35-min testing session. Besides its utility in single-session assessments, the TEP-Task may also serve for future research across diverse populations and experimental demands.
{"title":"Unifying temporal preparation: The temporal preparation task (TEP-Task).","authors":"Mariagrazia Capizzi, Lucie Attout, Giovanna Mioni, Pom Charras","doi":"10.3758/s13428-025-02908-8","DOIUrl":"10.3758/s13428-025-02908-8","url":null,"abstract":"<p><p>The dynamic nature of our environment allows us to anticipate the onset of relevant events, enhancing our responses to them. Temporal preparation can be assessed in the laboratory using various tasks, including foreperiod tasks, temporal orienting tasks, and rhythmic tasks. However, the existing literature lacks a unified task to measure the most common temporal preparation effects (i.e., foreperiod, sequential, temporal orienting, and rhythmic effects) in a single session. The main goal of the present study was to fill this gap by devising the temporal preparation task (TEP-Task) to measure temporal preparation effects in a single 35-min testing session. Besides its utility in single-session assessments, the TEP-Task may also serve for future research across diverse populations and experimental demands.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"52"},"PeriodicalIF":3.9,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146050142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}