Pub Date : 2026-02-04DOI: 10.3758/s13428-025-02924-8
Jiaxu Zhao, Xin He, Yi Jiang, Min Bao
Continuous flash suppression (CFS) is widely used in research on unconscious visual processing due to its long-lasting masking. While CFS effectively masks static stimuli, its application to motion stimuli remains challenging. To resolve this issue, our previous work developed the Chameleon-1 paradigm (Zhao & Bao, 2022), an enhanced CFS technique that enables robust masking of translational motion stimuli for up to 10 s through precise spatiotemporal matching of color dynamics between target and masking stimuli. The current study systematically evaluated and optimized this paradigm through three studies. We first assessed the masking efficacy of the Chameleon-1 paradigm across different motion parameters and patterns (Study 1). Because Chameleon-1 failed to effectively mask biological motion (BM) stimuli, we then upgraded the paradigm to accommodate BM stimulus characteristics (Study 2). The results demonstrated that this Chameleon-2 paradigm achieved superior masking efficacy for BM stimuli, with average breakthrough time extended by over two-fold compared to Chameleon-1 and breakthrough ratios approximately 75% for upright and 45% for inverted BM stimuli during 10-s of BM presentation. We further employed this paradigm to investigate the neural correlates of conscious and unconscious BM processing using functional near-infrared spectroscopy in Study 3. Our work establishes a robust paradigm for sustained masking of BM stimuli and validates its utility in unconscious processing research. We also provide new insights into the neural mechanisms underlying unconscious BM perception.
{"title":"The Chameleon Paradigm: An effective method for masking biological motion stimuli.","authors":"Jiaxu Zhao, Xin He, Yi Jiang, Min Bao","doi":"10.3758/s13428-025-02924-8","DOIUrl":"https://doi.org/10.3758/s13428-025-02924-8","url":null,"abstract":"<p><p>Continuous flash suppression (CFS) is widely used in research on unconscious visual processing due to its long-lasting masking. While CFS effectively masks static stimuli, its application to motion stimuli remains challenging. To resolve this issue, our previous work developed the Chameleon-1 paradigm (Zhao & Bao, 2022), an enhanced CFS technique that enables robust masking of translational motion stimuli for up to 10 s through precise spatiotemporal matching of color dynamics between target and masking stimuli. The current study systematically evaluated and optimized this paradigm through three studies. We first assessed the masking efficacy of the Chameleon-1 paradigm across different motion parameters and patterns (Study 1). Because Chameleon-1 failed to effectively mask biological motion (BM) stimuli, we then upgraded the paradigm to accommodate BM stimulus characteristics (Study 2). The results demonstrated that this Chameleon-2 paradigm achieved superior masking efficacy for BM stimuli, with average breakthrough time extended by over two-fold compared to Chameleon-1 and breakthrough ratios approximately 75% for upright and 45% for inverted BM stimuli during 10-s of BM presentation. We further employed this paradigm to investigate the neural correlates of conscious and unconscious BM processing using functional near-infrared spectroscopy in Study 3. Our work establishes a robust paradigm for sustained masking of BM stimuli and validates its utility in unconscious processing research. We also provide new insights into the neural mechanisms underlying unconscious BM perception.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"59"},"PeriodicalIF":3.9,"publicationDate":"2026-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146117507","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-02-02DOI: 10.3758/s13428-025-02938-2
Hiromichi Hagihara, Kazuki Miyazawa
This study examined how well large language models (LLMs) approximate human psychological ratings for early-acquired English words. We used four state-of-the-art LLMs, including GPT-4o and Meta-Llama-3.1, to evaluate 21 static psychological features for 695 words and compared these estimates with human norms. The results showed that LLMs aligned well with human ratings for some features (e.g., Concreteness, Bodily Interactiveness) in terms of rank correlations (rs > .82) and distributional similarities but diverged notably for others (e.g., Iconicity, Arousal; rs < .48). Compared with content words, function words showed more pronounced discrepancies between human and LLM ratings. We also assessed how similarly human- and LLM-derived psychological features predicted words' age of acquisition (AoA), revealing both strong correspondences and systematic biases, depending on the model (differences in correlations ranged from -.27 to .28). Based on these analyses, we identified which features may be reliably estimated using LLMs, which require further refinement, and what methodological considerations are necessary for applying LLM-based measures in cognitive science. We discuss the implications of using LLMs as methodological tools in psychology and cognitive science, highlighting both their practical advantages (e.g., data coverage and data collection efficiency) and theoretical relevance. The present study provides a novel framework for evaluating the cognitive plausibility of LLMs by using lexical psychological features, complementing existing benchmarks.
{"title":"How well do large language models mirror human cognition of word concepts?: A comparison of psychological ratings for early-acquired English words.","authors":"Hiromichi Hagihara, Kazuki Miyazawa","doi":"10.3758/s13428-025-02938-2","DOIUrl":"10.3758/s13428-025-02938-2","url":null,"abstract":"<p><p>This study examined how well large language models (LLMs) approximate human psychological ratings for early-acquired English words. We used four state-of-the-art LLMs, including GPT-4o and Meta-Llama-3.1, to evaluate 21 static psychological features for 695 words and compared these estimates with human norms. The results showed that LLMs aligned well with human ratings for some features (e.g., Concreteness, Bodily Interactiveness) in terms of rank correlations (rs > .82) and distributional similarities but diverged notably for others (e.g., Iconicity, Arousal; rs < .48). Compared with content words, function words showed more pronounced discrepancies between human and LLM ratings. We also assessed how similarly human- and LLM-derived psychological features predicted words' age of acquisition (AoA), revealing both strong correspondences and systematic biases, depending on the model (differences in correlations ranged from -.27 to .28). Based on these analyses, we identified which features may be reliably estimated using LLMs, which require further refinement, and what methodological considerations are necessary for applying LLM-based measures in cognitive science. We discuss the implications of using LLMs as methodological tools in psychology and cognitive science, highlighting both their practical advantages (e.g., data coverage and data collection efficiency) and theoretical relevance. The present study provides a novel framework for evaluating the cognitive plausibility of LLMs by using lexical psychological features, complementing existing benchmarks.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"58"},"PeriodicalIF":3.9,"publicationDate":"2026-02-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12864368/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146103563","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-30DOI: 10.3758/s13428-025-02887-w
Kushin Mukherjee, Holly Huey, Laura M Stoinski, Martin N Hebart, Judith E Fan, Wilma A Bainbridge
The development of large datasets of natural images has galvanized progress in psychology, neuroscience, and computer science. Notably, the THINGS database constitutes a collective effort towards understanding of human visual knowledge by accumulating rich data on a shared set of visual object concepts across several studies. In this paper, we introduce Drawing of THINGS ( DoT ), a novel dataset of 28,627 human drawings of 1854 diverse object concepts, sampled systematically from concrete picturable and nameable nouns in the American English language, mirroring the structure of the THINGS image database. In addition to data on drawings' stroke history, we further collected fine-grained recognition data for each drawing, along with metadata on participant demographics, drawing ability, and mental imagery. We characterize people's ability to communicate and recognize semantic information encoded in drawings and compare this ability to their ability to recognize real-world images of the same visual objects. We also explore the relationship between drawing understanding and the memorability and typicality of the objects contained in THINGS. In sum, we envision DoT as a powerful tool that builds on the THINGS database to advance understanding of how humans express knowledge about visual concepts.
{"title":"Drawings of THINGS: A large-scale drawing dataset of 1854 object concepts.","authors":"Kushin Mukherjee, Holly Huey, Laura M Stoinski, Martin N Hebart, Judith E Fan, Wilma A Bainbridge","doi":"10.3758/s13428-025-02887-w","DOIUrl":"10.3758/s13428-025-02887-w","url":null,"abstract":"<p><p>The development of large datasets of natural images has galvanized progress in psychology, neuroscience, and computer science. Notably, the THINGS database constitutes a collective effort towards understanding of human visual knowledge by accumulating rich data on a shared set of visual object concepts across several studies. In this paper, we introduce Drawing of THINGS ( DoT ), a novel dataset of 28,627 human drawings of 1854 diverse object concepts, sampled systematically from concrete picturable and nameable nouns in the American English language, mirroring the structure of the THINGS image database. In addition to data on drawings' stroke history, we further collected fine-grained recognition data for each drawing, along with metadata on participant demographics, drawing ability, and mental imagery. We characterize people's ability to communicate and recognize semantic information encoded in drawings and compare this ability to their ability to recognize real-world images of the same visual objects. We also explore the relationship between drawing understanding and the memorability and typicality of the objects contained in THINGS. In sum, we envision DoT as a powerful tool that builds on the THINGS database to advance understanding of how humans express knowledge about visual concepts.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"57"},"PeriodicalIF":3.9,"publicationDate":"2026-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12858628/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146091869","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-29DOI: 10.3758/s13428-025-02913-x
Wenshuo Li, Okan Bulut, Mark J Gierl, Sheng Zhang
Scale abbreviation is a crucial task for researchers aiming to reduce response burden and optimize data collection when using self-report instruments such as online surveys and questionnaires. Among various data-driven strategies available for scale abbreviation, supervised machine learning (SML) algorithms have emerged as a prominent approach due to their accuracy in predicting total scores from the original instrument. However, previous studies offer limited insights into how SML-abbreviated scales can be evaluated using both SML and psychometric metrics across different feature selection techniques. To address this gap, the current study aims to evaluate the effectiveness of seven feature selection methods: item-total-correlation-based filters (ITC), Minimum-Redundancy-Maximum-Relevance (MRMR), Lasso, Sequential Forward Selection (SFS), Sequential Backward Selection (SBS), Genetic Algorithms (GA), and Non-dominated Sorting Genetic Algorithms-II (NSGA-II), all used in conjunction with SML. Additionally, the psychometric properties of these SML methods are compared with two non-SML approaches. Using simulated datasets varying in sample size, model error, and factorial correlations, the study examines predictive accuracy, reliability, and the ability to recover both inter-subscale correlations and external criterion correlations. The findings indicate that no single method consistently excels across all conditions, with specific feature selection techniques performing better under certain circumstances. Key insights are provided to guide researchers in selecting appropriate feature selection methods based on their specific dataset characteristics and goals.
量表缩写是研究人员在使用在线调查和问卷等自我报告工具时,减轻响应负担和优化数据收集的关键任务。在各种可用于缩尺的数据驱动策略中,监督机器学习(SML)算法已成为一种突出的方法,因为它们在预测原始仪器的总分方面具有准确性。然而,先前的研究对如何在不同的特征选择技术中使用SML和心理测量指标来评估SML-缩略量表提供了有限的见解。为了解决这一差距,本研究旨在评估7种特征选择方法的有效性:基于项目总相关性的过滤器(ITC)、最小冗余-最大相关性(MRMR)、Lasso、顺序正向选择(SFS)、顺序向后选择(SBS)、遗传算法(GA)和非支配排序遗传算法- ii (NSGA-II),它们都与SML一起使用。此外,将这些方法与两种非SML方法的心理测量特性进行了比较。本研究使用不同样本量、模型误差和因子相关性的模拟数据集,检验了预测的准确性、可靠性以及恢复亚尺度间相关性和外部标准相关性的能力。研究结果表明,没有一种方法在所有条件下都表现出色,特定的特征选择技术在某些情况下表现更好。提供了关键的见解,以指导研究人员根据其特定的数据集特征和目标选择适当的特征选择方法。
{"title":"Scale abbreviation with supervised machine learning: A comparison of feature selection techniques.","authors":"Wenshuo Li, Okan Bulut, Mark J Gierl, Sheng Zhang","doi":"10.3758/s13428-025-02913-x","DOIUrl":"https://doi.org/10.3758/s13428-025-02913-x","url":null,"abstract":"<p><p>Scale abbreviation is a crucial task for researchers aiming to reduce response burden and optimize data collection when using self-report instruments such as online surveys and questionnaires. Among various data-driven strategies available for scale abbreviation, supervised machine learning (SML) algorithms have emerged as a prominent approach due to their accuracy in predicting total scores from the original instrument. However, previous studies offer limited insights into how SML-abbreviated scales can be evaluated using both SML and psychometric metrics across different feature selection techniques. To address this gap, the current study aims to evaluate the effectiveness of seven feature selection methods: item-total-correlation-based filters (ITC), Minimum-Redundancy-Maximum-Relevance (MRMR), Lasso, Sequential Forward Selection (SFS), Sequential Backward Selection (SBS), Genetic Algorithms (GA), and Non-dominated Sorting Genetic Algorithms-II (NSGA-II), all used in conjunction with SML. Additionally, the psychometric properties of these SML methods are compared with two non-SML approaches. Using simulated datasets varying in sample size, model error, and factorial correlations, the study examines predictive accuracy, reliability, and the ability to recover both inter-subscale correlations and external criterion correlations. The findings indicate that no single method consistently excels across all conditions, with specific feature selection techniques performing better under certain circumstances. Key insights are provided to guide researchers in selecting appropriate feature selection methods based on their specific dataset characteristics and goals.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"55"},"PeriodicalIF":3.9,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146083942","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-29DOI: 10.3758/s13428-025-02897-8
Matthew K Robison, Stephen Campbell, Lauren D Garner, Ciara Sibley, Joseph Coyne
The present study examined individual differences in 24 measures of cognitive ability in a sample of young adults (N = 255). Each measure was completed twice, separated by a period of 2 weeks, to assess test-retest reliability and retesting (i.e., practice) effects. Latent variable modeling was used to assess the convergent and discriminant validity of the measures, as they were selected to measure seven different cognitive constructs (attention control, processing speed, working memory, primary memory, secondary memory, fluid intelligence, and spatial ability). The measures showed adequate to high intrasession and intersession reliability. Construct-level estimates were highly reliable, and the measurement structure was invariant across the two testing occasions. In several instances, correlations among latent variables warranted further testing to ensure adequate discriminability. Finally, latent state-trait modeling indicated that the majority of systematic variance in cognitive measures is due to latent traits, rather than state-specific or task-specific factors. We discuss the practical and theoretical implications of these findings.
{"title":"A comprehensive psychometrics of cognitive ability measures: Reliability, practice effects, and the stability of latent factor structures across retesting.","authors":"Matthew K Robison, Stephen Campbell, Lauren D Garner, Ciara Sibley, Joseph Coyne","doi":"10.3758/s13428-025-02897-8","DOIUrl":"10.3758/s13428-025-02897-8","url":null,"abstract":"<p><p>The present study examined individual differences in 24 measures of cognitive ability in a sample of young adults (N = 255). Each measure was completed twice, separated by a period of 2 weeks, to assess test-retest reliability and retesting (i.e., practice) effects. Latent variable modeling was used to assess the convergent and discriminant validity of the measures, as they were selected to measure seven different cognitive constructs (attention control, processing speed, working memory, primary memory, secondary memory, fluid intelligence, and spatial ability). The measures showed adequate to high intrasession and intersession reliability. Construct-level estimates were highly reliable, and the measurement structure was invariant across the two testing occasions. In several instances, correlations among latent variables warranted further testing to ensure adequate discriminability. Finally, latent state-trait modeling indicated that the majority of systematic variance in cognitive measures is due to latent traits, rather than state-specific or task-specific factors. We discuss the practical and theoretical implications of these findings.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"56"},"PeriodicalIF":3.9,"publicationDate":"2026-01-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12855318/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146083978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-26DOI: 10.3758/s13428-025-02894-x
Dirk U Wulff, Pascal J Kieslich, Felix Henninger, Jonas M B Haslbeck, Michael Schulte-Mecklenbeck
{"title":"Publisher Correction: Movement tracking of psychological processes: A tutorial using mousetrap.","authors":"Dirk U Wulff, Pascal J Kieslich, Felix Henninger, Jonas M B Haslbeck, Michael Schulte-Mecklenbeck","doi":"10.3758/s13428-025-02894-x","DOIUrl":"10.3758/s13428-025-02894-x","url":null,"abstract":"","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"53"},"PeriodicalIF":3.9,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12835042/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146050125","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-26DOI: 10.3758/s13428-025-02935-5
Linjieqiong Huang, Chenxi Li, Xingshan Li
In the absence of inter-word spaces, Chinese text sometimes presents word boundary ambiguity. One common case is the overlapping ambiguous string (OAS), a three-character string (ABC) where the middle character can form distinct words with both the character to its left (AB) and the character to its right (BC), creating segmentation ambiguity between AB-C and A-BC. This structure makes OASs a valuable tool for investigating the cognitive mechanisms of Chinese word segmentation. We introduce a comprehensive OAS database consisting of 952,497 OASs, each with 43 types of linguistic information at the character, word, and OAS levels. To illustrate how to use the database, we conducted an eye-tracking reading experiment manipulating whether the first character of the OAS (i.e., character A) could stand alone in sentences. Results showed that when character A could not stand alone, readers were more likely to group it with the next character B, leading to an AB-C segmentation. These findings validate the utility of the OAS database in understanding word segmentation during Chinese reading. The potential applications of the database in artificial intelligence, education, and writing system reform are discussed.
{"title":"A database of overlapping ambiguous strings in Chinese reading.","authors":"Linjieqiong Huang, Chenxi Li, Xingshan Li","doi":"10.3758/s13428-025-02935-5","DOIUrl":"https://doi.org/10.3758/s13428-025-02935-5","url":null,"abstract":"<p><p>In the absence of inter-word spaces, Chinese text sometimes presents word boundary ambiguity. One common case is the overlapping ambiguous string (OAS), a three-character string (ABC) where the middle character can form distinct words with both the character to its left (AB) and the character to its right (BC), creating segmentation ambiguity between AB-C and A-BC. This structure makes OASs a valuable tool for investigating the cognitive mechanisms of Chinese word segmentation. We introduce a comprehensive OAS database consisting of 952,497 OASs, each with 43 types of linguistic information at the character, word, and OAS levels. To illustrate how to use the database, we conducted an eye-tracking reading experiment manipulating whether the first character of the OAS (i.e., character A) could stand alone in sentences. Results showed that when character A could not stand alone, readers were more likely to group it with the next character B, leading to an AB-C segmentation. These findings validate the utility of the OAS database in understanding word segmentation during Chinese reading. The potential applications of the database in artificial intelligence, education, and writing system reform are discussed.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"51"},"PeriodicalIF":3.9,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146050169","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-26DOI: 10.3758/s13428-025-02931-9
Chen Feng, Song Wang, Su Li
To facilitate research into the language development of preschool children, this article presents the Chinese Preschool Children's Spoken Lexical Database (CPCSLD), which is a lexical database built from a corpus of spontaneous speech production by 648 Chinese children aged between 3 and 6 years. The corpus comprises 1,199,851 word tokens, which include 21,372 unique words, 1,147 unique tonal syllables, and 400 unique atonal syllables. CPCSLD provides multiple distributional characteristics of both word-level and syllable-level information, including word frequency, token frequency, word length, word syntactic categories, tonal syllable frequency, and atonal syllable frequency, for words in the entire corpus as well as in the three grade-level sub-corpora (K1, K2, and K3). Using CPCSLD, we describe the developmental changes in word syntactic categories and frequency across K1, K2, and K3. Validation analyses showed an advantage for CPCSLD over existing databases in predicting children's picture-naming performance, but not in the semantic decision task. Correlation analyses further revealed distinct developmental patterns in lexical properties: CPCSLD's syllable frequencies were highly correlated with those from other child-based databases, whereas its word frequencies showed low correlations. These findings highlight CPCSLD's sensitivity to early lexical development and its value for research on language production in 3-6-year-old children.
{"title":"CPCSLD: A lexical database of Chinese preschool children's spoken words.","authors":"Chen Feng, Song Wang, Su Li","doi":"10.3758/s13428-025-02931-9","DOIUrl":"https://doi.org/10.3758/s13428-025-02931-9","url":null,"abstract":"<p><p>To facilitate research into the language development of preschool children, this article presents the Chinese Preschool Children's Spoken Lexical Database (CPCSLD), which is a lexical database built from a corpus of spontaneous speech production by 648 Chinese children aged between 3 and 6 years. The corpus comprises 1,199,851 word tokens, which include 21,372 unique words, 1,147 unique tonal syllables, and 400 unique atonal syllables. CPCSLD provides multiple distributional characteristics of both word-level and syllable-level information, including word frequency, token frequency, word length, word syntactic categories, tonal syllable frequency, and atonal syllable frequency, for words in the entire corpus as well as in the three grade-level sub-corpora (K1, K2, and K3). Using CPCSLD, we describe the developmental changes in word syntactic categories and frequency across K1, K2, and K3. Validation analyses showed an advantage for CPCSLD over existing databases in predicting children's picture-naming performance, but not in the semantic decision task. Correlation analyses further revealed distinct developmental patterns in lexical properties: CPCSLD's syllable frequencies were highly correlated with those from other child-based databases, whereas its word frequencies showed low correlations. These findings highlight CPCSLD's sensitivity to early lexical development and its value for research on language production in 3-6-year-old children.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"54"},"PeriodicalIF":3.9,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146050120","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The dynamic nature of our environment allows us to anticipate the onset of relevant events, enhancing our responses to them. Temporal preparation can be assessed in the laboratory using various tasks, including foreperiod tasks, temporal orienting tasks, and rhythmic tasks. However, the existing literature lacks a unified task to measure the most common temporal preparation effects (i.e., foreperiod, sequential, temporal orienting, and rhythmic effects) in a single session. The main goal of the present study was to fill this gap by devising the temporal preparation task (TEP-Task) to measure temporal preparation effects in a single 35-min testing session. Besides its utility in single-session assessments, the TEP-Task may also serve for future research across diverse populations and experimental demands.
{"title":"Unifying temporal preparation: The temporal preparation task (TEP-Task).","authors":"Mariagrazia Capizzi, Lucie Attout, Giovanna Mioni, Pom Charras","doi":"10.3758/s13428-025-02908-8","DOIUrl":"https://doi.org/10.3758/s13428-025-02908-8","url":null,"abstract":"<p><p>The dynamic nature of our environment allows us to anticipate the onset of relevant events, enhancing our responses to them. Temporal preparation can be assessed in the laboratory using various tasks, including foreperiod tasks, temporal orienting tasks, and rhythmic tasks. However, the existing literature lacks a unified task to measure the most common temporal preparation effects (i.e., foreperiod, sequential, temporal orienting, and rhythmic effects) in a single session. The main goal of the present study was to fill this gap by devising the temporal preparation task (TEP-Task) to measure temporal preparation effects in a single 35-min testing session. Besides its utility in single-session assessments, the TEP-Task may also serve for future research across diverse populations and experimental demands.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"52"},"PeriodicalIF":3.9,"publicationDate":"2026-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146050142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2026-01-23DOI: 10.3758/s13428-025-02925-7
Anna Gowenlock, Jennifer Rodd, Beth Malory, Courtenay Norbury
A growing number of psycholinguistic studies use methods from corpus linguistics to examine the language that children encounter in their environment, to understand how they might acquire different aspects of linguistic knowledge. Many of these studies focus on child-directed speech or children's literature, while there is a paucity of work focusing on children's television and video media. We describe the creation and contents of the Corpus of Children's Video Media (CCVM), a specialised corpus designed to represent the spoken language in television and online videos popular among 3-5-year-old children in the UK (available as a scrambled database of tokens). The CCVM was designed to be comparable to an existing corpus of child-directed speech (CDS). We used a dual sampling approach: inclusion decisions were guided by (a) a survey of parents with children in our target age group, and (b) a survey of programmes available on popular streaming platforms. The corpus consists of 233,471 tokens across 161 transcripts (43.12 h of video) and is available on the Open Science Framework (OSF) as a scrambled database of tokens (including gloss, stem, and lemma forms, and part-of-speech tags), organised within transcripts, together with relevant metadata for each transcript. We discuss the challenges of creating a corpus that is comparable to existing datasets and highlight the importance of transparency in this process. We take an open science approach, sharing a detailed data collection and processing protocol, code, and data so that the corpus can be evaluated, extended, and used appropriately by other research teams.
{"title":"Constructing the Corpus of Children's Video Media (CCVM): A new resource and guidelines for constructing comparable and reusable corpora.","authors":"Anna Gowenlock, Jennifer Rodd, Beth Malory, Courtenay Norbury","doi":"10.3758/s13428-025-02925-7","DOIUrl":"10.3758/s13428-025-02925-7","url":null,"abstract":"<p><p>A growing number of psycholinguistic studies use methods from corpus linguistics to examine the language that children encounter in their environment, to understand how they might acquire different aspects of linguistic knowledge. Many of these studies focus on child-directed speech or children's literature, while there is a paucity of work focusing on children's television and video media. We describe the creation and contents of the Corpus of Children's Video Media (CCVM), a specialised corpus designed to represent the spoken language in television and online videos popular among 3-5-year-old children in the UK (available as a scrambled database of tokens). The CCVM was designed to be comparable to an existing corpus of child-directed speech (CDS). We used a dual sampling approach: inclusion decisions were guided by (a) a survey of parents with children in our target age group, and (b) a survey of programmes available on popular streaming platforms. The corpus consists of 233,471 tokens across 161 transcripts (43.12 h of video) and is available on the Open Science Framework (OSF) as a scrambled database of tokens (including gloss, stem, and lemma forms, and part-of-speech tags), organised within transcripts, together with relevant metadata for each transcript. We discuss the challenges of creating a corpus that is comparable to existing datasets and highlight the importance of transparency in this process. We take an open science approach, sharing a detailed data collection and processing protocol, code, and data so that the corpus can be evaluated, extended, and used appropriately by other research teams.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"58 2","pages":"49"},"PeriodicalIF":3.9,"publicationDate":"2026-01-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12830416/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146040329","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}