首页 > 最新文献

Behavior Research Methods最新文献

英文 中文
Continuous Rating Scale Analytics (CoRSA): A tool for analyzing continuous and discrete data with item response theory. 连续评定量表分析(CoRSA):用项目反应理论分析连续和离散数据的工具。
IF 3.9 2区 心理学 Q1 PSYCHOLOGY, EXPERIMENTAL Pub Date : 2025-11-04 DOI: 10.3758/s13428-025-02848-3
Yeh-Tai Chou, Yao-Ting Sung, Wei-Hung Yang

The use of continuous rating scales such as the visual analogue scale (VAS) in research has increased, yet they are less popular than discrete scales like the Likert scale. The non-popularity of continuous scales is primarily due to the lack of validated analytical tools and user-friendly interfaces, which have also jointly resulted in a lack of sufficient theoretical and empirical research supporting confidence in using continuous rating formats. This research aims to address these gaps through four studies. The first study proposed an algorithm and developed the Continuous Rating Scale Analytics (CoRSA) to estimate parameters for the continuous rating scale model (Müller, Psychometrika, 52, 165-181, 1987). The second study evaluated CoRSA's efficacy in analyzing continuous scores compared to pcIRT (Hohensinn, Journal of Statistical Software, 84, 1-14, 2018) and discrete scores against ConQuest (Adams et al., 2020). Results showed superior parameter recovery with CoRSA for continuous data and comparable outcomes for discrete data. The third study analyzed empirical data from career interest and work value assessments using both VAS and Likert scales with CoRSA, demonstrating good model-data fit and validating CoRSA's effectiveness in rescaling data to interval measurements. Finally, the fourth study integrated CoRSA into the VAS-RRP 2.0 platform (Sung & Wu, Behavior Research Methods, 50, 1694-1715, 2018) to enhance accessibility and usability, allowing researchers and practitioners unfamiliar with statistical procedures to easily analyze continuous data. These findings confirm CoRSA as a valid tool for analyzing both continuous and discrete data, enhancing the utility of continuous rating formats in diverse research contexts.

像视觉模拟量表(VAS)这样的连续评定量表在研究中的使用越来越多,但它们不如像李克特量表这样的离散量表受欢迎。连续量表不受欢迎的主要原因是缺乏经过验证的分析工具和用户友好的界面,这也共同导致缺乏足够的理论和实证研究来支持使用连续评分格式的信心。本研究旨在通过四项研究来解决这些差距。第一项研究提出了一种算法,并开发了连续评定量表分析(CoRSA)来估计连续评定量表模型的参数(m ller, Psychometrika, 52, 165-181, 1987)。第二项研究评估了CoRSA与pcIRT (Hohensinn, Journal Statistical Software, 84,1 - 14,2018)和ConQuest (Adams et al., 2020)相比在分析连续得分方面的有效性。结果显示,连续数据的CoRSA参数恢复优于离散数据的可比较结果。第三项研究分析了职业兴趣和工作价值评估的实证数据,使用VAS和Likert量表与CoRSA,证明了良好的模型数据拟合,并验证了CoRSA在将数据重新缩放到区间测量方面的有效性。最后,第四项研究将CoRSA集成到VAS-RRP 2.0平台中(Sung & Wu, Behavior Research Methods, 50, 1694-1715, 2018),以增强可访问性和可用性,使不熟悉统计程序的研究人员和从业人员能够轻松分析连续数据。这些发现证实了CoRSA是分析连续和离散数据的有效工具,增强了连续评级格式在不同研究背景下的效用。
{"title":"Continuous Rating Scale Analytics (CoRSA): A tool for analyzing continuous and discrete data with item response theory.","authors":"Yeh-Tai Chou, Yao-Ting Sung, Wei-Hung Yang","doi":"10.3758/s13428-025-02848-3","DOIUrl":"10.3758/s13428-025-02848-3","url":null,"abstract":"<p><p>The use of continuous rating scales such as the visual analogue scale (VAS) in research has increased, yet they are less popular than discrete scales like the Likert scale. The non-popularity of continuous scales is primarily due to the lack of validated analytical tools and user-friendly interfaces, which have also jointly resulted in a lack of sufficient theoretical and empirical research supporting confidence in using continuous rating formats. This research aims to address these gaps through four studies. The first study proposed an algorithm and developed the Continuous Rating Scale Analytics (CoRSA) to estimate parameters for the continuous rating scale model (Müller, Psychometrika, 52, 165-181, 1987). The second study evaluated CoRSA's efficacy in analyzing continuous scores compared to pcIRT (Hohensinn, Journal of Statistical Software, 84, 1-14, 2018) and discrete scores against ConQuest (Adams et al., 2020). Results showed superior parameter recovery with CoRSA for continuous data and comparable outcomes for discrete data. The third study analyzed empirical data from career interest and work value assessments using both VAS and Likert scales with CoRSA, demonstrating good model-data fit and validating CoRSA's effectiveness in rescaling data to interval measurements. Finally, the fourth study integrated CoRSA into the VAS-RRP 2.0 platform (Sung & Wu, Behavior Research Methods, 50, 1694-1715, 2018) to enhance accessibility and usability, allowing researchers and practitioners unfamiliar with statistical procedures to easily analyze continuous data. These findings confirm CoRSA as a valid tool for analyzing both continuous and discrete data, enhancing the utility of continuous rating formats in diverse research contexts.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 12","pages":"333"},"PeriodicalIF":3.9,"publicationDate":"2025-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12586417/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145443831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Autoscribe: An automated tool for creating transcribed TextGrids from audio-recorded conversations. Autoscribe:一个自动工具,用于从音频录制的对话中创建转录文本网格。
IF 3.9 2区 心理学 Q1 PSYCHOLOGY, EXPERIMENTAL Pub Date : 2025-11-03 DOI: 10.3758/s13428-025-02850-9
Tyson S Barrett, Camille J Wynn, Lotte Eijk, Katerina A Tetzloff, Stephanie A Borrie

One major difficulty in conversational research is the time required to segment and transcribe conversational recordings. While recent advances have improved automatic speech recognition technologies, one limitation of current tools is that they are generally catered toward speech that occurs in monologues rather than conversation. Accordingly, the purpose of this project was to develop and validate an automated user-friendly tool for transcribing conversations. This tool, called Autoscribe, converts dyadic conversational audio recordings into Praat TextGrids with time-aligned turn boundaries between speech and non-speech segments and transcripts of all spoken dialogue output. Here we describe the development of this tool as well as its validation on two conversational corpora. Results showed that Autoscribe decreased the amount of active working time needed for TextGrid creation by over 70%. Average transcription accuracy was 92% and average utterance boundary placement of 95%. Thus, Autoscribe affords a practical research tool that drastically reduces the time and resource intensitivity needed for conversational segmentation and transcription.

会话研究的一个主要困难是对会话录音进行分段和转录所需的时间。虽然最近的进步已经改进了自动语音识别技术,但现有工具的一个限制是,它们通常是针对独白而不是对话中的语音。因此,这个项目的目的是开发和验证一个自动化的用户友好工具,用于记录对话。这个工具,称为Autoscribe,转换双向会话录音到Praat文本网格与时间对齐的回合边界之间的语音和非语音段和所有口语对话输出的文本。在这里,我们描述了这个工具的开发以及它在两个会话语料库上的验证。结果表明,Autoscribe将创建TextGrid所需的活动工作时间减少了70%以上。平均转录准确率为92%,平均话语边界位置为95%。因此,Autoscribe提供了一个实用的研究工具,大大减少了会话分割和转录所需的时间和资源密集度。
{"title":"Autoscribe: An automated tool for creating transcribed TextGrids from audio-recorded conversations.","authors":"Tyson S Barrett, Camille J Wynn, Lotte Eijk, Katerina A Tetzloff, Stephanie A Borrie","doi":"10.3758/s13428-025-02850-9","DOIUrl":"10.3758/s13428-025-02850-9","url":null,"abstract":"<p><p>One major difficulty in conversational research is the time required to segment and transcribe conversational recordings. While recent advances have improved automatic speech recognition technologies, one limitation of current tools is that they are generally catered toward speech that occurs in monologues rather than conversation. Accordingly, the purpose of this project was to develop and validate an automated user-friendly tool for transcribing conversations. This tool, called Autoscribe, converts dyadic conversational audio recordings into Praat TextGrids with time-aligned turn boundaries between speech and non-speech segments and transcripts of all spoken dialogue output. Here we describe the development of this tool as well as its validation on two conversational corpora. Results showed that Autoscribe decreased the amount of active working time needed for TextGrid creation by over 70%. Average transcription accuracy was 92% and average utterance boundary placement of 95%. Thus, Autoscribe affords a practical research tool that drastically reduces the time and resource intensitivity needed for conversational segmentation and transcription.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 12","pages":"332"},"PeriodicalIF":3.9,"publicationDate":"2025-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12583283/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145437003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Publisher Correction: A systematic review of latent class analysis in psychology: Examining the gap between guidelines and research practice. 出版商更正:心理学中潜在阶级分析的系统回顾:检查指南与研究实践之间的差距。
IF 3.9 2区 心理学 Q1 PSYCHOLOGY, EXPERIMENTAL Pub Date : 2025-11-03 DOI: 10.3758/s13428-025-02871-4
Angela Sorgente, Rossella Caliciuri, Matteo Robba, Margherita Lanz, Bruno D Zumbo
{"title":"Publisher Correction: A systematic review of latent class analysis in psychology: Examining the gap between guidelines and research practice.","authors":"Angela Sorgente, Rossella Caliciuri, Matteo Robba, Margherita Lanz, Bruno D Zumbo","doi":"10.3758/s13428-025-02871-4","DOIUrl":"10.3758/s13428-025-02871-4","url":null,"abstract":"","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 12","pages":"331"},"PeriodicalIF":3.9,"publicationDate":"2025-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12583348/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145436976","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Joint modeling with generalized item response theory model family and response time model: Enhancing model structural flexibility and data-fitting adequacy. 广义项目反应理论模型族和反应时间模型的联合建模:提高模型结构的灵活性和数据拟合的充分性。
IF 3.9 2区 心理学 Q1 PSYCHOLOGY, EXPERIMENTAL Pub Date : 2025-11-03 DOI: 10.3758/s13428-025-02855-4
Jing Lu, Xue Wang, Jiwei Zhang

In this study, we propose a joint hierarchical model that combines a family of item response theory (IRT) models with a log-normal response time (RT) model to analyze item responses and response times. By incorporating RTs as auxiliary information, we improve the accuracy of latent trait estimation, thereby facilitating a deeper understanding of examinee performance. Additionally, we explore the use of either identical or distinct link functions across different items, allowing us to optimize IRT models for each item and improve overall model fit. We further investigate scenarios in which the joint distribution of speed and ability is nonlinear by integrating the generalized logit-linked IRT model with the log-normal random quadratic variable speed model. Compared to the traditional hierarchical model by van der Linden (Psychometrika, 72, 287-308 2007), this integration yields more accurate estimates of ability, item difficulty, and discrimination parameters. Additionally, Bayesian model comparison reveals that the new joint hierarchical model provides a better fit than various models combining item responses and RTs, particularly when the data are derived from a joint RT and two-parameter IRT model with both symmetric and asymmetric link functions. Finally, a comprehensive analysis of data from the computer-based Program for International Student Assessment (PISA) science examination from 2015 is conducted to illustrate the proposed methodology.

在这项研究中,我们提出了一个联合层次模型,该模型将项目反应理论(IRT)家族模型与对数正态反应时间(RT)模型相结合,以分析项目反应和反应时间。通过将RTs作为辅助信息,我们提高了潜在特质估计的准确性,从而有助于更深入地了解考生的表现。此外,我们探索了在不同项目之间使用相同或不同的链接函数,使我们能够优化每个项目的IRT模型并改善整体模型拟合。通过将广义对数链IRT模型与对数正态随机二次变量速度模型相结合,进一步研究了速度和能力的非线性联合分布。与van der Linden (Psychometrika, 72, 287-308 2007)的传统层次模型相比,这种整合可以更准确地估计能力、项目难度和辨别参数。此外,贝叶斯模型比较表明,新的联合分层模型比将项目反应和RTs结合在一起的各种模型提供了更好的拟合,特别是当数据来自具有对称和非对称链接函数的联合RT和双参数IRT模型时。最后,对2015年基于计算机的国际学生评估项目(PISA)科学考试的数据进行了全面分析,以说明所提出的方法。
{"title":"Joint modeling with generalized item response theory model family and response time model: Enhancing model structural flexibility and data-fitting adequacy.","authors":"Jing Lu, Xue Wang, Jiwei Zhang","doi":"10.3758/s13428-025-02855-4","DOIUrl":"10.3758/s13428-025-02855-4","url":null,"abstract":"<p><p>In this study, we propose a joint hierarchical model that combines a family of item response theory (IRT) models with a log-normal response time (RT) model to analyze item responses and response times. By incorporating RTs as auxiliary information, we improve the accuracy of latent trait estimation, thereby facilitating a deeper understanding of examinee performance. Additionally, we explore the use of either identical or distinct link functions across different items, allowing us to optimize IRT models for each item and improve overall model fit. We further investigate scenarios in which the joint distribution of speed and ability is nonlinear by integrating the generalized logit-linked IRT model with the log-normal random quadratic variable speed model. Compared to the traditional hierarchical model by van der Linden (Psychometrika, 72, 287-308 2007), this integration yields more accurate estimates of ability, item difficulty, and discrimination parameters. Additionally, Bayesian model comparison reveals that the new joint hierarchical model provides a better fit than various models combining item responses and RTs, particularly when the data are derived from a joint RT and two-parameter IRT model with both symmetric and asymmetric link functions. Finally, a comprehensive analysis of data from the computer-based Program for International Student Assessment (PISA) science examination from 2015 is conducted to illustrate the proposed methodology.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 12","pages":"330"},"PeriodicalIF":3.9,"publicationDate":"2025-11-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145437005","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Cause for concern: Omitted cross-loadings in measurement models of nonlinear structural equation models. 引起关注的原因:非线性结构方程模型的测量模型中忽略了交叉载荷。
IF 3.9 2区 心理学 Q1 PSYCHOLOGY, EXPERIMENTAL Pub Date : 2025-10-30 DOI: 10.3758/s13428-025-02792-2
Karina Navarro, Karin Schermelleh-Engel

Cross-loadings on non-target factors in measurement models of linear structural equation models (SEM) are often observed in empirical research but frequently disregarded. Previous research on linear SEM has already shown that omitted positive cross-loadings result in overestimated covariances of the latent predictor variables and distorted linear effects. For nonlinear SEM with interaction and quadratic effects, omitting cross-loadings has not been investigated. This study examines the consequences of omitted cross-loadings in both linear and nonlinear SEM using a single empirical dataset and a small simulation study. We focus on the bias patterns that emerge when cross-loadings-reflecting the multidimensionality of items-are either positive or negative and assess how these biases vary with the level of the latent predictor covariance. The empirical analysis reveals that constraining theoretically justified cross-loadings to zero results in systematic over- and underestimation of factor loadings and structural parameters, with more pronounced effects in the nonlinear component of the model, thereby altering the functional form of the relationships between the latent variables. The simulation study further illustrates that the direction and magnitude of bias in both linear and nonlinear SEM depend jointly on the sign of the cross-loadings and the level of the latent predictor covariance. These findings underscore the critical importance of incorporating cross-loadings only theory-driven to maintain an accurate representation of the functional relationships between latent constructs. Practical implications and challenges of including cross-loadings in the model are discussed.

线性结构方程模型(SEM)测量模型中非目标因子的交叉载荷在实证研究中经常被观察到,但往往被忽视。先前对线性扫描电镜的研究已经表明,忽略正向交叉载荷会导致潜在预测变量协方差的高估和线性效应的扭曲。对于具有相互作用和二次效应的非线性扫描电镜,没有研究忽略交叉载荷。本研究使用单个经验数据集和小型模拟研究,检查了线性和非线性SEM中忽略交叉载荷的后果。我们关注的是当交叉加载(反映项目的多维性)是积极或消极时出现的偏差模式,并评估这些偏差如何随着潜在预测因子协方差的水平而变化。实证分析表明,将理论上合理的交叉载荷约束为零,会导致对因子载荷和结构参数的系统性高估和低估,对模型的非线性分量的影响更为明显,从而改变潜在变量之间关系的函数形式。模拟研究进一步表明,线性和非线性SEM中偏差的方向和大小共同取决于交叉载荷的符号和潜在预测因子协方差的水平。这些发现强调了将交叉加载纳入理论驱动的关键重要性,以保持潜在构念之间功能关系的准确表示。讨论了在模型中包含交叉加载的实际意义和挑战。
{"title":"Cause for concern: Omitted cross-loadings in measurement models of nonlinear structural equation models.","authors":"Karina Navarro, Karin Schermelleh-Engel","doi":"10.3758/s13428-025-02792-2","DOIUrl":"10.3758/s13428-025-02792-2","url":null,"abstract":"<p><p>Cross-loadings on non-target factors in measurement models of linear structural equation models (SEM) are often observed in empirical research but frequently disregarded. Previous research on linear SEM has already shown that omitted positive cross-loadings result in overestimated covariances of the latent predictor variables and distorted linear effects. For nonlinear SEM with interaction and quadratic effects, omitting cross-loadings has not been investigated. This study examines the consequences of omitted cross-loadings in both linear and nonlinear SEM using a single empirical dataset and a small simulation study. We focus on the bias patterns that emerge when cross-loadings-reflecting the multidimensionality of items-are either positive or negative and assess how these biases vary with the level of the latent predictor covariance. The empirical analysis reveals that constraining theoretically justified cross-loadings to zero results in systematic over- and underestimation of factor loadings and structural parameters, with more pronounced effects in the nonlinear component of the model, thereby altering the functional form of the relationships between the latent variables. The simulation study further illustrates that the direction and magnitude of bias in both linear and nonlinear SEM depend jointly on the sign of the cross-loadings and the level of the latent predictor covariance. These findings underscore the critical importance of incorporating cross-loadings only theory-driven to maintain an accurate representation of the functional relationships between latent constructs. Practical implications and challenges of including cross-loadings in the model are discussed.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 12","pages":"328"},"PeriodicalIF":3.9,"publicationDate":"2025-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145408003","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Do eye trackers estimate eyeball rotation? The relationship between tracked eye image feature and estimated saccadic waveform. 眼动仪能估计眼球转动吗?跟踪眼图像特征与估计眼动波形的关系。
IF 3.9 2区 心理学 Q1 PSYCHOLOGY, EXPERIMENTAL Pub Date : 2025-10-30 DOI: 10.3758/s13428-025-02862-5
Marcus Nyström, Diederick C Niehorster, Roy S Hessels, Richard Andersson, Marta K Skrok, Robert Konklewski, Patrycjusz Stremplewski, Maciej Nowakowski, Jakub Lipiński, Szymon Tamborski, Anna Szkulmowska, Maciej Szkulmowski, Ignace T C Hooge

The eyeball is not rigid and deforms during saccades. As a consequence, the saccade waveform recorded by an eye tracker may depend on which structure of the eye is used to estimate eyeball rotation. Here, we systematically describe and compare signals co-recorded from the retina, the cornea (corneal reflection, CR), the pupil, and the lens (fourth Purkinje reflection, P4) during saccades. We found that several commonly used parameters for saccade characterization differ systematically across the signals. For instance, saccades in the retinal signal had earlier onsets compared to saccades in the pupil and the P4 signals. The retinal signal had the smallest saccade amplitude and reached the peak saccade velocity earlier compared to the other signals. At the end of saccades, the retinal signal came to a stop faster than the other signals. We discuss possible explanations that may account for the relationship between the retinal signal and the other signals.

眼球不是刚性的,在扫视过程中会变形。因此,眼动仪记录的扫视波形可能取决于用来估计眼球旋转的眼睛结构。在这里,我们系统地描述并比较了在扫视过程中视网膜、角膜(角膜反射,CR)、瞳孔和晶状体(第四浦肯野反射,P4)共同记录的信号。我们发现几个常用的眼跳表征参数在不同的信号中有系统的不同。例如,与瞳孔和P4信号中的扫视相比,视网膜信号中的扫视出现得更早。与其他信号相比,视网膜信号具有最小的眼跳幅度和较早达到眼跳速度峰值。在扫视结束时,视网膜信号比其他信号更快地停止。我们讨论可能的解释,可能解释视网膜信号和其他信号之间的关系。
{"title":"Do eye trackers estimate eyeball rotation? The relationship between tracked eye image feature and estimated saccadic waveform.","authors":"Marcus Nyström, Diederick C Niehorster, Roy S Hessels, Richard Andersson, Marta K Skrok, Robert Konklewski, Patrycjusz Stremplewski, Maciej Nowakowski, Jakub Lipiński, Szymon Tamborski, Anna Szkulmowska, Maciej Szkulmowski, Ignace T C Hooge","doi":"10.3758/s13428-025-02862-5","DOIUrl":"10.3758/s13428-025-02862-5","url":null,"abstract":"<p><p>The eyeball is not rigid and deforms during saccades. As a consequence, the saccade waveform recorded by an eye tracker may depend on which structure of the eye is used to estimate eyeball rotation. Here, we systematically describe and compare signals co-recorded from the retina, the cornea (corneal reflection, CR), the pupil, and the lens (fourth Purkinje reflection, P4) during saccades. We found that several commonly used parameters for saccade characterization differ systematically across the signals. For instance, saccades in the retinal signal had earlier onsets compared to saccades in the pupil and the P4 signals. The retinal signal had the smallest saccade amplitude and reached the peak saccade velocity earlier compared to the other signals. At the end of saccades, the retinal signal came to a stop faster than the other signals. We discuss possible explanations that may account for the relationship between the retinal signal and the other signals.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 12","pages":"329"},"PeriodicalIF":3.9,"publicationDate":"2025-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12575526/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145408001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MATCH: A toolbox to assess the primary color of real-world objects and generate color-matching stimuli. MATCH:一个评估现实世界物体的原色并产生颜色匹配刺激的工具箱。
IF 3.9 2区 心理学 Q1 PSYCHOLOGY, EXPERIMENTAL Pub Date : 2025-10-29 DOI: 10.3758/s13428-025-02856-3
Jessica N Goetz, Mark B Neider

Real-world stimuli can be difficult to manipulate and control in experimental psychology studies. Color information is frequently used as a variable, and researchers often rely on subjective color labels that imprecisely describe the color information within real-world objects. Here, we describe a new toolbox called MATCH (Matching And Transforming Closely Hued objects) that can easily and objectively quantify and manipulate color information within real-world objects to generate object pairs that match in color. MATCH was designed incorporating theoretical frameworks and conceptual understanding from visual cognition research. Additionally, MATCH provides critical information on the distribution of color and the specific color values of any stimulus set. We also present two experimental studies to validate whether MATCH produces images that are consistent with human visual perception. In the first study, we provide evidence that the stimuli generated by MATCH are perceptually closer in color to a reference object compared to human categorization of object-color pairs. In the second study, we investigated the search for real-world objects with distractors generated by MATCH that matched the target object's color. We found patterns of data that are consistent with current theories of human search behavior. In summary, MATCH allows researchers to carefully control the color of real-world stimuli used in their studies.

在实验心理学研究中,现实世界的刺激是难以操纵和控制的。颜色信息经常被用作变量,研究人员经常依赖于主观的颜色标签,这些标签不精确地描述了现实世界中物体的颜色信息。在这里,我们描述了一个名为MATCH(匹配和转换紧密色调对象)的新工具箱,它可以轻松客观地量化和操作现实世界对象中的颜色信息,以生成颜色匹配的对象对。MATCH的设计结合了视觉认知研究的理论框架和概念理解。此外,MATCH还提供了关于颜色分布和任何刺激集的特定颜色值的关键信息。我们还提出了两个实验研究来验证MATCH是否产生与人类视觉感知一致的图像。在第一项研究中,我们提供的证据表明,与人类对物体-颜色对的分类相比,MATCH产生的刺激在颜色上更接近参考物体。在第二项研究中,我们研究了使用MATCH生成的与目标物体颜色匹配的干扰物来搜索现实世界中的物体。我们发现了与当前人类搜索行为理论相一致的数据模式。总之,MATCH允许研究人员仔细控制他们研究中使用的真实世界刺激的颜色。
{"title":"MATCH: A toolbox to assess the primary color of real-world objects and generate color-matching stimuli.","authors":"Jessica N Goetz, Mark B Neider","doi":"10.3758/s13428-025-02856-3","DOIUrl":"10.3758/s13428-025-02856-3","url":null,"abstract":"<p><p>Real-world stimuli can be difficult to manipulate and control in experimental psychology studies. Color information is frequently used as a variable, and researchers often rely on subjective color labels that imprecisely describe the color information within real-world objects. Here, we describe a new toolbox called MATCH (Matching And Transforming Closely Hued objects) that can easily and objectively quantify and manipulate color information within real-world objects to generate object pairs that match in color. MATCH was designed incorporating theoretical frameworks and conceptual understanding from visual cognition research. Additionally, MATCH provides critical information on the distribution of color and the specific color values of any stimulus set. We also present two experimental studies to validate whether MATCH produces images that are consistent with human visual perception. In the first study, we provide evidence that the stimuli generated by MATCH are perceptually closer in color to a reference object compared to human categorization of object-color pairs. In the second study, we investigated the search for real-world objects with distractors generated by MATCH that matched the target object's color. We found patterns of data that are consistent with current theories of human search behavior. In summary, MATCH allows researchers to carefully control the color of real-world stimuli used in their studies.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 12","pages":"327"},"PeriodicalIF":3.9,"publicationDate":"2025-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145399573","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Errors-in-variables regression as a viable approach to mediation analysis with random error-tainted measurements: Estimation, effectiveness, and an easy-to-use implementation. 变量中的误差回归作为具有随机误差污染度量的中介分析的可行方法:估计、有效性和易于使用的实现。
IF 3.9 2区 心理学 Q1 PSYCHOLOGY, EXPERIMENTAL Pub Date : 2025-10-29 DOI: 10.3758/s13428-025-02783-3
Andrew F Hayes, Paul D Allison, Sean M Alexander

Mediation analysis, popular in many disciplines that rely on behavioral science data analysis techniques, is often conducted using ordinary least squares (OLS) regression analysis methods. Given that one of OLS regression's weaknesses is its susceptibility to estimation bias resulting from unaccounted-for random measurement error in variables on the right-hand sides of the equation, many published mediation analyses certainly contain some and perhaps substantial bias in the direct, indirect, and total effects. In this manuscript, we offer errors-in-variables (EIV) regression as an easy-to-use alternative when a researcher has reasonable estimates of the reliability of the variables in the analysis. In three real-data examples, we show that EIV regression-based mediation analysis produces estimates that are equivalent to those obtained using an alternative, more analytically complex approach that accounts for measurement error-single-indicator latent variable structural equation modeling-yet quite different from the results generated by standard OLS regression that ignores random measurement error. In a small-scale simulation, we also establish that EIV regression successfully recovers the parameters of a mediation model involving variables adulterated by random measurement error while OLS regression generates biased estimates. To facilitate the adoption of EIV regression, we describe an implementation in the PROCESS macro for SPSS, SAS, and R that we believe eliminates most any excuse one can conjure for not accounting for random measurement error when conducting a mediation analysis.

中介分析在许多依赖行为科学数据分析技术的学科中很流行,通常使用普通最小二乘(OLS)回归分析方法进行。考虑到OLS回归的弱点之一是易受估计偏差的影响,这是由于方程右侧变量中未解释的随机测量误差造成的,许多已发表的中介分析肯定在直接、间接和总影响中包含一些甚至可能是实质性的偏差。在这份手稿中,我们提供误差变量(EIV)回归作为一个易于使用的替代方案,当研究人员有合理的估计在分析变量的可靠性。在三个实际数据示例中,我们表明基于EIV回归的中介分析产生的估计值与使用另一种更复杂的分析方法(单指标潜变量结构方程模型)获得的估计值相当,但与忽略随机测量误差的标准OLS回归产生的结果大不相同。在小规模模拟中,我们还建立了EIV回归成功地恢复了包含随机测量误差的变量的中介模型的参数,而OLS回归产生了有偏估计。为了促进EIV回归的采用,我们在PROCESS宏中描述了一个用于SPSS、SAS和R的实现,我们认为它消除了在进行中介分析时不考虑随机测量误差的大多数借口。
{"title":"Errors-in-variables regression as a viable approach to mediation analysis with random error-tainted measurements: Estimation, effectiveness, and an easy-to-use implementation.","authors":"Andrew F Hayes, Paul D Allison, Sean M Alexander","doi":"10.3758/s13428-025-02783-3","DOIUrl":"10.3758/s13428-025-02783-3","url":null,"abstract":"<p><p>Mediation analysis, popular in many disciplines that rely on behavioral science data analysis techniques, is often conducted using ordinary least squares (OLS) regression analysis methods. Given that one of OLS regression's weaknesses is its susceptibility to estimation bias resulting from unaccounted-for random measurement error in variables on the right-hand sides of the equation, many published mediation analyses certainly contain some and perhaps substantial bias in the direct, indirect, and total effects. In this manuscript, we offer errors-in-variables (EIV) regression as an easy-to-use alternative when a researcher has reasonable estimates of the reliability of the variables in the analysis. In three real-data examples, we show that EIV regression-based mediation analysis produces estimates that are equivalent to those obtained using an alternative, more analytically complex approach that accounts for measurement error-single-indicator latent variable structural equation modeling-yet quite different from the results generated by standard OLS regression that ignores random measurement error. In a small-scale simulation, we also establish that EIV regression successfully recovers the parameters of a mediation model involving variables adulterated by random measurement error while OLS regression generates biased estimates. To facilitate the adoption of EIV regression, we describe an implementation in the PROCESS macro for SPSS, SAS, and R that we believe eliminates most any excuse one can conjure for not accounting for random measurement error when conducting a mediation analysis.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 12","pages":"323"},"PeriodicalIF":3.9,"publicationDate":"2025-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145399606","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
ClozCHI: A cloze test for measuring L2 Chinese proficiency from novice to advanced levels. 完形填空测试,用于测量从初级到高级的第二语言汉语水平。
IF 3.9 2区 心理学 Q1 PSYCHOLOGY, EXPERIMENTAL Pub Date : 2025-10-29 DOI: 10.3758/s13428-025-02834-9
Jianyong Cai, Yuting Han, Xin Jiang

For decades, there has been a dearth of efficient tools for researchers to measure L2 Chinese proficiency. This study introduces ClozCHI, a cloze test developed to assess L2 Chinese proficiency across a range of levels from novice to advanced. Unlike existing Chinese cloze tests, ClozCHI comprises three passages with varying levels of difficulty. Its effectiveness was assessed with 225 L2 Chinese learners who participated in the Hanyu Shuiping Kaoshi (HSK) at Levels 3 to 6 and completed ClozCHI within 2 weeks before or after their HSK tests. Additionally, supplementary data were collected from 97 learners below HSK Level 3 without HSK testing. The psychometric analysis of the ClozCHI using both classical test theory (CTT) and item response theory (IRT) revealed that the test demonstrated appropriate difficulty, good discrimination, and high reliability from novice to advanced levels. ClozCHI scores showed strong correlations with HSK levels, demonstrating criterion-related validity. Confirmatory factor analysis (CFA) further supported its unidimensional structure. ClozCHI was more effective for assessing reading than listening or writing. These findings suggested that ClozCHI is a reliable and valid instrument for assessing L2 Chinese proficiency in research settings. ClozCHI is freely available for researchers from the Open Science Framework repository: https://osf.io/5kcrq/ .

几十年来,研究人员一直缺乏有效的工具来衡量第二语言汉语的熟练程度。本研究介绍了一种名为ClozCHI的完形填空测试,用于评估从初级到高级的二语汉语水平。与现有的汉语完形填空测试不同,ClozCHI由三个不同难度的段落组成。本研究以225名参加汉语水平考试(HSK) 3 - 6级并在HSK考试前后2周内完成ClozCHI测试的汉语学习者为对象,对其有效性进行了评估。此外,还收集了97名HSK三级以下未参加HSK考试的学习者的补充数据。运用经典测试理论(CTT)和项目反应理论(IRT)对ClozCHI量表进行了心理测量分析,结果表明,该量表从初级到高级水平具有适当的难度、良好的判别性和较高的信度。ClozCHI得分与HSK水平有较强的相关性,证明了标准相关的效度。验证性因子分析(CFA)进一步支持其单维结构。ClozCHI在评估阅读方面比听力或写作更有效。这些研究结果表明,ClozCHI是一个可靠和有效的评估第二语言汉语水平的工具。研究人员可以从开放科学框架存储库(https://osf.io/5kcrq/)免费获得ClozCHI。
{"title":"ClozCHI: A cloze test for measuring L2 Chinese proficiency from novice to advanced levels.","authors":"Jianyong Cai, Yuting Han, Xin Jiang","doi":"10.3758/s13428-025-02834-9","DOIUrl":"10.3758/s13428-025-02834-9","url":null,"abstract":"<p><p>For decades, there has been a dearth of efficient tools for researchers to measure L2 Chinese proficiency. This study introduces ClozCHI, a cloze test developed to assess L2 Chinese proficiency across a range of levels from novice to advanced. Unlike existing Chinese cloze tests, ClozCHI comprises three passages with varying levels of difficulty. Its effectiveness was assessed with 225 L2 Chinese learners who participated in the Hanyu Shuiping Kaoshi (HSK) at Levels 3 to 6 and completed ClozCHI within 2 weeks before or after their HSK tests. Additionally, supplementary data were collected from 97 learners below HSK Level 3 without HSK testing. The psychometric analysis of the ClozCHI using both classical test theory (CTT) and item response theory (IRT) revealed that the test demonstrated appropriate difficulty, good discrimination, and high reliability from novice to advanced levels. ClozCHI scores showed strong correlations with HSK levels, demonstrating criterion-related validity. Confirmatory factor analysis (CFA) further supported its unidimensional structure. ClozCHI was more effective for assessing reading than listening or writing. These findings suggested that ClozCHI is a reliable and valid instrument for assessing L2 Chinese proficiency in research settings. ClozCHI is freely available for researchers from the Open Science Framework repository: https://osf.io/5kcrq/ .</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 12","pages":"325"},"PeriodicalIF":3.9,"publicationDate":"2025-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145399576","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Contrast analysis for competing hypotheses: A tutorial using the R package cofad. 竞争性假设的对比分析:使用R包cofad的教程。
IF 3.9 2区 心理学 Q1 PSYCHOLOGY, EXPERIMENTAL Pub Date : 2025-10-29 DOI: 10.3758/s13428-025-02833-w
Mirka Henninger, Simone Malejka, Johannes Titz

Researchers in psychology traditionally use analysis of variance to examine differences between multiple groups or conditions. A less well-known, but valuable alternative is contrast analysis - a simple statistical method for testing directional, theoretically motivated hypotheses that are defined prior to data collection. In this article, we review the core concepts of contrast analysis for testing hypotheses in between-subjects and within-subjects designs. We also outline and demonstrate the largely unknown possibility of directly testing two competing contrasts against each other. In the tutorial part of the article, we show how such competing-contrast analyses can be conducted in the free, open-source software R using the package cofad. Because competing-contrast analysis is a straightforward, flexible, highly powered, and hypothesis-driven approach, it is a valuable tool to extend the understanding of cognitive and behavioral processes in psychological research.

心理学研究人员传统上使用方差分析来检查多个群体或条件之间的差异。对比分析是一种不太知名但很有价值的替代方法,它是一种简单的统计方法,用于测试在数据收集之前定义的定向的、理论上有动机的假设。在本文中,我们回顾了对比分析的核心概念,以检验受试者之间和受试者内设计的假设。我们还概述并展示了直接测试两个相互竞争的对比的很大程度上未知的可能性。在本文的教程部分,我们将展示如何使用cofad包在免费的开源软件R中进行这种竞争对比分析。因为竞争对比分析是一种直接、灵活、强大和假设驱动的方法,它是一种有价值的工具,可以扩展对心理学研究中认知和行为过程的理解。
{"title":"Contrast analysis for competing hypotheses: A tutorial using the R package cofad.","authors":"Mirka Henninger, Simone Malejka, Johannes Titz","doi":"10.3758/s13428-025-02833-w","DOIUrl":"10.3758/s13428-025-02833-w","url":null,"abstract":"<p><p>Researchers in psychology traditionally use analysis of variance to examine differences between multiple groups or conditions. A less well-known, but valuable alternative is contrast analysis - a simple statistical method for testing directional, theoretically motivated hypotheses that are defined prior to data collection. In this article, we review the core concepts of contrast analysis for testing hypotheses in between-subjects and within-subjects designs. We also outline and demonstrate the largely unknown possibility of directly testing two competing contrasts against each other. In the tutorial part of the article, we show how such competing-contrast analyses can be conducted in the free, open-source software R using the package cofad. Because competing-contrast analysis is a straightforward, flexible, highly powered, and hypothesis-driven approach, it is a valuable tool to extend the understanding of cognitive and behavioral processes in psychological research.</p>","PeriodicalId":8717,"journal":{"name":"Behavior Research Methods","volume":"57 12","pages":"326"},"PeriodicalIF":3.9,"publicationDate":"2025-10-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12572084/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145399527","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Behavior Research Methods
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1