首页 > 最新文献

Psychometrika最新文献

英文 中文
Analysis of Log Data From an International Online Educational Assessment System: A Multi-State Survival Modeling Approach to Reaction Time Between and Across Action Sequence. 国际在线教育评估系统日志数据分析:动作序列间和跨动作序列反应时间的多状态生存建模方法。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-01 DOI: 10.1017/psy.2025.10043
Jina Park, Ick Hoon Jin, Minjeong Jeon

With increasingly available computer-based or online assessments, researchers have shown keen interest in analyzing log data to improve our understanding of test takers' problem-solving processes. In this article, we propose a multi-state survival model (MSM) to action sequence data from log files, focusing on modeling test takers' reaction times between actions, in order to investigate which factors and how they influence test takers' transition speed between actions. We specifically identify the key actions that differentiate correct and incorrect answers, compare transition probabilities between these groups, and analyze their distinct problem-solving patterns. Through simulation studies and sensitivity analyses, we evaluate the robustness of our proposed model. We demonstrate the proposed approach using problem-solving items from the Programme for the International Assessment of Adult Competencies (PIAAC).

随着越来越多的基于计算机或在线的评估,研究人员对分析日志数据以提高我们对考生解决问题过程的理解表现出了浓厚的兴趣。本文针对日志文件中的动作序列数据,提出了一个多状态生存模型(MSM),重点对考生动作间的反应时间进行建模,以探讨哪些因素以及这些因素如何影响考生动作间的过渡速度。我们明确了区分正确和错误答案的关键动作,比较了这些组之间的转移概率,并分析了他们独特的解决问题的模式。通过仿真研究和敏感性分析,我们评估了所提出模型的鲁棒性。我们使用国际成人能力评估项目(PIAAC)中的问题解决项目来演示拟议的方法。
{"title":"Analysis of Log Data From an International Online Educational Assessment System: A Multi-State Survival Modeling Approach to Reaction Time Between and Across Action Sequence.","authors":"Jina Park, Ick Hoon Jin, Minjeong Jeon","doi":"10.1017/psy.2025.10043","DOIUrl":"10.1017/psy.2025.10043","url":null,"abstract":"<p><p>With increasingly available computer-based or online assessments, researchers have shown keen interest in analyzing log data to improve our understanding of test takers' problem-solving processes. In this article, we propose a multi-state survival model (MSM) to action sequence data from log files, focusing on modeling test takers' reaction times between actions, in order to investigate which factors and how they influence test takers' transition speed between actions. We specifically identify the key actions that differentiate correct and incorrect answers, compare transition probabilities between these groups, and analyze their distinct problem-solving patterns. Through simulation studies and sensitivity analyses, we evaluate the robustness of our proposed model. We demonstrate the proposed approach using problem-solving items from the Programme for the International Assessment of Adult Competencies (PIAAC).</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1506-1535"},"PeriodicalIF":3.1,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12660000/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144978850","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian Nonparametric Models for Multiple Raters: A General Statistical Framework. 多评分者的贝叶斯非参数模型:一个通用的统计框架。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-01 Epub Date: 2025-08-11 DOI: 10.1017/psy.2025.10035
Giuseppe Mignemi, Ioanna Manolopoulou

Rating procedure is crucial in many applied fields (e.g., educational, clinical, emergency). In these contexts, a rater (e.g., teacher, doctor) scores a subject (e.g., student, doctor) on a rating scale. Given raters' variability, several statistical methods have been proposed for assessing and improving the quality of ratings. The analysis and the estimate of the Intraclass Correlation Coefficient (ICC) are major concerns in such cases. As evidenced by the literature, ICC might differ across different subgroups of raters and might be affected by contextual factors and subject heterogeneity. Model estimation in the presence of heterogeneity has been one of the recent challenges in this research line. Consequently, several methods have been proposed to address this issue under a parametric multilevel modelling framework, in which strong distributional assumptions are made. We propose a more flexible model under the Bayesian nonparametric (BNP) framework, in which most of those assumptions are relaxed. By eliciting hierarchical discrete nonparametric priors, the model accommodates clusters among raters and subjects, naturally accounts for heterogeneity, and improves estimates' accuracy. We propose a general BNP heteroscedastic framework to analyze continuous and coarse rating data and possible latent differences among subjects and raters. The estimated densities are used to make inferences about the rating process and the quality of the ratings. By exploiting a stick-breaking representation of the discrete nonparametric priors, a general class of ICC indices might be derived for these models. Our method allows us to independently identify latent similarities between subjects and raters and can be applied in precise education to improve personalized teaching programs or interventions. Theoretical results about the ICC are provided together with computational strategies. Simulations and a real-world application are presented, and possible future directions are discussed.

评级程序在许多应用领域(如教育、临床、急救)至关重要。在这些情况下,评价者(例如,老师,医生)在评价表上给一个主体(例如,学生,医生)打分。鉴于评级者的可变性,已经提出了几种统计方法来评估和提高评级的质量。在这种情况下,类内相关系数(ICC)的分析和估计是主要关注的问题。正如文献所证明的那样,ICC可能在不同的评分者亚组中有所不同,并可能受到背景因素和受试者异质性的影响。存在异质性的模型估计是这一研究领域最近面临的挑战之一。因此,提出了几种方法在参数化多层建模框架下解决这一问题,其中做出了强分布假设。我们在贝叶斯非参数框架下提出了一个更灵活的模型,其中大多数假设都是宽松的。通过引出分层离散非参数先验,该模型适应了评分者和受试者之间的聚类,自然地解释了异质性,提高了估计的准确性。我们提出了一个通用的BNP异方差框架来分析连续和粗糙的评分数据以及受试者和评分者之间可能存在的潜在差异。估计的密度用于对评级过程和评级质量进行推断。通过利用离散非参数先验的断裂表示,可以为这些模型导出一般类型的ICC指标。我们的方法使我们能够独立地识别受试者和评分者之间潜在的相似性,并可应用于精确教育,以改进个性化的教学计划或干预措施。给出了有关ICC的理论结果和计算策略。给出了仿真和实际应用,并讨论了可能的未来发展方向。
{"title":"Bayesian Nonparametric Models for Multiple Raters: A General Statistical Framework.","authors":"Giuseppe Mignemi, Ioanna Manolopoulou","doi":"10.1017/psy.2025.10035","DOIUrl":"10.1017/psy.2025.10035","url":null,"abstract":"<p><p>Rating procedure is crucial in many applied fields (e.g., educational, clinical, emergency). In these contexts, a rater (e.g., teacher, doctor) scores a subject (e.g., student, doctor) on a rating scale. Given raters' variability, several statistical methods have been proposed for assessing and improving the quality of ratings. The analysis and the estimate of the Intraclass Correlation Coefficient (ICC) are major concerns in such cases. As evidenced by the literature, ICC might differ across different subgroups of raters and might be affected by contextual factors and subject heterogeneity. Model estimation in the presence of heterogeneity has been one of the recent challenges in this research line. Consequently, several methods have been proposed to address this issue under a parametric multilevel modelling framework, in which strong distributional assumptions are made. We propose a more flexible model under the Bayesian nonparametric (BNP) framework, in which most of those assumptions are relaxed. By eliciting hierarchical discrete nonparametric priors, the model accommodates clusters among raters and subjects, naturally accounts for heterogeneity, and improves estimates' accuracy. We propose a general BNP heteroscedastic framework to analyze continuous and coarse rating data and possible latent differences among subjects and raters. The estimated densities are used to make inferences about the rating process and the quality of the ratings. By exploiting a stick-breaking representation of the discrete nonparametric priors, a general class of ICC indices might be derived for these models. Our method allows us to independently identify latent similarities between subjects and raters and can be applied in <i>precise education</i> to improve personalized teaching programs or interventions. Theoretical results about the ICC are provided together with computational strategies. Simulations and a real-world application are presented, and possible future directions are discussed.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1445-1480"},"PeriodicalIF":3.1,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12660027/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144818305","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A New Fit Assessment Framework for Common Factor Models Using Generalized Residuals. 基于广义残差的共因子模型拟合评估新框架。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-01 Epub Date: 2025-08-07 DOI: 10.1017/psy.2025.10037
Youjin Sung, Youngjin Han, Yang Liu

Assessing fit in common factor models solely through the lens of mean and covariance structures, as is commonly done with conventional goodness-of-fit (GOF) assessments, may overlook critical aspects of misfit, potentially leading to misleading conclusions. To achieve more flexible fit assessment, we extend the theory of generalized residuals (Haberman & Sinharay, 2013), originally developed for models with categorical data, to encompass more general measurement models. Within this extended framework, we propose several fit test statistics designed to evaluate various parametric assumptions involved in common factor models. The examples include assessing the distributional assumptions of latent variables and the functional form assumptions of individual manifest variables. The performance of the proposed statistics is examined through simulation studies and an empirical data analysis. Our findings suggest that generalized residuals are promising tools for detecting misfit in measurement models, often masked when assessed by conventional GOF testing methods.

仅仅通过均值和协方差结构来评估共同因素模型的拟合,就像传统的拟合优度(GOF)评估一样,可能会忽略不拟合的关键方面,从而可能导致误导性结论。为了实现更灵活的拟合评估,我们扩展了广义残差理论(Haberman & Sinharay, 2013),该理论最初是为具有分类数据的模型开发的,以涵盖更一般的测量模型。在这个扩展框架内,我们提出了几个拟合检验统计,旨在评估公共因素模型中涉及的各种参数假设。这些例子包括评估潜在变量的分布假设和单个显变量的函数形式假设。通过模拟研究和实证数据分析来检验所提出的统计数据的性能。我们的研究结果表明,广义残差是检测测量模型中不拟合的有希望的工具,通常在传统的GOF测试方法评估时被掩盖。
{"title":"A New Fit Assessment Framework for Common Factor Models Using Generalized Residuals.","authors":"Youjin Sung, Youngjin Han, Yang Liu","doi":"10.1017/psy.2025.10037","DOIUrl":"10.1017/psy.2025.10037","url":null,"abstract":"<p><p>Assessing fit in common factor models solely through the lens of mean and covariance structures, as is commonly done with conventional goodness-of-fit (GOF) assessments, may overlook critical aspects of misfit, potentially leading to misleading conclusions. To achieve more flexible fit assessment, we extend the theory of generalized residuals (Haberman & Sinharay, 2013), originally developed for models with categorical data, to encompass more general measurement models. Within this extended framework, we propose several fit test statistics designed to evaluate various parametric assumptions involved in common factor models. The examples include assessing the distributional assumptions of latent variables and the functional form assumptions of individual manifest variables. The performance of the proposed statistics is examined through simulation studies and an empirical data analysis. Our findings suggest that generalized residuals are promising tools for detecting misfit in measurement models, often masked when assessed by conventional GOF testing methods.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1419-1444"},"PeriodicalIF":3.1,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12660002/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144796128","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing Empathic Accuracy: Penalized Functional Alignment Method to Correct Temporal Misalignment in Real-Time Emotional Perception. 增强共情准确性:校正实时情绪知觉时间错位的惩罚性功能对齐方法。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-01 Epub Date: 2025-09-05 DOI: 10.1017/psy.2025.10040
Linh H Nghiem, Jing Cao, Chrystyna D Kouros, Chul Moon

Empathic accuracy (EA) is the ability to accurately understand another person's thoughts and feelings, which is crucial for social and psychological interactions. Traditionally, EA is assessed by comparing a perceiver's moment-to-moment ratings of a target's emotional state with the target's own self-reported ratings at corresponding time points. However, misalignments between these two sequences are common due to the complexity of emotional interpretation and individual differences in behavioral responses. Conventional methods often ignore or oversimplify these misalignments, for instance by assuming a fixed time lag, which can introduce bias into EA estimates. To address this, we propose a novel alignment approach that captures a wide range of misalignment patterns. Our method leverages the square-root velocity framework to decompose emotional rating trajectories into amplitude and phase components. To ensure realistic alignment, we introduce a regularization constraint that limits temporal shifts to ranges consistent with human perceptual capabilities. This alignment is efficiently implemented using a constrained dynamic programming algorithm. We validate our method through simulations and real-world applications involving video and music datasets, demonstrating its superior performance over traditional techniques.

移情准确性(EA)是一种准确理解他人想法和感受的能力,这对社会和心理互动至关重要。传统上,EA的评估是通过比较感知者对目标情绪状态的即时评分和目标在相应时间点的自我报告评分来进行的。然而,由于情绪解释的复杂性和行为反应的个体差异,这两种序列之间的错位是常见的。传统的方法经常忽略或过度简化这些偏差,例如,通过假设一个固定的时间滞后,这可能会在EA估计中引入偏差。为了解决这个问题,我们提出了一种新的校准方法,可以捕获广泛的不校准模式。我们的方法利用平方根速度框架将情绪评级轨迹分解为振幅和相位分量。为了确保现实的一致性,我们引入了一个正则化约束,将时间偏移限制在与人类感知能力一致的范围内。使用约束动态规划算法有效地实现了这种对齐。我们通过模拟和涉及视频和音乐数据集的实际应用验证了我们的方法,证明了其优于传统技术的性能。
{"title":"Enhancing Empathic Accuracy: Penalized Functional Alignment Method to Correct Temporal Misalignment in Real-Time Emotional Perception.","authors":"Linh H Nghiem, Jing Cao, Chrystyna D Kouros, Chul Moon","doi":"10.1017/psy.2025.10040","DOIUrl":"10.1017/psy.2025.10040","url":null,"abstract":"<p><p>Empathic accuracy (EA) is the ability to accurately understand another person's thoughts and feelings, which is crucial for social and psychological interactions. Traditionally, EA is assessed by comparing a perceiver's moment-to-moment ratings of a target's emotional state with the target's own self-reported ratings at corresponding time points. However, misalignments between these two sequences are common due to the complexity of emotional interpretation and individual differences in behavioral responses. Conventional methods often ignore or oversimplify these misalignments, for instance by assuming a fixed time lag, which can introduce bias into EA estimates. To address this, we propose a novel alignment approach that captures a wide range of misalignment patterns. Our method leverages the square-root velocity framework to decompose emotional rating trajectories into amplitude and phase components. To ensure realistic alignment, we introduce a regularization constraint that limits temporal shifts to ranges consistent with human perceptual capabilities. This alignment is efficiently implemented using a constrained dynamic programming algorithm. We validate our method through simulations and real-world applications involving video and music datasets, demonstrating its superior performance over traditional techniques.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1536-1557"},"PeriodicalIF":3.1,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12660001/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145001943","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Designing Learning Intervention Studies: Identifiability of Heterogeneous Hidden Markov Models. 设计学习干预研究:异质隐马尔可夫模型的可辨识性。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-09-01 Epub Date: 2025-07-22 DOI: 10.1017/psy.2025.10024
Ying Liu, Steven Culpepper

Hidden Markov models (HMMs) are popular for modeling complex, longitudinal data. Existing identifiability theory for conventional HMMs assume emission probabilities are constant over time and the Markov chain governing transitions among the hidden states is irreducible, which are assumptions that may not be applicable in all educational and psychological research settings. We generalize existing conditions on homogeneous HMMs by considering heterogeneous HMMs with time-varying emission probabilities and the potential for absorbing states. Researchers are investigating a family of models known as restricted HMMs (RHMMs), which combine HMMs and restricted latent class models (RLCMs) to provide fine-grained classification of educationally and psychologically relevant attribute profiles over time. These RHMMs leverage the benefits of RLCMs and HMMs to understand changes in attribute profiles within longitudinal designs. The identifiability of RHMM parameters is a critical issue for ensuring successful applications and accurate statistical inference regarding factors that impact outcomes in intervention studies. We establish identifiability conditions for RHMMs. The new identifiability conditions for heterogeneous HMMs and RHMMs provide researchers insights for designing interventions. We discuss different types of assessment designs and the implications for practice. We present an application of a heterogeneous HMM to daily measures of positive and negative affect.

隐马尔可夫模型(hmm)在复杂的纵向数据建模中非常流行。传统hmm的现有可识别性理论假设发射概率随时间不变,并且控制隐藏状态之间转换的马尔可夫链是不可约的,这些假设可能不适用于所有的教育和心理学研究环境。通过考虑具有随时间变化的发射概率和吸收态势的非均匀hmm,推广了均匀hmm的现有条件。研究人员正在研究一类被称为受限hmm模型(rhmm)的模型,它将hmm模型和受限潜在类别模型(rlcm)结合起来,以提供随时间推移的教育和心理相关属性概况的细粒度分类。这些rhmm利用rlcm和hmm的优点来理解纵向设计中属性概要文件的变化。RHMM参数的可识别性是确保干预研究中影响结果的因素成功应用和准确统计推断的关键问题。我们建立了rhmm的可识别性条件。异质hmm和非均匀hmm的新识别条件为研究人员设计干预措施提供了新的见解。我们将讨论不同类型的评估设计及其对实践的影响。我们提出了一个异质HMM应用于积极和消极影响的日常测量。
{"title":"Designing Learning Intervention Studies: Identifiability of Heterogeneous Hidden Markov Models.","authors":"Ying Liu, Steven Culpepper","doi":"10.1017/psy.2025.10024","DOIUrl":"10.1017/psy.2025.10024","url":null,"abstract":"<p><p>Hidden Markov models (HMMs) are popular for modeling complex, longitudinal data. Existing identifiability theory for conventional HMMs assume emission probabilities are constant over time and the Markov chain governing transitions among the hidden states is irreducible, which are assumptions that may not be applicable in all educational and psychological research settings. We generalize existing conditions on homogeneous HMMs by considering heterogeneous HMMs with time-varying emission probabilities and the potential for absorbing states. Researchers are investigating a family of models known as restricted HMMs (RHMMs), which combine HMMs and restricted latent class models (RLCMs) to provide fine-grained classification of educationally and psychologically relevant attribute profiles over time. These RHMMs leverage the benefits of RLCMs and HMMs to understand changes in attribute profiles within longitudinal designs. The identifiability of RHMM parameters is a critical issue for ensuring successful applications and accurate statistical inference regarding factors that impact outcomes in intervention studies. We establish identifiability conditions for RHMMs. The new identifiability conditions for heterogeneous HMMs and RHMMs provide researchers insights for designing interventions. We discuss different types of assessment designs and the implications for practice. We present an application of a heterogeneous HMM to daily measures of positive and negative affect.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1258-1283"},"PeriodicalIF":3.1,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12660025/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144683608","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Explaining Performance Gaps with Problem-Solving Process Data via Latent Class Mediation Analysis. 用潜在类中介分析解释问题解决过程数据的绩效差距。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-08-11 DOI: 10.1017/psy.2025.10038
Sunbeom Kwon, Susu Zhang

Process data, in particular, log data collected from a computerized test, documents the sequence of actions performed by an examinee in pursuit of solving a problem, affording an opportunity to understand test-taking behavioral patterns that account for demographic group differences in key outcomes of interest, for instance, final score on a cognitive item. Addressing this aim, this article proposes a latent class mediation analysis procedure. Using continuous process features extracted from action sequence data as indicators, latent classes underlying the test-taking behavior are identified in a latent class mediation model, where an examinee's nominal latent class membership enters as the mediator between the observed grouping and outcome variables. A headlong search algorithm for selecting the subset of process features that maximizes the total indirect effect of the latent class mediator is implemented. The proposed procedure is validated with a series of simulations. An application to a large-scale assessment highlights how the proposed method can be used to explain performance gaps between students with learning disability and their typically developing peers on the National Assessment of Educational Progress (NAEP) math assessment.

过程数据,特别是从计算机化测试中收集的日志数据,记录了考生在追求解决问题的过程中所采取的一系列行动,提供了一个机会来了解考试行为模式,这些行为模式解释了关键结果的人口统计学组差异,例如,认知项目的最终分数。针对这一目的,本文提出了一个潜在类中介分析程序。使用从动作序列数据中提取的连续过程特征作为指标,在潜在类别中介模型中识别出考试行为背后的潜在类别,其中考生的名义潜在类别成员作为观察到的分组和结果变量之间的中介。实现了一种快速搜索算法,用于选择使潜在类中介的总间接效应最大化的过程特征子集。通过一系列的仿真验证了该方法的有效性。一项大规模评估的应用突出了所提出的方法如何用于解释在国家教育进步评估(NAEP)数学评估中有学习障碍的学生与正常发展的同龄人之间的表现差距。
{"title":"Explaining Performance Gaps with Problem-Solving Process Data via Latent Class Mediation Analysis.","authors":"Sunbeom Kwon, Susu Zhang","doi":"10.1017/psy.2025.10038","DOIUrl":"10.1017/psy.2025.10038","url":null,"abstract":"<p><p>Process data, in particular, log data collected from a computerized test, documents the sequence of actions performed by an examinee in pursuit of solving a problem, affording an opportunity to understand test-taking behavioral patterns that account for demographic group differences in key outcomes of interest, for instance, final score on a cognitive item. Addressing this aim, this article proposes a latent class mediation analysis procedure. Using continuous process features extracted from action sequence data as indicators, latent classes underlying the test-taking behavior are identified in a latent class mediation model, where an examinee's nominal latent class membership enters as the mediator between the observed grouping and outcome variables. A headlong search algorithm for selecting the subset of process features that maximizes the total indirect effect of the latent class mediator is implemented. The proposed procedure is validated with a series of simulations. An application to a large-scale assessment highlights how the proposed method can be used to explain performance gaps between students with learning disability and their typically developing peers on the National Assessment of Educational Progress (NAEP) math assessment.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-29"},"PeriodicalIF":3.1,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12805201/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144818319","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Detecting Differential Item Functioning across Multiple Groups using Group Pairwise Penalty. 用组对惩罚法检测不同组的不同项目功能。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-08-11 DOI: 10.1017/psy.2025.10034
Weicong Lyu, Chun Wang, Gongjun Xu
{"title":"Detecting Differential Item Functioning across Multiple Groups using Group Pairwise Penalty.","authors":"Weicong Lyu, Chun Wang, Gongjun Xu","doi":"10.1017/psy.2025.10034","DOIUrl":"10.1017/psy.2025.10034","url":null,"abstract":"","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-41"},"PeriodicalIF":3.1,"publicationDate":"2025-08-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12805199/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144818318","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Explaining Person-by-Item Responses using Person- and Item-Level Predictors via Random Forests and Interpretable Machine Learning in Explanatory Item Response Models. 通过随机森林和解释性项目反应模型中的可解释机器学习,使用个人和项目水平预测因子解释个人对项目的反应。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-07-31 DOI: 10.1017/psy.2025.10032
Sun-Joo Cho, Goodwin Amanda, Jorge Salas, Sophia Mueller

This study incorporates a random forest (RF) approach to probe complex interactions and nonlinearity among predictors into an item response model with the goal of using a hybrid approach to outperform either an RF or explanatory item response model (EIRM) only in explaining item responses. In the specified model, called EIRM-RF, predicted values using RF are added as a predictor in EIRM to model the nonlinear and interaction effects of person- and item-level predictors in person-by-item response data, while accounting for random effects over persons and items. The results of the EIRM-RF are probed with interpretable machine learning (ML) methods, including feature importance measures, partial dependence plots, accumulated local effect plots, and the H-statistic. The EIRM-RF and the interpretable methods are illustrated using an empirical data set to explain differences in reading comprehension in digital versus paper mediums, and the results of EIRM-RF are compared with those of EIRM and RF to show empirical differences in modeling the effects of predictors and random effects among EIRM, RF, and EIRM-RF. In addition, simulation studies are conducted to compare model accuracy among the three models and to evaluate the performance of interpretable ML methods.

本研究采用随机森林(RF)方法来探索预测因子之间的复杂相互作用和非线性,并将其纳入项目反应模型,目的是使用混合方法在解释项目反应方面优于随机森林或解释性项目反应模型(EIRM)。在指定的模型中,称为EIRM-RF,使用RF的预测值被添加为EIRM中的预测因子,以模拟个人和项目层面预测因子在个人-项目响应数据中的非线性和相互作用效应,同时考虑到人员和项目的随机效应。EIRM-RF的结果用可解释的机器学习(ML)方法进行了探测,包括特征重要性度量、部分依赖图、累积局部效应图和h统计量。本文使用一组经验数据来说明EIRM-RF和可解释方法,以解释数字媒介与纸质媒介在阅读理解方面的差异,并将EIRM-RF的结果与EIRM和RF的结果进行比较,以显示EIRM、RF和EIRM-RF在预测因子和随机效应建模方面的经验差异。此外,还进行了仿真研究,比较了三种模型之间的模型精度,并评估了可解释ML方法的性能。
{"title":"Explaining Person-by-Item Responses using Person- and Item-Level Predictors via Random Forests and Interpretable Machine Learning in Explanatory Item Response Models.","authors":"Sun-Joo Cho, Goodwin Amanda, Jorge Salas, Sophia Mueller","doi":"10.1017/psy.2025.10032","DOIUrl":"10.1017/psy.2025.10032","url":null,"abstract":"<p><p>This study incorporates a random forest (RF) approach to probe complex interactions and nonlinearity among predictors into an item response model with the goal of using a hybrid approach to outperform either an RF or explanatory item response model (EIRM) only in explaining item responses. In the specified model, called EIRM-RF, predicted values using RF are added as a predictor in EIRM to model the nonlinear and interaction effects of person- and item-level predictors in person-by-item response data, while accounting for random effects over persons and items. The results of the EIRM-RF are probed with interpretable machine learning (ML) methods, including feature importance measures, partial dependence plots, accumulated local effect plots, and the <i>H</i>-statistic. The EIRM-RF and the interpretable methods are illustrated using an empirical data set to explain differences in reading comprehension in digital versus paper mediums, and the results of EIRM-RF are compared with those of EIRM and RF to show empirical differences in modeling the effects of predictors and random effects among EIRM, RF, and EIRM-RF. In addition, simulation studies are conducted to compare model accuracy among the three models and to evaluate the performance of interpretable ML methods.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-38"},"PeriodicalIF":3.1,"publicationDate":"2025-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12483716/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144755132","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Item Response Models for Rating Relational Data. 评价关系数据的项目反应模型。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-06-30 DOI: 10.1017/psy.2025.10016
Chih-Han Leng, Ulf Böckenholt, Hsuan-Wei Lee, Grace Yao

This article introduces item response models for rating relational data. The relational data are obtained via ratings of senders and receivers in a directed network. The proposed models allow comparisons of senders and receivers on a one-dimensional latent scale while accounting for unobserved homophilic relationships. We show that the approach effectively captures reciprocity and clustering phenomena in the relational data. We estimate model parameters using a Bayesian specification and utilize Markov Chain Monte Carlo methods to approximate the full conditional posterior distributions. Simulation studies demonstrate that model parameters can be recovered satisfactorily even when the dimensionality of the network is small. We also present an extensive empirical application to illustrate the usefulness of the proposed models for complete and incomplete networks.

本文介绍了评价关系数据的项目响应模型。关系数据是通过有向网络中发送方和接收方的评级获得的。提出的模型允许在一维潜在尺度上比较发送者和接收者,同时考虑未观察到的同性关系。我们表明,该方法有效地捕获了关系数据中的互惠和聚类现象。我们使用贝叶斯规范估计模型参数,并利用马尔可夫链蒙特卡罗方法近似全条件后验分布。仿真研究表明,当网络的维数较小时,模型参数也能得到满意的恢复。我们还提出了一个广泛的经验应用,以说明所提出的模型对完全和不完全网络的有用性。
{"title":"Item Response Models for Rating Relational Data.","authors":"Chih-Han Leng, Ulf Böckenholt, Hsuan-Wei Lee, Grace Yao","doi":"10.1017/psy.2025.10016","DOIUrl":"10.1017/psy.2025.10016","url":null,"abstract":"<p><p>This article introduces item response models for rating relational data. The relational data are obtained via ratings of senders and receivers in a directed network. The proposed models allow comparisons of senders and receivers on a one-dimensional latent scale while accounting for unobserved homophilic relationships. We show that the approach effectively captures reciprocity and clustering phenomena in the relational data. We estimate model parameters using a Bayesian specification and utilize Markov Chain Monte Carlo methods to approximate the full conditional posterior distributions. Simulation studies demonstrate that model parameters can be recovered satisfactorily even when the dimensionality of the network is small. We also present an extensive empirical application to illustrate the usefulness of the proposed models for complete and incomplete networks.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-30"},"PeriodicalIF":3.1,"publicationDate":"2025-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12483722/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144531135","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Continuous-Time Dynamic Factor Model for Intensive Longitudinal Data Arising from Mobile Health Studies. 流动健康研究中密集纵向数据的连续时间动态因子模型。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-06-16 DOI: 10.1017/psy.2025.10023
Madeline R Abbott, Walter H Dempsey, Inbal Nahum-Shani, Cho Y Lam, David W Wetter, Jeremy M G Taylor

Intensive longitudinal data (ILD) collected in mobile health (mHealth) studies contain rich information on the dynamics of multiple outcomes measured frequently over time. Motivated by an mHealth study in which participants self-report the intensity of many emotions multiple times per day, we describe a dynamic factor model that summarizes ILD as a low-dimensional, interpretable latent process. This model consists of (i) a measurement submodel-a factor model-that summarizes the multivariate longitudinal outcome as lower-dimensional latent variables and (ii) a structural submodel-an Ornstein-Uhlenbeck (OU) stochastic process-that captures the dynamics of the multivariate latent process in continuous time. We derive a closed-form likelihood for the marginal distribution of the outcome and the computationally-simpler sparse precision matrix for the OU process. We propose a block coordinate descent algorithm for estimation and use simulation studies to show that it has good statistical properties with ILD. Then, we use our method to analyze data from the mHealth study. We summarize the dynamics of 18 emotions using models with one, two, and three time-varying latent factors, which correspond to different behavioral science theories of emotions. We demonstrate how results can be interpreted to help improve behavioral science theories of momentary emotions, latent psychological states, and their dynamics.

在移动健康(mHealth)研究中收集的密集纵向数据(ILD)包含了随时间频繁测量的多种结果动态的丰富信息。在一项移动健康研究的激励下,参与者每天多次自我报告许多情绪的强度,我们描述了一个动态因素模型,将ILD总结为一个低维的、可解释的潜在过程。该模型包括(i)一个测量子模型-一个将多变量纵向结果总结为低维潜在变量的因子模型和(ii)一个结构子模型-一个Ornstein-Uhlenbeck (OU)随机过程-捕捉连续时间内多变量潜在过程的动态。我们导出了结果的边际分布的封闭似然形式和OU过程的计算更简单的稀疏精度矩阵。我们提出了一种块坐标下降算法用于估计,并通过仿真研究表明它具有良好的ILD统计性能。然后,我们使用我们的方法来分析来自移动健康研究的数据。我们使用具有一个、两个和三个时变潜在因素的模型总结了18种情绪的动态,这些模型对应于不同的情绪行为科学理论。我们展示了如何解释结果,以帮助改进瞬间情绪、潜在心理状态及其动态的行为科学理论。
{"title":"A Continuous-Time Dynamic Factor Model for Intensive Longitudinal Data Arising from Mobile Health Studies.","authors":"Madeline R Abbott, Walter H Dempsey, Inbal Nahum-Shani, Cho Y Lam, David W Wetter, Jeremy M G Taylor","doi":"10.1017/psy.2025.10023","DOIUrl":"10.1017/psy.2025.10023","url":null,"abstract":"<p><p>Intensive longitudinal data (ILD) collected in mobile health (mHealth) studies contain rich information on the dynamics of multiple outcomes measured frequently over time. Motivated by an mHealth study in which participants self-report the intensity of many emotions multiple times per day, we describe a dynamic factor model that summarizes ILD as a low-dimensional, interpretable latent process. This model consists of (i) a measurement submodel-a factor model-that summarizes the multivariate longitudinal outcome as lower-dimensional latent variables and (ii) a structural submodel-an Ornstein-Uhlenbeck (OU) stochastic process-that captures the dynamics of the multivariate latent process in continuous time. We derive a closed-form likelihood for the marginal distribution of the outcome and the computationally-simpler sparse precision matrix for the OU process. We propose a block coordinate descent algorithm for estimation and use simulation studies to show that it has good statistical properties with ILD. Then, we use our method to analyze data from the mHealth study. We summarize the dynamics of 18 emotions using models with one, two, and three time-varying latent factors, which correspond to different behavioral science theories of emotions. We demonstrate how results can be interpreted to help improve behavioral science theories of momentary emotions, latent psychological states, and their dynamics.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-22"},"PeriodicalIF":3.1,"publicationDate":"2025-06-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12439307/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144303611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Psychometrika
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1