首页 > 最新文献

Psychometrika最新文献

英文 中文
Testing of Reverse Causality Using Semi-Supervised Machine Learning. 使用半监督机器学习测试反向因果关系。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-04-07 DOI: 10.1017/psy.2025.13
Nan Zhang, Heng Xu, Manuel J Vaulont, Zhen Zhang

Two potential obstacles stand between the observation of a statistical correlation and the design (and deployment) of an effective intervention, omitted variable bias and reverse causality. Whereas the former has received ample attention, comparably scant focus has been devoted to the latter in the methodological literature. Many existing methods for reverse causality testing commence by postulating a structural model that may suffer from widely recognized issues such as the difficulty of properly setting temporal lags, which are critical to model validity. In this article, we draw upon advances in machine learning, specifically the recently established link between causal direction and the effectiveness of semi-supervised learning algorithms, to develop a novel method for reverse causality testing that circumvents many of the assumptions required by traditional methods. Mathematical analysis and simulation studies were carried out to demonstrate the effectiveness of our method. We also performed tests over a real-world dataset to show how our method may be used to identify causal relationships in practice.

在统计相关性的观察和有效干预的设计(和部署)之间存在两个潜在的障碍,即遗漏的变量偏差和反向因果关系。虽然前者得到了充分的关注,但在方法论文献中,对后者的关注相对较少。许多现有的反向因果关系检验方法都是从假设一个结构模型开始的,该模型可能存在广泛认识到的问题,例如难以适当设置对模型有效性至关重要的时间滞后。在本文中,我们借鉴了机器学习的进展,特别是最近建立的因果方向与半监督学习算法有效性之间的联系,开发了一种新的反向因果检验方法,该方法绕过了传统方法所需的许多假设。数学分析和仿真研究证明了该方法的有效性。我们还对真实世界的数据集进行了测试,以显示我们的方法如何在实践中用于识别因果关系。
{"title":"Testing of Reverse Causality Using Semi-Supervised Machine Learning.","authors":"Nan Zhang, Heng Xu, Manuel J Vaulont, Zhen Zhang","doi":"10.1017/psy.2025.13","DOIUrl":"10.1017/psy.2025.13","url":null,"abstract":"<p><p>Two potential obstacles stand between the observation of a statistical correlation and the design (and deployment) of an effective intervention, <i>omitted variable bias</i> and <i>reverse causality</i>. Whereas the former has received ample attention, comparably scant focus has been devoted to the latter in the methodological literature. Many existing methods for reverse causality testing commence by postulating a structural model that may suffer from widely recognized issues such as the difficulty of properly setting temporal lags, which are critical to model validity. In this article, we draw upon advances in machine learning, specifically the recently established link between causal direction and the effectiveness of semi-supervised learning algorithms, to develop a novel method for reverse causality testing that circumvents many of the assumptions required by traditional methods. Mathematical analysis and simulation studies were carried out to demonstrate the effectiveness of our method. We also performed tests over a real-world dataset to show how our method may be used to identify causal relationships in practice.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-25"},"PeriodicalIF":3.1,"publicationDate":"2025-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12483713/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143796873","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Logistic Multidimensional Data Analysis for Ordinal Response Variables Using a Cumulative Link Function. 基于累积链接函数的有序响应变量的Logistic多维数据分析。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-03-27 DOI: 10.1017/psy.2025.10
Mark de Rooij, Ligaya Breemer, Dion Woestenburg, Frank Busing

We present a multidimensional data analysis framework for the analysis of ordinal response variables. Underlying the ordinal variables, we assume a continuous latent variable, leading to cumulative logit models. The framework includes unsupervised methods, when no predictor variables are available, and supervised methods, when predictor variables are available. We distinguish between dominance variables and proximity variables, where dominance variables are analyzed using inner product models, whereas the proximity variables are analyzed using distance models. An expectation-majorization-minimization algorithm is derived for estimation of the parameters of the models. We illustrate our methodology with three empirical data sets highlighting the advantages of the proposed framework. A simulation study is conducted to evaluate the performance of the algorithm.

我们提出了一个多维数据分析框架,用于分析有序响应变量。在有序变量的基础上,我们假设一个连续的潜在变量,导致累积logit模型。该框架包括无监督方法(当没有可用的预测变量时)和监督方法(当可用的预测变量时)。我们区分优势变量和接近变量,其中优势变量使用内积模型进行分析,而接近变量使用距离模型进行分析。推导了一种期望-最大化-最小化算法来估计模型的参数。我们用三个经验数据集来说明我们的方法,突出了所提议框架的优势。通过仿真研究对该算法的性能进行了评价。
{"title":"Logistic Multidimensional Data Analysis for Ordinal Response Variables Using a Cumulative Link Function.","authors":"Mark de Rooij, Ligaya Breemer, Dion Woestenburg, Frank Busing","doi":"10.1017/psy.2025.10","DOIUrl":"10.1017/psy.2025.10","url":null,"abstract":"<p><p>We present a multidimensional data analysis framework for the analysis of ordinal response variables. Underlying the ordinal variables, we assume a continuous latent variable, leading to cumulative logit models. The framework includes unsupervised methods, when no predictor variables are available, and supervised methods, when predictor variables are available. We distinguish between dominance variables and proximity variables, where dominance variables are analyzed using inner product models, whereas the proximity variables are analyzed using distance models. An expectation-majorization-minimization algorithm is derived for estimation of the parameters of the models. We illustrate our methodology with three empirical data sets highlighting the advantages of the proposed framework. A simulation study is conducted to evaluate the performance of the algorithm.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-37"},"PeriodicalIF":3.1,"publicationDate":"2025-03-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12483703/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144026859","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Multiphase Structured Latent Curve Models for Count Response Data: A Re-Analysis of the Acquisition of Morphology in English. 计数反应数据的多相结构潜曲线模型:英语形态学习得的再分析。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-03-18 DOI: 10.1017/psy.2025.8
Marian M Strazzeri, Jeffrey R Harring, Nan Bernstein Ratner

Structured latent curve models (SLCMs) for continuous repeated measures data have been the subject of considerable recent research activity. In this article, we develop a first-order SLCM for repeated measures count data where the underlying change process is theorized to develop in distinct phases. Parameters of the multiphase or piecewise growth model, including changepoints, are allowed to vary across individuals. Exposure is allowed to vary across both individuals and time. We demonstrate our modeling approach on empirical expressive language data (grammatical morpheme counts) drawn from multiple distinct corpora available in the Child Language Data Exchange System (CHILDES), where the acquisition of grammatical morphology is understood to occur in distinct phases in typically developing children. A multiphase SLCM is fit to summarize individuals' data as well as the average developmental pattern. Change in time-varying dispersion (unexplained variability in morpheme counts) over the course of early childhood is modeled concurrently to provide additional insights into acquisition. Unique characteristics of count data create modeling, identification, estimation, and diagnostic challenges that are exacerbated by incorporating growth models with nonlinear random effects. These are discussed at length. We provide annotated software code for each of models used in the empirical example.

用于连续重复测量数据的结构化潜在曲线模型(SLCMs)已成为近期大量研究活动的主题。在本文中,我们为重复测量计数数据开发了一阶SLCM,其中潜在的变化过程被理论化为在不同的阶段中发展。多阶段或分段增长模型的参数(包括变更点)允许在个体之间变化。暴露量可以因个人和时间而异。我们在儿童语言数据交换系统(CHILDES)中从多个不同的语料库中提取的经验表达性语言数据(语法语素计数)上展示了我们的建模方法,在该系统中,语法形态学的习得被理解为发生在正常发育儿童的不同阶段。多相SLCM适合于总结个体数据以及平均发展模式。在儿童早期的过程中,随时间变化的分散(语素计数的不明变异)的变化同时建模,以提供对习得的额外见解。计数数据的独特特性带来了建模、识别、估计和诊断方面的挑战,而结合非线性随机效应的增长模型则加剧了这些挑战。详细讨论了这些问题。我们为经验示例中使用的每个模型提供了注释的软件代码。
{"title":"Multiphase Structured Latent Curve Models for Count Response Data: A Re-Analysis of the Acquisition of Morphology in English.","authors":"Marian M Strazzeri, Jeffrey R Harring, Nan Bernstein Ratner","doi":"10.1017/psy.2025.8","DOIUrl":"10.1017/psy.2025.8","url":null,"abstract":"<p><p>Structured latent curve models (SLCMs) for continuous repeated measures data have been the subject of considerable recent research activity. In this article, we develop a first-order SLCM for repeated measures count data where the underlying change process is theorized to develop in distinct phases. Parameters of the multiphase or piecewise growth model, including changepoints, are allowed to vary across individuals. Exposure is allowed to vary across both individuals and time. We demonstrate our modeling approach on empirical expressive language data (grammatical morpheme counts) drawn from multiple distinct corpora available in the Child Language Data Exchange System (CHILDES), where the acquisition of grammatical morphology is understood to occur in distinct phases in typically developing children. A multiphase SLCM is fit to summarize individuals' data as well as the average developmental pattern. Change in time-varying dispersion (unexplained variability in morpheme counts) over the course of early childhood is modeled concurrently to provide additional insights into acquisition. Unique characteristics of count data create modeling, identification, estimation, and diagnostic challenges that are exacerbated by incorporating growth models with nonlinear random effects. These are discussed at length. We provide annotated software code for each of models used in the empirical example.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-40"},"PeriodicalIF":3.1,"publicationDate":"2025-03-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12483717/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144053925","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Consistency Theory of General Nonparametric Classification Methods in Cognitive Diagnosis. 认知诊断中一般非参数分类方法的一致性理论。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-03-17 DOI: 10.1017/psy.2025.9
Chengyu Cui, Yanlong Liu, Gongjun Xu

Cognitive diagnosis models (CDMs) have been popularly used in fields such as education, psychology, and social sciences. While parametric likelihood estimation is a prevailing method for fitting CDMs, nonparametric methodologies are attracting increasing attention due to their ease of implementation and robustness, particularly when sample sizes are relatively small. However, existing consistency results of the nonparametric estimation methods often rely on certain restrictive conditions, which may not be easily satisfied in practice. In this article, the consistency theory for the general nonparametric classification method is reestablished under weaker and more practical conditions.

认知诊断模型(CDMs)已广泛应用于教育、心理学和社会科学等领域。虽然参数似然估计是拟合cdm的一种流行方法,但非参数方法由于易于实现和鲁棒性而越来越受到关注,特别是在样本量相对较小的情况下。然而,现有的非参数估计方法的一致性结果往往依赖于一定的限制条件,在实际应用中不容易得到满足。本文在较弱和较实际的条件下,重新建立了一般非参数分类方法的一致性理论。
{"title":"Consistency Theory of General Nonparametric Classification Methods in Cognitive Diagnosis.","authors":"Chengyu Cui, Yanlong Liu, Gongjun Xu","doi":"10.1017/psy.2025.9","DOIUrl":"10.1017/psy.2025.9","url":null,"abstract":"<p><p>Cognitive diagnosis models (CDMs) have been popularly used in fields such as education, psychology, and social sciences. While parametric likelihood estimation is a prevailing method for fitting CDMs, nonparametric methodologies are attracting increasing attention due to their ease of implementation and robustness, particularly when sample sizes are relatively small. However, existing consistency results of the nonparametric estimation methods often rely on certain restrictive conditions, which may not be easily satisfied in practice. In this article, the consistency theory for the general nonparametric classification method is reestablished under weaker and more practical conditions.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-17"},"PeriodicalIF":3.1,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12483721/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143796697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stacking Models of Growth: A Methodology for Predicting the Pace of Progress to the Education Sustainable Development Targets Using International Large-Scale Assessments. 增长的叠加模型:利用国际大规模评估预测教育可持续发展目标进展速度的方法。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-02-13 DOI: 10.1017/psy.2025.2
David Kaplan, Kjorte Harra, Jonas Stampka, Nina Jude

To assess country-level progress toward these educational goals it is important to monitor trends in educational outcomes over time. The purpose of this article is to demonstrate how optimally predictive growth models can be constructed to monitor the pace of progress at which countries are moving toward (or way from) the education sustainable development goals as specified by the United Nations. A number of growth curve models can be specified to estimate the pace of progress, however, choosing one model and using it for predictive purposes assumes that the chosen model is the one that generated the data, and this choice runs the risk of "over-confident inferences and decisions that are more risky than one thinks they are" (Hoeting et al., 1999). To mitigate this problem, we adapt and apply Bayesian stacking to form mixtures of predictive distributions from an ensemble of individual models specified to predict country-level pace of progress. We demonstrate Bayesian stacking using country-level data from the Program on International Student Assessment. Our results show that Bayesian stacking yields better predictive accuracy than any single model as measured by the Kullback-Leibler divergence. Issues of Bayesian model identification and estimation for growth models are also discussed.

为了评估国家在实现这些教育目标方面取得的进展,重要的是监测一段时间以来教育成果的趋势。本文的目的是演示如何构建最优预测增长模型,以监测各国朝着(或远离)联合国指定的教育可持续发展目标前进的进度。可以指定许多增长曲线模型来估计进展的速度,然而,选择一个模型并将其用于预测目的,假设所选择的模型是生成数据的模型,这种选择有“过度自信的推断和决策比人们认为的风险更大”的风险(hoting等人,1999)。为了缓解这一问题,我们调整并应用贝叶斯叠加来形成预测分布的混合物,这些预测分布来自特定的单个模型集合,用于预测国家层面的进展速度。我们使用来自国际学生评估项目的国家级数据来演示贝叶斯堆叠。我们的研究结果表明,贝叶斯叠加比任何单一的Kullback-Leibler散度模型都具有更好的预测精度。本文还讨论了贝叶斯模型辨识和增长模型估计的问题。
{"title":"Stacking Models of Growth: A Methodology for Predicting the Pace of Progress to the Education Sustainable Development Targets Using International Large-Scale Assessments.","authors":"David Kaplan, Kjorte Harra, Jonas Stampka, Nina Jude","doi":"10.1017/psy.2025.2","DOIUrl":"10.1017/psy.2025.2","url":null,"abstract":"<p><p>To assess country-level progress toward these educational goals it is important to monitor trends in educational outcomes over time. The purpose of this article is to demonstrate how optimally predictive growth models can be constructed to monitor the pace of progress at which countries are moving toward (or way from) the education sustainable development goals as specified by the United Nations. A number of growth curve models can be specified to estimate the pace of progress, however, choosing one model and using it for predictive purposes assumes that the chosen model is the one that generated the data, and this choice runs the risk of \"over-confident inferences and decisions that are more risky than one thinks they are\" (Hoeting et al., 1999). To mitigate this problem, we adapt and apply <i>Bayesian stacking</i> to form mixtures of predictive distributions from an ensemble of individual models specified to predict country-level pace of progress. We demonstrate Bayesian stacking using country-level data from the Program on International Student Assessment. Our results show that Bayesian stacking yields better predictive accuracy than any single model as measured by the Kullback-Leibler divergence. Issues of Bayesian model identification and estimation for growth models are also discussed.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-29"},"PeriodicalIF":3.1,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12483712/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144058550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Generalized Factor Rotation Framework with Customized Regularization. 具有自定义正则化的广义因子旋转框架。
IF 3.1 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-01-27 DOI: 10.1017/psy.2025.1
Yongfeng Wu, Xiangyi Liao, Qizhai Li

Factor rotation is a crucial step in interpreting the results of exploratory factor analysis. Several rotation methods have been developed for simple structure solutions, but their extensions to bi-factor analysis are often not well established. In this article, we propose a mathematical framework that incorporates customized factor structure as a regularization to produce the optimal orthogonal or oblique rotation. We demonstrate the utility of the framework using examples of simple structure rotation and bi-factor rotation. Through detailed simulations, we show that the new method is accurate and robust in recovering the factor structures and latent correlations when bi-factor analysis is applied. The new method is applied to a test data and a Quality of Life survey data. Results show that our method can reveal bi-factor structures that are consistent with the theories.

因子轮换是解释探索性因子分析结果的关键步骤。对于简单结构解,已经开发了几种旋转方法,但它们对双因素分析的扩展往往没有很好地建立。在本文中,我们提出了一个数学框架,其中包含自定义因子结构作为正则化,以产生最佳的正交或斜旋转。我们使用简单结构旋转和双因素旋转的例子来演示框架的效用。仿真结果表明,在双因子分析中,新方法在恢复因子结构和潜在相关性方面具有较好的准确性和鲁棒性。将该方法应用于一个测试数据和一个生活质量调查数据。结果表明,我们的方法可以揭示与理论一致的双因素结构。
{"title":"A Generalized Factor Rotation Framework with Customized Regularization.","authors":"Yongfeng Wu, Xiangyi Liao, Qizhai Li","doi":"10.1017/psy.2025.1","DOIUrl":"10.1017/psy.2025.1","url":null,"abstract":"<p><p>Factor rotation is a crucial step in interpreting the results of exploratory factor analysis. Several rotation methods have been developed for simple structure solutions, but their extensions to bi-factor analysis are often not well established. In this article, we propose a mathematical framework that incorporates customized factor structure as a regularization to produce the optimal orthogonal or oblique rotation. We demonstrate the utility of the framework using examples of simple structure rotation and bi-factor rotation. Through detailed simulations, we show that the new method is accurate and robust in recovering the factor structures and latent correlations when bi-factor analysis is applied. The new method is applied to a test data and a Quality of Life survey data. Results show that our method can reveal bi-factor structures that are consistent with the theories.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1-25"},"PeriodicalIF":3.1,"publicationDate":"2025-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12483704/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144112840","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimizing Large-Scale Educational Assessment with a "Divide-and-Conquer" Strategy: Fast and Efficient Distributed Bayesian Inference in IRT Models. 用 "分而治之 "策略优化大规模教育评估:快速高效的 IRT 模型分布式贝叶斯推理。
IF 2.9 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-12-01 Epub Date: 2024-05-30 DOI: 10.1007/s11336-024-09978-1
Sainan Xu, Jing Lu, Jiwei Zhang, Chun Wang, Gongjun Xu

With the growing attention on large-scale educational testing and assessment, the ability to process substantial volumes of response data becomes crucial. Current estimation methods within item response theory (IRT), despite their high precision, often pose considerable computational burdens with large-scale data, leading to reduced computational speed. This study introduces a novel "divide- and-conquer" parallel algorithm built on the Wasserstein posterior approximation concept, aiming to enhance computational speed while maintaining accurate parameter estimation. This algorithm enables drawing parameters from segmented data subsets in parallel, followed by an amalgamation of these parameters via Wasserstein posterior approximation. Theoretical support for the algorithm is established through asymptotic optimality under certain regularity assumptions. Practical validation is demonstrated using real-world data from the Programme for International Student Assessment. Ultimately, this research proposes a transformative approach to managing educational big data, offering a scalable, efficient, and precise alternative that promises to redefine traditional practices in educational assessments.

随着大规模教育测试和评估日益受到关注,处理大量反应数据的能力变得至关重要。目前项目反应理论(IRT)中的估计方法尽管精度很高,但在处理大规模数据时往往会带来相当大的计算负担,导致计算速度下降。本研究介绍了一种基于 Wasserstein 后验近似概念的新型 "分而治之 "并行算法,旨在提高计算速度的同时保持准确的参数估计。该算法可以并行地从分段数据子集中提取参数,然后通过瓦瑟斯坦后验近似合并这些参数。在一定的规则性假设下,通过渐近最优性为该算法提供了理论支持。利用国际学生评估项目的真实数据进行了实际验证。最终,这项研究提出了一种管理教育大数据的变革方法,提供了一种可扩展、高效和精确的替代方案,有望重新定义教育评估的传统做法。
{"title":"Optimizing Large-Scale Educational Assessment with a \"Divide-and-Conquer\" Strategy: Fast and Efficient Distributed Bayesian Inference in IRT Models.","authors":"Sainan Xu, Jing Lu, Jiwei Zhang, Chun Wang, Gongjun Xu","doi":"10.1007/s11336-024-09978-1","DOIUrl":"10.1007/s11336-024-09978-1","url":null,"abstract":"<p><p>With the growing attention on large-scale educational testing and assessment, the ability to process substantial volumes of response data becomes crucial. Current estimation methods within item response theory (IRT), despite their high precision, often pose considerable computational burdens with large-scale data, leading to reduced computational speed. This study introduces a novel \"divide- and-conquer\" parallel algorithm built on the Wasserstein posterior approximation concept, aiming to enhance computational speed while maintaining accurate parameter estimation. This algorithm enables drawing parameters from segmented data subsets in parallel, followed by an amalgamation of these parameters via Wasserstein posterior approximation. Theoretical support for the algorithm is established through asymptotic optimality under certain regularity assumptions. Practical validation is demonstrated using real-world data from the Programme for International Student Assessment. Ultimately, this research proposes a transformative approach to managing educational big data, offering a scalable, efficient, and precise alternative that promises to redefine traditional practices in educational assessments.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1119-1147"},"PeriodicalIF":2.9,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141176735","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Are Sum Scores a Great Accomplishment of Psychometrics or Intuitive Test Theory? 总分是心理测量学还是直觉测验理论的伟大成就?
IF 2.9 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-12-01 Epub Date: 2024-10-22 DOI: 10.1007/s11336-024-10003-8
Robert J Mislevy

Sijtsma, Ellis, and Borsboom (Psychometrika, 89:84-117, 2024. https://doi.org/10.1007/s11336-024-09964-7 ) provide a thoughtful treatment in Psychometrika of the value and properties of sum scores and classical test theory at a depth at which few practicing psychometricians are familiar. In this note, I offer comments on their article from the perspective of evidentiary reasoning.

Sijtsma、Ellis 和 Borsboom (Psychometrika, 89:84-117, 2024. https://doi.org/10.1007/s11336-024-09964-7 ) 在《心理测量学》上对总分的价值和属性以及经典测验理论进行了深入的探讨,很少有实践心理测量学家会对这些内容感到熟悉。在本说明中,我将从证据推理的角度对他们的文章发表评论。
{"title":"Are Sum Scores a Great Accomplishment of Psychometrics or Intuitive Test Theory?","authors":"Robert J Mislevy","doi":"10.1007/s11336-024-10003-8","DOIUrl":"10.1007/s11336-024-10003-8","url":null,"abstract":"<p><p>Sijtsma, Ellis, and Borsboom (Psychometrika, 89:84-117, 2024. https://doi.org/10.1007/s11336-024-09964-7 ) provide a thoughtful treatment in Psychometrika of the value and properties of sum scores and classical test theory at a depth at which few practicing psychometricians are familiar. In this note, I offer comments on their article from the perspective of evidentiary reasoning.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1170-1174"},"PeriodicalIF":2.9,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142481089","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
New Paradigm of Identifiable General-response Cognitive Diagnostic Models: Beyond Categorical Data. 可识别的一般反应认知诊断模型新范例:超越分类数据
IF 2.9 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-12-01 Epub Date: 2024-07-05 DOI: 10.1007/s11336-024-09983-4
Seunghyun Lee, Yuqi Gu

Cognitive diagnostic models (CDMs) are a popular family of discrete latent variable models that model students' mastery or deficiency of multiple fine-grained skills. CDMs have been most widely used to model categorical item response data such as binary or polytomous responses. With advances in technology and the emergence of varying test formats in modern educational assessments, new response types, including continuous responses such as response times, and count-valued responses from tests with repetitive tasks or eye-tracking sensors, have also become available. Variants of CDMs have been proposed recently for modeling such responses. However, whether these extended CDMs are identifiable and estimable is entirely unknown. We propose a very general cognitive diagnostic modeling framework for arbitrary types of multivariate responses with minimal assumptions, and establish identifiability in this general setting. Surprisingly, we prove that our general-response CDMs are identifiable under Q -matrix-based conditions similar to those for traditional categorical-response CDMs. Our conclusions set up a new paradigm of identifiable general-response CDMs. We propose an EM algorithm to efficiently estimate a broad class of exponential family-based general-response CDMs. We conduct simulation studies under various response types. The simulation results not only corroborate our identifiability theory, but also demonstrate the superior empirical performance of our estimation algorithms. We illustrate our methodology by applying it to a TIMSS 2019 response time dataset.

认知诊断模型(CDM)是一种流行的离散潜变量模型,用于模拟学生掌握或缺乏多种精细技能的情况。认知诊断模型最广泛地应用于对二元或多态响应等分类项目响应数据建模。随着技术的进步和现代教育评估中不同测试形式的出现,新的反应类型也已出现,包括连续反应(如反应时间)和来自重复任务或眼动传感器测试的计数值反应。最近有人提出了 CDM 的变体,用于对这些反应建模。然而,这些扩展的 CDM 是否可以识别和估算还完全未知。我们为任意类型的多变量反应提出了一个非常通用的认知诊断建模框架,假设条件极少,并在这一通用环境中建立了可识别性。令人惊讶的是,我们证明了我们的一般反应 CDM 在基于 Q 矩阵的条件下是可识别的,这与传统分类反应 CDM 的条件相似。我们的结论为可识别的一般响应 CDM 树立了一个新范例。我们提出了一种 EM 算法,用于有效估计一大类基于指数族的一般响应 CDM。我们对各种反应类型进行了模拟研究。模拟结果不仅证实了我们的可识别性理论,还证明了我们的估计算法具有卓越的经验性能。我们将我们的方法应用于 TIMSS 2019 反应时间数据集,以说明我们的方法。
{"title":"New Paradigm of Identifiable General-response Cognitive Diagnostic Models: Beyond Categorical Data.","authors":"Seunghyun Lee, Yuqi Gu","doi":"10.1007/s11336-024-09983-4","DOIUrl":"10.1007/s11336-024-09983-4","url":null,"abstract":"<p><p>Cognitive diagnostic models (CDMs) are a popular family of discrete latent variable models that model students' mastery or deficiency of multiple fine-grained skills. CDMs have been most widely used to model categorical item response data such as binary or polytomous responses. With advances in technology and the emergence of varying test formats in modern educational assessments, new response types, including continuous responses such as response times, and count-valued responses from tests with repetitive tasks or eye-tracking sensors, have also become available. Variants of CDMs have been proposed recently for modeling such responses. However, whether these extended CDMs are identifiable and estimable is entirely unknown. We propose a very general cognitive diagnostic modeling framework for arbitrary types of multivariate responses with minimal assumptions, and establish identifiability in this general setting. Surprisingly, we prove that our general-response CDMs are identifiable under <math><mi>Q</mi></math> -matrix-based conditions similar to those for traditional categorical-response CDMs. Our conclusions set up a new paradigm of identifiable general-response CDMs. We propose an EM algorithm to efficiently estimate a broad class of exponential family-based general-response CDMs. We conduct simulation studies under various response types. The simulation results not only corroborate our identifiability theory, but also demonstrate the superior empirical performance of our estimation algorithms. We illustrate our methodology by applying it to a TIMSS 2019 response time dataset.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1304-1336"},"PeriodicalIF":2.9,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141535981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Ordinal Outcome State-Space Models for Intensive Longitudinal Data. 用于密集纵向数据的序数结果状态空间模型。
IF 2.9 2区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-12-01 Epub Date: 2024-06-11 DOI: 10.1007/s11336-024-09984-3
Teague R Henry, Lindley R Slipetz, Ami Falk, Jiaxing Qiu, Meng Chen

Intensive longitudinal (IL) data are increasingly prevalent in psychological science, coinciding with technological advancements that make it simple to deploy study designs such as daily diary and ecological momentary assessments. IL data are characterized by a rapid rate of data collection (1+ collections per day), over a period of time, allowing for the capture of the dynamics that underlie psychological and behavioral processes. One powerful framework for analyzing IL data is state-space modeling, where observed variables are considered measurements for underlying states (i.e., latent variables) that change together over time. However, state-space modeling has typically relied on continuous measurements, whereas psychological data often come in the form of ordinal measurements such as Likert scale items. In this manuscript, we develop a general estimation approach for state-space models with ordinal measurements, specifically focusing on a graded response model for Likert scale items. We evaluate the performance of our model and estimator against that of the commonly used "linear approximation" model, which treats ordinal measurements as though they are continuous. We find that our model resulted in unbiased estimates of the state dynamics, while the linear approximation resulted in strongly biased estimates of the state dynamics. Finally, we develop an approximate standard error, termed slice standard errors and show that these approximate standard errors are more liberal than true standard errors (i.e., smaller) at a consistent bias.

密集纵向(IL)数据在心理科学中日益盛行,与此同时,技术的进步使日常日记和生态瞬间评估等研究设计的部署变得简单。纵向数据的特点是在一段时间内快速收集数据(每天收集 1 次以上),从而捕捉到心理和行为过程的动态变化。状态空间建模是分析 IL 数据的一个强大框架,其中观察变量被视为随时间变化的潜在状态(即潜在变量)的测量值。然而,状态空间建模通常依赖于连续测量,而心理数据通常采用李克特量表项目等序数测量形式。在本手稿中,我们为具有顺序测量的状态空间模型开发了一种通用估算方法,尤其侧重于李克特量表项目的分级反应模型。我们评估了我们的模型和估计方法与常用的 "线性近似 "模型的性能,后者将序数测量视为连续测量。我们发现,我们的模型对状态动态的估计没有偏差,而线性近似模型对状态动态的估计偏差很大。最后,我们提出了一种近似标准误差,称为切片标准误差,并证明在偏差一致的情况下,这些近似标准误差比真实标准误差更宽松(即更小)。
{"title":"Ordinal Outcome State-Space Models for Intensive Longitudinal Data.","authors":"Teague R Henry, Lindley R Slipetz, Ami Falk, Jiaxing Qiu, Meng Chen","doi":"10.1007/s11336-024-09984-3","DOIUrl":"10.1007/s11336-024-09984-3","url":null,"abstract":"<p><p>Intensive longitudinal (IL) data are increasingly prevalent in psychological science, coinciding with technological advancements that make it simple to deploy study designs such as daily diary and ecological momentary assessments. IL data are characterized by a rapid rate of data collection (1+ collections per day), over a period of time, allowing for the capture of the dynamics that underlie psychological and behavioral processes. One powerful framework for analyzing IL data is state-space modeling, where observed variables are considered measurements for underlying states (i.e., latent variables) that change together over time. However, state-space modeling has typically relied on continuous measurements, whereas psychological data often come in the form of ordinal measurements such as Likert scale items. In this manuscript, we develop a general estimation approach for state-space models with ordinal measurements, specifically focusing on a graded response model for Likert scale items. We evaluate the performance of our model and estimator against that of the commonly used \"linear approximation\" model, which treats ordinal measurements as though they are continuous. We find that our model resulted in unbiased estimates of the state dynamics, while the linear approximation resulted in strongly biased estimates of the state dynamics. Finally, we develop an approximate standard error, termed slice standard errors and show that these approximate standard errors are more liberal than true standard errors (i.e., smaller) at a consistent bias.</p>","PeriodicalId":54534,"journal":{"name":"Psychometrika","volume":" ","pages":"1203-1229"},"PeriodicalIF":2.9,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11582181/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141302095","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Psychometrika
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1