首页 > 最新文献

British Journal of Mathematical & Statistical Psychology最新文献

英文 中文
Optimizing calibration designs with uncertainty in abilities 在能力不确定的情况下优化校准设计。
IF 1.8 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-03-10 DOI: 10.1111/bmsp.12387
Jonas Bjermo, Ellinor Fackle-Fornius, Frank Miller

Before items can be implemented in a test, the item characteristics need to be calibrated through pretesting. To achieve high-quality tests, it's crucial to maximize the precision of estimates obtained during item calibration. Higher precision can be attained if calibration items are allocated to examinees based on their individual abilities. Methods from optimal experimental design can be used to derive an optimal ability-matched calibration design. However, such an optimal design assumes known abilities of the examinees. In practice, the abilities are unknown and estimated based on a limited number of operational items. We develop the theory for handling the uncertainty in abilities in a proper way and show how the optimal calibration design can be derived when taking account of this uncertainty. We demonstrate that the derived designs are more robust when the uncertainty in abilities is acknowledged. Additionally, the method has been implemented in the R-package optical.

在测试中实施项目之前,项目特征需要通过预测试进行校准。为了实现高质量的测试,在项目校准期间获得的估计的精度最大化是至关重要的。如果根据考生的个人能力分配校正项目,可获得更高的精度。最优实验设计方法可用于导出最优能力匹配校准设计。然而,这种最优设计假定考生的能力是已知的。在实践中,这些能力是未知的,是基于有限数量的操作项目来估计的。我们发展了以适当的方式处理能力不确定性的理论,并展示了如何在考虑这种不确定性的情况下推导出最优的校准设计。我们证明,当能力的不确定性被承认时,推导出的设计具有更强的鲁棒性。此外,该方法已在r封装光学器件中实现。
{"title":"Optimizing calibration designs with uncertainty in abilities","authors":"Jonas Bjermo,&nbsp;Ellinor Fackle-Fornius,&nbsp;Frank Miller","doi":"10.1111/bmsp.12387","DOIUrl":"10.1111/bmsp.12387","url":null,"abstract":"<p>Before items can be implemented in a test, the item characteristics need to be calibrated through pretesting. To achieve high-quality tests, it's crucial to maximize the precision of estimates obtained during item calibration. Higher precision can be attained if calibration items are allocated to examinees based on their individual abilities. Methods from optimal experimental design can be used to derive an optimal ability-matched calibration design. However, such an optimal design assumes known abilities of the examinees. In practice, the abilities are unknown and estimated based on a limited number of operational items. We develop the theory for handling the uncertainty in abilities in a proper way and show how the optimal calibration design can be derived when taking account of this uncertainty. We demonstrate that the derived designs are more robust when the uncertainty in abilities is acknowledged. Additionally, the method has been implemented in the R-package <span>optical</span>.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"78 3","pages":"889-910"},"PeriodicalIF":1.8,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://bpspsychub.onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12387","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143598524","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integer programming in psychology: A review and directions for future research 心理学中的整数规划:综述与未来研究方向。
IF 1.8 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-02-26 DOI: 10.1111/bmsp.12386
Michael Brusco, Douglas Steinley, Ashley L. Watts

Integer programming (IP) is an extension of linear programming (LP) whereby the goal is to determine values for a set of decision variables (some or all of which have integer restrictions) so as to maximize or minimize a linear objective function of the variables subject to a set of linear constraints involving the variables. Although the psychological literature is replete with applications of multivariate statistics, implementations of mathematical modelling methods such as IP are comparatively far fewer. Nevertheless, over the decades, there have been a variety of important applications and the vast majority of these fall within the IP rather than the LP category. In this paper, we offer a brief overview of the history of IP methodology. We subsequently review some domains where IP has been gainfully applied in psychology, such as test assembly, cluster analysis and classification and seriation and unidimensional scaling. An illustrative example of using IP to cluster respondents measured on items pertaining to substance abuse disorder is provided. Finally, we identify areas where IP might be applied in emerging areas of psychology, such as in the domain of network psychometrics.

整数规划(IP)是线性规划(LP)的扩展,其目标是确定一组决策变量(其中一些或全部具有整数限制)的值,以便在涉及变量的一组线性约束下最大化或最小化变量的线性目标函数。尽管心理学文献中充满了多元统计的应用,但数学建模方法(如IP)的实现相对要少得多。然而,在过去的几十年里,出现了各种各样的重要应用,其中绝大多数属于IP而不是LP类别。在本文中,我们简要概述了知识产权方法论的历史。我们随后回顾了知识产权在心理学中的一些有益应用领域,如测试装配、聚类分析、分类、序列化和一维标度。提供了一个使用知识产权对有关药物滥用障碍的项目进行测量的受访者进行聚类的说明性示例。最后,我们确定了IP可能应用于心理学新兴领域的领域,例如网络心理测量学领域。
{"title":"Integer programming in psychology: A review and directions for future research","authors":"Michael Brusco,&nbsp;Douglas Steinley,&nbsp;Ashley L. Watts","doi":"10.1111/bmsp.12386","DOIUrl":"10.1111/bmsp.12386","url":null,"abstract":"<p>Integer programming (IP) is an extension of linear programming (LP) whereby the goal is to determine values for a set of decision variables (some or all of which have integer restrictions) so as to maximize or minimize a linear objective function of the variables subject to a set of linear constraints involving the variables. Although the psychological literature is replete with applications of multivariate statistics, implementations of mathematical modelling methods such as IP are comparatively far fewer. Nevertheless, over the decades, there have been a variety of important applications and the vast majority of these fall within the IP rather than the LP category. In this paper, we offer a brief overview of the history of IP methodology. We subsequently review some domains where IP has been gainfully applied in psychology, such as test assembly, cluster analysis and classification and seriation and unidimensional scaling. An illustrative example of using IP to cluster respondents measured on items pertaining to substance abuse disorder is provided. Finally, we identify areas where IP might be applied in emerging areas of psychology, such as in the domain of network psychometrics.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"79 1","pages":"1-30"},"PeriodicalIF":1.8,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143517464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A general dynamic learning model framework for cognitive diagnosis 认知诊断的通用动态学习模型框架。
IF 1.8 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-02-22 DOI: 10.1111/bmsp.12384
Zichu Liu, Shiyu Wang, Houping Xiao, Shumei Zhang, Tao Qiu

Understanding students' learning trajectories is crucial for educators to effectively monitor and enhance progress. With the rise of computer-based testing, researchers now have access to rich datasets that provide deeper insights into student performance. This study introduces a general dynamic learning model framework that integrates response accuracy and response times to capture different test-taking behaviors and estimate learning trajectories related to polytomous attributes over time. A Bayesian estimation method is proposed to estimate model parameters. Rigorous validation through simulation studies confirms the effectiveness of the MCMC algorithm in parameter recovery and highlights the model's utility in understanding learning trajectories and detecting different test-taking behaviors in a learning environment. Applied to real data, the model demonstrates practical value in educational settings. Overall, this comprehensive and validated model offers educators and researchers nuanced insights into student learning progress and behavioral dynamics.

了解学生的学习轨迹对教育者有效地监控和促进进步至关重要。随着计算机测试的兴起,研究人员现在可以访问丰富的数据集,从而更深入地了解学生的表现。本研究引入了一个通用的动态学习模型框架,该模型集成了响应精度和响应时间,以捕获不同的应试行为,并估计与多同构属性相关的学习轨迹。提出了一种贝叶斯估计方法来估计模型参数。通过仿真研究的严格验证证实了MCMC算法在参数恢复方面的有效性,并强调了该模型在理解学习轨迹和检测学习环境中不同的考试行为方面的实用性。通过对实际数据的分析,证明了该模型在教育领域的实用价值。总的来说,这个全面而有效的模型为教育工作者和研究人员提供了对学生学习进展和行为动态的细致入微的见解。
{"title":"A general dynamic learning model framework for cognitive diagnosis","authors":"Zichu Liu,&nbsp;Shiyu Wang,&nbsp;Houping Xiao,&nbsp;Shumei Zhang,&nbsp;Tao Qiu","doi":"10.1111/bmsp.12384","DOIUrl":"10.1111/bmsp.12384","url":null,"abstract":"<p>Understanding students' learning trajectories is crucial for educators to effectively monitor and enhance progress. With the rise of computer-based testing, researchers now have access to rich datasets that provide deeper insights into student performance. This study introduces a general dynamic learning model framework that integrates response accuracy and response times to capture different test-taking behaviors and estimate learning trajectories related to polytomous attributes over time. A Bayesian estimation method is proposed to estimate model parameters. Rigorous validation through simulation studies confirms the effectiveness of the MCMC algorithm in parameter recovery and highlights the model's utility in understanding learning trajectories and detecting different test-taking behaviors in a learning environment. Applied to real data, the model demonstrates practical value in educational settings. Overall, this comprehensive and validated model offers educators and researchers nuanced insights into student learning progress and behavioral dynamics.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"78 3","pages":"856-888"},"PeriodicalIF":1.8,"publicationDate":"2025-02-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://bpspsychub.onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12384","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143477198","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to “A new Q-matrix validation method based on signal detection theory” 修正“一种新的基于信号检测理论的q矩阵验证方法”。
IF 1.8 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-02-21 DOI: 10.1111/bmsp.12385
<p>Li, J., & Chen, P. (2024). A new Q-matrix validation method based on signal detection theory. <i>British Journal of Mathematical and Statistical Psychology</i>, 00, 1–33. https://doi.org/10.1111/bmsp.12371</p><p>In the third paragraph of “Search algorithm equipped with AIC” section, the text “… and the Stepwise method has a time complexity of <span></span><math> <mi>O</mi> <mfenced> <mrow> <mi>K</mi> <mo>·</mo> <mfenced> <mrow> <mi>K</mi> <mo>−</mo> <mn>1</mn> </mrow> </mfenced> </mrow> </mfenced></math>. It is clear that <span></span><math> <mi>O</mi> <mfenced> <mrow> <msup> <mn>2</mn> <mrow> <mi>K</mi> <mo>−</mo> <mn>2</mn> </mrow> </msup> <mo>+</mo> <mn>1</mn> <mo>+</mo> <mi>K</mi> </mrow> </mfenced> <mo><</mo> <mi>O</mi> <mfenced> <mrow> <mi>K</mi> <mo>·</mo> <mfenced> <mrow> <mi>K</mi> <mo>−</mo> <mn>1</mn> </mrow> </mfenced> </mrow> </mfenced> <mo><</mo> <mi>O</mi> <mfenced> <mrow> <msup> <mn>2</mn> <mi>K</mi> </msup> <mo>−</mo> <mn>1</mn> </mrow> </mfenced></math> when <span></span><math> <mi>K</mi> <mo>></mo> <mn>3</mn></math>. Therefore, the new search algorithm is not only effective but also efficient” was incorrect. This should have read “… and the time complexity of the Stepwise method is between <span></span><math> <mi>O</mi> <mfenced> <mrow> <mi>K</mi> <mo>·</mo> <mfenced> <mrow> <mi>K</mi> <mo>−</mo> <mn>1</mn> </mrow> </mfenced> <mo>+</mo> <mi>K</mi> </mrow> </mfenced></mat
李,J, &;陈P.(2024)。基于信号检测理论的q矩阵验证新方法。心理科学学报,2009,33 - 33。https://doi.org/10.1111/bmsp.12371In“配备AIC的搜索算法”一节的第三段,文本“…”和Stepwise方法的时间复杂度为O K·K−1。很明显,o2k−2 + 1 + K <;O K·K−1 <;当K >;3. 因此,新的搜索算法“既有效又高效”的说法是不正确的。这应该是“…”,逐步方法的时间复杂度介于O K·K−1 + K和O之间当K≥3时,K·2 K−1。很明显,o2k−2 + 1 + K <;当K≥3时o2k−1,o2k−2+ 1 + K <;当3≤K≤7时,O K·K−1 + K(注意O2 K−2 + 1 + K <;当K≥3时,K·2 K−1)。因此,新的搜索算法不仅有效,但在某些情况下也很有效。”我们为这个错误道歉。
{"title":"Correction to “A new Q-matrix validation method based on signal detection theory”","authors":"","doi":"10.1111/bmsp.12385","DOIUrl":"10.1111/bmsp.12385","url":null,"abstract":"&lt;p&gt;Li, J., &amp; Chen, P. (2024). A new Q-matrix validation method based on signal detection theory. &lt;i&gt;British Journal of Mathematical and Statistical Psychology&lt;/i&gt;, 00, 1–33. https://doi.org/10.1111/bmsp.12371&lt;/p&gt;&lt;p&gt;In the third paragraph of “Search algorithm equipped with AIC” section, the text “… and the Stepwise method has a time complexity of &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;mi&gt;O&lt;/mi&gt;\u0000 &lt;mfenced&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;K&lt;/mi&gt;\u0000 &lt;mo&gt;·&lt;/mo&gt;\u0000 &lt;mfenced&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;K&lt;/mi&gt;\u0000 &lt;mo&gt;−&lt;/mo&gt;\u0000 &lt;mn&gt;1&lt;/mn&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;/mfenced&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;/mfenced&gt;&lt;/math&gt;. It is clear that &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;mi&gt;O&lt;/mi&gt;\u0000 &lt;mfenced&gt;\u0000 &lt;mrow&gt;\u0000 &lt;msup&gt;\u0000 &lt;mn&gt;2&lt;/mn&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;K&lt;/mi&gt;\u0000 &lt;mo&gt;−&lt;/mo&gt;\u0000 &lt;mn&gt;2&lt;/mn&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;/msup&gt;\u0000 &lt;mo&gt;+&lt;/mo&gt;\u0000 &lt;mn&gt;1&lt;/mn&gt;\u0000 &lt;mo&gt;+&lt;/mo&gt;\u0000 &lt;mi&gt;K&lt;/mi&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;/mfenced&gt;\u0000 &lt;mo&gt;&lt;&lt;/mo&gt;\u0000 &lt;mi&gt;O&lt;/mi&gt;\u0000 &lt;mfenced&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;K&lt;/mi&gt;\u0000 &lt;mo&gt;·&lt;/mo&gt;\u0000 &lt;mfenced&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;K&lt;/mi&gt;\u0000 &lt;mo&gt;−&lt;/mo&gt;\u0000 &lt;mn&gt;1&lt;/mn&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;/mfenced&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;/mfenced&gt;\u0000 &lt;mo&gt;&lt;&lt;/mo&gt;\u0000 &lt;mi&gt;O&lt;/mi&gt;\u0000 &lt;mfenced&gt;\u0000 &lt;mrow&gt;\u0000 &lt;msup&gt;\u0000 &lt;mn&gt;2&lt;/mn&gt;\u0000 &lt;mi&gt;K&lt;/mi&gt;\u0000 &lt;/msup&gt;\u0000 &lt;mo&gt;−&lt;/mo&gt;\u0000 &lt;mn&gt;1&lt;/mn&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;/mfenced&gt;&lt;/math&gt; when &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;mi&gt;K&lt;/mi&gt;\u0000 &lt;mo&gt;&gt;&lt;/mo&gt;\u0000 &lt;mn&gt;3&lt;/mn&gt;&lt;/math&gt;. Therefore, the new search algorithm is not only effective but also efficient” was incorrect. This should have read “… and the time complexity of the Stepwise method is between &lt;span&gt;&lt;/span&gt;&lt;math&gt;\u0000 &lt;mi&gt;O&lt;/mi&gt;\u0000 &lt;mfenced&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;K&lt;/mi&gt;\u0000 &lt;mo&gt;·&lt;/mo&gt;\u0000 &lt;mfenced&gt;\u0000 &lt;mrow&gt;\u0000 &lt;mi&gt;K&lt;/mi&gt;\u0000 &lt;mo&gt;−&lt;/mo&gt;\u0000 &lt;mn&gt;1&lt;/mn&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;/mfenced&gt;\u0000 &lt;mo&gt;+&lt;/mo&gt;\u0000 &lt;mi&gt;K&lt;/mi&gt;\u0000 &lt;/mrow&gt;\u0000 &lt;/mfenced&gt;&lt;/mat","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"78 2","pages":""},"PeriodicalIF":1.8,"publicationDate":"2025-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12385","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143470013","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Decomposition of WAIC for assessing the information gain with application to educational testing 信息增益评估的WAIC分解及其在教育测试中的应用。
IF 1.8 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-02-21 DOI: 10.1111/bmsp.12383
Fang Liu, Ming-Hui Chen, Xiaojing Wang, Roeland Hancock

Nowadays, multidimensional data are often available from educational testing. One natural issue is to identify whether more dimensional data are useful in fitting the item response data. To address this important issue, we develop a new decomposition of Widely Applicable Information Criterion (WAIC) via the posterior predictive ordinate (PPO) under the joint model for the response, response time and two additional educational testing scores. Based on this decomposition, a new model assessment criterion is then proposed, which allows us to determine which of the response time and two additional scores are most useful in fitting the response data and whether other dimensional data are further needed given that one of these dimensional data is already included in the joint model with the response data. In addition, an efficient Monte Carlo method is developed to compute PPO. An extensive simulation study is conducted to examine the empirical performance of the proposed joint model and the model assessment criterion in the psychological setting. The proposed methodology is further applied to an analysis of a real dataset from a computerized educational assessment program.

如今,多维数据通常可以从教育测试中获得。一个自然的问题是确定多维数据在拟合项目响应数据时是否有用。为了解决这一重要问题,我们在反应、反应时间和两个额外的教育测试分数的联合模型下,通过后验预测坐标(PPO)开发了一种新的广泛适用信息标准(WAIC)分解方法。在此基础上,提出了一个新的模型评估标准,该标准允许我们确定响应时间和两个附加分数中哪一个在拟合响应数据时最有用,以及如果这些维度数据中的一个已经包含在与响应数据的联合模型中,是否还需要其他维度数据。此外,还提出了一种有效的蒙特卡罗方法来计算PPO。通过广泛的模拟研究来检验所提出的联合模型和模型评估标准在心理环境中的经验性能。提出的方法进一步应用于计算机化教育评估程序的真实数据集的分析。
{"title":"Decomposition of WAIC for assessing the information gain with application to educational testing","authors":"Fang Liu,&nbsp;Ming-Hui Chen,&nbsp;Xiaojing Wang,&nbsp;Roeland Hancock","doi":"10.1111/bmsp.12383","DOIUrl":"10.1111/bmsp.12383","url":null,"abstract":"<p>Nowadays, multidimensional data are often available from educational testing. One natural issue is to identify whether more dimensional data are useful in fitting the item response data. To address this important issue, we develop a new decomposition of Widely Applicable Information Criterion (WAIC) via the posterior predictive ordinate (PPO) under the joint model for the response, response time and two additional educational testing scores. Based on this decomposition, a new model assessment criterion is then proposed, which allows us to determine which of the response time and two additional scores are most useful in fitting the response data and whether other dimensional data are further needed given that one of these dimensional data is already included in the joint model with the response data. In addition, an efficient Monte Carlo method is developed to compute PPO. An extensive simulation study is conducted to examine the empirical performance of the proposed joint model and the model assessment criterion in the psychological setting. The proposed methodology is further applied to an analysis of a real dataset from a computerized educational assessment program.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"78 3","pages":"830-855"},"PeriodicalIF":1.8,"publicationDate":"2025-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143470029","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Jointly modeling responses and omitted items by a competing risk model: A survival analysis approach 通过竞争风险模型联合建模响应和遗漏项目:一种生存分析方法。
IF 1.8 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-01-30 DOI: 10.1111/bmsp.12382
Jinxin Guo, Xin Xu, Guanhua Fang, Zhiliang Ying, Susu Zhang

Item response theory models are commonly adopted in educational assessment and psychological measurement. Such models need to be modified to accommodate practical situations when statistical sampling assumptions are violated. Omission is a common phenomenon in educational testing. In modern computer-based testing, we have not only examinees' responses but also their response times. This paper utilizes response time and develops a joint model of responses and response times. The new approach is analogous to those developed in survival analysis for dealing with right-censored data. In particular, a key ingredient is the introduction of the omission time (OT), which corresponds to the censoring time in survival analysis. By competing risk formulation, the proposed method provides an alternative narrative to how an item becomes answered versus omitted, depending on the competing relationship of response time and OT, so that the likelihood function can be constructed properly. The maximum likelihood estimator can be computed via the expectation-maximization algorithm. Simulation studies were conducted to evaluate the performance of the proposed method and its robustness against various mis-specifications. The method was applied to a dataset from the PISA 2015 Science Test.

项目反应理论模型是教育评价和心理测量中常用的模型。当统计抽样假设被违反时,这些模型需要被修改以适应实际情况。遗漏是教育测试中普遍存在的现象。在现代的计算机测试中,我们不仅有考生的反应,还有他们的反应时间。本文利用响应时间,建立了响应和响应时间的联合模型。这种新方法类似于在生存分析中开发的用于处理右审查数据的方法。其中一个关键因素是省略时间(OT)的引入,它对应于生存分析中的审查时间。通过竞争风险公式,该方法根据响应时间和OT的竞争关系,提供了一个项目如何被回答或被省略的替代叙述,从而可以正确构建似然函数。最大似然估计量可以通过期望最大化算法来计算。通过仿真研究来评估该方法的性能及其对各种错误规范的鲁棒性。该方法应用于2015年PISA科学测试的数据集。
{"title":"Jointly modeling responses and omitted items by a competing risk model: A survival analysis approach","authors":"Jinxin Guo,&nbsp;Xin Xu,&nbsp;Guanhua Fang,&nbsp;Zhiliang Ying,&nbsp;Susu Zhang","doi":"10.1111/bmsp.12382","DOIUrl":"10.1111/bmsp.12382","url":null,"abstract":"<p>Item response theory models are commonly adopted in educational assessment and psychological measurement. Such models need to be modified to accommodate practical situations when statistical sampling assumptions are violated. Omission is a common phenomenon in educational testing. In modern computer-based testing, we have not only examinees' responses but also their response times. This paper utilizes response time and develops a joint model of responses and response times. The new approach is analogous to those developed in survival analysis for dealing with right-censored data. In particular, a key ingredient is the introduction of the omission time (OT), which corresponds to the censoring time in survival analysis. By competing risk formulation, the proposed method provides an alternative narrative to how an item becomes answered versus omitted, depending on the competing relationship of response time and OT, so that the likelihood function can be constructed properly. The maximum likelihood estimator can be computed via the expectation-maximization algorithm. Simulation studies were conducted to evaluate the performance of the proposed method and its robustness against various mis-specifications. The method was applied to a dataset from the PISA 2015 Science Test.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"78 3","pages":"804-829"},"PeriodicalIF":1.8,"publicationDate":"2025-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://bpspsychub.onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12382","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143069619","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient and accurate variational inference for multilevel threshold autoregressive models in intensive longitudinal data 纵向数据中多水平阈值自回归模型的高效准确变分推理。
IF 1.8 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-01-21 DOI: 10.1111/bmsp.12381
Azizur Rahman, Depeng Jiang, Lisa M. Lix

Recent technological advancements have enabled the collection of intensive longitudinal data (ILD), consisting of repeated measurements from the same individual. The threshold autoregressive (TAR) model is often used to capture the dynamic outcome process in ILD, with autoregressive parameters varying based on outcome variable levels. For ILD from multiple individuals, multilevel TAR (ML-TAR) models have been proposed, with Bayesian approaches typically used for parameter estimation. However, fitting ML-TAR models can be computationally challenging. This study introduces a mean-field variational Bayes (MFVB) algorithm as an alternative to traditional Bayesian inference. By optimizing to approximate posterior densities, variational Bayes aims to find the best approximation within a defined set of distributions. Simulation results demonstrate that our MFVB algorithm is significantly faster than the standard Markov chain Monte Carlo (MCMC) approach. Moreover, increasing the number of individuals or time points enhances the accuracy of the parameter estimates using MFVB, suggesting that sufficient data are crucial for accurate estimation in complex models like ML-TAR models. When applied to real-world data, the MFVB algorithm was significantly more efficient than MCMC and maintained similar accuracy. Thus, the MFVB algorithm is a faster and more consistent alternative to MCMC for large-scale inference in ILD models.

最近的技术进步已经能够收集密集的纵向数据(ILD),包括来自同一个体的重复测量。阈值自回归(TAR)模型通常用于捕获ILD的动态结果过程,其自回归参数根据结果变量水平而变化。对于来自多个个体的ILD,已经提出了多层TAR (ML-TAR)模型,通常使用贝叶斯方法进行参数估计。然而,拟合ML-TAR模型在计算上具有挑战性。本研究引入一种平均场变分贝叶斯(MFVB)算法,作为传统贝叶斯推理的替代方法。通过优化近似后验密度,变分贝叶斯旨在在一组定义的分布中找到最佳近似值。仿真结果表明,我们的MFVB算法比标准的马尔可夫链蒙特卡罗(MCMC)方法要快得多。此外,增加个体或时间点的数量可以提高MFVB参数估计的准确性,这表明在ML-TAR模型等复杂模型中,足够的数据对于准确估计至关重要。当应用于实际数据时,MFVB算法的效率明显高于MCMC,并保持相似的精度。因此,对于ILD模型中的大规模推理,MFVB算法比MCMC更快、更一致。
{"title":"Efficient and accurate variational inference for multilevel threshold autoregressive models in intensive longitudinal data","authors":"Azizur Rahman,&nbsp;Depeng Jiang,&nbsp;Lisa M. Lix","doi":"10.1111/bmsp.12381","DOIUrl":"10.1111/bmsp.12381","url":null,"abstract":"<p>Recent technological advancements have enabled the collection of intensive longitudinal data (ILD), consisting of repeated measurements from the same individual. The threshold autoregressive (TAR) model is often used to capture the dynamic outcome process in ILD, with autoregressive parameters varying based on outcome variable levels. For ILD from multiple individuals, multilevel TAR (ML-TAR) models have been proposed, with Bayesian approaches typically used for parameter estimation. However, fitting ML-TAR models can be computationally challenging. This study introduces a mean-field variational Bayes (MFVB) algorithm as an alternative to traditional Bayesian inference. By optimizing to approximate posterior densities, variational Bayes aims to find the best approximation within a defined set of distributions. Simulation results demonstrate that our MFVB algorithm is significantly faster than the standard Markov chain Monte Carlo (MCMC) approach. Moreover, increasing the number of individuals or time points enhances the accuracy of the parameter estimates using MFVB, suggesting that sufficient data are crucial for accurate estimation in complex models like ML-TAR models. When applied to real-world data, the MFVB algorithm was significantly more efficient than MCMC and maintained similar accuracy. Thus, the MFVB algorithm is a faster and more consistent alternative to MCMC for large-scale inference in ILD models.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"78 3","pages":"785-803"},"PeriodicalIF":1.8,"publicationDate":"2025-01-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143016865","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Data fusion by T3–PCA: A global model for the simultaneous analysis of coupled three-way and two-way real-valued data 通过 T3-PCA 进行数据融合:同时分析三向和双向实值耦合数据的全局模型。
IF 1.8 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-01-15 DOI: 10.1111/bmsp.12372
Elisa Frutos-Bernal, Eva Ceulemans, Purificación Galindo-Villardón, Tom F. Wilderjans

In various areas of science, researchers try to gain insight into important processes by jointly analysing different datasets containing information regarding common aspects of these processes. For example, to explain individual differences in personality, researchers collect, for the same set of persons, data regarding behavioural signatures (i.e., the reaction profile of a person across different situations), on the one hand, and traits or dispositions, on the other hand. To uncover the processes underlying such coupled data, to all N-way N-mode data blocks simultaneously a global model is fitted, in which each data block is represented by an N-way N-mode decomposition model (e.g., principal component analysis [PCA], Parafac, Tucker3) and the parameters underlying the common mode are required to be the same for all data blocks this mode belongs to. To estimate the parameters underlying the common mode, a simultaneous strategy is used that pools the information present in all data blocks (i.e., data fusion). In this paper, we propose the T3–PCA model, which represents three- and two-way data with Tucker3 and PCA respectively. This model is less restrictive than the already proposed LMPCA model in which the three-way data block is decomposed according to a Parafac model. To estimate the T3–PCA model parameters, an alternating least-squares algorithm is proposed. The superior performance of the simultaneous T3–PCA strategy over a sequential strategy (i.e., estimating common parameters using information from the three-way data block only) is demonstrated in an extensive simulation study and an application to empirical coupled anxiety data.

在不同的科学领域,研究人员试图通过联合分析包含这些过程的共同方面的信息的不同数据集来深入了解重要的过程。例如,为了解释个性的个体差异,研究人员收集了同一组人的行为特征数据(即,一个人在不同情况下的反应概况),以及另一方面的特征或性格。为了揭示这些耦合数据背后的过程,以所有N-way N $$ N $$ -mode数据块同时拟合一个全局模型,其中每个数据块用N表示 $$ N $$ -way N $$ N $$ -模态分解模型(如主成分分析[PCA]、Parafac、Tucker3)和公共模态的底层参数对于该模态所属的所有数据块都要求相同。为了估计公共模式下的参数,使用了一种同步策略,将所有数据块中的信息集中在一起(即数据融合)。本文提出了T3-PCA模型,分别用Tucker3和PCA表示三向和双向数据。该模型比已经提出的LMPCA模型约束更少,在LMPCA模型中,根据Parafac模型对三向数据块进行分解。为了估计T3-PCA模型参数,提出了一种交替最小二乘算法。在广泛的模拟研究和经验耦合焦虑数据的应用中,证明了同步T3-PCA策略优于顺序策略(即仅使用来自三方数据块的信息估计共同参数)。
{"title":"Data fusion by T3–PCA: A global model for the simultaneous analysis of coupled three-way and two-way real-valued data","authors":"Elisa Frutos-Bernal,&nbsp;Eva Ceulemans,&nbsp;Purificación Galindo-Villardón,&nbsp;Tom F. Wilderjans","doi":"10.1111/bmsp.12372","DOIUrl":"10.1111/bmsp.12372","url":null,"abstract":"<p>In various areas of science, researchers try to gain insight into important processes by jointly analysing different datasets containing information regarding common aspects of these processes. For example, to explain individual differences in personality, researchers collect, for the same set of persons, data regarding behavioural signatures (i.e., the reaction profile of a person across different situations), on the one hand, and traits or dispositions, on the other hand. To uncover the processes underlying such coupled data, to all <i>N</i>-way <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>N</mi>\u0000 </mrow>\u0000 </semantics></math>-mode data blocks simultaneously a global model is fitted, in which each data block is represented by an <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>N</mi>\u0000 </mrow>\u0000 </semantics></math>-way <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>N</mi>\u0000 </mrow>\u0000 </semantics></math>-mode decomposition model (e.g., principal component analysis [PCA], Parafac, Tucker3) and the parameters underlying the common mode are required to be the same for all data blocks this mode belongs to. To estimate the parameters underlying the common mode, a simultaneous strategy is used that pools the information present in all data blocks (i.e., data fusion). In this paper, we propose the T3–PCA model, which represents three- and two-way data with Tucker3 and PCA respectively. This model is less restrictive than the already proposed LMPCA model in which the three-way data block is decomposed according to a Parafac model. To estimate the T3–PCA model parameters, an alternating least-squares algorithm is proposed. The superior performance of the simultaneous T3–PCA strategy over a sequential strategy (i.e., estimating common parameters using information from the three-way data block only) is demonstrated in an extensive simulation study and an application to empirical coupled anxiety data.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"78 2","pages":"672-709"},"PeriodicalIF":1.8,"publicationDate":"2025-01-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12372","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142984780","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessment of fit of item response theory models: A critical review of the status quo and some future directions 项目反应理论模型的拟合性评价:现状及未来发展方向。
IF 1.8 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2025-01-06 DOI: 10.1111/bmsp.12378
Sandip Sinharay, Scott Monroe

This paper provides a literature review of assessment of fit of item response theory models. Various types of fit procedures for item response theory models are reviewed, with a focus on their advantages and disadvantages. Real data examples are used to demonstrate some of the fit procedures. Recommendations are provided for researchers and practitioners who are interested in assessing the fit of item response theory models.

本文对项目反应理论模型的拟合评价进行了文献综述。综述了项目反应理论模型的各种拟合程序,重点分析了它们的优缺点。用实际数据的例子来说明一些拟合过程。为有兴趣评估项目反应理论模型契合度的研究者和实践者提供了建议。
{"title":"Assessment of fit of item response theory models: A critical review of the status quo and some future directions","authors":"Sandip Sinharay,&nbsp;Scott Monroe","doi":"10.1111/bmsp.12378","DOIUrl":"10.1111/bmsp.12378","url":null,"abstract":"<p>This paper provides a literature review of assessment of fit of item response theory models. Various types of fit procedures for item response theory models are reviewed, with a focus on their advantages and disadvantages. Real data examples are used to demonstrate some of the fit procedures. Recommendations are provided for researchers and practitioners who are interested in assessing the fit of item response theory models.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"78 3","pages":"711-733"},"PeriodicalIF":1.8,"publicationDate":"2025-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142933352","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The generalized Hausman test for detecting non-normality in the latent variable distribution of the two-parameter IRT model 用于检测双参数IRT模型潜变量分布非正态性的广义Hausman检验。
IF 1.8 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2024-12-26 DOI: 10.1111/bmsp.12379
Lucia Guastadisegni, Silvia Cagnone, Irini Moustaki, Vassilis Vasdekis

This paper introduces the generalized Hausman test as a novel method for detecting the non-normality of the latent variable distribution of the unidimensional latent trait model for binary data. The test utilizes the pairwise maximum likelihood estimator for the parameters of the latent trait model, which assumes normality of the latent variable, and the maximum likelihood estimator obtained under a semi-non-parametric framework, allowing for a more flexible distribution of the latent variable. The performance of the generalized Hausman test is evaluated through a simulation study and compared with other test statistics available in the literature for testing latent variable distribution fit and an overall goodness-of-fit test statistic. Additionally, three information criteria are used to select the best-fitted model. The simulation results show that the generalized Hausman test outperforms the other tests under most conditions. However, the results obtained from the information criteria are somewhat contradictory under certain conditions, suggesting a need for further investigation and interpretation. The proposed test statistics are used in three datasets.

本文介绍了广义Hausman检验作为一种检测二元数据单维潜在特征模型潜在变量分布非正态性的新方法。该检验利用潜在特征模型参数的两两最大似然估计量,假设潜在变量的正态性,以及在半非参数框架下获得的最大似然估计量,允许潜在变量的更灵活的分布。通过模拟研究评估广义Hausman检验的性能,并与文献中用于检验潜在变量分布拟合和总体拟合优度检验统计量的其他检验统计量进行比较。此外,使用三个信息标准来选择最适合的模型。仿真结果表明,广义Hausman测试在大多数情况下都优于其他测试。然而,从信息标准得到的结果在某些条件下有些矛盾,这表明需要进一步的调查和解释。提出的测试统计量在三个数据集中使用。
{"title":"The generalized Hausman test for detecting non-normality in the latent variable distribution of the two-parameter IRT model","authors":"Lucia Guastadisegni,&nbsp;Silvia Cagnone,&nbsp;Irini Moustaki,&nbsp;Vassilis Vasdekis","doi":"10.1111/bmsp.12379","DOIUrl":"10.1111/bmsp.12379","url":null,"abstract":"<p>This paper introduces the generalized Hausman test as a novel method for detecting the non-normality of the latent variable distribution of the unidimensional latent trait model for binary data. The test utilizes the pairwise maximum likelihood estimator for the parameters of the latent trait model, which assumes normality of the latent variable, and the maximum likelihood estimator obtained under a semi-non-parametric framework, allowing for a more flexible distribution of the latent variable. The performance of the generalized Hausman test is evaluated through a simulation study and compared with other test statistics available in the literature for testing latent variable distribution fit and an overall goodness-of-fit test statistic. Additionally, three information criteria are used to select the best-fitted model. The simulation results show that the generalized Hausman test outperforms the other tests under most conditions. However, the results obtained from the information criteria are somewhat contradictory under certain conditions, suggesting a need for further investigation and interpretation. The proposed test statistics are used in three datasets.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"78 3","pages":"734-756"},"PeriodicalIF":1.8,"publicationDate":"2024-12-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://bpspsychub.onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12379","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142900633","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
British Journal of Mathematical & Statistical Psychology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1