首页 > 最新文献

British Journal of Mathematical & Statistical Psychology最新文献

英文 中文
Statistical inference for agreement between multiple raters on a binary scale 二元量表上多个评分者之间一致性的统计推断。
IF 2.6 3区 心理学 Q1 Mathematics Pub Date : 2024-01-17 DOI: 10.1111/bmsp.12333
Sophie Vanbelle

Agreement studies often involve more than two raters or repeated measurements. In the presence of two raters, the proportion of agreement and of positive agreement are simple and popular agreement measures for binary scales. These measures were generalized to agreement studies involving more than two raters with statistical inference procedures proposed on an empirical basis. We present two alternatives. The first is a Wald confidence interval using standard errors obtained by the delta method. The second involves Bayesian statistical inference not requiring any specific Bayesian software. These new procedures show better statistical behaviour than the confidence intervals initially proposed. In addition, we provide analytical formulas to determine the minimum number of persons needed for a given number of raters when planning an agreement study. All methods are implemented in the R package simpleagree and the Shiny app simpleagree.

一致性研究通常涉及两个以上的评分者或重复测量。在有两个评分者的情况下,二元量表的一致比例和积极一致比例是简单而常用的一致度量。根据经验提出的统计推论程序,这些测量方法被推广到涉及两个以上评分者的一致性研究中。我们提出了两种替代方案。第一种是使用德尔塔法获得的标准误差的沃尔德置信区间。第二种涉及贝叶斯统计推断,不需要任何特定的贝叶斯软件。与最初提出的置信区间相比,这些新程序显示出更好的统计性能。此外,我们还提供了分析公式,以便在计划协议研究时确定给定数量的评分者所需的最少人数。所有方法都在 R 软件包 simpleagree 和 Shiny 应用程序 simpleagree 中实现。
{"title":"Statistical inference for agreement between multiple raters on a binary scale","authors":"Sophie Vanbelle","doi":"10.1111/bmsp.12333","DOIUrl":"10.1111/bmsp.12333","url":null,"abstract":"<p>Agreement studies often involve more than two raters or repeated measurements. In the presence of two raters, the proportion of agreement and of positive agreement are simple and popular agreement measures for binary scales. These measures were generalized to agreement studies involving more than two raters with statistical inference procedures proposed on an empirical basis. We present two alternatives. The first is a Wald confidence interval using standard errors obtained by the delta method. The second involves Bayesian statistical inference not requiring any specific Bayesian software. These new procedures show better statistical behaviour than the confidence intervals initially proposed. In addition, we provide analytical formulas to determine the minimum number of persons needed for a given number of raters when planning an agreement study. All methods are implemented in the R package <i>simpleagree</i> and the Shiny app <i>simpleagree</i>.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2024-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12333","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139486878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A cluster differences unfolding method for large datasets of preference ratings on an interval scale: Minimizing the mean squared centred residuals 用于区间尺度偏好评分大型数据集的聚类差异展开法:最小化居中残差均方。
IF 2.6 3区 心理学 Q1 Mathematics Pub Date : 2024-01-11 DOI: 10.1111/bmsp.12332
Rodrigo Macías, J. Fernando Vera, Willem J. Heiser

Clustering and spatial representation methods are often used in combination, to analyse preference ratings when a large number of individuals and/or object is involved. When analysed under an unfolding model, row-conditional linear transformations are usually most appropriate when the goal is to determine clusters of individuals with similar preferences. However, a significant problem with transformations that include both slope and intercept is the occurrence of degenerate solutions. In this paper, we propose a least squares unfolding method that performs clustering of individuals while simultaneously estimating the location of cluster centres and object locations in low-dimensional space. The method is based on minimising the mean squared centred residuals of the preference ratings with respect to the distances between cluster centres and object locations. At the same time, the distances are row-conditionally transformed with optimally estimated slope parameters. It is computationally efficient for large datasets, and does not suffer from the appearance of degenerate solutions. The performance of the method is analysed in an extensive Monte Carlo experiment. It is illustrated for a real data set and the results are compared with those obtained using a two-step clustering and unfolding procedure.

当涉及大量个体和/或对象时,聚类和空间表示方法通常会结合使用,以分析偏好评级。在展开模型下进行分析时,当目标是确定具有相似偏好的个体聚类时,行条件线性变换通常是最合适的。然而,同时包含斜率和截距的变换的一个重要问题是会出现退化解。在本文中,我们提出了一种最小二乘展开法,在对个体进行聚类的同时,还能估计聚类中心的位置和低维空间中的对象位置。该方法基于最小化偏好评级与聚类中心和对象位置之间距离的均方中心残差。同时,利用最优估计的斜率参数对距离进行行条件变换。该方法对大型数据集的计算效率很高,而且不会出现退化解。通过大量的蒙特卡罗实验分析了该方法的性能。对一个真实数据集进行了说明,并将结果与使用两步聚类和展开程序获得的结果进行了比较。
{"title":"A cluster differences unfolding method for large datasets of preference ratings on an interval scale: Minimizing the mean squared centred residuals","authors":"Rodrigo Macías,&nbsp;J. Fernando Vera,&nbsp;Willem J. Heiser","doi":"10.1111/bmsp.12332","DOIUrl":"10.1111/bmsp.12332","url":null,"abstract":"<p>Clustering and spatial representation methods are often used in combination, to analyse preference ratings when a large number of individuals and/or object is involved. When analysed under an unfolding model, row-conditional linear transformations are usually most appropriate when the goal is to determine clusters of individuals with similar preferences. However, a significant problem with transformations that include both slope and intercept is the occurrence of degenerate solutions. In this paper, we propose a least squares unfolding method that performs clustering of individuals while simultaneously estimating the location of cluster centres and object locations in low-dimensional space. The method is based on minimising the mean squared centred residuals of the preference ratings with respect to the distances between cluster centres and object locations. At the same time, the distances are row-conditionally transformed with optimally estimated slope parameters. It is computationally efficient for large datasets, and does not suffer from the appearance of degenerate solutions. The performance of the method is analysed in an extensive Monte Carlo experiment. It is illustrated for a real data set and the results are compared with those obtained using a two-step clustering and unfolding procedure.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2024-01-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139426139","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correcting for measurement error under meta-analysis of z-transformed correlations 在对 z 变形相关性进行元分析时纠正测量误差。
IF 2.6 3区 心理学 Q1 Mathematics Pub Date : 2023-12-28 DOI: 10.1111/bmsp.12328
Qian Zhang, Qi Wang

This study mainly concerns correction for measurement error using the meta-analysis of Fisher's z-transformed correlations. The disattenuation formula of Spearman (American Journal of Psychology, 15, 1904, 72) is used to correct for individual raw correlations in primary studies. The corrected raw correlations are then used to obtain the corrected z-transformed correlations. What remains little studied, however, is how to best correct for within-study sampling error variances of corrected z-transformed correlations. We focused on three within-study sampling error variance estimators corrected for measurement error that were proposed in earlier studies and is proposed in the current study: (1) the formula given by Hedges (Test validity, Lawrence Erlbaum, 1988) assuming a linear relationship between corrected and uncorrected z-transformed correlations (linear correction), (2) one derived by the first-order delta method based on the average of corrected z-transformed correlations (stabilized first-order correction), and (3) one derived by the second-order delta method based on the average of corrected z-transformed correlations (stabilized second-order correction). Via a simulation study, we compared performance of these estimators and the sampling error variance estimator uncorrected for measurement error in terms of estimation and inference accuracy of the mean correlation as well as the homogeneity test of effect sizes. In obtaining the corrected z-transformed correlations and within-study sampling error variances, coefficient alpha was used as a common reliability coefficient estimate. The results showed that in terms of the estimated mean correlation, sampling error variances with linear correction, the stabilized first-order and second-order corrections, and no correction performed similarly in general. Furthermore, in terms of the homogeneity test, given a relatively large average sample size and normal true scores, the stabilized first-order and second-order corrections had type I error rates that were generally controlled as well as or better than the other estimators. Overall, stabilized first-order and second-order corrections are recommended when true scores are normal, reliabilities are acceptable, the number of items per psychological scale is relatively large, and the average sample size is relatively large.

本研究主要涉及利用费舍尔 z 变形相关系数的元分析来校正测量误差。斯皮尔曼(Spearman)的失调公式(《美国心理学杂志》,15,1904,72)用于校正原始研究中的个体原始相关性。然后使用校正后的原始相关性来获得校正后的 z 转换相关性。然而,对于如何最好地校正校正过的 z 转换相关性的研究内部抽样误差方差的研究仍然很少。我们重点研究了三种校正了测量误差的研究内部抽样误差方差估计方法,这三种方法在以前的研究中提出过,在本次研究中也提出了:(1) Hedges(《测试有效性》,Lawrence Erlbaum,1988 年)给出的公式,假设校正过的和未校正过的 z 转换相关系数之间存在线性关系(线性校正);(2) 根据校正过的 z 转换相关系数的平均值,通过一阶三角法得出的公式(稳定一阶校正);(3) 根据校正过的 z 转换相关系数的平均值,通过二阶三角法得出的公式(稳定二阶校正)。通过模拟研究,我们比较了这些估计器和未修正测量误差的抽样误差方差估计器在平均相关性的估计和推断准确性以及效应大小的同质性检验方面的性能。在获得经校正的 z 变形相关性和研究内部抽样误差方差时,使用了系数 α 作为通用的可靠性系数估计值。结果表明,在估计平均相关性方面,采用线性校正、稳定的一阶和二阶校正以及不采用校正的抽样误差方差总体表现相似。此外,就同质性检验而言,在平均样本量相对较大且真实分数正常的情况下,稳定化一阶和二阶校正的 I 类错误率通常控制得与其他估计器一样好,甚至更好。总体而言,当真实得分正常、信度可接受、每个心理量表的项目数相对较多、平均样本量相对较大时,建议使用稳定一阶和二阶修正法。
{"title":"Correcting for measurement error under meta-analysis of z-transformed correlations","authors":"Qian Zhang,&nbsp;Qi Wang","doi":"10.1111/bmsp.12328","DOIUrl":"10.1111/bmsp.12328","url":null,"abstract":"<p>This study mainly concerns correction for measurement error using the meta-analysis of Fisher's z-transformed correlations. The disattenuation formula of Spearman (American Journal of Psychology, <b>15</b>, 1904, 72) is used to correct for individual raw correlations in primary studies. The corrected raw correlations are then used to obtain the corrected z-transformed correlations. What remains little studied, however, is how to best correct for within-study sampling error variances of corrected z-transformed correlations. We focused on three within-study sampling error variance estimators corrected for measurement error that were proposed in earlier studies and is proposed in the current study: (1) the formula given by Hedges (<i>Test validity</i>, Lawrence Erlbaum, 1988) assuming a linear relationship between corrected and uncorrected z-transformed correlations (linear correction), (2) one derived by the first-order delta method based on the average of corrected z-transformed correlations (stabilized first-order correction), and (3) one derived by the second-order delta method based on the average of corrected z-transformed correlations (stabilized second-order correction). Via a simulation study, we compared performance of these estimators and the sampling error variance estimator uncorrected for measurement error in terms of estimation and inference accuracy of the mean correlation as well as the homogeneity test of effect sizes. In obtaining the corrected z-transformed correlations and within-study sampling error variances, coefficient alpha was used as a common reliability coefficient estimate. The results showed that in terms of the estimated mean correlation, sampling error variances with linear correction, the stabilized first-order and second-order corrections, and no correction performed similarly in general. Furthermore, in terms of the homogeneity test, given a relatively large average sample size and normal true scores, the stabilized first-order and second-order corrections had type I error rates that were generally controlled as well as or better than the other estimators. Overall, stabilized first-order and second-order corrections are recommended when true scores are normal, reliabilities are acceptable, the number of items per psychological scale is relatively large, and the average sample size is relatively large.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2023-12-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139059109","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mixtures of t $$ t $$ factor analysers with censored responses and external covariates: An application to educational data from Peru 有删减反应和外部协变量的 t 因子分析器混合物:秘鲁教育数据的应用
IF 2.6 3区 心理学 Q1 Mathematics Pub Date : 2023-12-14 DOI: 10.1111/bmsp.12329
Wan-Lun Wang, Luis M. Castro, Huei-Jyun Li, Tsung-I Lin

Analysing data from educational tests allows governments to make decisions for improving the quality of life of individuals in a society. One of the key responsibilities of statisticians is to develop models that provide decision-makers with pertinent information about the latent process that educational tests seek to represent. Mixtures of t$$ t $$ factor analysers (MtFA) have emerged as a powerful device for model-based clustering and classification of high-dimensional data containing one or several groups of observations with fatter tails or anomalous outliers. This paper considers an extension of MtFA for robust clustering of censored data, referred to as the MtFAC model, by incorporating external covariates. The enhanced flexibility of including covariates in MtFAC enables cluster-specific multivariate regression analysis of dependent variables with censored responses arising from upper and/or lower detection limits of experimental equipment. An alternating expectation conditional maximization (AECM) algorithm is developed for maximum likelihood estimation of the proposed model. Two simulation experiments are conducted to examine the effectiveness of the techniques presented. Furthermore, the proposed methodology is applied to Peruvian data from the 2007 Early Grade Reading Assessment, and the results obtained from the analysis provide new insights regarding the reading skills of Peruvian students.

分析教育测试的数据使政府能够为提高社会中个人的生活质量作出决定。统计学家的主要职责之一是开发模型,为决策者提供有关教育测试试图代表的潜在过程的相关信息。混合t $$ t $$因子分析器(MtFA)已经成为一种强大的设备,用于基于模型的聚类和高维数据的分类,这些数据包含一组或几组具有较宽尾部或异常异常值的观测值。本文考虑了MtFA的一个扩展,即MtFAC模型,通过合并外部协变量来实现截尾数据的鲁棒聚类。在MtFAC中包含协变量的灵活性增强,可以对因变量进行特定集群的多变量回归分析,这些因变量的响应由实验设备的上限和/或下限检测限引起。针对该模型的极大似然估计,提出了一种交替期望条件最大化(AECM)算法。通过两个仿真实验验证了所提技术的有效性。此外,所提出的方法应用于秘鲁2007年早期阅读评估的数据,从分析中获得的结果为秘鲁学生的阅读技能提供了新的见解。
{"title":"Mixtures of \u0000 \u0000 \u0000 t\u0000 \u0000 $$ t $$\u0000 factor analysers with censored responses and external covariates: An application to educational data from Peru","authors":"Wan-Lun Wang,&nbsp;Luis M. Castro,&nbsp;Huei-Jyun Li,&nbsp;Tsung-I Lin","doi":"10.1111/bmsp.12329","DOIUrl":"10.1111/bmsp.12329","url":null,"abstract":"<p>Analysing data from educational tests allows governments to make decisions for improving the quality of life of individuals in a society. One of the key responsibilities of statisticians is to develop models that provide decision-makers with pertinent information about the latent process that educational tests seek to represent. Mixtures of <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>t</mi>\u0000 </mrow>\u0000 <annotation>$$ t $$</annotation>\u0000 </semantics></math> factor analysers (MtFA) have emerged as a powerful device for model-based clustering and classification of high-dimensional data containing one or several groups of observations with fatter tails or anomalous outliers. This paper considers an extension of MtFA for robust clustering of censored data, referred to as the MtFAC model, by incorporating external covariates. The enhanced flexibility of including covariates in MtFAC enables cluster-specific multivariate regression analysis of dependent variables with censored responses arising from upper and/or lower detection limits of experimental equipment. An alternating expectation conditional maximization (AECM) algorithm is developed for maximum likelihood estimation of the proposed model. Two simulation experiments are conducted to examine the effectiveness of the techniques presented. Furthermore, the proposed methodology is applied to Peruvian data from the 2007 Early Grade Reading Assessment, and the results obtained from the analysis provide new insights regarding the reading skills of Peruvian students.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2023-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138630782","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Editorial acknowledgement 编辑致谢
IF 2.6 3区 心理学 Q1 Mathematics Pub Date : 2023-12-09 DOI: 10.1111/bmsp.12331
{"title":"Editorial acknowledgement","authors":"","doi":"10.1111/bmsp.12331","DOIUrl":"https://doi.org/10.1111/bmsp.12331","url":null,"abstract":"","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2023-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"139435114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Using cross-validation methods to select time series models: Promises and pitfalls 使用交叉验证方法选择时间序列模型:承诺和缺陷。
IF 2.6 3区 心理学 Q1 Mathematics Pub Date : 2023-12-07 DOI: 10.1111/bmsp.12330
Siwei Liu, Di Jody Zhou

Vector autoregressive (VAR) modelling is widely employed in psychology for time series analyses of dynamic processes. However, the typically short time series in psychological studies can lead to overfitting of VAR models, impairing their predictive ability on unseen samples. Cross-validation (CV) methods are commonly recommended for assessing the predictive ability of statistical models. However, it is unclear how the performance of CV is affected by characteristics of time series data and the fitted models. In this simulation study, we examine the ability of two CV methods, namely,10-fold CV and blocked CV, in estimating the prediction errors of three time series models with increasing complexity (person-mean, AR, and VAR), and evaluate how their performance is affected by data characteristics. We then compare these CV methods to the traditional methods using the Akaike (AIC) and Bayesian (BIC) information criteria in their accuracy of selecting the most predictive models. We find that CV methods tend to underestimate prediction errors of simpler models, but overestimate prediction errors of VAR models, particularly when the number of observations is small. Nonetheless, CV methods, especially blocked CV, generally outperform the AIC and BIC. We conclude our study with a discussion on the implications of the findings and provide helpful guidelines for practice.

向量自回归(VAR)模型在心理学中广泛应用于动态过程的时间序列分析。然而,在心理学研究中,典型的短时间序列会导致VAR模型的过拟合,损害其对未知样本的预测能力。交叉验证(CV)方法通常被推荐用于评估统计模型的预测能力。然而,目前尚不清楚时间序列数据和拟合模型的特征如何影响CV的性能。在这项模拟研究中,我们检验了两种CV方法,即10倍CV和阻塞CV,在估计三种复杂性时间序列模型(人平均、AR和VAR)的预测误差方面的能力,并评估了它们的性能如何受到数据特征的影响。然后,我们将这些CV方法与使用赤池(AIC)和贝叶斯(BIC)信息标准的传统方法在选择最具预测模型的准确性方面进行了比较。我们发现,CV方法往往低估了简单模型的预测误差,而高估了VAR模型的预测误差,特别是在观测数较少的情况下。尽管如此,CV方法,特别是阻塞CV方法,通常优于AIC和BIC方法。最后,我们对研究结果的意义进行了讨论,并为实践提供了有益的指导。
{"title":"Using cross-validation methods to select time series models: Promises and pitfalls","authors":"Siwei Liu,&nbsp;Di Jody Zhou","doi":"10.1111/bmsp.12330","DOIUrl":"10.1111/bmsp.12330","url":null,"abstract":"<p>Vector autoregressive (VAR) modelling is widely employed in psychology for time series analyses of dynamic processes. However, the typically short time series in psychological studies can lead to overfitting of VAR models, impairing their predictive ability on unseen samples. Cross-validation (CV) methods are commonly recommended for assessing the predictive ability of statistical models. However, it is unclear how the performance of CV is affected by characteristics of time series data and the fitted models. In this simulation study, we examine the ability of two CV methods, namely,10-fold CV and blocked CV, in estimating the prediction errors of three time series models with increasing complexity (person-mean, AR, and VAR), and evaluate how their performance is affected by data characteristics. We then compare these CV methods to the traditional methods using the Akaike (AIC) and Bayesian (BIC) information criteria in their accuracy of selecting the most predictive models. We find that CV methods tend to underestimate prediction errors of simpler models, but overestimate prediction errors of VAR models, particularly when the number of observations is small. Nonetheless, CV methods, especially blocked CV, generally outperform the AIC and BIC. We conclude our study with a discussion on the implications of the findings and provide helpful guidelines for practice.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2023-12-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12330","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138500287","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The effective sample size in Bayesian information criterion for level-specific fixed and random-effect selection in a two-level nested model 两层嵌套模型中特定水平固定和随机效应选择的贝叶斯信息准则的有效样本量
IF 2.6 3区 心理学 Q1 Mathematics Pub Date : 2023-12-01 DOI: 10.1111/bmsp.12327
Sun-Joo Cho, Hao Wu, Matthew Naveiras

Popular statistical software provides the Bayesian information criterion (BIC) for multi-level models or linear mixed models. However, it has been observed that the combination of statistical literature and software documentation has led to discrepancies in the formulas of the BIC and uncertainties as to the proper use of the BIC in selecting a multi-level model with respect to level-specific fixed and random effects. These discrepancies and uncertainties result from different specifications of sample size in the BIC's penalty term for multi-level models. In this study, we derive the BIC's penalty term for level-specific fixed- and random-effect selection in a two-level nested design. In this new version of BIC, called BICE1, this penalty term is decomposed into two parts if the random-effect variance–covariance matrix has full rank: (a) a term with the log of average sample size per cluster and (b) the total number of parameters times the log of the total number of clusters. Furthermore, we derive the new version of BIC, called BICE2, in the presence of redundant random effects. We show that the derived formulae, BICE1 and BICE2, adhere to empirical values via numerical demonstration and that BICE (E indicating either

流行的统计软件为多级模型或线性混合模型提供了贝叶斯信息准则(BIC)。然而,据观察,统计文献和软件文件的结合导致了BIC公式的差异,以及在选择关于特定水平的固定效应和随机效应的多层次模型时如何正确使用BIC的不确定性。这些差异和不确定性是由于BIC对多级模型的惩罚项中样本量的不同规格造成的。在本研究中,我们推导了在两水平嵌套设计中特定水平的固定效应和随机效应选择的BIC惩罚项。在这个称为BICE1 $$ {mathrm{BIC}}_{E1} $$的新版本的BIC中,如果随机效应方差-协方差矩阵具有全秩,则该惩罚项被分解为两部分:(a)每个簇的平均样本量的log项和(b)参数总数乘以簇总数的log项。此外,在冗余随机效应存在的情况下,我们推导出新的BIC,称为BICE2 $$ {mathrm{BIC}}_{E2} $$。我们通过数值论证证明了推导公式BICE1 $$ {mathrm{BIC}}_{E1} $$和BICE2 $$ {mathrm{BIC}}_{E2} $$符合经验值,并且通过模拟研究表明,BICE $$ {mathrm{BIC}}_E $$ (E $$ E $$表示E1 $$ E1 $$或E2 $$ E2 $$)是最佳的全局选择标准,因为它在各种多层次条件下的总样本量和聚类数量上的表现至少与BIC一样好。此外,还用一个教科书样例数据集说明了BICE1 $$ {mathrm{BIC}}_{E1} $$的使用。
{"title":"The effective sample size in Bayesian information criterion for level-specific fixed and random-effect selection in a two-level nested model","authors":"Sun-Joo Cho,&nbsp;Hao Wu,&nbsp;Matthew Naveiras","doi":"10.1111/bmsp.12327","DOIUrl":"10.1111/bmsp.12327","url":null,"abstract":"<p>Popular statistical software provides the Bayesian information criterion (BIC) for multi-level models or linear mixed models. However, it has been observed that the combination of statistical literature and software documentation has led to discrepancies in the formulas of the BIC and uncertainties as to the proper use of the BIC in selecting a multi-level model with respect to level-specific fixed and random effects. These discrepancies and uncertainties result from different specifications of sample size in the BIC's penalty term for multi-level models. In this study, we derive the BIC's penalty term for level-specific fixed- and random-effect selection in a two-level nested design. In this new version of BIC, called <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <msub>\u0000 <mi>BIC</mi>\u0000 <mrow>\u0000 <mi>E</mi>\u0000 <mn>1</mn>\u0000 </mrow>\u0000 </msub>\u0000 </mrow>\u0000 </semantics></math>, this penalty term is decomposed into two parts if the random-effect variance–covariance matrix has full rank: (a) a term with the log of average sample size per cluster and (b) the total number of parameters times the log of the total number of clusters. Furthermore, we derive the new version of BIC, called <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <msub>\u0000 <mi>BIC</mi>\u0000 <mrow>\u0000 <mi>E</mi>\u0000 <mn>2</mn>\u0000 </mrow>\u0000 </msub>\u0000 </mrow>\u0000 </semantics></math>, in the presence of redundant random effects. We show that the derived formulae, <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <msub>\u0000 <mi>BIC</mi>\u0000 <mrow>\u0000 <mi>E</mi>\u0000 <mn>1</mn>\u0000 </mrow>\u0000 </msub>\u0000 </mrow>\u0000 </semantics></math> and <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <msub>\u0000 <mi>BIC</mi>\u0000 <mrow>\u0000 <mi>E</mi>\u0000 <mn>2</mn>\u0000 </mrow>\u0000 </msub>\u0000 </mrow>\u0000 </semantics></math>, adhere to empirical values via numerical demonstration and that <span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <msub>\u0000 <mi>BIC</mi>\u0000 <mrow>\u0000 <mi>E</mi>\u0000 </mrow>\u0000 </msub>\u0000 </mrow>\u0000 </semantics></math> (<span></span><math>\u0000 <semantics>\u0000 <mrow>\u0000 <mi>E</mi>\u0000 </mrow>\u0000 </semantics></math> indicating either <span></s","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2023-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"138544364","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On generating plausible values for multilevel modelling with large-scale-assessment data 基于大规模评价数据的多层次模型的可信值生成。
IF 2.6 3区 心理学 Q1 Mathematics Pub Date : 2023-11-13 DOI: 10.1111/bmsp.12326
Xiaying Zheng

Large-scale assessments (LSAs) routinely employ latent regressions to generate plausible values (PVs) for unbiased estimation of the relationship between examinees' background variables and performance. To handle the clustering effect common in LSA data, multilevel modelling is a popular choice. However, most LSAs use single-level conditioning methods, resulting in a mismatch between the imputation model and the multilevel analytic model. While some LSAs have implemented special techniques in single-level latent regressions to support random-intercept modelling, these techniques are not expected to support random-slope models. To address this gap, this study proposed two new single-level methods to support random-slope estimation. The existing and proposed methods were compared to the theoretically unbiased multilevel latent regression method in terms of their ability to support multilevel models. The findings indicate that the two existing single-level methods can support random-intercept-only models. The multilevel latent regression method provided mostly adequate estimates but was limited by computational burden and did not have the best performance across all conditions. One of our proposed single-level methods presented an efficient alternative to multilevel latent regression and was able to recover acceptable estimates for all parameters. We provide recommendations for situations where each method can be applied, with some caveats.

大规模评估(LSAs)通常使用潜在回归来产生可信值(pv),以无偏估计考生背景变量与表现之间的关系。为了处理LSA数据中常见的聚类效应,多层建模是一种流行的选择。然而,大多数lsa采用单层次条件作用的方法,导致了imputation模型与多层分析模型之间的不匹配。虽然一些lsa在单水平潜在回归中实施了特殊技术来支持随机截距建模,但这些技术并不支持随机斜率模型。为了解决这一差距,本研究提出了两种新的单水平方法来支持随机斜率估计。在支持多水平模型的能力方面,将现有方法和提出的方法与理论上无偏的多水平潜在回归方法进行了比较。结果表明,现有的两种单级方法可以支持随机截取模型。多水平潜回归方法提供了足够的估计,但受计算量的限制,并不是在所有条件下都有最好的性能。我们提出的一种单水平方法提供了一种有效的替代多水平潜在回归的方法,并且能够恢复所有参数的可接受估计。我们为每种方法都可以应用的情况提供了建议,但有一些注意事项。
{"title":"On generating plausible values for multilevel modelling with large-scale-assessment data","authors":"Xiaying Zheng","doi":"10.1111/bmsp.12326","DOIUrl":"10.1111/bmsp.12326","url":null,"abstract":"<p>Large-scale assessments (LSAs) routinely employ latent regressions to generate plausible values (PVs) for unbiased estimation of the relationship between examinees' background variables and performance. To handle the clustering effect common in LSA data, multilevel modelling is a popular choice. However, most LSAs use single-level conditioning methods, resulting in a mismatch between the imputation model and the multilevel analytic model. While some LSAs have implemented special techniques in single-level latent regressions to support random-intercept modelling, these techniques are not expected to support random-slope models. To address this gap, this study proposed two new single-level methods to support random-slope estimation. The existing and proposed methods were compared to the theoretically unbiased multilevel latent regression method in terms of their ability to support multilevel models. The findings indicate that the two existing single-level methods can support random-intercept-only models. The multilevel latent regression method provided mostly adequate estimates but was limited by computational burden and did not have the best performance across all conditions. One of our proposed single-level methods presented an efficient alternative to multilevel latent regression and was able to recover acceptable estimates for all parameters. We provide recommendations for situations where each method can be applied, with some caveats.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2023-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"89720647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correction to ‘a note on computing Louis' observed information matrix identity for IRT and cognitive diagnostic models’ 更正“关于计算Louis的注释”,“IRT和认知诊断模型的观测信息矩阵恒等式”。
IF 2.6 3区 心理学 Q1 Mathematics Pub Date : 2023-10-25 DOI: 10.1111/bmsp.12325

Liu, C. W., & Chalmers, R. P. (2021). A note on computing Louis' observed information matrix identity for IRT and cognitive diagnostic models. British Journal of Mathematical and Statistical Psychology, 74(1), 118–138. https://doi.org/10.1111/bmsp.12207

The acknowledgement of funding was included in error: the paper was received on 30 April 2020, while the mentioned grant commenced on 1 August 2020. Consequently, there is no overlap between the grant period and the received date, rendering the acknowledgment inaccurate.

We apologize for this error.

Liu, C. W., & Chalmers, R. P. (2021)。关于计算 IRT 和认知诊断模型的路易斯观察信息矩阵同一性的说明。British Journal of Mathematical and Statistical Psychology, 74(1), 118-138. https://doi.org/10.1111/bmsp.12207The 资助确认有误:论文于 2020 年 4 月 30 日收到,而所述资助于 2020 年 8 月 1 日开始。因此,资助期限与收稿日期不存在重叠,导致鸣谢不准确。
{"title":"Correction to ‘a note on computing Louis' observed information matrix identity for IRT and cognitive diagnostic models’","authors":"","doi":"10.1111/bmsp.12325","DOIUrl":"10.1111/bmsp.12325","url":null,"abstract":"<p>Liu, C. W., &amp; Chalmers, R. P. (2021). A note on computing Louis' observed information matrix identity for IRT and cognitive diagnostic models. <i>British Journal of Mathematical and Statistical Psychology</i>, 74(1), 118–138. https://doi.org/10.1111/bmsp.12207</p><p>The acknowledgement of funding was included in error: the paper was received on 30 April 2020, while the mentioned grant commenced on 1 August 2020. Consequently, there is no overlap between the grant period and the received date, rendering the acknowledgment inaccurate.</p><p>We apologize for this error.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2023-10-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12325","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"50163853","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A correlated traits correlated (methods – 1) multitrait-multimethod model for augmented round-robin data 用于扩充循环数据的相关性状相关(方法-1)多性状多方法模型。
IF 2.6 3区 心理学 Q1 Mathematics Pub Date : 2023-10-16 DOI: 10.1111/bmsp.12324
David Jendryczko, Fridtjof W. Nussbeck

We didactically derive a correlated traits correlated (methods – 1) [CTC(M – 1)] multitrait-multimethod (MTMM) model for dyadic round-robin data augmented by self-reports. The model is an extension of the CTC(M – 1) model for cross-classified data and can handle dependencies between raters and targets by including reciprocity covariance parameters that are inherent in augmented round-robin designs. It can be specified as a traditional structural equation model. We present the variance decomposition as well as consistency and reliability coefficients. Moreover, we explain how to evaluate fit of a CTC(M – 1) model for augmented round-robin data. In a simulation study, we explore the properties of the full information maximum likelihood estimation of the model. Model (mis)fit can be quite accurately detected with the test of not close fit and dynamic root mean square errors of approximation. Even with few small round-robin groups, relative parameter estimation bias and coverage rates are satisfactory, but several larger round-robin groups are needed to minimize relative parameter estimation inaccuracy. Further, neglecting the reciprocity covariance-structure of the augmented round-robin data does not severely bias the remaining parameter estimates. All analyses (including data, R scripts, and results) and the simulation study are provided in the Supporting Information. Implications and limitations are discussed.

我们从教学上推导了一个相关特征相关(方法-1)[CTC(M-1)]多特征多方法(MTMM)模型,用于通过自我报告增强的二元循环数据。该模型是交叉分类数据CTC(M-1)模型的扩展,可以通过包括增强循环设计中固有的互易协方差参数来处理评分者和目标之间的相关性。它可以被指定为传统的结构方程模型。我们给出了方差分解以及一致性和可靠性系数。此外,我们还解释了如何评估CTC(M-1)模型对增广循环数据的拟合。在模拟研究中,我们探索了模型的全信息最大似然估计的性质。通过近似的非紧密拟合和动态均方根误差的检验,可以非常准确地检测模型拟合的错误。即使使用很少的小循环组,相对参数估计偏差和覆盖率也是令人满意的,但需要几个较大的循环组来最小化相对参数估计的不准确度。此外,忽略增强的循环数据的互易协方差结构不会严重偏移剩余的参数估计。支持信息中提供了所有分析(包括数据、R脚本和结果)和模拟研究。讨论了影响和局限性。
{"title":"A correlated traits correlated (methods – 1) multitrait-multimethod model for augmented round-robin data","authors":"David Jendryczko,&nbsp;Fridtjof W. Nussbeck","doi":"10.1111/bmsp.12324","DOIUrl":"10.1111/bmsp.12324","url":null,"abstract":"<p>We didactically derive a correlated traits correlated (methods – 1) [CTC(M – 1)] multitrait-multimethod (MTMM) model for dyadic round-robin data augmented by self-reports. The model is an extension of the CTC(M – 1) model for cross-classified data <i>and</i> can handle dependencies between raters and targets by including reciprocity covariance parameters that are inherent in augmented round-robin designs. It can be specified as a traditional structural equation model. We present the variance decomposition as well as consistency and reliability coefficients. Moreover, we explain how to evaluate fit of a CTC(M – 1) model for augmented round-robin data. In a simulation study, we explore the properties of the full information maximum likelihood estimation of the model. Model (mis)fit can be quite accurately detected with the test of not close fit and dynamic root mean square errors of approximation. Even with few small round-robin groups, relative parameter estimation bias and coverage rates are satisfactory, but several larger round-robin groups are needed to minimize relative parameter estimation inaccuracy. Further, neglecting the reciprocity covariance-structure of the augmented round-robin data does not severely bias the remaining parameter estimates. All analyses (including data, R scripts, and results) and the simulation study are provided in the Supporting Information. Implications and limitations are discussed.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":null,"pages":null},"PeriodicalIF":2.6,"publicationDate":"2023-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12324","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41241136","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
British Journal of Mathematical & Statistical Psychology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1