首页 > 最新文献

British Journal of Mathematical & Statistical Psychology最新文献

英文 中文
Complete Q-matrices in conjunctive models on general attribute structures 一般属性结构上合取模型中的完全q矩阵
IF 2.6 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2022-03-20 DOI: 10.1111/bmsp.12266
Jürgen Heller

In cognitive diagnostic assessment a property of the Q-matrix, usually referred to as completeness, warrants that the cognitive attributes underlying the observed behaviour can be uniquely assessed. Characterizations of completeness were first derived under the assumption of independent attributes, and are currently under investigation for interdependent attributes. The dominant approach considers so-called attribute hierarchies, which are conceptualized through a partial order on the set of attributes. The present paper extends previously published results on this issue obtained for conjunctive attribute hierarchy models. Drawing upon results from knowledge structure theory, it provides novel sufficient and necessary conditions for completeness of the Q-matrix, not only for conjunctive models on attribute hierarchies, but also on more general attribute structures.

在认知诊断评估中,q矩阵的一个属性,通常被称为完整性,保证了观察到的行为背后的认知属性可以被唯一地评估。完备性的特征首先是在独立属性的假设下推导出来的,目前正在对相互依赖属性进行研究。占主导地位的方法考虑所谓的属性层次结构,它通过属性集上的偏序来概念化。本文扩展了先前发表的关于连接属性层次模型的结果。利用知识结构理论的结果,它不仅为属性层次上的合取模型,而且为更一般的属性结构上的合取模型提供了新的q矩阵完备性的充要条件。
{"title":"Complete Q-matrices in conjunctive models on general attribute structures","authors":"Jürgen Heller","doi":"10.1111/bmsp.12266","DOIUrl":"10.1111/bmsp.12266","url":null,"abstract":"<p>In cognitive diagnostic assessment a property of the <i>Q</i>-matrix, usually referred to as completeness, warrants that the cognitive attributes underlying the observed behaviour can be uniquely assessed. Characterizations of completeness were first derived under the assumption of independent attributes, and are currently under investigation for interdependent attributes. The dominant approach considers so-called attribute hierarchies, which are conceptualized through a partial order on the set of attributes. The present paper extends previously published results on this issue obtained for conjunctive attribute hierarchy models. Drawing upon results from knowledge structure theory, it provides novel sufficient and necessary conditions for completeness of the <i>Q</i>-matrix, not only for conjunctive models on attribute hierarchies, but also on more general attribute structures.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"75 3","pages":"522-549"},"PeriodicalIF":2.6,"publicationDate":"2022-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://bpspsychub.onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12266","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124910395","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Refinement: Measuring informativeness of ratings in the absence of a gold standard 细化:在没有金标准的情况下衡量评级的信息量
IF 2.6 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2022-03-16 DOI: 10.1111/bmsp.12268
Sheridan Grant, Marina Meilă, Elena Erosheva, Carole Lee

We propose a new metric for evaluating the informativeness of a set of ratings from a single rater on a given scale. Such evaluations are of interest when raters rate numerous comparable items on the same scale, as occurs in hiring, college admissions, and peer review. Our exposition takes the context of peer review, which involves univariate and multivariate cardinal ratings. We draw on this context to motivate an information-theoretic measure of the refinement of a set of ratings – entropic refinement – as well as two secondary measures. A mathematical analysis of the three measures reveals that only the first, which captures the information content of the ratings, possesses properties appropriate to a refinement metric. Finally, we analyse refinement in real-world grant-review data, finding evidence that overall merit scores are more refined than criterion scores.

我们提出了一种新的度量标准,用于评估给定尺度上单个评分者的一组评分的信息性。当评分者在同一尺度上对许多可比较的项目进行评分时,就像在招聘、大学录取和同行评议中发生的那样,这种评估是有意义的。我们的论述以同行评议为背景,其中涉及单变量和多变量基数评级。我们利用这一背景来激发一组评级的改进的信息理论措施-熵改进-以及两个次要措施。对这三个度量的数学分析表明,只有第一个度量(捕获评级的信息内容)具有适合于细化度量的属性。最后,我们分析了现实世界拨款审查数据的细化,发现总体绩效分数比标准分数更细化的证据。
{"title":"Refinement: Measuring informativeness of ratings in the absence of a gold standard","authors":"Sheridan Grant,&nbsp;Marina Meilă,&nbsp;Elena Erosheva,&nbsp;Carole Lee","doi":"10.1111/bmsp.12268","DOIUrl":"10.1111/bmsp.12268","url":null,"abstract":"<p>We propose a new metric for evaluating the informativeness of a set of ratings from a single rater on a given scale. Such evaluations are of interest when raters rate numerous comparable items on the same scale, as occurs in hiring, college admissions, and peer review. Our exposition takes the context of peer review, which involves univariate and multivariate cardinal ratings. We draw on this context to motivate an information-theoretic measure of the <i>refinement</i> of a set of ratings – entropic refinement – as well as two secondary measures. A mathematical analysis of the three measures reveals that only the first, which captures the information content of the ratings, possesses properties appropriate to a refinement metric. Finally, we analyse refinement in real-world grant-review data, finding evidence that overall merit scores are more refined than criterion scores.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"75 3","pages":"593-615"},"PeriodicalIF":2.6,"publicationDate":"2022-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10860001","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
The evidence interval and the Bayesian evidence value: On a unified theory for Bayesian hypothesis testing and interval estimation 证据区间与贝叶斯证据值:贝叶斯假设检验与区间估计的统一理论
IF 2.6 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2022-03-01 DOI: 10.1111/bmsp.12267
Riko Kelter

Interval estimation is one of the most frequently used methods in statistical science, employed to provide a range of credible values a parameter is located in after taking into account the uncertainty in the data. However, while this interpretation only holds for Bayesian interval estimates, these suffer from two problems. First, Bayesian interval estimates can include values which have not been corroborated by observing the data. Second, Bayesian interval estimates and hypothesis tests can yield contradictory conclusions. In this paper a new theory for Bayesian hypothesis testing and interval estimation is presented. A new interval estimate is proposed, the Bayesian evidence interval, which is inspired by the Pereira–Stern theory of the full Bayesian significance test (FBST). It is shown that the evidence interval is a generalization of existing Bayesian interval estimates, that it solves the problems of standard Bayesian interval estimates and that it unifies Bayesian hypothesis testing and parameter estimation. The Bayesian evidence value is introduced, which quantifies the evidence for the (interval) null and alternative hypothesis. Based on the evidence interval and the evidence value, the (full) Bayesian evidence test (FBET) is proposed as a new, model-independent Bayesian hypothesis test. Additionally, a decision rule for hypothesis testing is derived which shows the relationship to a widely used decision rule based on the region of practical equivalence and Bayesian highest posterior density intervals and to the e-value in the FBST. In summary, the proposed method is universally applicable, computationally efficient, and while the evidence interval can be seen as an extension of existing Bayesian interval estimates, the FBET is a generalization of the FBST and contains it as a special case. Together, the theory developed provides a unification of Bayesian hypothesis testing and interval estimation and is made available in the R package fbst.

区间估计是统计科学中最常用的方法之一,用于在考虑数据的不确定性后提供参数所处的可信值范围。然而,虽然这种解释只适用于贝叶斯区间估计,但它们存在两个问题。首先,贝叶斯区间估计可以包括未被观测数据证实的值。其次,贝叶斯区间估计和假设检验可能会得出相互矛盾的结论。本文提出了一种新的贝叶斯假设检验和区间估计理论。受全贝叶斯显著性检验(FBST)的Pereira-Stern理论的启发,提出了一种新的区间估计——贝叶斯证据区间。证明证据区间是现有贝叶斯区间估计的泛化,解决了标准贝叶斯区间估计的问题,将贝叶斯假设检验和参数估计统一起来。引入贝叶斯证据值,量化(区间)零假设和备择假设的证据。基于证据区间和证据值,提出了一种新的、与模型无关的贝叶斯假设检验(FBET)。此外,推导了假设检验的决策规则,该规则显示了基于实际等价区域和贝叶斯最高后验密度区间的广泛使用的决策规则与FBST中的e值的关系。综上所述,该方法普遍适用,计算效率高,而证据区间可以看作是现有贝叶斯区间估计的扩展,而FBET是对FBST的推广,并将其作为一种特殊情况。总之,所开发的理论提供了贝叶斯假设检验和区间估计的统一,并在R包fbst中提供。
{"title":"The evidence interval and the Bayesian evidence value: On a unified theory for Bayesian hypothesis testing and interval estimation","authors":"Riko Kelter","doi":"10.1111/bmsp.12267","DOIUrl":"10.1111/bmsp.12267","url":null,"abstract":"<p>Interval estimation is one of the most frequently used methods in statistical science, employed to provide a range of credible values a parameter is located in after taking into account the uncertainty in the data. However, while this interpretation only holds for Bayesian interval estimates, these suffer from two problems. First, Bayesian interval estimates can include values which have not been corroborated by observing the data. Second, Bayesian interval estimates and hypothesis tests can yield contradictory conclusions. In this paper a new theory for Bayesian hypothesis testing and interval estimation is presented. A new interval estimate is proposed, the Bayesian <i>evidence interval</i>, which is inspired by the Pereira–Stern theory of the full Bayesian significance test (FBST). It is shown that the evidence interval is a generalization of existing Bayesian interval estimates, that it solves the problems of standard Bayesian interval estimates and that it unifies Bayesian hypothesis testing and parameter estimation. The Bayesian evidence value is introduced, which quantifies the evidence for the (interval) null and alternative hypothesis. Based on the evidence interval and the evidence value, the (full) Bayesian evidence test (FBET) is proposed as a new, model-independent Bayesian hypothesis test. Additionally, a decision rule for hypothesis testing is derived which shows the relationship to a widely used decision rule based on the region of practical equivalence and Bayesian highest posterior density intervals and to the e-value in the FBST. In summary, the proposed method is universally applicable, computationally efficient, and while the evidence interval can be seen as an extension of existing Bayesian interval estimates, the FBET is a generalization of the FBST and contains it as a special case. Together, the theory developed provides a unification of Bayesian hypothesis testing and interval estimation and is made available in the R package <i>fbst</i>.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"75 3","pages":"550-592"},"PeriodicalIF":2.6,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://bpspsychub.onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12267","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"33489369","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Reliability coefficients for multiple group item response theory models 多群体项目反应理论模型的信度系数
IF 2.6 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2022-03-01 DOI: 10.1111/bmsp.12269
Björn Andersson, Hao Luo, Kseniia Marcq

Reliability of scores from psychological or educational assessments provides important information regarding the precision of measurement. The reliability of scores is however population dependent and may vary across groups. In item response theory, this population dependence can be attributed to differential item functioning or to differences in the latent distributions between groups and needs to be accounted for when estimating the reliability of scores for different groups. Here, we introduce group-specific and overall reliability coefficients for sum scores and maximum likelihood ability estimates defined by a multiple group item response theory model. We derive confidence intervals using asymptotic theory and evaluate the empirical properties of estimators and the confidence intervals in a simulation study. The results show that the estimators are largely unbiased and that the confidence intervals are accurate with moderately large sample sizes. We exemplify the approach with the Montreal Cognitive Assessment (MoCA) in two groups defined by education level and give recommendations for applied work.

心理或教育评估分数的可靠性提供了有关测量精度的重要信息。然而,分数的可靠性依赖于人群,并且可能在不同的群体中有所不同。在项目反应理论中,这种群体依赖可以归因于不同的项目功能或群体之间潜在分布的差异,并且在估计不同群体得分的可靠性时需要考虑到这一点。在这里,我们引入群体特定和整体信度系数的总和得分和最大似然能力估计由多群体项目反应理论模型定义。我们利用渐近理论推导置信区间,并在模拟研究中评估了估计量和置信区间的经验性质。结果表明,估计量在很大程度上是无偏的,并且在中等规模的样本量下,置信区间是准确的。我们以蒙特利尔认知评估(MoCA)为例,根据教育水平划分了两组,并给出了应用工作的建议。
{"title":"Reliability coefficients for multiple group item response theory models","authors":"Björn Andersson,&nbsp;Hao Luo,&nbsp;Kseniia Marcq","doi":"10.1111/bmsp.12269","DOIUrl":"10.1111/bmsp.12269","url":null,"abstract":"<p>Reliability of scores from psychological or educational assessments provides important information regarding the precision of measurement. The reliability of scores is however population dependent and may vary across groups. In item response theory, this population dependence can be attributed to differential item functioning or to differences in the latent distributions between groups and needs to be accounted for when estimating the reliability of scores for different groups. Here, we introduce group-specific and overall reliability coefficients for sum scores and maximum likelihood ability estimates defined by a multiple group item response theory model. We derive confidence intervals using asymptotic theory and evaluate the empirical properties of estimators and the confidence intervals in a simulation study. The results show that the estimators are largely unbiased and that the confidence intervals are accurate with moderately large sample sizes. We exemplify the approach with the Montreal Cognitive Assessment (MoCA) in two groups defined by education level and give recommendations for applied work.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"75 2","pages":"395-410"},"PeriodicalIF":2.6,"publicationDate":"2022-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://bpspsychub.onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12269","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127811354","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Remarkable properties for diagnostics and inference of ranking data modelling 排序数据建模诊断和推理的显著特性
IF 2.6 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2022-02-07 DOI: 10.1111/bmsp.12260
Cristina Mollica, Luca Tardella

The Plackett-Luce model (PL) for ranked data assumes the forward order of the ranking process. This hypothesis postulates that the ranking process of the items is carried out by sequentially assigning the positions from the top (most liked) to the bottom (least liked) alternative. This assumption has been recently relaxed with the Extended Plackett-Luce model (EPL) through the introduction of the discrete reference order parameter, describing the rank attribution path. By starting from two formal properties of the EPL, the former related to the inverse ordering of the item probabilities at the first and last stage of the ranking process and the latter well-known as independence of irrelevant alternatives (or Luce's choice axiom), we derive novel diagnostic tools for testing the appropriateness of the EPL assumption as the actual sampling distribution of the observed rankings. These diagnostic tools can help uncovering possible idiosyncratic paths in the sequential choice process. Besides contributing to fill the gap of goodness-of-fit methods for the family of multistage models, we also show how one of the two statistics can be conveniently exploited to construct a heuristic method, that surrogates the maximum likelihood approach for inferring the underlying reference order parameter. The relative performance of the proposals, compared with more conventional approaches, is illustrated by means of extensive simulation studies.

排序数据的Plackett-Luce模型(PL)采用排序过程的前向顺序。这个假设假设项目的排名过程是通过从顶部(最受欢迎)到底部(最不受欢迎)的顺序分配位置来执行的。最近,扩展Plackett-Luce模型(EPL)通过引入离散参考顺序参数来描述等级归因路径,从而放宽了这一假设。从EPL的两个正式性质出发,前者与排名过程的第一和最后阶段的项目概率的逆排序有关,后者被称为无关选项的独立性(或Luce选择公理),我们推导出新的诊断工具,用于测试EPL假设作为观察到的排名的实际抽样分布的适当性。这些诊断工具可以帮助发现顺序选择过程中可能的特殊路径。除了有助于填补多阶段模型家族的拟合优度方法的空白之外,我们还展示了如何方便地利用两种统计量中的一种来构建启发式方法,该方法替代最大似然方法来推断潜在的参考顺序参数。通过大量的仿真研究,说明了这些方法的相对性能,并与更传统的方法进行了比较。
{"title":"Remarkable properties for diagnostics and inference of ranking data modelling","authors":"Cristina Mollica,&nbsp;Luca Tardella","doi":"10.1111/bmsp.12260","DOIUrl":"10.1111/bmsp.12260","url":null,"abstract":"<p>The Plackett-Luce model (PL) for ranked data assumes the forward order of the ranking process. This hypothesis postulates that the ranking process of the items is carried out by sequentially assigning the positions from the top (most liked) to the bottom (least liked) alternative. This assumption has been recently relaxed with the Extended Plackett-Luce model (EPL) through the introduction of the discrete reference order parameter, describing the rank attribution path. By starting from two formal properties of the EPL, the former related to the inverse ordering of the item probabilities at the first and last stage of the ranking process and the latter well-known as independence of irrelevant alternatives (or Luce's choice axiom), we derive novel diagnostic tools for testing the appropriateness of the EPL assumption as the actual sampling distribution of the observed rankings. These diagnostic tools can help uncovering possible idiosyncratic paths in the sequential choice process. Besides contributing to fill the gap of goodness-of-fit methods for the family of multistage models, we also show how one of the two statistics can be conveniently exploited to construct a heuristic method, that surrogates the maximum likelihood approach for inferring the underlying reference order parameter. The relative performance of the proposals, compared with more conventional approaches, is illustrated by means of extensive simulation studies.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"75 2","pages":"334-362"},"PeriodicalIF":2.6,"publicationDate":"2022-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/a3/fd/BMSP-75-334.PMC9305251.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39898673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the Q statistic with constant weights for standardized mean difference 关于标准平均差的定权Q统计量
IF 2.6 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2022-01-30 DOI: 10.1111/bmsp.12263
Ilyas Bakbergenuly, David C. Hoaglin, Elena Kulinskaya
<p>Cochran's <i>Q</i> statistic is routinely used for testing heterogeneity in meta-analysis. Its expected value is also used in several popular estimators of the between-study variance, <math> <msup> <mi>τ</mi> <mn>2</mn> </msup></math>. Those applications generally have not considered the implications of its use of estimated variances in the inverse-variance weights. Importantly, those weights make approximating the distribution of <i>Q</i> (more explicitly, <math> <msub> <mi>Q</mi> <mtext>IV</mtext> </msub></math>) rather complicated. As an alternative, we investigate a new <i>Q</i> statistic, <math> <msub> <mi>Q</mi> <mi>F</mi> </msub></math>, whose constant weights use only the studies' effective sample sizes. For the standardized mean difference as the measure of effect, we study, by simulation, approximations to distributions of <math> <msub> <mi>Q</mi> <mtext>IV</mtext> </msub></math> and <math> <msub> <mi>Q</mi> <mi>F</mi> </msub></math>, as the basis for tests of heterogeneity and for new point and interval estimators of <math> <msup> <mi>τ</mi> <mn>2</mn> </msup></math>. These include new DerSimonian–Kacker-type moment estimators based on the first moment of <math> <msub> <mi>Q</mi> <mi>F</mi> </msub></math>, and novel median-unbiased estimators. The results show that: an approximation based on an algorithm of Farebrother follows both the null and the alternative distributions of <math> <msub> <mi>Q</mi> <mi>F</mi> </msub></math> reasonably well, whereas the usual chi-squared approximation for the null distribution of <math> <msub> <mi>Q</mi> <mtext>IV</mtext> </msub></math> and the Biggerstaff–Jackson approximation to its alternative distribution are poor; in estimating <math> <msup> <mi>τ</mi> <mn>2</mn> </msup></math>, our moment estimator based on <math> <msub> <mi>Q</mi> <mi>F</mi> </msub></math> is almost unbiased, the Mandel – Paule estimator has some negative bias in some situations, and the DerSimonian–Laird and restricted maximum likelihood estimators have considerable negative bias; and all 95% interval estimators have coverage that is too high when <math> <mrow> <msup> <mi>τ</mi> <mn>2</mn> </msup> <mo>
在荟萃分析中,Cochran’s Q统计量通常用于检验异质性。它的期望值也用于研究间方差τ 2的几个常用估计。这些应用通常没有考虑在反方差权重中使用估计方差的含义。重要的是,这些权重使得近似Q的分布(更明确地说,是Q IV)变得相当复杂。作为一种选择,我们研究了一个新的Q统计量,Q F,其常数权重仅使用研究的有效样本量。对于作为效应度量的标准化平均差,我们通过模拟研究了Q IV和Q F分布的近似值,作为检验异质性和τ 2的新点和区间估计的基础。其中包括基于Q F的一阶矩的新的dersimonan - kacker型矩估计,以及新的中位数无偏估计。结果表明:基于Farebrother算法的近似值较好地遵循了Q F的零分布和备选分布,而Q IV的零分布的常用卡方近似值和备选分布的Biggerstaff-Jackson近似值较差;在估计τ 2时,基于Q F的矩估计几乎是无偏的,Mandel - Paule估计在某些情况下有负偏,dersimonan - laird估计和限制极大似然估计有相当大的负偏;当τ 2 = 0时,所有95%区间估计的覆盖率都太高,但在其他情况下,q -剖面区间表现得非常好。
{"title":"On the Q statistic with constant weights for standardized mean difference","authors":"Ilyas Bakbergenuly,&nbsp;David C. Hoaglin,&nbsp;Elena Kulinskaya","doi":"10.1111/bmsp.12263","DOIUrl":"10.1111/bmsp.12263","url":null,"abstract":"&lt;p&gt;Cochran's &lt;i&gt;Q&lt;/i&gt; statistic is routinely used for testing heterogeneity in meta-analysis. Its expected value is also used in several popular estimators of the between-study variance, &lt;math&gt;\u0000 &lt;msup&gt;\u0000 &lt;mi&gt;τ&lt;/mi&gt;\u0000 &lt;mn&gt;2&lt;/mn&gt;\u0000 &lt;/msup&gt;&lt;/math&gt;. Those applications generally have not considered the implications of its use of estimated variances in the inverse-variance weights. Importantly, those weights make approximating the distribution of &lt;i&gt;Q&lt;/i&gt; (more explicitly, &lt;math&gt;\u0000 &lt;msub&gt;\u0000 &lt;mi&gt;Q&lt;/mi&gt;\u0000 &lt;mtext&gt;IV&lt;/mtext&gt;\u0000 &lt;/msub&gt;&lt;/math&gt;) rather complicated. As an alternative, we investigate a new &lt;i&gt;Q&lt;/i&gt; statistic, &lt;math&gt;\u0000 &lt;msub&gt;\u0000 &lt;mi&gt;Q&lt;/mi&gt;\u0000 &lt;mi&gt;F&lt;/mi&gt;\u0000 &lt;/msub&gt;&lt;/math&gt;, whose constant weights use only the studies' effective sample sizes. For the standardized mean difference as the measure of effect, we study, by simulation, approximations to distributions of &lt;math&gt;\u0000 &lt;msub&gt;\u0000 &lt;mi&gt;Q&lt;/mi&gt;\u0000 &lt;mtext&gt;IV&lt;/mtext&gt;\u0000 &lt;/msub&gt;&lt;/math&gt; and &lt;math&gt;\u0000 &lt;msub&gt;\u0000 &lt;mi&gt;Q&lt;/mi&gt;\u0000 &lt;mi&gt;F&lt;/mi&gt;\u0000 &lt;/msub&gt;&lt;/math&gt;, as the basis for tests of heterogeneity and for new point and interval estimators of &lt;math&gt;\u0000 &lt;msup&gt;\u0000 &lt;mi&gt;τ&lt;/mi&gt;\u0000 &lt;mn&gt;2&lt;/mn&gt;\u0000 &lt;/msup&gt;&lt;/math&gt;. These include new DerSimonian–Kacker-type moment estimators based on the first moment of &lt;math&gt;\u0000 &lt;msub&gt;\u0000 &lt;mi&gt;Q&lt;/mi&gt;\u0000 &lt;mi&gt;F&lt;/mi&gt;\u0000 &lt;/msub&gt;&lt;/math&gt;, and novel median-unbiased estimators. The results show that: an approximation based on an algorithm of Farebrother follows both the null and the alternative distributions of &lt;math&gt;\u0000 &lt;msub&gt;\u0000 &lt;mi&gt;Q&lt;/mi&gt;\u0000 &lt;mi&gt;F&lt;/mi&gt;\u0000 &lt;/msub&gt;&lt;/math&gt; reasonably well, whereas the usual chi-squared approximation for the null distribution of &lt;math&gt;\u0000 &lt;msub&gt;\u0000 &lt;mi&gt;Q&lt;/mi&gt;\u0000 &lt;mtext&gt;IV&lt;/mtext&gt;\u0000 &lt;/msub&gt;&lt;/math&gt; and the Biggerstaff–Jackson approximation to its alternative distribution are poor; in estimating &lt;math&gt;\u0000 &lt;msup&gt;\u0000 &lt;mi&gt;τ&lt;/mi&gt;\u0000 &lt;mn&gt;2&lt;/mn&gt;\u0000 &lt;/msup&gt;&lt;/math&gt;, our moment estimator based on &lt;math&gt;\u0000 &lt;msub&gt;\u0000 &lt;mi&gt;Q&lt;/mi&gt;\u0000 &lt;mi&gt;F&lt;/mi&gt;\u0000 &lt;/msub&gt;&lt;/math&gt; is almost unbiased, the Mandel – Paule estimator has some negative bias in some situations, and the DerSimonian–Laird and restricted maximum likelihood estimators have considerable negative bias; and all 95% interval estimators have coverage that is too high when &lt;math&gt;\u0000 &lt;mrow&gt;\u0000 &lt;msup&gt;\u0000 &lt;mi&gt;τ&lt;/mi&gt;\u0000 &lt;mn&gt;2&lt;/mn&gt;\u0000 &lt;/msup&gt;\u0000 &lt;mo&gt;","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"75 3","pages":"444-465"},"PeriodicalIF":2.6,"publicationDate":"2022-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://bpspsychub.onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12263","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39570860","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Distance-based logistic model for cross-classified categorical data 基于距离的分类数据交叉分类逻辑模型
IF 2.6 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2022-01-25 DOI: 10.1111/bmsp.12264
José Fernando Vera

Logistic regression models are a powerful research tool for the analysis of cross-classified data in which a categorical response variable is involved. In a logistic model, the effect of a covariate refers to odds, and the simple relationship between the coefficients and the odds ratio often makes these the parameters of interest due to their easy interpretation. In this article we present a distance-based logistic model that allows a simple graphical interpretation of the association coefficients using the odds ratio in a contingency table. Two configurations are estimated, one for the rows and one for the columns, as the categories of a polytomous predictor and a nominal response variable respectively, such that the local odds ratio and the distances between the predictor and response categories are inversely related. The associations in terms of the odds ratios, or the ratios of the odds to their geometric means, are interpreted through distances for the most common coding schemes of the predictor variable, and the relationship between the distances related to different codings is investigated in its full dimension. The performance of the estimation procedure is analysed with a Monte Carlo experiment. The interpretation of the model and its performance, as well as its comparison with a two-step procedure involving first a logistic regression and then unfolding, is illustrated using real data sets.

逻辑回归模型是一种强大的研究工具,用于分析涉及分类响应变量的交叉分类数据。在逻辑模型中,协变量的影响指的是几率,而系数和几率比之间的简单关系往往使这些参数成为感兴趣的参数,因为它们易于解释。在本文中,我们提出了一个基于距离的逻辑模型,该模型允许使用列联表中的比值比对关联系数进行简单的图形解释。估计了两种配置,一种用于行,另一种用于列,分别作为多聚预测器和名义响应变量的类别,使得局部比值比和预测器和响应类别之间的距离呈负相关。在比值比方面的关联,或比值与其几何均值的比值,通过预测变量最常见编码方案的距离来解释,并且在其全维度上研究与不同编码相关的距离之间的关系。通过蒙特卡罗实验对估计过程的性能进行了分析。模型的解释和它的性能,以及它与两步过程的比较,首先涉及逻辑回归,然后展开,用实际数据集说明。
{"title":"Distance-based logistic model for cross-classified categorical data","authors":"José Fernando Vera","doi":"10.1111/bmsp.12264","DOIUrl":"10.1111/bmsp.12264","url":null,"abstract":"<p>Logistic regression models are a powerful research tool for the analysis of cross-classified data in which a categorical response variable is involved. In a logistic model, the effect of a covariate refers to odds, and the simple relationship between the coefficients and the odds ratio often makes these the parameters of interest due to their easy interpretation. In this article we present a distance-based logistic model that allows a simple graphical interpretation of the association coefficients using the odds ratio in a contingency table. Two configurations are estimated, one for the rows and one for the columns, as the categories of a polytomous predictor and a nominal response variable respectively, such that the local odds ratio and the distances between the predictor and response categories are inversely related. The associations in terms of the odds ratios, or the ratios of the odds to their geometric means, are interpreted through distances for the most common coding schemes of the predictor variable, and the relationship between the distances related to different codings is investigated in its full dimension. The performance of the estimation procedure is analysed with a Monte Carlo experiment. The interpretation of the model and its performance, as well as its comparison with a two-step procedure involving first a logistic regression and then unfolding, is illustrated using real data sets.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"75 3","pages":"466-492"},"PeriodicalIF":2.6,"publicationDate":"2022-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://bpspsychub.onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12264","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39861347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Editorial acknowledgement 社论承认
IF 2.6 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2022-01-12 DOI: 10.1111/bmsp.12262
{"title":"Editorial acknowledgement","authors":"","doi":"10.1111/bmsp.12262","DOIUrl":"https://doi.org/10.1111/bmsp.12262","url":null,"abstract":"","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"75 1","pages":"198-199"},"PeriodicalIF":2.6,"publicationDate":"2022-01-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://bpspsychub.onlinelibrary.wiley.com/doi/epdf/10.1111/bmsp.12262","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"137822807","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Latent variable selection in multidimensional item response theory models using the expectation model selection algorithm 基于期望模型选择算法的多维项目反应理论模型中潜在变量选择
IF 2.6 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2021-12-17 DOI: 10.1111/bmsp.12261
Ping-Feng Xu, Laixu Shang, Qian-Zhen Zheng, Na Shan, Man-Lai Tang

The aim of latent variable selection in multidimensional item response theory (MIRT) models is to identify latent traits probed by test items of a multidimensional test. In this paper the expectation model selection (EMS) algorithm proposed by Jiang et al. (2015) is applied to minimize the Bayesian information criterion (BIC) for latent variable selection in MIRT models with a known number of latent traits. Under mild assumptions, we prove the numerical convergence of the EMS algorithm for model selection by minimizing the BIC of observed data in the presence of missing data. For the identification of MIRT models, we assume that the variances of all latent traits are unity and each latent trait has an item that is only related to it. Under this identifiability assumption, the convergence of the EMS algorithm for latent variable selection in the multidimensional two-parameter logistic (M2PL) models can be verified. We give an efficient implementation of the EMS for the M2PL models. Simulation studies show that the EMS outperforms the EM-based L1 regularization in terms of correctly selected latent variables and computation time. The EMS algorithm is applied to a real data set related to the Eysenck Personality Questionnaire.

在多维项目反应理论模型中,潜在变量选择的目的是识别多维测试项目所探测的潜在特征。本文采用Jiang等人(2015)提出的期望模型选择(EMS)算法,在已知潜在性状数量的MIRT模型中最小化潜在变量选择的贝叶斯信息准则(BIC)。在温和的假设条件下,我们通过最小化观测数据的BIC证明了EMS算法在存在缺失数据时的模型选择的数值收敛性。为了识别MIRT模型,我们假设所有潜在性状的方差是统一的,每个潜在性状都有一个只与它相关的项目。在此可辨识性假设下,验证了EMS算法在多维双参数logistic模型中隐变量选择的收敛性。我们给出了M2PL模型的EMS的有效实现。仿真研究表明,在正确选择潜在变量和计算时间方面,EMS优于基于em的L1正则化。将EMS算法应用于与艾森克人格问卷相关的真实数据集。
{"title":"Latent variable selection in multidimensional item response theory models using the expectation model selection algorithm","authors":"Ping-Feng Xu,&nbsp;Laixu Shang,&nbsp;Qian-Zhen Zheng,&nbsp;Na Shan,&nbsp;Man-Lai Tang","doi":"10.1111/bmsp.12261","DOIUrl":"10.1111/bmsp.12261","url":null,"abstract":"<p>The aim of latent variable selection in multidimensional item response theory (MIRT) models is to identify latent traits probed by test items of a multidimensional test. In this paper the expectation model selection (EMS) algorithm proposed by Jiang et al. (2015) is applied to minimize the Bayesian information criterion (BIC) for latent variable selection in MIRT models with a known number of latent traits. Under mild assumptions, we prove the numerical convergence of the EMS algorithm for model selection by minimizing the BIC of observed data in the presence of missing data. For the identification of MIRT models, we assume that the variances of all latent traits are unity and each latent trait has an item that is only related to it. Under this identifiability assumption, the convergence of the EMS algorithm for latent variable selection in the multidimensional two-parameter logistic (M2PL) models can be verified. We give an efficient implementation of the EMS for the M2PL models. Simulation studies show that the EMS outperforms the EM-based <i>L</i><sub>1</sub> regularization in terms of correctly selected latent variables and computation time. The EMS algorithm is applied to a real data set related to the Eysenck Personality Questionnaire.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"75 2","pages":"363-394"},"PeriodicalIF":2.6,"publicationDate":"2021-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39847154","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Computing the real solutions of Fleishman's equations for simulating non-normal data 计算模拟非正态数据的Fleishman方程的实解
IF 2.6 3区 心理学 Q3 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2021-11-15 DOI: 10.1111/bmsp.12259
Nathaniel E. Helwig

Fleishman's power method is frequently used to simulate non-normal data with a desired skewness and kurtosis. Fleishman's method requires solving a system of nonlinear equations to find the third-order polynomial weights that transform a standard normal variable into a non-normal variable with desired moments. Most users of the power method seem unaware that Fleishman's equations have multiple solutions for typical combinations of skewness and kurtosis. Furthermore, researchers lack a simple method for exploring the multiple solutions of Fleishman's equations, so most applications only consider a single solution. In this paper, we propose novel methods for finding all real-valued solutions of Fleishman's equations. Additionally, we characterize the solutions in terms of differences in higher order moments. Our theoretical analysis of the power method reveals that there typically exists two solutions of Fleishman's equations that have noteworthy differences in higher order moments. Using simulated examples, we demonstrate that these differences can have remarkable effects on the shape of the non-normal distribution, as well as the sampling distributions of statistics calculated from the data. Some considerations for choosing a solution are discussed, and some recommendations for improved reporting standards are provided.

弗莱什曼幂方法经常用于模拟具有期望偏度和峰度的非正态数据。Fleishman的方法需要求解一个非线性方程组来找到将标准正态变量转换为具有期望矩的非正态变量的三阶多项式权重。大多数幂方法的使用者似乎没有意识到,对于典型的偏度和峰度组合,Fleishman方程有多个解。此外,研究人员缺乏一种简单的方法来探索Fleishman方程的多个解,因此大多数应用只考虑单个解。本文提出了一种求Fleishman方程全实值解的新方法。此外,我们用高阶矩的差异来描述解。我们对幂方法的理论分析表明,通常存在两个具有显著高阶矩差异的Fleishman方程解。通过模拟实例,我们证明了这些差异会对非正态分布的形状以及从数据计算的统计量的抽样分布产生显着影响。讨论了选择解决方案的一些考虑因素,并提供了改进报告标准的一些建议。
{"title":"Computing the real solutions of Fleishman's equations for simulating non-normal data","authors":"Nathaniel E. Helwig","doi":"10.1111/bmsp.12259","DOIUrl":"10.1111/bmsp.12259","url":null,"abstract":"<p>Fleishman's power method is frequently used to simulate non-normal data with a desired skewness and kurtosis. Fleishman's method requires solving a system of nonlinear equations to find the third-order polynomial weights that transform a standard normal variable into a non-normal variable with desired moments. Most users of the power method seem unaware that Fleishman's equations have multiple solutions for typical combinations of skewness and kurtosis. Furthermore, researchers lack a simple method for exploring the multiple solutions of Fleishman's equations, so most applications only consider a single solution. In this paper, we propose novel methods for finding all real-valued solutions of Fleishman's equations. Additionally, we characterize the solutions in terms of differences in higher order moments. Our theoretical analysis of the power method reveals that there typically exists two solutions of Fleishman's equations that have noteworthy differences in higher order moments. Using simulated examples, we demonstrate that these differences can have remarkable effects on the shape of the non-normal distribution, as well as the sampling distributions of statistics calculated from the data. Some considerations for choosing a solution are discussed, and some recommendations for improved reporting standards are provided.</p>","PeriodicalId":55322,"journal":{"name":"British Journal of Mathematical & Statistical Psychology","volume":"75 2","pages":"319-333"},"PeriodicalIF":2.6,"publicationDate":"2021-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"39892569","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
British Journal of Mathematical & Statistical Psychology
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1