首页 > 最新文献

Psychological methods最新文献

英文 中文
Random item slope regression: An alternative measurement model that accounts for both similarities and differences in association with individual items. 随机项目斜率回归:一种替代的测量模型,它可以解释与单个项目相关的相似性和差异性。
IF 7 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2023-07-27 DOI: 10.1037/met0000587
Ed Donnellan, Satoshi Usami, Kou Murayama

In psychology, researchers often predict a dependent variable (DV) consisting of multiple measurements (e.g., scale items measuring a concept). To analyze the data, researchers typically aggregate (sum/average) scores across items and use this as a DV. Alternatively, they may define the DV as a common factor using structural equation modeling. However, both approaches neglect the possibility that an independent variable (IV) may have different relationships to individual items. This variance in individual item slopes arises because items are randomly sampled from an infinite pool of items reflecting the construct that the scale purports to measure. Here, we offer a mixed-effects model called random item slope regression, which accounts for both similarities and differences of individual item associations. Critically, we argue that random item slope regression poses an alternative measurement model to common factor models prevalent in psychology. Unlike these models, the proposed model supposes no latent constructs and instead assumes that individual items have direct causal relationships with the IV. Such operationalization is especially useful when researchers want to assess a broad construct with heterogeneous items. Using mathematical proof and simulation, we demonstrate that random item slopes cause inflation of Type I error when not accounted for, particularly when the sample size (number of participants) is large. In real-world data (n = 564 participants) using commonly used surveys and two reaction time tasks, we demonstrate that random item slopes are present at problematic levels. We further demonstrate that common statistical indices are not sufficient to diagnose the presence of random item slopes. (PsycInfo Database Record (c) 2023 APA, all rights reserved).

在心理学中,研究人员经常预测一个由多个测量组成的因变量(DV)(例如,测量一个概念的量表项目)。为了分析数据,研究人员通常会汇总(总和/平均值)各个项目的分数,并将其用作DV。或者,他们可以使用结构方程建模将DV定义为一个公共因素。然而,这两种方法都忽略了自变量(IV)可能与单个项目有不同关系的可能性。单个项目斜率的差异之所以产生,是因为项目是从反映量表旨在测量的结构的无限项目池中随机抽样的。在这里,我们提供了一种混合效应模型,称为随机项目斜率回归,它同时考虑了单个项目关联的相似性和差异性。关键的是,我们认为随机项目斜率回归提出了一种替代测量模型,以共同因素模型普遍存在于心理学。与这些模型不同,所提出的模型不假设潜在构念,而是假设单个项目与IV有直接的因果关系。当研究人员想要评估具有异质项目的广泛构念时,这种操作化特别有用。使用数学证明和模拟,我们证明了随机项目斜率在未考虑的情况下会导致I型误差膨胀,特别是当样本量(参与者数量)很大时。在使用常用调查和两个反应时间任务的真实数据(n = 564名参与者)中,我们证明了随机项目斜率存在于问题水平。我们进一步证明了常用的统计指标不足以诊断随机项目斜率的存在。(PsycInfo数据库记录(c) 2023 APA,版权所有)。
{"title":"Random item slope regression: An alternative measurement model that accounts for both similarities and differences in association with individual items.","authors":"Ed Donnellan,&nbsp;Satoshi Usami,&nbsp;Kou Murayama","doi":"10.1037/met0000587","DOIUrl":"https://doi.org/10.1037/met0000587","url":null,"abstract":"<p><p>In psychology, researchers often predict a dependent variable (DV) consisting of multiple measurements (e.g., scale items measuring a concept). To analyze the data, researchers typically aggregate (sum/average) scores across items and use this as a DV. Alternatively, they may define the DV as a common factor using structural equation modeling. However, both approaches neglect the possibility that an independent variable (IV) may have different relationships to individual items. This variance in individual item slopes arises because items are randomly sampled from an infinite pool of items reflecting the construct that the scale purports to measure. Here, we offer a mixed-effects model called <i>random item slope regression,</i> which accounts for both similarities and differences of individual item associations. Critically, we argue that random item slope regression poses an alternative measurement model to common factor models prevalent in psychology. Unlike these models, the proposed model supposes no latent constructs and instead assumes that individual items have direct causal relationships with the IV. Such operationalization is especially useful when researchers want to assess a broad construct with heterogeneous items. Using mathematical proof and simulation, we demonstrate that random item slopes cause inflation of Type I error when not accounted for, particularly when the sample size (number of participants) is large. In real-world data (<i>n</i> = 564 participants) using commonly used surveys and two reaction time tasks, we demonstrate that random item slopes are present at problematic levels. We further demonstrate that common statistical indices are not sufficient to diagnose the presence of random item slopes. (PsycInfo Database Record (c) 2023 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":""},"PeriodicalIF":7.0,"publicationDate":"2023-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10259360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correspondence measures for assessing replication success. 用于评估复制成功的对应度量。
IF 7 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2023-07-27 DOI: 10.1037/met0000597
Peter M Steiner, Patrick Sheehan, Vivian C Wong

Given recent evidence challenging the replicability of results in the social and behavioral sciences, critical questions have been raised about appropriate measures for determining replication success in comparing effect estimates across studies. At issue is the fact that conclusions about replication success often depend on the measure used for evaluating correspondence in results. Despite the importance of choosing an appropriate measure, there is still no widespread agreement about which measures should be used. This article addresses these questions by describing formally the most commonly used measures for assessing replication success, and by comparing their performance in different contexts according to their replication probabilities-that is, the probability of obtaining replication success given study-specific settings. The measures may be characterized broadly as conclusion-based approaches, which assess the congruence of two independent studies' conclusions about the presence of an effect, and distance-based approaches, which test for a significant difference or equivalence of two effect estimates. We also introduce a new measure for assessing replication success called the correspondence test, which combines a difference and equivalence test in the same framework. To help researchers plan prospective replication efforts, we provide closed formulas for power calculations that can be used to determine the minimum detectable effect size (and thus, sample sizes) for each study so that a predetermined minimum replication probability can be achieved. Finally, we use a replication data set from the Open Science Collaboration (2015) to demonstrate the extent to which conclusions about replication success depend on the correspondence measure selected. (PsycInfo Database Record (c) 2023 APA, all rights reserved).

鉴于最近的证据对社会和行为科学结果的可复制性提出了挑战,在比较不同研究的效果估计时,确定复制成功的适当措施提出了关键问题。争论的焦点在于,关于复制成功与否的结论往往取决于用于评估结果一致性的测量方法。尽管选择适当的措施很重要,但对于应该使用哪些措施仍然没有广泛的共识。本文通过正式描述用于评估复制成功的最常用度量,并根据它们的复制概率(即给定特定研究设置的获得复制成功的概率)比较它们在不同上下文中的性能,来解决这些问题。这些措施可以被广泛地描述为基于结论的方法,评估两个独立研究关于效应存在的结论的一致性,以及基于距离的方法,测试两个效应估计的显着差异或等效性。我们还引入了一种评估复制成功的新方法,称为对应测试,它在同一框架中结合了差异测试和等效测试。为了帮助研究人员计划前瞻性的复制工作,我们提供了功率计算的封闭公式,可用于确定每个研究的最小可检测效应大小(从而确定样本量),从而可以实现预定的最小复制概率。最后,我们使用开放科学协作(2015)的复制数据集来证明关于复制成功的结论在多大程度上取决于所选择的对应度量。(PsycInfo数据库记录(c) 2023 APA,版权所有)。
{"title":"Correspondence measures for assessing replication success.","authors":"Peter M Steiner,&nbsp;Patrick Sheehan,&nbsp;Vivian C Wong","doi":"10.1037/met0000597","DOIUrl":"https://doi.org/10.1037/met0000597","url":null,"abstract":"<p><p>Given recent evidence challenging the replicability of results in the social and behavioral sciences, critical questions have been raised about appropriate measures for determining replication success in comparing effect estimates across studies. At issue is the fact that conclusions about replication success often depend on the measure used for evaluating correspondence in results. Despite the importance of choosing an appropriate measure, there is still no widespread agreement about which measures should be used. This article addresses these questions by describing formally the most commonly used measures for assessing replication success, and by comparing their performance in different contexts according to their replication probabilities-that is, the probability of obtaining replication success given study-specific settings. The measures may be characterized broadly as conclusion-based approaches, which assess the congruence of two independent studies' conclusions about the presence of an effect, and distance-based approaches, which test for a significant difference or equivalence of two effect estimates. We also introduce a new measure for assessing replication success called the correspondence test, which combines a difference and equivalence test in the same framework. To help researchers plan prospective replication efforts, we provide closed formulas for power calculations that can be used to determine the minimum detectable effect size (and thus, sample sizes) for each study so that a predetermined minimum replication probability can be achieved. Finally, we use a replication data set from the Open Science Collaboration (2015) to demonstrate the extent to which conclusions about replication success depend on the correspondence measure selected. (PsycInfo Database Record (c) 2023 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":""},"PeriodicalIF":7.0,"publicationDate":"2023-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10259359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On estimating the frequency of a target behavior from time-constrained yes/no survey questions: A parametric approach based on the Poisson process. 从时间约束的是/否调查问题中估计目标行为的频率:基于泊松过程的参数化方法。
IF 7 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2023-07-20 DOI: 10.1037/met0000588
Benedikt Iberl, Rolf Ulrich

We propose a novel method to analyze time-constrained yes/no questions about a target behavior (e.g., "Did you take sleeping pills during the last 12 months?"). A drawback of these questions is that the relative frequency of answering these questions with "yes" does not allow one to draw definite conclusions about the frequency of the target behavior (i.e., how often sleeping pills were taken) nor about the prevalence of trait carriers (i.e., percentage of people that take sleeping pills). Here we show how this information can be extracted from the results of such questions employing a prevalence curve and a Poisson model. The applicability of the method was evaluated with a survey on everyday behavior, which revealed plausible results and reasonable model fit. (PsycInfo Database Record (c) 2023 APA, all rights reserved).

我们提出了一种新的方法来分析关于目标行为的有时间限制的是/否问题(例如,“你在过去的12个月里吃过安眠药吗?”)。这些问题的一个缺点是,回答“是”的相对频率不能让一个人对目标行为的频率(即,服用安眠药的频率)或特质携带者的流行程度(即,服用安眠药的人的百分比)得出明确的结论。在这里,我们展示了如何利用流行曲线和泊松模型从这些问题的结果中提取这些信息。通过对日常行为的调查评估了该方法的适用性,结果表明该方法的结果合理,模型拟合合理。(PsycInfo数据库记录(c) 2023 APA,版权所有)。
{"title":"On estimating the frequency of a target behavior from time-constrained yes/no survey questions: A parametric approach based on the Poisson process.","authors":"Benedikt Iberl,&nbsp;Rolf Ulrich","doi":"10.1037/met0000588","DOIUrl":"https://doi.org/10.1037/met0000588","url":null,"abstract":"<p><p>We propose a novel method to analyze time-constrained yes/no questions about a target behavior (e.g., \"Did you take sleeping pills during the last 12 months?\"). A drawback of these questions is that the relative frequency of answering these questions with \"yes\" does not allow one to draw definite conclusions about the frequency of the target behavior (i.e., how often sleeping pills were taken) nor about the prevalence of trait carriers (i.e., percentage of people that take sleeping pills). Here we show how this information can be extracted from the results of such questions employing a prevalence curve and a Poisson model. The applicability of the method was evaluated with a survey on everyday behavior, which revealed plausible results and reasonable model fit. (PsycInfo Database Record (c) 2023 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":""},"PeriodicalIF":7.0,"publicationDate":"2023-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9838249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing predictive power by unamalgamating multi-item scales. 通过不合并多项目量表来增强预测能力。
IF 7 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2023-07-20 DOI: 10.1037/met0000599
David Trafimow, Michael R Hyman, Alena Kostyk

The generally small but touted as "statistically significant" correlation coefficients in the social sciences jeopardize theory testing and prediction. To investigate these small coefficients' underlying causes, traditional equations such as Spearman's (1904) classic attenuation formula, Cronbach's (1951) alpha, and Guilford and Fruchter's (1973) equation for the effect of additional items on a scale's predictive power are considered. These equations' implications differ regarding large interitem correlations enhancing or diminishing predictive power. Contrary to conventional practice, such correlations decrease predictive power when treating items as multi-item scale components but can increase predictive power when treating items separately. The implications are wide-ranging. (PsycInfo Database Record (c) 2023 APA, all rights reserved).

在社会科学中,通常较小但被吹捧为“统计显著”的相关系数危及理论检验和预测。为了研究这些小系数的潜在原因,考虑了传统方程,如Spearman(1904)的经典衰减公式,Cronbach(1951)的alpha,以及Guilford和Fruchter(1973)的附加项对量表预测能力影响的方程。这些方程的含义不同于大的项目间相关性,增强或减弱预测能力。与传统做法相反,当将项目作为多项目量表组件处理时,这种相关性会降低预测能力,但当单独处理项目时,这种相关性会增加预测能力。其影响是广泛的。(PsycInfo数据库记录(c) 2023 APA,版权所有)。
{"title":"Enhancing predictive power by unamalgamating multi-item scales.","authors":"David Trafimow,&nbsp;Michael R Hyman,&nbsp;Alena Kostyk","doi":"10.1037/met0000599","DOIUrl":"https://doi.org/10.1037/met0000599","url":null,"abstract":"<p><p>The generally small but touted as \"statistically significant\" correlation coefficients in the social sciences jeopardize theory testing and prediction. To investigate these small coefficients' underlying causes, traditional equations such as Spearman's (1904) classic attenuation formula, Cronbach's (1951) alpha, and Guilford and Fruchter's (1973) equation for the effect of additional items on a scale's predictive power are considered. These equations' implications differ regarding large interitem correlations enhancing or diminishing predictive power. Contrary to conventional practice, such correlations decrease predictive power when treating items as multi-item scale components but can increase predictive power when treating items separately. The implications are wide-ranging. (PsycInfo Database Record (c) 2023 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":""},"PeriodicalIF":7.0,"publicationDate":"2023-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9838250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Demystifying omega squared: Practical guidance for effect size in common analysis of variance designs. 揭开欧米茄平方的神秘面纱:方差设计中常见分析效应大小的实用指南。
IF 7 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2023-07-20 DOI: 10.1037/met0000581
Antoinette D A Kroes, Jason R Finley

Omega squared (ω^2) is a measure of effect size for analysis of variance (ANOVA) designs. It is less biased than eta squared, but reported less often. This is in part due to lack of clear guidance on how to calculate it. In this paper, we discuss the logic behind effect size measures, the problem with eta squared, the history of omega squared, and why it has been underused. We then provide a user-friendly guide to omega squared and partial omega squared for ANOVA designs with fixed factors, including one-way, two-way, and three-way designs, using within-subjects factors and/or between-subjects factors. We show how to calculate omega squared using output from SPSS. We provide information on the calculation of confidence intervals. We examine the problems of nonadditivity, and intrinsic versus extrinsic factors. We argue that statistical package developers could play an important role in making the calculation of omega squared easier. Finally, we recommend that researchers report the formulas used in calculating effect sizes, include confidence intervals if possible, and include ANOVA tables in the online supplemental materials of their work. (PsycInfo Database Record (c) 2023 APA, all rights reserved).

平方(ω^2)是方差分析(ANOVA)设计的效应大小的度量。它的偏差小于平方,但报告的频率较低。这在一定程度上是由于缺乏关于如何计算的明确指导。在本文中,我们讨论了效应大小测量背后的逻辑,平方的问题,平方的历史,以及为什么它没有得到充分利用。然后,我们为具有固定因素的方差分析设计提供了一个用户友好的omega平方和部分omega平方指南,包括单向,双向和三向设计,使用受试者内因素和/或受试者之间因素。我们展示了如何使用SPSS的输出来计算omega的平方。我们提供了计算置信区间的信息。我们研究了不可加性问题,以及内在因素与外在因素的对比。我们认为统计软件包开发人员可以在简化计算平方方面发挥重要作用。最后,我们建议研究人员报告用于计算效应量的公式,如果可能的话,包括置信区间,并在他们的工作的在线补充材料中包括方差分析表。(PsycInfo数据库记录(c) 2023 APA,版权所有)。
{"title":"Demystifying omega squared: Practical guidance for effect size in common analysis of variance designs.","authors":"Antoinette D A Kroes,&nbsp;Jason R Finley","doi":"10.1037/met0000581","DOIUrl":"https://doi.org/10.1037/met0000581","url":null,"abstract":"<p><p>Omega squared (ω^2) is a measure of effect size for analysis of variance (ANOVA) designs. It is less biased than eta squared, but reported less often. This is in part due to lack of clear guidance on how to calculate it. In this paper, we discuss the logic behind effect size measures, the problem with eta squared, the history of omega squared, and why it has been underused. We then provide a user-friendly guide to omega squared and partial omega squared for ANOVA designs with fixed factors, including one-way, two-way, and three-way designs, using within-subjects factors and/or between-subjects factors. We show how to calculate omega squared using output from SPSS. We provide information on the calculation of confidence intervals. We examine the problems of nonadditivity, and intrinsic versus extrinsic factors. We argue that statistical package developers could play an important role in making the calculation of omega squared easier. Finally, we recommend that researchers report the formulas used in calculating effect sizes, include confidence intervals if possible, and include ANOVA tables in the online supplemental materials of their work. (PsycInfo Database Record (c) 2023 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":""},"PeriodicalIF":7.0,"publicationDate":"2023-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9840882","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Random item slope regression: An alternative measurement model that accounts for both similarities and differences in association with individual items. 随机项目斜率回归:一种替代测量模型,它考虑了与单个项目相关的相似性和差异。
IF 7 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2023-07-17 DOI: 10.1037/met0000587.supp
E. Donnellan, S. Usami, K. Murayama
In psychology, researchers often predict a dependent variable (DV) consisting of multiple measurements (e.g., scale items measuring a concept). To analyze the data, researchers typically aggregate (sum/average) scores across items and use this as a DV. Alternatively, they may define the DV as a common factor using structural equation modeling. However, both approaches neglect the possibility that an independent variable (IV) may have different relationships to individual items. This variance in individual item slopes arises because items are randomly sampled from an infinite pool of items reflecting the construct that the scale purports to measure. Here, we offer a mixed-effects model called random item slope regression, which accounts for both similarities and differences of individual item associations. Critically, we argue that random item slope regression poses an alternative measurement model to common factor models prevalent in psychology. Unlike these models, the proposed model supposes no latent constructs and instead assumes that individual items have direct causal relationships with the IV. Such operationalization is especially useful when researchers want to assess a broad construct with heterogeneous items. Using mathematical proof and simulation, we demonstrate that random item slopes cause inflation of Type I error when not accounted for, particularly when the sample size (number of participants) is large. In real-world data (n = 564 participants) using commonly used surveys and two reaction time tasks, we demonstrate that random item slopes are present at problematic levels. We further demonstrate that common statistical indices are not sufficient to diagnose the presence of random item slopes. (PsycInfo Database Record (c) 2023 APA, all rights reserved).
在心理学中,研究人员经常预测一个由多个测量组成的因变量(DV)(例如,测量一个概念的量表项目)。为了分析数据,研究人员通常会汇总(总和/平均值)各个项目的分数,并将其用作DV。或者,他们可以使用结构方程建模将DV定义为一个公共因素。然而,这两种方法都忽略了自变量(IV)可能与单个项目有不同关系的可能性。单个项目斜率的差异之所以产生,是因为项目是从反映量表旨在测量的结构的无限项目池中随机抽样的。在这里,我们提供了一种混合效应模型,称为随机项目斜率回归,它同时考虑了单个项目关联的相似性和差异性。关键的是,我们认为随机项目斜率回归提出了一种替代测量模型,以共同因素模型普遍存在于心理学。与这些模型不同,所提出的模型不假设潜在构念,而是假设单个项目与IV有直接的因果关系。当研究人员想要评估具有异质项目的广泛构念时,这种操作化特别有用。使用数学证明和模拟,我们证明了随机项目斜率在未考虑的情况下会导致I型误差膨胀,特别是当样本量(参与者数量)很大时。在使用常用调查和两个反应时间任务的真实数据(n = 564名参与者)中,我们证明了随机项目斜率存在于问题水平。我们进一步证明了常用的统计指标不足以诊断随机项目斜率的存在。(PsycInfo数据库记录(c) 2023 APA,版权所有)。
{"title":"Random item slope regression: An alternative measurement model that accounts for both similarities and differences in association with individual items.","authors":"E. Donnellan, S. Usami, K. Murayama","doi":"10.1037/met0000587.supp","DOIUrl":"https://doi.org/10.1037/met0000587.supp","url":null,"abstract":"In psychology, researchers often predict a dependent variable (DV) consisting of multiple measurements (e.g., scale items measuring a concept). To analyze the data, researchers typically aggregate (sum/average) scores across items and use this as a DV. Alternatively, they may define the DV as a common factor using structural equation modeling. However, both approaches neglect the possibility that an independent variable (IV) may have different relationships to individual items. This variance in individual item slopes arises because items are randomly sampled from an infinite pool of items reflecting the construct that the scale purports to measure. Here, we offer a mixed-effects model called random item slope regression, which accounts for both similarities and differences of individual item associations. Critically, we argue that random item slope regression poses an alternative measurement model to common factor models prevalent in psychology. Unlike these models, the proposed model supposes no latent constructs and instead assumes that individual items have direct causal relationships with the IV. Such operationalization is especially useful when researchers want to assess a broad construct with heterogeneous items. Using mathematical proof and simulation, we demonstrate that random item slopes cause inflation of Type I error when not accounted for, particularly when the sample size (number of participants) is large. In real-world data (n = 564 participants) using commonly used surveys and two reaction time tasks, we demonstrate that random item slopes are present at problematic levels. We further demonstrate that common statistical indices are not sufficient to diagnose the presence of random item slopes. (PsycInfo Database Record (c) 2023 APA, all rights reserved).","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":""},"PeriodicalIF":7.0,"publicationDate":"2023-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"48809969","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
The receiver operating characteristic area under the curve (or mean ridit) as an effect size. 曲线下的接收者工作特征面积(或平均波幅)作为效应大小。
IF 7 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2023-07-13 DOI: 10.1037/met0000601
Michael Smithson

Several authors have recommended adopting the receiver operator characteristic (ROC) area under the curve (AUC) or mean ridit as an effect size, arguing that it measures an important and interpretable type of effect that conventional effect-size measures do not. It is base-rate insensitive, robust to outliers, and invariant under order-preserving transformations. However, applications have been limited to group comparisons, and usually just two groups, in line with the popular interpretation of the AUC as measuring the probability that a randomly chosen case from one group will score higher on the dependent variable than a randomly chosen case from another group. This tutorial article shows that the AUC can be used as an effect size for both categorical and continuous predictors in a wide variety of general linear models, whose dependent variables may be ordinal, interval, or ratio level. Thus, the AUC is a general effect-size measure. Demonstrations in this article include linear regression, ordinal logistic regression, gamma regression, and beta regression. The online supplemental materials to this tutorial provide a survey of currently available software resources in R for the AUC and ridits, along with the code and access to the data used in the examples. (PsycInfo Database Record (c) 2023 APA, all rights reserved).

一些作者建议采用接受者算子特征(ROC)曲线下面积(AUC)或平均ridit作为效应量,认为它测量了传统效应量测量无法测量的重要且可解释的效应类型。它对基本速率不敏感,对异常值具有鲁棒性,并且在保序变换下不变。然而,应用仅限于组比较,通常只有两个组,这符合对AUC的流行解释,即测量从一个组中随机选择的病例在因变量上的得分高于从另一个组中随机选择的病例的概率。本文展示了AUC可以用作各种一般线性模型中的分类和连续预测器的效应大小,这些模型的因变量可以是顺序、区间或比率水平。因此,AUC是一种通用的效应大小度量。本文中的演示包括线性回归、有序逻辑回归、gamma回归和beta回归。本教程的在线补充材料提供了当前可用的用于AUC和ridits的R软件资源的调查,以及示例中使用的代码和对数据的访问。(PsycInfo数据库记录(c) 2023 APA,版权所有)。
{"title":"The receiver operating characteristic area under the curve (or mean ridit) as an effect size.","authors":"Michael Smithson","doi":"10.1037/met0000601","DOIUrl":"https://doi.org/10.1037/met0000601","url":null,"abstract":"<p><p>Several authors have recommended adopting the receiver operator characteristic (ROC) area under the curve (AUC) or mean ridit as an effect size, arguing that it measures an important and interpretable type of effect that conventional effect-size measures do not. It is base-rate insensitive, robust to outliers, and invariant under order-preserving transformations. However, applications have been limited to group comparisons, and usually just two groups, in line with the popular interpretation of the AUC as measuring the probability that a randomly chosen case from one group will score higher on the dependent variable than a randomly chosen case from another group. This tutorial article shows that the AUC can be used as an effect size for both categorical and continuous predictors in a wide variety of general linear models, whose dependent variables may be ordinal, interval, or ratio level. Thus, the AUC is a general effect-size measure. Demonstrations in this article include linear regression, ordinal logistic regression, gamma regression, and beta regression. The online supplemental materials to this tutorial provide a survey of currently available software resources in R for the AUC and ridits, along with the code and access to the data used in the examples. (PsycInfo Database Record (c) 2023 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":""},"PeriodicalIF":7.0,"publicationDate":"2023-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9776732","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Demystifying omega squared: Practical guidance for effect size in common analysis of variance designs. 揭开欧米茄平方的神秘面纱:方差设计中常见分析效应大小的实用指南。
IF 7 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2023-07-10 DOI: 10.1037/met0000581.supp
Antoinette D A Kroes, Jason R. Finley
Omega squared (ω^2) is a measure of effect size for analysis of variance (ANOVA) designs. It is less biased than eta squared, but reported less often. This is in part due to lack of clear guidance on how to calculate it. In this paper, we discuss the logic behind effect size measures, the problem with eta squared, the history of omega squared, and why it has been underused. We then provide a user-friendly guide to omega squared and partial omega squared for ANOVA designs with fixed factors, including one-way, two-way, and three-way designs, using within-subjects factors and/or between-subjects factors. We show how to calculate omega squared using output from SPSS. We provide information on the calculation of confidence intervals. We examine the problems of nonadditivity, and intrinsic versus extrinsic factors. We argue that statistical package developers could play an important role in making the calculation of omega squared easier. Finally, we recommend that researchers report the formulas used in calculating effect sizes, include confidence intervals if possible, and include ANOVA tables in the online supplemental materials of their work. (PsycInfo Database Record (c) 2023 APA, all rights reserved).
Omega平方(ω^2)是方差分析(ANOVA)设计的效应大小的度量。它的偏差小于eta平方,但报告的频率较低。这在一定程度上是由于缺乏关于如何计算它的明确指导。在本文中,我们讨论了效应大小度量背后的逻辑、eta平方的问题、omega平方的历史,以及为什么它没有得到充分利用。然后,我们使用受试者内部因素和/或受试者之间因素,为具有固定因素的方差分析设计(包括单向、双向和三向设计)提供了一个关于ω平方和偏ω平方的用户友好指南。我们展示了如何使用SPSS的输出计算ω平方。我们提供有关置信区间计算的信息。我们研究了非相加性问题,以及内在因素与外在因素的关系。我们认为,统计包开发人员可以在简化ω平方的计算方面发挥重要作用。最后,我们建议研究人员报告用于计算效应大小的公式,如果可能的话,包括置信区间,并在他们工作的在线补充材料中包括方差分析表。(PsycInfo数据库记录(c)2023 APA,保留所有权利)。
{"title":"Demystifying omega squared: Practical guidance for effect size in common analysis of variance designs.","authors":"Antoinette D A Kroes, Jason R. Finley","doi":"10.1037/met0000581.supp","DOIUrl":"https://doi.org/10.1037/met0000581.supp","url":null,"abstract":"Omega squared (ω^2) is a measure of effect size for analysis of variance (ANOVA) designs. It is less biased than eta squared, but reported less often. This is in part due to lack of clear guidance on how to calculate it. In this paper, we discuss the logic behind effect size measures, the problem with eta squared, the history of omega squared, and why it has been underused. We then provide a user-friendly guide to omega squared and partial omega squared for ANOVA designs with fixed factors, including one-way, two-way, and three-way designs, using within-subjects factors and/or between-subjects factors. We show how to calculate omega squared using output from SPSS. We provide information on the calculation of confidence intervals. We examine the problems of nonadditivity, and intrinsic versus extrinsic factors. We argue that statistical package developers could play an important role in making the calculation of omega squared easier. Finally, we recommend that researchers report the formulas used in calculating effect sizes, include confidence intervals if possible, and include ANOVA tables in the online supplemental materials of their work. (PsycInfo Database Record (c) 2023 APA, all rights reserved).","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":""},"PeriodicalIF":7.0,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"46555021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Supplemental Material for Correspondence Measures for Assessing Replication Success 评估复制成功的对应措施的补充材料
IF 7 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2023-07-10 DOI: 10.1037/met0000597.supp
{"title":"Supplemental Material for Correspondence Measures for Assessing Replication Success","authors":"","doi":"10.1037/met0000597.supp","DOIUrl":"https://doi.org/10.1037/met0000597.supp","url":null,"abstract":"","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":""},"PeriodicalIF":7.0,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"45980999","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Bayesian regularization in multiple-indicators multiple-causes models. 多指标多原因模型中的贝叶斯正则化。
IF 7 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2023-07-10 DOI: 10.1037/met0000594.supp
Lijin Zhang, Xinya Liang
Integrating regularization methods into structural equation modeling is gaining increasing popularity. The purpose of regularization is to improve variable selection, model estimation, and prediction accuracy. In this study, we aim to: (a) compare Bayesian regularization methods for exploring covariate effects in multiple-indicators multiple-causes models, (b) examine the sensitivity of results to hyperparameter settings of penalty priors, and (c) investigate prediction accuracy through cross-validation. The Bayesian regularization methods examined included: ridge, lasso, adaptive lasso, spike-and-slab prior (SSP) and its variants, and horseshoe and its variants. Sparse solutions were developed for the structural coefficient matrix that contained only a small portion of nonzero path coefficients characterizing the effects of selected covariates on the latent variable. Results from the simulation study showed that compared to diffuse priors, penalty priors were advantageous in handling small sample sizes and collinearity among covariates. Priors with only the global penalty (ridge and lasso) yielded higher model convergence rates and power, whereas priors with both the global and local penalties (horseshoe and SSP) provided more accurate parameter estimates for medium and large covariate effects. The horseshoe and SSP improved accuracy in predicting factor scores, while achieving more parsimonious models. (PsycInfo Database Record (c) 2023 APA, all rights reserved).
将正则化方法集成到结构方程建模中越来越受欢迎。正则化的目的是提高变量选择、模型估计和预测精度。在这项研究中,我们的目标是:(a)比较贝叶斯正则化方法来探索多指标多原因模型中的协变量效应,(b)检验结果对惩罚先验超参数设置的敏感性,以及(c)通过交叉验证来研究预测的准确性。研究的贝叶斯正则化方法包括:脊法、套索法、自适应套索法、钉板先验(SSP)及其变体、马蹄法及其变体。我们为结构系数矩阵开发了稀疏解,该矩阵只包含一小部分表征选定协变量对潜在变量影响的非零路径系数。仿真研究结果表明,与扩散先验相比,惩罚先验在处理小样本量和协变量间共线性方面具有优势。只有全局惩罚的先验(ridge和lasso)产生了更高的模型收敛率和功率,而同时具有全局和局部惩罚的先验(horseshoe和SSP)为中、大协变量效应提供了更准确的参数估计。马蹄形和SSP提高了预测因子得分的准确性,同时实现了更简洁的模型。(PsycInfo数据库记录(c) 2023 APA,版权所有)。
{"title":"Bayesian regularization in multiple-indicators multiple-causes models.","authors":"Lijin Zhang, Xinya Liang","doi":"10.1037/met0000594.supp","DOIUrl":"https://doi.org/10.1037/met0000594.supp","url":null,"abstract":"Integrating regularization methods into structural equation modeling is gaining increasing popularity. The purpose of regularization is to improve variable selection, model estimation, and prediction accuracy. In this study, we aim to: (a) compare Bayesian regularization methods for exploring covariate effects in multiple-indicators multiple-causes models, (b) examine the sensitivity of results to hyperparameter settings of penalty priors, and (c) investigate prediction accuracy through cross-validation. The Bayesian regularization methods examined included: ridge, lasso, adaptive lasso, spike-and-slab prior (SSP) and its variants, and horseshoe and its variants. Sparse solutions were developed for the structural coefficient matrix that contained only a small portion of nonzero path coefficients characterizing the effects of selected covariates on the latent variable. Results from the simulation study showed that compared to diffuse priors, penalty priors were advantageous in handling small sample sizes and collinearity among covariates. Priors with only the global penalty (ridge and lasso) yielded higher model convergence rates and power, whereas priors with both the global and local penalties (horseshoe and SSP) provided more accurate parameter estimates for medium and large covariate effects. The horseshoe and SSP improved accuracy in predicting factor scores, while achieving more parsimonious models. (PsycInfo Database Record (c) 2023 APA, all rights reserved).","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":""},"PeriodicalIF":7.0,"publicationDate":"2023-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"47361273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Psychological methods
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1