首页 > 最新文献

Psychological methods最新文献

英文 中文
Bayesian regularization in multiple-indicators multiple-causes models. 多指标多原因模型中的贝叶斯正则化。
IF 7.6 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2024-08-01 Epub Date: 2023-07-27 DOI: 10.1037/met0000594
Lijin Zhang, Xinya Liang

Integrating regularization methods into structural equation modeling is gaining increasing popularity. The purpose of regularization is to improve variable selection, model estimation, and prediction accuracy. In this study, we aim to: (a) compare Bayesian regularization methods for exploring covariate effects in multiple-indicators multiple-causes models, (b) examine the sensitivity of results to hyperparameter settings of penalty priors, and (c) investigate prediction accuracy through cross-validation. The Bayesian regularization methods examined included: ridge, lasso, adaptive lasso, spike-and-slab prior (SSP) and its variants, and horseshoe and its variants. Sparse solutions were developed for the structural coefficient matrix that contained only a small portion of nonzero path coefficients characterizing the effects of selected covariates on the latent variable. Results from the simulation study showed that compared to diffuse priors, penalty priors were advantageous in handling small sample sizes and collinearity among covariates. Priors with only the global penalty (ridge and lasso) yielded higher model convergence rates and power, whereas priors with both the global and local penalties (horseshoe and SSP) provided more accurate parameter estimates for medium and large covariate effects. The horseshoe and SSP improved accuracy in predicting factor scores, while achieving more parsimonious models. (PsycInfo Database Record (c) 2024 APA, all rights reserved).

将正则化方法集成到结构方程建模中越来越受欢迎。正则化的目的是提高变量选择、模型估计和预测精度。在这项研究中,我们的目标是:(a)比较贝叶斯正则化方法来探索多指标多原因模型中的协变量效应,(b)检验结果对惩罚先验超参数设置的敏感性,以及(c)通过交叉验证来研究预测的准确性。研究的贝叶斯正则化方法包括:脊法、套索法、自适应套索法、钉板先验(SSP)及其变体、马蹄法及其变体。我们为结构系数矩阵开发了稀疏解,该矩阵只包含一小部分表征选定协变量对潜在变量影响的非零路径系数。仿真研究结果表明,与扩散先验相比,惩罚先验在处理小样本量和协变量间共线性方面具有优势。只有全局惩罚的先验(ridge和lasso)产生了更高的模型收敛率和功率,而同时具有全局和局部惩罚的先验(horseshoe和SSP)为中、大协变量效应提供了更准确的参数估计。马蹄形和SSP提高了预测因子得分的准确性,同时实现了更简洁的模型。(PsycInfo数据库记录(c) 2023 APA,版权所有)。
{"title":"Bayesian regularization in multiple-indicators multiple-causes models.","authors":"Lijin Zhang, Xinya Liang","doi":"10.1037/met0000594","DOIUrl":"10.1037/met0000594","url":null,"abstract":"<p><p>Integrating regularization methods into structural equation modeling is gaining increasing popularity. The purpose of regularization is to improve variable selection, model estimation, and prediction accuracy. In this study, we aim to: (a) compare Bayesian regularization methods for exploring covariate effects in multiple-indicators multiple-causes models, (b) examine the sensitivity of results to hyperparameter settings of penalty priors, and (c) investigate prediction accuracy through cross-validation. The Bayesian regularization methods examined included: ridge, lasso, adaptive lasso, spike-and-slab prior (SSP) and its variants, and horseshoe and its variants. Sparse solutions were developed for the structural coefficient matrix that contained only a small portion of nonzero path coefficients characterizing the effects of selected covariates on the latent variable. Results from the simulation study showed that compared to diffuse priors, penalty priors were advantageous in handling small sample sizes and collinearity among covariates. Priors with only the global penalty (ridge and lasso) yielded higher model convergence rates and power, whereas priors with both the global and local penalties (horseshoe and SSP) provided more accurate parameter estimates for medium and large covariate effects. The horseshoe and SSP improved accuracy in predicting factor scores, while achieving more parsimonious models. (PsycInfo Database Record (c) 2024 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":"679-703"},"PeriodicalIF":7.6,"publicationDate":"2024-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10241486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparing theories with the Ising model of explanatory coherence. 用解释一致性的伊辛模型比较理论。
IF 7.6 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2024-06-01 Epub Date: 2023-03-02 DOI: 10.1037/met0000543
Maximilian Maier, Noah van Dongen, Denny Borsboom

[Correction Notice: An Erratum for this article was reported in Vol 29(3) of Psychological Methods (see record 2025-28068-002). In the article, the copyright attribution was incorrectly listed, and the Creative Commons CC BY 4.0 license disclaimer was incorrectly omitted from the author note. The correct copyright is "© 2023 The Author(s)," and the omitted disclaimer is below: Open Access funding provided by University College London: This work is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0; https://creativecommons.org/licenses/by/ 4.0). This license permits copying and redistributing the work in any medium or format, as well as adapting the material for any purpose, even commercially.] Theories are among the most important tools of science. Lewin (1943) already noted "There is nothing as practical as a good theory." Although psychologists discussed problems of theory in their discipline for a long time, weak theories are still widespread in most subfields. One possible reason for this is that psychologists lack the tools to systematically assess the quality of their theories. Thagard (1989) developed a computational model for formal theory evaluation based on the concept of explanatory coherence. However, there are possible improvements to Thagard's (1989) model and it is not available in software that psychologists typically use. Therefore, we developed a new implementation of explanatory coherence based on the Ising model. We demonstrate the capabilities of this new Ising model of Explanatory Coherence (IMEC) on several examples from psychology and other sciences. In addition, we implemented it in the R-package IMEC to assist scientists in evaluating the quality of their theories in practice. (PsycInfo Database Record (c) 2024 APA, all rights reserved).

理论是科学最重要的工具之一。卢因(1943 年)已经指出:"没有什么比一个好的理论更实用了"。尽管心理学家们对本学科的理论问题讨论了很长时间,但在大多数分支领域,理论薄弱的现象仍然普遍存在。其中一个可能的原因是,心理学家缺乏系统评估其理论质量的工具。塔加德(Thagard,1989 年)基于解释一致性的概念,开发了一个用于正式理论评估的计算模型。然而,Thagard(1989 年)的模型还有可能改进,而且心理学家通常使用的软件中也没有这个模型。因此,我们根据伊辛模型开发了一种新的解释一致性实施方法。我们通过心理学和其他科学领域的几个例子,展示了这种新的解释一致性伊辛模型(IMEC)的功能。此外,我们还在 R 软件包 IMEC 中实现了这一模型,以帮助科学家在实践中评估其理论的质量。(PsycInfo Database Record (c) 2023 APA, 版权所有)。
{"title":"Comparing theories with the Ising model of explanatory coherence.","authors":"Maximilian Maier, Noah van Dongen, Denny Borsboom","doi":"10.1037/met0000543","DOIUrl":"10.1037/met0000543","url":null,"abstract":"<p><p>[Correction Notice: An Erratum for this article was reported in Vol 29(3) of <i>Psychological Methods</i> (see record 2025-28068-002). In the article, the copyright attribution was incorrectly listed, and the Creative Commons CC BY 4.0 license disclaimer was incorrectly omitted from the author note. The correct copyright is \"© 2023 The Author(s),\" and the omitted disclaimer is below: Open Access funding provided by University College London: This work is licensed under a Creative Commons Attribution 4.0 International License (CC BY 4.0; https://creativecommons.org/licenses/by/ 4.0). This license permits copying and redistributing the work in any medium or format, as well as adapting the material for any purpose, even commercially.] Theories are among the most important tools of science. Lewin (1943) already noted \"There is nothing as practical as a good theory.\" Although psychologists discussed problems of theory in their discipline for a long time, weak theories are still widespread in most subfields. One possible reason for this is that psychologists lack the tools to systematically assess the quality of their theories. Thagard (1989) developed a computational model for formal theory evaluation based on the concept of explanatory coherence. However, there are possible improvements to Thagard's (1989) model and it is not available in software that psychologists typically use. Therefore, we developed a new implementation of explanatory coherence based on the Ising model. We demonstrate the capabilities of this new Ising model of Explanatory Coherence (IMEC) on several examples from psychology and other sciences. In addition, we implemented it in the R-package IMEC to assist scientists in evaluating the quality of their theories in practice. (PsycInfo Database Record (c) 2024 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":"519-536"},"PeriodicalIF":7.6,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10805750","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Let the algorithm speak: How to use neural networks for automatic item generation in psychological scale development. 让算法说话:如何在心理量表开发中使用神经网络自动生成项目。
IF 7.6 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2024-06-01 Epub Date: 2023-02-16 DOI: 10.1037/met0000540
Friedrich M Götz, Rakoen Maertens, Sahil Loomba, Sander van der Linden

Measurement is at the heart of scientific research. As many-perhaps most-psychological constructs cannot be directly observed, there is a steady demand for reliable self-report scales to assess latent constructs. However, scale development is a tedious process that requires researchers to produce good items in large quantities. In this tutorial, we introduce, explain, and apply the Psychometric Item Generator (PIG), an open-source, free-to-use, self-sufficient natural language processing algorithm that produces large-scale, human-like, customized text output within a few mouse clicks. The PIG is based on the GPT-2, a powerful generative language model, and runs on Google Colaboratory-an interactive virtual notebook environment that executes code on state-of-the-art virtual machines at no cost. Across two demonstrations and a preregistered five-pronged empirical validation with two Canadian samples (NSample 1 = 501, NSample 2 = 773), we show that the PIG is equally well-suited to generate large pools of face-valid items for novel constructs (i.e., wanderlust) and create parsimonious short scales of existing constructs (i.e., Big Five personality traits) that yield strong performances when tested in the wild and benchmarked against current gold standards for assessment. The PIG does not require any prior coding skills or access to computational resources and can easily be tailored to any desired context by simply switching out short linguistic prompts in a single line of code. In short, we present an effective, novel machine learning solution to an old psychological challenge. As such, the PIG will not require you to learn a new language-but instead, speak yours. (PsycInfo Database Record (c) 2024 APA, all rights reserved).

测量是科学研究的核心。由于许多--也许是大多数--心理结构无法被直接观察到,因此人们一直需要可靠的自我报告量表来评估潜在的结构。然而,量表的开发是一个乏味的过程,需要研究人员大量制作优秀的项目。在本教程中,我们将介绍、解释并应用心理测量项目生成器(PIG),它是一种开源、免费使用、自给自足的自然语言处理算法,只需点击几下鼠标就能生成大规模、类似人类的定制文本输出。PIG 基于 GPT-2(一种功能强大的生成语言模型),在 Google Colaboratory 上运行,这是一种交互式虚拟笔记本环境,可在最先进的虚拟机上免费执行代码。通过两次演示和预先注册的两个加拿大样本(NSample 1 = 501,NSample 2 = 773)的五方面经验验证,我们表明 PIG 同样适用于为新结构(如流浪癖)生成大量的面验证项目,并为现有结构(如五大人格特质)创建简明的短量表,这些短量表在野外测试和以当前的黄金评估标准为基准时表现出色。PIG 不需要任何编码技能或计算资源,只需在一行代码中切换出简短的语言提示,就能轻松地根据任何需要的情境进行定制。简而言之,我们为一项古老的心理挑战提供了一种有效、新颖的机器学习解决方案。因此,PIG 不需要你学习新的语言,而是用你的语言说话。(PsycInfo Database Record (c) 2023 APA,保留所有权利)。
{"title":"Let the algorithm speak: How to use neural networks for automatic item generation in psychological scale development.","authors":"Friedrich M Götz, Rakoen Maertens, Sahil Loomba, Sander van der Linden","doi":"10.1037/met0000540","DOIUrl":"10.1037/met0000540","url":null,"abstract":"<p><p>Measurement is at the heart of scientific research. As many-perhaps most-psychological constructs cannot be directly observed, there is a steady demand for reliable self-report scales to assess latent constructs. However, scale development is a tedious process that requires researchers to produce good items in large quantities. In this tutorial, we introduce, explain, and apply the Psychometric Item Generator (PIG), an open-source, free-to-use, self-sufficient natural language processing algorithm that produces large-scale, human-like, customized text output within a few mouse clicks. The PIG is based on the GPT-2, a powerful generative language model, and runs on Google Colaboratory-an interactive virtual notebook environment that executes code on state-of-the-art virtual machines at no cost. Across two demonstrations and a preregistered five-pronged empirical validation with two Canadian samples (<i>N</i><sub>Sample 1</sub> = 501, <i>N</i><sub>Sample 2</sub> = 773), we show that the PIG is equally well-suited to generate large pools of face-valid items for novel constructs (i.e., wanderlust) and create parsimonious short scales of existing constructs (i.e., Big Five personality traits) that yield strong performances when tested in the wild and benchmarked against current gold standards for assessment. The PIG does not require any prior coding skills or access to computational resources and can easily be tailored to any desired context by simply switching out short linguistic prompts in a single line of code. In short, we present an effective, novel machine learning solution to an old psychological challenge. As such, the PIG will not require you to learn a new language-but instead, speak yours. (PsycInfo Database Record (c) 2024 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":"494-518"},"PeriodicalIF":7.6,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10787831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Causal inference for treatment effects in partially nested designs. 部分嵌套设计中治疗效果的因果推断。
IF 7.6 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2024-06-01 Epub Date: 2023-04-13 DOI: 10.1037/met0000565
Xiao Liu, Fang Liu, Laura Miller-Graff, Kathryn H Howell, Lijuan Wang

artially nested designs (PNDs) are common in intervention studies in psychology and other social sciences. With this design, participants are assigned to treatment and control groups on an individual basis, but clustering occurs in some but not all groups (e.g., the treatment group). In recent years, there has been substantial development of methods for analyzing data from PNDs. However, little research has been done on causal inference for PNDs, especially for PNDs with nonrandomized treatment assignments. To reduce the research gap, in the current study, we used the expanded potential outcomes framework to define and identify the average causal treatment effects in PNDs. Based on the identification results, we formulated the outcome models that could produce treatment effect estimates with causal interpretation and evaluated how alternative model specifications affect the causal interpretation. We also developed an inverse propensity weighted (IPW) estimation approach and proposed a sandwich-type standard error estimator for the IPW-based estimate. Our simulation studies demonstrated that both the outcome modeling and the IPW methods specified following the identification results can yield satisfactory estimates and inferences of the average causal treatment effects. We applied the proposed approaches to data from a real-life pilot study of the Pregnant Moms' Empowerment Program for illustration. The current study provides guidance and insights on causal inference for PNDs and adds to researchers' toolbox of treatment effect estimation with PNDs. (PsycInfo Database Record (c) 2024 APA, all rights reserved).

部分嵌套设计(PND)在心理学和其他社会科学的干预研究中很常见。在这种设计中,参与者以个体为单位被分配到治疗组和对照组,但在某些组而不是所有组(如治疗组)中会出现聚类现象。近年来,分析 PND 数据的方法有了长足的发展。然而,对于 PND 的因果推断,尤其是对于非随机治疗分配的 PND,却鲜有研究。为了缩小研究差距,在本研究中,我们使用了扩展的潜在结果框架来定义和识别 PND 的平均因果治疗效果。根据识别结果,我们建立了能够产生具有因果解释的治疗效果估计值的结果模型,并评估了替代模型规格对因果解释的影响。我们还开发了一种反倾向加权(IPW)估算方法,并为基于 IPW 的估算提出了一种三明治型标准误差估算器。我们的模拟研究表明,根据识别结果指定的结果建模和 IPW 方法都能对平均因果治疗效果做出令人满意的估计和推断。我们将所提出的方法应用于 "孕妇妈妈赋权计划 "的实际试点研究数据,以资说明。本研究为 PND 的因果推断提供了指导和启示,并为研究人员利用 PND 估算治疗效果提供了更多的工具。(PsycInfo Database Record (c) 2024 APA, 版权所有)。
{"title":"Causal inference for treatment effects in partially nested designs.","authors":"Xiao Liu, Fang Liu, Laura Miller-Graff, Kathryn H Howell, Lijuan Wang","doi":"10.1037/met0000565","DOIUrl":"10.1037/met0000565","url":null,"abstract":"<p><p>artially nested designs (PNDs) are common in intervention studies in psychology and other social sciences. With this design, participants are assigned to treatment and control groups on an individual basis, but clustering occurs in some but not all groups (e.g., the treatment group). In recent years, there has been substantial development of methods for analyzing data from PNDs. However, little research has been done on causal inference for PNDs, especially for PNDs with nonrandomized treatment assignments. To reduce the research gap, in the current study, we used the expanded potential outcomes framework to define and identify the average causal treatment effects in PNDs. Based on the identification results, we formulated the outcome models that could produce treatment effect estimates with causal interpretation and evaluated how alternative model specifications affect the causal interpretation. We also developed an inverse propensity weighted (IPW) estimation approach and proposed a sandwich-type standard error estimator for the IPW-based estimate. Our simulation studies demonstrated that both the outcome modeling and the IPW methods specified following the identification results can yield satisfactory estimates and inferences of the average causal treatment effects. We applied the proposed approaches to data from a real-life pilot study of the Pregnant Moms' Empowerment Program for illustration. The current study provides guidance and insights on causal inference for PNDs and adds to researchers' toolbox of treatment effect estimation with PNDs. (PsycInfo Database Record (c) 2024 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":"457-479"},"PeriodicalIF":7.6,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9737459","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
We need to change how we compute RMSEA for nested model comparisons in structural equation modeling. 我们需要改变结构方程建模中嵌套模型比较的 RMSEA 计算方法。
IF 7.6 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2024-06-01 Epub Date: 2023-01-09 DOI: 10.1037/met0000537
Victoria Savalei, Jordan C Brace, Rachel T Fouladi

Comparison of nested models is common in applications of structural equation modeling (SEM). When two models are nested, model comparison can be done via a chi-square difference test or by comparing indices of approximate fit. The advantage of fit indices is that they permit some amount of misspecification in the additional constraints imposed on the model, which is a more realistic scenario. The most popular index of approximate fit is the root mean square error of approximation (RMSEA). In this article, we argue that the dominant way of comparing RMSEA values for two nested models, which is simply taking their difference, is problematic and will often mask misfit, particularly in model comparisons with large initial degrees of freedom. We instead advocate computing the RMSEA associated with the chi-square difference test, which we call RMSEAD. We are not the first to propose this index, and we review numerous methodological articles that have suggested it. Nonetheless, these articles appear to have had little impact on actual practice. The modification of current practice that we call for may be particularly needed in the context of measurement invariance assessment. We illustrate the difference between the current approach and our advocated approach on three examples, where two involve multiple-group and longitudinal measurement invariance assessment and the third involves comparisons of models with different numbers of factors. We conclude with a discussion of recommendations and future research directions. (PsycInfo Database Record (c) 2024 APA, all rights reserved).

在结构方程建模(SEM)的应用中,嵌套模型的比较很常见。当两个模型嵌套时,可以通过卡方差异检验或比较近似拟合指数来进行模型比较。拟合指数的优势在于,它们允许在对模型施加的额外约束条件中存在一定程度的错误规范,这是更现实的情况。最常用的近似拟合指数是近似均方根误差(RMSEA)。在本文中,我们认为比较两个嵌套模型 RMSEA 值的主流方法,即简单地取它们的差值,是有问题的,往往会掩盖不拟合,特别是在初始自由度较大的模型比较中。我们主张计算与卡方差检验相关的 RMSEA,我们称之为 RMSEAD。我们并不是第一个提出这一指标的人,我们回顾了许多提出这一指标的方法论文章。然而,这些文章似乎对实际操作影响甚微。在测量不变性评估方面,我们呼吁对当前实践进行修改,这可能是特别需要的。我们通过三个例子来说明当前方法与我们所提倡的方法之间的区别,其中两个例子涉及多组和纵向测量不变性评估,第三个例子涉及不同因子数量模型的比较。最后,我们对建议和未来研究方向进行了讨论。(PsycInfo Database Record (c) 2023 APA, 版权所有)。
{"title":"We need to change how we compute RMSEA for nested model comparisons in structural equation modeling.","authors":"Victoria Savalei, Jordan C Brace, Rachel T Fouladi","doi":"10.1037/met0000537","DOIUrl":"10.1037/met0000537","url":null,"abstract":"<p><p>Comparison of nested models is common in applications of structural equation modeling (SEM). When two models are nested, model comparison can be done via a chi-square difference test or by comparing indices of approximate fit. The advantage of fit indices is that they permit some amount of misspecification in the additional constraints imposed on the model, which is a more realistic scenario. The most popular index of approximate fit is the root mean square error of approximation (RMSEA). In this article, we argue that the dominant way of comparing RMSEA values for two nested models, which is simply taking their difference, is problematic and will often mask misfit, particularly in model comparisons with large initial degrees of freedom. We instead advocate computing the RMSEA associated with the chi-square difference test, which we call RMSEA<sub>D</sub>. We are not the first to propose this index, and we review numerous methodological articles that have suggested it. Nonetheless, these articles appear to have had little impact on actual practice. The modification of current practice that we call for may be particularly needed in the context of measurement invariance assessment. We illustrate the difference between the current approach and our advocated approach on three examples, where two involve multiple-group and longitudinal measurement invariance assessment and the third involves comparisons of models with different numbers of factors. We conclude with a discussion of recommendations and future research directions. (PsycInfo Database Record (c) 2024 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":"480-493"},"PeriodicalIF":7.6,"publicationDate":"2024-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10495377","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Reconsideration of the type I error rate for psychological science in the era of replication. 复制时代对心理科学I型错误率的再思考。
IF 7 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2024-04-01 Epub Date: 2022-04-11 DOI: 10.1037/met0000490
Michael T Carlin, Mack S Costello, Madisyn A Flansburg, Alyssa Darden

Careful consideration of the tradeoff between Type I and Type II error rates when designing experiments is critical for maximizing statistical decision accuracy. Typically, Type I error rates (e.g., .05) are significantly lower than Type II error rates (e.g., .20 for .80 power) in psychological science. Further, positive findings (true effects and Type I errors) are more likely to be the focus of replication. This conventional approach leads to very high rates of Type II error. Analyses show that increasing the Type I error rate to .10, thereby increasing power and decreasing the Type II error rate for each test, leads to higher overall rates of correct statistical decisions. This increase of Type I error rate is consistent with, and most beneficial in the context of, the replication and "New Statistics" movements in psychology. (PsycInfo Database Record (c) 2024 APA, all rights reserved).

在设计实验时,仔细考虑I型和II型错误率之间的权衡对于最大限度地提高统计决策准确性至关重要。通常,在心理科学中,I型错误率(例如.05)显著低于II型错误率。此外,积极的发现(真实效果和I型错误)更有可能成为复制的焦点。这种传统方法导致非常高的II型错误率。分析表明,将I型错误率提高到.10,从而增加每次测试的功率并降低II型错误率,可以提高统计决策的总体正确率。I型错误率的增加与心理学中的复制和“新统计学”运动相一致,在这种情况下最为有益。(PsycInfo数据库记录(c)2022 APA,保留所有权利)。
{"title":"Reconsideration of the type I error rate for psychological science in the era of replication.","authors":"Michael T Carlin, Mack S Costello, Madisyn A Flansburg, Alyssa Darden","doi":"10.1037/met0000490","DOIUrl":"10.1037/met0000490","url":null,"abstract":"<p><p>Careful consideration of the tradeoff between Type I and Type II error rates when designing experiments is critical for maximizing statistical decision accuracy. Typically, Type I error rates (e.g., .05) are significantly lower than Type II error rates (e.g., .20 for .80 power) in psychological science. Further, positive findings (true effects and Type I errors) are more likely to be the focus of replication. This conventional approach leads to very high rates of Type II error. Analyses show that increasing the Type I error rate to .10, thereby increasing power and decreasing the Type II error rate for each test, leads to higher overall rates of correct statistical decisions. This increase of Type I error rate is consistent with, and most beneficial in the context of, the replication and \"New Statistics\" movements in psychology. (PsycInfo Database Record (c) 2024 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":"1 1","pages":"379-387"},"PeriodicalIF":7.0,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41346077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient alternatives for Bayesian hypothesis tests in psychology. 心理学中贝叶斯假设检验的有效替代方案。
IF 7 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2024-04-01 Epub Date: 2022-04-14 DOI: 10.1037/met0000482
Sandipan Pramanik, Valen E Johnson

Bayesian hypothesis testing procedures have gained increased acceptance in recent years. A key advantage that Bayesian tests have over classical testing procedures is their potential to quantify information in support of true null hypotheses. Ironically, default implementations of Bayesian tests prevent the accumulation of strong evidence in favor of true null hypotheses because associated default alternative hypotheses assign a high probability to data that are most consistent with a null effect. We propose the use of "nonlocal" alternative hypotheses to resolve this paradox. The resulting class of Bayesian hypothesis tests permits more rapid accumulation of evidence in favor of both true null hypotheses and alternative hypotheses that are compatible with standardized effect sizes of most interest in psychology. (PsycInfo Database Record (c) 2024 APA, all rights reserved).

近年来,贝叶斯假设检验程序得到了越来越多的认可。与经典测试程序相比,贝叶斯测试的一个关键优势是它们有可能量化支持真零假设的信息。具有讽刺意味的是,贝叶斯测试的默认实现阻止了有利于真正零假设的有力证据的积累,因为相关的默认替代假设为最符合零效应的数据分配了高概率。我们建议使用“非局部”替代假设来解决这个悖论。由此产生的一类贝叶斯假设测试允许更快速地积累有利于真零假设和替代假设的证据,这些假设与心理学中最感兴趣的标准化效应大小相兼容。(PsycInfo数据库记录(c)2022 APA,保留所有权利)。
{"title":"Efficient alternatives for Bayesian hypothesis tests in psychology.","authors":"Sandipan Pramanik, Valen E Johnson","doi":"10.1037/met0000482","DOIUrl":"10.1037/met0000482","url":null,"abstract":"<p><p>Bayesian hypothesis testing procedures have gained increased acceptance in recent years. A key advantage that Bayesian tests have over classical testing procedures is their potential to quantify information in support of true null hypotheses. Ironically, default implementations of Bayesian tests prevent the accumulation of strong evidence in favor of true null hypotheses because associated default alternative hypotheses assign a high probability to data that are most consistent with a null effect. We propose the use of \"nonlocal\" alternative hypotheses to resolve this paradox. The resulting class of Bayesian hypothesis tests permits more rapid accumulation of evidence in favor of both true null hypotheses and alternative hypotheses that are compatible with standardized effect sizes of most interest in psychology. (PsycInfo Database Record (c) 2024 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":"243-261"},"PeriodicalIF":7.0,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9561355/pdf/nihms-1808497.pdf","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41210713","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing measurement invariance with moderated nonlinear factor analysis using the R package OpenMx. 利用 R 软件包 OpenMx 进行调节非线性因子分析,评估测量不变性。
IF 7 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2024-04-01 Epub Date: 2022-07-04 DOI: 10.1037/met0000501
Laura Kolbe, Dylan Molenaar, Suzanne Jak, Terrence D Jorgensen

Assessing measurement invariance is an important step in establishing a meaningful comparison of measurements of a latent construct across individuals or groups. Most recently, moderated nonlinear factor analysis (MNLFA) has been proposed as a method to assess measurement invariance. In MNLFA models, measurement invariance is examined in a single-group confirmatory factor analysis model by means of parameter moderation. The advantages of MNLFA over other methods is that it (a) accommodates the assessment of measurement invariance across multiple continuous and categorical background variables and (b) accounts for heteroskedasticity by allowing the factor and residual variances to differ as a function of the background variables. In this article, we aim to make MNLFA more accessible to researchers without access to commercial structural equation modeling software by demonstrating how this method can be applied with the open-source R package OpenMx. (PsycInfo Database Record (c) 2024 APA, all rights reserved).

评估测量不变性是对不同个体或群体的潜在构念的测量结果进行有意义比较的重要步骤。最近,有人提出了调节性非线性因子分析(MNLFA)作为一种评估测量不变量的方法。在 MNLFA 模型中,测量不变性是通过参数调节的方式在单组确认性因子分析模型中进行检验的。与其他方法相比,MNLFA 的优点在于:(a) 可以评估多个连续和分类背景变量的测量不变量;(b) 允许因子方差和残差方差随背景变量的变化而变化,从而考虑到异方差。在本文中,我们旨在通过演示如何使用开源 R 软件包 OpenMx 来应用 MNLFA,使无法使用商业结构方程建模软件的研究人员更容易使用 MNLFA。(PsycInfo Database Record (c) 2024 APA, 版权所有)。
{"title":"Assessing measurement invariance with moderated nonlinear factor analysis using the R package OpenMx.","authors":"Laura Kolbe, Dylan Molenaar, Suzanne Jak, Terrence D Jorgensen","doi":"10.1037/met0000501","DOIUrl":"10.1037/met0000501","url":null,"abstract":"<p><p>Assessing measurement invariance is an important step in establishing a meaningful comparison of measurements of a latent construct across individuals or groups. Most recently, moderated nonlinear factor analysis (MNLFA) has been proposed as a method to assess measurement invariance. In MNLFA models, measurement invariance is examined in a single-group confirmatory factor analysis model by means of parameter moderation. The advantages of MNLFA over other methods is that it (a) accommodates the assessment of measurement invariance across multiple continuous and categorical background variables and (b) accounts for heteroskedasticity by allowing the factor and residual variances to differ as a function of the background variables. In this article, we aim to make MNLFA more accessible to researchers without access to commercial structural equation modeling software by demonstrating how this method can be applied with the open-source R package OpenMx. (PsycInfo Database Record (c) 2024 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":"388-406"},"PeriodicalIF":7.0,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9577801","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Harnessing the power of excess statistical significance: Weighted and iterative least squares. 利用过度统计显著性的力量:加权和迭代最小二乘法。
IF 7 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2024-04-01 Epub Date: 2022-05-12 DOI: 10.1037/met0000502
T D Stanley, Hristos Doucouliagos

We introduce a new meta-analysis estimator, the weighted and iterated least squares (WILS), that greatly reduces publication selection bias (PSB) when selective reporting for statistical significance (SSS) is present. WILS is the simple weighted average that has smaller bias and rates of false positives than conventional meta-analysis estimators, the unrestricted weighted least squares (UWLS), and the weighted average of the adequately powered (WAAP) when there is SSS. As a simple weighted average, it is not vulnerable to violations in publication bias corrections models' assumptions too often seen in application. WILS is based on the novel idea of allowing excess statistical significance (ESS), which is a necessary condition of SSS, to identify when and how to reduce PSB. We show in comparisons with large-scale preregistered replications and in evidence-based simulations that the remaining bias is small. The routine application of WILS in the place of random effects would do much to reduce conventional meta-analysis's notable biases and high rates of false positives. (PsycInfo Database Record (c) 2024 APA, all rights reserved).

我们引入了一种新的荟萃分析估计器,加权迭代最小二乘法(WILS),当存在统计显著性选择性报告(SSS)时,它可以大大降低出版物选择偏差(PSB)。WILS是一种简单的加权平均值,与传统的荟萃分析估计量、无限制加权最小二乘法(UWLS)和有SSS时的充分加权平均值(WAAP)相比,其偏差和误报率较小。作为一个简单的加权平均值,它不容易受到应用中经常出现的出版物偏差校正模型假设的违反。WILS基于允许超额统计显著性(ESS)的新思想,这是SSS的必要条件,以确定何时以及如何减少PSB。我们在与大规模预注册复制的比较和循证模拟中表明,剩余的偏差很小。用WILS代替随机效应的常规应用将大大减少传统荟萃分析的显著偏差和高假阳性率。(PsycInfo数据库记录(c)2023 APA,保留所有权利)。
{"title":"Harnessing the power of excess statistical significance: Weighted and iterative least squares.","authors":"T D Stanley, Hristos Doucouliagos","doi":"10.1037/met0000502","DOIUrl":"10.1037/met0000502","url":null,"abstract":"<p><p>We introduce a new meta-analysis estimator, the weighted and iterated least squares (WILS), that greatly reduces publication selection bias (PSB) when selective reporting for statistical significance (SSS) is present. WILS is the simple weighted average that has smaller bias and rates of false positives than conventional meta-analysis estimators, the unrestricted weighted least squares (UWLS), and the weighted average of the adequately powered (WAAP) when there is SSS. As a simple weighted average, it is not vulnerable to violations in publication bias corrections models' assumptions too often seen in application. WILS is based on the novel idea of allowing excess statistical significance (ESS), which is a necessary condition of SSS, to identify when and how to reduce PSB. We show in comparisons with large-scale preregistered replications and in evidence-based simulations that the remaining bias is small. The routine application of WILS in the place of random effects would do much to reduce conventional meta-analysis's notable biases and high rates of false positives. (PsycInfo Database Record (c) 2024 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":"407-420"},"PeriodicalIF":7.0,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41210714","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Factor analyzing ordinal items requires substantive knowledge of response marginals. 对序数项目进行因子分析需要对反应边际有实质性的了解。
IF 7 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2024-02-01 Epub Date: 2022-05-19 DOI: 10.1037/met0000495
Steffen Grønneberg, Njål Foldnes

In the social sciences, measurement scales often consist of ordinal items and are commonly analyzed using factor analysis. Either data are treated as continuous, or a discretization framework is imposed in order to take the ordinal scale properly into account. Correlational analysis is central in both approaches, and we review recent theory on correlations obtained from ordinal data. To ensure appropriate estimation, the item distributions prior to discretization should be (approximately) known, or the thresholds should be known to be equally spaced. We refer to such knowledge as substantive because it may not be extracted from the data, but must be rooted in expert knowledge about the data-generating process. An illustrative case is presented where absence of substantive knowledge of the item distributions inevitably leads the analyst to conclude that a truly two-dimensional case is perfectly one-dimensional. Additional studies probe the extent to which violation of the standard assumption of underlying normality leads to bias in correlations and factor models. As a remedy, we propose an adjusted polychoric estimator for ordinal factor analysis that takes substantive knowledge into account. Also, we demonstrate how to use the adjusted estimator in sensitivity analysis when the continuous item distributions are known only approximately. (PsycInfo Database Record (c) 2024 APA, all rights reserved).

在社会科学领域,测量量表通常由序数项目组成,通常使用因子分析进行分析。要么将数据视为连续数据,要么采用离散化框架,以便适当考虑序数量表。相关分析是这两种方法的核心,我们将回顾从序数数据中获得相关性的最新理论。为了确保适当的估计,离散化之前的项目分布应该是(近似)已知的,或者阈值应该是已知的等距分布。我们将这种知识称为实质性知识,因为它可能无法从数据中提取,而必须植根于有关数据生成过程的专家知识。我们将举例说明,如果缺乏关于项目分布的实质性知识,分析人员必然会得出结论,认为一个真正的二维案例完全是一维的。其他研究还探讨了违反基本正态性标准假设在多大程度上会导致相关性和因子模型出现偏差。作为一种补救措施,我们为序数因子分析提出了一种考虑到实质性知识的调整多变量估计器。此外,我们还演示了当连续项目分布仅为近似已知时,如何在敏感性分析中使用调整后的估计器。(PsycInfo Database Record (c) 2024 APA, all rights reserved)。
{"title":"Factor analyzing ordinal items requires substantive knowledge of response marginals.","authors":"Steffen Grønneberg, Njål Foldnes","doi":"10.1037/met0000495","DOIUrl":"10.1037/met0000495","url":null,"abstract":"<p><p>In the social sciences, measurement scales often consist of ordinal items and are commonly analyzed using factor analysis. Either data are treated as continuous, or a discretization framework is imposed in order to take the ordinal scale properly into account. Correlational analysis is central in both approaches, and we review recent theory on correlations obtained from ordinal data. To ensure appropriate estimation, the item distributions prior to discretization should be (approximately) known, or the thresholds should be known to be equally spaced. We refer to such knowledge as substantive because it may not be extracted from the data, but must be rooted in expert knowledge about the data-generating process. An illustrative case is presented where absence of substantive knowledge of the item distributions inevitably leads the analyst to conclude that a truly two-dimensional case is perfectly one-dimensional. Additional studies probe the extent to which violation of the standard assumption of underlying normality leads to bias in correlations and factor models. As a remedy, we propose an adjusted polychoric estimator for ordinal factor analysis that takes substantive knowledge into account. Also, we demonstrate how to use the adjusted estimator in sensitivity analysis when the continuous item distributions are known only approximately. (PsycInfo Database Record (c) 2024 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":"65-87"},"PeriodicalIF":7.0,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9171744","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Psychological methods
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1