首页 > 最新文献

Psychological methods最新文献

英文 中文
Which estimation method to choose in network psychometrics? Deriving guidelines for applied researchers. 在网络心理测量中应该选择哪种估计方法?为应用研究人员制定指导方针。
IF 7 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2023-08-01 DOI: 10.1037/met0000439
Adela-Maria Isvoranu, Sacha Epskamp

The Gaussian graphical model (GGM) has recently grown popular in psychological research, with a large body of estimation methods being proposed and discussed across various fields of study, and several algorithms being identified and recommend as applicable to psychological data sets. Such high-dimensional model estimation, however, is not trivial, and algorithms tend to perform differently in different settings. In addition, psychological research poses unique challenges, including placing a strong focus on weak edges (e.g., bridge edges), handling data measured on ordered scales, and relatively limited sample sizes. As a result, there is currently no consensus regarding which estimation procedure performs best in which setting. In this large-scale simulation study, we aimed to overcome this gap in the literature by comparing the performance of several estimation algorithms suitable for Gaussian and skewed ordered categorical data across a multitude of settings, as to arrive at concrete guidelines from applied researchers. In total, we investigated 60 different metrics across 564,000 simulated data sets. We summarized our findings through a platform that allows for manually exploring simulation results. Overall, we found that an exchange between discovery (e.g., sensitivity, edge weight correlation) and caution (e.g., specificity, precision) should always be expected, and achieving both-which is a requirement for perfect replicability-is difficult. Further, we identified that the estimation method is best chosen in light of each research question and have highlighted, alongside desirable asymptotic properties and low sample size discovery, results according to most common research questions in the field. (PsycInfo Database Record (c) 2023 APA, all rights reserved).

高斯图形模型(GGM)最近在心理学研究中越来越流行,在各个研究领域提出和讨论了大量的估计方法,并确定了几种适用于心理学数据集的算法。然而,这种高维模型估计并不是微不足道的,而且算法在不同的设置中往往表现不同。此外,心理学研究提出了独特的挑战,包括将重点放在弱边缘(例如,桥梁边缘),处理有序尺度上测量的数据,以及相对有限的样本量。因此,目前对于哪种评估过程在哪种环境下表现最好还没有达成一致意见。在这项大规模的模拟研究中,我们的目标是通过比较几种适合高斯和偏序分类数据的估计算法在多种设置中的性能来克服文献中的这一差距,从而得出应用研究人员的具体指导方针。我们总共调查了564,000个模拟数据集中的60个不同指标。我们通过一个允许手动探索模拟结果的平台总结了我们的发现。总的来说,我们发现发现(例如,敏感性,边缘权重相关性)和谨慎(例如,特异性,精度)之间的交换应该始终是预期的,并且实现两者-这是完美可复制性的要求-是困难的。此外,我们确定了根据每个研究问题最好选择估计方法,并根据该领域最常见的研究问题强调了理想的渐近性质和低样本量发现结果。(PsycInfo数据库记录(c) 2023 APA,版权所有)。
{"title":"Which estimation method to choose in network psychometrics? Deriving guidelines for applied researchers.","authors":"Adela-Maria Isvoranu,&nbsp;Sacha Epskamp","doi":"10.1037/met0000439","DOIUrl":"https://doi.org/10.1037/met0000439","url":null,"abstract":"<p><p>The Gaussian graphical model (GGM) has recently grown popular in psychological research, with a large body of estimation methods being proposed and discussed across various fields of study, and several algorithms being identified and recommend as applicable to psychological data sets. Such high-dimensional model estimation, however, is not trivial, and algorithms tend to perform differently in different settings. In addition, psychological research poses unique challenges, including placing a strong focus on weak edges (e.g., bridge edges), handling data measured on ordered scales, and relatively limited sample sizes. As a result, there is currently no consensus regarding which estimation procedure performs best in which setting. In this large-scale simulation study, we aimed to overcome this gap in the literature by comparing the performance of several estimation algorithms suitable for Gaussian and skewed ordered categorical data across a multitude of settings, as to arrive at concrete guidelines from applied researchers. In total, we investigated 60 different metrics across 564,000 simulated data sets. We summarized our findings through a platform that allows for manually exploring simulation results. Overall, we found that an exchange between discovery (e.g., sensitivity, edge weight correlation) and caution (e.g., specificity, precision) should always be expected, and achieving both-which is a requirement for perfect replicability-is difficult. Further, we identified that the estimation method is best chosen in light of each research question and have highlighted, alongside desirable asymptotic properties and low sample size discovery, results according to most common research questions in the field. (PsycInfo Database Record (c) 2023 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":"28 4","pages":"925-946"},"PeriodicalIF":7.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10033899","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 33
Bayesian continuous-time hidden Markov models with covariate selection for intensive longitudinal data with measurement error. 针对具有测量误差的密集纵向数据的带有协变量选择的贝叶斯连续时间隐马尔可夫模型。
IF 7.6 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2023-08-01 Epub Date: 2021-12-20 DOI: 10.1037/met0000433
Mingrui Liang, Matthew D Koslovsky, Emily T Hébert, Darla E Kendzor, Michael S Businelle, Marina Vannucci

Intensive longitudinal data collected with ecological momentary assessment methods capture information on participants' behaviors, feelings, and environment in near real-time. While these methods can reduce recall biases typically present in survey data, they may still suffer from other biases commonly found in self-reported data (e.g., measurement error and social desirability bias). To accommodate potential biases, we develop a Bayesian hidden Markov model to simultaneously identify risk factors for subjects transitioning between discrete latent states as well as risk factors potentially associated with them misreporting their true behaviors. We use simulated data to demonstrate how ignoring potential measurement error can negatively affect variable selection performance and estimation accuracy. We apply our proposed model to smartphone-based ecological momentary assessment data collected within a randomized controlled trial that evaluated the impact of incentivizing abstinence from cigarette smoking among socioeconomically disadvantaged adults. (PsycInfo Database Record (c) 2023 APA, all rights reserved).

使用生态学瞬间评估方法收集的密集纵向数据可近乎实时地获取参与者的行为、感受和环境信息。虽然这些方法可以减少调查数据中通常存在的回忆偏差,但仍可能存在自我报告数据中常见的其他偏差(如测量误差和社会期望偏差)。为了适应潜在的偏差,我们建立了一个贝叶斯隐马尔科夫模型,以同时识别受试者在离散潜伏状态之间转换的风险因素,以及可能与他们误报真实行为相关的风险因素。我们使用模拟数据来证明忽略潜在的测量误差会如何对变量选择性能和估计精度产生负面影响。我们将提出的模型应用于基于智能手机的生态瞬间评估数据,这些数据是在一项随机对照试验中收集的,该试验评估了在社会经济条件较差的成年人中鼓励戒烟的影响。(PsycInfo Database Record (c) 2023 APA, 版权所有)。
{"title":"Bayesian continuous-time hidden Markov models with covariate selection for intensive longitudinal data with measurement error.","authors":"Mingrui Liang, Matthew D Koslovsky, Emily T Hébert, Darla E Kendzor, Michael S Businelle, Marina Vannucci","doi":"10.1037/met0000433","DOIUrl":"10.1037/met0000433","url":null,"abstract":"<p><p>Intensive longitudinal data collected with ecological momentary assessment methods capture information on participants' behaviors, feelings, and environment in near real-time. While these methods can reduce recall biases typically present in survey data, they may still suffer from other biases commonly found in self-reported data (e.g., measurement error and social desirability bias). To accommodate potential biases, we develop a Bayesian hidden Markov model to simultaneously identify risk factors for subjects transitioning between discrete latent states as well as risk factors potentially associated with them misreporting their true behaviors. We use simulated data to demonstrate how ignoring potential measurement error can negatively affect variable selection performance and estimation accuracy. We apply our proposed model to smartphone-based ecological momentary assessment data collected within a randomized controlled trial that evaluated the impact of incentivizing abstinence from cigarette smoking among socioeconomically disadvantaged adults. (PsycInfo Database Record (c) 2023 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":"28 4","pages":"880-894"},"PeriodicalIF":7.6,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9207158/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10128719","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Bayesian region of measurement equivalence (ROME) approach for establishing measurement invariance. 一种建立测量不变性的贝叶斯测量等价域方法。
IF 7 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2023-08-01 DOI: 10.1037/met0000455
Yichi Zhang, Mark H C Lai, Gregory J Palardy

Measurement invariance research has focused on identifying biases in test indicators measuring a latent trait across two or more groups. However, relatively little attention has been devoted to the practical implications of noninvariance. An important question is whether noninvariance in indicators or items results in differences in observed composite scores across groups. The current study introduces the Bayesian region of measurement equivalence (ROME) as a framework for visualizing and testing the combined impact of partial invariance on the group difference in observed scores. Under the proposed framework, researchers first compute the highest posterior density intervals (HPDIs)-which contain the most plausible values-for the expected group difference in observed test scores over a range of latent trait levels. By comparing the HPDIs with a predetermined range of values that is practically equivalent to zero (i.e., region of measurement equivalence), researchers can determine whether a test instrument is practically invariant. The proposed ROME method can be used for both continuous indicators and ordinal items. We illustrated ROME using five items measuring mathematics-specific self-efficacy from a nationally representative sample of 10th graders. Whereas conventional invariance testing identifies a partial strict invariance model across gender, the statistically significant noninvariant items were found to have a negligible impact on the comparison of the observed scores. This empirical example demonstrates the utility of the ROME method for assessing practical significance when statistically significant item noninvariance is found. (PsycInfo Database Record (c) 2023 APA, all rights reserved).

测量不变性研究的重点是在两个或多个群体中测量潜在特征的测试指标中识别偏差。然而,对非不变性的实际含义的关注相对较少。一个重要的问题是,指标或项目的不变性是否会导致观察到的组间综合得分的差异。本研究引入贝叶斯测量等效区域(ROME)作为一个框架,用于可视化和测试部分不变性对观察得分组差异的综合影响。在提出的框架下,研究人员首先计算最高后验密度间隔(HPDIs),它包含了最合理的值,用于观察到的测试分数在潜在特征水平范围内的预期组差异。通过将HPDIs与实际等于零的预定值范围(即测量等效区域)进行比较,研究人员可以确定测试仪器是否实际不变。所提出的ROME方法既可用于连续指标,也可用于序数项目。我们从一个具有全国代表性的十年级学生样本中使用五个项目来测量数学特定的自我效能来说明ROME。尽管传统的不变性检验确定了跨性别的部分严格不变性模型,但统计上显著的非不变项被发现对观察得分的比较具有可忽略不计的影响。这个经验例子表明,当发现统计显著项目不变性时,ROME方法用于评估实际意义的效用。(PsycInfo数据库记录(c) 2023 APA,版权所有)。
{"title":"A Bayesian region of measurement equivalence (ROME) approach for establishing measurement invariance.","authors":"Yichi Zhang,&nbsp;Mark H C Lai,&nbsp;Gregory J Palardy","doi":"10.1037/met0000455","DOIUrl":"https://doi.org/10.1037/met0000455","url":null,"abstract":"<p><p>Measurement invariance research has focused on identifying biases in test indicators measuring a latent trait across two or more groups. However, relatively little attention has been devoted to the practical implications of noninvariance. An important question is whether noninvariance in indicators or items results in differences in observed composite scores across groups. The current study introduces the Bayesian <i>region of measurement equivalence</i> (ROME) as a framework for visualizing and testing the combined impact of partial invariance on the group difference in observed scores. Under the proposed framework, researchers first compute the <i>highest posterior density intervals</i> (HPDIs)-which contain the most plausible values-for the expected group difference in observed test scores over a range of latent trait levels. By comparing the HPDIs with a predetermined range of values that is practically equivalent to zero (i.e., region of measurement equivalence), researchers can determine whether a test instrument is practically invariant. The proposed ROME method can be used for both continuous indicators and ordinal items. We illustrated ROME using five items measuring mathematics-specific self-efficacy from a nationally representative sample of 10th graders. Whereas conventional invariance testing identifies a partial strict invariance model across gender, the statistically significant noninvariant items were found to have a negligible impact on the comparison of the observed scores. This empirical example demonstrates the utility of the ROME method for assessing practical significance when statistically significant item noninvariance is found. (PsycInfo Database Record (c) 2023 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":"28 4","pages":"993-1004"},"PeriodicalIF":7.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10409453","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Evaluating predictors' relative importance using Bayes factors in regression models. 在回归模型中使用贝叶斯因子评估预测因子的相对重要性。
IF 7 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2023-08-01 DOI: 10.1037/met0000431
Xin Gu

This study presents a Bayesian inference approach to evaluate the relative importance of predictors in regression models. Depending on the interpretation of importance, a number of indices are introduced, such as the standardized regression coefficient, the average squared semipartial correlation, and the dominance analysis measure. Researchers' theories about relative importance are represented by order constrained hypotheses. Support for or against the hypothesis is quantified by the Bayes factor, which can be computed from the prior and posterior distributions of the importance index. As the distributions of the indices are often unknown, we specify prior and posterior distributions for the covariance matrix of all variables in the regression model. The prior and posterior distributions of each importance index can be obtained from the prior and posterior samples of the covariance matrix. Simulation studies are conducted to show different inferences resulting from various importance indices and to investigate the performance of the proposed Bayesian testing approach. The procedure of evaluating relative importance using Bayes factors is illustrated using two real data examples. (PsycInfo Database Record (c) 2023 APA, all rights reserved).

本研究提出一种贝叶斯推理方法来评估回归模型中预测因子的相对重要性。根据对重要性的解释,引入了一些指标,如标准化回归系数、平均平方半偏相关和优势分析措施。研究者关于相对重要性的理论表现为顺序约束假设。支持或反对假设是量化的贝叶斯因子,它可以从重要性指数的先验和后验分布计算。由于指标的分布往往是未知的,我们为回归模型中所有变量的协方差矩阵指定了先验和后验分布。从协方差矩阵的先验和后验样本可以得到各重要指标的先验和后验分布。通过仿真研究,展示了不同重要指数所产生的不同推论,并对所提出的贝叶斯测试方法的性能进行了研究。用两个实际数据实例说明了利用贝叶斯因子评价相对重要性的过程。(PsycInfo数据库记录(c) 2023 APA,版权所有)。
{"title":"Evaluating predictors' relative importance using Bayes factors in regression models.","authors":"Xin Gu","doi":"10.1037/met0000431","DOIUrl":"https://doi.org/10.1037/met0000431","url":null,"abstract":"<p><p>This study presents a Bayesian inference approach to evaluate the relative importance of predictors in regression models. Depending on the interpretation of importance, a number of indices are introduced, such as the standardized regression coefficient, the average squared semipartial correlation, and the dominance analysis measure. Researchers' theories about relative importance are represented by order constrained hypotheses. Support for or against the hypothesis is quantified by the Bayes factor, which can be computed from the prior and posterior distributions of the importance index. As the distributions of the indices are often unknown, we specify prior and posterior distributions for the covariance matrix of all variables in the regression model. The prior and posterior distributions of each importance index can be obtained from the prior and posterior samples of the covariance matrix. Simulation studies are conducted to show different inferences resulting from various importance indices and to investigate the performance of the proposed Bayesian testing approach. The procedure of evaluating relative importance using Bayes factors is illustrated using two real data examples. (PsycInfo Database Record (c) 2023 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":"28 4","pages":"825-842"},"PeriodicalIF":7.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10408970","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Methods for estimating the sampling variance of the standardized mean difference. 估计标准化平均差抽样方差的方法。
IF 7 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2023-08-01 DOI: 10.1037/met0000446
Manuel Suero, Juan Botella, Juan I Durán

One of the most widely used effect size indices for meta-analysis in psychology is the standardized mean difference (SMD). The most common way to synthesize a set of estimates of the SMD is to weight them by the inverse of their variances. For this, it is necessary to estimate the corresponding sampling variances. Meta-analysts have a formula for obtaining unbiased estimates of sampling variances, but they often use a variety of alternative, simpler methods. The bias and efficiency of five different methods that have been proposed and that are implemented in different computerized calculation tools are compared and assessed. The data from a set of published meta-analyses are also reanalyzed, calculating the combined estimates and their confidence intervals, as well as estimates of the specific, between-studies variance, using the five estimation methods. This test of sensitivity shows that the results of a meta-analysis can change noticeably depending on the method used to estimate the sampling variance of SMD values, especially under a random-effects model. Some practical recommendations are made about how to choose and implement the methods in calculation resources. (PsycInfo Database Record (c) 2023 APA, all rights reserved).

标准化平均差(SMD)是心理学荟萃分析中应用最广泛的效应大小指标之一。合成一组SMD估计的最常见方法是通过方差的倒数来对它们进行加权。为此,有必要估计相应的抽样方差。元分析者有一个公式来获得抽样方差的无偏估计,但他们经常使用各种替代的、更简单的方法。本文比较和评估了在不同计算机计算工具中提出和实施的五种不同方法的偏倚和效率。我们还重新分析了一组已发表的荟萃分析的数据,计算了综合估计值及其置信区间,以及使用五种估计方法对具体研究间方差的估计。这一敏感性检验表明,元分析的结果可能会因估计SMD值的抽样方差的方法而发生显著变化,特别是在随机效应模型下。对计算资源中方法的选择和实现提出了一些实用的建议。(PsycInfo数据库记录(c) 2023 APA,版权所有)。
{"title":"Methods for estimating the sampling variance of the standardized mean difference.","authors":"Manuel Suero,&nbsp;Juan Botella,&nbsp;Juan I Durán","doi":"10.1037/met0000446","DOIUrl":"https://doi.org/10.1037/met0000446","url":null,"abstract":"<p><p>One of the most widely used effect size indices for meta-analysis in psychology is the <i>standardized mean difference</i> (SMD). The most common way to synthesize a set of estimates of the SMD is to weight them by the inverse of their variances. For this, it is necessary to estimate the corresponding sampling variances. Meta-analysts have a formula for obtaining unbiased estimates of sampling variances, but they often use a variety of alternative, simpler methods. The bias and efficiency of five different methods that have been proposed and that are implemented in different computerized calculation tools are compared and assessed. The data from a set of published meta-analyses are also reanalyzed, calculating the combined estimates and their confidence intervals, as well as estimates of the specific, between-studies variance, using the five estimation methods. This test of sensitivity shows that the results of a meta-analysis can change noticeably depending on the method used to estimate the sampling variance of SMD values, especially under a random-effects model. Some practical recommendations are made about how to choose and implement the methods in calculation resources. (PsycInfo Database Record (c) 2023 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":"28 4","pages":"895-904"},"PeriodicalIF":7.0,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10036729","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
How to develop, test, and extend multinomial processing tree models: A tutorial. 如何开发、测试和扩展多项处理树模型:教程。
IF 7 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2023-07-27 DOI: 10.1037/met0000561
Oliver Schmidt, Edgar Erdfelder, Daniel W Heck

Many psychological theories assume that observable responses are determined by multiple latent processes. Multinomial processing tree (MPT) models are a class of cognitive models for discrete responses that allow researchers to disentangle and measure such processes. Before applying MPT models to specific psychological theories, it is necessary to tailor a model to specific experimental designs. In this tutorial, we explain how to develop, fit, and test MPT models using the classical pair-clustering model as a running example. The first part covers the required data structures, model equations, identifiability, model validation, maximum-likelihood estimation, hypothesis tests, and power analyses using the software multiTree. The second part introduces hierarchical MPT modeling which allows researchers to account for individual differences and to estimate the correlations of latent processes among each other and with additional covariates using the TreeBUGS package in R. All examples including data and annotated analysis scripts are provided at the Open Science Framework (https://osf.io/24pbm/). (PsycInfo Database Record (c) 2023 APA, all rights reserved).

许多心理学理论认为,可观察到的反应是由多个潜在过程决定的。多项处理树(MPT)模型是一类离散响应的认知模型,允许研究人员解开和测量这些过程。在将MPT模型应用于具体的心理学理论之前,有必要根据具体的实验设计来定制模型。在本教程中,我们将使用经典的配对聚类模型作为运行示例,解释如何开发、拟合和测试MPT模型。第一部分涵盖了所需的数据结构、模型方程、可识别性、模型验证、最大似然估计、假设检验和使用软件multiTree的功率分析。第二部分介绍了分层MPT建模,它允许研究人员使用r中的TreeBUGS包来解释个体差异,并估计潜在过程之间以及与其他协变量之间的相关性。所有示例包括数据和注释分析脚本都在开放科学框架(https://osf.io/24pbm/)上提供。(PsycInfo数据库记录(c) 2023 APA,版权所有)。
{"title":"How to develop, test, and extend multinomial processing tree models: A tutorial.","authors":"Oliver Schmidt,&nbsp;Edgar Erdfelder,&nbsp;Daniel W Heck","doi":"10.1037/met0000561","DOIUrl":"https://doi.org/10.1037/met0000561","url":null,"abstract":"<p><p>Many psychological theories assume that observable responses are determined by multiple latent processes. Multinomial processing tree (MPT) models are a class of cognitive models for discrete responses that allow researchers to disentangle and measure such processes. Before applying MPT models to specific psychological theories, it is necessary to tailor a model to specific experimental designs. In this tutorial, we explain how to develop, fit, and test MPT models using the classical pair-clustering model as a running example. The first part covers the required data structures, model equations, identifiability, model validation, maximum-likelihood estimation, hypothesis tests, and power analyses using the software multiTree. The second part introduces hierarchical MPT modeling which allows researchers to account for individual differences and to estimate the correlations of latent processes among each other and with additional covariates using the TreeBUGS package in R. All examples including data and annotated analysis scripts are provided at the Open Science Framework (https://osf.io/24pbm/). (PsycInfo Database Record (c) 2023 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":""},"PeriodicalIF":7.0,"publicationDate":"2023-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9882826","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Random item slope regression: An alternative measurement model that accounts for both similarities and differences in association with individual items. 随机项目斜率回归:一种替代的测量模型,它可以解释与单个项目相关的相似性和差异性。
IF 7 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2023-07-27 DOI: 10.1037/met0000587
Ed Donnellan, Satoshi Usami, Kou Murayama

In psychology, researchers often predict a dependent variable (DV) consisting of multiple measurements (e.g., scale items measuring a concept). To analyze the data, researchers typically aggregate (sum/average) scores across items and use this as a DV. Alternatively, they may define the DV as a common factor using structural equation modeling. However, both approaches neglect the possibility that an independent variable (IV) may have different relationships to individual items. This variance in individual item slopes arises because items are randomly sampled from an infinite pool of items reflecting the construct that the scale purports to measure. Here, we offer a mixed-effects model called random item slope regression, which accounts for both similarities and differences of individual item associations. Critically, we argue that random item slope regression poses an alternative measurement model to common factor models prevalent in psychology. Unlike these models, the proposed model supposes no latent constructs and instead assumes that individual items have direct causal relationships with the IV. Such operationalization is especially useful when researchers want to assess a broad construct with heterogeneous items. Using mathematical proof and simulation, we demonstrate that random item slopes cause inflation of Type I error when not accounted for, particularly when the sample size (number of participants) is large. In real-world data (n = 564 participants) using commonly used surveys and two reaction time tasks, we demonstrate that random item slopes are present at problematic levels. We further demonstrate that common statistical indices are not sufficient to diagnose the presence of random item slopes. (PsycInfo Database Record (c) 2023 APA, all rights reserved).

在心理学中,研究人员经常预测一个由多个测量组成的因变量(DV)(例如,测量一个概念的量表项目)。为了分析数据,研究人员通常会汇总(总和/平均值)各个项目的分数,并将其用作DV。或者,他们可以使用结构方程建模将DV定义为一个公共因素。然而,这两种方法都忽略了自变量(IV)可能与单个项目有不同关系的可能性。单个项目斜率的差异之所以产生,是因为项目是从反映量表旨在测量的结构的无限项目池中随机抽样的。在这里,我们提供了一种混合效应模型,称为随机项目斜率回归,它同时考虑了单个项目关联的相似性和差异性。关键的是,我们认为随机项目斜率回归提出了一种替代测量模型,以共同因素模型普遍存在于心理学。与这些模型不同,所提出的模型不假设潜在构念,而是假设单个项目与IV有直接的因果关系。当研究人员想要评估具有异质项目的广泛构念时,这种操作化特别有用。使用数学证明和模拟,我们证明了随机项目斜率在未考虑的情况下会导致I型误差膨胀,特别是当样本量(参与者数量)很大时。在使用常用调查和两个反应时间任务的真实数据(n = 564名参与者)中,我们证明了随机项目斜率存在于问题水平。我们进一步证明了常用的统计指标不足以诊断随机项目斜率的存在。(PsycInfo数据库记录(c) 2023 APA,版权所有)。
{"title":"Random item slope regression: An alternative measurement model that accounts for both similarities and differences in association with individual items.","authors":"Ed Donnellan,&nbsp;Satoshi Usami,&nbsp;Kou Murayama","doi":"10.1037/met0000587","DOIUrl":"https://doi.org/10.1037/met0000587","url":null,"abstract":"<p><p>In psychology, researchers often predict a dependent variable (DV) consisting of multiple measurements (e.g., scale items measuring a concept). To analyze the data, researchers typically aggregate (sum/average) scores across items and use this as a DV. Alternatively, they may define the DV as a common factor using structural equation modeling. However, both approaches neglect the possibility that an independent variable (IV) may have different relationships to individual items. This variance in individual item slopes arises because items are randomly sampled from an infinite pool of items reflecting the construct that the scale purports to measure. Here, we offer a mixed-effects model called <i>random item slope regression,</i> which accounts for both similarities and differences of individual item associations. Critically, we argue that random item slope regression poses an alternative measurement model to common factor models prevalent in psychology. Unlike these models, the proposed model supposes no latent constructs and instead assumes that individual items have direct causal relationships with the IV. Such operationalization is especially useful when researchers want to assess a broad construct with heterogeneous items. Using mathematical proof and simulation, we demonstrate that random item slopes cause inflation of Type I error when not accounted for, particularly when the sample size (number of participants) is large. In real-world data (<i>n</i> = 564 participants) using commonly used surveys and two reaction time tasks, we demonstrate that random item slopes are present at problematic levels. We further demonstrate that common statistical indices are not sufficient to diagnose the presence of random item slopes. (PsycInfo Database Record (c) 2023 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":""},"PeriodicalIF":7.0,"publicationDate":"2023-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10259360","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Correspondence measures for assessing replication success. 用于评估复制成功的对应度量。
IF 7 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2023-07-27 DOI: 10.1037/met0000597
Peter M Steiner, Patrick Sheehan, Vivian C Wong

Given recent evidence challenging the replicability of results in the social and behavioral sciences, critical questions have been raised about appropriate measures for determining replication success in comparing effect estimates across studies. At issue is the fact that conclusions about replication success often depend on the measure used for evaluating correspondence in results. Despite the importance of choosing an appropriate measure, there is still no widespread agreement about which measures should be used. This article addresses these questions by describing formally the most commonly used measures for assessing replication success, and by comparing their performance in different contexts according to their replication probabilities-that is, the probability of obtaining replication success given study-specific settings. The measures may be characterized broadly as conclusion-based approaches, which assess the congruence of two independent studies' conclusions about the presence of an effect, and distance-based approaches, which test for a significant difference or equivalence of two effect estimates. We also introduce a new measure for assessing replication success called the correspondence test, which combines a difference and equivalence test in the same framework. To help researchers plan prospective replication efforts, we provide closed formulas for power calculations that can be used to determine the minimum detectable effect size (and thus, sample sizes) for each study so that a predetermined minimum replication probability can be achieved. Finally, we use a replication data set from the Open Science Collaboration (2015) to demonstrate the extent to which conclusions about replication success depend on the correspondence measure selected. (PsycInfo Database Record (c) 2023 APA, all rights reserved).

鉴于最近的证据对社会和行为科学结果的可复制性提出了挑战,在比较不同研究的效果估计时,确定复制成功的适当措施提出了关键问题。争论的焦点在于,关于复制成功与否的结论往往取决于用于评估结果一致性的测量方法。尽管选择适当的措施很重要,但对于应该使用哪些措施仍然没有广泛的共识。本文通过正式描述用于评估复制成功的最常用度量,并根据它们的复制概率(即给定特定研究设置的获得复制成功的概率)比较它们在不同上下文中的性能,来解决这些问题。这些措施可以被广泛地描述为基于结论的方法,评估两个独立研究关于效应存在的结论的一致性,以及基于距离的方法,测试两个效应估计的显着差异或等效性。我们还引入了一种评估复制成功的新方法,称为对应测试,它在同一框架中结合了差异测试和等效测试。为了帮助研究人员计划前瞻性的复制工作,我们提供了功率计算的封闭公式,可用于确定每个研究的最小可检测效应大小(从而确定样本量),从而可以实现预定的最小复制概率。最后,我们使用开放科学协作(2015)的复制数据集来证明关于复制成功的结论在多大程度上取决于所选择的对应度量。(PsycInfo数据库记录(c) 2023 APA,版权所有)。
{"title":"Correspondence measures for assessing replication success.","authors":"Peter M Steiner,&nbsp;Patrick Sheehan,&nbsp;Vivian C Wong","doi":"10.1037/met0000597","DOIUrl":"https://doi.org/10.1037/met0000597","url":null,"abstract":"<p><p>Given recent evidence challenging the replicability of results in the social and behavioral sciences, critical questions have been raised about appropriate measures for determining replication success in comparing effect estimates across studies. At issue is the fact that conclusions about replication success often depend on the measure used for evaluating correspondence in results. Despite the importance of choosing an appropriate measure, there is still no widespread agreement about which measures should be used. This article addresses these questions by describing formally the most commonly used measures for assessing replication success, and by comparing their performance in different contexts according to their replication probabilities-that is, the probability of obtaining replication success given study-specific settings. The measures may be characterized broadly as conclusion-based approaches, which assess the congruence of two independent studies' conclusions about the presence of an effect, and distance-based approaches, which test for a significant difference or equivalence of two effect estimates. We also introduce a new measure for assessing replication success called the correspondence test, which combines a difference and equivalence test in the same framework. To help researchers plan prospective replication efforts, we provide closed formulas for power calculations that can be used to determine the minimum detectable effect size (and thus, sample sizes) for each study so that a predetermined minimum replication probability can be achieved. Finally, we use a replication data set from the Open Science Collaboration (2015) to demonstrate the extent to which conclusions about replication success depend on the correspondence measure selected. (PsycInfo Database Record (c) 2023 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":""},"PeriodicalIF":7.0,"publicationDate":"2023-07-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10259359","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On estimating the frequency of a target behavior from time-constrained yes/no survey questions: A parametric approach based on the Poisson process. 从时间约束的是/否调查问题中估计目标行为的频率:基于泊松过程的参数化方法。
IF 7 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2023-07-20 DOI: 10.1037/met0000588
Benedikt Iberl, Rolf Ulrich

We propose a novel method to analyze time-constrained yes/no questions about a target behavior (e.g., "Did you take sleeping pills during the last 12 months?"). A drawback of these questions is that the relative frequency of answering these questions with "yes" does not allow one to draw definite conclusions about the frequency of the target behavior (i.e., how often sleeping pills were taken) nor about the prevalence of trait carriers (i.e., percentage of people that take sleeping pills). Here we show how this information can be extracted from the results of such questions employing a prevalence curve and a Poisson model. The applicability of the method was evaluated with a survey on everyday behavior, which revealed plausible results and reasonable model fit. (PsycInfo Database Record (c) 2023 APA, all rights reserved).

我们提出了一种新的方法来分析关于目标行为的有时间限制的是/否问题(例如,“你在过去的12个月里吃过安眠药吗?”)。这些问题的一个缺点是,回答“是”的相对频率不能让一个人对目标行为的频率(即,服用安眠药的频率)或特质携带者的流行程度(即,服用安眠药的人的百分比)得出明确的结论。在这里,我们展示了如何利用流行曲线和泊松模型从这些问题的结果中提取这些信息。通过对日常行为的调查评估了该方法的适用性,结果表明该方法的结果合理,模型拟合合理。(PsycInfo数据库记录(c) 2023 APA,版权所有)。
{"title":"On estimating the frequency of a target behavior from time-constrained yes/no survey questions: A parametric approach based on the Poisson process.","authors":"Benedikt Iberl,&nbsp;Rolf Ulrich","doi":"10.1037/met0000588","DOIUrl":"https://doi.org/10.1037/met0000588","url":null,"abstract":"<p><p>We propose a novel method to analyze time-constrained yes/no questions about a target behavior (e.g., \"Did you take sleeping pills during the last 12 months?\"). A drawback of these questions is that the relative frequency of answering these questions with \"yes\" does not allow one to draw definite conclusions about the frequency of the target behavior (i.e., how often sleeping pills were taken) nor about the prevalence of trait carriers (i.e., percentage of people that take sleeping pills). Here we show how this information can be extracted from the results of such questions employing a prevalence curve and a Poisson model. The applicability of the method was evaluated with a survey on everyday behavior, which revealed plausible results and reasonable model fit. (PsycInfo Database Record (c) 2023 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":""},"PeriodicalIF":7.0,"publicationDate":"2023-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9838249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing predictive power by unamalgamating multi-item scales. 通过不合并多项目量表来增强预测能力。
IF 7 1区 心理学 Q1 PSYCHOLOGY, MULTIDISCIPLINARY Pub Date : 2023-07-20 DOI: 10.1037/met0000599
David Trafimow, Michael R Hyman, Alena Kostyk

The generally small but touted as "statistically significant" correlation coefficients in the social sciences jeopardize theory testing and prediction. To investigate these small coefficients' underlying causes, traditional equations such as Spearman's (1904) classic attenuation formula, Cronbach's (1951) alpha, and Guilford and Fruchter's (1973) equation for the effect of additional items on a scale's predictive power are considered. These equations' implications differ regarding large interitem correlations enhancing or diminishing predictive power. Contrary to conventional practice, such correlations decrease predictive power when treating items as multi-item scale components but can increase predictive power when treating items separately. The implications are wide-ranging. (PsycInfo Database Record (c) 2023 APA, all rights reserved).

在社会科学中,通常较小但被吹捧为“统计显著”的相关系数危及理论检验和预测。为了研究这些小系数的潜在原因,考虑了传统方程,如Spearman(1904)的经典衰减公式,Cronbach(1951)的alpha,以及Guilford和Fruchter(1973)的附加项对量表预测能力影响的方程。这些方程的含义不同于大的项目间相关性,增强或减弱预测能力。与传统做法相反,当将项目作为多项目量表组件处理时,这种相关性会降低预测能力,但当单独处理项目时,这种相关性会增加预测能力。其影响是广泛的。(PsycInfo数据库记录(c) 2023 APA,版权所有)。
{"title":"Enhancing predictive power by unamalgamating multi-item scales.","authors":"David Trafimow,&nbsp;Michael R Hyman,&nbsp;Alena Kostyk","doi":"10.1037/met0000599","DOIUrl":"https://doi.org/10.1037/met0000599","url":null,"abstract":"<p><p>The generally small but touted as \"statistically significant\" correlation coefficients in the social sciences jeopardize theory testing and prediction. To investigate these small coefficients' underlying causes, traditional equations such as Spearman's (1904) classic attenuation formula, Cronbach's (1951) alpha, and Guilford and Fruchter's (1973) equation for the effect of additional items on a scale's predictive power are considered. These equations' implications differ regarding large interitem correlations enhancing or diminishing predictive power. Contrary to conventional practice, such correlations decrease predictive power when treating items as multi-item scale components but can increase predictive power when treating items separately. The implications are wide-ranging. (PsycInfo Database Record (c) 2023 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":""},"PeriodicalIF":7.0,"publicationDate":"2023-07-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9838250","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Psychological methods
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1