首页 > 最新文献

Multivariate Behavioral Research最新文献

英文 中文
Unique Variable Analysis: A Network Psychometrics Method to Detect Local Dependence. 独特变量分析:一种检测局部依赖性的网络心理计量学方法。
IF 3.8 3区 心理学 Q1 Mathematics Pub Date : 2023-11-01 Epub Date: 2023-05-04 DOI: 10.1080/00273171.2023.2194606
Alexander P Christensen, Luis Eduardo Garrido, Hudson Golino

The local independence assumption states that variables are unrelated after conditioning on a latent variable. Common problems that arise from violations of this assumption include model misspecification, biased model parameters, and inaccurate estimates of internal structure. These problems are not limited to latent variable models but also apply to network psychometrics. This paper proposes a novel network psychometric approach to detect locally dependent pairs of variables using network modeling and a graph theory measure called weighted topological overlap (wTO). Using simulation, this approach is compared to contemporary local dependence detection methods such as exploratory structural equation modeling with standardized expected parameter change and a recently developed approach using partial correlations and a resampling procedure. Different approaches to determine local dependence using statistical significance and cutoff values are also compared. Continuous, polytomous (5-point Likert scale), and dichotomous (binary) data were generated with skew across a variety of conditions. Our results indicate that cutoff values work better than significance approaches. Overall, the network psychometrics approaches using wTO with graphical least absolute shrinkage and selector operator with extended Bayesian information criterion and wTO with Bayesian Gaussian graphical model were the best performing local dependence detection methods overall.

局部独立假设是指变量在以潜在变量为条件后是不相关的。违反这一假设所产生的常见问题包括模型不规范、模型参数有偏差以及对内部结构的估计不准确。这些问题不仅限于潜变量模型,也适用于网络心理测量学。本文提出了一种新颖的网络心理测量方法,利用网络建模和一种称为加权拓扑重叠(wTO)的图论测量方法来检测局部依赖变量对。通过模拟,本文将这种方法与当代的局部依赖性检测方法进行了比较,如采用标准化预期参数变化的探索性结构方程建模,以及最近开发的一种使用偏相关性和重采样程序的方法。此外,还比较了使用统计显著性和截断值确定局部依赖性的不同方法。我们生成了连续、多态(5 点李克特量表)和二态(二进制)数据,这些数据在各种条件下都有偏差。结果表明,截止值比显著性方法更有效。总体而言,使用具有图形最小绝对收缩的 wTO 和具有扩展贝叶斯信息准则的选择算子的网络心理计量学方法,以及具有贝叶斯高斯图形模型的 wTO 是总体上表现最好的局部依赖性检测方法。
{"title":"Unique Variable Analysis: A Network Psychometrics Method to Detect Local Dependence.","authors":"Alexander P Christensen, Luis Eduardo Garrido, Hudson Golino","doi":"10.1080/00273171.2023.2194606","DOIUrl":"10.1080/00273171.2023.2194606","url":null,"abstract":"<p><p>The local independence assumption states that variables are unrelated after conditioning on a latent variable. Common problems that arise from violations of this assumption include model misspecification, biased model parameters, and inaccurate estimates of internal structure. These problems are not limited to latent variable models but also apply to network psychometrics. This paper proposes a novel network psychometric approach to detect locally dependent pairs of variables using network modeling and a graph theory measure called weighted topological overlap (wTO). Using simulation, this approach is compared to contemporary local dependence detection methods such as exploratory structural equation modeling with standardized expected parameter change and a recently developed approach using partial correlations and a resampling procedure. Different approaches to determine local dependence using statistical significance and cutoff values are also compared. Continuous, polytomous (5-point Likert scale), and dichotomous (binary) data were generated with skew across a variety of conditions. Our results indicate that cutoff values work better than significance approaches. Overall, the network psychometrics approaches using wTO with graphical least absolute shrinkage and selector operator with extended Bayesian information criterion and wTO with Bayesian Gaussian graphical model were the best performing local dependence detection methods overall.</p>","PeriodicalId":53155,"journal":{"name":"Multivariate Behavioral Research","volume":null,"pages":null},"PeriodicalIF":3.8,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9406819","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Reorienting Latent Variable Modeling for Supervised Learning. 面向监督学习的潜在变量建模。
IF 5.3 3区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-11-01 Epub Date: 2023-05-25 DOI: 10.1080/00273171.2023.2182753
Booil Jo, Trevor J Hastie, Zetan Li, Eric A Youngstrom, Robert L Findling, Sarah McCue Horwitz

Despite its potentials benefits, using prediction targets generated based on latent variable (LV) modeling is not a common practice in supervised learning, a dominating framework for developing prediction models. In supervised learning, it is typically assumed that the outcome to be predicted is clear and readily available, and therefore validating outcomes before predicting them is a foreign concept and an unnecessary step. The usual goal of LV modeling is inference, and therefore using it in supervised learning and in the prediction context requires a major conceptual shift. This study lays out methodological adjustments and conceptual shifts necessary for integrating LV modeling into supervised learning. It is shown that such integration is possible by combining the traditions of LV modeling, psychometrics, and supervised learning. In this interdisciplinary learning framework, generating practical outcomes using LV modeling and systematically validating them based on clinical validators are the two main strategies. In the example using the data from the Longitudinal Assessment of Manic Symptoms (LAMS) Study, a large pool of candidate outcomes is generated by flexible LV modeling. It is demonstrated that this exploratory situation can be used as an opportunity to tailor desirable prediction targets taking advantage of contemporary science and clinical insights.

尽管有潜在的好处,但使用基于潜变量(LV)建模生成的预测目标在监督学习中并不常见,而监督学习是开发预测模型的主要框架。在监督学习中,通常假设要预测的结果是清晰且容易获得的,因此在预测结果之前验证结果是一个陌生的概念,也是一个不必要的步骤。LV建模的通常目标是推理,因此在监督学习和预测上下文中使用它需要一个重大的概念转变。本研究提出了将LV模型整合到监督学习中所必需的方法调整和概念转变。研究表明,通过结合传统的LV建模、心理测量学和监督学习,这种整合是可能的。在这个跨学科的学习框架中,使用LV建模和基于临床验证器系统地验证产生实际结果是两个主要策略。在使用躁狂症状纵向评估(LAMS)研究数据的示例中,通过灵活的LV建模生成了大量候选结果。这表明,这种探索性的情况可以作为一个机会,利用当代科学和临床见解来定制理想的预测目标。
{"title":"Reorienting Latent Variable Modeling for Supervised Learning.","authors":"Booil Jo, Trevor J Hastie, Zetan Li, Eric A Youngstrom, Robert L Findling, Sarah McCue Horwitz","doi":"10.1080/00273171.2023.2182753","DOIUrl":"10.1080/00273171.2023.2182753","url":null,"abstract":"<p><p>Despite its potentials benefits, using prediction targets generated based on latent variable (LV) modeling is not a common practice in supervised learning, a dominating framework for developing prediction models. In supervised learning, it is typically assumed that the outcome to be predicted is clear and readily available, and therefore validating outcomes before predicting them is a foreign concept and an unnecessary step. The usual goal of LV modeling is inference, and therefore using it in supervised learning and in the prediction context requires a major conceptual shift. This study lays out methodological adjustments and conceptual shifts necessary for integrating LV modeling into supervised learning. It is shown that such integration is possible by combining the traditions of LV modeling, psychometrics, and supervised learning. In this interdisciplinary learning framework, generating practical outcomes using LV modeling and systematically validating them based on clinical validators are the two main strategies. In the example using the data from the Longitudinal Assessment of Manic Symptoms (LAMS) Study, a large pool of candidate outcomes is generated by flexible LV modeling. It is demonstrated that this exploratory situation can be used as an opportunity to tailor desirable prediction targets taking advantage of contemporary science and clinical insights.</p>","PeriodicalId":53155,"journal":{"name":"Multivariate Behavioral Research","volume":null,"pages":null},"PeriodicalIF":5.3,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10674034/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9524122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
betaDelta and betaSandwich: Confidence Intervals for Standardized Regression Coefficients in R. betaDelta 和 betaSandwich:R 中标准化回归系数的置信区间。
IF 3.8 3区 心理学 Q1 Mathematics Pub Date : 2023-11-01 Epub Date: 2023-04-25 DOI: 10.1080/00273171.2023.2201277
Ivan Jacob Agaloos Pesigan, Rong Wei Sun, Shu Fai Cheung

The multivariate delta method was used by Yuan and Chan to estimate standard errors and confidence intervals for standardized regression coefficients. Jones and Waller extended the earlier work to situations where data are nonnormal by utilizing Browne's asymptotic distribution-free (ADF) theory. Furthermore, Dudgeon developed standard errors and confidence intervals, employing heteroskedasticity-consistent (HC) estimators, that are robust to nonnormality with better performance in smaller sample sizes compared to Jones and Waller's ADF technique. Despite these advancements, empirical research has been slow to adopt these methodologies. This can be a result of the dearth of user-friendly software programs to put these techniques to use. We present the betaDelta and the betaSandwich packages in the R statistical software environment in this manuscript. Both the normal-theory approach and the ADF approach put forth by Yuan and Chan and Jones and Waller are implemented by the betaDelta package. The HC approach proposed by Dudgeon is implemented by the betaSandwich package. The use of the packages is demonstrated with an empirical example. We think the packages will enable applied researchers to accurately assess the sampling variability of standardized regression coefficients.

Yuan 和 Chan 使用多元三角法估算标准化回归系数的标准误差和置信区间。Jones 和 Waller 利用 Browne 的无渐近分布 (ADF) 理论,将先前的工作扩展到了数据非正态分布的情况。此外,Dudgeon 利用异方差一致(HC)估计器开发了标准误差和置信区间,与 Jones 和 Waller 的 ADF 技术相比,这些估计器对非正态性具有稳健性,在样本量较小的情况下性能更好。尽管取得了这些进步,但实证研究在采用这些方法方面进展缓慢。这可能是由于缺乏用户友好的软件程序来使用这些技术。我们在本手稿中介绍了 R 统计软件环境中的 betaDelta 和 betaSandwich 软件包。Yuan和Chan以及Jones和Waller提出的正态理论方法和ADF方法都由betaDelta软件包实现。Dudgeon 提出的 HC 方法由 betaSandwich 软件包实现。我们通过一个实证例子演示了软件包的使用。我们认为这些软件包可以帮助应用研究人员准确评估标准化回归系数的抽样变异性。
{"title":"betaDelta and betaSandwich: Confidence Intervals for Standardized Regression Coefficients in R.","authors":"Ivan Jacob Agaloos Pesigan, Rong Wei Sun, Shu Fai Cheung","doi":"10.1080/00273171.2023.2201277","DOIUrl":"10.1080/00273171.2023.2201277","url":null,"abstract":"<p><p>The multivariate delta method was used by Yuan and Chan to estimate standard errors and confidence intervals for standardized regression coefficients. Jones and Waller extended the earlier work to situations where data are nonnormal by utilizing Browne's asymptotic distribution-free (ADF) theory. Furthermore, Dudgeon developed standard errors and confidence intervals, employing heteroskedasticity-consistent (HC) estimators, that are robust to nonnormality with better performance in smaller sample sizes compared to Jones and Waller's ADF technique. Despite these advancements, empirical research has been slow to adopt these methodologies. This can be a result of the dearth of user-friendly software programs to put these techniques to use. We present the betaDelta and the betaSandwich packages in the R statistical software environment in this manuscript. Both the normal-theory approach and the ADF approach put forth by Yuan and Chan and Jones and Waller are implemented by the betaDelta package. The HC approach proposed by Dudgeon is implemented by the betaSandwich package. The use of the packages is demonstrated with an empirical example. We think the packages will enable applied researchers to accurately assess the sampling variability of standardized regression coefficients.</p>","PeriodicalId":53155,"journal":{"name":"Multivariate Behavioral Research","volume":null,"pages":null},"PeriodicalIF":3.8,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9986115","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
On the Common but Problematic Specification of Conflated Random Slopes in Multilevel Models. 论多层次模型中常见但有问题的串联随机斜率规范。
IF 3.8 3区 心理学 Q1 Mathematics Pub Date : 2023-11-01 Epub Date: 2023-04-10 DOI: 10.1080/00273171.2023.2174490
Jason D Rights, Sonya K Sterba

For multilevel models (MLMs) with fixed slopes, it has been widely recognized that a level-1 variable can have distinct between-cluster and within-cluster fixed effects, and that failing to disaggregate these effects yields a conflated, uninterpretable fixed effect. For MLMs with random slopes, however, we clarify that two different types of slope conflation can occur: that of the fixed component (termed fixed conflation) and that of the random component (termed random conflation). The latter is rarely recognized and not well understood. Here we explain that a model commonly used to disaggregate the fixed component-the contextual effect model with random slopes-troublingly still yields a conflated random component. Negative consequences of such random conflation have not been demonstrated. Here we show that they include erroneous interpretation and inferences about the substantively important extent of between-cluster differences in slopes, including either underestimating or overestimating such slope heterogeneity. Furthermore, we show that this random conflation can yield inappropriate standard errors for fixed effects. To aid researchers in practice, we delineate which types of random slope specifications yield an unconflated random component. We demonstrate the advantages of these unconflated models in terms of estimating and testing random slope variance (i.e., improved power, Type I error, and bias) and in terms of standard error estimation for fixed effects (i.e., more accurate standard errors), and make recommendations for which specifications to use for particular research purposes.

对于具有固定斜率的多层次模型(MLMs)来说,人们普遍认为第一层次变量可能具有明显的群间固定效应和群内固定效应,如果不对这些效应进行分解,就会产生混淆的、难以解释的固定效应。然而,对于具有随机斜率的多变量模型,我们要澄清的是,可能会出现两种不同类型的斜率混淆:固定分量的斜率混淆(称为固定混淆)和随机分量的斜率混淆(称为随机混淆)。后者很少被认识到,也没有得到很好的理解。在这里,我们解释了一个常用于分解固定成分的模型--具有随机斜率的背景效应模型--令人不安的是,它仍然会产生混淆的随机成分。这种随机混淆的负面影响尚未得到证实。在这里,我们将证明这些负面影响包括对群组间斜率差异的重要程度的错误解释和推断,包括低估或高估这种斜率异质性。此外,我们还表明,这种随机混淆会产生不恰当的固定效应标准误差。为了在实践中帮助研究人员,我们划分了哪些类型的随机斜率规范会产生非膨胀随机成分。我们展示了这些非膨胀模型在估计和检验随机斜率方差(即改进的功率、I 类误差和偏差)以及固定效应的标准误差估计(即更准确的标准误差)方面的优势,并就特定研究目的使用哪种规范提出了建议。
{"title":"On the Common but Problematic Specification of Conflated Random Slopes in Multilevel Models.","authors":"Jason D Rights, Sonya K Sterba","doi":"10.1080/00273171.2023.2174490","DOIUrl":"10.1080/00273171.2023.2174490","url":null,"abstract":"<p><p>For multilevel models (MLMs) with fixed slopes, it has been widely recognized that a level-1 variable can have distinct between-cluster and within-cluster fixed effects, and that failing to disaggregate these effects yields a conflated, uninterpretable fixed effect. For MLMs with random slopes, however, we clarify that two different types of slope conflation can occur: that of the fixed component (termed fixed conflation) and that of the random component (termed random conflation). The latter is rarely recognized and not well understood. Here we explain that a model commonly used to disaggregate the fixed component-the contextual effect model with random slopes-troublingly still yields a conflated random component. Negative consequences of such random conflation have not been demonstrated. Here we show that they include erroneous interpretation and inferences about the substantively important extent of between-cluster differences in slopes, including either underestimating or overestimating such slope heterogeneity. Furthermore, we show that this random conflation can yield inappropriate standard errors for fixed effects. To aid researchers in practice, we delineate which types of random slope specifications yield an unconflated random component. We demonstrate the advantages of these unconflated models in terms of estimating and testing random slope variance (i.e., improved power, Type I error, and bias) and in terms of standard error estimation for fixed effects (i.e., more accurate standard errors), and make recommendations for which specifications to use for particular research purposes.</p>","PeriodicalId":53155,"journal":{"name":"Multivariate Behavioral Research","volume":null,"pages":null},"PeriodicalIF":3.8,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9642680","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Pay Attention to the Ignorable Missing Data Mechanisms! An Exploration of Their Impact on the Efficiency of Regression Coefficients. 关注不可忽略的缺失数据机制!探讨其对回归系数效率的影响
IF 3.8 3区 心理学 Q1 Mathematics Pub Date : 2023-11-01 Epub Date: 2023-04-11 DOI: 10.1080/00273171.2023.2193600
Lihan Chen, Victoria Savalei, Mijke Rhemtulla

The use of modern missing data techniques has become more prevalent with their increasing accessibility in statistical software. These techniques focus on handling data that are missing at random (MAR). Although all MAR mechanisms are routinely treated as the same, they are not equal. The impact of missing data on the efficiency of parameter estimates can differ for different MAR variations, even when the amount of missing data is held constant; yet, in current practice, only the rate of missing data is reported. The impact of MAR on the loss of efficiency can instead be more directly measured by the fraction of missing information (FMI). In this article, we explore this impact using FMIs in regression models with one and two predictors. With the help of a Shiny application, we demonstrate that efficiency loss due to missing data can be highly complex and is not always intuitive. We recommend substantive researchers who work with missing data report estimates of FMIs in addition to the rate of missingness. We also encourage methodologists to examine FMIs when designing simulation studies with missing data, and to explore the behavior of efficiency loss under MAR using FMIs in more complex models.

随着统计软件的日益普及,现代缺失数据技术的使用也越来越普遍。这些技术侧重于处理随机缺失数据(MAR)。虽然所有的随机缺失机制通常都被视为相同的,但它们并不相同。即使在缺失数据量保持不变的情况下,缺失数据对参数估计效率的影响也会因不同的 MAR 变化而不同;但在目前的实践中,只报告缺失数据率。MAR 对效率损失的影响可以通过缺失信息的比例 (FMI) 更直接地衡量。在本文中,我们将利用单预测因子和双预测因子回归模型中的 FMIs 来探讨这种影响。在 Shiny 应用程序的帮助下,我们证明了数据缺失导致的效率损失可能非常复杂,而且并不总是直观的。我们建议研究人员在处理缺失数据时,除了报告缺失率外,还要报告 FMIs 的估计值。我们还鼓励方法论专家在设计有缺失数据的模拟研究时检查 FMIs,并在更复杂的模型中使用 FMIs 探索 MAR 下的效率损失行为。
{"title":"Pay Attention to the Ignorable Missing Data Mechanisms! An Exploration of Their Impact on the Efficiency of Regression Coefficients.","authors":"Lihan Chen, Victoria Savalei, Mijke Rhemtulla","doi":"10.1080/00273171.2023.2193600","DOIUrl":"10.1080/00273171.2023.2193600","url":null,"abstract":"<p><p>The use of modern missing data techniques has become more prevalent with their increasing accessibility in statistical software. These techniques focus on handling data that are <i>missing at random</i> (MAR). Although all MAR mechanisms are routinely treated as the same, they are not equal. The impact of missing data on the efficiency of parameter estimates can differ for different MAR variations, even when the amount of missing data is held constant; yet, in current practice, only the rate of missing data is reported. The impact of MAR on the loss of efficiency can instead be more directly measured by the <i>fraction of missing information</i> (FMI). In this article, we explore this impact using FMIs in regression models with one and two predictors. With the help of a <i>Shiny</i> application, we demonstrate that efficiency loss due to missing data can be highly complex and is not always intuitive. We recommend substantive researchers who work with missing data report estimates of FMIs in addition to the rate of missingness. We also encourage methodologists to examine FMIs when designing simulation studies with missing data, and to explore the behavior of efficiency loss under MAR using FMIs in more complex models.</p>","PeriodicalId":53155,"journal":{"name":"Multivariate Behavioral Research","volume":null,"pages":null},"PeriodicalIF":3.8,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9273348","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploratory Bi-factor Analysis with Multiple General Factors. 具有多个一般因素的探索性双因素分析。
IF 3.8 3区 心理学 Q1 Mathematics Pub Date : 2023-11-01 Epub Date: 2023-04-10 DOI: 10.1080/00273171.2023.2189571
Marcos Jiménez, Francisco J Abad, Eduardo Garcia-Garzon, Luis Eduardo Garrido

Exploratory bi-factor analysis (EBFA) is a very popular approach to estimate models where specific factors are concomitant to a single, general dimension. However, the models typically encountered in fields like personality, intelligence, and psychopathology involve more than one general factor. To address this circumstance, we developed an algorithm (GSLiD) based on partially specified targets to perform exploratory bi-factor analysis with multiple general factors (EBFA-MGF). In EBFA-MGF, researchers do not need to conduct independent bi-factor analyses anymore because several bi-factor models are estimated simultaneously in an exploratory manner, guarding against biased estimates and model misspecification errors due to unexpected cross-loadings and factor correlations. The results from an exhaustive Monte Carlo simulation manipulating nine variables of interest suggested that GSLiD outperforms the Schmid-Leiman approximation and is robust to challenging conditions involving cross-loadings and pure items of the general factors. Thereby, we supply an R package (bifactor) to make EBFA-MGF readily available for substantive research. Finally, we use GSLiD to assess the hierarchical structure of a reduced version of the Personality Inventory for DSM-5 Short Form (PID-5-SF).

探索性双因素分析(EBFA)是一种非常流行的估算模型的方法,在这种模型中,特定的因素与单一的一般维度相关联。然而,在人格、智力和精神病理学等领域通常会遇到涉及不止一个一般因素的模型。针对这种情况,我们开发了一种基于部分指定目标的算法(GSLiD),用于执行具有多个一般因素的探索性双因素分析(EBFA-MGF)。在 EBFA-MGF 中,研究人员不需要再进行独立的双因素分析,因为多个双因素模型会以探索性的方式同时进行估计,从而避免了由于意外的交叉负荷和因素相关性而导致的估计偏差和模型指定错误。对九个相关变量进行的详尽蒙特卡罗模拟结果表明,GSLiD 的性能优于 Schmid-Leiman 近似方法,并且对涉及交叉负荷和一般因子纯项的挑战性条件具有鲁棒性。因此,我们提供了一个 R 软件包(bifactor),使 EBFA-MGF 可随时用于实质性研究。最后,我们使用 GSLiD 评估了简化版 DSM-5 人格问卷简表(PID-5-SF)的层次结构。
{"title":"Exploratory Bi-factor Analysis with Multiple General Factors.","authors":"Marcos Jiménez, Francisco J Abad, Eduardo Garcia-Garzon, Luis Eduardo Garrido","doi":"10.1080/00273171.2023.2189571","DOIUrl":"10.1080/00273171.2023.2189571","url":null,"abstract":"<p><p>Exploratory bi-factor analysis (EBFA) is a very popular approach to estimate models where specific factors are concomitant to a single, general dimension. However, the models typically encountered in fields like personality, intelligence, and psychopathology involve more than one general factor. To address this circumstance, we developed an algorithm (GSLiD) based on partially specified targets to perform exploratory bi-factor analysis with multiple general factors (EBFA-MGF). In EBFA-MGF, researchers do not need to conduct independent bi-factor analyses anymore because several bi-factor models are estimated simultaneously in an exploratory manner, guarding against biased estimates and model misspecification errors due to unexpected cross-loadings and factor correlations. The results from an exhaustive Monte Carlo simulation manipulating nine variables of interest suggested that GSLiD outperforms the Schmid-Leiman approximation and is robust to challenging conditions involving cross-loadings and pure items of the general factors. Thereby, we supply an R package (bifactor) to make EBFA-MGF readily available for substantive research. Finally, we use GSLiD to assess the hierarchical structure of a reduced version of the Personality Inventory for DSM-5 Short Form (PID-5-SF).</p>","PeriodicalId":53155,"journal":{"name":"Multivariate Behavioral Research","volume":null,"pages":null},"PeriodicalIF":3.8,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9642682","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Which is Better for Individual Participant Data Meta-Analysis of Zero-Inflated Count Outcomes, One-Step or Two-Step Analysis? A Simulation Study. 零膨胀计数结果的个体参与者数据元分析、一步分析和两步分析哪个更好?模拟研究。
IF 5.3 3区 心理学 Q1 MATHEMATICS, INTERDISCIPLINARY APPLICATIONS Pub Date : 2023-11-01 Epub Date: 2023-03-23 DOI: 10.1080/00273171.2023.2173135
David Huh, Scott A Baldwin, Zhengyang Zhou, Joonsuk Park, Eun-Young Mun

Meta-analysis using individual participant data (IPD) is an important methodology in intervention research because it (a) increases accuracy and precision of estimates, (b) allows researchers to investigate mediators and moderators of treatment effects, and (c) makes use of extant data. IPD meta-analysis can be conducted either via a one-step approach that uses data from all studies simultaneously, or a two-step approach, which aggregates data for each study and then combines them in a traditional meta-analysis model. Unfortunately, there are no evidence-based guidelines for how best to approach IPD meta-analysis for count outcomes with many zeroes, such as alcohol use. We used simulation to compare the performance of four hurdle models (3 one-step and 1 two-step models) for zero-inflated count IPD, under realistic data conditions. Overall, all models yielded adequate coverage and bias for the treatment effect in the count portion of the model, across all data conditions. However, in the zero portion, the treatment effect was underestimated in most models and data conditions, especially when there were fewer studies. The performance of both one- and two-step approaches depended on the formulation of the treatment effects, suggesting a need to carefully consider model assumptions and specifications when using IPD.

使用个体参与者数据的荟萃分析(IPD)是干预研究中的一种重要方法,因为它(a)提高了估计的准确性和准确性,(b)允许研究人员调查治疗效果的中介和调节因素,以及(c)利用现有数据。IPD荟萃分析可以通过同时使用所有研究数据的一步方法进行,也可以通过汇总每个研究的数据,然后将其组合到传统的荟萃分析模型中的两步方法进行。不幸的是,对于如何最好地进行IPD荟萃分析来计算多个零的结果(如饮酒),目前还没有循证指南。在实际数据条件下,我们使用模拟来比较零膨胀计数IPD的四个栏模型(3个一步和1个两步模型)的性能。总体而言,在所有数据条件下,所有模型的计数部分都对治疗效果产生了足够的覆盖率和偏差。然而,在零部分,在大多数模型和数据条件下,治疗效果被低估了,尤其是在研究较少的情况下。一步和两步方法的性能取决于治疗效果的公式,这表明在使用IPD时需要仔细考虑模型假设和规范。
{"title":"Which is Better for Individual Participant Data Meta-Analysis of Zero-Inflated Count Outcomes, One-Step or Two-Step Analysis? A Simulation Study.","authors":"David Huh, Scott A Baldwin, Zhengyang Zhou, Joonsuk Park, Eun-Young Mun","doi":"10.1080/00273171.2023.2173135","DOIUrl":"10.1080/00273171.2023.2173135","url":null,"abstract":"<p><p>Meta-analysis using individual participant data (IPD) is an important methodology in intervention research because it (a) increases accuracy and precision of estimates, (b) allows researchers to investigate mediators and moderators of treatment effects, and (c) makes use of extant data. IPD meta-analysis can be conducted either via a one-step approach that uses data from all studies simultaneously, or a two-step approach, which aggregates data for each study and then combines them in a traditional meta-analysis model. Unfortunately, there are no evidence-based guidelines for how best to approach IPD meta-analysis for count outcomes with many zeroes, such as alcohol use. We used simulation to compare the performance of four hurdle models (3 one-step and 1 two-step models) for zero-inflated count IPD, under realistic data conditions. Overall, all models yielded adequate coverage and bias for the treatment effect in the count portion of the model, across all data conditions. However, in the zero portion, the treatment effect was underestimated in most models and data conditions, especially when there were fewer studies. The performance of both one- and two-step approaches depended on the formulation of the treatment effects, suggesting a need to carefully consider model assumptions and specifications when using IPD.</p>","PeriodicalId":53155,"journal":{"name":"Multivariate Behavioral Research","volume":null,"pages":null},"PeriodicalIF":5.3,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10517064/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9535145","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
CLC Estimator: A Tool for Latent Construct Estimation via Congeneric Approaches in Survey Research. CLC 估算器:调查研究中通过同源方法进行潜在结构估计的工具。
IF 3.8 3区 心理学 Q1 Mathematics Pub Date : 2023-11-01 Epub Date: 2023-04-10 DOI: 10.1080/00273171.2023.2193718
Giacomo Marzi, Marco Balzano, Leonardo Egidi, Alessandro Magrini

This article proposes the Shiny app 'CLC Estimator' -Congeneric Latent Construct Estimator- to address the problem of estimating latent unidimensional constructs via congeneric approaches. While congeneric approaches provide more rigorous results than suboptimal parallel-based scoring methods, most statistical packages do not provide easy access to congeneric approaches. To address this issue, the CLC Estimator allows social scientists to use congeneric approaches to estimate latent unidimensional constructs smoothly. The present app provides a novel solution to the challenge of limited access to congeneric estimation methods in survey research.

本文提出了 Shiny 应用程序 "CLC 估算器"(CLC Estimator)--同源潜在结构估算器(Congeneric Latent Construct Estimator)-- 来解决通过同源方法估算潜在单维结构的问题。虽然同源方法比次优的平行计分方法能提供更严谨的结果,但大多数统计软件包都无法轻松访问同源方法。为了解决这个问题,CLC Estimator 允许社会科学家使用同源方法来顺利估计潜在的单维构造。本应用程序提供了一种新颖的解决方案,解决了调查研究中同源估算方法使用受限的难题。
{"title":"CLC Estimator: A Tool for Latent Construct Estimation via Congeneric Approaches in Survey Research.","authors":"Giacomo Marzi, Marco Balzano, Leonardo Egidi, Alessandro Magrini","doi":"10.1080/00273171.2023.2193718","DOIUrl":"10.1080/00273171.2023.2193718","url":null,"abstract":"<p><p>This article proposes the Shiny app 'CLC Estimator' -Congeneric Latent Construct Estimator- to address the problem of estimating latent unidimensional constructs via congeneric approaches. While congeneric approaches provide more rigorous results than suboptimal parallel-based scoring methods, most statistical packages do not provide easy access to congeneric approaches. To address this issue, the CLC Estimator allows social scientists to use congeneric approaches to estimate latent unidimensional constructs smoothly. The present app provides a novel solution to the challenge of limited access to congeneric estimation methods in survey research.</p>","PeriodicalId":53155,"journal":{"name":"Multivariate Behavioral Research","volume":null,"pages":null},"PeriodicalIF":3.8,"publicationDate":"2023-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"9279519","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
State Space Mixture Modeling: Finding People with Similar Patterns of Change. 状态空间混合建模:寻找具有相似变化模式的人。
IF 3.8 3区 心理学 Q1 Mathematics Pub Date : 2023-10-10 DOI: 10.1080/00273171.2023.2261224
Michael D Hunter

Increasingly, behavioral scientists encounter data where several individuals were measured on multiple variables over numerous occasions. Many current methods combine these data, assuming all individuals are randomly equivalent. An extreme alternative assumes no one is randomly equivalent. We propose state space mixture modeling as one possible compromise. State space mixture modeling assumes that unknown groups of people exist who share the same parameters of a state space model, and simultaneously estimates both the state space parameters and group membership. The goal is to find people that are undergoing similar change processes over time. The present work demonstrates state space mixture modeling on a simulated data set, and summarizes the results from a large simulation study. The illustration shows how the analysis is conducted, whereas the simulation provides evidence of its general validity and applicability. In the simulation study, sample size had the greatest influence on parameter estimation and the dimension of the change process had the greatest impact on correctly grouping people together, likely due to the distinctiveness of their patterns of change. State space mixture modeling offers one of the best-performing methods for simultaneously drawing conclusions about individual change processes while also analyzing multiple people.

行为科学家越来越多地遇到这样的数据,即在许多场合对几个个体进行多变量测量。目前的许多方法都将这些数据结合起来,假设所有个体都是随机等价的。一个极端的替代方案假设没有一个是随机等价的。我们提出状态空间混合建模作为一种可能的折衷方案。状态空间混合建模假设存在共享状态空间模型的相同参数的未知人群,并同时估计状态空间参数和群体成员关系。目标是找到随着时间推移正在经历类似变化过程的人。目前的工作演示了在模拟数据集上的状态空间混合建模,并总结了大型模拟研究的结果。图示显示了分析是如何进行的,而模拟提供了其总体有效性和适用性的证据。在模拟研究中,样本量对参数估计的影响最大,而变化过程的维度对正确地将人们分组的影响最大。这可能是由于他们的变化模式的独特性。状态空间混合建模提供了一种性能最好的方法,既可以得出关于个体变化过程的结论,又可以分析多个人。
{"title":"State Space Mixture Modeling: Finding People with Similar Patterns of Change.","authors":"Michael D Hunter","doi":"10.1080/00273171.2023.2261224","DOIUrl":"https://doi.org/10.1080/00273171.2023.2261224","url":null,"abstract":"<p><p>Increasingly, behavioral scientists encounter data where several individuals were measured on multiple variables over numerous occasions. Many current methods combine these data, assuming all individuals are randomly equivalent. An extreme alternative assumes no one is randomly equivalent. We propose state space mixture modeling as one possible compromise. State space mixture modeling assumes that unknown groups of people exist who share the same parameters of a state space model, and simultaneously estimates both the state space parameters and group membership. The goal is to find people that are undergoing similar change processes over time. The present work demonstrates state space mixture modeling on a simulated data set, and summarizes the results from a large simulation study. The illustration shows how the analysis is conducted, whereas the simulation provides evidence of its general validity and applicability. In the simulation study, sample size had the greatest influence on parameter estimation and the dimension of the change process had the greatest impact on correctly grouping people together, likely due to the distinctiveness of their patterns of change. State space mixture modeling offers one of the best-performing methods for simultaneously drawing conclusions about individual change processes while also analyzing multiple people.</p>","PeriodicalId":53155,"journal":{"name":"Multivariate Behavioral Research","volume":null,"pages":null},"PeriodicalIF":3.8,"publicationDate":"2023-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"41184181","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Gradient Tree Boosting for Hierarchical Data. 层次数据的梯度树提升。
IF 3.8 3区 心理学 Q1 Mathematics Pub Date : 2023-09-01 Epub Date: 2023-01-05 DOI: 10.1080/00273171.2022.2146638
Marie Salditt, Sarah Humberg, Steffen Nestler

Gradient tree boosting is a powerful machine learning technique that has shown good performance in predicting a variety of outcomes. However, when applied to hierarchical (e.g., longitudinal or clustered) data, the predictive performance of gradient tree boosting may be harmed by ignoring the hierarchical structure, and may be improved by accounting for it. Tree-based methods such as regression trees and random forests have already been extended to hierarchical data settings by combining them with the linear mixed effects model (MEM). In the present article, we add to this literature by proposing two algorithms to estimate a combination of the MEM and gradient tree boosting. We report on two simulation studies that (i) investigate the predictive performance of the two MEM boosting algorithms and (ii) compare them to standard gradient tree boosting, standard random forest, and other existing methods for hierarchical data (MEM, MEM random forests, model-based boosting, Bayesian additive regression trees [BART]). We found substantial improvements in the predictive performance of our MEM boosting algorithms over standard boosting when the random effects were non-negligible. MEM boosting as well as BART showed a predictive performance similar to the correctly specified MEM (i.e., the benchmark model), and overall outperformed the model-based boosting and random forest approaches.

梯度树提升是一种强大的机器学习技术,在预测各种结果方面表现出良好的性能。然而,当应用于分层(例如,纵向或聚类)数据时,梯度树提升的预测性能可能会因忽略分层结构而受到损害,并且可能会因考虑分层结构而得到改善。回归树和随机森林等基于树的方法已经通过将其与线性混合效应模型(MEM)相结合而扩展到分层数据设置中。在本文中,我们通过提出两种算法来估计MEM和梯度树提升的组合,从而对文献进行了补充。我们报告了两项模拟研究,(i)研究了两种MEM增强算法的预测性能,(ii)将它们与标准梯度树增强、标准随机森林和其他现有的分层数据方法(MEM、MEM随机森林、基于模型的增强、贝叶斯加性回归树[BART])进行比较。当随机效应不可忽略时,我们发现我们的MEM增强算法的预测性能比标准增强算法有了实质性的改进。MEM增强和BART显示出与正确指定的MEM(即基准模型)相似的预测性能,总体上优于基于模型的增强和随机森林方法。
{"title":"Gradient Tree Boosting for Hierarchical Data.","authors":"Marie Salditt,&nbsp;Sarah Humberg,&nbsp;Steffen Nestler","doi":"10.1080/00273171.2022.2146638","DOIUrl":"10.1080/00273171.2022.2146638","url":null,"abstract":"<p><p>Gradient tree boosting is a powerful machine learning technique that has shown good performance in predicting a variety of outcomes. However, when applied to hierarchical (e.g., longitudinal or clustered) data, the predictive performance of gradient tree boosting may be harmed by ignoring the hierarchical structure, and may be improved by accounting for it. Tree-based methods such as regression trees and random forests have already been extended to hierarchical data settings by combining them with the linear mixed effects model (MEM). In the present article, we add to this literature by proposing two algorithms to estimate a combination of the MEM and gradient tree boosting. We report on two simulation studies that (i) investigate the predictive performance of the two MEM boosting algorithms and (ii) compare them to standard gradient tree boosting, standard random forest, and other existing methods for hierarchical data (MEM, MEM random forests, model-based boosting, Bayesian additive regression trees [BART]). We found substantial improvements in the predictive performance of our MEM boosting algorithms over standard boosting when the random effects were non-negligible. MEM boosting as well as BART showed a predictive performance similar to the correctly specified MEM (i.e., the benchmark model), and overall outperformed the model-based boosting and random forest approaches.</p>","PeriodicalId":53155,"journal":{"name":"Multivariate Behavioral Research","volume":null,"pages":null},"PeriodicalIF":3.8,"publicationDate":"2023-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"10480691","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Multivariate Behavioral Research
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1