Pub Date : 2025-10-01Epub Date: 2023-11-13DOI: 10.1037/met0000613
Cheng-Hsien Li
The fulfillment of measurement invariance/equivalence is considered a prerequisite for meaningfully proceeding with substantive cross-group comparisons. In the multiple-group confirmatory factor analysis approach, one model identification issue has unfortunately received little attention: the specification of a referent variable in the test of measurement invariance. A multiple-indicator multiple-cause (MIMIC) model with moderated effects (i.e., MIMIC-interaction modeling; Woods & Grimm, 2011) and a moderated nonlinear factor analysis (MNLFA; Bauer, 2017) model for detecting uniform and nonuniform measurement inequivalences in tandem were proposed to identify credible referent variables. The performance of two search strategies, constrained and free baseline models, and MIMIC-interaction and MNLFA methodologies were evaluated in a Monte Carlo simulation. Effects of different configurations of the number of inequivalent variables, type and magnitude of inequivalence, magnitude of group differences in factor means and variances, and sample size in combination with each search strategy were determined. Results showed that the constrained baseline model strategy generally outperformed the free baseline model strategy for identifying credible referent variables, functioning well when up to one-third of the observed variables were noninvariant. Moreover, MNLFA performed better than MIMIC-interaction modeling for the selection of referent variables across nearly all conditions investigated in the study. The superiority of MNLFA over MIMIC-interaction modeling was specifically evident in the models with relatively small samples, large between-group latent variance differences, or a combination of both. An empirical example was presented to demonstrate the applicability of MNLFA with the constrained baseline model strategy for referent variable selection. (PsycInfo Database Record (c) 2025 APA, all rights reserved).
{"title":"Empirical selection of referent variables: Comparing multiple-indicator multiple-cause-interaction modeling and moderated nonlinear factor analysis.","authors":"Cheng-Hsien Li","doi":"10.1037/met0000613","DOIUrl":"10.1037/met0000613","url":null,"abstract":"<p><p>The fulfillment of measurement invariance/equivalence is considered a prerequisite for meaningfully proceeding with substantive cross-group comparisons. In the multiple-group confirmatory factor analysis approach, one model identification issue has unfortunately received little attention: the specification of a referent variable in the test of measurement invariance. A multiple-indicator multiple-cause (MIMIC) model with moderated effects (i.e., MIMIC-interaction modeling; Woods & Grimm, 2011) and a moderated nonlinear factor analysis (MNLFA; Bauer, 2017) model for detecting uniform and nonuniform measurement inequivalences in tandem were proposed to identify credible referent variables. The performance of two search strategies, constrained and free baseline models, and MIMIC-interaction and MNLFA methodologies were evaluated in a Monte Carlo simulation. Effects of different configurations of the number of inequivalent variables, type and magnitude of inequivalence, magnitude of group differences in factor means and variances, and sample size in combination with each search strategy were determined. Results showed that the constrained baseline model strategy generally outperformed the free baseline model strategy for identifying credible referent variables, functioning well when up to one-third of the observed variables were noninvariant. Moreover, MNLFA performed better than MIMIC-interaction modeling for the selection of referent variables across nearly all conditions investigated in the study. The superiority of MNLFA over MIMIC-interaction modeling was specifically evident in the models with relatively small samples, large between-group latent variance differences, or a combination of both. An empirical example was presented to demonstrate the applicability of MNLFA with the constrained baseline model strategy for referent variable selection. (PsycInfo Database Record (c) 2025 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":"1056-1078"},"PeriodicalIF":7.8,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"92156265","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Hannelies de Jonge, Kees-Jan Kan, Frans J Oort, Suzanne Jak
Meta-analytic structural equation modeling (MASEM) allows a researcher to simultaneously examine multiple relations among variables by fitting a structural equation model to summary statistics from multiple studies. Consider, for example, a mediation model with a predictor (X), mediator (M), and outcome variable (Y). In such a model, X can be a dichotomous variable, allowing researchers to examine the direct and indirect effects of an intervention as in randomized controlled trials (RCTs). However, the natural choice of a meta-analysis of RCTs would involve standardized mean differences as effect sizes, whereas MASEM requires correlation matrices as input. This can be solved by converting standardized mean differences (Cohen's d or Hedges' g) to point-biserial correlations (rpb). Possible conversion formulas vary across publications and conversion tools, and it is unclear which one is most appropriate for use in MASEM. The aim of this article is to describe and evaluate several conversions of standardized mean differences to point-biserial correlations in the context of RCTs. We investigate the impact of the usage of various conversions on MASEM parameter estimation using the R package metaSEM in a simulation study, varying the ratio of group sample sizes, number of primary studies, sample sizes, and missingness. The results show that a relatively unknown d-to-rpb conversion generally performs best. However, this conversion formula is not implemented in the mainstream conversion tools. We developed a user-friendly web application entitled Effect Size Calculator and Converter (https://hdejonge.shinyapps.io/ESCACO) that converts the user's primary study statistics into an effect size suitable for use in MASEM. (PsycInfo Database Record (c) 2026 APA, all rights reserved).
元分析结构方程模型(MASEM)允许研究人员通过拟合结构方程模型来汇总多项研究的统计数据,从而同时检查变量之间的多种关系。例如,考虑一个具有预测因子(X)、中介因子(M)和结果变量(Y)的中介模型。在这样的模型中,X可以是一个二分类变量,允许研究人员检查干预的直接和间接影响,就像随机对照试验(rct)一样。然而,随机对照试验的荟萃分析的自然选择将涉及标准化的平均差异作为效应大小,而MASEM需要相关矩阵作为输入。这可以通过将标准化平均差异(Cohen's d或Hedges' s g)转换为点双列相关性(rpb)来解决。可能的转换公式因出版物和转换工具而异,不清楚哪一种最适合在MASEM中使用。本文的目的是描述和评估在随机对照试验背景下标准化平均差异到点双列相关性的几种转换。我们在模拟研究中研究了使用R包metaSEM对MASEM参数估计的各种转换的影响,改变了组样本量的比例、主要研究的数量、样本量和缺失。结果表明,相对未知的d-to-rpb转换通常表现最好。然而,这种转换公式并没有在主流的转换工具中实现。我们开发了一个用户友好的网络应用程序,名为效应大小计算器和转换器(https://hdejonge.shinyapps.io/ESCACO),将用户的主要研究统计数据转换为适合在MASEM中使用的效应大小。(PsycInfo Database Record (c) 2025 APA,版权所有)。
{"title":"How to synthesize randomized controlled trial data with meta-analytic structural equation modeling: A comparison of various d-to-rpb conversions.","authors":"Hannelies de Jonge, Kees-Jan Kan, Frans J Oort, Suzanne Jak","doi":"10.1037/met0000790","DOIUrl":"10.1037/met0000790","url":null,"abstract":"<p><p>Meta-analytic structural equation modeling (MASEM) allows a researcher to simultaneously examine multiple relations among variables by fitting a structural equation model to summary statistics from multiple studies. Consider, for example, a mediation model with a predictor (<i>X</i>), mediator (<i>M</i>), and outcome variable (<i>Y</i>). In such a model, <i>X</i> can be a dichotomous variable, allowing researchers to examine the direct and indirect effects of an intervention as in randomized controlled trials (RCTs). However, the natural choice of a meta-analysis of RCTs would involve standardized mean differences as effect sizes, whereas MASEM requires correlation matrices as input. This can be solved by converting standardized mean differences (Cohen's <i>d</i> or Hedges' <i>g</i>) to point-biserial correlations (<i>r</i><sub>pb</sub>). Possible conversion formulas vary across publications and conversion tools, and it is unclear which one is most appropriate for use in MASEM. The aim of this article is to describe and evaluate several conversions of standardized mean differences to point-biserial correlations in the context of RCTs. We investigate the impact of the usage of various conversions on MASEM parameter estimation using the R package metaSEM in a simulation study, varying the ratio of group sample sizes, number of primary studies, sample sizes, and missingness. The results show that a relatively unknown <i>d</i>-to-<i>r</i><sub>pb</sub> conversion generally performs best. However, this conversion formula is not implemented in the mainstream conversion tools. We developed a user-friendly web application entitled Effect Size Calculator and Converter (https://hdejonge.shinyapps.io/ESCACO) that converts the user's primary study statistics into an effect size suitable for use in MASEM. (PsycInfo Database Record (c) 2026 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":""},"PeriodicalIF":7.8,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145192491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Supplemental Material for Inferences and Effect Sizes for Direct, Indirect, and Total Effects in Continuous-Time Mediation Models","authors":"","doi":"10.1037/met0000779.supp","DOIUrl":"https://doi.org/10.1037/met0000779.supp","url":null,"abstract":"","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":"24 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145254871","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Supplemental Material for How to Synthesize Randomized Controlled Trial Data With Meta-Analytic Structural Equation Modeling: A Comparison of Various d-to-rpb Conversions","authors":"","doi":"10.1037/met0000790.supp","DOIUrl":"https://doi.org/10.1037/met0000790.supp","url":null,"abstract":"","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":"255 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145254883","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Tom Heyman,Ekaterina Pronizius,Savannah C Lewis,Oguz A Acar,Matúš Adamkovič,Ettore Ambrosini,Jan Antfolk,Krystian Barzykowski,Ernest Baskin,Carlota Batres,Leanne Boucher,Jordane Boudesseul,Eduard Brandstätter,W Matthew Collins,Dušica Filipović Ðurđević,Ciara Egan,Vanessa Era,Paulo Ferreira,Chiara Fini,Patricia Garrido-Vásquez,Hendrik Godbersen,Pablo Gomez,Aurelien Graton,Necdet Gurkan,Zhiran He,Dave C Johnson,Pavol Kačmár,Chris Koch,Marta Kowal,Tomas Kratochvil,Marco Marelli,Fernando Marmolejo-Ramos,Martín Martínez,Alan Mattiassi,Nicholas P Maxwell,Maria Montefinese,Coby Morvinski,Maital Neta,Yngwie A Nielsen,Sebastian Ocklenburg,Jaš Onič,Marietta Papadatou-Pastou,Adam J Parker,Mariola Paruzel-Czachura,Yuri G Pavlov,Manuel Perea,Gerit Pfuhl,Tanja C Roembke,Jan P Röer,Timo B Roettger,Susana Ruiz-Fernandez,Kathleen Schmidt,Cynthia S Q Siew,Christian K Tamnes,Jack E Taylor,Rémi Thériault,José L Ulloa,Miguel A Vadillo,Michael E W Varnum,Martin R Vasilev,Steven Verheyen,Giada Viviani,Sebastian Wallot,Yuki Yamada,Yueyuan Zheng,Erin M Buchanan
When processing and analyzing empirical data, researchers regularly face choices that may appear arbitrary (e.g., how to define and handle outliers). If one chooses to exclusively focus on a particular option and conduct a single analysis, its outcome might be of limited utility. That is, one remains agnostic regarding the generalizability of the results, because plausible alternative paths remain unexplored. A multiverse analysis offers a solution to this issue by exploring the various choices pertaining to data-processing and/or model building, and examining their impact on the conclusion of a study. However, even though multiverse analyses are arguably less susceptible to biases compared to the typical single-pathway approach, it is still possible to selectively add or omit pathways. To address this issue, we outline a novel, more principled approach to conducting multiverse analyses through crowdsourcing. The approach is detailed in a step-by-step tutorial to facilitate its implementation. We also provide a worked-out illustration featuring the Semantic Priming Across Many Languages project, thereby demonstrating its feasibility and its ability to increase objectivity and transparency. (PsycInfo Database Record (c) 2025 APA, all rights reserved).
在处理和分析经验数据时,研究人员经常面临可能显得武断的选择(例如,如何定义和处理异常值)。如果一个人选择专门关注一个特定的选项,并进行单一的分析,其结果可能是有限的效用。也就是说,对于结果的普遍性,人们仍然是不可知论者,因为合理的替代途径仍然没有被探索。多元宇宙分析通过探索与数据处理和/或模型构建相关的各种选择,并检查它们对研究结论的影响,为这个问题提供了解决方案。然而,尽管与典型的单路径方法相比,多元宇宙分析可以说不太容易受到偏差的影响,但仍然有可能选择性地添加或省略路径。为了解决这个问题,我们概述了一种新颖的、更有原则的方法,通过众包来进行多元宇宙分析。该方法将在一个循序渐进的教程中详细介绍,以促进其实现。我们还提供了一个针对跨多种语言语义启动项目的详细说明,从而展示了其可行性及其增加客观性和透明度的能力。(PsycInfo Database Record (c) 2025 APA,版权所有)。
{"title":"Crowdsourcing multiverse analyses to explore the impact of different data-processing and analysis decisions: A tutorial.","authors":"Tom Heyman,Ekaterina Pronizius,Savannah C Lewis,Oguz A Acar,Matúš Adamkovič,Ettore Ambrosini,Jan Antfolk,Krystian Barzykowski,Ernest Baskin,Carlota Batres,Leanne Boucher,Jordane Boudesseul,Eduard Brandstätter,W Matthew Collins,Dušica Filipović Ðurđević,Ciara Egan,Vanessa Era,Paulo Ferreira,Chiara Fini,Patricia Garrido-Vásquez,Hendrik Godbersen,Pablo Gomez,Aurelien Graton,Necdet Gurkan,Zhiran He,Dave C Johnson,Pavol Kačmár,Chris Koch,Marta Kowal,Tomas Kratochvil,Marco Marelli,Fernando Marmolejo-Ramos,Martín Martínez,Alan Mattiassi,Nicholas P Maxwell,Maria Montefinese,Coby Morvinski,Maital Neta,Yngwie A Nielsen,Sebastian Ocklenburg,Jaš Onič,Marietta Papadatou-Pastou,Adam J Parker,Mariola Paruzel-Czachura,Yuri G Pavlov,Manuel Perea,Gerit Pfuhl,Tanja C Roembke,Jan P Röer,Timo B Roettger,Susana Ruiz-Fernandez,Kathleen Schmidt,Cynthia S Q Siew,Christian K Tamnes,Jack E Taylor,Rémi Thériault,José L Ulloa,Miguel A Vadillo,Michael E W Varnum,Martin R Vasilev,Steven Verheyen,Giada Viviani,Sebastian Wallot,Yuki Yamada,Yueyuan Zheng,Erin M Buchanan","doi":"10.1037/met0000770","DOIUrl":"https://doi.org/10.1037/met0000770","url":null,"abstract":"When processing and analyzing empirical data, researchers regularly face choices that may appear arbitrary (e.g., how to define and handle outliers). If one chooses to exclusively focus on a particular option and conduct a single analysis, its outcome might be of limited utility. That is, one remains agnostic regarding the generalizability of the results, because plausible alternative paths remain unexplored. A multiverse analysis offers a solution to this issue by exploring the various choices pertaining to data-processing and/or model building, and examining their impact on the conclusion of a study. However, even though multiverse analyses are arguably less susceptible to biases compared to the typical single-pathway approach, it is still possible to selectively add or omit pathways. To address this issue, we outline a novel, more principled approach to conducting multiverse analyses through crowdsourcing. The approach is detailed in a step-by-step tutorial to facilitate its implementation. We also provide a worked-out illustration featuring the Semantic Priming Across Many Languages project, thereby demonstrating its feasibility and its ability to increase objectivity and transparency. (PsycInfo Database Record (c) 2025 APA, all rights reserved).","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":"1 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2025-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145078142","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
When comparing relationships between latent variables across groups, measurement invariance (MI) needs to be established to ensure that the test results are valid and meaningful conclusions can be drawn. Common tests of MI are not ideal for investigating many groups and are of limited value during the development of measurement models. In addition, popular network-based alternatives to latent variable modeling lack established methods for MI testing. Therefore, we propose exploratory graph analysis trees (EGA trees) that apply the idea of model-based recursive partitioning to correlation matrices and combine it with EGA-which can be used instead of exploratory factor analysis. In a simulation study, we test the approach regarding its ability to detect configural or metric noninvariance in common factor models given numerous covariates and illustrate its usefulness in conditions with severe violations of configural invaraince based on the diverging number of factors. The results demonstrate that EGA trees can be a valuable tool for the exploration of MI when constructing scales and working on measurement models. We provide R functions within the R package EFAtree to easily implement EGA trees. (PsycInfo Database Record (c) 2025 APA, all rights reserved).
在比较各组间潜在变量之间的关系时,需要建立测量不变性(MI),以确保检验结果有效,并能得出有意义的结论。MI的普通测试对于调查许多群体并不理想,并且在开发测量模型期间价值有限。此外,流行的基于网络的潜在变量建模替代方法缺乏MI测试的既定方法。因此,我们提出探索性图分析树(EGA树),将基于模型的递归划分思想应用于相关矩阵,并将其与EGA相结合,可以代替探索性因子分析。在模拟研究中,我们测试了该方法在给定大量协变量的公共因素模型中检测组态或度量非不变的能力,并说明了其在基于分散的因素数量严重违反组态不变性的条件下的有用性。结果表明,EGA树在构建尺度和处理测量模型时可以成为探索MI的一个有价值的工具。我们在R包EFAtree中提供R函数来轻松实现EGA树。(PsycInfo Database Record (c) 2025 APA,版权所有)。
{"title":"Exploratory graph analysis trees-A network-based approach to investigate measurement invariance with numerous covariates.","authors":"David Goretzko,Philipp Sterner","doi":"10.1037/met0000796","DOIUrl":"https://doi.org/10.1037/met0000796","url":null,"abstract":"When comparing relationships between latent variables across groups, measurement invariance (MI) needs to be established to ensure that the test results are valid and meaningful conclusions can be drawn. Common tests of MI are not ideal for investigating many groups and are of limited value during the development of measurement models. In addition, popular network-based alternatives to latent variable modeling lack established methods for MI testing. Therefore, we propose exploratory graph analysis trees (EGA trees) that apply the idea of model-based recursive partitioning to correlation matrices and combine it with EGA-which can be used instead of exploratory factor analysis. In a simulation study, we test the approach regarding its ability to detect configural or metric noninvariance in common factor models given numerous covariates and illustrate its usefulness in conditions with severe violations of configural invaraince based on the diverging number of factors. The results demonstrate that EGA trees can be a valuable tool for the exploration of MI when constructing scales and working on measurement models. We provide R functions within the R package EFAtree to easily implement EGA trees. (PsycInfo Database Record (c) 2025 APA, all rights reserved).","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":"67 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2025-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145058900","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Oscar L Olvera Astivia,Xijuan Zhang,Edward Kroc,Bruno D Zumbo
Moderated multiple regression is one of the most established, popular methods to model nonlinear associations in social sciences. A mostly unacknowledged fact is that a particular type of nonnormality can make the coefficient capturing this association nonzero. To further understand this connection, a theoretical investigation was conducted. A generalization of Isserlis' theorem from multivariate normal densities to all elliptical densities is presented. Through this generalization, it was found that the family of elliptical densities (which includes the multivariate normal) cannot generate a product-interaction term. Moreover, asymmetry in lower and/or higher dimensions can induce a product-interaction term. Special case studies are presented where the variables are unidimensional symmetric, but jointly nonsymmetric, resulting in a moderated multiple regression model. A call is made for researchers to think carefully and decide when they have a true interaction term, theorized a priori, and when nonnormality is mimicking an interaction effect. (PsycInfo Database Record (c) 2025 APA, all rights reserved).
适度多元回归是社会科学中最成熟、最流行的非线性关联建模方法之一。一个大多数未被承认的事实是,一种特殊类型的非正态性可以使捕获这种关联的系数非零。为了进一步了解这种联系,进行了理论研究。将Isserlis定理从多元正态密度推广到所有椭圆密度。通过这种推广,发现椭圆密度族(包括多元正态)不能产生积相互作用项。此外,低维和/或高维的不对称可以诱导产物相互作用项。提出了特殊的案例研究,其中变量是一维对称的,但联合不对称,导致一个有调节的多元回归模型。研究人员需要仔细思考,并决定什么时候他们有一个真正的相互作用条件,理论化的先验,什么时候非正常是模仿相互作用的效果。(PsycInfo Database Record (c) 2025 APA,版权所有)。
{"title":"On the uncanny relationship between nonnormality and moderated multiple regression.","authors":"Oscar L Olvera Astivia,Xijuan Zhang,Edward Kroc,Bruno D Zumbo","doi":"10.1037/met0000797","DOIUrl":"https://doi.org/10.1037/met0000797","url":null,"abstract":"Moderated multiple regression is one of the most established, popular methods to model nonlinear associations in social sciences. A mostly unacknowledged fact is that a particular type of nonnormality can make the coefficient capturing this association nonzero. To further understand this connection, a theoretical investigation was conducted. A generalization of Isserlis' theorem from multivariate normal densities to all elliptical densities is presented. Through this generalization, it was found that the family of elliptical densities (which includes the multivariate normal) cannot generate a product-interaction term. Moreover, asymmetry in lower and/or higher dimensions can induce a product-interaction term. Special case studies are presented where the variables are unidimensional symmetric, but jointly nonsymmetric, resulting in a moderated multiple regression model. A call is made for researchers to think carefully and decide when they have a true interaction term, theorized a priori, and when nonnormality is mimicking an interaction effect. (PsycInfo Database Record (c) 2025 APA, all rights reserved).","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":"14 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2025-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145018258","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Henrik R. Godmann, František Bartoš, Eric-Jan Wagenmakers
{"title":"Truncating the likelihood allows outlier exclusion without overestimating the evidence in the Bayes factor t test.","authors":"Henrik R. Godmann, František Bartoš, Eric-Jan Wagenmakers","doi":"10.1037/met0000782","DOIUrl":"https://doi.org/10.1037/met0000782","url":null,"abstract":"","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":"23 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144910621","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Coefficient of agreement between two raters corrected for category prevalence: Alternative to kappa.","authors":"Rashid Saif Almehrizi","doi":"10.1037/met0000732","DOIUrl":"https://doi.org/10.1037/met0000732","url":null,"abstract":"","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":"40 1","pages":""},"PeriodicalIF":7.0,"publicationDate":"2025-08-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144899765","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The statistical and pragmatic tension between explanation and prediction is well recognized in psychology. Yarkoni and Westfall (2017) suggested focusing more on predictions, which will ultimately produce better calibrated interpretations. Variable selection methods, such as regularization, are strongly recommended because it will help construct interpretable models while optimizing prediction accuracy. However, when the data contain a nonignorable proportion of missingness, variable selection and model building via penalized regression methods are not straightforward. What further complicates the analysis protocol is when the model performance is evaluated on both prediction accuracy and fairness, the latter is of increasing attention when the predictive outcome has societal implications. This study explored two methods for variable selection with incomplete data: the bootstrap imputation-stability selection (BI-SS) method and the stacked elastic net (SENET) method. Both methods work with multiply imputed data sets but in different ways. BI-SS implements variable selection separately on each imputed bootstrap data set and aggregates the results via stability selection, while SENET stacks all imputed data sets and fits a single pooled model. We thoroughly evaluated their performance using a suite of metrics (including area under the curve, F1 score, and fairness criteria) via three increasingly complex simulation studies. Results reveal that while BI-SS and SENET methods perform almost equally well in settings with generalized linear models, only BI-SS fares well with nested data design because of high computation demand in fitting the regularized generalized linear mixed effects models. Finally, we demonstrated both methods with an example using rich electronic health data. (PsycInfo Database Record (c) 2025 APA, all rights reserved).
在心理学中,解释和预测之间的统计学和语用学张力是公认的。Yarkoni和Westfall(2017)建议更多地关注预测,这最终将产生更好的校准解释。变量选择方法,如正则化,是强烈推荐的,因为它将有助于构建可解释的模型,同时优化预测精度。然而,当数据包含不可忽略的缺失比例时,通过惩罚回归方法进行变量选择和模型构建并不简单。使分析方案进一步复杂化的是,当模型性能同时评估预测准确性和公平性时,后者在预测结果具有社会影响时越来越受到关注。本文探讨了两种不完全数据下的变量选择方法:自举法(BI-SS)和叠弹性网法(SENET)。这两种方法都适用于多输入数据集,但方式不同。BI-SS分别对每个输入的自举数据集进行变量选择,并通过稳定性选择汇总结果,而SENET将所有输入的数据集叠加并拟合单个池模型。我们通过三个日益复杂的模拟研究,使用一系列指标(包括曲线下面积、F1分数和公平性标准)彻底评估了他们的表现。结果表明,虽然BI-SS和SENET方法在广义线性模型设置中表现几乎相同,但只有BI-SS方法在嵌套数据设计中表现良好,因为在拟合正则化广义线性混合效应模型时需要大量的计算量。最后,我们通过一个使用丰富电子健康数据的示例演示了这两种方法。(PsycInfo Database Record (c) 2025 APA,版权所有)。
{"title":"Constructing a binary prediction model with incomplete data: Variable selection to balance fairness and precision.","authors":"He Ren, Chun Wang, Gongjun Xu, David J Weiss","doi":"10.1037/met0000786","DOIUrl":"10.1037/met0000786","url":null,"abstract":"<p><p>The statistical and pragmatic tension between explanation and prediction is well recognized in psychology. Yarkoni and Westfall (2017) suggested focusing more on predictions, which will ultimately produce better calibrated interpretations. Variable selection methods, such as regularization, are strongly recommended because it will help construct interpretable models while optimizing prediction accuracy. However, when the data contain a nonignorable proportion of missingness, variable selection and model building via penalized regression methods are not straightforward. What further complicates the analysis protocol is when the model performance is evaluated on both prediction accuracy and fairness, the latter is of increasing attention when the predictive outcome has societal implications. This study explored two methods for variable selection with incomplete data: the bootstrap imputation-stability selection (BI-SS) method and the stacked elastic net (SENET) method. Both methods work with multiply imputed data sets but in different ways. BI-SS implements variable selection separately on each imputed bootstrap data set and aggregates the results via stability selection, while SENET stacks all imputed data sets and fits a single pooled model. We thoroughly evaluated their performance using a suite of metrics (including area under the curve, F1 score, and fairness criteria) via three increasingly complex simulation studies. Results reveal that while BI-SS and SENET methods perform almost equally well in settings with generalized linear models, only BI-SS fares well with nested data design because of high computation demand in fitting the regularized generalized linear mixed effects models. Finally, we demonstrated both methods with an example using rich electronic health data. (PsycInfo Database Record (c) 2025 APA, all rights reserved).</p>","PeriodicalId":20782,"journal":{"name":"Psychological methods","volume":" ","pages":""},"PeriodicalIF":7.8,"publicationDate":"2025-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12356495/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144856196","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}