Pub Date : 2026-01-14DOI: 10.1080/00273171.2026.2613311
Ethan M McCormick
There has been a growing interest in using earlier change to predict downstream distal outcomes in development; however, prior work has mostly focused on estimating the unique effect of the different growth parameters (e.g., intercept and slope) rather than focusing on the trajectory as a whole. Here I lay out a distal outcome latent curve model with latent interactions which attempts to model the joint effect of growth parameters on these later outcomes. I show again that these models require us to contend with unintuitive time coding effects which can impact the direction and significance of effects and that plotting and probing are necessary for disambiguating these joint effects. These graphical approaches emphasize practical steps for applied researchers in understanding these effects. I then outline how future research can help clarify optimal approaches for using the trajectory as a whole rather than the unique effects of its individual sub-components.
{"title":"Moderating the Consequences of Longitudinal Change for Distal Outcomes.","authors":"Ethan M McCormick","doi":"10.1080/00273171.2026.2613311","DOIUrl":"https://doi.org/10.1080/00273171.2026.2613311","url":null,"abstract":"<p><p>There has been a growing interest in using earlier change to predict downstream distal outcomes in development; however, prior work has mostly focused on estimating the unique effect of the different growth parameters (e.g., intercept and slope) rather than focusing on the trajectory as a whole. Here I lay out a distal outcome latent curve model with latent interactions which attempts to model the <i>joint</i> effect of growth parameters on these later outcomes. I show again that these models require us to contend with unintuitive time coding effects which can impact the direction and significance of effects and that plotting and probing are necessary for disambiguating these joint effects. These graphical approaches emphasize practical steps for applied researchers in understanding these effects. I then outline how future research can help clarify optimal approaches for using the trajectory as a whole rather than the unique effects of its individual sub-components.</p>","PeriodicalId":53155,"journal":{"name":"Multivariate Behavioral Research","volume":" ","pages":"1-19"},"PeriodicalIF":3.5,"publicationDate":"2026-01-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145967948","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-30DOI: 10.1080/00273171.2025.2605678
Ludovica De Carolis, Inhan Kang, Minjeong Jeon
In this study, we introduce a novel modeling approach for ordinal response data, extending the one-parameter graded response model. The proposed model incorporates unobserved interactions between respondents and items, represented as distances in a two-dimensional Euclidean space, referred to as an interaction map. This latent space graded response model (LSGRM) addresses potential violations of the conditional independence assumption shared by traditional main-effect-only psychometric models and offers a visualization tool for exploring conditional dependence in ordinal item response data. Through simulation and empirical studies, we illustrate the utility of the proposed approach in analyzing Likert-scale psychological assessment data. Also, by comparing the results with those from other models of different data modalities, we examined the impact of dichotomization and treating ordinal responses as continuous on conditional dependence.
{"title":"A Latent Space Graded Response Model for Likert-Scale Psychological Assessments.","authors":"Ludovica De Carolis, Inhan Kang, Minjeong Jeon","doi":"10.1080/00273171.2025.2605678","DOIUrl":"https://doi.org/10.1080/00273171.2025.2605678","url":null,"abstract":"<p><p>In this study, we introduce a novel modeling approach for ordinal response data, extending the one-parameter graded response model. The proposed model incorporates unobserved interactions between respondents and items, represented as distances in a two-dimensional Euclidean space, referred to as an interaction map. This latent space graded response model (LSGRM) addresses potential violations of the conditional independence assumption shared by traditional main-effect-only psychometric models and offers a visualization tool for exploring conditional dependence in ordinal item response data. Through simulation and empirical studies, we illustrate the utility of the proposed approach in analyzing Likert-scale psychological assessment data. Also, by comparing the results with those from other models of different data modalities, we examined the impact of dichotomization and treating ordinal responses as continuous on conditional dependence.</p>","PeriodicalId":53155,"journal":{"name":"Multivariate Behavioral Research","volume":" ","pages":"1-26"},"PeriodicalIF":3.5,"publicationDate":"2025-12-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145859036","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-27DOI: 10.1080/00273171.2025.2601271
Satoshi Usami
Psychological researchers have shown an interest in disaggregating within-person variability from between-person differences. This paper provides a tutorial, simulation, and illustrative example of a new approach proposed by Usami (2023). This approach consists of a two-step procedure: within-person variability scores (WPVS) for each person, which are disaggregated from the stable traits of that person, are predicted using structural equation modeling, and causal parameters are then estimated via a potential outcome approach, such as by using structural nested mean models (SNMMs). This method has several advantages: (i) the flexible inclusion of curvilinear and interaction effects for WPVS as latent variables in treatment and outcome models, (ii) more accurate estimates of causal parameters for reciprocal relations can be obtained under certain conditions owing to them being doubly robust, even if unobserved time-varying confounders and model misspecifications exist, (iii) no models for (the distributions of) observed time-varying confounders are needed for estimation, and (iv) the risk of obtaining improper solutions is reduced. Estimation performances are investigated through large-scale simulations and it shows that the proposed approach works well in many conditions if longitudinal data with are available. An analytic example using data from the Tokyo Teen Cohort (TTC) study is also provided.
{"title":"A Two-Step Robust Estimation Approach for Inferring Within-Person Relations in Longitudinal Design: Tutorial and Simulations.","authors":"Satoshi Usami","doi":"10.1080/00273171.2025.2601271","DOIUrl":"https://doi.org/10.1080/00273171.2025.2601271","url":null,"abstract":"<p><p>Psychological researchers have shown an interest in disaggregating within-person variability from between-person differences. This paper provides a tutorial, simulation, and illustrative example of a new approach proposed by Usami (2023). This approach consists of a two-step procedure: <i>within-person variability scores</i> (WPVS) for each person, which are disaggregated from the stable traits of that person, are predicted using structural equation modeling, and causal parameters are then estimated <i>via</i> a potential outcome approach, such as by using structural nested mean models (SNMMs). This method has several advantages: (i) the flexible inclusion of curvilinear and interaction effects for WPVS as latent variables in treatment and outcome models, (ii) more accurate estimates of causal parameters for reciprocal relations can be obtained under certain conditions owing to them being doubly robust, even if unobserved time-varying confounders and model misspecifications exist, (iii) no models for (the distributions of) observed time-varying confounders are needed for estimation, and (iv) the risk of obtaining improper solutions is reduced. Estimation performances are investigated through large-scale simulations and it shows that the proposed approach works well in many conditions if longitudinal data with <math><mrow><mi>T</mi><mo>≥</mo><mn>4</mn></mrow></math> are available. An analytic example using data from the Tokyo Teen Cohort (TTC) study is also provided.</p>","PeriodicalId":53155,"journal":{"name":"Multivariate Behavioral Research","volume":" ","pages":"1-22"},"PeriodicalIF":3.5,"publicationDate":"2025-12-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145844588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-03DOI: 10.1080/00273171.2025.2587379
Lingbo Tong, Zhiyong Zhang
Artificial neural networks (ANN) have attracted increasing attention in the field of psychology. With the availability of software programs, the wide application of ANN becomes possible. However, without a firm understanding of the basics of the ANN, issues can easily arise. This article presents a step-by-step guide for implementing a feed-forward neural network (FNN) on a psychological data set to illustrate the critical steps in building, estimating, and interpreting a neural network model. We start with a concrete example of a basic 3-layer FNN, illustrating the core concepts, the matrix representation, and the whole optimization process. By adjusting parameters and changing the model structure, we examine their effects on model performance. Then, we introduce accessible methods for interpreting model results and making inferences. Through the guide, we hope to help researchers avoid common problems in applying neural network models and machine learning methods in general.
{"title":"Neural Network Analysis of Psychological Data: A Step-by-Step Guide.","authors":"Lingbo Tong, Zhiyong Zhang","doi":"10.1080/00273171.2025.2587379","DOIUrl":"https://doi.org/10.1080/00273171.2025.2587379","url":null,"abstract":"<p><p>Artificial neural networks (ANN) have attracted increasing attention in the field of psychology. With the availability of software programs, the wide application of ANN becomes possible. However, without a firm understanding of the basics of the ANN, issues can easily arise. This article presents a step-by-step guide for implementing a feed-forward neural network (FNN) on a psychological data set to illustrate the critical steps in building, estimating, and interpreting a neural network model. We start with a concrete example of a basic 3-layer FNN, illustrating the core concepts, the matrix representation, and the whole optimization process. By adjusting parameters and changing the model structure, we examine their effects on model performance. Then, we introduce accessible methods for interpreting model results and making inferences. Through the guide, we hope to help researchers avoid common problems in applying neural network models and machine learning methods in general.</p>","PeriodicalId":53155,"journal":{"name":"Multivariate Behavioral Research","volume":" ","pages":"1-21"},"PeriodicalIF":3.5,"publicationDate":"2025-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145662669","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-12-02DOI: 10.1080/00273171.2025.2592361
Yuan Fang, Lijuan Wang
Intraindividual variability (IIV) characterizes the amplitude and temporal dependency of short-term fluctuations of a variable and is often used to predict outcomes in psychological studies. However, how to properly model IIV is understudied. In particular, intraindividual standard deviation (or variance), which quantifies the amplitude of fluctuation of a variable around its mean level, can be challenging to model directly in popular latent variable frameworks, such as dynamic structural equation modeling (DSEM). In this study, we introduced three novel modeling methods, including two two-step hybrid-Bayesian methods using DSEM and a one-step full Bayesian method, to model IIV as predictors. We conducted a simulation study to evaluate the performance of the three methods and compared their performance to that of the conventional regression approach under various data conditions. Simulation results showed that the hybrid-Bayesian approach with multiple draws (HBM) and the one-step full Bayesian (FB) approach performed well in recovering the parameters when sufficient sample size and time points were available. The data requirement of using FB was lower than HBM. However, the conventional approach and hybrid-Bayesian approach with a single draw failed to recover parameters, even with large samples. We provided a simulated data example with code online to illustrate the use of the methods.
{"title":"Novel Full-Bayesian and Hybrid-Bayesian Approaches for Modeling Intraindividual Variability.","authors":"Yuan Fang, Lijuan Wang","doi":"10.1080/00273171.2025.2592361","DOIUrl":"https://doi.org/10.1080/00273171.2025.2592361","url":null,"abstract":"<p><p>Intraindividual variability (IIV) characterizes the amplitude and temporal dependency of short-term fluctuations of a variable and is often used to predict outcomes in psychological studies. However, how to properly model IIV is understudied. In particular, intraindividual standard deviation (or variance), which quantifies the amplitude of fluctuation of a variable around its mean level, can be challenging to model directly in popular latent variable frameworks, such as dynamic structural equation modeling (DSEM). In this study, we introduced three novel modeling methods, including two two-step hybrid-Bayesian methods using DSEM and a one-step full Bayesian method, to model IIV as predictors. We conducted a simulation study to evaluate the performance of the three methods and compared their performance to that of the conventional regression approach under various data conditions. Simulation results showed that the hybrid-Bayesian approach with multiple draws (HBM) and the one-step full Bayesian (FB) approach performed well in recovering the parameters when sufficient sample size and time points were available. The data requirement of using FB was lower than HBM. However, the conventional approach and hybrid-Bayesian approach with a single draw failed to recover parameters, even with large samples. We provided a simulated data example with code online to illustrate the use of the methods.</p>","PeriodicalId":53155,"journal":{"name":"Multivariate Behavioral Research","volume":" ","pages":"1-16"},"PeriodicalIF":3.5,"publicationDate":"2025-12-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145656347","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-22DOI: 10.1080/00273171.2025.2587286
Xiaoyue Xiong, Yanling Li, Michael D Hunter, Sy-Miin Chow
Trends represent systematic intra-individual variations that occur over slower time scales that, if unaccounted, are known to yield biases in estimation of momentary change patterns captured by time series models. The applicability of detrending methods has rarely been assessed in the context of multi-level longitudinal panel data, namely, nested data structures with relatively few measurements. This paper evaluated the efficacy of a series of two-stage detrending methods against a single-stage Bayesian approach in fitting multi-level nonlinear growth curve models with autoregressive residuals (ml-GAR) with random effects in both the growth and autoregressive processes. Monte Carlo simulation studies revealed that the single-stage Bayesian approach, in contrast to two-stage approaches, exhibited satisfactory properties with as few as five time points when the number of individuals was large (e.g., 500 individuals). It still outperformed alternative two-stage approaches when correlated random effects between the trend and autoregressive processes were misspecified as a diagonal random effect structure. Empirical results from the Early Childhood Longitudinal Study-Kindergarten Class (ECLS-K) data suggested substantial deviations in conclusions regarding children's reading ability using two-stage in comparison to single-stage approaches, thus highlighting the importance of simultaneous modeling of trends and intraindividual variability whenever feasible.
{"title":"Integrated Trend and Lagged Modeling of Multi-Subject, Multilevel, and Short Time Series.","authors":"Xiaoyue Xiong, Yanling Li, Michael D Hunter, Sy-Miin Chow","doi":"10.1080/00273171.2025.2587286","DOIUrl":"https://doi.org/10.1080/00273171.2025.2587286","url":null,"abstract":"<p><p>Trends represent systematic intra-individual variations that occur over slower time scales that, if unaccounted, are known to yield biases in estimation of momentary change patterns captured by time series models. The applicability of detrending methods has rarely been assessed in the context of multi-level longitudinal panel data, namely, nested data structures with relatively few measurements. This paper evaluated the efficacy of a series of two-stage detrending methods against a single-stage Bayesian approach in fitting <u>m</u>ulti-<u>l</u>evel nonlinear <u>g</u>rowth curve models with <u>a</u>uto<u>r</u>egressive residuals (ml-GAR) with random effects in both the growth and autoregressive processes. Monte Carlo simulation studies revealed that the single-stage Bayesian approach, in contrast to two-stage approaches, exhibited satisfactory properties with as few as five time points when the number of individuals was large (e.g., 500 individuals). It still outperformed alternative two-stage approaches when correlated random effects between the trend and autoregressive processes were misspecified as a diagonal random effect structure. Empirical results from the Early Childhood Longitudinal Study-Kindergarten Class (ECLS-K) data suggested substantial deviations in conclusions regarding children's reading ability using two-stage in comparison to single-stage approaches, thus highlighting the importance of simultaneous modeling of trends and intraindividual variability whenever feasible.</p>","PeriodicalId":53155,"journal":{"name":"Multivariate Behavioral Research","volume":" ","pages":"1-24"},"PeriodicalIF":3.5,"publicationDate":"2025-11-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145574722","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-21DOI: 10.1080/00273171.2025.2570250
Han Du, Fang Liu, Zhiyong Zhang, Craig Enders
Bayesian statistics have gained significant traction across various fields over the past few decades. Bayesian statistics textbooks often provide both code and the analytical forms of parameters for simple models. However, they often omit the process of deriving posterior distributions or limit it to basic univariate examples focused on the mean and variance. Additionally, these resources frequently assume a strong background in linear algebra and probability theory, which can present barriers for researchers without extensive mathematical training. This tutorial aims to fill that gap by offering a step-by-step guide to deriving posterior distributions. We aim to make concepts typically reserved for advanced statistics courses more accessible and practical. This tutorial will cover two models: the univariate normal model and the multilevel model. The concepts and properties demonstrated in the two examples can be generalized to other models and distributions.
{"title":"Demystifying Posterior Distributions: A Tutorial on Their Derivation.","authors":"Han Du, Fang Liu, Zhiyong Zhang, Craig Enders","doi":"10.1080/00273171.2025.2570250","DOIUrl":"https://doi.org/10.1080/00273171.2025.2570250","url":null,"abstract":"<p><p>Bayesian statistics have gained significant traction across various fields over the past few decades. Bayesian statistics textbooks often provide both code and the analytical forms of parameters for simple models. However, they often omit the process of deriving posterior distributions or limit it to basic univariate examples focused on the mean and variance. Additionally, these resources frequently assume a strong background in linear algebra and probability theory, which can present barriers for researchers without extensive mathematical training. This tutorial aims to fill that gap by offering a step-by-step guide to deriving posterior distributions. We aim to make concepts typically reserved for advanced statistics courses more accessible and practical. This tutorial will cover two models: the univariate normal model and the multilevel model. The concepts and properties demonstrated in the two examples can be generalized to other models and distributions.</p>","PeriodicalId":53155,"journal":{"name":"Multivariate Behavioral Research","volume":" ","pages":"1-15"},"PeriodicalIF":3.5,"publicationDate":"2025-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145574762","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-21DOI: 10.1080/00273171.2025.2565591
Monica Morell, Muwon Kwon, Youngjin Han, Youjin Sung, Yang Liu, Ji Seung Yang
A regression discontinuity (RD) design is often employed to provide causal evidence when the randomization of the treatment assignment is infeasible. When variables of interest are latent constructs measured by observed indicators, the conventional RD analysis using observed variable scores does not allow researchers to examine heterogeneity in the estimated local average treatment effect (ATE) and to generalize the ATE to participants away from the cutoff. We propose a novel methodological augmentation to the conventional RD analysis, which assumes the availability of multiple indicator variables (i.e., raw item responses) that measure the latent construct underlying the running variable. By specifying an explicit measurement model based on those indicator variables, our latent RD framework allows 1) defining the local ATE conditional on the latent construct, 2) disentangling the heterogeneity of the local ATE, and 3) generalizing the local ATE to running variable scores away from the cutoff. In a proof-of-concept simulation we illustrate the proposed augmentation recovers parameters of interest well under practical test length and sample size conditions.
{"title":"Regression Discontinuity Analysis with Latent Variables.","authors":"Monica Morell, Muwon Kwon, Youngjin Han, Youjin Sung, Yang Liu, Ji Seung Yang","doi":"10.1080/00273171.2025.2565591","DOIUrl":"https://doi.org/10.1080/00273171.2025.2565591","url":null,"abstract":"<p><p>A regression discontinuity (RD) design is often employed to provide causal evidence when the randomization of the treatment assignment is infeasible. When variables of interest are latent constructs measured by observed indicators, the conventional RD analysis using observed variable scores does not allow researchers to examine heterogeneity in the estimated local average treatment effect (ATE) and to generalize the ATE to participants away from the cutoff. We propose a novel methodological augmentation to the conventional RD analysis, which assumes the availability of multiple indicator variables (i.e., raw item responses) that measure the latent construct underlying the running variable. By specifying an explicit measurement model based on those indicator variables, our latent RD framework allows 1) defining the local ATE conditional on the latent construct, 2) disentangling the heterogeneity of the local ATE, and 3) generalizing the local ATE to running variable scores away from the cutoff. In a proof-of-concept simulation we illustrate the proposed augmentation recovers parameters of interest well under practical test length and sample size conditions.</p>","PeriodicalId":53155,"journal":{"name":"Multivariate Behavioral Research","volume":" ","pages":"1-14"},"PeriodicalIF":3.5,"publicationDate":"2025-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145574752","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-20DOI: 10.1080/00273171.2025.2583158
Dayeon Lee, Gregory R Hancock
In a linear latent growth model parameterized by intercept (α) and slope (β) factors, those factors' relation is often of interest. The model typically captures this through their covariance parameter, which inherently assumes linearity in their relation. However, this assumption may not always hold. For instance, α and β might be unrelated below a certain threshold along the α-axis but show a meaningful relation above it. That is, even though individual growth trajectories may follow a linear pattern over time, the relation between α and β can be nonlinear, potentially featuring distinct segments separated by a transition point. To address such relations, we propose a semiparametric approach that combines Bayesian P-splines for flexible nonlinear modeling of the α-β relation along with a segmented regression-based transition point detection method. This two-stage analytic approach provides for a more nuanced understanding of the α-β relation, including estimation of a potential transition point where the α-β relation structure fundamentally changes. Simulation results and an empirical data illustration support this approach's effectiveness with single transition point scenarios, offering deeper insights into aspects of the growth process.
{"title":"Detecting Transition Points in the Slope-Intercept Relation in Linear Latent Growth Models.","authors":"Dayeon Lee, Gregory R Hancock","doi":"10.1080/00273171.2025.2583158","DOIUrl":"https://doi.org/10.1080/00273171.2025.2583158","url":null,"abstract":"<p><p>In a linear latent growth model parameterized by intercept (α) and slope (β) factors, those factors' relation is often of interest. The model typically captures this through their covariance parameter, which inherently assumes linearity in their relation. However, this assumption may not always hold. For instance, α and β might be unrelated below a certain threshold along the α-axis but show a meaningful relation above it. That is, even though individual growth trajectories may follow a linear pattern over time, the relation between α and β can be nonlinear, potentially featuring distinct segments separated by a transition point. To address such relations, we propose a semiparametric approach that combines Bayesian P-splines for flexible nonlinear modeling of the α-β relation along with a segmented regression-based transition point detection method. This two-stage analytic approach provides for a more nuanced understanding of the α-β relation, including estimation of a potential transition point where the α-β relation structure fundamentally changes. Simulation results and an empirical data illustration support this approach's effectiveness with single transition point scenarios, offering deeper insights into aspects of the growth process.</p>","PeriodicalId":53155,"journal":{"name":"Multivariate Behavioral Research","volume":" ","pages":"1-20"},"PeriodicalIF":3.5,"publicationDate":"2025-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145566273","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Pub Date : 2025-11-20DOI: 10.1080/00273171.2025.2561942
Christoph Jindra, Karoline A Sachse
State-of-the-art causal inference methods for observational data promise to relax assumptions threatening valid causal inference. Targeted maximum likelihood estimation (TMLE), for example, is a template for constructing doubly robust, semiparametric, efficient substitution estimators, providing consistent estimates if the outcome or treatment model is correctly specified. Compared to standard approaches, it reduces the risk of misspecification bias by allowing (nonparametric) machine-learning techniques, including super learning, to estimate the relevant components of the data distribution. We briefly introduce TMLE and demonstrate its use by estimating the effects of private tutoring in mathematics during Year 7 on mathematics proficiency and grades using observational data from starting cohort 3 of the National Education Panel Study ( 4,167). We contrast TMLE estimates to those from ordinary least squares, the parametric G-formula, and the augmented inverse-probability weighted estimator. Our findings reveal close agreement between methods for end-of-year grades. However, variations emerge when examining mathematics proficiency as the outcome, highlighting that substantive conclusions may depend on the analytical approach. The results underscore the significance of employing advanced causal inference methods, such as TMLE, when navigating the complexities of observational data and highlight the nuanced impact of methodological choices on the interpretation of study outcomes.
{"title":"Targeted Maximum Likelihood Estimation for Causal Inference With Observational Data-The Example of Private Tutoring.","authors":"Christoph Jindra, Karoline A Sachse","doi":"10.1080/00273171.2025.2561942","DOIUrl":"https://doi.org/10.1080/00273171.2025.2561942","url":null,"abstract":"<p><p>State-of-the-art causal inference methods for observational data promise to relax assumptions threatening valid causal inference. Targeted maximum likelihood estimation (TMLE), for example, is a template for constructing doubly robust, semiparametric, efficient substitution estimators, providing consistent estimates if the outcome or treatment model is correctly specified. Compared to standard approaches, it reduces the risk of misspecification bias by allowing (nonparametric) machine-learning techniques, including super learning, to estimate the relevant components of the data distribution. We briefly introduce TMLE and demonstrate its use by estimating the effects of private tutoring in mathematics during Year 7 on mathematics proficiency and grades using observational data from starting cohort 3 of the National Education Panel Study (<math><mrow><mi>N</mi><mo>=</mo></mrow></math> 4,167). We contrast TMLE estimates to those from ordinary least squares, the parametric G-formula, and the augmented inverse-probability weighted estimator. Our findings reveal close agreement between methods for end-of-year grades. However, variations emerge when examining mathematics proficiency as the outcome, highlighting that substantive conclusions may depend on the analytical approach. The results underscore the significance of employing advanced causal inference methods, such as TMLE, when navigating the complexities of observational data and highlight the nuanced impact of methodological choices on the interpretation of study outcomes.</p>","PeriodicalId":53155,"journal":{"name":"Multivariate Behavioral Research","volume":" ","pages":"1-20"},"PeriodicalIF":3.5,"publicationDate":"2025-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145566256","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}