{"title":"Prediction of postpartum depression in women: development and validation of multiple machine learning models.","authors":"Weijing Qi, Yongjian Wang, Yipeng Wang, Sha Huang, Cong Li, Haoyu Jin, Jinfan Zuo, Xuefei Cui, Ziqi Wei, Qing Guo, Jie Hu","doi":"10.1186/s12967-025-06289-6","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Postpartum depression (PPD) is a significant public health issue. This study aimed to develop and validate machine learning (ML) models using biopsychosocial predictors to predict the risk of PPD for perinatal women and to provide several risk assessment tools for the early detection of PPD.</p><p><strong>Methods: </strong>Candidate predictors, including history of mental illness and demographic, psychosocial, and physiological factors, were obtained from 1138 perinatal women between August 2021 and August 2022. The primary outcome of PPD was measured with the Edinburgh Postnatal Depression Scale at 6 weeks postpartum. Seven feature selection methods and six ML algorithms were employed to develop models, and their prediction performances were compared.</p><p><strong>Results: </strong>A total of 11 potential predictive factors associated with PPD were identified and subsequently used to construct prenatal and postpartum predictive models for PPD. The cross-validation results showed that the models built on logistic regression (LR) [area under the curve (AUC): 0.801, 0.858] and artificial neural network (ANN) (AUC: 0.787, 0.844) algorithms exhibited the best prediction performance. In contrast to the prenatal models, the addition of postpartum predictors (primary caregiver and mother-in-law's care) remarkably improved the predictive performance of the postpartum models. The risk-stratification score, the nomogram, and the Shapley additive explanation were used to visualize and interpret the risk prediction model for predicting PPD in the early stage.</p><p><strong>Conclusions: </strong>The LR and ANN models achieved the best predictive performances. Applying these models and risk assessment tools to early predict and screen PPD has several implications for public health.</p>","PeriodicalId":17458,"journal":{"name":"Journal of Translational Medicine","volume":"23 1","pages":"291"},"PeriodicalIF":6.1000,"publicationDate":"2025-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Translational Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12967-025-06289-6","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Postpartum depression (PPD) is a significant public health issue. This study aimed to develop and validate machine learning (ML) models using biopsychosocial predictors to predict the risk of PPD for perinatal women and to provide several risk assessment tools for the early detection of PPD.
Methods: Candidate predictors, including history of mental illness and demographic, psychosocial, and physiological factors, were obtained from 1138 perinatal women between August 2021 and August 2022. The primary outcome of PPD was measured with the Edinburgh Postnatal Depression Scale at 6 weeks postpartum. Seven feature selection methods and six ML algorithms were employed to develop models, and their prediction performances were compared.
Results: A total of 11 potential predictive factors associated with PPD were identified and subsequently used to construct prenatal and postpartum predictive models for PPD. The cross-validation results showed that the models built on logistic regression (LR) [area under the curve (AUC): 0.801, 0.858] and artificial neural network (ANN) (AUC: 0.787, 0.844) algorithms exhibited the best prediction performance. In contrast to the prenatal models, the addition of postpartum predictors (primary caregiver and mother-in-law's care) remarkably improved the predictive performance of the postpartum models. The risk-stratification score, the nomogram, and the Shapley additive explanation were used to visualize and interpret the risk prediction model for predicting PPD in the early stage.
Conclusions: The LR and ANN models achieved the best predictive performances. Applying these models and risk assessment tools to early predict and screen PPD has several implications for public health.
期刊介绍:
The Journal of Translational Medicine is an open-access journal that publishes articles focusing on information derived from human experimentation to enhance communication between basic and clinical science. It covers all areas of translational medicine.