Prediction model of preeclampsia using machine learning based methods: a population based cohort study in China

Taishun Li, Mingyang Xu, Yuan Wang, Ya Wang, H. Tang, Honglei Duan, Guangfeng Zhao, M. Zheng, Yali Hu
{"title":"Prediction model of preeclampsia using machine learning based methods: a population based cohort study in China","authors":"Taishun Li, Mingyang Xu, Yuan Wang, Ya Wang, H. Tang, Honglei Duan, Guangfeng Zhao, M. Zheng, Yali Hu","doi":"10.3389/fendo.2024.1345573","DOIUrl":null,"url":null,"abstract":"Preeclampsia is a disease with an unknown pathogenesis and is one of the leading causes of maternal and perinatal morbidity. At present, early identification of high-risk groups for preeclampsia and timely intervention with aspirin is an effective preventive method against preeclampsia. This study aims to develop a robust and effective preeclampsia prediction model with good performance by machine learning algorithms based on maternal characteristics, biophysical and biochemical markers at 11–13 + 6 weeks’ gestation, providing an effective tool for early screening and prediction of preeclampsia.This study included 5116 singleton pregnant women who underwent PE screening and fetal aneuploidy from a prospective cohort longitudinal study in China. Maternal characteristics (such as maternal age, height, pre-pregnancy weight), past medical history, mean arterial pressure, uterine artery pulsatility index, pregnancy-associated plasma protein A, and placental growth factor were collected as the covariates for the preeclampsia prediction model. Five classification algorithms including Logistic Regression, Extra Trees Classifier, Voting Classifier, Gaussian Process Classifier and Stacking Classifier were applied for the prediction model development. Five-fold cross-validation with an 8:2 train-test split was applied for model validation.We ultimately included 49 cases of preterm preeclampsia and 161 cases of term preeclampsia from the 4644 pregnant women data in the final analysis. Compared with other prediction algorithms, the AUC and detection rate at 10% FPR of the Voting Classifier algorithm showed better performance in the prediction of preterm preeclampsia (AUC=0.884, DR at 10%FPR=0.625) under all covariates included. However, its performance was similar to that of other model algorithms in all PE and term PE prediction. In the prediction of all preeclampsia, the contribution of PLGF was higher than PAPP-A (11.9% VS 8.7%), while the situation was opposite in the prediction of preterm preeclampsia (7.2% VS 16.5%). The performance for preeclampsia or preterm preeclampsia using machine learning algorithms was similar to that achieved by the fetal medicine foundation competing risk model under the same predictive factors (AUCs of 0.797 and 0.856 for PE and preterm PE, respectively).Our models provide an accessible tool for large-scale population screening and prediction of preeclampsia, which helps reduce the disease burden and improve maternal and fetal outcomes.","PeriodicalId":505784,"journal":{"name":"Frontiers in Endocrinology","volume":"56 22","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Endocrinology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/fendo.2024.1345573","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Preeclampsia is a disease with an unknown pathogenesis and is one of the leading causes of maternal and perinatal morbidity. At present, early identification of high-risk groups for preeclampsia and timely intervention with aspirin is an effective preventive method against preeclampsia. This study aims to develop a robust and effective preeclampsia prediction model with good performance by machine learning algorithms based on maternal characteristics, biophysical and biochemical markers at 11–13 + 6 weeks’ gestation, providing an effective tool for early screening and prediction of preeclampsia.This study included 5116 singleton pregnant women who underwent PE screening and fetal aneuploidy from a prospective cohort longitudinal study in China. Maternal characteristics (such as maternal age, height, pre-pregnancy weight), past medical history, mean arterial pressure, uterine artery pulsatility index, pregnancy-associated plasma protein A, and placental growth factor were collected as the covariates for the preeclampsia prediction model. Five classification algorithms including Logistic Regression, Extra Trees Classifier, Voting Classifier, Gaussian Process Classifier and Stacking Classifier were applied for the prediction model development. Five-fold cross-validation with an 8:2 train-test split was applied for model validation.We ultimately included 49 cases of preterm preeclampsia and 161 cases of term preeclampsia from the 4644 pregnant women data in the final analysis. Compared with other prediction algorithms, the AUC and detection rate at 10% FPR of the Voting Classifier algorithm showed better performance in the prediction of preterm preeclampsia (AUC=0.884, DR at 10%FPR=0.625) under all covariates included. However, its performance was similar to that of other model algorithms in all PE and term PE prediction. In the prediction of all preeclampsia, the contribution of PLGF was higher than PAPP-A (11.9% VS 8.7%), while the situation was opposite in the prediction of preterm preeclampsia (7.2% VS 16.5%). The performance for preeclampsia or preterm preeclampsia using machine learning algorithms was similar to that achieved by the fetal medicine foundation competing risk model under the same predictive factors (AUCs of 0.797 and 0.856 for PE and preterm PE, respectively).Our models provide an accessible tool for large-scale population screening and prediction of preeclampsia, which helps reduce the disease burden and improve maternal and fetal outcomes.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于机器学习方法的子痫前期预测模型:一项基于中国人群的队列研究
子痫前期是一种发病机制不明的疾病,也是孕产妇和围产期发病率的主要原因之一。目前,早期识别子痫前期的高危人群并及时使用阿司匹林进行干预是预防子痫前期的有效方法。本研究旨在通过机器学习算法,基于孕 11-13+6 周时的母体特征、生物物理和生物化学标志物,建立一个稳健有效、性能良好的子痫前期预测模型,为子痫前期的早期筛查和预测提供有效工具。研究收集了母体特征(如母体年龄、身高、孕前体重)、既往病史、平均动脉压、子宫动脉搏动指数、妊娠相关血浆蛋白 A 和胎盘生长因子作为子痫前期预测模型的协变量。预测模型的开发采用了五种分类算法,包括逻辑回归、额外树分类器、投票分类器、高斯过程分类器和堆叠分类器。最终,我们从 4644 名孕妇的数据中筛选出 49 例子痫前期和 161 例子痫前期。与其他预测算法相比,投票分类器算法的AUC和10%FPR时的检出率在预测早产子痫前期时表现更好(AUC=0.884,10%FPR时的DR=0.625)。然而,在所有子痫前期和子痫期预测中,其表现与其他模型算法相似。在预测所有子痫前期时,PLGF 的贡献率高于 PAPP-A(11.9% VS 8.7%),而在预测早产子痫前期时,情况正好相反(7.2% VS 16.5%)。在相同的预测因素下,使用机器学习算法预测子痫前期或先兆子痫的效果与胎儿医学基金会竞争风险模型的效果相似(PE和先兆子痫的AUC分别为0.797和0.856)。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
T1 mapping combined with arterial spin labeling MRI to identify renal injury in patients with liver cirrhosis Adjunctive benefits of low-frequency transcutaneous electrical nerve stimulation for obesity frequent chronic conditions: a systematic review Cadmium activation of wild-type and constitutively active estrogen receptor alpha Editorial: Preventing cardiovascular complications of type 2 diabetes The causal relationship between antihypertensive drugs and depression: a Mendelian randomization study of drug targets
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1