Consecutive prediction of adverse maternal outcomes of preeclampsia, using the PIERS-ML and fullPIERS models: A multicountry prospective observational study.

IF 9.9 1区医学 Q1 Medicine PLoS Medicine Pub Date : 2025-02-04 eCollection Date: 2025-02-01 DOI:10.1371/journal.pmed.1004509

Guiyou Yang, Tünde Montgomery-Csobán, Wessel Ganzevoort, Sanne J Gordijn, Kimberley Kavanagh, Paul Murray, Laura A Magee, Henk Groen, Peter von Dadelszen

{"title":"Consecutive prediction of adverse maternal outcomes of preeclampsia, using the PIERS-ML and fullPIERS models: A multicountry prospective observational study.","authors":"Guiyou Yang, Tünde Montgomery-Csobán, Wessel Ganzevoort, Sanne J Gordijn, Kimberley Kavanagh, Paul Murray, Laura A Magee, Henk Groen, Peter von Dadelszen","doi":"10.1371/journal.pmed.1004509","DOIUrl":null,"url":null,"abstract":"Background: Preeclampsia is a potentially life-threatening pregnancy complication. Among women whose pregnancies are complicated by preeclampsia, the Preeclampsia Integrated Estimate of RiSk (PIERS) models (i.e., the PIERS Machine Learning [PIERS-ML] model, and the logistic regression-based fullPIERS model) accurately identify individuals at greatest or least risk of adverse maternal outcomes within 48 h following admission. Both models were developed and validated to be used as part of initial assessment. In the United Kingdom, the National Institute for Health and Care Excellence (NICE) recommends repeated use of such static models for ongoing assessment beyond the first 48 h. This study evaluated the models' performance during such consecutive prediction.Methods and findings: This multicountry prospective study used data of 8,843 women (32% white, 30% black, and 26% Asian) with a median age of 31 years. These women, admitted to maternity units in the Americas, sub-Saharan Africa, South Asia, Europe, and Oceania, were diagnosed with preeclampsia at a median gestational age of 35.79 weeks between year 2003 and 2016. The risk differentiation performance of the PIERS-ML and fullPIERS models were assessed for each day within a 2-week post-admission window. The PIERS adverse maternal outcome includes one or more of: death, end-organ complication (cardiorespiratory, renal, hepatic, etc.), or uteroplacental dysfunction (e.g., placental abruption). The main outcome measures were: trajectories of mean risk of each of the uncomplicated course and adverse outcome groups; daily area under the precision-recall curve (AUC-PRC); potential clinical impact (i.e., net benefit in decision curve analysis); dynamic shifts of multiple risk groups; and daily likelihood ratios. In the 2 weeks window, the number of daily outcome events decreased from over 200 to around 10. For both PIERS-ML and fullPIERS models, we observed consistently higher mean risk in the adverse outcome (versus uncomplicated course) group. The AUC-PRC values (0.2-0.4) of the fullPIERS model remained low (i.e., close to the daily fraction of adverse outcomes, indicating low discriminative capacity). The PIERS-ML model's AUC-PRC peaked on day 0 (0.65), and notably decreased thereafter. When categorizing women into multiple risk groups, the PIERS-ML model generally showed good rule-in capacity for the \"very high\" risk group, with positive likelihood ratio values ranging from 70.99 to infinity, and good rule-out capacity for the \"very low\" risk group where most negative likelihood ratio values were 0. However, performance declined notably for other risk groups beyond 48 h. Decision curve analysis revealed a diminishing advantage for treatment guided by both models over time. The main limitation of this study is that the baseline performance of the PIERS-ML model was assessed on its development data; however, its baseline performance has also undergone external evaluation.Conclusions: In this study, we have evaluated the performance of the fullPIERS and PIERS-ML models for consecutive prediction. We observed deteriorating performance of both models over time. We recommend using the models for consecutive prediction with greater caution and interpreting predictions with increasing uncertainty as the pregnancy progresses. For clinical practice, models should be adapted to retain accuracy when deployed serially. The performance of future models can be compared with the results of this study to quantify their added value.","PeriodicalId":49008,"journal":{"name":"PLoS Medicine","volume":"22 2","pages":"e1004509"},"PeriodicalIF":9.9000,"publicationDate":"2025-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11793762/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PLoS Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1371/journal.pmed.1004509","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Preeclampsia is a potentially life-threatening pregnancy complication. Among women whose pregnancies are complicated by preeclampsia, the Preeclampsia Integrated Estimate of RiSk (PIERS) models (i.e., the PIERS Machine Learning [PIERS-ML] model, and the logistic regression-based fullPIERS model) accurately identify individuals at greatest or least risk of adverse maternal outcomes within 48 h following admission. Both models were developed and validated to be used as part of initial assessment. In the United Kingdom, the National Institute for Health and Care Excellence (NICE) recommends repeated use of such static models for ongoing assessment beyond the first 48 h. This study evaluated the models' performance during such consecutive prediction.

Methods and findings: This multicountry prospective study used data of 8,843 women (32% white, 30% black, and 26% Asian) with a median age of 31 years. These women, admitted to maternity units in the Americas, sub-Saharan Africa, South Asia, Europe, and Oceania, were diagnosed with preeclampsia at a median gestational age of 35.79 weeks between year 2003 and 2016. The risk differentiation performance of the PIERS-ML and fullPIERS models were assessed for each day within a 2-week post-admission window. The PIERS adverse maternal outcome includes one or more of: death, end-organ complication (cardiorespiratory, renal, hepatic, etc.), or uteroplacental dysfunction (e.g., placental abruption). The main outcome measures were: trajectories of mean risk of each of the uncomplicated course and adverse outcome groups; daily area under the precision-recall curve (AUC-PRC); potential clinical impact (i.e., net benefit in decision curve analysis); dynamic shifts of multiple risk groups; and daily likelihood ratios. In the 2 weeks window, the number of daily outcome events decreased from over 200 to around 10. For both PIERS-ML and fullPIERS models, we observed consistently higher mean risk in the adverse outcome (versus uncomplicated course) group. The AUC-PRC values (0.2-0.4) of the fullPIERS model remained low (i.e., close to the daily fraction of adverse outcomes, indicating low discriminative capacity). The PIERS-ML model's AUC-PRC peaked on day 0 (0.65), and notably decreased thereafter. When categorizing women into multiple risk groups, the PIERS-ML model generally showed good rule-in capacity for the "very high" risk group, with positive likelihood ratio values ranging from 70.99 to infinity, and good rule-out capacity for the "very low" risk group where most negative likelihood ratio values were 0. However, performance declined notably for other risk groups beyond 48 h. Decision curve analysis revealed a diminishing advantage for treatment guided by both models over time. The main limitation of this study is that the baseline performance of the PIERS-ML model was assessed on its development data; however, its baseline performance has also undergone external evaluation.

Conclusions: In this study, we have evaluated the performance of the fullPIERS and PIERS-ML models for consecutive prediction. We observed deteriorating performance of both models over time. We recommend using the models for consecutive prediction with greater caution and interpreting predictions with increasing uncertainty as the pregnancy progresses. For clinical practice, models should be adapted to retain accuracy when deployed serially. The performance of future models can be compared with the results of this study to quantify their added value.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

使用 PIERS-ML 和 fullPIERS 模型连续预测子痫前期的不良孕产结局：一项多国前瞻性观察研究。

背景：子痫前期是一种潜在的危及生命的妊娠并发症。在妊娠合并子痫前期的妇女中，子痫前期综合风险估计（PIERS）模型（即PIERS机器学习[PIERS- ml]模型和基于logistic回归的fullPIERS模型）在入院后48小时内准确识别出不良产妇结局风险最高或最低的个体。这两个模型都被开发和验证，作为初始评估的一部分。在英国，国家健康与护理卓越研究所（NICE）建议在最初的48小时之后重复使用这种静态模型进行持续评估。本研究评估了模型在这种连续预测期间的性能。方法和结果：这项多国前瞻性研究使用了8,843名女性（32%白人，30%黑人，26%亚洲人）的数据，中位年龄为31岁。这些妇女在美洲、撒哈拉以南非洲、南亚、欧洲和大洋洲的产科病房就诊，在2003年至2016年期间，中位胎龄为35.79周时被诊断为先兆子痫。在入院后2周内每天评估PIERS-ML和fullPIERS模型的风险分化表现。PIERS不良产妇结局包括以下一种或多种：死亡、终末器官并发症（心肺、肾、肝等）或子宫胎盘功能障碍（如胎盘早剥）。主要结果测量指标为：各简单病程组和不良结局组的平均风险轨迹；精密度-召回曲线下的日面积；潜在的临床影响（即决策曲线分析中的净收益）；多风险群体的动态转移；以及每日似然比。在两周的窗口期，每日结果事件的数量从200多个减少到10个左右。对于PIERS-ML和fullPIERS模型，我们观察到不良结果组（与无复杂病程组相比）的平均风险始终较高。fullPIERS模型的AUC-PRC值（0.2-0.4）仍然很低（即接近不良结果的每日分数，表明判别能力低）。PIERS-ML模型的AUC-PRC在第0天达到峰值（0.65），此后显著下降。在将女性划分为多个风险组时，PIERS-ML模型一般对“极高”风险组表现出良好的规则适应能力，其正似然比值范围从70.99到无穷大，对“极低”风险组表现出良好的排除能力，其中大多数负似然比值为0。然而，超过48小时后，其他风险组的表现明显下降。决策曲线分析显示，随着时间的推移，两种模型指导的治疗优势逐渐减弱。本研究的主要局限性在于，PIERS-ML模型的基线性能是根据其开发数据进行评估的；然而，其基线绩效也经历了外部评价。结论：在本研究中，我们评估了fullPIERS和PIERS-ML模型连续预测的性能。我们观察到，随着时间的推移，两种模型的性能都在恶化。我们建议使用该模型进行更谨慎的连续预测，并随着妊娠的进展解释不确定性增加的预测。在临床实践中，模型应适应连续部署时保持准确性。未来模型的性能可以与本研究的结果进行比较，以量化其附加价值。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

PLoS Medicine MEDICINE, GENERAL & INTERNAL-

CiteScore

17.60

自引率

0.60%

发文量

227

审稿时长

4-8 weeks

期刊介绍： PLOS Medicine is a prominent platform for discussing and researching global health challenges. The journal covers a wide range of topics, including biomedical, environmental, social, and political factors affecting health. It prioritizes articles that contribute to clinical practice, health policy, or a better understanding of pathophysiology, ultimately aiming to improve health outcomes across different settings. The journal is unwavering in its commitment to uphold the highest ethical standards in medical publishing. This includes actively managing and disclosing any conflicts of interest related to reporting, reviewing, and publishing. PLOS Medicine promotes transparency in the entire review and publication process. The journal also encourages data sharing and encourages the reuse of published work. Additionally, authors retain copyright for their work, and the publication is made accessible through Open Access with no restrictions on availability and dissemination. PLOS Medicine takes measures to avoid conflicts of interest associated with advertising drugs and medical devices or engaging in the exclusive sale of reprints.