{"title":"Unsupervised learning-derived phenotypes for personalized fluid management in critically ill patients with heart failure: A multicenter study","authors":"Chengjian Guan, Angwei Gong, Yan Zhao, Hangtian Yu, Shuaidan Zhang, Zhiyi Xie, Yehui Jin, Xiuchun Yang, Jingchao Lu, Bing Xiao","doi":"10.1002/ctm2.70081","DOIUrl":null,"url":null,"abstract":"<p>Dear Editor,</p><p>Fluid balance management in critically ill heart failure (HF) patients remains a formidable clinical challenge. While clinicians typically aim for net negative fluid balance to alleviate symptoms, recent studies employing fixed strategies have yielded inconsistent results.<span><sup>1, 2</sup></span> The 2024 Heart Failure Association guidelines of the European Society of Cardiology emphasized the importance of individualized fluid balance strategies, particularly for critically ill patients.<span><sup>3</sup></span> Our study introduces a novel approach using unsupervised learning to identify four distinct phenotypes of critically ill HF patients, each with unique clinical characteristics and fluid balance requirements. To facilitate clinical application, we have developed a user-friendly interface that enables rapid phenotype identification and customized fluid management.</p><p>We utilized two non-overlapping databases: III-CareVue subset and IV versions of the Intensive Care Medical Information Marketplace (MIMIC)<span><sup>4</sup></span> for training cohorts and the eICU Collaborative Research Database (eICU)<span><sup>5</sup></span> for external validation (Method S1). The MIMIC cohort comprised 5998 patients, while the eICU cohort included 2549 patients (Figure S1). We initially extracted 56 variables from the first day of ICU admission. After eliminating variables with more than 30% missing data, 47 variables remained, encompassing demographics, comorbidities, laboratory values, vital signs, interventions, and severity scores. To ensure a balanced contribution of characteristics, all data underwent cleaning and normalization (Method S2, Figure S2). In-hospital mortality served as our primary outcome, with ICU length of stay and total hospital length of stay as secondary outcomes.</p><p>Uniform Manifold Approximation and Projection (UMAP) was used to determine that there were no differences in clinical characteristics between the two training databases (Figure S3). To classify patients, we applied the K-prototypes clustering algorithm, which effectively accommodates mixed numerical and categorical attributes while preserving the characteristics of factorial variables (Method S3). The optimal number of clusters was determined using standard tests, considering both statistical metrics and clinical relevance. This approach ultimately identified four distinct phenotypes (Figure S4).</p><p>Comparative analysis of these phenotypes revealed distinct clinical profiles (Figure 1, Table 1, Table S1). Phenotype A was characterized by aggressive interventions and inflammation, including high rates of vasoactive drug use, antibiotic use, and mechanical ventilation. This group also exhibited the highest white blood cell count and chloride levels, coupled with the lowest platelet count. Phenotype B represented the mildest form with the most favourable prognosis. Phenotype C was distinguished by the highest mean age, lowest body weight, higher comorbidity burden, and second-highest mortality rate, despite having the lowest Sequential Organ Failure Assessment (SOFA) score. Phenotype D presented the most severe clinical profile with a poor prognosis. Short-and long-term survival outcomes differed significantly among these phenotypes, with Phenotype D showing the worst prognosis and Phenotype B the best prognosis (Figure 2A,B). To validate our findings, we conducted correlation analysis among continuous variables and excluded highly correlated factors before re-clustering. The resulting phenotypes retained consistent characteristics, confirming the stability of our clustering method (Figures S5–S7).</p><p>Additionally, we investigated phenotype-specific fluid management strategies using net daily fluid balance data, adjusted for demographic factors, laboratory parameters, vital signs, and interventions. The impact of these strategies on in-hospital mortality was analyzed using the parametric g-formula (Method S4), focusing on the first seven days of ICU admission. Figure 2C illustrates how different fluid management strategies influence in-hospital mortality across the four patient phenotypes over 7 days: Phenotype A presented with severe respiratory failure, shock, and inflammation, and benefited from fluid balances between −1000 and 500 mL daily. This aligned with recent studies on acute respiratory distress syndrome (ARDS) and ventilator-related events, confirming the adverse effects of positive fluid balance on mechanical ventilation duration and mortality.<span><sup>6, 7</sup></span> Phenotype C, despite milder clinical parameters, exhibited significantly higher mortality rates compared to Phenotype B, likely due to older age and multiple comorbidities. We recommended a daily net fluid balance ranging from −1500 and 500 mL for this group, underscoring the impact of age and frailty on HF prognosis.<span><sup>8</sup></span> Phenotype D, characterized by severe metabolic derangements including acidosis, renal dysfunction, and sepsis, had the poorest prognosis. Our results suggested a more restrictive fluid strategy (−2000 to −500 mL daily), consistent with studies demonstrating adverse effects of positive fluid balance in populations with kidney disease and sepsis.<span><sup>9, 10</sup></span> Phenotype B showed no clear benefit from specific fluid management strategies, warranting further investigation to determine whether this reflected the relative mildness of their condition or methodological limitations.</p><p>To facilitate the efficient classification of HF phenotypes across different cohorts, we developed a machine learning-based classification model. The Joint Mutual Information Maximiza (JMIM) method identified nine variables with a feature importance score >.8, including age, blood urea nitrogen (BUN), hematocrit, vasoactive drugs, renal disease, creatinine, diastolic blood pressure (DBP), mechanical ventilation, and anion gap (Figure 2D). Based on benchmark tests (Table S3), we selected the Extreme Gradient Boosting (XGBoost) model for phenotype classification. The model achieved high predictive performance in MIMIC (AUC:.918–.943) (Figure 2E) and satisfactory performance in the eICU cohort used for external validation (AUC:.802–.907) (Figure S9). We evaluated the performance metrics and decision curve analysis of the model, and the results showed that the XGBoost model had good performance and clinical net benefit (Table S4, Figure S8). Then, we conducted an interpretability analysis to visualize the model's decision-making process (Figures S10–S12). To support clinical application, we developed a web-based tool (https://7kdtqk-guanchengcheng.shinyapps.io/hf_phenotype/).</p><p>While our study provided valuable insights, several limitations warranted acknowledgement. First, the retrospective nature of the study precluded clear causal inference. Additionally, the absence of some important variables (such as ejection fraction and natriuretic peptides) might have led to the omission of potentially significant factors. Future randomized controlled trials were necessary to confirm the efficacy of phenotype-specific fluid management strategies. Second, our analysis primarily focused on net fluid intake and its impact on prognosis. Further research was needed to explore the effects of infusion rates and individual fluid responsiveness on patient outcomes.</p><p>In summary, we have identified four distinct phenotypes of critically ill heart failure patients, each with unique clinical characteristics and fluid management needs. Our novel classification model and user interface facilitated rapid phenotype identification, enabling personalized fluid management strategies.</p><p>Bing Xiao contributed to the research design. Chengjian Guan, Angwei Gong, and Yan Zhao contributed to data collection, data processing, and graphing. Chengjian Guan, Hangtian Yu, and Shuaidan Zhang conducted model construction and deployment. Zhiyi Xie, Yehui Jin, Xiuchun Yang, and Jingchao Lu contributed to data proofreading and formal analysis. Chengjian Guan and Angwei Gong contributed to the writing of the manuscript. Xiao Bing contributed to the review and editing. All authors have read and approved the final manuscript.</p><p>The authors declare no conflict of interest.</p><p>This study was conducted in accordance with the Declaration of Helsinki. Since the public databases used in this study all use de-identified data, individual informed consent is not required, so we informed the Ethics Committee of this without a written report.</p>","PeriodicalId":10189,"journal":{"name":"Clinical and Translational Medicine","volume":"14 11","pages":""},"PeriodicalIF":7.9000,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11546239/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical and Translational Medicine","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ctm2.70081","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Dear Editor,
Fluid balance management in critically ill heart failure (HF) patients remains a formidable clinical challenge. While clinicians typically aim for net negative fluid balance to alleviate symptoms, recent studies employing fixed strategies have yielded inconsistent results.1, 2 The 2024 Heart Failure Association guidelines of the European Society of Cardiology emphasized the importance of individualized fluid balance strategies, particularly for critically ill patients.3 Our study introduces a novel approach using unsupervised learning to identify four distinct phenotypes of critically ill HF patients, each with unique clinical characteristics and fluid balance requirements. To facilitate clinical application, we have developed a user-friendly interface that enables rapid phenotype identification and customized fluid management.
We utilized two non-overlapping databases: III-CareVue subset and IV versions of the Intensive Care Medical Information Marketplace (MIMIC)4 for training cohorts and the eICU Collaborative Research Database (eICU)5 for external validation (Method S1). The MIMIC cohort comprised 5998 patients, while the eICU cohort included 2549 patients (Figure S1). We initially extracted 56 variables from the first day of ICU admission. After eliminating variables with more than 30% missing data, 47 variables remained, encompassing demographics, comorbidities, laboratory values, vital signs, interventions, and severity scores. To ensure a balanced contribution of characteristics, all data underwent cleaning and normalization (Method S2, Figure S2). In-hospital mortality served as our primary outcome, with ICU length of stay and total hospital length of stay as secondary outcomes.
Uniform Manifold Approximation and Projection (UMAP) was used to determine that there were no differences in clinical characteristics between the two training databases (Figure S3). To classify patients, we applied the K-prototypes clustering algorithm, which effectively accommodates mixed numerical and categorical attributes while preserving the characteristics of factorial variables (Method S3). The optimal number of clusters was determined using standard tests, considering both statistical metrics and clinical relevance. This approach ultimately identified four distinct phenotypes (Figure S4).
Comparative analysis of these phenotypes revealed distinct clinical profiles (Figure 1, Table 1, Table S1). Phenotype A was characterized by aggressive interventions and inflammation, including high rates of vasoactive drug use, antibiotic use, and mechanical ventilation. This group also exhibited the highest white blood cell count and chloride levels, coupled with the lowest platelet count. Phenotype B represented the mildest form with the most favourable prognosis. Phenotype C was distinguished by the highest mean age, lowest body weight, higher comorbidity burden, and second-highest mortality rate, despite having the lowest Sequential Organ Failure Assessment (SOFA) score. Phenotype D presented the most severe clinical profile with a poor prognosis. Short-and long-term survival outcomes differed significantly among these phenotypes, with Phenotype D showing the worst prognosis and Phenotype B the best prognosis (Figure 2A,B). To validate our findings, we conducted correlation analysis among continuous variables and excluded highly correlated factors before re-clustering. The resulting phenotypes retained consistent characteristics, confirming the stability of our clustering method (Figures S5–S7).
Additionally, we investigated phenotype-specific fluid management strategies using net daily fluid balance data, adjusted for demographic factors, laboratory parameters, vital signs, and interventions. The impact of these strategies on in-hospital mortality was analyzed using the parametric g-formula (Method S4), focusing on the first seven days of ICU admission. Figure 2C illustrates how different fluid management strategies influence in-hospital mortality across the four patient phenotypes over 7 days: Phenotype A presented with severe respiratory failure, shock, and inflammation, and benefited from fluid balances between −1000 and 500 mL daily. This aligned with recent studies on acute respiratory distress syndrome (ARDS) and ventilator-related events, confirming the adverse effects of positive fluid balance on mechanical ventilation duration and mortality.6, 7 Phenotype C, despite milder clinical parameters, exhibited significantly higher mortality rates compared to Phenotype B, likely due to older age and multiple comorbidities. We recommended a daily net fluid balance ranging from −1500 and 500 mL for this group, underscoring the impact of age and frailty on HF prognosis.8 Phenotype D, characterized by severe metabolic derangements including acidosis, renal dysfunction, and sepsis, had the poorest prognosis. Our results suggested a more restrictive fluid strategy (−2000 to −500 mL daily), consistent with studies demonstrating adverse effects of positive fluid balance in populations with kidney disease and sepsis.9, 10 Phenotype B showed no clear benefit from specific fluid management strategies, warranting further investigation to determine whether this reflected the relative mildness of their condition or methodological limitations.
To facilitate the efficient classification of HF phenotypes across different cohorts, we developed a machine learning-based classification model. The Joint Mutual Information Maximiza (JMIM) method identified nine variables with a feature importance score >.8, including age, blood urea nitrogen (BUN), hematocrit, vasoactive drugs, renal disease, creatinine, diastolic blood pressure (DBP), mechanical ventilation, and anion gap (Figure 2D). Based on benchmark tests (Table S3), we selected the Extreme Gradient Boosting (XGBoost) model for phenotype classification. The model achieved high predictive performance in MIMIC (AUC:.918–.943) (Figure 2E) and satisfactory performance in the eICU cohort used for external validation (AUC:.802–.907) (Figure S9). We evaluated the performance metrics and decision curve analysis of the model, and the results showed that the XGBoost model had good performance and clinical net benefit (Table S4, Figure S8). Then, we conducted an interpretability analysis to visualize the model's decision-making process (Figures S10–S12). To support clinical application, we developed a web-based tool (https://7kdtqk-guanchengcheng.shinyapps.io/hf_phenotype/).
While our study provided valuable insights, several limitations warranted acknowledgement. First, the retrospective nature of the study precluded clear causal inference. Additionally, the absence of some important variables (such as ejection fraction and natriuretic peptides) might have led to the omission of potentially significant factors. Future randomized controlled trials were necessary to confirm the efficacy of phenotype-specific fluid management strategies. Second, our analysis primarily focused on net fluid intake and its impact on prognosis. Further research was needed to explore the effects of infusion rates and individual fluid responsiveness on patient outcomes.
In summary, we have identified four distinct phenotypes of critically ill heart failure patients, each with unique clinical characteristics and fluid management needs. Our novel classification model and user interface facilitated rapid phenotype identification, enabling personalized fluid management strategies.
Bing Xiao contributed to the research design. Chengjian Guan, Angwei Gong, and Yan Zhao contributed to data collection, data processing, and graphing. Chengjian Guan, Hangtian Yu, and Shuaidan Zhang conducted model construction and deployment. Zhiyi Xie, Yehui Jin, Xiuchun Yang, and Jingchao Lu contributed to data proofreading and formal analysis. Chengjian Guan and Angwei Gong contributed to the writing of the manuscript. Xiao Bing contributed to the review and editing. All authors have read and approved the final manuscript.
The authors declare no conflict of interest.
This study was conducted in accordance with the Declaration of Helsinki. Since the public databases used in this study all use de-identified data, individual informed consent is not required, so we informed the Ethics Committee of this without a written report.
期刊介绍:
Clinical and Translational Medicine (CTM) is an international, peer-reviewed, open-access journal dedicated to accelerating the translation of preclinical research into clinical applications and fostering communication between basic and clinical scientists. It highlights the clinical potential and application of various fields including biotechnologies, biomaterials, bioengineering, biomarkers, molecular medicine, omics science, bioinformatics, immunology, molecular imaging, drug discovery, regulation, and health policy. With a focus on the bench-to-bedside approach, CTM prioritizes studies and clinical observations that generate hypotheses relevant to patients and diseases, guiding investigations in cellular and molecular medicine. The journal encourages submissions from clinicians, researchers, policymakers, and industry professionals.