基于自编码器的血液透析患者死亡率预测。

IF 3.7 2区 医学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS International Journal of Medical Informatics Pub Date : 2024-12-01 DOI:10.1016/j.ijmedinf.2024.105744
Shuzhi Su , Jisheng Gao , Jingjing Dong , Qi Guo , Hualin Ma , Shaodong Luan , Xuejia Zheng , Huihui Tao , Lingling Zhou , Yong Dai
{"title":"基于自编码器的血液透析患者死亡率预测。","authors":"Shuzhi Su ,&nbsp;Jisheng Gao ,&nbsp;Jingjing Dong ,&nbsp;Qi Guo ,&nbsp;Hualin Ma ,&nbsp;Shaodong Luan ,&nbsp;Xuejia Zheng ,&nbsp;Huihui Tao ,&nbsp;Lingling Zhou ,&nbsp;Yong Dai","doi":"10.1016/j.ijmedinf.2024.105744","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Patients with end-stage renal disease (ESRD) undergoing hemodialysis (HD) exhibit a high mortality risk, particularly at the onset of treatment. Conventional risk assessment models, dependent on extensive temporal data accumulation, frequently encounter issues of data incompleteness and lengthy collection periods.</div></div><div><h3>Objective</h3><div>This study addresses the imbalance in short-term HD data and the issue of missing data features, achieving a robust assessment of mortality risk for HD patients over the subsequent 30 to 450 days.</div></div><div><h3>Methods</h3><div>An autoencoder-based mortality prediction model for HD patients is proposed. Leveraging the manifold structure of the non-missing features and the intrinsic relationship between the features in the high-dimensional data space, the model infers the values of the missing features. Noise and redundant information typically distort the manifold structure, impacting the accuracy of inferences about missing features. Consequently, we generate feature dropping masks to simulate the missing data distribution in the deep learning framework and design an autoencoder, forming an adaptive feature extraction module. The module utilizes readily available short-term data for unsupervised learning, enabling the encoder to reconstruct missing features and derive latent representations. Finally, a classifier based on the latent representations achieves the mortality prediction.</div></div><div><h3>Results</h3><div>Over a 30-day observation window, the model demonstrated superior mortality prediction performance compared to other models across all prediction windows. Feature importance analysis showed that creatinine and age are consistently the most critical features across all prediction windows. Glucose (fasting) and platelet count also remain significant, with their correlation with mortality risk increasing over time. Serum albumin, international standard ratio, and phosphate are particularly important for short-term predictions, while conjugated bilirubin and prothrombin time gain prominence in mid- and long-term predictions.</div></div><div><h3>Conclusion</h3><div>The proposed model proficiently leverages short-term HD data to provide precise mortality risk evaluations in HD patients, with particular efficacy in the short-term. Its application holds considerable value for clinical decision-making and risk management in this patient population.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"195 ","pages":"Article 105744"},"PeriodicalIF":3.7000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prediction of mortality in hemodialysis patients based on autoencoders\",\"authors\":\"Shuzhi Su ,&nbsp;Jisheng Gao ,&nbsp;Jingjing Dong ,&nbsp;Qi Guo ,&nbsp;Hualin Ma ,&nbsp;Shaodong Luan ,&nbsp;Xuejia Zheng ,&nbsp;Huihui Tao ,&nbsp;Lingling Zhou ,&nbsp;Yong Dai\",\"doi\":\"10.1016/j.ijmedinf.2024.105744\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><div>Patients with end-stage renal disease (ESRD) undergoing hemodialysis (HD) exhibit a high mortality risk, particularly at the onset of treatment. Conventional risk assessment models, dependent on extensive temporal data accumulation, frequently encounter issues of data incompleteness and lengthy collection periods.</div></div><div><h3>Objective</h3><div>This study addresses the imbalance in short-term HD data and the issue of missing data features, achieving a robust assessment of mortality risk for HD patients over the subsequent 30 to 450 days.</div></div><div><h3>Methods</h3><div>An autoencoder-based mortality prediction model for HD patients is proposed. Leveraging the manifold structure of the non-missing features and the intrinsic relationship between the features in the high-dimensional data space, the model infers the values of the missing features. Noise and redundant information typically distort the manifold structure, impacting the accuracy of inferences about missing features. Consequently, we generate feature dropping masks to simulate the missing data distribution in the deep learning framework and design an autoencoder, forming an adaptive feature extraction module. The module utilizes readily available short-term data for unsupervised learning, enabling the encoder to reconstruct missing features and derive latent representations. Finally, a classifier based on the latent representations achieves the mortality prediction.</div></div><div><h3>Results</h3><div>Over a 30-day observation window, the model demonstrated superior mortality prediction performance compared to other models across all prediction windows. Feature importance analysis showed that creatinine and age are consistently the most critical features across all prediction windows. Glucose (fasting) and platelet count also remain significant, with their correlation with mortality risk increasing over time. Serum albumin, international standard ratio, and phosphate are particularly important for short-term predictions, while conjugated bilirubin and prothrombin time gain prominence in mid- and long-term predictions.</div></div><div><h3>Conclusion</h3><div>The proposed model proficiently leverages short-term HD data to provide precise mortality risk evaluations in HD patients, with particular efficacy in the short-term. Its application holds considerable value for clinical decision-making and risk management in this patient population.</div></div>\",\"PeriodicalId\":54950,\"journal\":{\"name\":\"International Journal of Medical Informatics\",\"volume\":\"195 \",\"pages\":\"Article 105744\"},\"PeriodicalIF\":3.7000,\"publicationDate\":\"2024-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Medical Informatics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1386505624004076\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Medical Informatics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1386505624004076","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

背景:接受血液透析(HD)的终末期肾病(ESRD)患者表现出高死亡率,特别是在治疗开始时。传统的风险评估模型依赖于大量的时间数据积累,经常遇到数据不完整和收集周期长的问题。目的:本研究解决了HD短期数据的不平衡和数据特征缺失的问题,实现了HD患者随后30至450天死亡风险的可靠评估。方法:提出一种基于自编码器的HD患者死亡率预测模型。该模型利用高维数据空间中非缺失特征的流形结构和特征之间的内在关系,推断出缺失特征的值。噪声和冗余信息通常会扭曲流形结构,影响对缺失特征的推断的准确性。因此,我们生成特征删除掩码来模拟深度学习框架中的缺失数据分布,并设计一个自编码器,形成自适应特征提取模块。该模块利用易于获得的短期数据进行无监督学习,使编码器能够重建缺失的特征并获得潜在表征。最后,基于潜在表征的分类器实现死亡率预测。结果:在30天的观察窗口中,与其他模型相比,该模型在所有预测窗口中都表现出优越的死亡率预测性能。特征重要性分析表明,肌酐和年龄始终是所有预测窗口中最关键的特征。血糖(空腹)和血小板计数也很重要,它们与死亡风险的相关性随着时间的推移而增加。血清白蛋白、国际标准比值和磷酸盐对于短期预测尤为重要,而结合胆红素和凝血酶原时间在中长期预测中尤为重要。结论:所提出的模型熟练地利用HD短期数据,为HD患者提供精确的死亡风险评估,在短期内具有特别的疗效。其应用对该患者群体的临床决策和风险管理具有相当大的价值。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Prediction of mortality in hemodialysis patients based on autoencoders

Background

Patients with end-stage renal disease (ESRD) undergoing hemodialysis (HD) exhibit a high mortality risk, particularly at the onset of treatment. Conventional risk assessment models, dependent on extensive temporal data accumulation, frequently encounter issues of data incompleteness and lengthy collection periods.

Objective

This study addresses the imbalance in short-term HD data and the issue of missing data features, achieving a robust assessment of mortality risk for HD patients over the subsequent 30 to 450 days.

Methods

An autoencoder-based mortality prediction model for HD patients is proposed. Leveraging the manifold structure of the non-missing features and the intrinsic relationship between the features in the high-dimensional data space, the model infers the values of the missing features. Noise and redundant information typically distort the manifold structure, impacting the accuracy of inferences about missing features. Consequently, we generate feature dropping masks to simulate the missing data distribution in the deep learning framework and design an autoencoder, forming an adaptive feature extraction module. The module utilizes readily available short-term data for unsupervised learning, enabling the encoder to reconstruct missing features and derive latent representations. Finally, a classifier based on the latent representations achieves the mortality prediction.

Results

Over a 30-day observation window, the model demonstrated superior mortality prediction performance compared to other models across all prediction windows. Feature importance analysis showed that creatinine and age are consistently the most critical features across all prediction windows. Glucose (fasting) and platelet count also remain significant, with their correlation with mortality risk increasing over time. Serum albumin, international standard ratio, and phosphate are particularly important for short-term predictions, while conjugated bilirubin and prothrombin time gain prominence in mid- and long-term predictions.

Conclusion

The proposed model proficiently leverages short-term HD data to provide precise mortality risk evaluations in HD patients, with particular efficacy in the short-term. Its application holds considerable value for clinical decision-making and risk management in this patient population.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
International Journal of Medical Informatics
International Journal of Medical Informatics 医学-计算机:信息系统
CiteScore
8.90
自引率
4.10%
发文量
217
审稿时长
42 days
期刊介绍: International Journal of Medical Informatics provides an international medium for dissemination of original results and interpretative reviews concerning the field of medical informatics. The Journal emphasizes the evaluation of systems in healthcare settings. The scope of journal covers: Information systems, including national or international registration systems, hospital information systems, departmental and/or physician''s office systems, document handling systems, electronic medical record systems, standardization, systems integration etc.; Computer-aided medical decision support systems using heuristic, algorithmic and/or statistical methods as exemplified in decision theory, protocol development, artificial intelligence, etc. Educational computer based programs pertaining to medical informatics or medicine in general; Organizational, economic, social, clinical impact, ethical and cost-benefit aspects of IT applications in health care.
期刊最新文献
Machine learning for predicting outcomes of transcatheter aortic valve implantation: A systematic review AI-driven triage in emergency departments: A review of benefits, challenges, and future directions Predicting cancer survival at different stages: Insights from fair and explainable machine learning approaches The fading structural prominence of explanations in clinical studies Utilization, challenges, and training needs of digital health technologies: Perspectives from healthcare professionals
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1