Health data science最新文献

英文中文

Artificial Intelligence-Enabled 8-Channel ECG Diagnosing of Abnormalities with Wide QRS Complexes. 人工智能支持的8通道心电图诊断宽QRS复合物异常。

Health data science

Pub Date : 2026-02-05 eCollection Date: 2026-01-01 DOI: 10.34133/hds.0265

Hongling Zhu, Qiushi Luo, Yao Wang, Mairihaba Maimaiti, Heng Zhang, Yulong Xiong, Chen Ruan, Jingyi Wang, Yedan Liu, Mengqiao Zhou, Yinan Sun, Wei Chen, E Jin, Jin Li, Xia Chen, Tao Zhu, Xiaoyun Yang

Background: There is a substantial number of research exploring the application of artificial intelligence (AI) in identifying electrocardiogram (ECG) abnormalities related to heart rhythm or conduction with the 12-channel format. However, there is a scarcity of studies focusing on refined differentiation of serials of ECG abnormalities with wide QRS complexes in a simplified channel format. Methods: We constructed an ECG dataset (standard 10-s, 12-channel format) from adult patients from Tongji Hospital of Huazhong University of Science and Technology, Wuhan, China. This dataset was consisted of 5 kinds of ECG abnormalities with wide QRS complexes in the normal heartbeat (60 to 100 beats per minute) and the normal ECGs. Convolutional neural network was developed to classify these abnormalities. Four-channel (I, II, V1, and V5) and 8-channel (I, II, and V1 to V6) formats, compared to the standard 12-channel format (I, II, III, aVR, aVL, aVF, and V1 to V6), were chosen as the input channel format of the model. Other unreplicated ECGs from Tongji Hospital (TJ-Test set), annotated by a committee of board-certified cardiologists, served as the test dataset. The F1 score, area under the receiver operating characteristic curve (AUROC), and accuracy were calculated to assess the performance of the model, which were further compared with diagnoses of 6 ECG cardiologists who were informed that the final objective was classifying among 6 classes with the 12-channel format. In addition, a dataset of 291 ECGs from The First People's Hospital of Jiangxia District (JX-Test set) and a public dataset of 64 ECGs were used to assess model generalizability Results: The dataset consisted of 11,808 ECGs from 8,542 patients from 2012 January 1 to 2020 November 30 and divided into training and validation datasets in the ratio of 9:1. The test dataset included unreplicated 480 ECGs from 480 new adult patients recorded from 2014 January 1 to 2017 November 30. The model shows a superior performance in the 8-channel format compared to that of 4- and 12-channel formats. As for the 8-channel format, the model obtained an accuracy of 95.0%, a mean F1 score of 0.969 (0.943 to 0.997), and a mean AUROC score of 0.997 (0.975 to 1.00) compared to an accuracy of 89.9%, an F1 score of 0.898 (0.863 to 0.932), and an AUROC score of 0.941 (0.918 to 0.963) of physicians assessing the same datasets. The model exhibited a mean F1 score of 0.917 (0.943 to 0.997) and a mean AUROC score of 0.994 (0.975 to 1.00) on the JX-Test set, and mean F1 scores of 0.708 for the left bundle branch block and 0.828 for the right bundle branch block for the external published validation data both with the 8-channel format. Conclusion: Our model distinguishes a range of distinct abnormalities focusing on abnormal morphology on QRS complexes in normal heartbeats with high accuracy, providing a foundation for AI-aided clinical decision-support systems in

背景：有大量的研究探索人工智能（AI）在识别与心律或传导相关的12通道格式心电图（ECG）异常中的应用。然而，在简化通道格式下对具有宽QRS复合物的ECG异常序列进行精细区分的研究却很少。方法：构建中国武汉华中科技大学同济医院成人患者心电图数据集（标准10-s， 12通道格式）。该数据集由正常心跳（60 ~ 100次/分钟）和正常心电图中5种具有宽QRS复合物的ECG异常组成。利用卷积神经网络对这些异常进行分类。与标准的12通道格式（I、II、III、aVR、aVL、aVF、V1至V6）相比，选择4通道（I、II、V1、V5）和8通道（I、II、V1至V6）作为模型的输入通道格式。来自同济医院的其他未复制的心电图（tj测试集），由董事会认证的心脏病专家委员会注释，作为测试数据集。计算F1评分、受试者工作特征曲线下面积（AUROC）和准确率来评估模型的性能，并进一步与6名心内科医生的诊断进行比较，这些医生被告知最终目标是在12通道格式的6个类别中进行分类。此外，使用江夏区第一人民医院291张心电图数据集（JX-Test集）和64张公开数据集来评估模型的可泛化性。结果：该数据集由2012年1月1日至2020年11月30日8,542例患者的11,808张心电图组成，并按9:1的比例分为训练和验证数据集。测试数据集包括2014年1月1日至2017年11月30日记录的480名新成年患者的480张未复制的心电图。与4通道和12通道格式相比，该模型在8通道格式中表现出优越的性能。对于8通道格式，模型的准确率为95.0%，平均F1评分为0.969(0.943 ~ 0.997)，平均AUROC评分为0.997(0.975 ~ 1.00)，而评估相同数据集的医生的准确率为89.9%，F1评分为0.898 (0.863 ~ 0.932)，AUROC评分为0.941（0.918 ~ 0.963）。该模型在JX-Test集上的平均F1得分为0.917(0.943 ~ 0.997)，平均AUROC得分为0.994(0.975 ~ 1.00)，在8通道格式的外部发布验证数据中，左束分支块的平均F1得分为0.708，右束分支块的平均F1得分为0.828。结论：我们的模型以正常心跳中QRS复合物的异常形态为重点，区分了一系列明显的异常，准确率高，为人工智能辅助临床决策支持系统在ECG鉴别诊断中的应用提供了基础。

{"title":"Artificial Intelligence-Enabled 8-Channel ECG Diagnosing of Abnormalities with Wide QRS Complexes.","authors":"Hongling Zhu, Qiushi Luo, Yao Wang, Mairihaba Maimaiti, Heng Zhang, Yulong Xiong, Chen Ruan, Jingyi Wang, Yedan Liu, Mengqiao Zhou, Yinan Sun, Wei Chen, E Jin, Jin Li, Xia Chen, Tao Zhu, Xiaoyun Yang","doi":"10.34133/hds.0265","DOIUrl":"https://doi.org/10.34133/hds.0265","url":null,"abstract":"Background: There is a substantial number of research exploring the application of artificial intelligence (AI) in identifying electrocardiogram (ECG) abnormalities related to heart rhythm or conduction with the 12-channel format. However, there is a scarcity of studies focusing on refined differentiation of serials of ECG abnormalities with wide QRS complexes in a simplified channel format. Methods: We constructed an ECG dataset (standard 10-s, 12-channel format) from adult patients from Tongji Hospital of Huazhong University of Science and Technology, Wuhan, China. This dataset was consisted of 5 kinds of ECG abnormalities with wide QRS complexes in the normal heartbeat (60 to 100 beats per minute) and the normal ECGs. Convolutional neural network was developed to classify these abnormalities. Four-channel (I, II, V1, and V5) and 8-channel (I, II, and V1 to V6) formats, compared to the standard 12-channel format (I, II, III, aVR, aVL, aVF, and V1 to V6), were chosen as the input channel format of the model. Other unreplicated ECGs from Tongji Hospital (TJ-Test set), annotated by a committee of board-certified cardiologists, served as the test dataset. The F1 score, area under the receiver operating characteristic curve (AUROC), and accuracy were calculated to assess the performance of the model, which were further compared with diagnoses of 6 ECG cardiologists who were informed that the final objective was classifying among 6 classes with the 12-channel format. In addition, a dataset of 291 ECGs from The First People's Hospital of Jiangxia District (JX-Test set) and a public dataset of 64 ECGs were used to assess model generalizability Results: The dataset consisted of 11,808 ECGs from 8,542 patients from 2012 January 1 to 2020 November 30 and divided into training and validation datasets in the ratio of 9:1. The test dataset included unreplicated 480 ECGs from 480 new adult patients recorded from 2014 January 1 to 2017 November 30. The model shows a superior performance in the 8-channel format compared to that of 4- and 12-channel formats. As for the 8-channel format, the model obtained an accuracy of 95.0%, a mean F1 score of 0.969 (0.943 to 0.997), and a mean AUROC score of 0.997 (0.975 to 1.00) compared to an accuracy of 89.9%, an F1 score of 0.898 (0.863 to 0.932), and an AUROC score of 0.941 (0.918 to 0.963) of physicians assessing the same datasets. The model exhibited a mean F1 score of 0.917 (0.943 to 0.997) and a mean AUROC score of 0.994 (0.975 to 1.00) on the JX-Test set, and mean F1 scores of 0.708 for the left bundle branch block and 0.828 for the right bundle branch block for the external published validation data both with the 8-channel format. Conclusion: Our model distinguishes a range of distinct abnormalities focusing on abnormal morphology on QRS complexes in normal heartbeats with high accuracy, providing a foundation for AI-aided clinical decision-support systems in ","PeriodicalId":73207,"journal":{"name":"Health data science","volume":"6 ","pages":"0265"},"PeriodicalIF":0.0,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12873062/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146144532","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Glycosylation Profiles in Cardiovascular Diseases: A Bibliometric Analysis. 心血管疾病的糖基化特征：文献计量学分析。

Health data science

Pub Date : 2026-02-03 eCollection Date: 2026-01-01 DOI: 10.34133/hds.0409

Tianqi Chang, Jingyu Wang, Chenyu Fan, Yuzhou Xue, Jiaxing Wang, Ming Xu

Background: Cardiovascular diseases (CVDs) continue to be the leading cause of morbidity and mortality globally, indicating a major global health burden. Glycosylation, one of the key posttranslational modifications of proteins, plays an important role in the onset and progression of CVDs. This study employed bibliometric analysis to examine the research on glycosylation and CVDs, aiming to identify the evolution and hotspots in this field. Methods: A total of 1,441 publications published from 2010 January 1 to 2024 December 31 were extracted from the Web of Science Core Collection. The analysis included a visual and descriptive examination of publication trends, countries/regions, institutions, keywords, and references. Results: The United States is the most productive country/region in this field, followed closely by China. The University of Alabama at Birmingham has made the most important contribution to this area. Key research hotspots include "O-GlcNAcylation", "biomarkers", "angiogenesis", "α-dystroglycan", "potassium channel", "heart failure", "gene expression", "glycosylation", and "cardiac glycosides". Conclusion: Research on glycosylation in CVDs has shown a steady increase in recent years. Among these studies, O-GlcNAcylation plays a pivotal role in this field. This comprehensive bibliometric analysis of glycosylation and CVDs provides researchers with valuable, objective insights to support future investigations.

背景：心血管疾病（cvd）仍然是全球发病率和死亡率的主要原因，表明一个主要的全球健康负担。糖基化是蛋白质翻译后修饰的关键之一，在心血管疾病的发生和发展中起着重要作用。本研究采用文献计量学分析的方法对糖基化与心血管疾病的研究进行了梳理，旨在了解该领域的发展历程和研究热点。方法：从Web of Science核心馆藏中提取2010年1月1日至2024年12月31日发表的1441篇论文。分析包括对出版趋势、国家/地区、机构、关键词和参考文献的视觉和描述性检查。结果：美国是该领域产量最高的国家/地区，中国紧随其后。阿拉巴马大学伯明翰分校在这方面做出了最重要的贡献。重点研究热点包括“o - glcn酰化”、“生物标志物”、“血管生成”、“α-糖醛酸失调”、“钾通道”、“心力衰竭”、“基因表达”、“糖基化”、“心脏糖苷”等。结论：近年来对心血管疾病糖基化的研究呈稳步增长趋势。在这些研究中，o - glcnac酰化在这一领域起着举足轻重的作用。糖基化和心血管疾病的综合文献计量学分析为研究人员提供了有价值的、客观的见解，以支持未来的研究。

{"title":"Glycosylation Profiles in Cardiovascular Diseases: A Bibliometric Analysis.","authors":"Tianqi Chang, Jingyu Wang, Chenyu Fan, Yuzhou Xue, Jiaxing Wang, Ming Xu","doi":"10.34133/hds.0409","DOIUrl":"10.34133/hds.0409","url":null,"abstract":"Background: Cardiovascular diseases (CVDs) continue to be the leading cause of morbidity and mortality globally, indicating a major global health burden. Glycosylation, one of the key posttranslational modifications of proteins, plays an important role in the onset and progression of CVDs. This study employed bibliometric analysis to examine the research on glycosylation and CVDs, aiming to identify the evolution and hotspots in this field. Methods: A total of 1,441 publications published from 2010 January 1 to 2024 December 31 were extracted from the Web of Science Core Collection. The analysis included a visual and descriptive examination of publication trends, countries/regions, institutions, keywords, and references. Results: The United States is the most productive country/region in this field, followed closely by China. The University of Alabama at Birmingham has made the most important contribution to this area. Key research hotspots include \"O-GlcNAcylation\", \"biomarkers\", \"angiogenesis\", \"α-dystroglycan\", \"potassium channel\", \"heart failure\", \"gene expression\", \"glycosylation\", and \"cardiac glycosides\". Conclusion: Research on glycosylation in CVDs has shown a steady increase in recent years. Among these studies, O-GlcNAcylation plays a pivotal role in this field. This comprehensive bibliometric analysis of glycosylation and CVDs provides researchers with valuable, objective insights to support future investigations.","PeriodicalId":73207,"journal":{"name":"Health data science","volume":"6 ","pages":"0409"},"PeriodicalIF":0.0,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12864656/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146121406","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Role of Limited Emotion Regulation Strategies on Nonsuicidal Self-injury and Suicide Attempts among Chinese Adolescents: A Network Analysis Based on Jiangxi Province. 有限情绪调节策略对中国青少年非自杀性自伤和自杀企图的影响：基于江西省的网络分析

Health data science

Pub Date : 2026-02-03 eCollection Date: 2026-01-01 DOI: 10.34133/hds.0195

Hao Xu, Xuejing Xu, Yan Chen, Wei Qu, Yunlong Tan, Zhiren Wang, Yanli Zhao, Shuping Tan, Dianying Liu

Background: Nonsuicidal self-injuries (NSSIs) are an important contributing factor to adolescent suicide, and various shared factors influence the risk of both NSSIs and suicide attempts (SAs). Both are important predictors of suicide and are part of a continuum of suicidal behaviors. Further exploration of the relationship between adolescent NSSI and SA may facilitate suicide prevention efforts. Methods: An online survey was conducted among 9,140 participants. Network analysis methods were used to explore expected influence (EI), bridge expected influence (BEI), edge weights, and differences between adolescents that have and have not attempted suicide (NSSI-SA and NSSI-NoSA, respectively). Results: Of the 9,140 participants, 7,030 completed the questionnaire, yielding a participation rate of 76.91%. Participants with at least one NSSI were retained, with 2,496 (35.50%) included in the network analysis. The strongest EI node for both networks was "emotion regulation strategies" (E = 1.389 and 1.393), and that for BEI was "personal distress" (Interpersonal Reactivity Index-personal distress; E = 0.497 and 0.492). Network comparisons revealed significant differences in NSSI 4 ("intentionally hitting walls, tables, and other hard objects"; E _(Δ) = -0.384, P < 0.001), significant differences in BEI with regard to "perspective taking" (Interpersonal Reactivity Index-perspective taking; E _(Δ) = -0.215, P < 0.001), and significant differences in edge weights between NSSI 4 and NSSI 5 ("intentionally hurting oneself by hitting with a fist, palm, or hard object"; E _(Δr) = -0.173, P < 0.001). Conclusions: Our study suggests that interventions in the form of emotion regulation strategies can alleviate symptoms throughout the entire network. Attention should be paid to instances when NSSI 4 and NSSI 5 behaviors co-occur frequently.

背景：非自杀性自伤（nssi）是青少年自杀的重要因素，多种共同因素影响着自伤和自杀企图的发生。两者都是自杀的重要预测因素，也是自杀行为连续体的一部分。进一步探讨青少年自伤和SA之间的关系可能有助于预防自杀的努力。方法：对9140人进行在线调查。本研究采用网络分析方法探讨自杀未遂和未自杀未遂青少年（分别为nsi - sa和nsi - nosa）之间的预期影响（EI）、桥式预期影响（BEI）、边缘权重和差异。结果：9140名参与者中，有7030人完成了问卷调查，参与率为76.91%。至少有一次自伤的参与者被保留，2496人（35.50%）被纳入网络分析。两种网络的EI节点均以“情绪调节策略”节点最强（E = 1.389和1.393），BEI的EI节点均以“个人苦恼”节点最强（E = 0.497和0.492）。网络比较NSSI 4中显示显著差异(“故意撞击墙壁、表和其他对象”;E(Δ)= -0.384,P < 0.001),显著差异在贝关于“视角”(人际反应Index-perspective服用;E(Δ)= -0.215,P < 0.001),显著差异在NSSI 4和NSSI 5之间的边的权值(“故意伤害自己用拳头打,棕榈,或硬物体”;E(Δr) = -0.173, P < 0.001)。结论：我们的研究表明情绪调节策略的干预可以缓解整个网络的症状。应注意自伤4和自伤5行为经常同时发生的情况。

{"title":"The Role of Limited Emotion Regulation Strategies on Nonsuicidal Self-injury and Suicide Attempts among Chinese Adolescents: A Network Analysis Based on Jiangxi Province.","authors":"Hao Xu, Xuejing Xu, Yan Chen, Wei Qu, Yunlong Tan, Zhiren Wang, Yanli Zhao, Shuping Tan, Dianying Liu","doi":"10.34133/hds.0195","DOIUrl":"10.34133/hds.0195","url":null,"abstract":"Background: Nonsuicidal self-injuries (NSSIs) are an important contributing factor to adolescent suicide, and various shared factors influence the risk of both NSSIs and suicide attempts (SAs). Both are important predictors of suicide and are part of a continuum of suicidal behaviors. Further exploration of the relationship between adolescent NSSI and SA may facilitate suicide prevention efforts. Methods: An online survey was conducted among 9,140 participants. Network analysis methods were used to explore expected influence (EI), bridge expected influence (BEI), edge weights, and differences between adolescents that have and have not attempted suicide (NSSI-SA and NSSI-NoSA, respectively). Results: Of the 9,140 participants, 7,030 completed the questionnaire, yielding a participation rate of 76.91%. Participants with at least one NSSI were retained, with 2,496 (35.50%) included in the network analysis. The strongest EI node for both networks was \"emotion regulation strategies\" (E = 1.389 and 1.393), and that for BEI was \"personal distress\" (Interpersonal Reactivity Index-personal distress; E = 0.497 and 0.492). Network comparisons revealed significant differences in NSSI 4 (\"intentionally hitting walls, tables, and other hard objects\"; E (Δ) = -0.384, P < 0.001), significant differences in BEI with regard to \"perspective taking\" (Interpersonal Reactivity Index-perspective taking; E (Δ) = -0.215, P < 0.001), and significant differences in edge weights between NSSI 4 and NSSI 5 (\"intentionally hurting oneself by hitting with a fist, palm, or hard object\"; E (Δr) = -0.173, P < 0.001). Conclusions: Our study suggests that interventions in the form of emotion regulation strategies can alleviate symptoms throughout the entire network. Attention should be paid to instances when NSSI 4 and NSSI 5 behaviors co-occur frequently.","PeriodicalId":73207,"journal":{"name":"Health data science","volume":"6 ","pages":"0195"},"PeriodicalIF":0.0,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12864651/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146121346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Response to "Toward Audit-Ready Standards for Synthetic Cohorts in Health Decision Science". 对“面向卫生决策科学合成队列的可审计标准”的回应。

Health data science

Pub Date : 2026-01-23 eCollection Date: 2026-01-01 DOI: 10.34133/hds.0415

Gregorio Ferreira, Jacopo Amidei, Ruben Nieto, Andreas Kaltenbrunner

引用次数: 0

Toward Audit-Ready Standards for Synthetic Cohorts in Health Decision Science. 健康决策科学合成队列可审计标准的探讨。

Health data science

Pub Date : 2026-01-23 eCollection Date: 2026-01-01 DOI: 10.34133/hds.0414

Mulavagili Vijayasimha

引用次数: 0

Explainable Mortality Prediction for Liver Transplant Candidates with Hepatocellular Carcinoma: A Supervised Clustering Approach. 肝细胞癌肝移植候选人可解释的死亡率预测：一种监督聚类方法。

Health data science

Pub Date : 2026-01-13 eCollection Date: 2026-01-01 DOI: 10.34133/hds.0295

Abdelghani Halimi, Nesma Houmani, Sonia Garcia-Salicetti, Ilias Kounis, Audrey Coilly

Background: Accurate mortality prediction for liver transplant candidates with hepatocellular carcinoma (HCC) remains a critical challenge. Traditional scoring systems, including Child-Pugh, Albumin-Bilirubin, Model for End-Stage Liver Disease (MELD), MELD-Na, MELD 3.0, and Alpha-fetoprotein scores, are widely used but often fail to provide precise risk assessments. This limitation arises from the dual burden of liver dysfunction and tumor progression, which complicates prognosis. Consequently, there is a need for a comprehensive approach addressing both considerations to better manage HCC patients. Methods: We propose an advanced machine learning-based scoring system exploiting Ensemble Learning and SHapley Additive exPlanations (SHAP) for a better understanding of key mortality risk factors. SHAP offers valuable insights into the decision-making process by providing both global and local explanations. By embedding SHAP values in the Uniform Manifold Approximation and Projection space, we perform supervised clustering to infer latent subgroups, providing a higher granularity on the contribution of key variables for mortality risk assessment. Results: Our system based on LightGBM outperforms conventional scores leveraging only 8 relevant variables selected by SHAP analysis. These variables respond to the challenging dual risk problem set in this work. With supervised clustering, we uncover 7 subgroups showing an increasing mortality risk level and a fine assessment of risk factors' contribution. Conclusion: By contrast to existing studies, our approach offers an integrative data-driven framework for handling the dual risk challenge set by HCC patients with liver dysfunction. Also, it provides a valuable tool for a more precise risk evaluation that may guide treatment decisions and help monitoring patient progression.

背景：准确预测肝移植候选者肝细胞癌（HCC）的死亡率仍然是一个关键的挑战。传统的评分系统，包括Child-Pugh、白蛋白-胆红素、终末期肝病模型（MELD）、MELD- na、MELD 3.0和甲胎蛋白评分，被广泛使用，但往往不能提供精确的风险评估。这种限制来自肝功能障碍和肿瘤进展的双重负担，这使预后复杂化。因此，需要一种综合的方法来解决这两个问题，以更好地管理HCC患者。方法：我们提出了一个先进的基于机器学习的评分系统，利用集成学习和SHapley加性解释（SHAP）来更好地理解关键的死亡风险因素。SHAP通过提供全球和当地的解释，为决策过程提供了有价值的见解。通过在均匀流形逼近和投影空间中嵌入SHAP值，我们执行监督聚类来推断潜在子群，为死亡率风险评估提供更高粒度的关键变量贡献。结果：基于LightGBM的系统优于传统评分，仅利用SHAP分析选择的8个相关变量。这些变量响应本工作中具有挑战性的双重风险问题集。通过监督聚类，我们发现了7个显示死亡风险水平增加的亚组，并对风险因素的贡献进行了精细评估。结论：与现有研究相比，我们的方法提供了一个综合数据驱动的框架，用于处理肝细胞癌合并肝功能障碍患者的双重风险挑战。此外，它为更精确的风险评估提供了有价值的工具，可以指导治疗决策并帮助监测患者的进展。

{"title":"Explainable Mortality Prediction for Liver Transplant Candidates with Hepatocellular Carcinoma: A Supervised Clustering Approach.","authors":"Abdelghani Halimi, Nesma Houmani, Sonia Garcia-Salicetti, Ilias Kounis, Audrey Coilly","doi":"10.34133/hds.0295","DOIUrl":"10.34133/hds.0295","url":null,"abstract":"Background: Accurate mortality prediction for liver transplant candidates with hepatocellular carcinoma (HCC) remains a critical challenge. Traditional scoring systems, including Child-Pugh, Albumin-Bilirubin, Model for End-Stage Liver Disease (MELD), MELD-Na, MELD 3.0, and Alpha-fetoprotein scores, are widely used but often fail to provide precise risk assessments. This limitation arises from the dual burden of liver dysfunction and tumor progression, which complicates prognosis. Consequently, there is a need for a comprehensive approach addressing both considerations to better manage HCC patients. Methods: We propose an advanced machine learning-based scoring system exploiting Ensemble Learning and SHapley Additive exPlanations (SHAP) for a better understanding of key mortality risk factors. SHAP offers valuable insights into the decision-making process by providing both global and local explanations. By embedding SHAP values in the Uniform Manifold Approximation and Projection space, we perform supervised clustering to infer latent subgroups, providing a higher granularity on the contribution of key variables for mortality risk assessment. Results: Our system based on LightGBM outperforms conventional scores leveraging only 8 relevant variables selected by SHAP analysis. These variables respond to the challenging dual risk problem set in this work. With supervised clustering, we uncover 7 subgroups showing an increasing mortality risk level and a fine assessment of risk factors' contribution. Conclusion: By contrast to existing studies, our approach offers an integrative data-driven framework for handling the dual risk challenge set by HCC patients with liver dysfunction. Also, it provides a valuable tool for a more precise risk evaluation that may guide treatment decisions and help monitoring patient progression.","PeriodicalId":73207,"journal":{"name":"Health data science","volume":"6 ","pages":"0295"},"PeriodicalIF":0.0,"publicationDate":"2026-01-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12796095/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145971644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Response to "Enhancing the XGBoost Mortality Prediction Model for ICU Patients with Acute Ischemic Stroke". 对“加强ICU急性缺血性脑卒中患者XGBoost死亡率预测模型”的回应。

Health data science

Pub Date : 2025-12-09 eCollection Date: 2025-01-01 DOI: 10.34133/hds.0394

Jack A Cummins, Ben S Gerber, Mayuko Ito Fukunaga, Nils Henninger, Catarina I Kiefe, Feifan Liu

引用次数: 0

Enhancing the XGBoost Mortality Prediction Model for ICU Patients with Acute Ischemic Stroke. 重症监护病房急性缺血性脑卒中患者XGBoost死亡率预测模型的改进

Health data science

Pub Date : 2025-12-09 eCollection Date: 2025-01-01 DOI: 10.34133/hds.0371

Jing Sun, Chang Meng, Guobin Miao

引用次数: 0

Projected Prevalence and Economic Burden of Alzheimer's Disease and Related Dementias in China: Regional Disparities and Policy Implications. 中国阿尔茨海默病和相关痴呆的预测患病率和经济负担：地区差异和政策启示

Health data science

Pub Date : 2025-11-21 eCollection Date: 2025-01-01 DOI: 10.34133/hds.0377

Xinyi Liu, Simiao Chen, Donglan Zhang, Yian Gu, Gang Li, Bei Wu, José A Pagán, Luxia Zhang, Yan Li

Background: China has the largest population with Alzheimer's disease and related dementias (ADRDs) globally, and rapid population aging is expected to drive a substantial increase in cases. This study projects ADRD prevalence and associated economic burdens across provinces in China from 2025 to 2060. Methods: Using data from the China Health and Retirement Longitudinal Study (CHARLS) supplemented by national demographic and provincial statistics, we projected the prevalence and care costs of ADRD for each of the 31 provinces in China from 2025 to 2060. Cost projections included formal care expenses and informal caregiving valued through replacement cost methods. We conducted uncertainty analysis to provide robust estimates for ADRD prevalence and costs. Results: By 2060, ADRD cases in China are projected to reach approximately 49.89 million, with the highest prevalence and economic burden concentrated in provinces such as Shandong, Sichuan, Jiangsu, Henan, and Guangdong. Formal care costs alone are expected to exceed $1 trillion annually, while the total economic value-including informal caregiving-could surpass $5 trillion. Geographic disparities highlight that Eastern and Central regions, with a higher proportions of older adults, will bear disproportionate costs. Informal caregiving is projected to constitute 60% to 80% of total ADRD-related costs. Conclusion: China faces an unprecedented rise in ADRD-related economic burden over the next 4 decades, with substantial regional disparities. Strengthening long-term care infrastructure, expanding financial and social support for caregivers, and implementing regionally tailored healthy aging policies are essential to ensuring equitable and sustainable ADRD care across China.

背景：中国是全球阿尔茨海默病及相关痴呆（adrd）患者最多的国家，人口快速老龄化预计将推动病例大幅增加。本研究预测了2025年至2060年中国各省ADRD患病率和相关经济负担。方法：利用中国健康与退休纵向研究（CHARLS）的数据，辅以国家人口统计和省级统计数据，我们预测了2025年至2060年中国31个省份ADRD的患病率和护理成本。成本预测包括通过重置成本法评估的正式护理费用和非正式护理费用。我们进行了不确定性分析，以提供对ADRD患病率和成本的可靠估计。结果：到2060年，中国ADRD病例预计将达到约4989万例，发病率和经济负担最高的省份集中在山东、四川、江苏、河南和广东等省份。预计每年仅正规护理的费用就将超过1万亿美元，而包括非正规护理在内的总经济价值可能超过5万亿美元。地域差异突出表明，老年人比例较高的东部和中部地区将承担不成比例的费用。非正式护理预计将占与adrd相关的总费用的60%至80%。结论：未来40年，中国将面临与adrd相关的前所未有的经济负担，且存在显著的地区差异。加强长期护理基础设施，扩大对护理人员的财政和社会支持，并实施适合地区的健康老龄化政策，对于确保中国各地公平和可持续的ADRD护理至关重要。

{"title":"Projected Prevalence and Economic Burden of Alzheimer's Disease and Related Dementias in China: Regional Disparities and Policy Implications.","authors":"Xinyi Liu, Simiao Chen, Donglan Zhang, Yian Gu, Gang Li, Bei Wu, José A Pagán, Luxia Zhang, Yan Li","doi":"10.34133/hds.0377","DOIUrl":"10.34133/hds.0377","url":null,"abstract":"Background: China has the largest population with Alzheimer's disease and related dementias (ADRDs) globally, and rapid population aging is expected to drive a substantial increase in cases. This study projects ADRD prevalence and associated economic burdens across provinces in China from 2025 to 2060. Methods: Using data from the China Health and Retirement Longitudinal Study (CHARLS) supplemented by national demographic and provincial statistics, we projected the prevalence and care costs of ADRD for each of the 31 provinces in China from 2025 to 2060. Cost projections included formal care expenses and informal caregiving valued through replacement cost methods. We conducted uncertainty analysis to provide robust estimates for ADRD prevalence and costs. Results: By 2060, ADRD cases in China are projected to reach approximately 49.89 million, with the highest prevalence and economic burden concentrated in provinces such as Shandong, Sichuan, Jiangsu, Henan, and Guangdong. Formal care costs alone are expected to exceed $1 trillion annually, while the total economic value-including informal caregiving-could surpass $5 trillion. Geographic disparities highlight that Eastern and Central regions, with a higher proportions of older adults, will bear disproportionate costs. Informal caregiving is projected to constitute 60% to 80% of total ADRD-related costs. Conclusion: China faces an unprecedented rise in ADRD-related economic burden over the next 4 decades, with substantial regional disparities. Strengthening long-term care infrastructure, expanding financial and social support for caregivers, and implementing regionally tailored healthy aging policies are essential to ensuring equitable and sustainable ADRD care across China.","PeriodicalId":73207,"journal":{"name":"Health data science","volume":"5 ","pages":"0377"},"PeriodicalIF":0.0,"publicationDate":"2025-11-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12635023/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145590038","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Response to "Anticancer Drug Approval in China: From Acceleration of Access to Certainty of Benefits". 对“中国抗癌药物审批：从加速获得到确定获益”的回应。

Health data science

Pub Date : 2025-10-28 eCollection Date: 2025-01-01 DOI: 10.34133/hds.0337

Lixia Fu, Yimin Cui

引用次数: 0

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Health data science

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀