Health data science最新文献_第4页

Prevalence and Risk Factors of Type 2 Diabetes Mellitus among Depression Inpatients from 2005 to 2018 in Beijing, China. 2005 - 2018年北京市抑郁症住院患者2型糖尿病患病率及危险因素分析

Health data science

Pub Date : 2025-03-05 eCollection Date: 2025-01-01 DOI: 10.34133/hds.0111

Peng Gao, Fude Yang, Qiuyue Ma, Botao Ma, Wenzhan Jing, Jue Liu, Moning Guo, Juan Li, Zhiren Wang, Min Liu

Background: There are few data on the comorbidity of diabetes in Chinese patients with depression. We aimed to calculate the prevalence and explore risk factors of type 2 diabetes mellitus (T2DM) among depression inpatients from 2005 to 2018 in Beijing. Methods: This study is a cross-sectional study. The data collected from 19 specialized psychiatric hospitals in Beijing were analyzed. The prevalence of T2DM and its distribution were analyzed. The multivariable logistic regression was performed to explore the risk factors of T2DM. Results: A total of 20,899 depression inpatients were included. The prevalence of T2DM was 9.13% [95% confidence interval (CI), 8.74% to 9.52%]. The prevalence of T2DM showed an upward trend with year (P for trend < 0.001) and age (P for trend < 0.001). The prevalence of T2DM was higher among readmitted patients (12.97%) and patients with comorbid hypertension (26.16%), hyperlipidemia (21.28%), and nonalcoholic fatty liver disease (NAFLD) (18.85%). The prevalence of T2DM in females was lower than in males among patients aged 18 to 59 years, while the prevalence of T2DM in females was higher than in males among patients aged ≥60 years. T2DM was associated with older age [adjusted odds ratios (aORs) ranged from 3.68 to 29.95, P < 0.001], hypertension (aOR, 3.01; 95% CI, 2.70 to 3.35; P < 0.001), hyperlipidemia (aOR, 1.69; 95% CI, 1.50 to 1.91; P < 0.001), and NAFLD (aOR, 1.58; 95% CI, 1.37 to 1.82; P < 0.001). Conclusions: The prevalence of T2DM among depression inpatients from 2005 to 2018 in Beijing was high and increased with the year. Depression inpatients who were older and with hypertension, hyperlipidemia, and NAFLD had a higher prevalence and risk of T2DM.

背景：关于中国抑郁症患者糖尿病合并症的资料很少。我们的目的是计算2005 - 2018年北京市抑郁症住院患者中2型糖尿病（T2DM）的患病率并探讨其危险因素。方法：本研究为横断面研究。对北京市19家精神病专科医院的数据进行分析。分析2型糖尿病的患病率及分布。采用多变量logistic回归分析T2DM的危险因素。结果：共纳入抑郁症住院患者20899例。T2DM患病率为9.13%[95%可信区间（CI）， 8.74% ~ 9.52%]。T2DM患病率随年龄（P < 0.001）和年龄（P < 0.001）呈上升趋势。T2DM的患病率在再入院患者（12.97%）和合并高血压（26.16%）、高脂血症（21.28%）和非酒精性脂肪性肝病（NAFLD）（18.85%）的患者中较高。在18 ~ 59岁的患者中，女性T2DM患病率低于男性，而在≥60岁的患者中，女性T2DM患病率高于男性。T2DM与老年、高血压(aOR, 3.01；95% CI, 2.70 ~ 3.35；P < 0.001)，高脂血症(aOR, 1.69；95% CI, 1.50 ~ 1.91；P < 0.001), NAFLD (aOR, 1.58；95% CI, 1.37 ~ 1.82；P < 0.001)。结论：2005 - 2018年北京市抑郁症住院患者中T2DM患病率较高，且呈逐年上升趋势。年龄较大且伴有高血压、高脂血症和NAFLD的抑郁症住院患者有较高的T2DM患病率和风险。

{"title":"Prevalence and Risk Factors of Type 2 Diabetes Mellitus among Depression Inpatients from 2005 to 2018 in Beijing, China.","authors":"Peng Gao, Fude Yang, Qiuyue Ma, Botao Ma, Wenzhan Jing, Jue Liu, Moning Guo, Juan Li, Zhiren Wang, Min Liu","doi":"10.34133/hds.0111","DOIUrl":"10.34133/hds.0111","url":null,"abstract":"Background: There are few data on the comorbidity of diabetes in Chinese patients with depression. We aimed to calculate the prevalence and explore risk factors of type 2 diabetes mellitus (T2DM) among depression inpatients from 2005 to 2018 in Beijing. Methods: This study is a cross-sectional study. The data collected from 19 specialized psychiatric hospitals in Beijing were analyzed. The prevalence of T2DM and its distribution were analyzed. The multivariable logistic regression was performed to explore the risk factors of T2DM. Results: A total of 20,899 depression inpatients were included. The prevalence of T2DM was 9.13% [95% confidence interval (CI), 8.74% to 9.52%]. The prevalence of T2DM showed an upward trend with year (P for trend < 0.001) and age (P for trend < 0.001). The prevalence of T2DM was higher among readmitted patients (12.97%) and patients with comorbid hypertension (26.16%), hyperlipidemia (21.28%), and nonalcoholic fatty liver disease (NAFLD) (18.85%). The prevalence of T2DM in females was lower than in males among patients aged 18 to 59 years, while the prevalence of T2DM in females was higher than in males among patients aged ≥60 years. T2DM was associated with older age [adjusted odds ratios (aORs) ranged from 3.68 to 29.95, P < 0.001], hypertension (aOR, 3.01; 95% CI, 2.70 to 3.35; P < 0.001), hyperlipidemia (aOR, 1.69; 95% CI, 1.50 to 1.91; P < 0.001), and NAFLD (aOR, 1.58; 95% CI, 1.37 to 1.82; P < 0.001). Conclusions: The prevalence of T2DM among depression inpatients from 2005 to 2018 in Beijing was high and increased with the year. Depression inpatients who were older and with hypertension, hyperlipidemia, and NAFLD had a higher prevalence and risk of T2DM.","PeriodicalId":73207,"journal":{"name":"Health data science","volume":"5 ","pages":"0111"},"PeriodicalIF":0.0,"publicationDate":"2025-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11880573/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143569083","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Caring for the "Osteo-Cardiovascular Faller": Associations between Multimorbidity and Fall Transitions among Middle-Aged and Older Chinese. 照顾“骨-心血管患者”：中国中老年人群多病与跌倒过渡之间的关系。

Health data science

Pub Date : 2025-02-19 eCollection Date: 2025-01-01 DOI: 10.34133/hds.0151

Mingzhi Yu, Longbing Ren, Rui Yang, Yuling Jiang, Shijie Cui, Jingjing Wang, Shaojie Li, Yang Hu, Zhouwei Liu, Yifei Wu, Gongzi Zhang, Ye Peng, Lihai Zhang, Yao Yao

Background: It is still uncertain how multimorbidity patterns affect transitions between fall states among middle-aged and older Chinese. Methods: Data were obtained from China Health and Retirement Longitudinal Study (CHARLS) 2011-2018. We utilized latent class analysis to categorize baseline multimorbidity patterns, Markov multi-state model to explore the impact of multimorbidity characterized by condition counts and multimorbidity patterns on subsequent fall transitions, and Cox proportional hazard models to assess hazard ratios of each transition. Results: A total of 14,244 participants aged 45 years and older were enrolled at baseline. Among these participants, 11,956 (83.9%) did not have a fall history in the last 2 years, 1,054 (7.4%) had mild falls, and 1,234 (8.7%) had severe falls. Using a multi-state model, 10,967 transitions were observed during a total follow-up of 57,094 person-times, 6,527 of which had worsening transitions and 4,440 had improving transitions. Among 6,711 multimorbid participants, osteo-cardiovascular (20.5%), pulmonary-digestive-rheumatic (30.5%), metabolic-cardiovascular (22.9%), and neuropsychiatric-sensory (26.1%) patterns were classified. Multimorbid participants had significantly higher risks of transitions compared with other participants. Among 4 multimorbidity patterns, osteo-cardiovascular pattern had higher transition risks than other 3 patterns. Conclusions: Multimorbidity, especially the "osteo-cardiovascular pattern" identified in this study, was associated with higher risks of fall transitions among middle-aged and older Chinese. Generally, the effect of multimorbidity is more significant in older adults than in middle-aged adults. Findings from this study provide facts and evidence for fall prevention, and offer implications for clinicians to target on vulnerable population, and for public health policymakers to allocate healthcare resources.

背景：多病模式如何影响中国中老年人在跌倒状态之间的转换，目前尚不清楚。研究方法数据来自 2011-2018 年中国健康与退休纵向研究（CHARLS）。我们利用潜类分析对基线多病模式进行分类，利用马尔可夫多状态模型探讨以病情计数和多病模式为特征的多病对随后跌倒转换的影响，并利用 Cox 比例危险模型评估每种转换的危险比。研究结果共有 14244 名 45 岁及以上的参与者参与了基线研究。在这些参与者中，11956 人（83.9%）在过去两年中没有跌倒史，1054 人（7.4%）有轻微跌倒，1234 人（8.7%）有严重跌倒。使用多状态模型，在总计 57094 人次的随访过程中观察到 10967 次转变，其中 6527 次恶化转变，4440 次改善转变。在 6711 名多病参与者中，分为骨-心血管（20.5%）、肺-消化-风湿（30.5%）、代谢-心血管（22.9%）和神经-精神-感官（26.1%）模式。与其他参与者相比，多病参与者的转院风险明显更高。在 4 种多病模式中，骨-心血管模式的过渡风险高于其他 3 种模式。结论多病，尤其是本研究中发现的 "骨-心血管模式"，与中老年中国人较高的跌倒转归风险相关。一般来说，多病对老年人的影响比对中年人的影响更大。本研究的结果为预防跌倒提供了事实和证据，并为临床医生针对弱势人群和公共卫生决策者分配医疗资源提供了启示。

{"title":"Caring for the \"Osteo-Cardiovascular Faller\": Associations between Multimorbidity and Fall Transitions among Middle-Aged and Older Chinese.","authors":"Mingzhi Yu, Longbing Ren, Rui Yang, Yuling Jiang, Shijie Cui, Jingjing Wang, Shaojie Li, Yang Hu, Zhouwei Liu, Yifei Wu, Gongzi Zhang, Ye Peng, Lihai Zhang, Yao Yao","doi":"10.34133/hds.0151","DOIUrl":"10.34133/hds.0151","url":null,"abstract":"Background: It is still uncertain how multimorbidity patterns affect transitions between fall states among middle-aged and older Chinese. Methods: Data were obtained from China Health and Retirement Longitudinal Study (CHARLS) 2011-2018. We utilized latent class analysis to categorize baseline multimorbidity patterns, Markov multi-state model to explore the impact of multimorbidity characterized by condition counts and multimorbidity patterns on subsequent fall transitions, and Cox proportional hazard models to assess hazard ratios of each transition. Results: A total of 14,244 participants aged 45 years and older were enrolled at baseline. Among these participants, 11,956 (83.9%) did not have a fall history in the last 2 years, 1,054 (7.4%) had mild falls, and 1,234 (8.7%) had severe falls. Using a multi-state model, 10,967 transitions were observed during a total follow-up of 57,094 person-times, 6,527 of which had worsening transitions and 4,440 had improving transitions. Among 6,711 multimorbid participants, osteo-cardiovascular (20.5%), pulmonary-digestive-rheumatic (30.5%), metabolic-cardiovascular (22.9%), and neuropsychiatric-sensory (26.1%) patterns were classified. Multimorbid participants had significantly higher risks of transitions compared with other participants. Among 4 multimorbidity patterns, osteo-cardiovascular pattern had higher transition risks than other 3 patterns. Conclusions: Multimorbidity, especially the \"osteo-cardiovascular pattern\" identified in this study, was associated with higher risks of fall transitions among middle-aged and older Chinese. Generally, the effect of multimorbidity is more significant in older adults than in middle-aged adults. Findings from this study provide facts and evidence for fall prevention, and offer implications for clinicians to target on vulnerable population, and for public health policymakers to allocate healthcare resources.","PeriodicalId":73207,"journal":{"name":"Health data science","volume":"5 ","pages":"0151"},"PeriodicalIF":0.0,"publicationDate":"2025-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11836196/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143460972","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

ECG-LM: Understanding Electrocardiogram with a Large Language Model. ECG-LM：用大语言模型理解心电图。

Health data science

Pub Date : 2025-02-04 eCollection Date: 2025-01-01 DOI: 10.34133/hds.0221

Kai Yang, Massimo Hong, Jiahuan Zhang, Yizhen Luo, Suyuan Zhao, Ou Zhang, Xiaomao Yu, Jiawen Zhou, Liuqing Yang, Ping Zhang, Mu Qiao, Zaiqing Nie

Background: The electrocardiogram (ECG) is a valuable, noninvasive tool for monitoring heart-related conditions, providing critical insights. However, the interpretation of ECG data alongside patient information demands substantial medical expertise and resources. While deep learning methods help streamline this process, they often fall short in integrating patient data with ECG readings and do not provide the nuanced clinical suggestions and insights necessary for accurate diagnosis. Methods: Although recent advancements in multi-modal large language modeling have propelled their application scope beyond the natural language processing domain, their applicability to ECG processing remains largely unexplored, partly due to the lack of text-ECG data. To this end, we develop ECG-Language Model (ECG-LM), the first multi-modal large language model able to process natural language and understand ECG signals. The model employs a specialized ECG encoder that transforms raw ECG signals into a high-dimensional feature space, which is then aligned with the textual feature space derived from the large language model. To address the scarcity of text-ECG data, we generated text-ECG pairs by leveraging detailed ECG pattern descriptions from medical guidelines, creating a robust dataset for pre-training ECG-LM. Additionally, we fine-tune ECG-LM with public clinical conversation datasets and build an additional supervised fine-tuning dataset based on real clinical data from the hospital, aiming to provide a more comprehensive and customized user experience. Results: ECG-LM outperforms existing few-shot and zero-shot solutions in cardiovascular disease detection across all 3 tasks (diagnostic, rhythm, and form) while also demonstrating strong potential in ECG-related question answering. Conclusions: The results across various tasks demonstrate that ECG-LM effectively captures the intricate features of ECGs, showcasing its versatility in applications such as disease prediction and advanced question answering.

背景：心电图（ECG）是一种有价值的、无创的监测心脏相关疾病的工具，提供了重要的见解。然而，心电图数据和患者信息的解释需要大量的医学专业知识和资源。虽然深度学习方法有助于简化这一过程，但它们在将患者数据与ECG读数整合方面往往存在不足，并且不能提供准确诊断所需的细致入微的临床建议和见解。方法：尽管近年来多模态大语言建模的进展使其应用范围超出了自然语言处理领域，但由于缺乏文本-心电数据，其在心电处理中的适用性在很大程度上仍未得到探索。为此，我们开发了ECG-语言模型（ECG- lm），这是第一个能够处理自然语言并理解心电信号的多模态大型语言模型。该模型采用专门的心电编码器，将原始心电信号转换为高维特征空间，然后与大语言模型导出的文本特征空间对齐。为了解决文本-ECG数据的稀缺性，我们利用医疗指南中详细的ECG模式描述生成了文本-ECG对，创建了一个用于预训练ECG- lm的鲁棒数据集。此外，我们使用公开的临床会话数据集对ECG-LM进行微调，并基于医院的真实临床数据构建额外的监督微调数据集，旨在提供更全面和定制的用户体验。结果：ECG-LM在心血管疾病检测的所有3个任务（诊断、节律和形式）中都优于现有的少射和零射解决方案，同时在ecg相关的问题回答中也显示出强大的潜力。结论：各种任务的结果表明，ECG-LM有效地捕获了心电图的复杂特征，展示了其在疾病预测和高级问题回答等应用中的多功能性。

{"title":"ECG-LM: Understanding Electrocardiogram with a Large Language Model.","authors":"Kai Yang, Massimo Hong, Jiahuan Zhang, Yizhen Luo, Suyuan Zhao, Ou Zhang, Xiaomao Yu, Jiawen Zhou, Liuqing Yang, Ping Zhang, Mu Qiao, Zaiqing Nie","doi":"10.34133/hds.0221","DOIUrl":"10.34133/hds.0221","url":null,"abstract":"Background: The electrocardiogram (ECG) is a valuable, noninvasive tool for monitoring heart-related conditions, providing critical insights. However, the interpretation of ECG data alongside patient information demands substantial medical expertise and resources. While deep learning methods help streamline this process, they often fall short in integrating patient data with ECG readings and do not provide the nuanced clinical suggestions and insights necessary for accurate diagnosis. Methods: Although recent advancements in multi-modal large language modeling have propelled their application scope beyond the natural language processing domain, their applicability to ECG processing remains largely unexplored, partly due to the lack of text-ECG data. To this end, we develop ECG-Language Model (ECG-LM), the first multi-modal large language model able to process natural language and understand ECG signals. The model employs a specialized ECG encoder that transforms raw ECG signals into a high-dimensional feature space, which is then aligned with the textual feature space derived from the large language model. To address the scarcity of text-ECG data, we generated text-ECG pairs by leveraging detailed ECG pattern descriptions from medical guidelines, creating a robust dataset for pre-training ECG-LM. Additionally, we fine-tune ECG-LM with public clinical conversation datasets and build an additional supervised fine-tuning dataset based on real clinical data from the hospital, aiming to provide a more comprehensive and customized user experience. Results: ECG-LM outperforms existing few-shot and zero-shot solutions in cardiovascular disease detection across all 3 tasks (diagnostic, rhythm, and form) while also demonstrating strong potential in ECG-related question answering. Conclusions: The results across various tasks demonstrate that ECG-LM effectively captures the intricate features of ECGs, showcasing its versatility in applications such as disease prediction and advanced question answering.","PeriodicalId":73207,"journal":{"name":"Health data science","volume":"5 ","pages":"0221"},"PeriodicalIF":0.0,"publicationDate":"2025-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11791404/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"143191464","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

VP-SFDA: Visual Prompt Source-Free Domain Adaptation for Cross-Modal Medical Image. 跨模态医学图像的视觉提示无源域自适应。

Health data science

Pub Date : 2025-01-07 eCollection Date: 2025-01-01 DOI: 10.34133/hds.0143

Yixin Chen, Yan Wang, Zhaoheng Xie

Background: Source-free unsupervised domain adaptation (SFUDA) methods aim to address the challenge of domain shift while preserving data privacy. Existing SFUDA approaches construct reliable and confident pseudo-labels for target-domain data through denoising methods, thereby guiding the training of the target-domain model. The effectiveness of denoising approaches is influenced by the degree of domain gap between the source and target domains. A marked shift can cause the pseudo-labels to be unreliable, even after applying denoising. Methods: We propose a novel 2-stage framework for SFUDA called visual prompt source-free domain adaptation (VP-SFDA). We propose input-specific visual prompt in the first stage, prompting process, which bridges the target-domain data to source-domain distribution. Our method utilizes visual prompts and batch normalization constraint to enable the alignment model to learn domain-specific knowledge and align the target-domain data with the source-domain contribution. The second stage is the adaptation process, which aims at optimizing the segmentation model from the source domain to the target domain. This is accomplished through the denoising techniques, ultimately enhancing the performance. Results: Our study presents a comparative analysis of several SFUDA techniques in the VP-SFDA framework across 4 tasks: abdominal magnetic resonance imaging (MRI) to computed tomography (CT), abdominal CT to MRI, cardiac MRI to CT, and cardiac CT to MRI. Notably, in the abdominal MRI to CT adaptation task, the VP-OS method achieved a remarkable improvement, increasing the average DICE score from 0.658 to 0.773 (P $<$ 0.01) and reducing the average surface distance (ASD) from 3.489 to 2.961 (P $<$ 0.01). Similarly, the VP-LD and VP-DPL methods also showed significant improvements over their base algorithms in both abdominal and cardiac MRI to CT tasks. Conclusions: This paper proposes VP-SFDA, a novel 2-stage framework for SFUDA in medical imaging, which achieves superior performance through input-specific visual prompts and batch normalization constraint for domain adaptation, coupled with denoising methods for enhanced results. Comparative experiments on 4 medical SFUDA tasks demonstrate that VO-SFDA surpasses existing methods, with ablation studies confirming the benefits of domain-specific patterns.

背景：无源无监督域自适应（SFUDA）方法旨在解决域转移的挑战，同时保护数据隐私。现有的SFUDA方法通过去噪方法为目标域数据构建可靠、置信的伪标签，从而指导目标域模型的训练。降噪方法的有效性受源域和目标域之间的域间隙程度的影响。明显的偏移会导致伪标签不可靠，即使在应用去噪之后也是如此。方法：提出了一种新的两阶段SFUDA框架，称为视觉提示无源域自适应（VP-SFDA）。我们在第一阶段提出了特定于输入的视觉提示，即提示过程，它将目标域的数据与源域的分布联系起来。我们的方法利用可视化提示和批处理规范化约束，使对齐模型能够学习特定于领域的知识，并将目标领域的数据与源领域的贡献进行对齐。第二阶段是自适应过程，目的是从源域到目标域对分割模型进行优化。这是通过去噪技术来实现的，最终提高了性能。结果：我们的研究对VP-SFDA框架下的几种SFUDA技术进行了4项任务的比较分析：腹部磁共振成像（MRI）到计算机断层扫描（CT）、腹部CT到MRI、心脏MRI到CT和心脏CT到MRI。值得注意的是，在腹部MRI到CT的适应任务中，VP-OS方法取得了显著的改善，将DICE平均评分从0.658提高到0.773 (P 0.01)，将平均表面距离（ASD）从3.489降低到2.961 （P 0.01）。同样，VP-LD和VP-DPL方法在腹部和心脏MRI到CT任务中也比它们的基本算法有了显著的改进。结论：本文提出了一种新的两阶段医学成像SFUDA框架VP-SFDA，该框架通过输入特定的视觉提示和批量归一化约束进行领域自适应，并结合去噪方法增强结果，取得了优异的性能。4种医学SFUDA任务的对比实验表明，VO-SFDA优于现有方法，消融研究证实了特定领域模式的好处。

{"title":"VP-SFDA: Visual Prompt Source-Free Domain Adaptation for Cross-Modal Medical Image.","authors":"Yixin Chen, Yan Wang, Zhaoheng Xie","doi":"10.34133/hds.0143","DOIUrl":"10.34133/hds.0143","url":null,"abstract":"Background: Source-free unsupervised domain adaptation (SFUDA) methods aim to address the challenge of domain shift while preserving data privacy. Existing SFUDA approaches construct reliable and confident pseudo-labels for target-domain data through denoising methods, thereby guiding the training of the target-domain model. The effectiveness of denoising approaches is influenced by the degree of domain gap between the source and target domains. A marked shift can cause the pseudo-labels to be unreliable, even after applying denoising. Methods: We propose a novel 2-stage framework for SFUDA called visual prompt source-free domain adaptation (VP-SFDA). We propose input-specific visual prompt in the first stage, prompting process, which bridges the target-domain data to source-domain distribution. Our method utilizes visual prompts and batch normalization constraint to enable the alignment model to learn domain-specific knowledge and align the target-domain data with the source-domain contribution. The second stage is the adaptation process, which aims at optimizing the segmentation model from the source domain to the target domain. This is accomplished through the denoising techniques, ultimately enhancing the performance. Results: Our study presents a comparative analysis of several SFUDA techniques in the VP-SFDA framework across 4 tasks: abdominal magnetic resonance imaging (MRI) to computed tomography (CT), abdominal CT to MRI, cardiac MRI to CT, and cardiac CT to MRI. Notably, in the abdominal MRI to CT adaptation task, the VP-OS method achieved a remarkable improvement, increasing the average DICE score from 0.658 to 0.773 (P <math><mo><</mo></math> 0.01) and reducing the average surface distance (ASD) from 3.489 to 2.961 (P <math><mo><</mo></math> 0.01). Similarly, the VP-LD and VP-DPL methods also showed significant improvements over their base algorithms in both abdominal and cardiac MRI to CT tasks. Conclusions: This paper proposes VP-SFDA, a novel 2-stage framework for SFUDA in medical imaging, which achieves superior performance through input-specific visual prompts and batch normalization constraint for domain adaptation, coupled with denoising methods for enhanced results. Comparative experiments on 4 medical SFUDA tasks demonstrate that VO-SFDA surpasses existing methods, with ablation studies confirming the benefits of domain-specific patterns.","PeriodicalId":73207,"journal":{"name":"Health data science","volume":"5 ","pages":"0143"},"PeriodicalIF":0.0,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12063703/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144032095","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Multi-Modal CLIP-Informed Protein Editing. 多模态剪辑通知蛋白质编辑。

Health data science

Pub Date : 2024-12-19 eCollection Date: 2024-01-01 DOI: 10.34133/hds.0211

Mingze Yin, Hanjing Zhou, Yiheng Zhu, Miao Lin, Yixuan Wu, Jialu Wu, Hongxia Xu, Chang-Yu Hsieh, Tingjun Hou, Jintai Chen, Jian Wu

Background: Proteins govern most biological functions essential for life, and achieving controllable protein editing has made great advances in probing natural systems, creating therapeutic conjugates, and generating novel protein constructs. Recently, machine learning-assisted protein editing (MLPE) has shown promise in accelerating optimization cycles and reducing experimental workloads. However, current methods struggle with the vast combinatorial space of potential protein edits and cannot explicitly conduct protein editing using biotext instructions, limiting their interactivity with human feedback. Methods: To fill these gaps, we propose a novel method called ProtET for efficient CLIP-informed protein editing through multi-modality learning. Our approach comprises 2 stages: In the pretraining stage, contrastive learning aligns protein-biotext representations encoded by 2 large language models (LLMs). Subsequently, during the protein editing stage, the fused features from editing instruction texts and original protein sequences serve as the final editing condition for generating target protein sequences. Results: Comprehensive experiments demonstrated the superiority of ProtET in editing proteins to enhance human-expected functionality across multiple attribute domains, including enzyme catalytic activity, protein stability, and antibody-specific binding ability. ProtET improves the state-of-the-art results by a large margin, leading to substantial stability improvements of 16.67% and 16.90%. Conclusions: This capability positions ProtET to advance real-world artificial protein editing, potentially addressing unmet academic, industrial, and clinical needs.

背景：蛋白质控制着生命所必需的大多数生物功能，实现可控的蛋白质编辑在探测自然系统、创造治疗偶联物和产生新的蛋白质结构方面取得了巨大进展。最近，机器学习辅助蛋白质编辑（MLPE）在加速优化周期和减少实验工作量方面显示出了希望。然而，目前的方法与潜在的蛋白质编辑的巨大组合空间作斗争，并且不能使用生物文本指令明确地进行蛋白质编辑，限制了它们与人类反馈的交互性。方法：为了填补这些空白，我们提出了一种名为ProtET的新方法，通过多模态学习对clip进行有效的蛋白质编辑。我们的方法包括两个阶段：在预训练阶段，对比学习对齐由两个大型语言模型（llm）编码的蛋白质-生物文本表示。随后，在蛋白质编辑阶段，编辑指令文本与原始蛋白质序列的融合特征作为生成目标蛋白质序列的最终编辑条件。结果：综合实验证明了ProtET在编辑蛋白质方面的优势，可以增强人类期望的跨多个属性域的功能，包括酶催化活性、蛋白质稳定性和抗体特异性结合能力。ProtET在很大程度上提高了最先进的结果，导致稳定性提高了16.67%和16.90%。结论：这种能力使ProtET能够推进现实世界的人工蛋白质编辑，潜在地解决未满足的学术、工业和临床需求。

{"title":"Multi-Modal CLIP-Informed Protein Editing.","authors":"Mingze Yin, Hanjing Zhou, Yiheng Zhu, Miao Lin, Yixuan Wu, Jialu Wu, Hongxia Xu, Chang-Yu Hsieh, Tingjun Hou, Jintai Chen, Jian Wu","doi":"10.34133/hds.0211","DOIUrl":"10.34133/hds.0211","url":null,"abstract":"Background: Proteins govern most biological functions essential for life, and achieving controllable protein editing has made great advances in probing natural systems, creating therapeutic conjugates, and generating novel protein constructs. Recently, machine learning-assisted protein editing (MLPE) has shown promise in accelerating optimization cycles and reducing experimental workloads. However, current methods struggle with the vast combinatorial space of potential protein edits and cannot explicitly conduct protein editing using biotext instructions, limiting their interactivity with human feedback. Methods: To fill these gaps, we propose a novel method called ProtET for efficient CLIP-informed protein editing through multi-modality learning. Our approach comprises 2 stages: In the pretraining stage, contrastive learning aligns protein-biotext representations encoded by 2 large language models (LLMs). Subsequently, during the protein editing stage, the fused features from editing instruction texts and original protein sequences serve as the final editing condition for generating target protein sequences. Results: Comprehensive experiments demonstrated the superiority of ProtET in editing proteins to enhance human-expected functionality across multiple attribute domains, including enzyme catalytic activity, protein stability, and antibody-specific binding ability. ProtET improves the state-of-the-art results by a large margin, leading to substantial stability improvements of 16.67% and 16.90%. Conclusions: This capability positions ProtET to advance real-world artificial protein editing, potentially addressing unmet academic, industrial, and clinical needs.","PeriodicalId":73207,"journal":{"name":"Health data science","volume":"4 ","pages":"0211"},"PeriodicalIF":0.0,"publicationDate":"2024-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11658819/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142866372","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The Burden of Type 2 Diabetes in Adolescents and Young Adults in China: A Secondary Analysis from the Global Burden of Disease Study 2021. 中国青少年 2 型糖尿病的负担：2021 年全球疾病负担研究的二次分析》。

Health data science

Pub Date : 2024-12-17 eCollection Date: 2024-01-01 DOI: 10.34133/hds.0210

Junting Yang, Siwei Deng, Houyu Zhao, Feng Sun, Xiaotong Zou, Linong Ji, Siyan Zhan

Background: Early-onset type 2 diabetes (T2D) is an increasingly serious public health issue, particularly in China. This study aimed to analyze the characteristics of disease burden, secular trend, and attributable risk factors of early-onset T2D in China. Methods: Using data from the Global Burden of Disease (GBD) 2021, we analyzed the age-standardized rate (ASR) of incidence, disability-adjusted life years (DALYs), and mortality rates of T2D among individuals aged 15 to 39 years in China from 1990 to 2021. Joinpoint regression analysis was employed to analyze secular trend, calculating the average annual percent change (AAPC). We also examined changes in the proportion of early-onset T2D within the total T2D burden and its attributable risk factors. Results: From 1990 to 2021, the ASR of incidence of early-onset T2D in China increased from 140.20 [95% uncertainty interval (UI): 89.14 to 204.74] to 315.97 (95% UI: 226.75 to 417.55) per 100,000, with an AAPC of 2.67% (95% CI: 2.60% to 2.75%, P < 0.001). DALYs rose from 116.29 (95% UI: 78.51 to 167.05) to 267.47 (95% UI: 171.08 to 387.38) per 100,000, with an AAPC of 2.75% (95% CI: 2.64% to 2.87%, P < 0.001). Mortality rates slightly decreased from 0.30 (95% UI: 0.24 to 0.38) to 0.28 (95% UI: 0.23 to 0.34) per 100,000, with an AAPC of -0.22% (95% CI: -0.33% to -0.11%, P < 0.001). The 15 to 19 years age group showed the fastest increase in incidence (AAPC: 4.08%, 95% CI: 3.93% to 4.29%, P < 0.001). The burden was consistently higher and increased more rapidly among males compared to females. The proportion of early-onset T2D within the total T2D burden fluctuated but remained higher than global levels. In 2021, high body mass index (BMI) was the primary attributable risk factor for DALYs of early-onset T2D (59.85%, 95% UI: 33.54% to 76.65%), and its contribution increased substantially from 40.08% (95% UI: 20.71% to 55.79%) in 1990, followed by ambient particulate matter pollution (14.77%, 95% UI: 8.24% to 21.24%) and diet high in red meat (9.33%, 95% UI: -1.42% to 20.06%). Conclusion: The disease burden of early-onset T2D in China is rapidly increasing, particularly among younger populations and males. Despite a slight decrease in mortality rates, the continued rapid increase in incidence and DALYs indicates a need for strengthened prevention and management strategies, especially interventions targeting younger age groups. High BMI and environmental pollution emerge as primary risk factors and should be prioritized in future interventions.

背景：早发性2型糖尿病（T2D）是一个日益严重的公共卫生问题，尤其是在中国。本研究旨在分析中国早发性T2D的疾病负担特征、长期趋势及归因危险因素。方法：利用全球疾病负担（GBD） 2021的数据，我们分析了1990年至2021年中国15至39岁人群中T2D发病率的年龄标准化率（ASR）、残疾调整生命年（DALYs）和死亡率。采用连接点回归分析分析长期趋势，计算年均变化百分数（AAPC）。我们还研究了早发性T2D在总T2D负担中所占比例的变化及其归因风险因素。结果：1990 - 2021年，中国早发性T2D发病率ASR从140.20 / 10万（95%不确定区间（UI）： 89.14 ~ 204.74）上升至315.97 / 10万（95% UI: 226.75 ~ 417.55）， AAPC为2.67% （95% CI: 2.60% ~ 2.75%, P < 0.001）。DALYs从每10万人116.29例（95% UI: 78.51 ~ 167.05）上升到267.47例（95% UI: 171.08 ~ 387.38）， AAPC为2.75% （95% CI: 2.64% ~ 2.87%, P < 0.001）。死亡率从每10万人0.30 （95% UI: 0.24至0.38）略微下降至0.28 (95% UI: 0.23至0.34)，AAPC为-0.22% （95% CI: -0.33%至-0.11%,P < 0.001）。15 ~ 19岁年龄组发病率增长最快（AAPC: 4.08%, 95% CI: 3.93% ~ 4.29%, P < 0.001）。与女性相比，男性的负担一直更高，而且增加得更快。早发性T2D在总T2D负担中的比例有所波动，但仍高于全球水平。2021年，高体重指数（BMI）是早发性T2D DALYs的主要归因危险因素（59.85%,95% UI: 33.54% ~ 76.65%），其贡献率从1990年的40.08% （95% UI: 20.71% ~ 55.79%）大幅增加，其次是环境颗粒物污染（14.77%,95% UI: 8.24% ~ 21.24%）和高红肉饮食（9.33%,95% UI: -1.42% ~ 20.06%）。结论：中国早发性T2D的疾病负担正在迅速增加，尤其是在年轻人群和男性中。尽管死亡率略有下降，但发病率和伤残调整生命年继续迅速增加表明需要加强预防和管理战略，特别是针对较年轻年龄组的干预措施。高BMI和环境污染是主要的危险因素，应在未来的干预措施中优先考虑。

{"title":"The Burden of Type 2 Diabetes in Adolescents and Young Adults in China: A Secondary Analysis from the Global Burden of Disease Study 2021.","authors":"Junting Yang, Siwei Deng, Houyu Zhao, Feng Sun, Xiaotong Zou, Linong Ji, Siyan Zhan","doi":"10.34133/hds.0210","DOIUrl":"10.34133/hds.0210","url":null,"abstract":"Background: Early-onset type 2 diabetes (T2D) is an increasingly serious public health issue, particularly in China. This study aimed to analyze the characteristics of disease burden, secular trend, and attributable risk factors of early-onset T2D in China. Methods: Using data from the Global Burden of Disease (GBD) 2021, we analyzed the age-standardized rate (ASR) of incidence, disability-adjusted life years (DALYs), and mortality rates of T2D among individuals aged 15 to 39 years in China from 1990 to 2021. Joinpoint regression analysis was employed to analyze secular trend, calculating the average annual percent change (AAPC). We also examined changes in the proportion of early-onset T2D within the total T2D burden and its attributable risk factors. Results: From 1990 to 2021, the ASR of incidence of early-onset T2D in China increased from 140.20 [95% uncertainty interval (UI): 89.14 to 204.74] to 315.97 (95% UI: 226.75 to 417.55) per 100,000, with an AAPC of 2.67% (95% CI: 2.60% to 2.75%, P < 0.001). DALYs rose from 116.29 (95% UI: 78.51 to 167.05) to 267.47 (95% UI: 171.08 to 387.38) per 100,000, with an AAPC of 2.75% (95% CI: 2.64% to 2.87%, P < 0.001). Mortality rates slightly decreased from 0.30 (95% UI: 0.24 to 0.38) to 0.28 (95% UI: 0.23 to 0.34) per 100,000, with an AAPC of -0.22% (95% CI: -0.33% to -0.11%, P < 0.001). The 15 to 19 years age group showed the fastest increase in incidence (AAPC: 4.08%, 95% CI: 3.93% to 4.29%, P < 0.001). The burden was consistently higher and increased more rapidly among males compared to females. The proportion of early-onset T2D within the total T2D burden fluctuated but remained higher than global levels. In 2021, high body mass index (BMI) was the primary attributable risk factor for DALYs of early-onset T2D (59.85%, 95% UI: 33.54% to 76.65%), and its contribution increased substantially from 40.08% (95% UI: 20.71% to 55.79%) in 1990, followed by ambient particulate matter pollution (14.77%, 95% UI: 8.24% to 21.24%) and diet high in red meat (9.33%, 95% UI: -1.42% to 20.06%). Conclusion: The disease burden of early-onset T2D in China is rapidly increasing, particularly among younger populations and males. Despite a slight decrease in mortality rates, the continued rapid increase in incidence and DALYs indicates a need for strengthened prevention and management strategies, especially interventions targeting younger age groups. High BMI and environmental pollution emerge as primary risk factors and should be prioritized in future interventions.","PeriodicalId":73207,"journal":{"name":"Health data science","volume":"4 ","pages":"0210"},"PeriodicalIF":0.0,"publicationDate":"2024-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11651706/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142848616","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Federated Learning in Healthcare: A Benchmark Comparison of Engineering and Statistical Approaches for Structured Data Analysis. 医疗保健中的联邦学习：结构化数据分析的工程和统计方法的基准比较。

Health data science

Pub Date : 2024-12-04 eCollection Date: 2024-01-01 DOI: 10.34133/hds.0196

Siqi Li, Di Miao, Qiming Wu, Chuan Hong, Danny D'Agostino, Xin Li, Yilin Ning, Yuqing Shang, Ziwen Wang, Molei Liu, Huazhu Fu, Marcus Eng Hock Ong, Hamed Haddadi, Nan Liu

Background: Federated learning (FL) holds promise for safeguarding data privacy in healthcare collaborations. While the term "FL" was originally coined by the engineering community, the statistical field has also developed privacy-preserving algorithms, though these are less recognized. Our goal was to bridge this gap with the first comprehensive comparison of FL frameworks from both domains. Methods: We assessed 7 FL frameworks, encompassing both engineering-based and statistical FL algorithms, and compared them against local and centralized modeling of logistic regression and least absolute shrinkage and selection operator (Lasso). Our evaluation utilized both simulated data and real-world emergency department data, focusing on comparing both estimated model coefficients and the performance of model predictions. Results: The findings reveal that statistical FL algorithms produce much less biased estimates of model coefficients. Conversely, engineering-based methods can yield models with slightly better prediction performance, occasionally outperforming both centralized and statistical FL models. Conclusion: This study underscores the relative strengths and weaknesses of both types of methods, providing recommendations for their selection based on distinct study characteristics. Furthermore, we emphasize the critical need to raise awareness of and integrate these methods into future applications of FL within the healthcare domain.

背景：联邦学习（FL）有望在医疗保健协作中保护数据隐私。虽然术语“FL”最初是由工程界创造的，但统计领域也开发了隐私保护算法，尽管这些算法不太为人所知。我们的目标是通过首次全面比较两个领域的FL框架来弥合这一差距。方法：我们评估了7个FL框架，包括基于工程和统计的FL算法，并将它们与逻辑回归的局部和集中建模以及最小绝对收缩和选择算子（Lasso）进行了比较。我们的评估利用了模拟数据和现实世界的急诊科数据，重点比较了估计的模型系数和模型预测的性能。结果：研究结果表明，统计FL算法产生的模型系数的偏差估计要小得多。相反，基于工程的方法可以产生稍微更好的预测性能的模型，偶尔优于集中式和统计FL模型。结论：本研究强调了这两种方法的相对优势和劣势，并根据不同的研究特征为其选择提供了建议。此外，我们强调迫切需要提高对这些方法的认识，并将这些方法集成到医疗保健领域FL的未来应用中。

{"title":"Federated Learning in Healthcare: A Benchmark Comparison of Engineering and Statistical Approaches for Structured Data Analysis.","authors":"Siqi Li, Di Miao, Qiming Wu, Chuan Hong, Danny D'Agostino, Xin Li, Yilin Ning, Yuqing Shang, Ziwen Wang, Molei Liu, Huazhu Fu, Marcus Eng Hock Ong, Hamed Haddadi, Nan Liu","doi":"10.34133/hds.0196","DOIUrl":"10.34133/hds.0196","url":null,"abstract":"Background: Federated learning (FL) holds promise for safeguarding data privacy in healthcare collaborations. While the term \"FL\" was originally coined by the engineering community, the statistical field has also developed privacy-preserving algorithms, though these are less recognized. Our goal was to bridge this gap with the first comprehensive comparison of FL frameworks from both domains. Methods: We assessed 7 FL frameworks, encompassing both engineering-based and statistical FL algorithms, and compared them against local and centralized modeling of logistic regression and least absolute shrinkage and selection operator (Lasso). Our evaluation utilized both simulated data and real-world emergency department data, focusing on comparing both estimated model coefficients and the performance of model predictions. Results: The findings reveal that statistical FL algorithms produce much less biased estimates of model coefficients. Conversely, engineering-based methods can yield models with slightly better prediction performance, occasionally outperforming both centralized and statistical FL models. Conclusion: This study underscores the relative strengths and weaknesses of both types of methods, providing recommendations for their selection based on distinct study characteristics. Furthermore, we emphasize the critical need to raise awareness of and integrate these methods into future applications of FL within the healthcare domain.","PeriodicalId":73207,"journal":{"name":"Health data science","volume":"4 ","pages":"0196"},"PeriodicalIF":0.0,"publicationDate":"2024-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11615161/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142782014","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Robust Meta-Model for Predicting the Likelihood of Receiving Blood Transfusion in Non-traumatic Intensive Care Unit Patients. 预测非创伤性重症监护室患者接受输血可能性的稳健元模型。

Health data science

Pub Date : 2024-11-06 eCollection Date: 2024-01-01 DOI: 10.34133/hds.0197

Alireza Rafiei, Ronald Moore, Tilendra Choudhary, Curtis Marshall, Geoffrey Smith, John D Roback, Ravi M Patel, Cassandra D Josephson, Rishikesan Kamaleswaran

Background: Blood transfusions, crucial in managing anemia and coagulopathy in intensive care unit (ICU) settings, require accurate prediction for effective resource allocation and patient risk assessment. However, existing clinical decision support systems have primarily targeted a particular patient demographic with unique medical conditions and focused on a single type of blood transfusion. This study aims to develop an advanced machine learning-based model to predict the probability of transfusion necessity over the next 24 h for a diverse range of non-traumatic ICU patients. Methods: We conducted a retrospective cohort study on 72,072 non-traumatic adult ICU patients admitted to a high-volume US metropolitan academic hospital between 2016 and 2020. We developed a meta-learner and various machine learning models to serve as predictors, training them annually with 4-year data and evaluating on the fifth, unseen year, iteratively over 5 years. Results: The experimental results revealed that the meta-model surpasses the other models in different development scenarios. It achieved notable performance metrics, including an area under the receiver operating characteristic curve of 0.97, an accuracy rate of 0.93, and an F1 score of 0.89 in the best scenario. Conclusion: This study pioneers the use of machine learning models for predicting the likelihood of blood transfusion receipt in a diverse cohort of critically ill patients. The findings of this evaluation confirm that our model not only effectively predicts transfusion reception but also identifies key biomarkers for making transfusion decisions.

背景：输血是重症监护病房（ICU）中治疗贫血和凝血功能障碍的关键，需要准确的预测才能进行有效的资源分配和患者风险评估。然而，现有的临床决策支持系统主要针对具有独特医疗条件的特定患者人群，并侧重于单一类型的输血。本研究旨在开发一种先进的基于机器学习的模型，以预测各种非创伤性重症监护病房患者在未来 24 小时内输血的必要性概率。研究方法我们对 2016 年至 2020 年间入住美国一家大城市学术医院的 72,072 名非创伤性成人 ICU 患者进行了回顾性队列研究。我们开发了元学习器和各种机器学习模型作为预测指标，每年用 4 年的数据对其进行训练，并在 5 年内对未见过的第五年进行评估。结果实验结果表明，元模型在不同的开发场景中都超越了其他模型。它取得了显著的性能指标，包括接收器工作特征曲线下面积为 0.97，准确率为 0.93，在最佳情况下的 F1 分数为 0.89。结论这项研究开创性地使用机器学习模型来预测不同危重病人接受输血的可能性。评估结果证实，我们的模型不仅能有效预测输血接收情况，还能识别关键生物标志物，从而做出输血决定。

{"title":"Robust Meta-Model for Predicting the Likelihood of Receiving Blood Transfusion in Non-traumatic Intensive Care Unit Patients.","authors":"Alireza Rafiei, Ronald Moore, Tilendra Choudhary, Curtis Marshall, Geoffrey Smith, John D Roback, Ravi M Patel, Cassandra D Josephson, Rishikesan Kamaleswaran","doi":"10.34133/hds.0197","DOIUrl":"10.34133/hds.0197","url":null,"abstract":"Background: Blood transfusions, crucial in managing anemia and coagulopathy in intensive care unit (ICU) settings, require accurate prediction for effective resource allocation and patient risk assessment. However, existing clinical decision support systems have primarily targeted a particular patient demographic with unique medical conditions and focused on a single type of blood transfusion. This study aims to develop an advanced machine learning-based model to predict the probability of transfusion necessity over the next 24 h for a diverse range of non-traumatic ICU patients. Methods: We conducted a retrospective cohort study on 72,072 non-traumatic adult ICU patients admitted to a high-volume US metropolitan academic hospital between 2016 and 2020. We developed a meta-learner and various machine learning models to serve as predictors, training them annually with 4-year data and evaluating on the fifth, unseen year, iteratively over 5 years. Results: The experimental results revealed that the meta-model surpasses the other models in different development scenarios. It achieved notable performance metrics, including an area under the receiver operating characteristic curve of 0.97, an accuracy rate of 0.93, and an F1 score of 0.89 in the best scenario. Conclusion: This study pioneers the use of machine learning models for predicting the likelihood of blood transfusion receipt in a diverse cohort of critically ill patients. The findings of this evaluation confirm that our model not only effectively predicts transfusion reception but also identifies key biomarkers for making transfusion decisions.","PeriodicalId":73207,"journal":{"name":"Health data science","volume":"4 ","pages":"0197"},"PeriodicalIF":0.0,"publicationDate":"2024-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11538953/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142592448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Survival Disparities among Cancer Patients Based on Mobility Patterns: A Population-Based Study. 基于流动模式的癌症患者生存差异：基于人口的研究

Health data science

Pub Date : 2024-11-05 eCollection Date: 2024-01-01 DOI: 10.34133/hds.0198

Fengyu Wen, Yike Zhang, Chao Yang, Pengfei Li, Qing Wang, Luxia Zhang

Background: Cancer is a major health problem worldwide. A growing number of cancer patients travel to hospitals outside their residential cities due to unbalanced medical resources. We aimed to evaluate the association between patterns of patient mobility and survival among patients with cancer. Methods: Data of patients hospitalized for cancer between January 2015 and December 2017 were collected from the regional data platform of an eastern coastal province of China. According to the cities of hospitalization and residency, 3 mobility patterns including intra-city, local center, and national center pattern were defined. Patients with intra-city pattern were sequentially matched to patients with the other 2 patterns on demographics, marital status, cancer type, comorbidity, and hospitalization frequency, using propensity score matching. We estimated 5-year survival and the associations between all-cause mortality and patient mobility. Results: Among 20,602 cancer patients, there were 17,035 (82.7%) patients with intra-city pattern, 2,974 (14.4%) patients with local center pattern, and 593 (2.9%) patients with national center pattern. Compared to patients with intra-city pattern, higher survival rates were observed in patients with local center pattern [5-year survival rate, 69.3% versus 65.4%; hazard ratio (HR), 0.85; 95% confidence interval (CI), 0.77 to 0.95] and in patients with national center pattern (5-year survival rate, 69.3% versus 64.5%; HR, 0.80; 95% CI, 0.67 to 0.97). Conclusions: We found significant survival disparities among different mobility patterns of patients with cancer. Improving the quality of cancer care is crucial, especially for cities with below-average healthcare resources.

背景：癌症是世界范围内的主要健康问题。由于医疗资源不均衡，越来越多的癌症患者前往居住城市以外的医院就诊。我们旨在评估癌症患者的流动模式与生存率之间的关系。研究方法我们从中国东部沿海省份的区域数据平台收集了2015年1月至2017年12月期间因癌症住院的患者数据。根据住院和居住城市，定义了3种流动模式，包括市内模式、地方中心模式和国家中心模式。采用倾向得分匹配法，将市内模式的患者与其他两种模式的患者在人口统计学、婚姻状况、癌症类型、合并症和住院频率等方面进行依次匹配。我们估算了患者的 5 年生存率以及全因死亡率与患者流动性之间的关系。结果如下在 20,602 名癌症患者中，有 17,035 人（82.7%）属于城市内模式，2,974 人（14.4%）属于地方中心模式，593 人（2.9%）属于国家中心模式。与市内模式患者相比，当地中心模式患者的存活率更高（5 年存活率，69.3% 对 65.4%；危险比 (HR)，0.85；95% 置信区间 (CI)，0.77 至 0.95），国家中心模式患者的存活率更高（5 年存活率，69.3% 对 64.5%；HR，0.80；95% 置信区间 (CI)，0.67 至 0.97）。结论我们发现不同流动模式的癌症患者之间存在着明显的生存差异。提高癌症治疗质量至关重要，尤其是对于医疗资源低于平均水平的城市。

{"title":"Survival Disparities among Cancer Patients Based on Mobility Patterns: A Population-Based Study.","authors":"Fengyu Wen, Yike Zhang, Chao Yang, Pengfei Li, Qing Wang, Luxia Zhang","doi":"10.34133/hds.0198","DOIUrl":"10.34133/hds.0198","url":null,"abstract":"Background: Cancer is a major health problem worldwide. A growing number of cancer patients travel to hospitals outside their residential cities due to unbalanced medical resources. We aimed to evaluate the association between patterns of patient mobility and survival among patients with cancer. Methods: Data of patients hospitalized for cancer between January 2015 and December 2017 were collected from the regional data platform of an eastern coastal province of China. According to the cities of hospitalization and residency, 3 mobility patterns including intra-city, local center, and national center pattern were defined. Patients with intra-city pattern were sequentially matched to patients with the other 2 patterns on demographics, marital status, cancer type, comorbidity, and hospitalization frequency, using propensity score matching. We estimated 5-year survival and the associations between all-cause mortality and patient mobility. Results: Among 20,602 cancer patients, there were 17,035 (82.7%) patients with intra-city pattern, 2,974 (14.4%) patients with local center pattern, and 593 (2.9%) patients with national center pattern. Compared to patients with intra-city pattern, higher survival rates were observed in patients with local center pattern [5-year survival rate, 69.3% versus 65.4%; hazard ratio (HR), 0.85; 95% confidence interval (CI), 0.77 to 0.95] and in patients with national center pattern (5-year survival rate, 69.3% versus 64.5%; HR, 0.80; 95% CI, 0.67 to 0.97). Conclusions: We found significant survival disparities among different mobility patterns of patients with cancer. Improving the quality of cancer care is crucial, especially for cities with below-average healthcare resources.","PeriodicalId":73207,"journal":{"name":"Health data science","volume":"10 ","pages":"0198"},"PeriodicalIF":0.0,"publicationDate":"2024-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11535395/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142585159","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Association of Smoking with Chronic Kidney Disease Stages 3 to 5: A Mendelian Randomization Study. 吸烟与慢性肾脏病 3 至 5 期的关系：孟德尔随机研究。

Health data science

Pub Date : 2024-11-04 eCollection Date: 2024-01-01 DOI: 10.34133/hds.0199

Zhilong Zhang, Feifei Zhang, Xiaomeng Zhang, Lanlan Lu, Luxia Zhang

Background: Previous studies suggested that smoking behavior (e.g., smoking status) was associated with an elevated risk of chronic kidney disease (CKD), yet whether this association is causal remains uncertain. Methods: We used data for half million participants aged 40 to 69 years from the UK Biobank cohort. In the traditional observational study, we used Cox proportional hazards models to calculate the associations between 2 smoking indices-smoking status and lifetime smoking index and incident CKD stages 3 to 5. Mendelian randomization (MR) approaches were used to estimate a potential causal effect. In one-sample MR, genetic variants associated with lifetime smoking index were used as instrument variables to examine the causal associations with CKD stages 3 to 5, among 344,255 UK Biobank participants with white British ancestry. We further validated our findings by a two-sample MR analysis using information from the Chronic Kidney Disease Genetics Consortium genome-wide association study. Results: In the traditional observational study, both smoking status [hazard ratio (HR): 1.26, 95% confidence interval (CI): 1.22 to 1.30] and lifetime smoking index (HR: 1.22, 95% CI: 1.20 to 1.24) were positively associated with a higher risk of incident CKD. However, both our one-sample and two-sample MR analyses showed no causal association between lifetime smoking index and CKD (all P > 0.05). The genetic instruments were validated by several statistical tests, and all sensitivity analyses showed similar results with the main model. Conclusion: Evidence from our analyses does not suggest a causal effect of smoking behavior on CKD risk. The positive association presented in the traditional observational study is possibly a result of confounding.

背景：以前的研究表明，吸烟行为（如吸烟状态）与慢性肾脏病（CKD）风险升高有关，但这种关联是否是因果关系仍不确定。研究方法我们使用了英国生物库队列中 50 万名 40 至 69 岁参与者的数据。在传统的观察性研究中，我们使用 Cox 比例危险模型来计算两个吸烟指数--吸烟状态和终生吸烟指数--与 CKD 3 至 5 期事件之间的关系。孟德尔随机化（MR）方法用于估计潜在的因果效应。在单样本 MR 中，我们将与终生吸烟指数相关的基因变异作为工具变量，在 344,255 名英国生物库参与者（英国白人血统）中检验与 CKD 3 至 5 期的因果关系。我们利用慢性肾脏病遗传学联盟全基因组关联研究的信息，通过双样本 MR 分析进一步验证了我们的研究结果。研究结果在传统的观察性研究中，吸烟状况[危险比（HR）：1.26，95% 置信区间（CI）：1.22 至 1.30]和终生吸烟指数（HR：1.22，95% CI：1.20 至 1.24）均与较高的慢性肾脏病发病风险呈正相关。然而，我们的单样本和双样本 MR 分析表明，终生吸烟指数与 CKD 之间没有因果关系（所有 P > 0.05）。遗传工具已通过多项统计检验得到验证，所有敏感性分析均显示出与主模型相似的结果。结论我们分析的证据并不表明吸烟行为对 CKD 风险有因果效应。传统观察研究中出现的正相关可能是混杂因素造成的。

{"title":"Association of Smoking with Chronic Kidney Disease Stages 3 to 5: A Mendelian Randomization Study.","authors":"Zhilong Zhang, Feifei Zhang, Xiaomeng Zhang, Lanlan Lu, Luxia Zhang","doi":"10.34133/hds.0199","DOIUrl":"10.34133/hds.0199","url":null,"abstract":"Background: Previous studies suggested that smoking behavior (e.g., smoking status) was associated with an elevated risk of chronic kidney disease (CKD), yet whether this association is causal remains uncertain. Methods: We used data for half million participants aged 40 to 69 years from the UK Biobank cohort. In the traditional observational study, we used Cox proportional hazards models to calculate the associations between 2 smoking indices-smoking status and lifetime smoking index and incident CKD stages 3 to 5. Mendelian randomization (MR) approaches were used to estimate a potential causal effect. In one-sample MR, genetic variants associated with lifetime smoking index were used as instrument variables to examine the causal associations with CKD stages 3 to 5, among 344,255 UK Biobank participants with white British ancestry. We further validated our findings by a two-sample MR analysis using information from the Chronic Kidney Disease Genetics Consortium genome-wide association study. Results: In the traditional observational study, both smoking status [hazard ratio (HR): 1.26, 95% confidence interval (CI): 1.22 to 1.30] and lifetime smoking index (HR: 1.22, 95% CI: 1.20 to 1.24) were positively associated with a higher risk of incident CKD. However, both our one-sample and two-sample MR analyses showed no causal association between lifetime smoking index and CKD (all P > 0.05). The genetic instruments were validated by several statistical tests, and all sensitivity analyses showed similar results with the main model. Conclusion: Evidence from our analyses does not suggest a causal effect of smoking behavior on CKD risk. The positive association presented in the traditional observational study is possibly a result of confounding.","PeriodicalId":73207,"journal":{"name":"Health data science","volume":"4 ","pages":"0199"},"PeriodicalIF":0.0,"publicationDate":"2024-11-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11532587/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142577414","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0