首页 > 最新文献

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science最新文献

英文 中文
A Large-Language Model Framework for Relative Timeline Extraction from PubMed Case Reports. 面向PubMed病例报告相对时间线提取的大语言模型框架。
Jing Wang, Jeremy C Weiss

Timing of clinical events is central to characterization of patient trajectories, enabling analyses such as process tracing, forecasting, and causal reasoning. However, structured electronic health records capture few data elements critical to these tasks, while clinical reports lack temporal localization of events in structured form. We present a system that transforms case reports into textual time series-structured pairs of textual events and timestamps. We contrast manual and large language model (LLM) annotations (n=320 and n=390 respectively) of ten randomly-sampled PubMed open-access (PMOA) case reports (N=152,974) and assess inter-LLM agreement (n=3,103 N=93). We find that the LLM models have moderate event recall (O1-preview: 0.80) but high temporal concordance among identified events (O1-preview: 0.95). By establishing the task, annotation, and assessment systems, and by demonstrating high concordance, this work may serve as a benchmark for leveraging the PMOA corpus for temporal analytics. Code is available at:https://github.com/jcweiss2/LLM-Timeline-PMOA/.

临床事件的时间是表征患者轨迹的核心,可以进行过程跟踪、预测和因果推理等分析。然而,结构化的电子健康记录捕获的对这些任务至关重要的数据元素很少,而临床报告缺乏结构化形式的事件时间定位。我们提出了一个将案例报告转换为文本时间序列的系统——文本事件和时间戳的结构化对。我们对比了10个随机抽样的PubMed开放获取(PMOA)病例报告(n= 152,974)的手动和大型语言模型(LLM)注释(n=320和n=390),并评估了LLM间的一致性(n=3,103 n= 93)。我们发现LLM模型具有中等的事件回忆率(0 - 1预览:0.80),但识别事件之间的时间一致性较高(0 - 1预览:0.95)。通过建立任务、注释和评估系统,并通过展示高度的一致性,这项工作可以作为利用PMOA语料库进行时间分析的基准。代码可从https://github.com/jcweiss2/LLM-Timeline-PMOA/获得。
{"title":"A Large-Language Model Framework for Relative Timeline Extraction from PubMed Case Reports.","authors":"Jing Wang, Jeremy C Weiss","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Timing of clinical events is central to characterization of patient trajectories, enabling analyses such as process tracing, forecasting, and causal reasoning. However, structured electronic health records capture few data elements critical to these tasks, while clinical reports lack temporal localization of events in structured form. We present a system that transforms case reports into textual time series-structured pairs of textual events and timestamps. We contrast manual and large language model (LLM) annotations (n=320 and n=390 respectively) of ten randomly-sampled PubMed open-access (PMOA) case reports (N=152,974) and assess inter-LLM agreement (n=3,103 N=93). We find that the LLM models have moderate event recall (O1-preview: 0.80) but high temporal concordance among identified events (O1-preview: 0.95). By establishing the task, annotation, and assessment systems, and by demonstrating high concordance, this work may serve as a benchmark for leveraging the PMOA corpus for temporal analytics. Code is available at:https://github.com/jcweiss2/LLM-Timeline-PMOA/.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"598-606"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150726/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276832","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Generalized Tool to Assess Algorithmic Fairness in Disease Phenotype Definitions. 一种评估疾病表型定义算法公平性的通用工具。
Jacob S Zelko, Justin Manjourides

For evidence from observational studies to be reliable, researchers must ensure that the patient populations of interest are accurately defined. However, disease definitions can be extremely difficult to standardize and implement accurately across different datasets and study requirements. Furthermore, in this context, they must also ensure that populations are represented fairly to accurately reflect populations' various demographic dynamics and to not overgeneralize across non-applicable populations. In this work, we present a generalized tool to assess the fairness of disease definitions by evaluating their implementation across common fairness metrics. Our approach calculates fairness metrics and provides a robust method to examine coarse and strongly intersecting populations across many characteristics. We highlight workflows when working with disease definitions, provide an example analysis using an OMOP CDM patient database, and discuss potential directions for future improvement and research.

为了使观察性研究的证据可靠,研究人员必须确保所关注的患者群体得到准确定义。然而,在不同的数据集和研究要求中,疾病定义可能非常难以标准化和准确实施。此外,在这方面,它们还必须确保公平地代表人口,以准确地反映人口的各种动态,而不是对不适用的人口进行过度概括。在这项工作中,我们提出了一个通用的工具来评估疾病定义的公平性,通过评估它们在常见公平性指标上的实施情况。我们的方法计算公平指标,并提供了一种健壮的方法来检查粗糙和强交叉的人口在许多特征上。我们强调了处理疾病定义时的工作流程,提供了使用OMOP CDM患者数据库的示例分析,并讨论了未来改进和研究的潜在方向。
{"title":"A Generalized Tool to Assess Algorithmic Fairness in Disease Phenotype Definitions.","authors":"Jacob S Zelko, Justin Manjourides","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>For evidence from observational studies to be reliable, researchers must ensure that the patient populations of interest are accurately defined. However, disease definitions can be extremely difficult to standardize and implement accurately across different datasets and study requirements. Furthermore, in this context, they must also ensure that populations are represented fairly to accurately reflect populations' various demographic dynamics and to not overgeneralize across non-applicable populations. In this work, we present a generalized tool to assess the fairness of disease definitions by evaluating their implementation across common fairness metrics. Our approach calculates fairness metrics and provides a robust method to examine coarse and strongly intersecting populations across many characteristics. We highlight workflows when working with disease definitions, provide an example analysis using an OMOP CDM patient database, and discuss potential directions for future improvement and research.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"624-633"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150753/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276831","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Generative AI Is Not Ready for Clinical Use in Patient Education for Lower Back Pain Patients, Even With Retrieval-Augmented Generation. 生成式人工智能尚未准备好用于下背部疼痛患者的临床患者教育,即使有检索增强生成。
Yi-Fei Zhao, Allyn Bove, David Thompson, James Hill, Yi Xu, Yufan Ren, Andrea Hassman, Leming Zhou, Yanshan Wang

Low back pain (LBP) is a leading cause of disability globally. Following the onset of LBP and subsequent treatment, adequate patient education is crucial for improving functionality and long-term outcomes. Despite advancements in patient education strategies, significant gaps persist in delivering personalized, evidence-based information to patients with LBP. Recent advancements in large language models (LLMs) and generative artificial intelligence (GenAI) have demonstrated the potential to enhance patient education. However, their application and efficacy in delivering educational content to patients with LBP remain underexplored and warrant further investigation. In this study, we introduce a novel approach utilizing LLMs with Retrieval-Augmented Generation (RAG) and few-shot learning to generate tailored educational materials for patients with LBP. Physical therapists manually evaluated our model responses for redundancy, accuracy, and completeness using a Likert scale. In addition, the readability of the generated education materials is assessed using the Flesch Reading Ease score. The findings demonstrate that RAG-based LLMs outperform traditional LLMs, providing more accurate, complete, and readable patient education materials with less redundancy. Having said that, our analysis reveals that the generated materials are not yet ready for use in clinical practice. This study underscores the potential of AI-driven models utilizing RAG to improve patient education for LBP; however, significant challenges remain in ensuring the clinical relevance and granularity of content generated by these models.

下腰痛(LBP)是全球致残的主要原因。在腰痛发病和后续治疗后,充分的患者教育对于改善功能和长期预后至关重要。尽管患者教育策略取得了进步,但在向LBP患者提供个性化、循证信息方面仍存在显著差距。大型语言模型(llm)和生成式人工智能(GenAI)的最新进展已经证明了增强患者教育的潜力。然而,它们在为LBP患者提供教育内容方面的应用和疗效仍有待进一步研究。在这项研究中,我们引入了一种新的方法,利用llm与检索增强生成(RAG)和少量学习来为LBP患者生成定制的教育材料。物理治疗师使用李克特量表手动评估我们的模型反应的冗余、准确性和完整性。此外,使用Flesch Reading Ease评分来评估生成的教育材料的可读性。研究结果表明,基于rag的法学硕士优于传统法学硕士,提供更准确、完整、可读的患者教育材料,冗余更少。话虽如此,我们的分析表明,生成的材料尚未准备好用于临床实践。这项研究强调了利用RAG的人工智能驱动模型在改善LBP患者教育方面的潜力;然而,在确保这些模型生成的内容的临床相关性和粒度方面仍然存在重大挑战。
{"title":"Generative AI Is Not Ready for Clinical Use in Patient Education for Lower Back Pain Patients, Even With Retrieval-Augmented Generation.","authors":"Yi-Fei Zhao, Allyn Bove, David Thompson, James Hill, Yi Xu, Yufan Ren, Andrea Hassman, Leming Zhou, Yanshan Wang","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Low back pain (LBP) is a leading cause of disability globally. Following the onset of LBP and subsequent treatment, adequate patient education is crucial for improving functionality and long-term outcomes. Despite advancements in patient education strategies, significant gaps persist in delivering personalized, evidence-based information to patients with LBP. Recent advancements in large language models (LLMs) and generative artificial intelligence (GenAI) have demonstrated the potential to enhance patient education. However, their application and efficacy in delivering educational content to patients with LBP remain underexplored and warrant further investigation. In this study, we introduce a novel approach utilizing LLMs with Retrieval-Augmented Generation (RAG) and few-shot learning to generate tailored educational materials for patients with LBP. Physical therapists manually evaluated our model responses for redundancy, accuracy, and completeness using a Likert scale. In addition, the readability of the generated education materials is assessed using the Flesch Reading Ease score. The findings demonstrate that RAG-based LLMs outperform traditional LLMs, providing more accurate, complete, and readable patient education materials with less redundancy. Having said that, our analysis reveals that the generated materials are not yet ready for use in clinical practice. This study underscores the potential of AI-driven models utilizing RAG to improve patient education for LBP; however, significant challenges remain in ensuring the clinical relevance and granularity of content generated by these models.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"644-653"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150711/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276848","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrating Social Determinants of Health into Knowledge Graphs: Evaluating Prediction Bias and Fairness in Healthcare. 将健康的社会决定因素整合到知识图谱中:评估医疗保健中的预测偏差和公平性。
Tianqi Shang, Weiqing He, Tianlong Chen, Ying Ding, Huanmei Wu, Kaixiong Zhou, Li Shen

Social determinants of health (SDoH) play a crucial role in patient health outcomes, yet their integration into biomedical knowledge graphs remains underexplored. This study addresses this gap by constructing an SDoH-enriched knowledge graph using the MIMIC-III dataset and PrimeKG. We introduce a novel fairness formulation for graph embeddings, focusing on invariance with respect to sensitive SDoH information. Via employing a heterogeneous-GCN model for drug-disease link prediction, we detect biases related to various SDoH factors. To mitigate these biases, we propose a post-processing method that strategically reweights edges connected to SDoHs, balancing their influence on graph representations. This approach represents one of the first comprehensive investigations into fairness issues within biomedical knowledge graphs incorporating SDoH. Our work not only highlights the importance of considering SDoH in medical informatics but also provides a concrete method for reducing SDoH-related biases in link prediction tasks, paving the way for more equitable healthcare recommendations. Our code is available at https://github.com/hwq0726/SDoH-KG.

健康的社会决定因素(SDoH)在患者健康结果中起着至关重要的作用,但将其整合到生物医学知识图谱中仍未得到充分探索。本研究通过使用MIMIC-III数据集和PrimeKG构建一个sdoh丰富的知识图来解决这一差距。我们引入了一种新的图嵌入公平性公式,重点关注敏感SDoH信息的不变性。通过采用异构gcn模型进行药物-疾病联系预测,我们检测到与各种SDoH因素相关的偏差。为了减轻这些偏差,我们提出了一种后处理方法,策略性地重新加权与sdoh相连的边,平衡它们对图表示的影响。这种方法代表了对生物医学知识图谱中包含SDoH的公平性问题的首次全面调查之一。我们的工作不仅强调了在医学信息学中考虑SDoH的重要性,而且还提供了一种具体的方法来减少链接预测任务中与SDoH相关的偏差,为更公平的医疗建议铺平了道路。我们的代码可在https://github.com/hwq0726/SDoH-KG上获得。
{"title":"Integrating Social Determinants of Health into Knowledge Graphs: Evaluating Prediction Bias and Fairness in Healthcare.","authors":"Tianqi Shang, Weiqing He, Tianlong Chen, Ying Ding, Huanmei Wu, Kaixiong Zhou, Li Shen","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Social determinants of health (SDoH) play a crucial role in patient health outcomes, yet their integration into biomedical knowledge graphs remains underexplored. This study addresses this gap by constructing an SDoH-enriched knowledge graph using the MIMIC-III dataset and PrimeKG. We introduce a novel fairness formulation for graph embeddings, focusing on invariance with respect to sensitive SDoH information. Via employing a heterogeneous-GCN model for drug-disease link prediction, we detect biases related to various SDoH factors. To mitigate these biases, we propose a post-processing method that strategically reweights edges connected to SDoHs, balancing their influence on graph representations. This approach represents one of the first comprehensive investigations into fairness issues within biomedical knowledge graphs incorporating SDoH. Our work not only highlights the importance of considering SDoH in medical informatics but also provides a concrete method for reducing SDoH-related biases in link prediction tasks, paving the way for more equitable healthcare recommendations. Our code is available at https://github.com/hwq0726/SDoH-KG.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"481-490"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150739/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276853","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison of Machine Learning Models in Predicting Mental Health Sequelae Following Concussion in Youth. 预测青少年脑震荡后心理健康后遗症的机器学习模型比较
Jin Peng, Jiayuan Chen, Changchang Yin, Ping Zhang, Jingzhen Yang

Youth who experience concussions may be at greater risk for subsequent mental health challenges, making early detection crucial for timely intervention. This study utilized Bidirectional Long Short-Term Memory (BiLSTM) networks to predict mental health outcomes following concussion in youth and compared its performance to traditional models. We also examined whether incorporating social determinants of health (SDoH) improved predictive power, given the disproportionate impact of concussions and mental health issues on disadvantaged populations. We evaluated the models using accuracy, area under the curve (4UC) of the receiver operating characteristic (ROC), and other performance metrics. Our BiLSTM model with SDoH data achieved the highest accuracy (0.883) and 4UC-ROC score (0.892). Unlike traditional models, our approach provided real-time predictions at each visit within 12 months of the index concussion, aiding clinicians in making timely, visit-specific referrals for further treatment and interventions.

经历过脑震荡的青年随后可能面临更大的心理健康挑战,因此及早发现对及时干预至关重要。本研究利用双向长短期记忆(BiLSTM)网络预测青少年脑震荡后的心理健康结果,并将其与传统模型的表现进行比较。考虑到脑震荡和心理健康问题对弱势群体的不成比例的影响,我们还研究了纳入健康的社会决定因素(SDoH)是否提高了预测能力。我们使用准确性、受试者工作特征(ROC)曲线下面积(4UC)和其他性能指标来评估模型。采用SDoH数据的BiLSTM模型准确率最高(0.883),4UC-ROC评分最高(0.892)。与传统模型不同的是,我们的方法可以在脑震荡后12个月内的每次就诊中提供实时预测,帮助临床医生及时、具体地进行进一步的治疗和干预。
{"title":"Comparison of Machine Learning Models in Predicting Mental Health Sequelae Following Concussion in Youth.","authors":"Jin Peng, Jiayuan Chen, Changchang Yin, Ping Zhang, Jingzhen Yang","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Youth who experience concussions may be at greater risk for subsequent mental health challenges, making early detection crucial for timely intervention. This study utilized Bidirectional Long Short-Term Memory (BiLSTM) networks to predict mental health outcomes following concussion in youth and compared its performance to traditional models. We also examined whether incorporating social determinants of health (SDoH) improved predictive power, given the disproportionate impact of concussions and mental health issues on disadvantaged populations. We evaluated the models using accuracy, area under the curve (4UC) of the receiver operating characteristic (ROC), and other performance metrics. Our BiLSTM model with SDoH data achieved the highest accuracy (0.883) and 4UC-ROC score (0.892). Unlike traditional models, our approach provided real-time predictions at each visit within 12 months of the index concussion, aiding clinicians in making timely, visit-specific referrals for further treatment and interventions.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"422-431"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150734/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276860","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Disparate Model Performance and Stability in Machine Learning Clinical Support for Diabetes and Heart Diseases. 机器学习在糖尿病和心脏病临床支持中的不同模型性能和稳定性。
Ioannis Bilionis, Ricardo C Berrios, Luis Fernandez-Luque, Carlos Castillo

Machine Learning (ML) algorithms are vital for supporting clinical decision-making in biomedical informatics. However, their predictive performance can vary across demographic groups, often due to the underrepresentation of historically marginalized populations in training datasets. The investigation reveals widespread sex- and age-related inequities in chronic disease datasets and their derived ML models. Thus, a novel analytical framework is introduced, combining systematic arbitrariness with traditional metrics like accuracy and data complexity. The analysis of data from over 25,000 individuals with chronic diseases revealed mild sex-related disparities, favoring predictive accuracy for males, and significant age-related differences, with better accuracy for younger patients. Notably, older patients showed inconsistent predictive accuracy across seven datasets, linked to higher data complexity and lower model performance. This highlights that representativeness in training data alone does not guarantee equitable outcomes, and model arbitrariness must be addressed before deploying models in clinical settings.

机器学习(ML)算法对于支持生物医学信息学中的临床决策至关重要。然而,它们的预测性能在不同的人口群体中可能会有所不同,这通常是由于训练数据集中历史上边缘化人群的代表性不足。调查揭示了慢性疾病数据集及其衍生的ML模型中普遍存在与性别和年龄相关的不平等。因此,引入了一种新的分析框架,将系统的任意性与传统的度量(如准确性和数据复杂性)相结合。对25000多名慢性疾病患者的数据进行分析,发现与性别有关的轻微差异有利于男性的预测准确性,与年龄有关的显著差异有利于年轻患者的预测准确性。值得注意的是,老年患者在七个数据集中表现出不一致的预测准确性,这与较高的数据复杂性和较低的模型性能有关。这突出表明,仅训练数据的代表性并不能保证公平的结果,在将模型部署到临床环境之前,必须解决模型的随意性。
{"title":"Disparate Model Performance and Stability in Machine Learning Clinical Support for Diabetes and Heart Diseases.","authors":"Ioannis Bilionis, Ricardo C Berrios, Luis Fernandez-Luque, Carlos Castillo","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Machine Learning (ML) algorithms are vital for supporting clinical decision-making in biomedical informatics. However, their predictive performance can vary across demographic groups, often due to the underrepresentation of historically marginalized populations in training datasets. The investigation reveals widespread sex- and age-related inequities in chronic disease datasets and their derived ML models. Thus, a novel analytical framework is introduced, combining systematic arbitrariness with traditional metrics like accuracy and data complexity. The analysis of data from over 25,000 individuals with chronic diseases revealed mild sex-related disparities, favoring predictive accuracy for males, and significant age-related differences, with better accuracy for younger patients. Notably, older patients showed inconsistent predictive accuracy across seven datasets, linked to higher data complexity and lower model performance. This highlights that representativeness in training data alone does not guarantee equitable outcomes, and model arbitrariness must be addressed before deploying models in clinical settings.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"95-104"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150696/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276864","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
EntroLLM: Leveraging Entropy and Large Language Model Embeddings for Enhanced Risk Prediction with Wearable Device Data. 利用熵和大语言模型嵌入增强可穿戴设备数据的风险预测。
Xueqing Huang, Tian Gu

Wearable devices collect complex structured data with high-dimensional and time-series features that are challenging for traditional models to handle efficiently. We propose EntroLLM, a new method that combines entropy measures and the low-dimensional representation (embedding) generated from large language models (LLMs) to enhance risk prediction using wearable device data. In EntroLLM, the entropy quantifies the variability of a subject's physical activity patterns, while the LLM embedding approximates the latent temporal structure. We evaluate the feasibility and performance of EntroLLM using NHANES data to predict overweight status using demographics and physical activity collected from wearable devices. Results show that combining entropy with GPT-based embedding improves model performance compared to baseline models and other embedding techniques, leading to an average increase in AUC from 0.56 to 0.64. EntroLLM showcases the potential of combining entropy and LLM-based embedding and offers a promising approach to wearable device data analysis for predicting health outcomes.

可穿戴设备收集具有高维和时间序列特征的复杂结构化数据,传统模型难以有效处理这些数据。我们提出了一种新的方法EntroLLM,它结合了熵度量和由大型语言模型(llm)生成的低维表示(嵌入),以增强使用可穿戴设备数据的风险预测。在EntroLLM中,熵量化了受试者身体活动模式的可变性,而LLM嵌入近似于潜在的时间结构。我们使用NHANES数据来评估EntroLLM的可行性和性能,通过可穿戴设备收集的人口统计数据和身体活动来预测超重状态。结果表明,与基线模型和其他嵌入技术相比,将熵与基于gpt的嵌入相结合可以提高模型的性能,使AUC平均从0.56提高到0.64。EntroLLM展示了熵和基于llm的嵌入相结合的潜力,并为可穿戴设备的数据分析提供了一种有前途的方法,用于预测健康结果。
{"title":"EntroLLM: Leveraging Entropy and Large Language Model Embeddings for Enhanced Risk Prediction with Wearable Device Data.","authors":"Xueqing Huang, Tian Gu","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Wearable devices collect complex structured data with high-dimensional and time-series features that are challenging for traditional models to handle efficiently. We propose EntroLLM, a new method that combines entropy measures and the low-dimensional representation (embedding) generated from large language models (LLMs) to enhance risk prediction using wearable device data. In EntroLLM, the entropy quantifies the variability of a subject's physical activity patterns, while the LLM embedding approximates the latent temporal structure. We evaluate the feasibility and performance of EntroLLM using NHANES data to predict overweight status using demographics and physical activity collected from wearable devices. Results show that combining entropy with GPT-based embedding improves model performance compared to baseline models and other embedding techniques, leading to an average increase in AUC from 0.56 to 0.64. EntroLLM showcases the potential of combining entropy and LLM-based embedding and offers a promising approach to wearable device data analysis for predicting health outcomes.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"225-234"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150754/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276870","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Predicting survival time for critically ill patients with heart failure using conformalized survival analysis. 应用符合化生存分析预测危重心衰患者的生存时间。
Xiaomeng Wang, Zhimei Ren, Jiancheng Ye

Heart failure (HF) is a significant public health challenge, especially among critically ill patients in intensive care units (ICUs). Predicting survival outcomes for these patients with calibrated uncertainty is both challenging and essential for guiding subsequent treatments. This study introduces conformalized survival analysis (CSA) as a novel method for predicting survival times in critically ill HF patients. CSA enhances each predicted survival time with a statistically rigorous lower bound, providing valuable uncertainty quantification. Using the MIMIC-IV dataset, we demonstrate that CSA effectively delivers calibrated uncertainty quantification for survival predictions, in contrast to parametric models like the Cox or Accelerated Failure Time models. Through the application of CSA to a large, real-world dataset, this study underscores its potential to improve decision-making in critical care, offering a more precise and reliable tool for prognosis in a setting where accurate predictions and calibrated uncertainty can profoundly impact patient outcomes.

心力衰竭(HF)是一项重大的公共卫生挑战,特别是在重症监护病房(icu)的重症患者中。根据校准的不确定性预测这些患者的生存结果既具有挑战性,又对指导后续治疗至关重要。本研究将符合化生存分析(CSA)作为预测危重心衰患者生存时间的新方法。CSA提高了每个预测生存时间与统计严格的下界,提供有价值的不确定性量化。使用MIMIC-IV数据集,我们证明了与Cox或加速失效时间模型等参数模型相比,CSA有效地为生存预测提供了校准的不确定性量化。通过将CSA应用于大型真实数据集,本研究强调了其改善重症监护决策的潜力,在准确预测和校准不确定性可能深刻影响患者预后的情况下,为预后提供了更精确和可靠的工具。
{"title":"Predicting survival time for critically ill patients with heart failure using conformalized survival analysis.","authors":"Xiaomeng Wang, Zhimei Ren, Jiancheng Ye","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Heart failure (HF) is a significant public health challenge, especially among critically ill patients in intensive care units (ICUs). Predicting survival outcomes for these patients with calibrated uncertainty is both challenging and essential for guiding subsequent treatments. This study introduces conformalized survival analysis (CSA) as a novel method for predicting survival times in critically ill HF patients. CSA enhances each predicted survival time with a statistically rigorous lower bound, providing valuable uncertainty quantification. Using the MIMIC-IV dataset, we demonstrate that CSA effectively delivers calibrated uncertainty quantification for survival predictions, in contrast to parametric models like the Cox or Accelerated Failure Time models. Through the application of CSA to a large, real-world dataset, this study underscores its potential to improve decision-making in critical care, offering a more precise and reliable tool for prognosis in a setting where accurate predictions and calibrated uncertainty can profoundly impact patient outcomes.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"576-597"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150701/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276880","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Comparative Analysis of Patient Similarity Measures for Outcome Prediction. 预后预测患者相似度的比较分析。
Deyi Li, Alan S L Yu, Mei Liu

Personalized medicine aims to improve clinical outcomes by tailoring treatments to individual patients based on genetic, phenotypic, or psychosocial characteristics, leveraging insights from similar patients. This is particularly necessary for managing diseases with significant variability in their causes, progressions and prognoses. Accurate measurement of patient similarity is crucial in this context, as it enables the identification of a high-quality cohort of similar patients, thereby enhancing clinical decision making with better evidence. However, previous studies have not comprehensively compared different patient similarity measures in large-scale retrospective analyses of electronic health records (EHRs). In this study, we conducted a comparative analysis of four patient similarity measures focusing on feature weighting mechanisms, using EHR data from 46,968 hospitalized patients. For evaluation, we assessed the patient similarity measures for predicting acute kidney injury, readmission, and mortality. Our results showed that using grid-searched weights to combine features based by their types outperformed all other methods.

个性化医疗旨在根据遗传、表型或社会心理特征,利用类似患者的见解,为个体患者量身定制治疗方案,从而改善临床结果。这对于管理在病因、进展和预后方面具有显著差异的疾病尤其必要。在这种情况下,准确测量患者的相似性是至关重要的,因为它可以确定一个高质量的相似患者队列,从而在更好的证据基础上加强临床决策。然而,以往的研究并没有全面比较电子病历(EHRs)大规模回顾性分析中不同的患者相似性度量。在这项研究中,我们使用来自46,968名住院患者的电子病历数据,对四种患者相似度度量进行了比较分析,重点关注特征加权机制。为了评估,我们评估了预测急性肾损伤、再入院和死亡率的患者相似性指标。我们的结果表明,使用网格搜索的权重来组合基于类型的特征优于所有其他方法。
{"title":"A Comparative Analysis of Patient Similarity Measures for Outcome Prediction.","authors":"Deyi Li, Alan S L Yu, Mei Liu","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Personalized medicine aims to improve clinical outcomes by tailoring treatments to individual patients based on genetic, phenotypic, or psychosocial characteristics, leveraging insights from similar patients. This is particularly necessary for managing diseases with significant variability in their causes, progressions and prognoses. Accurate measurement of patient similarity is crucial in this context, as it enables the identification of a high-quality cohort of similar patients, thereby enhancing clinical decision making with better evidence. However, previous studies have not comprehensively compared different patient similarity measures in large-scale retrospective analyses of electronic health records (EHRs). In this study, we conducted a comparative analysis of four patient similarity measures focusing on feature weighting mechanisms, using EHR data from 46,968 hospitalized patients. For evaluation, we assessed the patient similarity measures for predicting acute kidney injury, readmission, and mortality. Our results showed that using grid-searched weights to combine features based by their types outperformed all other methods.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"270-279"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150746/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276830","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
An Implemented Real-World-Data Pipeline for Standardization of Electronic Health Records in Precision Oncology. 精确肿瘤学电子健康记录标准化的实时数据管道实现。
Kory Kreimeyer, Durrant Barasa, Mohamed Sherief, Xiaorui Shi, Marvin Borja, Srinivasan Yegnasubramanian, Valsamo Anagnostou, Joseph C Murray, Taxiarchis Botsis

Several use cases in precision oncology require accurately extracting and standardizing Real-World Data from Electronic Health Records (EHRs). We developed the infrastructure and a toolset incorporating data mining and natural language processing scripts to automatically retrieve selected descriptive and common endpoint variables from EHRs. This toolset was evaluated against a reference dataset of 106 lung cancer and 45 sarcoma patient cases pulled from two databases complying with the Precision Oncology Core Data Model (Precision-DM) and maintained by the Johns Hopkins Molecular Tumor Board and a research team. We accurately retrieved most descriptive EHR fields but less efficiently extracted the Date of Diagnosis and Treatment Start Date that supported calculating the Age at Diagnosis, Overall Survival, and Time to First Treatment (accuracy range 50%-86%). Our infrastructure and Precision-DM-based standardization could inspire similar efforts in other cancer centers, however, the toolset should be enhanced to improve accuracy in certain variables.

精确肿瘤学中的几个用例需要从电子健康记录(EHRs)中准确提取和标准化真实世界数据。我们开发了包含数据挖掘和自然语言处理脚本的基础设施和工具集,以便从电子病历中自动检索选定的描述性和公共端点变量。该工具集与来自两个数据库的106例肺癌和45例肉瘤患者的参考数据集进行了评估,这些数据库符合精确肿瘤学核心数据模型(Precision- dm),并由约翰霍普金斯分子肿瘤委员会和一个研究小组维护。我们准确地检索了大多数描述性EHR字段,但提取诊断日期和治疗开始日期的效率较低,这些数据支持计算诊断年龄、总生存期和首次治疗时间(准确率范围为50%-86%)。我们的基础设施和基于precision dm的标准化可以在其他癌症中心激发类似的努力,然而,应该加强工具集以提高某些变量的准确性。
{"title":"An Implemented Real-World-Data Pipeline for Standardization of Electronic Health Records in Precision Oncology.","authors":"Kory Kreimeyer, Durrant Barasa, Mohamed Sherief, Xiaorui Shi, Marvin Borja, Srinivasan Yegnasubramanian, Valsamo Anagnostou, Joseph C Murray, Taxiarchis Botsis","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Several use cases in precision oncology require accurately extracting and standardizing Real-World Data from Electronic Health Records (EHRs). We developed the infrastructure and a toolset incorporating data mining and natural language processing scripts to automatically retrieve selected descriptive and common endpoint variables from EHRs. This toolset was evaluated against a reference dataset of 106 lung cancer and 45 sarcoma patient cases pulled from two databases complying with the Precision Oncology Core Data Model (Precision-DM) and maintained by the Johns Hopkins Molecular Tumor Board and a research team. We accurately retrieved most descriptive EHR fields but less efficiently extracted the Date of Diagnosis and Treatment Start Date that supported calculating the Age at Diagnosis, Overall Survival, and Time to First Treatment (accuracy range 50%-86%). Our infrastructure and Precision-DM-based standardization could inspire similar efforts in other cancer centers, however, the toolset should be enhanced to improve accuracy in certain variables.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"242-249"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150718/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276837","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1