首页 > 最新文献

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science最新文献

英文 中文
Feasibility of Automated Precharting using GPT-4 in New Specialty Referrals. 在新的专科转诊中使用GPT-4自动预诊的可行性。
April S Liang, Juan M Banda, Thomas Savage, Abby Pandya, Rebecca Carey, Uchechukwu C Megwalu, Michael T Chang, Dev Dash, Conor K Corbin, Aditya Sharma, Rahul Thapa, Nikesh Kotecha, Nigam H Shah, Jennifer Y Lee, Jonathan H Chen

This study evaluates the feasibility of using GPT-4 to automate precharting for specialty referrals, focusing on new patients referred to an otolaryngology clinic for nasal congestion. We describe the design decisions and strategies tested in creating this precharting utility, including methods for prompt design and token limit handling. Through iterative testing and building, our tool achieved 95.0% agreement with physician consensus in a small retrospective test sample. Results from a small prospective pilot showed favorable feedback of summaries in a real-world clinical setting, though there was a discrepancy between high intention to use the summary but lower perception of time savings. Our results demonstrate that automated pre-charting with accuracy and clinical relevance can be feasible with large language models such as GPT-4. Our design features can inform the development of vendor chart summarization solutions.

本研究评估了使用GPT-4自动预诊专科转诊的可行性,重点关注耳鼻喉科诊所因鼻塞而转诊的新患者。我们描述了在创建此预绘制实用程序时测试的设计决策和策略,包括用于提示设计和令牌限制处理的方法。通过反复测试和构建,我们的工具在一个小的回顾性测试样本中与医生的共识达成了95.0%的一致性。从一个小型的前瞻性试点的结果显示,在现实世界的临床环境中,总结的反馈是有利的,尽管在使用总结的高意愿和较低的时间节省的感知之间存在差异。我们的研究结果表明,在GPT-4等大型语言模型中,具有准确性和临床相关性的自动预表是可行的。我们的设计特性可以为供应商图表总结解决方案的开发提供信息。
{"title":"Feasibility of Automated Precharting using GPT-4 in New Specialty Referrals.","authors":"April S Liang, Juan M Banda, Thomas Savage, Abby Pandya, Rebecca Carey, Uchechukwu C Megwalu, Michael T Chang, Dev Dash, Conor K Corbin, Aditya Sharma, Rahul Thapa, Nikesh Kotecha, Nigam H Shah, Jennifer Y Lee, Jonathan H Chen","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>This study evaluates the feasibility of using GPT-4 to automate precharting for specialty referrals, focusing on new patients referred to an otolaryngology clinic for nasal congestion. We describe the design decisions and strategies tested in creating this precharting utility, including methods for prompt design and token limit handling. Through iterative testing and building, our tool achieved 95.0% agreement with physician consensus in a small retrospective test sample. Results from a small prospective pilot showed favorable feedback of summaries in a real-world clinical setting, though there was a discrepancy between high intention to use the summary but lower perception of time savings. Our results demonstrate that automated pre-charting with accuracy and clinical relevance can be feasible with large language models such as GPT-4. Our design features can inform the development of vendor chart summarization solutions.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"312-321"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150724/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276846","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
powerROC: An Interactive Web Tool for Sample Size Calculation in Assessing Models' Discriminative Abilities. powerROC:一个用于评估模型判别能力的样本量计算的交互式网络工具。
François Grolleau, Robert Tibshirani, Jonathan H Chen

Rigorous external validation is crucial for assessing the generalizability of prediction models, particularly by evaluating their discrimination (AUROC) on new data. This often involves comparing a new model's AUROC to that of an established reference model. However, many studies rely on arbitrary rules of thumb for sample size calculations, often resulting in underpowered analyses and unreliable conclusions. This paper reviews crucial concepts for accurate sample size determination in AUROC-based external validation studies, making the theory and practice more accessible to researchers and clinicians. We introduce powerROC, an open-source web tool designed to simplify these calculations, enabling both the evaluation of a single model and the comparison of two models. The tool offers guidance on selecting target precision levels and employs flexible approaches, leveraging either pilot data or user-defined probability distributions. We illustrate powerROC's utility through a case study on hospital mortality prediction using the MIMIC database.

严格的外部验证对于评估预测模型的泛化性至关重要,特别是通过评估它们对新数据的辨别能力(AUROC)。这通常涉及将新模型的AUROC与已建立的参考模型的AUROC进行比较。然而,许多研究依赖于任意的经验法则来计算样本大小,往往导致分析不足和结论不可靠。本文回顾了在基于auroc的外部验证研究中准确确定样本量的关键概念,使研究人员和临床医生更容易获得理论和实践。我们介绍了powerROC,这是一个开源的web工具,旨在简化这些计算,既可以对单个模型进行评估,也可以对两个模型进行比较。该工具提供了选择目标精度水平的指导,并采用灵活的方法,利用试验数据或用户定义的概率分布。我们通过一个使用MIMIC数据库进行医院死亡率预测的案例研究来说明powerROC的实用性。
{"title":"powerROC: An Interactive Web Tool for Sample Size Calculation in Assessing Models' Discriminative Abilities.","authors":"François Grolleau, Robert Tibshirani, Jonathan H Chen","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Rigorous external validation is crucial for assessing the generalizability of prediction models, particularly by evaluating their discrimination (AUROC) on new data. This often involves comparing a new model's AUROC to that of an established reference model. However, many studies rely on arbitrary rules of thumb for sample size calculations, often resulting in underpowered analyses and unreliable conclusions. This paper reviews crucial concepts for accurate sample size determination in AUROC-based external validation studies, making the theory and practice more accessible to researchers and clinicians. We introduce powerROC, an open-source web tool designed to simplify these calculations, enabling both the evaluation of a single model and the comparison of two models. The tool offers guidance on selecting target precision levels and employs flexible approaches, leveraging either pilot data or user-defined probability distributions. We illustrate powerROC's utility through a case study on hospital mortality prediction using the MIMIC database.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"196-204"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150715/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276878","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sex-Based Differences in the Association of Epigenetic Age Acceleration with Alzheimer's Disease Biomarkers and Cognitive Measures. 表观遗传年龄加速与阿尔茨海默病生物标志物和认知测量之间的性别差异
Travyse A Edwards, Tianhua Zhai, Kwangsik Nho, Andrew J Saykin, Qi Long, Li Shen

Alzheimer's Disease (AD) is a neurodegenerative disorder marked by cognitive and functional decline. Biological sex has been linked to differences in lifetime AD risk, AD-related neuropathology, and the rate of cognitive decline, although the underlying biological mechanisms driving these disparities remain unclear. Epigenetic Age Acceleration-a metric derived from epigenetic aging clocks-has been associated with numerous aging-related conditions such as AD. Although there is promise in using Epigenetic age acceleration as a biomarker for several aging related diseases, the underlying mechanism that aging clocks are actually predicting is not well understood. In this study, we used data from the Alzheimer's Disease Neuroimaging Initiative (ADNI) to examine how sex influences the relationship between age acceleration and cognitive performance as well as brain volume. Our findings suggest that, although epigenetic age acceleration can predict changes in brain structure, these changes don't appear to be different across sexes. Future research should focus on validating these findings in an external cohort and exploring them longitudinally.

阿尔茨海默病(AD)是一种以认知和功能下降为特征的神经退行性疾病。生物性别与阿尔茨海默病终生风险、阿尔茨海默病相关神经病理和认知能力下降率的差异有关,尽管导致这些差异的潜在生物学机制尚不清楚。表观遗传衰老加速——一种源自表观遗传衰老时钟的指标——与许多与衰老相关的疾病(如阿尔茨海默病)有关。尽管表观遗传衰老加速有望作为几种衰老相关疾病的生物标志物,但衰老时钟实际预测的潜在机制尚未得到很好的理解。在这项研究中,我们使用来自阿尔茨海默病神经影像学倡议(ADNI)的数据来研究性别如何影响年龄加速与认知能力以及脑容量之间的关系。我们的研究结果表明,尽管表观遗传年龄加速可以预测大脑结构的变化,但这些变化似乎在性别之间没有差异。未来的研究应侧重于在外部队列中验证这些发现,并对其进行纵向探索。
{"title":"Sex-Based Differences in the Association of Epigenetic Age Acceleration with Alzheimer's Disease Biomarkers and Cognitive Measures.","authors":"Travyse A Edwards, Tianhua Zhai, Kwangsik Nho, Andrew J Saykin, Qi Long, Li Shen","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Alzheimer's Disease (AD) is a neurodegenerative disorder marked by cognitive and functional decline. Biological sex has been linked to differences in lifetime AD risk, AD-related neuropathology, and the rate of cognitive decline, although the underlying biological mechanisms driving these disparities remain unclear. Epigenetic Age Acceleration-a metric derived from epigenetic aging clocks-has been associated with numerous aging-related conditions such as AD. Although there is promise in using Epigenetic age acceleration as a biomarker for several aging related diseases, the underlying mechanism that aging clocks are actually predicting is not well understood. In this study, we used data from the Alzheimer's Disease Neuroimaging Initiative (ADNI) to examine how sex influences the relationship between age acceleration and cognitive performance as well as brain volume. Our findings suggest that, although epigenetic age acceleration can predict changes in brain structure, these changes don't appear to be different across sexes. Future research should focus on validating these findings in an external cohort and exploring them longitudinally.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"141-148"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150745/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276884","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Studying Veteran food insecurity longitudinally using electronic health record data and natural language processing. 利用电子健康记录数据和自然语言处理技术对退伍军人食品安全状况进行纵向研究。
Alec B Chapman, Talia Panadero, Rachel Dalrymple, Alicia Cohen, Nipa Kamdar, Farhana Pethani, Andrea Kalvesmaki, Richard E Nelson, Jorie Butler

Food insecurity is an important social risk factor that is directly linked to patient health and well-being. The Department of Veterans Affairs (VA) aims to identify and resolve food insecurity through social and clinical interventions. However, evaluating the impact of such interventions is made challenging by the lack of follow-up data on Veteran food insecurity status. One potential solution is to leverage documentation of food insecurity in electronic health records (EHRs). In this paper, we developed and validated a natural language processing system to identify food insecurity status from clinical notes and applied it to study longitudinal trajectories of food insecurity among a large cohort of food insecure Veterans. Our analyses provide insight into the timing and persistence of Veteran food insecurity; in the future, our methods will be used to evaluate food insecurity interventions and evaluate VA policy.

粮食不安全是一个重要的社会风险因素,与患者的健康和福祉直接相关。退伍军人事务部(VA)旨在通过社会和临床干预来确定和解决粮食不安全问题。然而,由于缺乏关于退伍军人粮食不安全状况的后续数据,评估此类干预措施的影响具有挑战性。一个潜在的解决方案是利用电子健康记录(EHRs)中的食品不安全记录。在本文中,我们开发并验证了一种自然语言处理系统,用于从临床记录中识别粮食不安全状况,并将其应用于研究大量粮食不安全退伍军人的粮食不安全纵向轨迹。我们的分析提供了洞察老兵粮食不安全的时间和持久性;未来,我们的方法将用于评估粮食不安全干预措施和评估VA政策。
{"title":"Studying Veteran food insecurity longitudinally using electronic health record data and natural language processing.","authors":"Alec B Chapman, Talia Panadero, Rachel Dalrymple, Alicia Cohen, Nipa Kamdar, Farhana Pethani, Andrea Kalvesmaki, Richard E Nelson, Jorie Butler","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Food insecurity is an important social risk factor that is directly linked to patient health and well-being. The Department of Veterans Affairs (VA) aims to identify and resolve food insecurity through social and clinical interventions. However, evaluating the impact of such interventions is made challenging by the lack of follow-up data on Veteran food insecurity status. One potential solution is to leverage documentation of food insecurity in electronic health records (EHRs). In this paper, we developed and validated a natural language processing system to identify food insecurity status from clinical notes and applied it to study longitudinal trajectories of food insecurity among a large cohort of food insecure Veterans. Our analyses provide insight into the timing and persistence of Veteran food insecurity; in the future, our methods will be used to evaluate food insecurity interventions and evaluate VA policy.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"124-133"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150752/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276887","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Systematic Exploration of Hospital Cost Variability: A Conformal Prediction-Based Outlier Detection Method for Electronic Health Records. 医院成本变异性的系统探索:一种基于适形预测的电子健康记录异常检测方法。
François Grolleau, Ethan Goh, Stephen P Ma, Jonathan Masterson, Ted Ross, Arnold Milstein, Jonathan H Chen

Marked variability in inpatient hospitalization costs poses significant challenges to healthcare quality, resource allocation, and patient outcomes. Traditional methods like Diagnosis-Related Groups (DRGs) aid in cost management but lack practical solutions for enhancing hospital care value. We introduce a novel methodology for outlier detection in Electronic Health Records (EHRs) using Conformal Prediction. This approach identifies and prioritizes areas for optimizing high-value care processes. Unlike conventional predictive models that neglect uncertainty, our method employs Conformal Quantile Regression (CQR) to generate robust prediction intervals, offering a comprehensive view of cost variability. By integrating Conformal Prediction with machine learning models, healthcare professionals can more accurately pinpoint opportunities for quality and efficiency improvements. Our framework systematically evaluates unexplained hospital cost variations and generates interpretable hypotheses for refining clinical practices associated with atypical costs. This data-driven approach offers a systematic method to generate clinically sound hypotheses that may inform processes to enhance care quality and optimize resource utilization.

住院费用的显著差异对医疗质量、资源分配和患者预后构成了重大挑战。诊断相关分组(DRGs)等传统方法有助于成本管理,但缺乏提高医院护理价值的实际解决方案。我们介绍了一种利用适形预测来检测电子健康记录(EHRs)中的异常值的新方法。该方法确定并优先考虑优化高价值护理流程的领域。与忽略不确定性的传统预测模型不同,我们的方法采用保形分位数回归(CQR)来生成稳健的预测区间,提供了成本可变性的全面视图。通过将适形预测与机器学习模型集成,医疗保健专业人员可以更准确地找到提高质量和效率的机会。我们的框架系统地评估了不明原因的医院成本变化,并为改进与非典型成本相关的临床实践产生了可解释的假设。这种数据驱动的方法提供了一种系统的方法来产生临床合理的假设,这些假设可以为提高护理质量和优化资源利用的过程提供信息。
{"title":"Systematic Exploration of Hospital Cost Variability: A Conformal Prediction-Based Outlier Detection Method for Electronic Health Records.","authors":"François Grolleau, Ethan Goh, Stephen P Ma, Jonathan Masterson, Ted Ross, Arnold Milstein, Jonathan H Chen","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Marked variability in inpatient hospitalization costs poses significant challenges to healthcare quality, resource allocation, and patient outcomes. Traditional methods like Diagnosis-Related Groups (DRGs) aid in cost management but lack practical solutions for enhancing hospital care value. We introduce a novel methodology for outlier detection in Electronic Health Records (EHRs) using Conformal Prediction. This approach identifies and prioritizes areas for optimizing high-value care processes. Unlike conventional predictive models that neglect uncertainty, our method employs Conformal Quantile Regression (CQR) to generate robust prediction intervals, offering a comprehensive view of cost variability. By integrating Conformal Prediction with machine learning models, healthcare professionals can more accurately pinpoint opportunities for quality and efficiency improvements. Our framework systematically evaluates unexplained hospital cost variations and generates interpretable hypotheses for refining clinical practices associated with atypical costs. This data-driven approach offers a systematic method to generate clinically sound hypotheses that may inform processes to enhance care quality and optimize resource utilization.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"187-195"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150741/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276889","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Exploring ChatGPT 3.5 for structured data extraction from oncological notes. 探索ChatGPT 3.5从肿瘤笔记中提取结构化数据。
Ty J Skyles, Isaac J Freeman, Georgewilliam Kalibbala, David Davila-Garcia, Kendall Kiser, Silpa Raju, Adam Wilcox

In large-scale clinical informatics, there is a need to maximize the amount of usable data from electronic health records. With the adoption of large language models in medical research, there is potential to use them to extract structured data from unstructured clinical notes. We explored how ChatGPT could be used to improve data availability in cancer research. We assessed how GPT used clinical notes to answer six relevant clinical questions. Four prompt engineering strategies were used: zero-shot, zero-shot with context, few-shot, and few-shot with context. Few-shot prompting often decreased the accuracy of GPT outputs and context did not consistently improve accuracy. GPT extracted patients' Gleason scores and ages with an F1 score of 0.99 and it identified if patients received palliative care with and if patients were in pain with an F1 score of 0.86. Effective use of LLMs has potential to increase interoperability between healthcare and clinical research.

在大规模临床信息学中,需要最大限度地利用电子健康记录中的可用数据。随着在医学研究中采用大型语言模型,有可能使用它们从非结构化临床记录中提取结构化数据。我们探索了如何使用ChatGPT来提高癌症研究中的数据可用性。我们评估了GPT如何使用临床记录来回答六个相关的临床问题。使用了四种提示工程策略:零射击、零射击结合情境、少射击和少射击结合情境。少数镜头提示通常会降低GPT输出的准确性,并且上下文并没有始终提高准确性。GPT提取患者Gleason评分和年龄,F1评分为0.99,识别患者是否接受姑息治疗和患者是否有疼痛,F1评分为0.86。llm的有效使用有可能增加医疗保健和临床研究之间的互操作性。
{"title":"Exploring ChatGPT 3.5 for structured data extraction from oncological notes.","authors":"Ty J Skyles, Isaac J Freeman, Georgewilliam Kalibbala, David Davila-Garcia, Kendall Kiser, Silpa Raju, Adam Wilcox","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>In large-scale clinical informatics, there is a need to maximize the amount of usable data from electronic health records. With the adoption of large language models in medical research, there is potential to use them to extract structured data from unstructured clinical notes. We explored how ChatGPT could be used to improve data availability in cancer research. We assessed how GPT used clinical notes to answer six relevant clinical questions. Four prompt engineering strategies were used: zero-shot, zero-shot with context, few-shot, and few-shot with context. Few-shot prompting often decreased the accuracy of GPT outputs and context did not consistently improve accuracy. GPT extracted patients' Gleason scores and ages with an F1 score of 0.99 and it identified if patients received palliative care with and if patients were in pain with an F1 score of 0.86. Effective use of LLMs has potential to increase interoperability between healthcare and clinical research.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"518-526"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150697/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276874","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Temporal Rule Mining for Enhanced Risk Pattern Extraction: A Case Study with Acute Kidney Injury. 增强风险模式提取的时间规则挖掘:以急性肾损伤为例。
Ho Yin Chan, Alan S Yu, Mei Liu

Association rule mining is a widely used data mining technique to uncover knowledge from large datasets. In healthcare, it can reveal meaningful patterns within electronic health records (EHR) that inform clinical decision-making and treatment strategies. However, many studies neglect the temporal aspects of EHR data, potentially overlooking patterns linked to specific time periods or sequence of clinical events. Recent advancements have introduced methods for mining temporal association rules, offering enhanced predictive and descriptive insights. We propose a multi-step framework that utilizes temporal pattern mining algorithm to extract actionable and temporal risk patterns for acute kidney injury (AKI) from EHR data. Our algorithm identified approximately 3,313 rules with 10 actionable features, characterized by low support and high confidence. These rules have a median support of 0.055 and a median confidence of 0.58. We highlight key rules, explore their potential clinical implications, and present a network-based view to provide actionable insights.

关联规则挖掘是一种广泛使用的数据挖掘技术,用于从大型数据集中发现知识。在医疗保健领域,它可以揭示电子健康记录(EHR)中有意义的模式,为临床决策和治疗策略提供信息。然而,许多研究忽视了电子病历数据的时间方面,可能忽略了与特定时间段或临床事件序列相关的模式。最近的进展引入了挖掘时间关联规则的方法,提供了增强的预测性和描述性见解。我们提出了一个多步骤框架,利用时间模式挖掘算法从电子病历数据中提取急性肾损伤(AKI)的可操作和时间风险模式。我们的算法识别了大约3313条规则和10个可操作的特征,具有低支持度和高置信度的特点。这些规则的中位数支持度为0.055,中位数置信度为0.58。我们强调关键规则,探索其潜在的临床意义,并提出基于网络的观点,以提供可操作的见解。
{"title":"Temporal Rule Mining for Enhanced Risk Pattern Extraction: A Case Study with Acute Kidney Injury.","authors":"Ho Yin Chan, Alan S Yu, Mei Liu","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Association rule mining is a widely used data mining technique to uncover knowledge from large datasets. In healthcare, it can reveal meaningful patterns within electronic health records (EHR) that inform clinical decision-making and treatment strategies. However, many studies neglect the temporal aspects of EHR data, potentially overlooking patterns linked to specific time periods or sequence of clinical events. Recent advancements have introduced methods for mining temporal association rules, offering enhanced predictive and descriptive insights. We propose a multi-step framework that utilizes temporal pattern mining algorithm to extract actionable and temporal risk patterns for acute kidney injury (AKI) from EHR data. Our algorithm identified approximately 3,313 rules with 10 actionable features, characterized by low support and high confidence. These rules have a median support of 0.055 and a median confidence of 0.58. We highlight key rules, explore their potential clinical implications, and present a network-based view to provide actionable insights.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"115-123"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150717/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276890","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Transfer Learning with Clinical Concept Embeddings from Large Language Models. 基于大型语言临床概念嵌入的迁移学习。
Yuhe Gao, Runxue Bao, Yuelyu Ji, Yiming Sun, Chenxi Song, Jeffrey P Ferraro, Ye Ye

Knowledge exchange is crucial in healthcare, particularly when leveraging data from multiple clinical sites to address data scarcity, reduce costs, and enable timely interventions. Transfer learning can facilitate cross-site knowledge transfer, yet a significant challenge is the heterogeneity in clinical concepts across different sites. Recently, Large Language Models (LLMs) have shown significant potential in capturing the semantic meanings of clinical concepts and mitigating heterogeneity in biomedicine. This study analyzed electronic health records from two large healthcare systems to assess the impact of semantic embeddings from LLMs on local models, shared models, and transfer learning models. The results indicate that domain-specific LLMs, such as Med-BERT, consistently outperform in local and direct transfer scenarios, whereas generic models like OpenAI embeddings may need fine-tuning for optimal performance. This study emphasizes the importance of domain-specific embeddings and meticulous model tuning for effective knowledge transfer in healthcare. It remains essential to investigate the balance the balance between the complexity of downstream tasks, the size of training samples, and the extent of model tuning.

知识交换在医疗保健中至关重要,特别是在利用来自多个临床站点的数据来解决数据短缺、降低成本和实现及时干预时。迁移学习可以促进跨站点的知识转移,但一个重要的挑战是不同站点的临床概念的异质性。近年来,大型语言模型(llm)在捕获临床概念的语义和减轻生物医学的异质性方面显示出巨大的潜力。本研究分析了来自两个大型医疗保健系统的电子健康记录,以评估llm语义嵌入对本地模型、共享模型和迁移学习模型的影响。结果表明,特定领域的llm,如Med-BERT,在本地和直接传输场景中始终表现出色,而像OpenAI嵌入这样的通用模型可能需要微调以获得最佳性能。本研究强调了领域特定嵌入和细致模型调优对于医疗保健中有效知识转移的重要性。研究下游任务的复杂性、训练样本的大小和模型调整的程度之间的平衡仍然是必要的。
{"title":"Transfer Learning with Clinical Concept Embeddings from Large Language Models.","authors":"Yuhe Gao, Runxue Bao, Yuelyu Ji, Yiming Sun, Chenxi Song, Jeffrey P Ferraro, Ye Ye","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Knowledge exchange is crucial in healthcare, particularly when leveraging data from multiple clinical sites to address data scarcity, reduce costs, and enable timely interventions. Transfer learning can facilitate cross-site knowledge transfer, yet a significant challenge is the heterogeneity in clinical concepts across different sites. Recently, Large Language Models (LLMs) have shown significant potential in capturing the semantic meanings of clinical concepts and mitigating heterogeneity in biomedicine. This study analyzed electronic health records from two large healthcare systems to assess the impact of semantic embeddings from LLMs on local models, shared models, and transfer learning models. The results indicate that domain-specific LLMs, such as Med-BERT, consistently outperform in local and direct transfer scenarios, whereas generic models like OpenAI embeddings may need fine-tuning for optimal performance. This study emphasizes the importance of domain-specific embeddings and meticulous model tuning for effective knowledge transfer in healthcare. It remains essential to investigate the balance the balance between the complexity of downstream tasks, the size of training samples, and the extent of model tuning.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"167-176"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150738/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276904","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RD-LIVES: A Living Evidence Synthesis System for Rare Disease Treatment Efficacy and Safety. RD-LIVES:罕见病治疗疗效和安全性的活证据合成系统。
Jinlian Wang, Hui Li, Hongfang Liu

Although rare diseases (RD) are gaining priority in healthcare worldwide, developing research policies for studying them in public settings remains challenging due to the limited evidence available. Evidence generation is crucial for rare diseases, requiring systematic assessment of study quality across multiple sources. Given the scarcity of patients, literature and clinical trial data for orphan drugs, we developed RD-LIVES-a tool designed to automatically accelerate evidence collection from literature and clinical trials for systematic reviews and meta-analyses. This tool enhances our understanding of treatment outcomes, determines appropriate follow-up durations, and informs the required treatment impact size for new drugs. Using Idiopathic Pulmonary Fibrosis (IPF) as an example, we demonstrate how RD-LIVES automates evidence collection and element extraction. The results indicate that RD-LIVES plays a vital role in designing costly prospective trials and has the potential to increase the likelihood of successful trial outcomes.

尽管罕见病(RD)在全球卫生保健领域日益受到重视,但由于现有证据有限,制定在公共环境中研究罕见病的研究政策仍然具有挑战性。证据生成对罕见病至关重要,需要对多个来源的研究质量进行系统评估。鉴于孤儿药患者、文献和临床试验数据的稀缺性,我们开发了rd - livess工具,旨在自动加速从文献和临床试验中收集证据,用于系统评价和荟萃分析。该工具增强了我们对治疗结果的理解,确定了适当的随访时间,并告知了新药所需的治疗影响大小。以特发性肺纤维化(IPF)为例,我们演示了RD-LIVES如何自动化证据收集和元素提取。结果表明,RD-LIVES在设计昂贵的前瞻性试验中起着至关重要的作用,并有可能增加试验结果成功的可能性。
{"title":"RD-LIVES: A Living Evidence Synthesis System for Rare Disease Treatment Efficacy and Safety.","authors":"Jinlian Wang, Hui Li, Hongfang Liu","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Although rare diseases (RD) are gaining priority in healthcare worldwide, developing research policies for studying them in public settings remains challenging due to the limited evidence available. Evidence generation is crucial for rare diseases, requiring systematic assessment of study quality across multiple sources. Given the scarcity of patients, literature and clinical trial data for orphan drugs, we developed RD-LIVES-a tool designed to automatically accelerate evidence collection from literature and clinical trials for systematic reviews and meta-analyses. This tool enhances our understanding of treatment outcomes, determines appropriate follow-up durations, and informs the required treatment impact size for new drugs. Using Idiopathic Pulmonary Fibrosis (IPF) as an example, we demonstrate how RD-LIVES automates evidence collection and element extraction. The results indicate that RD-LIVES plays a vital role in designing costly prospective trials and has the potential to increase the likelihood of successful trial outcomes.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"607-613"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150729/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276881","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing Disease Detection in Radiology Reports Through Fine-tuning Lightweight LLM on Weak Labels. 通过微调弱标签上的轻量级LLM来增强放射学报告中的疾病检测。
Yishu Wei, Xindi Wang, Hanley Ong, Yiliang Zhou, Adam Flanders, George Shih, Yifan Peng

Despite significant progress in applying large language models (LLMs) to the medical domain, several limitations still prevent them from practical applications. Among these are the constraints on model size and the lack of cohort-specific labeled datasets. In this work, we investigated the potential of improving a lightweight LLM, such as Llama 3.1-8B, through fine-tuning with datasets using synthetic labels. Two tasks are jointly trained by combining their respective instruction datasets. When the quality of the task-specific synthetic labels is relatively high (e.g., generated by GPT4-o), Llama 3.1-8B achieves satisfactory performance on the open-ended disease detection task, with a micro F1 score of 0.91. Conversely, when the quality of the task-relevant synthetic labels is relatively low (e.g., from the MIMIC-CXR dataset), fine-tuned Llama 3.1-8B is able to surpass its noisy teacher labels (micro F1 score of 0.67 v.s. 0.63) when calibrated against curated labels, indicating the strong inherent underlying capability of the model. These findings demonstrate the potential offine-tuning LLMs with synthetic labels, offering a promising direction for future research on LLM specialization in the medical domain.

尽管在将大型语言模型(llm)应用于医学领域方面取得了重大进展,但一些限制仍然阻碍了它们的实际应用。其中包括模型大小的限制和缺乏特定队列标记数据集。在这项工作中,我们研究了通过使用合成标签对数据集进行微调来改进轻量级LLM(如Llama 3.1-8B)的潜力。两个任务通过结合各自的指令数据集进行联合训练。当任务特异性合成标签质量较高时(例如由GPT4-o生成),Llama 3.1-8B在开放式疾病检测任务上取得了令人满意的表现,微F1得分为0.91。相反,当与任务相关的合成标签的质量相对较低时(例如,来自MIMIC-CXR数据集),经过微调的Llama 3.1-8B能够超过其嘈杂的教师标签(微F1分数为0.67 vs . 0.63),当与精选标签进行校准时,表明该模型具有强大的内在潜在能力。这些发现证明了具有合成标签的脱机调优LLM的潜力,为未来医学领域LLM专业化的研究提供了一个有希望的方向。
{"title":"Enhancing Disease Detection in Radiology Reports Through Fine-tuning Lightweight LLM on Weak Labels.","authors":"Yishu Wei, Xindi Wang, Hanley Ong, Yiliang Zhou, Adam Flanders, George Shih, Yifan Peng","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Despite significant progress in applying large language models (LLMs) to the medical domain, several limitations still prevent them from practical applications. Among these are the constraints on model size and the lack of cohort-specific labeled datasets. In this work, we investigated the potential of improving a lightweight LLM, such as Llama 3.1-8B, through fine-tuning with datasets using synthetic labels. Two tasks are jointly trained by combining their respective instruction datasets. When the quality of the task-specific synthetic labels is relatively high (e.g., generated by GPT4-o), Llama 3.1-8B achieves satisfactory performance on the open-ended disease detection task, with a micro F1 score of 0.91. Conversely, when the quality of the task-relevant synthetic labels is relatively low (e.g., from the MIMIC-CXR dataset), fine-tuned Llama 3.1-8B is able to surpass its noisy teacher labels (micro F1 score of 0.67 v.s. 0.63) when calibrated against curated labels, indicating the strong inherent underlying capability of the model. These findings demonstrate the potential offine-tuning LLMs with synthetic labels, offering a promising direction for future research on LLM specialization in the medical domain.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"614-623"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150749/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276868","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1