首页 > 最新文献

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science最新文献

英文 中文
SLR: A Modified Logistic Regression Model with Sinkhorn Divergence for Alzheimer's Disease Classification. 具有Sinkhorn散度的修正Logistic回归模型用于阿尔茨海默病分类。
Qipeng Zhan, Zhuoping Zhou, Zixuan Wen, Zexuan Wang, Boning Tong, Heng Huang, Andrew J Saykin, Paul M Thompson, Christos Davatzikos, Li Shen

Logistic regression is a widely used model in machine learning, particularly as a baseline for binary classification tasks due to its simplicity, effectiveness, and interpretability. It is especially powerful when dealing with categorical features. Despite its advantages, standard logistic regression fails to capture the distributional and geometric structure of data, especially when features are derived from structured spaces like brain imaging. For instance, in Voxel-Based Morphometry (VBM), measurements from distinct brain regions follow a clear spatial organization, which standard logistic regression cannot fully leverage. In this paper, we propose Sinkhorn Logistic Regression (SLR), a variant of logistic regression that incorporates the Sinkhorn divergence as a loss function. This adaptation enables the model to leverage geometric information about the data distribution, enhancing its performance on structured datasets.

逻辑回归是机器学习中广泛使用的模型,由于其简单,有效和可解释性,特别是作为二元分类任务的基线。它在处理分类特征时尤其强大。尽管标准逻辑回归具有优势,但它无法捕捉数据的分布和几何结构,特别是当特征来自结构化空间(如脑成像)时。例如,在基于体素的形态测量(VBM)中,来自不同大脑区域的测量遵循一个清晰的空间组织,这是标准逻辑回归无法充分利用的。在本文中,我们提出了Sinkhorn逻辑回归(SLR),这是逻辑回归的一种变体,它将Sinkhorn散度作为损失函数。这种适应使模型能够利用有关数据分布的几何信息,增强其在结构化数据集上的性能。
{"title":"SLR: A Modified Logistic Regression Model with Sinkhorn Divergence for Alzheimer's Disease Classification.","authors":"Qipeng Zhan, Zhuoping Zhou, Zixuan Wen, Zexuan Wang, Boning Tong, Heng Huang, Andrew J Saykin, Paul M Thompson, Christos Davatzikos, Li Shen","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Logistic regression is a widely used model in machine learning, particularly as a baseline for binary classification tasks due to its simplicity, effectiveness, and interpretability. It is especially powerful when dealing with categorical features. Despite its advantages, standard logistic regression fails to capture the distributional and geometric structure of data, especially when features are derived from structured spaces like brain imaging. For instance, in Voxel-Based Morphometry (VBM), measurements from distinct brain regions follow a clear spatial organization, which standard logistic regression cannot fully leverage. In this paper, we propose Sinkhorn Logistic Regression (SLR), a variant of logistic regression that incorporates the Sinkhorn divergence as a loss function. This adaptation enables the model to leverage geometric information about the data distribution, enhancing its performance on structured datasets.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"634-643"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150743/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276885","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Institutional Platform for Secure Self-Service Large Language Model Exploration. 安全自助服务大型语言模式探索的机构平台。
V K Cody Bumgardner, Mitchell A Klusty, W Vaiden Logan, Samuel E Armstrong, Caroline N Leach, Caylin Hickey, Jeff Talbert

This paper introduces a user-friendly platform developed by the University of Kentucky Center for Applied AI, designed to make customized large language models (LLMs) more accessible. By capitalizing on recent advancements in multi-LoRA inference, the system efficiently accommodates custom adapters for a diverse range of users and projects. The paper outlines the system's architecture and key features, encompassing dataset curation, model training, secure inference, and text-based feature extraction. We illustrate the establishment of a tenant-aware computational network using agent-based methods, securely utilizing islands of isolated resources as a unified system. The platform strives to deliver secure, affordable LLM services, emphasizing process and data isolation, end-to-end encryption, and role-based resource authentication. This contribution aligns with the overarching goal of enabling simplified access to cutting-edge AI models and technology in support of scientific discovery and the development of biomedical informatics.

本文介绍了由肯塔基大学应用人工智能中心开发的一个用户友好平台,旨在使定制的大型语言模型(llm)更容易访问。通过利用多lora推理的最新进展,该系统有效地为各种用户和项目提供定制适配器。本文概述了系统的架构和关键特征,包括数据集管理、模型训练、安全推理和基于文本的特征提取。我们说明了使用基于代理的方法建立一个租户感知计算网络,安全地利用孤立的资源孤岛作为一个统一的系统。该平台致力于提供安全、经济的LLM服务,强调流程和数据隔离、端到端加密以及基于角色的资源身份验证。这一贡献符合简化获取尖端人工智能模型和技术以支持科学发现和生物医学信息学发展的总体目标。
{"title":"Institutional Platform for Secure Self-Service Large Language Model Exploration.","authors":"V K Cody Bumgardner, Mitchell A Klusty, W Vaiden Logan, Samuel E Armstrong, Caroline N Leach, Caylin Hickey, Jeff Talbert","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>This paper introduces a user-friendly platform developed by the University of Kentucky Center for Applied AI, designed to make customized large language models (LLMs) more accessible. By capitalizing on recent advancements in multi-LoRA inference, the system efficiently accommodates custom adapters for a diverse range of users and projects. The paper outlines the system's architecture and key features, encompassing dataset curation, model training, secure inference, and text-based feature extraction. We illustrate the establishment of a tenant-aware computational network using agent-based methods, securely utilizing islands of isolated resources as a unified system. The platform strives to deliver secure, affordable LLM services, emphasizing process and data isolation, end-to-end encryption, and role-based resource authentication. This contribution aligns with the overarching goal of enabling simplified access to cutting-edge AI models and technology in support of scientific discovery and the development of biomedical informatics.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"105-114"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150735/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276851","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancing Cross-Domain Generalizability in Social Determinants of Health Extraction with Prompt-Tuning Large Language Models. 利用快速调优的大语言模型增强健康提取社会决定因素的跨领域泛化性。
Cheng Peng, Zehao Yu, Kaleb E Smith, Wei-Hsuan Lo-Ciganic, Jiang Bian, Yonghui Wu

The progress in natural language processing (NLP) using large language models (LLMs) has greatly improved patient information extraction from clinical narratives. However, most methods based on the fine-tuning strategy have limited transfer learning ability for cross-domain applications. This study proposed a novel approach that employs a soft prompt-based learning architecture, which introduces trainable prompts to guide LLMs toward desired outputs. We examined two types of LLM architectures, including encoder-only GatorTron and decoder-only GatorTronGPT, and evaluated their performance for the extraction of social determinants of health (SDoH) using a cross-institution dataset from the 2022 n2c2 challenge and a cross-disease dataset from the University of Florida (UF) Health. The results show that decoder-only LLMs with prompt tuning achieved better performance in cross-domain applications. GatorTronGPT achieved the best F1 scores for both datasets, outperforming traditional fine-tuned GatorTron by 8.9% and 21.8% in a cross-institution setting, and 5.5% and 14.5% in a cross-disease setting.

利用大型语言模型(llm)的自然语言处理(NLP)的进展极大地改善了从临床叙述中提取患者信息。然而,大多数基于微调策略的方法在跨领域应用中的迁移学习能力有限。本研究提出了一种新颖的方法,采用基于软提示的学习架构,引入可训练的提示来指导法学硕士获得期望的输出。我们研究了两种类型的LLM架构,包括仅编码器的GatorTron和仅解码器的GatorTronGPT,并使用来自2022年n2c2挑战的跨机构数据集和来自佛罗里达大学(UF)健康的跨疾病数据集评估了它们在提取健康社会决定因素(SDoH)方面的性能。结果表明,具有快速调优的纯解码器llm在跨域应用中获得了更好的性能。GatorTronGPT在两个数据集上都取得了最好的F1分数,在跨机构设置中比传统的微调GatorTron高8.9%和21.8%,在跨疾病设置中比传统的GatorTron高5.5%和14.5%。
{"title":"Enhancing Cross-Domain Generalizability in Social Determinants of Health Extraction with Prompt-Tuning Large Language Models.","authors":"Cheng Peng, Zehao Yu, Kaleb E Smith, Wei-Hsuan Lo-Ciganic, Jiang Bian, Yonghui Wu","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The progress in natural language processing (NLP) using large language models (LLMs) has greatly improved patient information extraction from clinical narratives. However, most methods based on the fine-tuning strategy have limited transfer learning ability for cross-domain applications. This study proposed a novel approach that employs a soft prompt-based learning architecture, which introduces trainable prompts to guide LLMs toward desired outputs. We examined two types of LLM architectures, including encoder-only GatorTron and decoder-only GatorTronGPT, and evaluated their performance for the extraction of social determinants of health (SDoH) using a cross-institution dataset from the 2022 n2c2 challenge and a cross-disease dataset from the University of Florida (UF) Health. The results show that decoder-only LLMs with prompt tuning achieved better performance in cross-domain applications. GatorTronGPT achieved the best F1 scores for both datasets, outperforming traditional fine-tuned GatorTron by 8.9% and 21.8% in a cross-institution setting, and 5.5% and 14.5% in a cross-disease setting.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"432-440"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150740/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276867","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Method for Enabling Digital Health Technologies in Clinical and Translational Research at an Academic Medical Center. 在学术医疗中心的临床和转化研究中启用数字健康技术的方法。
Cindy Chen, Laura R Bradford, Melissa A Epstein, J Travis Gossey, Mohammad N Mansour, Christy M O'Connor, Brian J Tschinkel, Thomas R Campion

Federal- and state-level governance as well as local institutional oversight are changing rapidly to address the accelerated growth in the usage of digital health technologies (DHT) -such as apps, wearables, and websites-to enable clinical and translational research. While studies have described frameworks for assessing and/or implementing individual DHTs, to our knowledge there are none describing how to implement and support multiple DHTs at an academic medical center (AMC). A multi-disciplinary team including information technology, institutional review board, legal, and privacy professionals identified 33 items to evaluate as part of onboarding studies using DHTs. In a one-year period at one AMC, we applied the novel instrument to review 98 requests for research (93) and non-research (5) use of DHTs. The 33-item instrument may be valuable to researchers and practitioners in other settings seeking to scale institutional support for DHTs.

联邦和州一级的管理以及地方机构的监督正在迅速改变,以应对数字医疗技术(DHT)使用的加速增长,例如应用程序、可穿戴设备和网站,以实现临床和转化研究。虽然已有研究描述了评估和/或实施单个dht的框架,但据我们所知,没有研究描述如何在学术医疗中心(AMC)实施和支持多个dht。一个由信息技术、机构审查委员会、法律和隐私专家组成的多学科团队确定了33个项目进行评估,作为使用dht进行入职研究的一部分。在一个AMC的一年时间里,我们应用新仪器审查了98个研究(93)和非研究(5)使用dht的请求。这个包含33个项目的工具对于寻求扩大对dht的机构支持的其他环境中的研究人员和从业人员可能很有价值。
{"title":"A Method for Enabling Digital Health Technologies in Clinical and Translational Research at an Academic Medical Center.","authors":"Cindy Chen, Laura R Bradford, Melissa A Epstein, J Travis Gossey, Mohammad N Mansour, Christy M O'Connor, Brian J Tschinkel, Thomas R Campion","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Federal- and state-level governance as well as local institutional oversight are changing rapidly to address the accelerated growth in the usage of digital health technologies (DHT) -such as apps, wearables, and websites-to enable clinical and translational research. While studies have described frameworks for assessing and/or implementing individual DHTs, to our knowledge there are none describing how to implement and support multiple DHTs at an academic medical center (AMC). A multi-disciplinary team including information technology, institutional review board, legal, and privacy professionals identified 33 items to evaluate as part of onboarding studies using DHTs. In a one-year period at one AMC, we applied the novel instrument to review 98 requests for research (93) and non-research (5) use of DHTs. The 33-item instrument may be valuable to researchers and practitioners in other settings seeking to scale institutional support for DHTs.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"134-140"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150704/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276833","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automatic Summarization of Doctor-Patient Encounter Dialogues Using Large Language Model through Prompt Tuning. 基于提示调优的大型语言模型的医患对话自动摘要。
Mengxian Lyu, Cheng Peng, Xiaohan Li, Patrick Balian, Jiang Bian, Yonghui Wu

Automatic text summarization (ATS) is an emerging technology to assist clinicians in providing continuous and coordinated care. This study presents an approach to summarize doctor-patient dialogues using generative large language models (LLMs). We developed prompt-tuning algorithms to instruct generative LLMs to summarize clinical text. We examined the prompt-tuning strategies, the size of soft prompts, and the few-short learning ability of GatorTronGPT, a generative clinical LLM developed using 277 billion clinical and general English words with up to 20 billion parameters. We compared GatorTronGPT with a previous solution based on fine-tuning of a widely used T5 model, using a clinical benchmark dataset MTS-DIALOG. The experimental results show that the GatorTronGPT-20B model achieved the best performance on all evaluation metrics. The proposed solution has a low computing cost as the LLM parameters are not updated during prompt-tuning. This study demonstrates the efficiency of generative clinical LLMs for clinical ATS through prompt tuning.

自动文本摘要(ATS)是一项新兴的技术,以协助临床医生提供持续和协调的护理。本研究提出了一种使用生成式大语言模型(llm)来总结医患对话的方法。我们开发了提示调整算法来指导生成法学硕士总结临床文本。我们研究了GatorTronGPT的提示调整策略、软提示的大小和短时间学习能力。GatorTronGPT是一个生成式临床法学硕士,使用2770亿个临床和通用英语单词和多达200亿个参数开发而成。我们使用临床基准数据集MTS-DIALOG,将GatorTronGPT与先前基于广泛使用的T5模型微调的解决方案进行了比较。实验结果表明,GatorTronGPT-20B模型在所有评估指标上都取得了最好的性能。该方法在快速调优过程中不需要更新LLM参数,计算成本较低。本研究通过快速调整证明了生成型临床llm对临床ATS的有效性。
{"title":"Automatic Summarization of Doctor-Patient Encounter Dialogues Using Large Language Model through Prompt Tuning.","authors":"Mengxian Lyu, Cheng Peng, Xiaohan Li, Patrick Balian, Jiang Bian, Yonghui Wu","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Automatic text summarization (ATS) is an emerging technology to assist clinicians in providing continuous and coordinated care. This study presents an approach to summarize doctor-patient dialogues using generative large language models (LLMs). We developed prompt-tuning algorithms to instruct generative LLMs to summarize clinical text. We examined the prompt-tuning strategies, the size of soft prompts, and the few-short learning ability of GatorTronGPT, a generative clinical LLM developed using 277 billion clinical and general English words with up to 20 billion parameters. We compared GatorTronGPT with a previous solution based on fine-tuning of a widely used T5 model, using a clinical benchmark dataset MTS-DIALOG. The experimental results show that the GatorTronGPT-20B model achieved the best performance on all evaluation metrics. The proposed solution has a low computing cost as the LLM parameters are not updated during prompt-tuning. This study demonstrates the efficiency of generative clinical LLMs for clinical ATS through prompt tuning.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"342-349"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150732/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276839","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Determining the Importance of Clinical Modalities for NeuroDegenerative Disorders and Risk of Patient Injury Using Machine Learning and Survival Analysis. 使用机器学习和生存分析确定神经退行性疾病和患者损伤风险的临床模式的重要性。
Kazi Noshin, Mary Regina Boland, Bojian Hou, Weiqing He, Victoria Lu, Carol Manning, Li Shen, Aidong Zhang

Falls among the elderly and especially those with NeuroDegenerative Disorders (NDD) reduces life expectancy. The purpose of this study is to explore the role of Machine Learning on Electronic Health Records (EHR) data for time-to-event survival analysis prediction of injuries, and role of sensitive attributes, e.g., Race, Ethnicity, Sex, in these models. We used multiple survival analysis methods on a cohort of 29,045 patients 65 years and older treated at PennMedicine for either NDD, Mild Cognitive Impairment (MCI), or another disease. We compare the algorithms and explore the role of multiple modalities on improving prediction of injuries among NDD patients, specifically medications and laboratory tests. Overall, we found that medication features resulted in either increased Hazard Ratios (HR) or reduced HR depending on the NDD type. We found that being of Black race significantly increased the risk offall/injury in the models that included only medication and sensitive attribute features. The combined model that used both modalities (medications and laboratory information) removed this relationship between being of Black race and increases in fall/injury. Therefore, we found that combining modalities in these survival models in the prediction offall/injury risk among NDD and MCI individuals results in findings that are robust to different Racial and Ethnic groups with no biases apparent in our final combined modality results. Furthermore, combining modalities (both medications and laboratory values) improved the survival analysis performance across multiple survival analysis methods, when compared using the C-index.

老年人尤其是神经退行性疾病(NDD)患者的跌倒会降低预期寿命。本研究的目的是探索机器学习在电子健康记录(EHR)数据中的作用,以进行损伤的时间到事件生存分析预测,以及敏感属性(如种族、民族、性别)在这些模型中的作用。我们对29,045名65岁及以上在宾夕法尼亚大学医学中心接受NDD、轻度认知障碍(MCI)或其他疾病治疗的患者进行了多种生存分析方法。我们比较了算法,并探讨了多种模式在改善NDD患者损伤预测方面的作用,特别是药物和实验室检查。总体而言,我们发现药物特征导致NDD类型的风险比(HR)增加或降低。我们发现,在只包含药物和敏感属性特征的模型中,黑人显著增加了跌倒/受伤的风险。使用两种模式(药物和实验室信息)的组合模型消除了黑人种族与跌倒/受伤增加之间的关系。因此,我们发现,在预测NDD和MCI个体的跌倒/受伤风险时,这些生存模型中的组合模式导致的结果对不同种族和民族群体都是稳健的,在我们的最终组合模式结果中没有明显的偏差。此外,当使用c指数进行比较时,组合模式(药物和实验室值)提高了多种生存分析方法的生存分析性能。
{"title":"Determining the Importance of Clinical Modalities for NeuroDegenerative Disorders and Risk of Patient Injury Using Machine Learning and Survival Analysis.","authors":"Kazi Noshin, Mary Regina Boland, Bojian Hou, Weiqing He, Victoria Lu, Carol Manning, Li Shen, Aidong Zhang","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Falls among the elderly and especially those with NeuroDegenerative Disorders (NDD) reduces life expectancy. The purpose of this study is to explore the role of Machine Learning on Electronic Health Records (EHR) data for time-to-event survival analysis prediction of injuries, and role of sensitive attributes, e.g., Race, Ethnicity, Sex, in these models. We used multiple survival analysis methods on a cohort of 29,045 patients 65 years and older treated at PennMedicine for either NDD, Mild Cognitive Impairment (MCI), or another disease. We compare the algorithms and explore the role of multiple modalities on improving prediction of injuries among NDD patients, specifically medications and laboratory tests. Overall, we found that medication features resulted in either increased Hazard Ratios (HR) or reduced HR depending on the NDD type. We found that being of Black race significantly increased the risk offall/injury in the models that included only medication and sensitive attribute features. The combined model that used both modalities (medications and laboratory information) removed this relationship between being of Black race and increases in fall/injury. Therefore, we found that combining modalities in these survival models in the prediction offall/injury risk among NDD and MCI individuals results in findings that are robust to different Racial and Ethnic groups with no biases apparent in our final combined modality results. Furthermore, combining modalities (both medications and laboratory values) improved the survival analysis performance across multiple survival analysis methods, when compared using the C-index.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"385-394"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150751/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276862","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Early Alzheimer's Detection Through Voice Analysis: Harnessing Locally Deployable LLMs via ADetectoLocum, a privacy-preserving diagnostic system. 通过语音分析早期检测阿尔茨海默氏症:利用本地可部署的llm通过ADetectoLocum,一个隐私保护诊断系统。
Genevieve A Mortensen, Rui Zhu

Diagnosing Alzheimer's Disease (AD) early and cost-effectively is crucial. Recent advancements in Large Language Models (LLMs) like ChatGPT have made accurate, affordable AD detection feasible. Yet, HIPAA compliance and the challenge of integrating these models into hospital systems limit their use. Addressing these constraints, we introduce ADetectoLocum, an open-source LLM equipped model designed for AD risk detection within hospital environments. This model evaluates AD risk through spontaneous patient speech, enhancing diagnostic processes without external data exchange. Our approach secures local deployment and significantly surpasses previous models in predictive accuracy for AD detection, especially in early-stage identification. ADetectoLocum therefore offers a reliable solution for AD diagnostics in healthcare institutions.

早期和经济有效地诊断阿尔茨海默病(AD)至关重要。像ChatGPT这样的大型语言模型(llm)的最新进展使得准确、负担得起的AD检测成为可能。然而,HIPAA合规和将这些模型集成到医院系统中的挑战限制了它们的使用。为了解决这些限制,我们引入了ADetectoLocum,这是一个配备法学硕士的开源模型,专为医院环境中的AD风险检测而设计。该模型通过患者自发的言语来评估AD风险,增强了无需外部数据交换的诊断过程。我们的方法确保了本地部署,并且在AD检测的预测准确性方面显著超过了以前的模型,特别是在早期识别方面。因此,ADetectoLocum为医疗机构的AD诊断提供了可靠的解决方案。
{"title":"Early Alzheimer's Detection Through Voice Analysis: Harnessing Locally Deployable LLMs via <i>ADetectoLocum</i>, a privacy-preserving diagnostic system.","authors":"Genevieve A Mortensen, Rui Zhu","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Diagnosing Alzheimer's Disease (AD) early and cost-effectively is crucial. Recent advancements in Large Language Models (LLMs) like ChatGPT have made accurate, affordable AD detection feasible. Yet, HIPAA compliance and the challenge of integrating these models into hospital systems limit their use. Addressing these constraints, we introduce ADetectoLocum, an open-source LLM equipped model designed for AD risk detection within hospital environments. This model evaluates AD risk through spontaneous patient speech, enhancing diagnostic processes without external data exchange. Our approach secures local deployment and significantly surpasses previous models in predictive accuracy for AD detection, especially in early-stage identification. ADetectoLocum therefore offers a reliable solution for AD diagnostics in healthcare institutions.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"365-374"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150716/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276865","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Explainable AI for Clinical Outcome Prediction: A Survey of Clinician Perceptions and Preferences. 用于临床结果预测的可解释人工智能:临床医生感知和偏好的调查。
Jun Hou, Lucy Lu Wang

Explainable AI (XAI) techniques are necessary to help clinicians make sense of AI predictions and integrate predictions into their decision-making workflow. In this work, we conduct a survey study to understand clinician preference among different XAI techniques when they are used to interpret model predictions over text-based EHR data. We implement four XAI techniques (LIME, Attention-based span highlights, exemplar patient retrieval, and free-text rationales generated by LLMs) on an outcome prediction model that uses ICU admission notes to predict a patient's likelihood of experiencing in-hospital mortality. Using these XAI implementations, we design and conduct a survey study of 32 practicing clinicians, collecting their feedback and preferences on the four techniques. We synthesize our findings into a set of recommendations describing when each of the XAI techniques may be more appropriate, their potential limitations, as well as recommendations for improvement.

可解释的人工智能(XAI)技术对于帮助临床医生理解人工智能预测并将预测整合到他们的决策流程中是必要的。在这项工作中,我们进行了一项调查研究,以了解临床医生在使用不同的XAI技术来解释基于文本的EHR数据的模型预测时的偏好。我们在结果预测模型上实施了四种XAI技术(LIME、基于注意力的广度突出、典型患者检索和llm生成的自由文本基本原理),该模型使用ICU住院记录来预测患者在医院死亡的可能性。使用这些XAI实现,我们设计并对32名执业临床医生进行了调查研究,收集他们对四种技术的反馈和偏好。我们将我们的发现综合成一组建议,描述每种XAI技术何时可能更合适、它们的潜在限制以及改进建议。
{"title":"Explainable AI for Clinical Outcome Prediction: A Survey of Clinician Perceptions and Preferences.","authors":"Jun Hou, Lucy Lu Wang","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Explainable AI (XAI) techniques are necessary to help clinicians make sense of AI predictions and integrate predictions into their decision-making workflow. In this work, we conduct a survey study to understand clinician preference among different XAI techniques when they are used to interpret model predictions over text-based EHR data. We implement four XAI techniques (LIME, Attention-based span highlights, exemplar patient retrieval, and free-text rationales generated by LLMs) on an outcome prediction model that uses ICU admission notes to predict a patient's likelihood of experiencing in-hospital mortality. Using these XAI implementations, we design and conduct a survey study of 32 practicing clinicians, collecting their feedback and preferences on the four techniques. We synthesize our findings into a set of recommendations describing when each of the XAI techniques may be more appropriate, their potential limitations, as well as recommendations for improvement.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"215-224"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150750/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276871","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Outpatient Portal Use and Blood Pressure Management during Pregnancy. 门诊门静脉使用与妊娠期血压管理。
Athena Stamos, Naleef Fareed

We investigated the association between systole and diastole, and outpatient portal use during pregnancy. We used electronic and administrative data from our academic medical center. We categorized patients into two groups: (<140 mm Hg; <90 mm Hg), and out-of-range (≥140 mm Hg, ≥ 90 mm Hg). Random effects linear regression models examined the association between mean trimester blood pressure (BP) levels and portal use, adjusting for covariates. As portal use increased, both systole and diastole levels decreased for the out-of-range group. These differences were statistically significant for patients who were initially out-of-range. For the in-range group, systole and diastole levels were stable as portal use increased. Results provide evidence to support a relationship between outpatient portal use and BP outcomes during pregnancy. More research is needed to expand on our findings, especially those focused on the implementation and design of outpatient portals for pregnancy.

我们调查了收缩期和舒张期之间的关系,以及怀孕期间门诊门静脉的使用。我们使用了学术医疗中心的电子和管理数据。我们将患者分为两组:(
{"title":"Outpatient Portal Use and Blood Pressure Management during Pregnancy.","authors":"Athena Stamos, Naleef Fareed","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>We investigated the association between systole and diastole, and outpatient portal use during pregnancy. We used electronic and administrative data from our academic medical center. We categorized patients into two groups: (<140 mm Hg; <90 mm Hg), and out-of-range (≥140 mm Hg, ≥ 90 mm Hg). Random effects linear regression models examined the association between mean trimester blood pressure (BP) levels and portal use, adjusting for covariates. As portal use increased, both systole and diastole levels decreased for the out-of-range group. These differences were statistically significant for patients who were initially out-of-range. For the in-range group, systole and diastole levels were stable as portal use increased. Results provide evidence to support a relationship between outpatient portal use and BP outcomes during pregnancy. More research is needed to expand on our findings, especially those focused on the implementation and design of outpatient portals for pregnancy.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"537-545"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150748/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276877","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Safeguarding Privacy in Genome Research: A Comprehensive Framework for Authors. 基因组研究中的隐私保护:作者的综合框架。
Maryam Ghasemian, Lynette Hammond Gerido, Erman Ayday

As genomic research continues to advance, sharing of genomic data and research outcomes has become increasingly important for fostering collaboration and accelerating scientific discovery. However, such data sharing must be balanced with the need to protect the privacy of individuals whose genetic information is being utilized. This paper presents a bidirectional framework for evaluating privacy risks associated with data shared (both in terms of summary statistics and research datasets) in genomic research papers, particularly focusing on re-identification risks such as membership inference attacks (MIA). The framework consists of a structured workflow that begins with a questionnaire designed to capture researchers' (authors') self-reported data sharing practices and privacy protection measures. Responses are used to calculate the risk of re-identification for their study (paper) when compared with the National Institutes of Health (NIH) genomic data sharing policy. Any gaps in compliance help us to identify potential vulnerabilities and encourage the researchers to enhance their privacy measures before submitting their research for publication. The paper also demonstrates the application of this framework, using published genomic research as case study scenarios to emphasize the importance of implementing bidirectional frameworks to support trustworthy open science and genomic data sharing practices.

随着基因组研究的不断推进,共享基因组数据和研究成果对于促进合作和加速科学发现变得越来越重要。然而,这种数据共享必须与保护正在使用其遗传信息的个人隐私的需要相平衡。本文提出了一个双向框架,用于评估基因组研究论文中与共享数据相关的隐私风险(包括汇总统计和研究数据集),特别关注重新识别风险,如成员推理攻击(MIA)。该框架由一个结构化的工作流组成,该工作流从一个问卷开始,旨在捕获研究人员(作者)自我报告的数据共享实践和隐私保护措施。当与美国国立卫生研究院(NIH)基因组数据共享政策进行比较时,应答被用来计算他们的研究(论文)被重新识别的风险。合规方面的任何漏洞都有助于我们识别潜在的漏洞,并鼓励研究人员在提交研究报告发表之前加强他们的隐私措施。本文还演示了该框架的应用,使用已发表的基因组研究作为案例研究场景,强调实施双向框架以支持可信赖的开放科学和基因组数据共享实践的重要性。
{"title":"Safeguarding Privacy in Genome Research: A Comprehensive Framework for Authors.","authors":"Maryam Ghasemian, Lynette Hammond Gerido, Erman Ayday","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>As genomic research continues to advance, sharing of genomic data and research outcomes has become increasingly important for fostering collaboration and accelerating scientific discovery. However, such data sharing must be balanced with the need to protect the privacy of individuals whose genetic information is being utilized. This paper presents a bidirectional framework for evaluating privacy risks associated with data shared (both in terms of summary statistics and research datasets) in genomic research papers, particularly focusing on re-identification risks such as membership inference attacks (MIA). The framework consists of a structured workflow that begins with a questionnaire designed to capture researchers' (authors') self-reported data sharing practices and privacy protection measures. Responses are used to calculate the risk of re-identification for their study (paper) when compared with the National Institutes of Health (NIH) genomic data sharing policy. Any gaps in compliance help us to identify potential vulnerabilities and encourage the researchers to enhance their privacy measures before submitting their research for publication. The paper also demonstrates the application of this framework, using published genomic research as case study scenarios to emphasize the importance of implementing bidirectional frameworks to support trustworthy open science and genomic data sharing practices.</p>","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"177-186"},"PeriodicalIF":0.0,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150713/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144276883","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1