首页 > 最新文献

AMIA ... Annual Symposium proceedings. AMIA Symposium最新文献

英文 中文
Analyzing Dementia Caregivers' Experiences on Twitter: A Term-Weighted Topic Modeling Approach. 分析痴呆症护理者在Twitter上的经历:一种术语加权主题建模方法。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Yanbo Feng, Bojian Hou, Ari Klein, Karen O'Connor, Jiong Chen, Andrées Mondragóon, Shu Yang, Graciela Gonzalez-Hernandez, Li Shen

Dementia profoundly impacts patients and their families, making it essential to understand the experiences and concerns offamily caregivers for enhanced support and care. This study introduces a novel approach to analyzing tweets from individuals whose family members suffer from dementia. We preprocessed our collected Twitter (now X) data using advanced natural language processing techniques and enhanced conventional topic model-Gibbs Sampling Dirichlet Multinomial Mixture Model (GSDMM)-with term-weighting strategies to improve topic clarity. This enhanced approach enabled the identification of key topics among dementia-affected families, offering semantically rich and contextually coherent topics, demonstrating that our method outperforms the state-of-the-art BERTopic model in clarity and consistency. Leveraging ChatGPT 4 alongside two human experts, we uncovered the multifaceted challenges faced by family caregivers. This work aims to provide healthcare professionals, researchers, and support organizations with a valuable tool to better understand and address the needs offamily caregivers.

痴呆症对患者及其家庭产生深远影响,因此了解家庭照护者的经历和关切,以加强支持和照护至关重要。这项研究引入了一种新的方法来分析家庭成员患有痴呆症的个人的推文。我们使用先进的自然语言处理技术和增强的传统主题模型(gibbs Sampling Dirichlet多项式混合模型(GSDMM))预处理收集到的Twitter(现在是X)数据,并使用术语加权策略来提高主题清晰度。这种增强的方法能够识别痴呆症影响家庭中的关键主题,提供语义丰富和上下文连贯的主题,表明我们的方法在清晰度和一致性方面优于最先进的BERTopic模型。利用ChatGPT 4和两位人类专家,我们发现了家庭护理人员面临的多方面挑战。这项工作旨在为医疗保健专业人员、研究人员和支持组织提供一个有价值的工具,以更好地了解和解决家庭照顾者的需求。
{"title":"Analyzing Dementia Caregivers' Experiences on Twitter: A Term-Weighted Topic Modeling Approach.","authors":"Yanbo Feng, Bojian Hou, Ari Klein, Karen O'Connor, Jiong Chen, Andrées Mondragóon, Shu Yang, Graciela Gonzalez-Hernandez, Li Shen","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Dementia profoundly impacts patients and their families, making it essential to understand the experiences and concerns offamily caregivers for enhanced support and care. This study introduces a novel approach to analyzing tweets from individuals whose family members suffer from dementia. We preprocessed our collected Twitter (now X) data using advanced natural language processing techniques and enhanced conventional topic model-Gibbs Sampling Dirichlet Multinomial Mixture Model (GSDMM)-with term-weighting strategies to improve topic clarity. This enhanced approach enabled the identification of key topics among dementia-affected families, offering semantically rich and contextually coherent topics, demonstrating that our method outperforms the state-of-the-art BERTopic model in clarity and consistency. Leveraging ChatGPT 4 alongside two human experts, we uncovered the multifaceted challenges faced by family caregivers. This work aims to provide healthcare professionals, researchers, and support organizations with a valuable tool to better understand and address the needs offamily caregivers.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"407-416"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099380/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards Optimizing LLM Use in Healthcare: Identifying Patient Questions in MyChart Messages. 优化LLM在医疗保健中的应用:在MyChart消息中识别患者问题。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Akhila Chekuri, Armaan S Johal, Matthew R Allen, John W Ayers, Michael Hogarth, Emilia Farcas

The volume of patient-provider messages is on the rise, and Large Language Models (LLMs) can potentially streamline the clinical messaging process, but their success hinges on triaging messages they can optimally address. In this study, we analyzed Electronic Health Records with over 4 million messages exchanged between patients and providers to characterize the utility of using LLMs for messages containing knowledge questions. We implemented a rule-based Syntactic Question Detector as a triage tool, and we evaluated it on 500 messages. The interrater reliability metrics and comparison with LLMs show the difficulty of detecting questions due to the informal text and implicit requests. Our results show that 25% of MyChart messages with questions do not have a response from the clinical team. This paper provides insights into the challenges of real-world data, highlights the importance and non-triviality of detecting questions, and suggests a pipeline for LLM use in healthcare.

患者-提供者信息的数量正在增加,大型语言模型(llm)可以潜在地简化临床信息传递过程,但它们的成功取决于它们能够最佳地处理的分类信息。在本研究中,我们分析了患者和提供者之间交换的超过400万条消息的电子健康记录,以表征使用llm处理包含知识问题的消息的效用。我们实现了一个基于规则的句法问题检测器作为分类工具,并对500条消息进行了评估。解释器可靠性指标和与llm的比较表明,由于非正式文本和隐含请求,问题检测困难。我们的结果显示,有25%的MyChart问题信息没有得到临床团队的回应。本文提供了对现实世界数据挑战的见解,强调了检测问题的重要性和非琐细性,并建议在医疗保健中使用法学硕士的管道。
{"title":"Towards Optimizing LLM Use in Healthcare: Identifying Patient Questions in MyChart Messages.","authors":"Akhila Chekuri, Armaan S Johal, Matthew R Allen, John W Ayers, Michael Hogarth, Emilia Farcas","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The volume of patient-provider messages is on the rise, and Large Language Models (LLMs) can potentially streamline the clinical messaging process, but their success hinges on triaging messages they can optimally address. In this study, we analyzed Electronic Health Records with over 4 million messages exchanged between patients and providers to characterize the utility of using LLMs for messages containing knowledge questions. We implemented a rule-based Syntactic Question Detector as a triage tool, and we evaluated it on 500 messages. The interrater reliability metrics and comparison with LLMs show the difficulty of detecting questions due to the informal text and implicit requests. Our results show that 25% of MyChart messages with questions do not have a response from the clinical team. This paper provides insights into the challenges of real-world data, highlights the importance and non-triviality of detecting questions, and suggests a pipeline for LLM use in healthcare.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"232-241"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099336/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144824","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Structured Knowledge Base Enhances Effective Use of Large Language Models for Metadata Curation. 结构化知识库增强了元数据管理中大型语言模型的有效使用。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Sowmya S Sundaram, Benjamin Solomon, Avani Khatri, Anisha Laumas, Purvesh Khatri, Mark A Musen

Metadata play a crucial role in ensuring the findability, accessibility, interoperability, and reusability of datasets. This paper investigates the potential of large language models (LLMs), specifically GPT-4, to improve adherence to metadata standards in existing datasets. We conducted experiments on 200 random data records describing human samples relating to lung cancer from the NCBI BioSample repository, evaluating GPT-4's ability to suggest edits for adherence to metadata standards. We computed the adherence accuracy of field name-field value pairs through a peer review process, and we observed a marginal average improvement in adherence to the standard data dictionary from 79% to 80% when using GPT-4. We then prompted GPT-4 with domain information in the form of the textual descriptions of CEDAR metadata templates and recorded a statistically significant improvement to 97% from 79% (p<0.01). These results indicate that LLMs show promise for use in automated metadata curation when integrated with a structured knowledge base, though they may struggle when unaided.

元数据在确保数据集的可查找性、可访问性、互操作性和可重用性方面起着至关重要的作用。本文研究了大型语言模型(llm)的潜力,特别是GPT-4,以提高对现有数据集中元数据标准的遵守。我们对来自NCBI生物样本库的200个随机数据记录进行了实验,这些数据记录描述了与肺癌相关的人类样本,评估了GPT-4根据元数据标准建议编辑的能力。我们通过同行评审过程计算了字段名称-字段值对的一致性准确性,我们观察到使用GPT-4时,对标准数据字典的一致性从79%提高到80%。然后,我们用CEDAR元数据模板文本描述形式的域信息提示GPT-4,并记录了统计上显着的改进,从79%提高到97% (p
{"title":"Structured Knowledge Base Enhances Effective Use of Large Language Models for Metadata Curation.","authors":"Sowmya S Sundaram, Benjamin Solomon, Avani Khatri, Anisha Laumas, Purvesh Khatri, Mark A Musen","doi":"","DOIUrl":"","url":null,"abstract":"<p><p><i>Metadata play a crucial role in ensuring the findability, accessibility, interoperability, and reusability of datasets. This paper investigates the potential of large language models (LLMs), specifically GPT-4, to improve adherence to metadata standards in existing datasets. We conducted experiments on 200 random data records describing human samples relating to lung cancer from the NCBI BioSample repository, evaluating GPT-4's ability to suggest edits for adherence to metadata standards. We computed the adherence accuracy of</i> field name-field value <i>pairs through a peer review process, and we observed a marginal average improvement in adherence to the standard data dictionary from 79% to 80% when using GPT-4. We then prompted GPT-4 with domain information in the form of the textual descriptions of CEDAR metadata templates and recorded a statistically significant improvement to 97% from 79% (p<0.01). These results indicate that LLMs show promise for use in automated metadata curation when integrated with a structured knowledge base, though they may struggle when unaided.</i></p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"1050-1058"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099408/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144804","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating the Performance of Large Language Models for Named Entity Recognition in Ophthalmology Clinical Free-Text Notes. 评估大型语言模型在眼科临床自由文本笔记中命名实体识别的性能。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Iyad Majid, Vaibhav Mishra, Rohith Ravindranath, Sophia Y Wang

This study compared large language models (LLMs) and Bidirectional Encoder Representations from Transformers (BERT) models in identifying medication names, routes, and frequencies from publicly available free-text ophthalmology progress notes of 480 patients. 5,520 lines of annotated text were divided into train (N=3,864), validation (N=1,104), and test sets (N=552). We evaluated ChatGPT-3.5, ChatGPT-4, PaLM 2, and Gemini to identify these medication entities. We fine-tuned BERT, BioBERT, ClinicalBERT, DistilBERT, and RoBERTa for the same task using the training set. On the test set, GPT-4 achieved the best performance (macro-averaged F1 0.962). Among the BERT models, BioBERT achieved the best performance (macro-averaged F1 0.875). Modern LLMs outperformed BERT models even in the highly domain-specific task of identifying ophthalmic medication information from progress notes, showcasing the potential of LLMs for medical named entity recognition to enhance patient care.

本研究比较了大型语言模型(llm)和变形金刚的双向编码器表示(BERT)模型在从480名患者的公开自由文本眼科进展记录中识别药物名称、路线和频率方面的效果。5520行注释文本被分为训练集(N= 3864)、验证集(N= 1104)和测试集(N=552)。我们评估了ChatGPT-3.5、ChatGPT-4、PaLM 2和Gemini来识别这些药物实体。我们使用训练集对BERT、BioBERT、ClinicalBERT、DistilBERT和RoBERTa进行了微调,以完成相同的任务。在测试集上,GPT-4的性能最佳(宏观平均F1为0.962)。在BERT模型中,BioBERT模型表现最佳(宏观平均F1为0.875)。现代llm甚至在从进度记录中识别眼科药物信息的高度特定领域任务中也优于BERT模型,这显示了llm在医学命名实体识别方面的潜力,以增强患者护理。
{"title":"Evaluating the Performance of Large Language Models for Named Entity Recognition in Ophthalmology Clinical Free-Text Notes.","authors":"Iyad Majid, Vaibhav Mishra, Rohith Ravindranath, Sophia Y Wang","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>This study compared large language models (LLMs) and Bidirectional Encoder Representations from Transformers (BERT) models in identifying medication names, routes, and frequencies from publicly available free-text ophthalmology progress notes of 480 patients. 5,520 lines of annotated text were divided into train (N=3,864), validation (N=1,104), and test sets (N=552). We evaluated ChatGPT-3.5, ChatGPT-4, PaLM 2, and Gemini to identify these medication entities. We fine-tuned BERT, BioBERT, ClinicalBERT, DistilBERT, and RoBERTa for the same task using the training set. On the test set, GPT-4 achieved the best performance (macro-averaged F1 0.962). Among the BERT models, BioBERT achieved the best performance (macro-averaged F1 0.875). Modern LLMs outperformed BERT models even in the highly domain-specific task of identifying ophthalmic medication information from progress notes, showcasing the potential of LLMs for medical named entity recognition to enhance patient care.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"778-787"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099357/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144562","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Leveraging Cluster Causal Diagrams for Determining Causal Effects in Medicine. 利用聚类因果图确定医学中的因果效应。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Tara V Anand, George Hripcsak

Causal inference, or the task of estimating the causal effect of an exposure or interventional variable on an outcome from an observational dataset, requires precise and rigorous methods, based on assumptions about the system under study. Such assumptions can be articulated as a causal diagram, however use of this technique in medicine is uncommon due to challenges in causal diagram construction in high-dimensional settings. Recent introduction of cluster causal diagrams or C-DAGs promise to ease the task of diagram construction by allowing for the representation of some unknown or partially defined relationships. We evaluate the practical application of C-DAGs in simulated medical contexts. We estimate causal effects under varying sets of assumptions, determined by both causal diagrams and C-DAGs and compare our results. Our findings show empirically similar results, with little discrepancy between causal effect sizes or variance across experimental runs, although estimation and efficiency challenges remain to be explored.

因果推断,或估计暴露或干预变量对观测数据集结果的因果效应的任务,需要基于对所研究系统的假设,采用精确和严格的方法。这样的假设可以被表述为因果图,然而,由于在高维环境中构建因果图的挑战,在医学中使用这种技术并不常见。最近引入的聚类因果图(c - dag)承诺通过允许表示一些未知或部分定义的关系来简化图构建的任务。我们评估了c - dag在模拟医学环境中的实际应用。我们在不同的假设下估计因果效应,由因果图和c - dag决定,并比较我们的结果。我们的研究结果显示了经验上相似的结果,在实验运行中因果效应大小或方差之间几乎没有差异,尽管估计和效率挑战仍有待探索。
{"title":"Leveraging Cluster Causal Diagrams for Determining Causal Effects in Medicine.","authors":"Tara V Anand, George Hripcsak","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Causal inference, or the task of estimating the causal effect of an exposure or interventional variable on an outcome from an observational dataset, requires precise and rigorous methods, based on assumptions about the system under study. Such assumptions can be articulated as a causal diagram, however use of this technique in medicine is uncommon due to challenges in causal diagram construction in high-dimensional settings. Recent introduction of cluster causal diagrams or C-DAGs promise to ease the task of diagram construction by allowing for the representation of some unknown or partially defined relationships. We evaluate the practical application of C-DAGs in simulated medical contexts. We estimate causal effects under varying sets of assumptions, determined by both causal diagrams and C-DAGs and compare our results. Our findings show empirically similar results, with little discrepancy between causal effect sizes or variance across experimental runs, although estimation and efficiency challenges remain to be explored.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"134-141"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099438/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144581","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Modeling Precision Feedback Knowledge for Healthcare Professional Learning and Quality Improvement. 为医疗保健专业人员学习和质量改进建模精确反馈知识。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Zach Landis-Lewis, Yidan Cao, Hana Chung, Peter Boisvert, Anjana Deep Renji, Patrick Galante, Ayshwarya Jagadeesan, Farid Seifi, Allison Janda, Nirav Shah, Andrew Krumm, Allen Flynn

Healthcare providers learn continuously, but better support for provider learning is needed as new biomedical knowledge is produced at an increasing rate alongside widespread use of EHR data for clinical performance measurement. Precision feedback is an approach to improve support for provider learning by prioritizing coaching and appreciation messages based on each message's motivational potential for a specific recipient. We developed a Precision Feedback Knowledge Base as an open resource to support precision feedback systems, containing knowledge models that hold potential as key infrastructure for learning health systems. We describe the design and development of the Precision Feedback Knowledge Base, as well as its key components, including quality measures, feedback message templates, causal pathway models, signal detectors, and prioritization algorithms. Presently, the knowledge base is implemented in a national-scale quality improvement consortium for anesthesia care, to enhance provider feedback email messages.

医疗保健提供者不断学习,但随着新的生物医学知识以越来越快的速度产生,以及用于临床绩效衡量的电子病历数据的广泛使用,需要更好地支持提供者学习。精确反馈是一种改进对提供者学习的支持的方法,通过根据每个信息对特定接收者的激励潜力来优先处理指导和赞赏信息。我们开发了一个精确反馈知识库,作为支持精确反馈系统的开放资源,其中包含的知识模型具有作为学习型卫生系统关键基础设施的潜力。我们描述了精确反馈知识库的设计和开发,以及它的关键组件,包括质量度量、反馈消息模板、因果路径模型、信号检测器和优先级算法。目前,该知识库在全国范围内的麻醉护理质量改进联盟中实施,以增强提供者反馈的电子邮件信息。
{"title":"Modeling Precision Feedback Knowledge for Healthcare Professional Learning and Quality Improvement.","authors":"Zach Landis-Lewis, Yidan Cao, Hana Chung, Peter Boisvert, Anjana Deep Renji, Patrick Galante, Ayshwarya Jagadeesan, Farid Seifi, Allison Janda, Nirav Shah, Andrew Krumm, Allen Flynn","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Healthcare providers learn continuously, but better support for provider learning is needed as new biomedical knowledge is produced at an increasing rate alongside widespread use of EHR data for clinical performance measurement. Precision feedback is an approach to improve support for provider learning by prioritizing coaching and appreciation messages based on each message's motivational potential for a specific recipient. We developed a Precision Feedback Knowledge Base as an open resource to support precision feedback systems, containing knowledge models that hold potential as key infrastructure for learning health systems. We describe the design and development of the Precision Feedback Knowledge Base, as well as its key components, including quality measures, feedback message templates, causal pathway models, signal detectors, and prioritization algorithms. Presently, the knowledge base is implemented in a national-scale quality improvement consortium for anesthesia care, to enhance provider feedback email messages.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"628-637"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099443/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144613","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Federated Diabetes Prediction in Canadian Adults Using Real-world Cross-Province Primary Care Data. 使用真实世界跨省初级保健数据的加拿大成人联合糖尿病预测
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Guojun Tang, Jason E Black, Tyler S Williamson, Steve H Drew

Integrating Electronic Health Records (EHR) and the application of machine learning present opportunities for enhancing the accuracy and accessibility of data-driven diabetes prediction. In particular, developing data-driven machine learning models can provide early identification of patients with high risk for diabetes, potentially leading to more effective therapeutic strategies and reduced healthcare costs. However, regulation restrictions create barriers to developing centralized predictive models. This paper addresses the challenges by introducing a federated learning approach, which amalgamates predictive models without centralized data storage and processing, thus avoiding privacy issues. This marks the first application of federated learning to predict diabetes using real clinical datasets in Canada extracted from the Canadian Primary Care Sentinel Surveillance Network (CPCSSN) without cross-province patient data sharing. We address class-imbalance issues through downsampling techniques and compare federated learning performance against province-based and centralized models. Experimental results show that the federated MLP model presents a similar or higher performance compared to the model trained with the centralized approach. However, the federated logistic regression model showed inferior performance compared to its centralized peer.

整合电子健康记录(EHR)和机器学习的应用为提高数据驱动的糖尿病预测的准确性和可及性提供了机会。特别是,开发数据驱动的机器学习模型可以提供糖尿病高风险患者的早期识别,可能导致更有效的治疗策略并降低医疗保健成本。然而,监管限制为开发集中式预测模型创造了障碍。本文通过引入联邦学习方法来解决这些挑战,该方法合并了预测模型,而无需集中数据存储和处理,从而避免了隐私问题。这标志着联邦学习首次应用于预测糖尿病,使用加拿大初级保健哨点监测网络(cpcsn)中提取的真实临床数据集,而无需跨省患者数据共享。我们通过降采样技术解决了类不平衡问题,并将联邦学习性能与基于省份和集中式模型进行了比较。实验结果表明,与集中式方法训练的模型相比,联邦MLP模型具有相似或更高的性能。然而,与集中式逻辑回归模型相比,联邦逻辑回归模型表现出较差的性能。
{"title":"Federated Diabetes Prediction in Canadian Adults Using Real-world Cross-Province Primary Care Data.","authors":"Guojun Tang, Jason E Black, Tyler S Williamson, Steve H Drew","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Integrating Electronic Health Records (EHR) and the application of machine learning present opportunities for enhancing the accuracy and accessibility of data-driven diabetes prediction. In particular, developing data-driven machine learning models can provide early identification of patients with high risk for diabetes, potentially leading to more effective therapeutic strategies and reduced healthcare costs. However, regulation restrictions create barriers to developing centralized predictive models. This paper addresses the challenges by introducing a federated learning approach, which amalgamates predictive models without centralized data storage and processing, thus avoiding privacy issues. This marks the first application of federated learning to predict diabetes using real clinical datasets in Canada extracted from the Canadian Primary Care Sentinel Surveillance Network (CPCSSN) without cross-province patient data sharing. We address class-imbalance issues through downsampling techniques and compare federated learning performance against province-based and centralized models. Experimental results show that the federated MLP model presents a similar or higher performance compared to the model trained with the centralized approach. However, the federated logistic regression model showed inferior performance compared to its centralized peer.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"1099-1108"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099350/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144645","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identifying acute kidney injury subtypes based on serum electrolyte data in ICU via K-medoids clustering. 基于ICU患者血清电解质数据的K-medoids聚类识别急性肾损伤亚型。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Wentie Liu, Tongyue Shi, Haowei Xu, Huiying Zhao, Jianguo Hao, Guilan Kong

This study proposes to use the K-medoids clustering method to identify subtypes of Intensive Care Unit (ICU)-acquired acute kidney injury (AKI) patients based on serum electrolyte data. Three distinct AKI subtypes with different serum electrolyte characteristics were identified by clustering analysis. Further, descriptive analysis was employed to characterize in-hospital mortality and renal replacement therapy, diuretic and vasopressor usage in the three subtypes, and Chi-square tests were conducted to check the differences of prognosis and treatments among the identified subtypes. This study enables the subclassification of AKI patients in the ICU, facilitating ICU physicians to make timely clinical decisions about AKI, and ultimately may contribute to patient outcome improvement.

本研究提出基于血清电解质数据,采用K-medoids聚类方法识别重症监护病房(ICU)获得性急性肾损伤(AKI)患者的亚型。聚类分析发现3种不同的AKI亚型具有不同的血清电解质特征。进一步,采用描述性分析对三种亚型患者的住院死亡率和肾脏替代治疗、利尿剂和血管加压剂的使用情况进行表征,并采用卡方检验检验所确定亚型患者的预后和治疗差异。本研究实现了AKI患者在ICU的亚分类,有助于ICU医生对AKI做出及时的临床决策,最终可能有助于患者预后的改善。
{"title":"Identifying acute kidney injury subtypes based on serum electrolyte data in ICU via <i>K</i>-medoids clustering.","authors":"Wentie Liu, Tongyue Shi, Haowei Xu, Huiying Zhao, Jianguo Hao, Guilan Kong","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>This study proposes to use the K-medoids clustering method to identify subtypes of Intensive Care Unit (ICU)-acquired acute kidney injury (AKI) patients based on serum electrolyte data. Three distinct AKI subtypes with different serum electrolyte characteristics were identified by clustering analysis. Further, descriptive analysis was employed to characterize in-hospital mortality and renal replacement therapy, diuretic and vasopressor usage in the three subtypes, and Chi-square tests were conducted to check the differences of prognosis and treatments among the identified subtypes. This study enables the subclassification of AKI patients in the ICU, facilitating ICU physicians to make timely clinical decisions about AKI, and ultimately may contribute to patient outcome improvement.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"733-737"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099402/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144651","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Impact of Automated Transfer of Semi-Automated Segmentation and Structured Report Rule Requirements on Cardiac MRI Report Quality, Standardization, and Efficiency. 半自动分割和结构化报告规则要求的自动传递对心脏MRI报告质量、标准化和效率的影响。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Diane Rizkallah, Neil L Greenberg, Rishabh Khurana, Vadivelan Palanisamy, Ben Alencherry, Carl Ammoury, Yezan Salam, Lisa Lamovsky, Haitham Fares, Robert Geschke, Richard Grimm, Christopher Nguyen, David Chen, Deborah H Kwon

Clinical reporting of cardiac magnetic resonance (CMR) imaging exams is commonly performed with a dictation approach which requires great care to capture both consistent and comprehensive data. We sought to transform the reporting process by utilizing a structured report framework for reporting standardization, by incorporating automated transfer of data semi-automated segmentation tools for efficiency, and rule-based reporting requirements to improve quality and standardization. Interfaces between the applications used to schedule and protocol exams and to analyze the acquired images were created to bring the source information directly into the structured reporting environment. The physicians reporting CMR were surveyed to determine satisfaction and improved efficiency with the new process through self-reported reporting time. Quality improvement was assessed by examining the consistency of reported parameters with the inclusion of rule-based requirements. The designed structured reporting process with automated measurements and rule-based requirements resulted in significant improvement in report efficiency and quality.

心脏磁共振(CMR)成像检查的临床报告通常采用听写方法,需要非常小心地捕获一致和全面的数据。我们试图通过利用结构化报告框架来实现报告标准化,通过合并自动数据传输、半自动分割工具来提高效率,以及基于规则的报告要求来提高质量和标准化,从而改变报告过程。用于安排和协议考试以及分析获取的图像的应用程序之间的接口被创建,以便将源信息直接引入结构化报告环境。对报告CMR的医生进行调查,通过自我报告的报告时间来确定新流程的满意度和提高的效率。质量改进是通过检查报告参数的一致性来评估的,包括基于规则的要求。采用自动测量和基于规则的需求设计的结构化报告过程显著提高了报告的效率和质量。
{"title":"Impact of Automated Transfer of Semi-Automated Segmentation and Structured Report Rule Requirements on Cardiac MRI Report Quality, Standardization, and Efficiency.","authors":"Diane Rizkallah, Neil L Greenberg, Rishabh Khurana, Vadivelan Palanisamy, Ben Alencherry, Carl Ammoury, Yezan Salam, Lisa Lamovsky, Haitham Fares, Robert Geschke, Richard Grimm, Christopher Nguyen, David Chen, Deborah H Kwon","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Clinical reporting of cardiac magnetic resonance (CMR) imaging exams is commonly performed with a dictation approach which requires great care to capture both consistent and comprehensive data. We sought to transform the reporting process by utilizing a structured report framework for reporting standardization, by incorporating automated transfer of data semi-automated segmentation tools for efficiency, and rule-based reporting requirements to improve quality and standardization. Interfaces between the applications used to schedule and protocol exams and to analyze the acquired images were created to bring the source information directly into the structured reporting environment. The physicians reporting CMR were surveyed to determine satisfaction and improved efficiency with the new process through self-reported reporting time. Quality improvement was assessed by examining the consistency of reported parameters with the inclusion of rule-based requirements. The designed structured reporting process with automated measurements and rule-based requirements resulted in significant improvement in report efficiency and quality.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"950-959"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099410/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144657","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
PathSAM: Enhancing Oral Cancer Detection with Advanced Segmentation and Explainability. PathSAM:通过先进的分割和可解释性增强口腔癌的检测。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Suraj Sood, Jawad S Shah, Saeed Alqarn, Yugyung Lee

Building on the success of the Segment Anything Model (SAM) in image segmentation, "PathSAM: SAM for Pathological Images in Oral Cancer Detection" addresses the unique challenges associated with diagnosing oral cancer. Although SAM is versatile, its application to pathological images is hindered by its inherent complexity and variability. PathSAM advances beyond traditional deep-learning methods by delivering superior accuracy and detail in segmenting critical datasets like ORCA and OCDC, as demonstrated through both quantitative and qualitative evaluations. The integration of Large Language Models (LLMs) further enhances PathSAM by providing clear, interpretable segmentation results, facilitating accurate tumor identification, and improving communication between patients and healthcare providers. This innovation positions PathSAM as a valuable tool in medical diagnostics.

基于SAM在图像分割方面的成功,“PathSAM:用于口腔癌检测的病理图像SAM”解决了与口腔癌诊断相关的独特挑战。虽然SAM是通用的,但其在病理图像中的应用受到其固有的复杂性和可变性的阻碍。如定量和定性评估所示,PathSAM超越了传统的深度学习方法,在分割ORCA和OCDC等关键数据集方面提供了卓越的准确性和细节。大型语言模型(llm)的集成通过提供清晰、可解释的分割结果、促进准确的肿瘤识别以及改善患者与医疗保健提供者之间的沟通,进一步增强了PathSAM。这一创新使PathSAM成为医学诊断领域的一种有价值的工具。
{"title":"PathSAM: Enhancing Oral Cancer Detection with Advanced Segmentation and Explainability.","authors":"Suraj Sood, Jawad S Shah, Saeed Alqarn, Yugyung Lee","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Building on the success of the Segment Anything Model (SAM) in image segmentation, \"PathSAM: SAM for Pathological Images in Oral Cancer Detection\" addresses the unique challenges associated with diagnosing oral cancer. Although SAM is versatile, its application to pathological images is hindered by its inherent complexity and variability. PathSAM advances beyond traditional deep-learning methods by delivering superior accuracy and detail in segmenting critical datasets like ORCA and OCDC, as demonstrated through both quantitative and qualitative evaluations. The integration of Large Language Models (LLMs) further enhances PathSAM by providing clear, interpretable segmentation results, facilitating accurate tumor identification, and improving communication between patients and healthcare providers. This innovation positions PathSAM as a valuable tool in medical diagnostics.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"1069-1078"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099372/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144664","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
AMIA ... Annual Symposium proceedings. AMIA Symposium
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1