首页 > 最新文献

AMIA ... Annual Symposium proceedings. AMIA Symposium最新文献

英文 中文
Web-based Interventions for Substance Use Disorders and Mental Health: Preliminary findings from a Scoping Review. 基于网络的物质使用障碍和精神健康干预:范围审查的初步发现。
Pub Date : 2025-12-19 eCollection Date: 2024-01-01
Yuri Quintana, Amanda L Joseph, Gyana Srivastava

This scoping review evaluated the efficacy and potential of web-based interventions for substance use disorders and mental health conditions. The studies comprise randomized controlled trials, pilot trials, and effectiveness trials. Web-based interventions consistently demonstrated significant reductions in substance use, improvements in mental health outcomes (e.g., PTSD, depression, anxiety), and enhancements in emotion regulation, help-seeking, and quality of life. Several studies found web-based interventions to be non-inferior or superior to traditional face-to-face treatments. Despite limitations in the current evidence base, such as methodological issues and lack of long-term follow-up, the findings highlight the promise of web-based interventions in expanding access to evidence-based care, particularly for underserved populations. Future research should focus on refining interventions, exploring novel technologies, and evaluating long-term effectiveness and cost-effectiveness. The integration of web-based interventions into healthcare systems has the potential to significantly impact public health by increasing treatment accessibility and improving outcomes for individuals with substance use disorders and mental health conditions.

本范围审查评估了基于网络的药物使用障碍和精神健康状况干预措施的有效性和潜力。研究包括随机对照试验、试点试验和有效性试验。基于网络的干预措施一致表明,药物使用显著减少,精神健康结果(例如,创伤后应激障碍、抑郁、焦虑)得到改善,情绪调节、寻求帮助和生活质量得到增强。几项研究发现,基于网络的干预措施并不逊于或优于传统的面对面治疗。尽管目前的证据基础存在局限性,例如方法问题和缺乏长期随访,但研究结果强调了基于网络的干预措施在扩大获得循证护理方面的前景,特别是对于服务不足的人群。未来的研究应侧重于改进干预措施,探索新技术,评估长期有效性和成本效益。将基于网络的干预措施整合到卫生保健系统中,有可能通过增加治疗可及性和改善物质使用障碍和精神健康状况患者的预后,对公共卫生产生重大影响。
{"title":"Web-based Interventions for Substance Use Disorders and Mental Health: Preliminary findings from a Scoping Review.","authors":"Yuri Quintana, Amanda L Joseph, Gyana Srivastava","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>This scoping review evaluated the efficacy and potential of web-based interventions for substance use disorders and mental health conditions. The studies comprise randomized controlled trials, pilot trials, and effectiveness trials. Web-based interventions consistently demonstrated significant reductions in substance use, improvements in mental health outcomes (e.g., PTSD, depression, anxiety), and enhancements in emotion regulation, help-seeking, and quality of life. Several studies found web-based interventions to be non-inferior or superior to traditional face-to-face treatments. Despite limitations in the current evidence base, such as methodological issues and lack of long-term follow-up, the findings highlight the promise of web-based interventions in expanding access to evidence-based care, particularly for underserved populations. Future research should focus on refining interventions, exploring novel technologies, and evaluating long-term effectiveness and cost-effectiveness. The integration of web-based interventions into healthcare systems has the potential to significantly impact public health by increasing treatment accessibility and improving outcomes for individuals with substance use disorders and mental health conditions.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"940-949"},"PeriodicalIF":0.0,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12778349/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145936476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lessons Learned from OpenEMR Implementation in Graduate Health Informatics Curriculum. 研究生健康信息学课程实施openenemr的经验教训。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Keerthika Sunchu, Megha M Moncy, Saptarshi Purkayastha, Cathy R Fulton

This study examines the integration of OpenEMR, a Meaningful Use-certified open-source electronic health record (EHR) system, into a Health Informatics curriculum. The primary objective was to address the disparity between theoretical knowledge and practical application in health informatics education. The implementation process revealed several significant challenges, including unintended system modifications that compromised functionality, data entry errors that impacted usability, and technical issues that impeded accessibility. To mitigate these challenges, a series of interventions were implemented. These included backend modifications to enhance data entry accuracy, usability improvements such as limiting open tabs to facilitate navigation, and the implementation ofproactive measures to expedite the resolution of technical issues. The experiences gained from this integration process highlight three critical aspects of health informatics education: the significance of practical proficiency in EHR systems, the necessity for user-centric interface design, and the importance of adaptability and problem-solving skills. The study proposes several future directions for research and practice. These include fostering global collaboration, developing standardized curricula for EHR education, and establishing robust mechanisms for continuous assessment and improvement. The findings underscore the pivotal role of integrating hands-on EHR experience into health informatics education, emphasizing its potential to equip students with the essential competencies required to navigate the complex and dynamic healthcare landscape.

本研究考察了开放式健康档案系统(一个有意义使用认证的开源电子健康档案系统)与健康信息学课程的整合。主要目的是解决卫生信息学教育中理论知识与实际应用之间的差距。实现过程揭示了几个重要的挑战,包括破坏功能的意外系统修改,影响可用性的数据输入错误,以及阻碍可访问性的技术问题。为了缓解这些挑战,实施了一系列干预措施。其中包括后端修改以提高数据输入的准确性,可用性改进(如限制打开选项卡以方便导航),以及实施主动措施以加快技术问题的解决。从这一整合过程中获得的经验突出了健康信息学教育的三个关键方面:电子病历系统实践熟练程度的重要性,以用户为中心的界面设计的必要性,以及适应性和解决问题技能的重要性。本研究提出了未来研究和实践的几个方向。这些措施包括促进全球合作,制定电子健康档案教育的标准化课程,以及建立持续评估和改进的健全机制。研究结果强调了将电子病历实践经验整合到健康信息学教育中的关键作用,强调了其潜力,使学生具备驾驭复杂和动态的医疗保健环境所需的基本能力。
{"title":"Lessons Learned from OpenEMR Implementation in Graduate Health Informatics Curriculum.","authors":"Keerthika Sunchu, Megha M Moncy, Saptarshi Purkayastha, Cathy R Fulton","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>This study examines the integration of OpenEMR, a Meaningful Use-certified open-source electronic health record (EHR) system, into a Health Informatics curriculum. The primary objective was to address the disparity between theoretical knowledge and practical application in health informatics education. The implementation process revealed several significant challenges, including unintended system modifications that compromised functionality, data entry errors that impacted usability, and technical issues that impeded accessibility. To mitigate these challenges, a series of interventions were implemented. These included backend modifications to enhance data entry accuracy, usability improvements such as limiting open tabs to facilitate navigation, and the implementation ofproactive measures to expedite the resolution of technical issues. The experiences gained from this integration process highlight three critical aspects of health informatics education: the significance of practical proficiency in EHR systems, the necessity for user-centric interface design, and the importance of adaptability and problem-solving skills. The study proposes several future directions for research and practice. These include fostering global collaboration, developing standardized curricula for EHR education, and establishing robust mechanisms for continuous assessment and improvement. The findings underscore the pivotal role of integrating hands-on EHR experience into health informatics education, emphasizing its potential to equip students with the essential competencies required to navigate the complex and dynamic healthcare landscape.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"1079-1088"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099383/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RealMedQA: A pilot biomedical question answering dataset containing realistic clinical questions. RealMedQA:一个试验性的生物医学问题回答数据集,包含现实的临床问题。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Gregory Kell, Angus Roberts, Serge Umansky, Yuti Khare, Najma Ahmed, Nikhil Patel, Chloe Simela, Jack Coumbe, Julian Rozario, Ryan-Rhys Griffiths, Iain J Marshall

Clinical question answering systems have the potential to provide clinicians with relevant and timely answers to their questions. Nonetheless, despite the advances that have been made, adoption of these systems in clinical settings has been slow. One issue is a lack of question-answering datasets which reflect the real-world needs of health professionals. In this work, we present RealMedQA, a dataset of realistic clinical questions generated by humans and an LLM. We describe the process for generating and verifying the QA pairs and assess several QA models on BioASQ and RealMedQA to assess the relative difficulty of matching answers to questions. We show that the LLM is more cost-efficient for generating "ideal" QA pairs. Additionally, we achieve a lower lexical similarity between questions and answers than BioASQ which provides an additional challenge to the top two QA models, as per the results. We release our code and our dataset publicly to encourage further research.

临床问答系统有可能为临床医生提供相关和及时的问题答案。然而,尽管取得了进展,但在临床环境中采用这些系统的速度很慢。一个问题是缺乏反映现实世界卫生专业人员需求的问答数据集。在这项工作中,我们提出了RealMedQA,这是一个由人类和法学硕士生成的现实临床问题的数据集。我们描述了生成和验证QA对的过程,并在BioASQ和RealMedQA上评估了几个QA模型,以评估匹配问题答案的相对难度。我们证明了LLM在生成“理想”QA对方面更具成本效益。此外,根据结果,我们实现了比BioASQ更低的问题和答案之间的词汇相似性,这为前两个QA模型提供了额外的挑战。我们公开发布代码和数据集,以鼓励进一步的研究。
{"title":"RealMedQA: A pilot biomedical question answering dataset containing realistic clinical questions.","authors":"Gregory Kell, Angus Roberts, Serge Umansky, Yuti Khare, Najma Ahmed, Nikhil Patel, Chloe Simela, Jack Coumbe, Julian Rozario, Ryan-Rhys Griffiths, Iain J Marshall","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Clinical question answering systems have the potential to provide clinicians with relevant and timely answers to their questions. Nonetheless, despite the advances that have been made, adoption of these systems in clinical settings has been slow. One issue is a lack of question-answering datasets which reflect the real-world needs of health professionals. In this work, we present RealMedQA, a dataset of realistic clinical questions generated by humans and an LLM. We describe the process for generating and verifying the QA pairs and assess several QA models on BioASQ and RealMedQA to assess the relative difficulty of matching answers to questions. We show that the LLM is more cost-efficient for generating \"ideal\" QA pairs. Additionally, we achieve a lower lexical similarity between questions and answers than BioASQ which provides an additional challenge to the top two QA models, as per the results. We release our code and our dataset publicly to encourage further research.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"590-599"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099375/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Towards Interpretable End-Stage Renal Disease (ESRD) Prediction: Utilizing Administrative Claims Data with Explainable AI Techniques. 迈向可解释的终末期肾病(ESRD)预测:利用行政索赔数据和可解释的人工智能技术。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Yubo Li, Saba Al-Sayouri, Rema Padman

This study explores the potential of utilizing administrative claims data, combined with advanced machine learning and deep learning techniques, to predict the progression of Chronic Kidney Disease (CKD) to End-Stage Renal Disease (ESRD). We analyze a comprehensive, 10-year dataset provided by a major health insurance organization to develop prediction models for multiple observation windows using traditional machine learning methods such as Random Forest and XGBoost as well as deep learning approaches such as Long Short-Term Memory (LSTM) networks. Our findings demonstrate that the LSTM model, particularly with a 24-month observation window, exhibits superior performance in predicting ESRD progression, outperforming existing models in the literature. We further apply SHap-ley Additive exPlanations (SHAP) analysis to enhance interpretability, providing insights into the impact of individual features on predictions at the individual patient level. This study underscores the value of leveraging administrative claims data for CKD management and predicting ESRD progression.

本研究探讨了利用行政索赔数据,结合先进的机器学习和深度学习技术,预测慢性肾脏疾病(CKD)到终末期肾脏疾病(ESRD)进展的潜力。我们分析了由一家大型健康保险组织提供的全面的10年数据集,使用传统的机器学习方法(如Random Forest和XGBoost)以及深度学习方法(如长短期记忆(LSTM)网络)开发多个观测窗口的预测模型。我们的研究结果表明,LSTM模型,特别是具有24个月观察窗口的LSTM模型,在预测ESRD进展方面表现优异,优于文献中的现有模型。我们进一步应用SHAP -ley加性解释(SHAP)分析来提高可解释性,从而深入了解个体特征对个体患者水平预测的影响。本研究强调了利用行政索赔数据对CKD管理和预测ESRD进展的价值。
{"title":"Towards Interpretable End-Stage Renal Disease (ESRD) Prediction: Utilizing Administrative Claims Data with Explainable AI Techniques.","authors":"Yubo Li, Saba Al-Sayouri, Rema Padman","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>This study explores the potential of utilizing administrative claims data, combined with advanced machine learning and deep learning techniques, to predict the progression of Chronic Kidney Disease (CKD) to End-Stage Renal Disease (ESRD). We analyze a comprehensive, 10-year dataset provided by a major health insurance organization to develop prediction models for multiple observation windows using traditional machine learning methods such as Random Forest and XGBoost as well as deep learning approaches such as Long Short-Term Memory (LSTM) networks. Our findings demonstrate that the LSTM model, particularly with a 24-month observation window, exhibits superior performance in predicting ESRD progression, outperforming existing models in the literature. We further apply SHap-ley Additive exPlanations (SHAP) analysis to enhance interpretability, providing insights into the impact of individual features on predictions at the individual patient level. This study underscores the value of leveraging administrative claims data for CKD management and predicting ESRD progression.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"664-673"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099416/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144822","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development of a Human Evaluation Framework and Correlation with Automated Metrics for Natural Language Generation of Medical Diagnoses. 医学诊断自然语言生成的人类评估框架及其与自动度量的关联。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Emma Croxford, Yanjun Gao, Brian Patterson, Daniel To, Samuel Tesch, Dmitriy Dligach, Anoop Mayampurath, Matthew M Churpek, Majid Afshar

In the evolving landscape of clinical Natural Language Generation (NLG), assessing abstractive text quality remains challenging, as existing methods often overlook generative task complexities. This work aimed to examine the current state of automated evaluation metrics in NLG in healthcare. To have a robust and well-validated baseline with which to examine the alignment of these metrics, we created a comprehensive human evaluation framework. Employing ChatGPT-3.5-turbo generative output, we correlated human judgments with each metric. None of the metrics demonstrated high alignment; however, the SapBERT score-a Unified Medical Language System (UMLS)- showed the best results. This underscores the importance of incorporating domain-specific knowledge into evaluation efforts. Our work reveals the deficiency in quality evaluations for generated text and introduces our comprehensive human evaluation framework as a baseline. Future efforts should prioritize integrating medical knowledge databases to enhance the alignment of automated metrics, particularly focusing on refining the SapBERT score for improved assessments.

在临床自然语言生成(NLG)不断发展的环境中,评估抽象文本质量仍然具有挑战性,因为现有方法经常忽略生成任务的复杂性。这项工作旨在检查医疗保健中NLG自动评估指标的现状。为了有一个健壮的和经过良好验证的基线来检查这些度量的一致性,我们创建了一个全面的人类评估框架。使用chatgpt -3.5涡轮生成输出,我们将人类判断与每个指标关联起来。没有一个指标显示出高度的一致性;然而,统一医学语言系统(UMLS)的SapBERT评分显示出最好的结果。这强调了将特定领域的知识纳入评估工作的重要性。我们的工作揭示了生成文本质量评估的不足,并介绍了我们的综合人类评估框架作为基线。未来的工作应优先考虑整合医学知识数据库,以增强自动化度量标准的一致性,特别是侧重于改进SapBERT评分以改进评估。
{"title":"Development of a Human Evaluation Framework and Correlation with Automated Metrics for Natural Language Generation of Medical Diagnoses.","authors":"Emma Croxford, Yanjun Gao, Brian Patterson, Daniel To, Samuel Tesch, Dmitriy Dligach, Anoop Mayampurath, Matthew M Churpek, Majid Afshar","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>In the evolving landscape of clinical Natural Language Generation (NLG), assessing abstractive text quality remains challenging, as existing methods often overlook generative task complexities. This work aimed to examine the current state of automated evaluation metrics in NLG in healthcare. To have a robust and well-validated baseline with which to examine the alignment of these metrics, we created a comprehensive human evaluation framework. Employing ChatGPT-3.5-turbo generative output, we correlated human judgments with each metric. None of the metrics demonstrated high alignment; however, the SapBERT score-a Unified Medical Language System (UMLS)- showed the best results. This underscores the importance of incorporating domain-specific knowledge into evaluation efforts. Our work reveals the deficiency in quality evaluations for generated text and introduces our comprehensive human evaluation framework as a baseline. Future efforts should prioritize integrating medical knowledge databases to enhance the alignment of automated metrics, particularly focusing on refining the SapBERT score for improved assessments.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"309-318"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099413/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Factors Driving Patient Decisions to Access Electronic Health Records via a Breast Cancer Online Decision Aid linked to the Patient Portal. 通过与患者门户网站链接的乳腺癌在线决策辅助,驱动患者决定访问电子健康记录的因素。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Anna Vaynrub, Subiksha Umakanth, Harry West, Alissa Michel, Jill Dimond, Stephan Constante, Katherine D Crew, Rita Kukafka

A critical strategy in limiting breast cancer (BC) mortality is the early identification of high-risk patients and implementation of risk-reducing measures. RealRisks, an online decision aid constructed by our team to provide education on BC risk and personalized risk assessment, allows users the option to connect to their electronic health record (EHR) to extract requisite data to calculate BC risk via Fast Healthcare Interoperability Resources (FHIR). Using data from RealRisks user profiles, baseline questionnaires, and interview transcripts, we sought to understand the differences between the groups of patients who opted to download their data via the EHR vs. those who did not. A higher percentage of those who downloaded data (53.8% vs. 42.3%) identified as Hispanic/Latino compared to those who did not download. Thematic analysis suggested that while data security and privacy concerns may lead to hesitation, it is perhaps technological barriers that most significantly limit EHR download.

限制乳腺癌(BC)死亡率的一个关键策略是早期识别高危患者并实施降低风险的措施。RealRisks是由我们的团队构建的在线决策辅助工具,旨在提供BC风险教育和个性化风险评估,允许用户选择连接到他们的电子健康记录(EHR),以提取必要的数据,通过快速医疗保健互操作性资源(FHIR)计算BC风险。使用来自RealRisks用户档案、基线问卷和访谈记录的数据,我们试图了解选择通过电子病历下载数据的患者组与不选择通过电子病历下载数据的患者组之间的差异。与没有下载数据的人相比,下载数据的人(53.8% vs. 42.3%)中西班牙裔/拉丁裔人的比例更高。专题分析表明,虽然数据安全和隐私问题可能导致犹豫,但技术障碍可能是限制电子病历下载的最重要因素。
{"title":"Factors Driving Patient Decisions to Access Electronic Health Records via a Breast Cancer Online Decision Aid linked to the Patient Portal.","authors":"Anna Vaynrub, Subiksha Umakanth, Harry West, Alissa Michel, Jill Dimond, Stephan Constante, Katherine D Crew, Rita Kukafka","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>A critical strategy in limiting breast cancer (BC) mortality is the early identification of high-risk patients and implementation of risk-reducing measures. <i>RealRisks</i>, an online decision aid constructed by our team to provide education on BC risk and personalized risk assessment, allows users the option to connect to their electronic health record (EHR) to extract requisite data to calculate BC risk via Fast Healthcare Interoperability Resources (FHIR). Using data from <i>RealRisks</i> user profiles, baseline questionnaires, and interview transcripts, we sought to understand the differences between the groups of patients who opted to download their data via the EHR vs. those who did not. A higher percentage of those who downloaded data (53.8% vs. 42.3%) identified as Hispanic/Latino compared to those who did not download. Thematic analysis suggested that while data security and privacy concerns may lead to hesitation, it is perhaps technological barriers that most significantly limit EHR download.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"1159-1168"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099358/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Large Language Models Struggle in Token-Level Clinical Named Entity Recognition. 大型语言模型在符号级临床命名实体识别中的挣扎。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Qiuhao Lu, Rui Li, Andrew Wen, Jinlian Wang, Liwei Wang, Hongfang Liu

Large Language Models (LLMs) have revolutionized various sectors, including healthcare where they are employed in diverse applications. Their utility is particularly significant in the context of rare diseases, where data scarcity, complexity, and specificity pose considerable challenges. In the clinical domain, Named Entity Recognition (NER) stands out as an essential task and it plays a crucial role in extracting relevant information from clinical texts. Despite the promise of LLMs, current research mostly concentrates on document-level NER, identifying entities in a more general context across entire documents, without extracting their precise location. Additionally, efforts have been directed towards adapting ChatGPTfor token-level NER. However, there is a significant research gap when it comes to employing token-level NER for clinical texts, especially with the use of local open-source LLMs. This study aims to bridge this gap by investigating the effectiveness of both proprietary and local LLMs in token-level clinical NER. Essentially, we delve into the capabilities of these models through a series of experiments involving zero-shot prompting, few-shot prompting, retrieval-augmented generation (RAG), and instruction-fine-tuning. Our exploration reveals the inherent challenges LLMs face in token-level NER, particularly in the context of rare diseases, and suggests possible improvements for their application in healthcare. This research contributes to narrowing a significant gap in healthcare informatics and offers insights that could lead to a more refined application of LLMs in the healthcare sector.

大型语言模型(llm)已经彻底改变了各个领域,包括医疗保健领域,其中它们被用于各种应用程序。在罕见疾病的背景下,它们的效用尤其重要,因为数据的稀缺性、复杂性和特异性构成了相当大的挑战。在临床领域,命名实体识别(NER)是一项重要的任务,它在从临床文本中提取相关信息方面起着至关重要的作用。尽管llm很有前途,但目前的研究主要集中在文档级NER上,即在整个文档中更一般的上下文中识别实体,而不是提取它们的精确位置。此外,还在努力使chatgpt适应令牌级NER。然而,当涉及到为临床文本使用令牌级NER时,特别是使用本地开源法学硕士时,存在显着的研究差距。本研究旨在通过调查专有和本地法学硕士在令牌级临床NER中的有效性来弥合这一差距。从本质上讲,我们通过一系列涉及零提示、少提示、检索增强生成(RAG)和指令微调的实验来深入研究这些模型的功能。我们的探索揭示了llm在代币级NER中面临的固有挑战,特别是在罕见疾病的背景下,并建议了它们在医疗保健领域应用的可能改进。这项研究有助于缩小医疗保健信息学方面的重大差距,并提供了可能导致法学硕士在医疗保健领域更精细应用的见解。
{"title":"Large Language Models Struggle in Token-Level Clinical Named Entity Recognition.","authors":"Qiuhao Lu, Rui Li, Andrew Wen, Jinlian Wang, Liwei Wang, Hongfang Liu","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Large Language Models (LLMs) have revolutionized various sectors, including healthcare where they are employed in diverse applications. Their utility is particularly significant in the context of rare diseases, where data scarcity, complexity, and specificity pose considerable challenges. In the clinical domain, Named Entity Recognition (NER) stands out as an essential task and it plays a crucial role in extracting relevant information from clinical texts. Despite the promise of LLMs, current research mostly concentrates on document-level NER, identifying entities in a more general context across entire documents, without extracting their precise location. Additionally, efforts have been directed towards adapting ChatGPTfor token-level NER. However, there is a significant research gap when it comes to employing token-level NER for clinical texts, especially with the use of local open-source LLMs. This study aims to bridge this gap by investigating the effectiveness of both proprietary and local LLMs in token-level clinical NER. Essentially, we delve into the capabilities of these models through a series of experiments involving zero-shot prompting, few-shot prompting, retrieval-augmented generation (RAG), and instruction-fine-tuning. Our exploration reveals the inherent challenges LLMs face in token-level NER, particularly in the context of rare diseases, and suggests possible improvements for their application in healthcare. This research contributes to narrowing a significant gap in healthcare informatics and offers insights that could lead to a more refined application of LLMs in the healthcare sector.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"748-757"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099373/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Enhancement of Fairness in AI for Chest X-ray Classification. 增强胸部x线分类人工智能公平性
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Nicholas J Jackson, Chao Yan, Bradley A Malin

The use of artificial intelligence (AI) in medicine has shown promise to improve the quality of healthcare decisions. However, AI can be biased in a manner that produces unfair predictions for certain demographic subgroups. In MIMIC-CXR, a publicly available dataset of over 300,000 chest X-ray images, diagnostic AI has been shown to have a higher false negative rate for racial minorities. We evaluated the capacity of synthetic data augmentation, oversampling, and demographic-based corrections to enhance the fairness of AI predictions. We show that adjusting unfair predictions for demographic attributes, such as race, is ineffective at improving fairness or predictive performance. However, using oversampling and synthetic data augmentation to modify disease prevalence reduced such disparities by 74.7% and 10.6%, respectively. Moreover, such fairness gains were accomplished without reduction in performance (95% CI AUC: [0.816, 0.820] versus [0.810, 0.819] versus [0.817, 0.821] for baseline, oversampling, and augmentation, respectively).

人工智能(AI)在医学领域的应用有望提高医疗保健决策的质量。然而,人工智能可能会以某种方式产生对某些人口统计子群体的不公平预测。MIMIC-CXR是一个公开的超过30万张胸部x射线图像数据集,在该数据集中,人工智能诊断对少数种族的假阴性率更高。我们评估了合成数据增强、过采样和基于人口统计的修正的能力,以提高人工智能预测的公平性。我们表明,调整人口统计属性(如种族)的不公平预测在提高公平性或预测性能方面是无效的。然而,使用过采样和合成数据增强来修改患病率,分别将这种差异缩小了74.7%和10.6%。此外,这种公平性的提高在不降低性能的情况下实现(95% CI AUC分别为基线、过采样和增强的[0.816,0.820]、[0.810,0.819]和[0.817,0.821])。
{"title":"Enhancement of Fairness in AI for Chest X-ray Classification.","authors":"Nicholas J Jackson, Chao Yan, Bradley A Malin","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The use of artificial intelligence (AI) in medicine has shown promise to improve the quality of healthcare decisions. However, AI can be biased in a manner that produces unfair predictions for certain demographic subgroups. In MIMIC-CXR, a publicly available dataset of over 300,000 chest X-ray images, diagnostic AI has been shown to have a higher false negative rate for racial minorities. We evaluated the capacity of synthetic data augmentation, oversampling, and demographic-based corrections to enhance the fairness of AI predictions. We show that adjusting unfair predictions for demographic attributes, such as race, is ineffective at improving fairness or predictive performance. However, using oversampling and synthetic data augmentation to modify disease prevalence reduced such disparities by 74.7% and 10.6%, respectively. Moreover, such fairness gains were accomplished without reduction in performance (95% CI AUC: [0.816, 0.820] versus [0.810, 0.819] versus [0.817, 0.821] for baseline, oversampling, and augmentation, respectively).</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"551-560"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099404/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Narrative Feature or Structured Feature? A Study of Large Language Models to Identify Cancer Patients at Risk of Heart Failure. 叙事特征还是结构特征?大型语言模型识别心脏衰竭风险癌症患者的研究。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Ziyi Chen, Mengyuan Zhang, Mustafa Mohammed Ahmed, Yi Guo, Thomas J George, Jiang Bian, Yonghui Wu

Cancer treatments are known to introduce cardiotoxicity, negatively impacting outcomes and survivorship. Identifying cancer patients at risk of heart failure (HF) is critical to improving cancer treatment outcomes and safety. This study examined machine learning (ML) models to identify cancer patients at risk of HF using electronic health records (EHRs), including traditional ML, Time-Aware long short-term memory (T-LSTM), and large language models (LLMs) using novel narrative features derived from the structured medical codes. We identified a cancer cohort of 12,806 patients from the University of Florida Health, diagnosed with lung, breast, and colorectal cancers, among which 1,602 individuals developed HF after cancer. The LLM, GatorTron-3.9B, achieved the best F1 scores, outperforming the traditional support vector machines by 39%, the T-LSTM deep learning model by 7%, and a widely used transformer model, BERT, by 5.6%. The analysis shows that the proposed narrative features remarkably increased feature density and improved performance.

众所周知,癌症治疗会引入心脏毒性,对预后和生存率产生负面影响。识别有心力衰竭(HF)风险的癌症患者对于改善癌症治疗结果和安全性至关重要。本研究检查了机器学习(ML)模型,使用电子健康记录(EHRs)识别有HF风险的癌症患者,包括传统的ML,时间感知长短期记忆(T-LSTM),以及使用源自结构化医疗代码的新颖叙事特征的大型语言模型(llm)。我们确定了来自佛罗里达健康大学的12806例癌症患者,诊断为肺癌、乳腺癌和结直肠癌,其中1602例癌症后发生心衰。LLM GatorTron-3.9B取得了最好的F1分数,比传统的支持向量机高出39%,比T-LSTM深度学习模型高出7%,比广泛使用的变压器模型BERT高出5.6%。分析表明,所提出的叙事特征显著增加了特征密度,提高了性能。
{"title":"Narrative Feature or Structured Feature? A Study of Large Language Models to Identify Cancer Patients at Risk of Heart Failure.","authors":"Ziyi Chen, Mengyuan Zhang, Mustafa Mohammed Ahmed, Yi Guo, Thomas J George, Jiang Bian, Yonghui Wu","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Cancer treatments are known to introduce cardiotoxicity, negatively impacting outcomes and survivorship. Identifying cancer patients at risk of heart failure (HF) is critical to improving cancer treatment outcomes and safety. This study examined machine learning (ML) models to identify cancer patients at risk of HF using electronic health records (EHRs), including traditional ML, Time-Aware long short-term memory (T-LSTM), and large language models (LLMs) using novel narrative features derived from the structured medical codes. We identified a cancer cohort of 12,806 patients from the University of Florida Health, diagnosed with lung, breast, and colorectal cancers, among which 1,602 individuals developed HF after cancer. The LLM, GatorTron-3.9B, achieved the best F1 scores, outperforming the traditional support vector machines by 39%, the T-LSTM deep learning model by 7%, and a widely used transformer model, BERT, by 5.6%. The analysis shows that the proposed narrative features remarkably increased feature density and improved performance.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"242-251"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099403/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Acceptability of pictographs as a novel patient identifier to improve patient safety in the neonatal intensive care unit. 可接受的象形文字作为一种新的患者标识符,以提高新生儿重症监护病房的患者安全。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Hojjat Salmasian, Carmina Erdei, Joanne R Applebaum, Danielle Sharon, Katie Hannon, Deborah Cuddyer, Mary Sawyer, Tina Steele, Yvonne Sheldon, I-Fong S Lehman, Joseph E Schwartz, Allen Chen, Jason Adelman

As part of a randomized controlled trial on the use of pictographs (images used in lieu of a patient photo) embedded in the electronic health record to reduce wrong-patient errors in the neonatal intensive care unit (NICU), we conducted a series of surveys of parents, providers and nurses in the NICU. Data from survey responses were thematically analyzed and categorized. We found that in all groups, there was very high awareness of the intended purpose of the pictographs; however, the perception of providers and nurses about the effectiveness of pictographs was not as strong. While several providers and nurses acknowledged that pictographs can or have helped them avoid wrong-patient errors when caring for multiple birth infants (such as twins), many nurses believed that their current practice of the use of two patient identifiers was sufficient, and pictographs were not useful. Parents reported that pictographs improved their experience of care.

作为一项随机对照试验的一部分,我们对新生儿重症监护病房(NICU)的父母、医护人员和护士进行了一系列调查,研究了在电子健康记录中使用象形文字(用图像代替患者照片)来减少错误的患者。从调查反馈的数据进行主题分析和分类。我们发现,在所有的小组中,人们对象形文字的预期目的都有很高的认识;然而,提供者和护士对象形文字的有效性的看法并不强烈。虽然一些提供者和护士承认,在照顾多胞胎(如双胞胎)时,象形文字可以或已经帮助他们避免了错误的患者错误,但许多护士认为,他们目前使用两个患者标识符的做法已经足够了,象形文字没有用处。家长们报告说,象形文字改善了他们的照顾体验。
{"title":"Acceptability of pictographs as a novel patient identifier to improve patient safety in the neonatal intensive care unit.","authors":"Hojjat Salmasian, Carmina Erdei, Joanne R Applebaum, Danielle Sharon, Katie Hannon, Deborah Cuddyer, Mary Sawyer, Tina Steele, Yvonne Sheldon, I-Fong S Lehman, Joseph E Schwartz, Allen Chen, Jason Adelman","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>As part of a randomized controlled trial on the use of pictographs (images used in lieu of a patient photo) embedded in the electronic health record to reduce wrong-patient errors in the neonatal intensive care unit (NICU), we conducted a series of surveys of parents, providers and nurses in the NICU. Data from survey responses were thematically analyzed and categorized. We found that in all groups, there was very high awareness of the intended purpose of the pictographs; however, the perception of providers and nurses about the effectiveness of pictographs was not as strong. While several providers and nurses acknowledged that pictographs can or have helped them avoid wrong-patient errors when caring for multiple birth infants (such as twins), many nurses believed that their current practice of the use of two patient identifiers was sufficient, and pictographs were not useful. Parents reported that pictographs improved their experience of care.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"980-986"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099398/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144692","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
AMIA ... Annual Symposium proceedings. AMIA Symposium
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1