This scoping review evaluated the efficacy and potential of web-based interventions for substance use disorders and mental health conditions. The studies comprise randomized controlled trials, pilot trials, and effectiveness trials. Web-based interventions consistently demonstrated significant reductions in substance use, improvements in mental health outcomes (e.g., PTSD, depression, anxiety), and enhancements in emotion regulation, help-seeking, and quality of life. Several studies found web-based interventions to be non-inferior or superior to traditional face-to-face treatments. Despite limitations in the current evidence base, such as methodological issues and lack of long-term follow-up, the findings highlight the promise of web-based interventions in expanding access to evidence-based care, particularly for underserved populations. Future research should focus on refining interventions, exploring novel technologies, and evaluating long-term effectiveness and cost-effectiveness. The integration of web-based interventions into healthcare systems has the potential to significantly impact public health by increasing treatment accessibility and improving outcomes for individuals with substance use disorders and mental health conditions.
{"title":"Web-based Interventions for Substance Use Disorders and Mental Health: Preliminary findings from a Scoping Review.","authors":"Yuri Quintana, Amanda L Joseph, Gyana Srivastava","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>This scoping review evaluated the efficacy and potential of web-based interventions for substance use disorders and mental health conditions. The studies comprise randomized controlled trials, pilot trials, and effectiveness trials. Web-based interventions consistently demonstrated significant reductions in substance use, improvements in mental health outcomes (e.g., PTSD, depression, anxiety), and enhancements in emotion regulation, help-seeking, and quality of life. Several studies found web-based interventions to be non-inferior or superior to traditional face-to-face treatments. Despite limitations in the current evidence base, such as methodological issues and lack of long-term follow-up, the findings highlight the promise of web-based interventions in expanding access to evidence-based care, particularly for underserved populations. Future research should focus on refining interventions, exploring novel technologies, and evaluating long-term effectiveness and cost-effectiveness. The integration of web-based interventions into healthcare systems has the potential to significantly impact public health by increasing treatment accessibility and improving outcomes for individuals with substance use disorders and mental health conditions.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"940-949"},"PeriodicalIF":0.0,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12778349/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145936476","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Keerthika Sunchu, Megha M Moncy, Saptarshi Purkayastha, Cathy R Fulton
This study examines the integration of OpenEMR, a Meaningful Use-certified open-source electronic health record (EHR) system, into a Health Informatics curriculum. The primary objective was to address the disparity between theoretical knowledge and practical application in health informatics education. The implementation process revealed several significant challenges, including unintended system modifications that compromised functionality, data entry errors that impacted usability, and technical issues that impeded accessibility. To mitigate these challenges, a series of interventions were implemented. These included backend modifications to enhance data entry accuracy, usability improvements such as limiting open tabs to facilitate navigation, and the implementation ofproactive measures to expedite the resolution of technical issues. The experiences gained from this integration process highlight three critical aspects of health informatics education: the significance of practical proficiency in EHR systems, the necessity for user-centric interface design, and the importance of adaptability and problem-solving skills. The study proposes several future directions for research and practice. These include fostering global collaboration, developing standardized curricula for EHR education, and establishing robust mechanisms for continuous assessment and improvement. The findings underscore the pivotal role of integrating hands-on EHR experience into health informatics education, emphasizing its potential to equip students with the essential competencies required to navigate the complex and dynamic healthcare landscape.
{"title":"Lessons Learned from OpenEMR Implementation in Graduate Health Informatics Curriculum.","authors":"Keerthika Sunchu, Megha M Moncy, Saptarshi Purkayastha, Cathy R Fulton","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>This study examines the integration of OpenEMR, a Meaningful Use-certified open-source electronic health record (EHR) system, into a Health Informatics curriculum. The primary objective was to address the disparity between theoretical knowledge and practical application in health informatics education. The implementation process revealed several significant challenges, including unintended system modifications that compromised functionality, data entry errors that impacted usability, and technical issues that impeded accessibility. To mitigate these challenges, a series of interventions were implemented. These included backend modifications to enhance data entry accuracy, usability improvements such as limiting open tabs to facilitate navigation, and the implementation ofproactive measures to expedite the resolution of technical issues. The experiences gained from this integration process highlight three critical aspects of health informatics education: the significance of practical proficiency in EHR systems, the necessity for user-centric interface design, and the importance of adaptability and problem-solving skills. The study proposes several future directions for research and practice. These include fostering global collaboration, developing standardized curricula for EHR education, and establishing robust mechanisms for continuous assessment and improvement. The findings underscore the pivotal role of integrating hands-on EHR experience into health informatics education, emphasizing its potential to equip students with the essential competencies required to navigate the complex and dynamic healthcare landscape.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"1079-1088"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099383/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Clinical question answering systems have the potential to provide clinicians with relevant and timely answers to their questions. Nonetheless, despite the advances that have been made, adoption of these systems in clinical settings has been slow. One issue is a lack of question-answering datasets which reflect the real-world needs of health professionals. In this work, we present RealMedQA, a dataset of realistic clinical questions generated by humans and an LLM. We describe the process for generating and verifying the QA pairs and assess several QA models on BioASQ and RealMedQA to assess the relative difficulty of matching answers to questions. We show that the LLM is more cost-efficient for generating "ideal" QA pairs. Additionally, we achieve a lower lexical similarity between questions and answers than BioASQ which provides an additional challenge to the top two QA models, as per the results. We release our code and our dataset publicly to encourage further research.
{"title":"RealMedQA: A pilot biomedical question answering dataset containing realistic clinical questions.","authors":"Gregory Kell, Angus Roberts, Serge Umansky, Yuti Khare, Najma Ahmed, Nikhil Patel, Chloe Simela, Jack Coumbe, Julian Rozario, Ryan-Rhys Griffiths, Iain J Marshall","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Clinical question answering systems have the potential to provide clinicians with relevant and timely answers to their questions. Nonetheless, despite the advances that have been made, adoption of these systems in clinical settings has been slow. One issue is a lack of question-answering datasets which reflect the real-world needs of health professionals. In this work, we present RealMedQA, a dataset of realistic clinical questions generated by humans and an LLM. We describe the process for generating and verifying the QA pairs and assess several QA models on BioASQ and RealMedQA to assess the relative difficulty of matching answers to questions. We show that the LLM is more cost-efficient for generating \"ideal\" QA pairs. Additionally, we achieve a lower lexical similarity between questions and answers than BioASQ which provides an additional challenge to the top two QA models, as per the results. We release our code and our dataset publicly to encourage further research.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"590-599"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099375/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144715","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
This study explores the potential of utilizing administrative claims data, combined with advanced machine learning and deep learning techniques, to predict the progression of Chronic Kidney Disease (CKD) to End-Stage Renal Disease (ESRD). We analyze a comprehensive, 10-year dataset provided by a major health insurance organization to develop prediction models for multiple observation windows using traditional machine learning methods such as Random Forest and XGBoost as well as deep learning approaches such as Long Short-Term Memory (LSTM) networks. Our findings demonstrate that the LSTM model, particularly with a 24-month observation window, exhibits superior performance in predicting ESRD progression, outperforming existing models in the literature. We further apply SHap-ley Additive exPlanations (SHAP) analysis to enhance interpretability, providing insights into the impact of individual features on predictions at the individual patient level. This study underscores the value of leveraging administrative claims data for CKD management and predicting ESRD progression.
{"title":"Towards Interpretable End-Stage Renal Disease (ESRD) Prediction: Utilizing Administrative Claims Data with Explainable AI Techniques.","authors":"Yubo Li, Saba Al-Sayouri, Rema Padman","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>This study explores the potential of utilizing administrative claims data, combined with advanced machine learning and deep learning techniques, to predict the progression of Chronic Kidney Disease (CKD) to End-Stage Renal Disease (ESRD). We analyze a comprehensive, 10-year dataset provided by a major health insurance organization to develop prediction models for multiple observation windows using traditional machine learning methods such as Random Forest and XGBoost as well as deep learning approaches such as Long Short-Term Memory (LSTM) networks. Our findings demonstrate that the LSTM model, particularly with a 24-month observation window, exhibits superior performance in predicting ESRD progression, outperforming existing models in the literature. We further apply SHap-ley Additive exPlanations (SHAP) analysis to enhance interpretability, providing insights into the impact of individual features on predictions at the individual patient level. This study underscores the value of leveraging administrative claims data for CKD management and predicting ESRD progression.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"664-673"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099416/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144822","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Emma Croxford, Yanjun Gao, Brian Patterson, Daniel To, Samuel Tesch, Dmitriy Dligach, Anoop Mayampurath, Matthew M Churpek, Majid Afshar
In the evolving landscape of clinical Natural Language Generation (NLG), assessing abstractive text quality remains challenging, as existing methods often overlook generative task complexities. This work aimed to examine the current state of automated evaluation metrics in NLG in healthcare. To have a robust and well-validated baseline with which to examine the alignment of these metrics, we created a comprehensive human evaluation framework. Employing ChatGPT-3.5-turbo generative output, we correlated human judgments with each metric. None of the metrics demonstrated high alignment; however, the SapBERT score-a Unified Medical Language System (UMLS)- showed the best results. This underscores the importance of incorporating domain-specific knowledge into evaluation efforts. Our work reveals the deficiency in quality evaluations for generated text and introduces our comprehensive human evaluation framework as a baseline. Future efforts should prioritize integrating medical knowledge databases to enhance the alignment of automated metrics, particularly focusing on refining the SapBERT score for improved assessments.
{"title":"Development of a Human Evaluation Framework and Correlation with Automated Metrics for Natural Language Generation of Medical Diagnoses.","authors":"Emma Croxford, Yanjun Gao, Brian Patterson, Daniel To, Samuel Tesch, Dmitriy Dligach, Anoop Mayampurath, Matthew M Churpek, Majid Afshar","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>In the evolving landscape of clinical Natural Language Generation (NLG), assessing abstractive text quality remains challenging, as existing methods often overlook generative task complexities. This work aimed to examine the current state of automated evaluation metrics in NLG in healthcare. To have a robust and well-validated baseline with which to examine the alignment of these metrics, we created a comprehensive human evaluation framework. Employing ChatGPT-3.5-turbo generative output, we correlated human judgments with each metric. None of the metrics demonstrated high alignment; however, the SapBERT score-a Unified Medical Language System (UMLS)- showed the best results. This underscores the importance of incorporating domain-specific knowledge into evaluation efforts. Our work reveals the deficiency in quality evaluations for generated text and introduces our comprehensive human evaluation framework as a baseline. Future efforts should prioritize integrating medical knowledge databases to enhance the alignment of automated metrics, particularly focusing on refining the SapBERT score for improved assessments.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"309-318"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099413/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144496","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Anna Vaynrub, Subiksha Umakanth, Harry West, Alissa Michel, Jill Dimond, Stephan Constante, Katherine D Crew, Rita Kukafka
A critical strategy in limiting breast cancer (BC) mortality is the early identification of high-risk patients and implementation of risk-reducing measures. RealRisks, an online decision aid constructed by our team to provide education on BC risk and personalized risk assessment, allows users the option to connect to their electronic health record (EHR) to extract requisite data to calculate BC risk via Fast Healthcare Interoperability Resources (FHIR). Using data from RealRisks user profiles, baseline questionnaires, and interview transcripts, we sought to understand the differences between the groups of patients who opted to download their data via the EHR vs. those who did not. A higher percentage of those who downloaded data (53.8% vs. 42.3%) identified as Hispanic/Latino compared to those who did not download. Thematic analysis suggested that while data security and privacy concerns may lead to hesitation, it is perhaps technological barriers that most significantly limit EHR download.
限制乳腺癌(BC)死亡率的一个关键策略是早期识别高危患者并实施降低风险的措施。RealRisks是由我们的团队构建的在线决策辅助工具,旨在提供BC风险教育和个性化风险评估,允许用户选择连接到他们的电子健康记录(EHR),以提取必要的数据,通过快速医疗保健互操作性资源(FHIR)计算BC风险。使用来自RealRisks用户档案、基线问卷和访谈记录的数据,我们试图了解选择通过电子病历下载数据的患者组与不选择通过电子病历下载数据的患者组之间的差异。与没有下载数据的人相比,下载数据的人(53.8% vs. 42.3%)中西班牙裔/拉丁裔人的比例更高。专题分析表明,虽然数据安全和隐私问题可能导致犹豫,但技术障碍可能是限制电子病历下载的最重要因素。
{"title":"Factors Driving Patient Decisions to Access Electronic Health Records via a Breast Cancer Online Decision Aid linked to the Patient Portal.","authors":"Anna Vaynrub, Subiksha Umakanth, Harry West, Alissa Michel, Jill Dimond, Stephan Constante, Katherine D Crew, Rita Kukafka","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>A critical strategy in limiting breast cancer (BC) mortality is the early identification of high-risk patients and implementation of risk-reducing measures. <i>RealRisks</i>, an online decision aid constructed by our team to provide education on BC risk and personalized risk assessment, allows users the option to connect to their electronic health record (EHR) to extract requisite data to calculate BC risk via Fast Healthcare Interoperability Resources (FHIR). Using data from <i>RealRisks</i> user profiles, baseline questionnaires, and interview transcripts, we sought to understand the differences between the groups of patients who opted to download their data via the EHR vs. those who did not. A higher percentage of those who downloaded data (53.8% vs. 42.3%) identified as Hispanic/Latino compared to those who did not download. Thematic analysis suggested that while data security and privacy concerns may lead to hesitation, it is perhaps technological barriers that most significantly limit EHR download.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"1159-1168"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099358/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144644","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Qiuhao Lu, Rui Li, Andrew Wen, Jinlian Wang, Liwei Wang, Hongfang Liu
Large Language Models (LLMs) have revolutionized various sectors, including healthcare where they are employed in diverse applications. Their utility is particularly significant in the context of rare diseases, where data scarcity, complexity, and specificity pose considerable challenges. In the clinical domain, Named Entity Recognition (NER) stands out as an essential task and it plays a crucial role in extracting relevant information from clinical texts. Despite the promise of LLMs, current research mostly concentrates on document-level NER, identifying entities in a more general context across entire documents, without extracting their precise location. Additionally, efforts have been directed towards adapting ChatGPTfor token-level NER. However, there is a significant research gap when it comes to employing token-level NER for clinical texts, especially with the use of local open-source LLMs. This study aims to bridge this gap by investigating the effectiveness of both proprietary and local LLMs in token-level clinical NER. Essentially, we delve into the capabilities of these models through a series of experiments involving zero-shot prompting, few-shot prompting, retrieval-augmented generation (RAG), and instruction-fine-tuning. Our exploration reveals the inherent challenges LLMs face in token-level NER, particularly in the context of rare diseases, and suggests possible improvements for their application in healthcare. This research contributes to narrowing a significant gap in healthcare informatics and offers insights that could lead to a more refined application of LLMs in the healthcare sector.
{"title":"Large Language Models Struggle in Token-Level Clinical Named Entity Recognition.","authors":"Qiuhao Lu, Rui Li, Andrew Wen, Jinlian Wang, Liwei Wang, Hongfang Liu","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Large Language Models (LLMs) have revolutionized various sectors, including healthcare where they are employed in diverse applications. Their utility is particularly significant in the context of rare diseases, where data scarcity, complexity, and specificity pose considerable challenges. In the clinical domain, Named Entity Recognition (NER) stands out as an essential task and it plays a crucial role in extracting relevant information from clinical texts. Despite the promise of LLMs, current research mostly concentrates on document-level NER, identifying entities in a more general context across entire documents, without extracting their precise location. Additionally, efforts have been directed towards adapting ChatGPTfor token-level NER. However, there is a significant research gap when it comes to employing token-level NER for clinical texts, especially with the use of local open-source LLMs. This study aims to bridge this gap by investigating the effectiveness of both proprietary and local LLMs in token-level clinical NER. Essentially, we delve into the capabilities of these models through a series of experiments involving zero-shot prompting, few-shot prompting, retrieval-augmented generation (RAG), and instruction-fine-tuning. Our exploration reveals the inherent challenges LLMs face in token-level NER, particularly in the context of rare diseases, and suggests possible improvements for their application in healthcare. This research contributes to narrowing a significant gap in healthcare informatics and offers insights that could lead to a more refined application of LLMs in the healthcare sector.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"748-757"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099373/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144361","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
The use of artificial intelligence (AI) in medicine has shown promise to improve the quality of healthcare decisions. However, AI can be biased in a manner that produces unfair predictions for certain demographic subgroups. In MIMIC-CXR, a publicly available dataset of over 300,000 chest X-ray images, diagnostic AI has been shown to have a higher false negative rate for racial minorities. We evaluated the capacity of synthetic data augmentation, oversampling, and demographic-based corrections to enhance the fairness of AI predictions. We show that adjusting unfair predictions for demographic attributes, such as race, is ineffective at improving fairness or predictive performance. However, using oversampling and synthetic data augmentation to modify disease prevalence reduced such disparities by 74.7% and 10.6%, respectively. Moreover, such fairness gains were accomplished without reduction in performance (95% CI AUC: [0.816, 0.820] versus [0.810, 0.819] versus [0.817, 0.821] for baseline, oversampling, and augmentation, respectively).
人工智能(AI)在医学领域的应用有望提高医疗保健决策的质量。然而,人工智能可能会以某种方式产生对某些人口统计子群体的不公平预测。MIMIC-CXR是一个公开的超过30万张胸部x射线图像数据集,在该数据集中,人工智能诊断对少数种族的假阴性率更高。我们评估了合成数据增强、过采样和基于人口统计的修正的能力,以提高人工智能预测的公平性。我们表明,调整人口统计属性(如种族)的不公平预测在提高公平性或预测性能方面是无效的。然而,使用过采样和合成数据增强来修改患病率,分别将这种差异缩小了74.7%和10.6%。此外,这种公平性的提高在不降低性能的情况下实现(95% CI AUC分别为基线、过采样和增强的[0.816,0.820]、[0.810,0.819]和[0.817,0.821])。
{"title":"Enhancement of Fairness in AI for Chest X-ray Classification.","authors":"Nicholas J Jackson, Chao Yan, Bradley A Malin","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>The use of artificial intelligence (AI) in medicine has shown promise to improve the quality of healthcare decisions. However, AI can be biased in a manner that produces unfair predictions for certain demographic subgroups. In MIMIC-CXR, a publicly available dataset of over 300,000 chest X-ray images, diagnostic AI has been shown to have a higher false negative rate for racial minorities. We evaluated the capacity of synthetic data augmentation, oversampling, and demographic-based corrections to enhance the fairness of AI predictions. We show that adjusting unfair predictions for demographic attributes, such as race, is ineffective at improving fairness or predictive performance. However, using oversampling and synthetic data augmentation to modify disease prevalence reduced such disparities by 74.7% and 10.6%, respectively. Moreover, such fairness gains were accomplished without reduction in performance (95% CI AUC: [0.816, 0.820] versus [0.810, 0.819] versus [0.817, 0.821] for baseline, oversampling, and augmentation, respectively).</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"551-560"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099404/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144579","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Ziyi Chen, Mengyuan Zhang, Mustafa Mohammed Ahmed, Yi Guo, Thomas J George, Jiang Bian, Yonghui Wu
Cancer treatments are known to introduce cardiotoxicity, negatively impacting outcomes and survivorship. Identifying cancer patients at risk of heart failure (HF) is critical to improving cancer treatment outcomes and safety. This study examined machine learning (ML) models to identify cancer patients at risk of HF using electronic health records (EHRs), including traditional ML, Time-Aware long short-term memory (T-LSTM), and large language models (LLMs) using novel narrative features derived from the structured medical codes. We identified a cancer cohort of 12,806 patients from the University of Florida Health, diagnosed with lung, breast, and colorectal cancers, among which 1,602 individuals developed HF after cancer. The LLM, GatorTron-3.9B, achieved the best F1 scores, outperforming the traditional support vector machines by 39%, the T-LSTM deep learning model by 7%, and a widely used transformer model, BERT, by 5.6%. The analysis shows that the proposed narrative features remarkably increased feature density and improved performance.
{"title":"Narrative Feature or Structured Feature? A Study of Large Language Models to Identify Cancer Patients at Risk of Heart Failure.","authors":"Ziyi Chen, Mengyuan Zhang, Mustafa Mohammed Ahmed, Yi Guo, Thomas J George, Jiang Bian, Yonghui Wu","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Cancer treatments are known to introduce cardiotoxicity, negatively impacting outcomes and survivorship. Identifying cancer patients at risk of heart failure (HF) is critical to improving cancer treatment outcomes and safety. This study examined machine learning (ML) models to identify cancer patients at risk of HF using electronic health records (EHRs), including traditional ML, Time-Aware long short-term memory (T-LSTM), and large language models (LLMs) using novel narrative features derived from the structured medical codes. We identified a cancer cohort of 12,806 patients from the University of Florida Health, diagnosed with lung, breast, and colorectal cancers, among which 1,602 individuals developed HF after cancer. The LLM, GatorTron-3.9B, achieved the best F1 scores, outperforming the traditional support vector machines by 39%, the T-LSTM deep learning model by 7%, and a widely used transformer model, BERT, by 5.6%. The analysis shows that the proposed narrative features remarkably increased feature density and improved performance.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"242-251"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099403/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Hojjat Salmasian, Carmina Erdei, Joanne R Applebaum, Danielle Sharon, Katie Hannon, Deborah Cuddyer, Mary Sawyer, Tina Steele, Yvonne Sheldon, I-Fong S Lehman, Joseph E Schwartz, Allen Chen, Jason Adelman
As part of a randomized controlled trial on the use of pictographs (images used in lieu of a patient photo) embedded in the electronic health record to reduce wrong-patient errors in the neonatal intensive care unit (NICU), we conducted a series of surveys of parents, providers and nurses in the NICU. Data from survey responses were thematically analyzed and categorized. We found that in all groups, there was very high awareness of the intended purpose of the pictographs; however, the perception of providers and nurses about the effectiveness of pictographs was not as strong. While several providers and nurses acknowledged that pictographs can or have helped them avoid wrong-patient errors when caring for multiple birth infants (such as twins), many nurses believed that their current practice of the use of two patient identifiers was sufficient, and pictographs were not useful. Parents reported that pictographs improved their experience of care.
{"title":"Acceptability of pictographs as a novel patient identifier to improve patient safety in the neonatal intensive care unit.","authors":"Hojjat Salmasian, Carmina Erdei, Joanne R Applebaum, Danielle Sharon, Katie Hannon, Deborah Cuddyer, Mary Sawyer, Tina Steele, Yvonne Sheldon, I-Fong S Lehman, Joseph E Schwartz, Allen Chen, Jason Adelman","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>As part of a randomized controlled trial on the use of pictographs (images used in lieu of a patient photo) embedded in the electronic health record to reduce wrong-patient errors in the neonatal intensive care unit (NICU), we conducted a series of surveys of parents, providers and nurses in the NICU. Data from survey responses were thematically analyzed and categorized. We found that in all groups, there was very high awareness of the intended purpose of the pictographs; however, the perception of providers and nurses about the effectiveness of pictographs was not as strong. While several providers and nurses acknowledged that pictographs can or have helped them avoid wrong-patient errors when caring for multiple birth infants (such as twins), many nurses believed that their current practice of the use of two patient identifiers was sufficient, and pictographs were not useful. Parents reported that pictographs improved their experience of care.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"980-986"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099398/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144692","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}