AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science最新文献_第8页

Multimodal Data Hybrid Fusion and Natural Language Processing for Clinical Prediction Models. 用于临床预测模型的多模态数据混合融合与自然语言处理。

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science

Pub Date : 2024-05-31 eCollection Date: 2024-01-01

Jiancheng Ye, Jiarui Hai, Jiacheng Song, Zidan Wang

This study aims to propose a novel approach for enhancing clinical prediction models by combining structured and unstructured data with multimodal data fusion. We presented a comprehensive framework that integrated multimodal data sources, including textual clinical notes, structured electronic health records (EHRs), and relevant clinical data from National Electronic Injury Surveillance System (NEISS) datasets. We proposed a novel hybrid fusion method, which incorporated state-of-the-art pre-trained language model, to integrate unstructured clinical text with structured EHR data and other multimodal sources, thereby capturing a more comprehensive representation of patient information. The experimental results demonstrated that the hybrid fusion approach significantly improved the performance of clinical prediction models compared to traditional fusion frameworks and unimodal models that rely solely on structured data or text information alone. The proposed hybrid fusion system with RoBERTa language encoder achieved the best prediction of the Top 1 injury with an accuracy of 75.00% and Top 3 injuries with an accuracy of 93.54%. Our study highlights the potential of integrating natural language processing (NLP) techniques with multimodal data fusion for enhancing clinical prediction models' performances. By leveraging the rich information present in clinical text and combining it with structured EHR data, the proposed approach can improve the accuracy and robustness of predictive models. The approach has the potential to advance clinical decision support systems, enable personalized medicine, and facilitate evidence-based health care practices. Future research can further explore the application of this hybrid fusion approach in real-world clinical settings and investigate its impact on improving patient outcomes.

本研究旨在提出一种新方法，通过多模态数据融合将结构化和非结构化数据结合起来，从而增强临床预测模型。我们提出了一个综合框架，该框架整合了多模态数据源，包括文本临床笔记、结构化电子健康记录（EHR）以及来自国家电子伤害监测系统（NEISS）数据集的相关临床数据。我们提出了一种新颖的混合融合方法，该方法结合了最先进的预训练语言模型，将非结构化临床文本与结构化电子病历数据和其他多模态数据源整合在一起，从而更全面地呈现患者信息。实验结果表明，与传统的融合框架和仅依赖结构化数据或文本信息的单模态模型相比，混合融合方法显著提高了临床预测模型的性能。使用 RoBERTa 语言编码器的混合融合系统对前 1 名损伤的预测准确率达到 75.00%，对前 3 名损伤的预测准确率达到 93.54%。我们的研究强调了自然语言处理（NLP）技术与多模态数据融合在提高临床预测模型性能方面的潜力。通过利用临床文本中的丰富信息并将其与结构化电子病历数据相结合，所提出的方法可以提高预测模型的准确性和稳健性。该方法有望推动临床决策支持系统的发展，实现个性化医疗，促进循证医疗实践。未来的研究可以进一步探索这种混合融合方法在实际临床环境中的应用，并研究其对改善患者预后的影响。

{"title":"Multimodal Data Hybrid Fusion and Natural Language Processing for Clinical Prediction Models.","authors":"Jiancheng Ye, Jiarui Hai, Jiacheng Song, Zidan Wang","doi":"","DOIUrl":"","url":null,"abstract":"This study aims to propose a novel approach for enhancing clinical prediction models by combining structured and unstructured data with multimodal data fusion. We presented a comprehensive framework that integrated multimodal data sources, including textual clinical notes, structured electronic health records (EHRs), and relevant clinical data from National Electronic Injury Surveillance System (NEISS) datasets. We proposed a novel hybrid fusion method, which incorporated state-of-the-art pre-trained language model, to integrate unstructured clinical text with structured EHR data and other multimodal sources, thereby capturing a more comprehensive representation of patient information. The experimental results demonstrated that the hybrid fusion approach significantly improved the performance of clinical prediction models compared to traditional fusion frameworks and unimodal models that rely solely on structured data or text information alone. The proposed hybrid fusion system with RoBERTa language encoder achieved the best prediction of the Top 1 injury with an accuracy of 75.00% and Top 3 injuries with an accuracy of 93.54%. Our study highlights the potential of integrating natural language processing (NLP) techniques with multimodal data fusion for enhancing clinical prediction models' performances. By leveraging the rich information present in clinical text and combining it with structured EHR data, the proposed approach can improve the accuracy and robustness of predictive models. The approach has the potential to advance clinical decision support systems, enable personalized medicine, and facilitate evidence-based health care practices. Future research can further explore the application of this hybrid fusion approach in real-world clinical settings and investigate its impact on improving patient outcomes.","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2024 ","pages":"191-200"},"PeriodicalIF":0.0,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11141806/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141201192","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Prioritizing Clinically Significant Lung Cancer Somatic Mutations for Targeted Therapy Through Efficient NGS Data Filtering System. 通过高效的 NGS 数据过滤系统优先选择具有临床意义的肺癌体细胞突变进行靶向治疗

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science

Pub Date : 2024-05-31 eCollection Date: 2024-01-01

Jinlian Wang, Hui Li, Hongfang Liu

In the realm of lung cancer treatment, where genetic heterogeneity presents formidable challenges, precision oncology demands an exacting approach to identify and hierarchically sort clinically significant somatic mutations. Current Next-Generation Sequencing (NGS) data filtering pipelines, while utilizing various external databases for mutation screening, often fall short in comprehensive integration and flexibility needed to keep pace with the evolving landscape of clinical data. Our study introduces a sophisticated NGS data filtering system, which not only aggregates but effectively synergizes diverse data sources, encompassing genetic variants, gene functions, clinical evidence, and an extensive body of literature. This system is distinguished by a unique algorithm that facilitates a rigorous, multi-tiered filtration process. This allows for the efficient prioritization of 420 genes and 1,193 variants from large datasets, with a particular focus on 80 variants demonstrating high clinical actionability. These variants have been aligned with FDA approvals, NCCN guidelines, and thoroughly reviewed literature, thereby equipping oncologists with a refined arsenal for targeted therapy decisions. The innovation of our system lies in its dynamic integration framework and its algorithm, tailored to emphasize clinical utility and actionability-a nuanced approach often lacking in existing methodologies. Our validation on real-world lung adenocarcinoma NGS datasets has shown not only an enhanced efficiency in identifying genetic targets but also the potential to streamline clinical workflows, thus propelling the advancement of precision oncology. Planned future enhancements include expanding the range of integrated data types and developing a user-friendly interface, aiming to facilitate easier access to data and promote collaborative efforts in tailoring cancer treatments.

在肺癌治疗领域，遗传异质性带来了严峻的挑战，精准肿瘤学需要一种精确的方法来识别和分层分类具有临床意义的体细胞突变。目前的下一代测序（NGS）数据筛选管道虽然利用了各种外部数据库进行突变筛选，但往往缺乏所需的全面整合性和灵活性，无法跟上临床数据不断发展的步伐。我们的研究介绍了一种先进的 NGS 数据筛选系统，它不仅能聚合不同的数据源，还能有效协同，包括基因变异、基因功能、临床证据和大量文献。该系统的独特之处在于它采用了一种独特的算法，可进行严格的多层过滤。这样就能从大型数据集中有效地优先筛选出 420 个基因和 1,193 个变异体，尤其是 80 个显示出高度临床可操作性的变异体。这些变异与 FDA 批准、NCCN 指南和经过全面审查的文献相一致，从而为肿瘤学家的靶向治疗决策提供了精良的武器。我们系统的创新之处在于其动态整合框架及其算法，该算法量身定制，强调临床实用性和可操作性--这是现有方法通常缺乏的一种细致入微的方法。我们在真实世界肺腺癌 NGS 数据集上的验证结果表明，该系统不仅提高了识别基因靶点的效率，还具有简化临床工作流程的潜力，从而推动了精准肿瘤学的发展。未来的改进计划包括扩大集成数据类型的范围和开发用户友好型界面，旨在方便数据访问，促进癌症治疗的定制化合作。

{"title":"Prioritizing Clinically Significant Lung Cancer Somatic Mutations for Targeted Therapy Through Efficient NGS Data Filtering System.","authors":"Jinlian Wang, Hui Li, Hongfang Liu","doi":"","DOIUrl":"","url":null,"abstract":"In the realm of lung cancer treatment, where genetic heterogeneity presents formidable challenges, precision oncology demands an exacting approach to identify and hierarchically sort clinically significant somatic mutations. Current Next-Generation Sequencing (NGS) data filtering pipelines, while utilizing various external databases for mutation screening, often fall short in comprehensive integration and flexibility needed to keep pace with the evolving landscape of clinical data. Our study introduces a sophisticated NGS data filtering system, which not only aggregates but effectively synergizes diverse data sources, encompassing genetic variants, gene functions, clinical evidence, and an extensive body of literature. This system is distinguished by a unique algorithm that facilitates a rigorous, multi-tiered filtration process. This allows for the efficient prioritization of 420 genes and 1,193 variants from large datasets, with a particular focus on 80 variants demonstrating high clinical actionability. These variants have been aligned with FDA approvals, NCCN guidelines, and thoroughly reviewed literature, thereby equipping oncologists with a refined arsenal for targeted therapy decisions. The innovation of our system lies in its dynamic integration framework and its algorithm, tailored to emphasize clinical utility and actionability-a nuanced approach often lacking in existing methodologies. Our validation on real-world lung adenocarcinoma NGS datasets has shown not only an enhanced efficiency in identifying genetic targets but also the potential to streamline clinical workflows, thus propelling the advancement of precision oncology. Planned future enhancements include expanding the range of integrated data types and developing a user-friendly interface, aiming to facilitate easier access to data and promote collaborative efforts in tailoring cancer treatments.","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2024 ","pages":"305-313"},"PeriodicalIF":0.0,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11141846/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141201206","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Association of Diagnostic Discrepancy with Length of Stay and Mortality in Congestive Heart Failure Patients Admitted to the Emergency Department. 急诊科收治的充血性心力衰竭患者的诊断不一致与住院时间和死亡率的关系。

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science

Pub Date : 2024-05-31 eCollection Date: 2024-01-01

Joseph Finkelstein, Wanting Cui, Jeffrey P Ferraro, Kensaku Kawamoto

The goal of this study was to analyze diagnostic discrepancies between emergency department (ED) and hospital discharge diagnoses in patients with congestive heart failure admitted to the ED. Using a synthetic dataset from the Department of Veterans Affairs, the patients' primary diagnoses were compared at two levels: diagnostic category and body system. With 12,621 patients and 24,235 admission cases, the study found a 58% mismatch rate at the category level, which was reduced to 30% at the body system level. Diagnostic categories associated with higher levels of mismatch included aplastic anemia, pneumonia, and bacterial infections. In contrast, diagnostic categories associated with lower levels of mismatch included alcohol-related disorders, COVID-19, cardiac dysrhythmias, and gastrointestinal hemorrhage. Further investigation revealed that diagnostic mismatches are associated with longer hospital stays and higher mortality rates. These findings highlight the importance of reducing diagnostic uncertainty, particularly in specific diagnostic categories and body systems, to improve patient care following ED admission.

本研究的目的是分析急诊科（ED）和医院对急诊科收治的充血性心力衰竭患者的出院诊断之间的差异。利用退伍军人事务部的合成数据集，从诊断类别和身体系统两个层面对患者的主要诊断进行了比较。在 12,621 名患者和 24,235 个入院病例中，研究发现类别层面的不匹配率为 58%，而身体系统层面的不匹配率则降至 30%。不匹配程度较高的诊断类别包括再生障碍性贫血、肺炎和细菌感染。与此相反，不匹配程度较低的诊断类别包括酒精相关疾病、COVID-19、心律失常和消化道出血。进一步调查显示，诊断不匹配与住院时间延长和死亡率升高有关。这些发现强调了减少诊断不确定性的重要性，尤其是在特定诊断类别和身体系统方面，以改善急诊室入院后的病人护理。

{"title":"Association of Diagnostic Discrepancy with Length of Stay and Mortality in Congestive Heart Failure Patients Admitted to the Emergency Department.","authors":"Joseph Finkelstein, Wanting Cui, Jeffrey P Ferraro, Kensaku Kawamoto","doi":"","DOIUrl":"","url":null,"abstract":"The goal of this study was to analyze diagnostic discrepancies between emergency department (ED) and hospital discharge diagnoses in patients with congestive heart failure admitted to the ED. Using a synthetic dataset from the Department of Veterans Affairs, the patients' primary diagnoses were compared at two levels: diagnostic category and body system. With 12,621 patients and 24,235 admission cases, the study found a 58% mismatch rate at the category level, which was reduced to 30% at the body system level. Diagnostic categories associated with higher levels of mismatch included aplastic anemia, pneumonia, and bacterial infections. In contrast, diagnostic categories associated with lower levels of mismatch included alcohol-related disorders, COVID-19, cardiac dysrhythmias, and gastrointestinal hemorrhage. Further investigation revealed that diagnostic mismatches are associated with longer hospital stays and higher mortality rates. These findings highlight the importance of reducing diagnostic uncertainty, particularly in specific diagnostic categories and body systems, to improve patient care following ED admission.","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2024 ","pages":"155-161"},"PeriodicalIF":0.0,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11141848/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141201478","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

FAIR privacy-preserving operation of large genomic variant calling format (VCF) data without download or installation. 无需下载或安装，即可对大型基因组变异调用格式 (VCF) 数据进行 FAIR 隐私保护操作。

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science

Pub Date : 2024-05-31 eCollection Date: 2024-01-01

Yasmmin C Martins, Praphulla Ms Bhawsar, Jeya B Balasubramanian, Daniel Russ, Wendy Sw Wong, Wolfgang Maass, Jonas S Almeida

Motivation: The proliferation of genetic testing and consumer genomics represents a logistic challenge to the personalized use of GWAS data in VCF format. Specifically, the challenge of retrieving target genetic variation from large compressed files filled with unrelated variation information. Compounding the data traversal challenge, privacy-sensitive VCF files are typically managed as large stand-alone single files (no companion index file) composed of variable-sized compressed chunks, hosted in consumer-facing environments with no native support for hosted execution. Results: A portable JavaScript module was developed to support in-browser fetching of partial content using byte-range requests. This includes on-the-fly decompressing irregularly positioned compressed chunks, coupled with a binary search algorithm iteratively identifying chromosome-position ranges. The in-browser zero-footprint solution (no downloads, no installations) enables the interoperability, reusability, and user-facing governance advanced by the FAIR principles for stewardship of scientific data. Availability - https://episphere.github.io/vcf, including supplementary material.

动机基因检测和消费者基因组学的普及给个性化使用 VCF 格式的 GWAS 数据带来了物流方面的挑战。具体来说，就是从充满无关变异信息的大型压缩文件中检索目标遗传变异的挑战而言。使数据遍历难题更加复杂的是，隐私敏感的 VCF 文件通常是作为独立的大型单个文件（没有配套的索引文件）来管理的，这些文件由大小不一的压缩块组成，托管在面向消费者的环境中，不支持本地托管执行。结果开发了一个可移植的 JavaScript 模块，支持使用字节范围请求在浏览器中获取部分内容。这包括对位置不规则的压缩块进行即时解压缩，并采用二进制搜索算法迭代识别染色体位置范围。浏览器内的零足迹解决方案（无需下载、无需安装）实现了互操作性、可重用性和面向用户的管理，这些都是科学数据管理的 FAIR 原则所倡导的。可用性 - https://episphere.github.io/vcf，包括补充材料。

{"title":"FAIR privacy-preserving operation of large genomic variant calling format (VCF) data without download or installation.","authors":"Yasmmin C Martins, Praphulla Ms Bhawsar, Jeya B Balasubramanian, Daniel Russ, Wendy Sw Wong, Wolfgang Maass, Jonas S Almeida","doi":"","DOIUrl":"","url":null,"abstract":"Motivation: The proliferation of genetic testing and consumer genomics represents a logistic challenge to the personalized use of GWAS data in VCF format. Specifically, the challenge of retrieving target genetic variation from large compressed files filled with unrelated variation information. Compounding the data traversal challenge, privacy-sensitive VCF files are typically managed as large stand-alone single files (no companion index file) composed of variable-sized compressed chunks, hosted in consumer-facing environments with no native support for hosted execution. Results: A portable JavaScript module was developed to support in-browser fetching of partial content using byte-range requests. This includes on-the-fly decompressing irregularly positioned compressed chunks, coupled with a binary search algorithm iteratively identifying chromosome-position ranges. The in-browser zero-footprint solution (no downloads, no installations) enables the interoperability, reusability, and user-facing governance advanced by the FAIR principles for stewardship of scientific data. Availability - https://episphere.github.io/vcf, including supplementary material.","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2024 ","pages":"65-74"},"PeriodicalIF":0.0,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11141832/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141200821","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Improving Automating Quality Control in Radiology: Leveraging Large Language Models to Extract Correlative Findings in Radiology and Operative Reports. 改进放射学质量控制自动化：利用大型语言模型提取放射学和手术报告中的相关结果。

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science

Pub Date : 2024-05-31 eCollection Date: 2024-01-01

Niloufar Eghbali, Chad Klochko, Perra Razoky, Prateek Chintalapati, Efan Jawad, Zaid Mahdi, Joseph Craig, Mohammad M Ghassemi

Radiology Imaging plays a pivotal role in medical diagnostics, providing clinicians with insights into patient health and guiding the next steps in treatment. The true value of a radiological image lies in the accuracy of its accompanying report. To ensure the reliability of these reports, they are often cross-referenced with operative findings. The conventional method of manually comparing radiology and operative reports is labor-intensive and demands specialized knowledge. This study explores the potential of a Large Language Model (LLM) to simplify the radiology evaluation process by automatically extracting pertinent details from these reports, focusing especially on the shoulder's primary anatomical structures. A fine-tuned LLM identifies mentions of the supraspinatus tendon, infraspinatus tendon, subscapularis tendon, biceps tendon, and glenoid labrum in lengthy radiology and operative documents. Initial findings emphasize the model's capability to pinpoint relevant data, suggesting a transformative approach to the typical evaluation methods in radiology.

放射成像在医疗诊断中起着举足轻重的作用，它能让临床医生深入了解病人的健康状况，并指导下一步的治疗。放射影像的真正价值在于其随附报告的准确性。为确保这些报告的可靠性，通常需要与手术结果进行交叉对比。人工对比放射报告和手术报告的传统方法需要大量人力和专业知识。本研究探索了大语言模型（LLM）的潜力，通过自动提取这些报告中的相关细节，特别是肩部的主要解剖结构，来简化放射学评估过程。经过微调的 LLM 可以识别冗长的放射学和手术文件中提到的冈上肌腱、冈下肌腱、肩胛下肌腱、肱二头肌肌腱和盂唇。初步发现强调了该模型精确定位相关数据的能力，并建议对放射学中的典型评估方法进行改革。

{"title":"Improving Automating Quality Control in Radiology: Leveraging Large Language Models to Extract Correlative Findings in Radiology and Operative Reports.","authors":"Niloufar Eghbali, Chad Klochko, Perra Razoky, Prateek Chintalapati, Efan Jawad, Zaid Mahdi, Joseph Craig, Mohammad M Ghassemi","doi":"","DOIUrl":"","url":null,"abstract":"Radiology Imaging plays a pivotal role in medical diagnostics, providing clinicians with insights into patient health and guiding the next steps in treatment. The true value of a radiological image lies in the accuracy of its accompanying report. To ensure the reliability of these reports, they are often cross-referenced with operative findings. The conventional method of manually comparing radiology and operative reports is labor-intensive and demands specialized knowledge. This study explores the potential of a Large Language Model (LLM) to simplify the radiology evaluation process by automatically extracting pertinent details from these reports, focusing especially on the shoulder's primary anatomical structures. A fine-tuned LLM identifies mentions of the supraspinatus tendon, infraspinatus tendon, subscapularis tendon, biceps tendon, and glenoid labrum in lengthy radiology and operative documents. Initial findings emphasize the model's capability to pinpoint relevant data, suggesting a transformative approach to the typical evaluation methods in radiology.","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2024 ","pages":"135-144"},"PeriodicalIF":0.0,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11141845/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141201053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The role of artificial intelligence for the application of integrating electronic health records and patient-generated data in clinical decision support. 人工智能在临床决策支持中整合电子健康记录和患者生成数据的应用中的作用。

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science

Pub Date : 2024-05-31 eCollection Date: 2024-01-01

Jiancheng Ye, Donna Woods, Neil Jordan, Justin Starren

This narrative review aims to identify and understand the role of artificial intelligence in the application of integrated electronic health records (EHRs) and patient-generated health data (PGHD) in clinical decision support. We focused on integrated data that combined PGHD and EHR data, and we investigated the role of artificial intelligence (AI) in the application. We used the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines to search articles in six databases: PubMed, Embase, Web of Science, Scopus, ACM Digital Library, and IEEE Computer Society Digital Library. In addition, we also synthesized seminal sources, including other systematic reviews, reports, and white papers, to inform the context, history, and development of this field. Twenty-six publications met the review criteria after screening. The EHR-integrated PGHD introduces benefits to health care, including empowering patients and families to engage via shared decision-making, improving the patient-provider relationship, and reducing the time and cost of clinical visits. AI's roles include cleaning and management of heterogeneous datasets, assisting in identifying dynamic patterns to improve clinical care processes, and providing more sophisticated algorithms to better predict outcomes and propose precise recommendations based on the integrated data. Challenges mainly stem from the large volume of integrated data, data standards, data exchange and interoperability, security and privacy, interpretation, and meaningful use. The use of PGHD in health care is at a promising stage but needs further work for widespread adoption and seamless integration into health care systems. AI-driven, EHR-integrated PGHD systems can greatly improve clinicians' abilities to diagnose patients' health issues, classify risks at the patient level by drawing on the power of integrated data, and provide much-needed support to clinics and hospitals. With EHR-integrated PGHD, AI can help transform health care by improving diagnosis, treatment, and the delivery of clinical care, thus improving clinical decision support.

本综述旨在确定和了解人工智能在临床决策支持中应用集成电子健康记录（EHR）和患者生成的健康数据（PGHD）方面的作用。我们将重点放在结合了 PGHD 和 EHR 数据的集成数据上，并研究了人工智能 (AI) 在应用中的作用。我们采用系统综述和荟萃分析首选报告项目（PRISMA）指南在六个数据库中搜索文章：PubMed、Embase、Web of Science、Scopus、ACM 数字图书馆和 IEEE 计算机协会数字图书馆。此外，我们还综合了其他系统综述、报告和白皮书等开创性资料，以了解该领域的背景、历史和发展。经过筛选，26 篇出版物符合审查标准。整合了电子病历的 PGHD 为医疗保健带来了诸多益处，包括通过共同决策增强患者和家属的参与能力，改善患者与医疗服务提供者之间的关系，以及减少临床就诊的时间和成本。人工智能的作用包括清理和管理异构数据集，协助识别动态模式以改进临床护理流程，以及提供更复杂的算法以更好地预测结果并根据集成数据提出精确建议。所面临的挑战主要来自大量的集成数据、数据标准、数据交换和互操作性、安全性和隐私性、解释和有意义的使用。PGHD 在医疗保健领域的应用正处于大有可为的阶段，但还需要进一步努力才能得到广泛应用并无缝集成到医疗保健系统中。人工智能驱动的、整合了电子病历的 PGHD 系统可以大大提高临床医生诊断病人健康问题的能力，通过利用整合数据的力量对病人层面的风险进行分类，并为诊所和医院提供急需的支持。通过与电子病历集成的 PGHD，人工智能可以改善诊断、治疗和临床护理的提供，从而改善临床决策支持，从而帮助改变医疗保健。

{"title":"The role of artificial intelligence for the application of integrating electronic health records and patient-generated data in clinical decision support.","authors":"Jiancheng Ye, Donna Woods, Neil Jordan, Justin Starren","doi":"","DOIUrl":"","url":null,"abstract":"This narrative review aims to identify and understand the role of artificial intelligence in the application of integrated electronic health records (EHRs) and patient-generated health data (PGHD) in clinical decision support. We focused on integrated data that combined PGHD and EHR data, and we investigated the role of artificial intelligence (AI) in the application. We used the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines to search articles in six databases: PubMed, Embase, Web of Science, Scopus, ACM Digital Library, and IEEE Computer Society Digital Library. In addition, we also synthesized seminal sources, including other systematic reviews, reports, and white papers, to inform the context, history, and development of this field. Twenty-six publications met the review criteria after screening. The EHR-integrated PGHD introduces benefits to health care, including empowering patients and families to engage via shared decision-making, improving the patient-provider relationship, and reducing the time and cost of clinical visits. AI's roles include cleaning and management of heterogeneous datasets, assisting in identifying dynamic patterns to improve clinical care processes, and providing more sophisticated algorithms to better predict outcomes and propose precise recommendations based on the integrated data. Challenges mainly stem from the large volume of integrated data, data standards, data exchange and interoperability, security and privacy, interpretation, and meaningful use. The use of PGHD in health care is at a promising stage but needs further work for widespread adoption and seamless integration into health care systems. AI-driven, EHR-integrated PGHD systems can greatly improve clinicians' abilities to diagnose patients' health issues, classify risks at the patient level by drawing on the power of integrated data, and provide much-needed support to clinics and hospitals. With EHR-integrated PGHD, AI can help transform health care by improving diagnosis, treatment, and the delivery of clinical care, thus improving clinical decision support.","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2024 ","pages":"459-467"},"PeriodicalIF":0.0,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11141850/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141201285","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Using Discrete Event Simulation to Design and Assess an AI-aided Workflow for Same-day Diagnostic Testing of Women Undergoing Breast Screening. 利用离散事件模拟设计和评估人工智能辅助工作流程，为接受乳腺筛查的妇女提供当天诊断测试。

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science

Pub Date : 2024-05-31 eCollection Date: 2024-01-01

Yannan Lin, Anne C Hoyt, Vladimir G Manuel, Moira Inkelas, William Hsu

The process of patients waiting for diagnostic examinations after an abnormal screening mammogram is inefficient and anxiety-inducing. Artificial intelligence (AI)-aided interpretation of screening mammography could reduce the number of recalls after screening. We proposed a same-day diagnostic workup to alleviate patient anxiety by employing an AI-aided interpretation to reduce unnecessary diagnostic testing after an abnormal screening mammogram. However, the potential unintended consequences of introducing this workflow in a high-volume breast imaging center are unknown. Using discrete event simulation, we observed that implementing the AI-aided screening mammogram interpretation and same-day diagnostic workflow would reduce daily patient volume by 4%, increase the time a patient would be at the clinic by 24%, and increase waiting times by 13-31%. We discuss how changing the hours of operation and introducing new imaging equipment and personnel may alleviate these negative impacts.

患者在乳房X光筛查异常后等待诊断检查的过程既低效又令人焦虑。人工智能（AI）辅助筛查乳腺 X 射线摄影的解读可以减少筛查后的召回次数。我们提出了一种当天诊断工作法，通过采用人工智能辅助判读来减轻患者的焦虑，从而减少乳房X光筛查异常后不必要的诊断检测。然而，在高容量乳腺成像中心引入这一工作流程可能产生的意外后果尚不清楚。通过离散事件模拟，我们观察到，实施人工智能辅助筛查乳腺 X 光片解读和当天诊断工作流程会使每天的患者量减少 4%，患者在诊所的停留时间增加 24%，等候时间增加 13-31%。我们将讨论如何通过改变营业时间、引进新的成像设备和人员来减轻这些负面影响。

{"title":"Using Discrete Event Simulation to Design and Assess an AI-aided Workflow for Same-day Diagnostic Testing of Women Undergoing Breast Screening.","authors":"Yannan Lin, Anne C Hoyt, Vladimir G Manuel, Moira Inkelas, William Hsu","doi":"","DOIUrl":"","url":null,"abstract":"The process of patients waiting for diagnostic examinations after an abnormal screening mammogram is inefficient and anxiety-inducing. Artificial intelligence (AI)-aided interpretation of screening mammography could reduce the number of recalls after screening. We proposed a same-day diagnostic workup to alleviate patient anxiety by employing an AI-aided interpretation to reduce unnecessary diagnostic testing after an abnormal screening mammogram. However, the potential unintended consequences of introducing this workflow in a high-volume breast imaging center are unknown. Using discrete event simulation, we observed that implementing the AI-aided screening mammogram interpretation and same-day diagnostic workflow would reduce daily patient volume by 4%, increase the time a patient would be at the clinic by 24%, and increase waiting times by 13-31%. We discuss how changing the hours of operation and introducing new imaging equipment and personnel may alleviate these negative impacts.","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2024 ","pages":"314-323"},"PeriodicalIF":0.0,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11141813/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141201267","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Deep Learning Approaches to Predict Exercise Exertion Levels Using Wearable Physiological Data. 利用可穿戴生理数据预测运动消耗水平的深度学习方法。

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science

Pub Date : 2024-05-31 eCollection Date: 2024-01-01

Aref Smiley, Joseph Finkelstein

Using physiological data from wearable devices, the study aimed to predict exercise exertion levels by building deep learning classification and regression models. Physiological data were obtained using an unobtrusive chest-worn ECG sensor and portable pulse oximeter from healthy individuals who performed 16-minute cycling exercise sessions. During each session, real-time ECG, pulse rate, oxygen saturation, and revolutions per minute (RPM) data were collected at three intensity levels. Subjects' ratings of perceived exertion (RPE) were collected once per minute. Each 16-minute exercise session was divided into eight 2-minute windows. The self-reported RPEs, heart rate, RPMs, and oxygen saturation levels were averaged for each window to form the predictive features. In addition, heart rate variability (HRV) features were extracted from the ECG for each window. Different feature selection algorithms were used to choose top-ranked predictors. The best predictors were then used to train and test deep learning models for regression and classification analysis. Our results showed the highest accuracy and F1 score of 98.2% and 98%, respectively in training the models. For testing the models, the highest accuracy and F1 score were 80%.

这项研究旨在利用可穿戴设备提供的生理数据，通过建立深度学习分类和回归模型来预测运动消耗水平。研究人员使用非侵入性胸戴式心电图传感器和便携式脉搏血氧仪获取了健康人的生理数据，这些健康人进行了 16 分钟的自行车运动。在每次运动过程中，以三种强度水平收集实时心电图、脉搏率、血氧饱和度和每分钟转数（RPM）数据。受试者的体力感知评分（RPE）每分钟收集一次。每个 16 分钟的运动时段被分为 8 个 2 分钟的窗口。对每个窗口的自我报告的 RPE、心率、转速和血氧饱和度水平进行平均，以形成预测特征。此外，还从每个窗口的心电图中提取了心率变异性（HRV）特征。使用不同的特征选择算法来选择排名靠前的预测因子。最佳预测因子随后被用于训练和测试深度学习模型，以进行回归和分类分析。我们的结果显示，在训练模型时，最高准确率和 F1 分数分别为 98.2% 和 98%。在测试模型时，最高准确率和 F1 分数均为 80%。

{"title":"Deep Learning Approaches to Predict Exercise Exertion Levels Using Wearable Physiological Data.","authors":"Aref Smiley, Joseph Finkelstein","doi":"","DOIUrl":"","url":null,"abstract":"Using physiological data from wearable devices, the study aimed to predict exercise exertion levels by building deep learning classification and regression models. Physiological data were obtained using an unobtrusive chest-worn ECG sensor and portable pulse oximeter from healthy individuals who performed 16-minute cycling exercise sessions. During each session, real-time ECG, pulse rate, oxygen saturation, and revolutions per minute (RPM) data were collected at three intensity levels. Subjects' ratings of perceived exertion (RPE) were collected once per minute. Each 16-minute exercise session was divided into eight 2-minute windows. The self-reported RPEs, heart rate, RPMs, and oxygen saturation levels were averaged for each window to form the predictive features. In addition, heart rate variability (HRV) features were extracted from the ECG for each window. Different feature selection algorithms were used to choose top-ranked predictors. The best predictors were then used to train and test deep learning models for regression and classification analysis. Our results showed the highest accuracy and F1 score of 98.2% and 98%, respectively in training the models. For testing the models, the highest accuracy and F1 score were 80%.","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2024 ","pages":"419-428"},"PeriodicalIF":0.0,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11141804/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"141200160","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Identifying and Characterizing the Transgender and Nonbinary Population Presenting to Pediatric Psychiatry Emergency Services. 识别和描述向儿科精神科急诊服务求诊的变性和非二元人群。

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science

Pub Date : 2024-05-31 eCollection Date: 2024-01-01

Wyatt Kim, Kathleen R Donise, Katherine A Brown, Mary Kathryn Cancilliere, Elizabeth S Chen

Transgender and nonbinary (TGNB) individuals have an increased risk of certain mental health outcomes, such as depression and suicide attempts. This population skews younger in the United States and prior studies have not included TGNB patients for the entire pediatric age range in an emergency department (ED) setting. The present study aimed to examine gender identity documentation in the electronic health record and then use that information to identify and further characterize the pediatric TGNB population presenting to a psychiatric emergency service. Preliminary findings include a greater percentage of TGNB patients compared to non-TGNB individuals who had repeat visits to the ED for high acuity psychiatric concerns. A larger portion of TGNB patients also had at least one evaluation that included suicidal ideation. These results call for increased attention on the quality of mental healthcare for TGNB youth both inside and outside of the ED.

变性人和非二元性（TGNB）人患抑郁症和自杀未遂等某些心理健康后果的风险会增加。在美国，变性人和非二元性（TGNB）人群偏向年轻化，之前的研究并未将整个儿科年龄段的变性人和非二元性（TGNB）患者纳入急诊科（ED）的研究范围。本研究旨在检查电子健康记录中的性别认同文件，然后利用这些信息来识别和进一步描述在精神科急诊就诊的儿科 TGNB 患者。初步研究结果表明，与非 TGNB 患者相比，TGNB 患者中因高度精神问题重复到急诊室就诊的比例更高。更多的 TGNB 患者还至少接受过一次包括自杀意念在内的评估。这些结果呼吁人们更加关注急诊室内外为非华裔黑人青少年提供的心理保健服务的质量。

引用次数: 0

Interpretability Study for Long Interview Transcripts from Behavior Intervention Sessions for Family Caregivers of Dementia Patients. 痴呆症患者家庭护理人员行为干预课程长访谈记录的可解读性研究。

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science

Pub Date : 2024-05-31 eCollection Date: 2024-01-01

Weiqing He, Bojian Hou, George Demiris, Li Shen

Mental health challenges are significant global public health concerns, affecting millions of people and impacting individuals, families, and communities alike. Therapists play a crucial role in supporting those with mental health issues by providing emotional, practical, and financial assistance, as well as facilitating access to treatment and services. Utilizing one-to-one interviews is an effective approach that yields valuable transcripts for further study. In this paper, we focus on interview transcripts between therapists and caregivers with family members suffering from dementia. We propose a method to efficiently handle long interview transcripts for classification. Then we employ the Shapley-value based interpretability technique to identify important contents that significantly contribute to classification results and build a corpus containing sentences potentially beneficial to the therapy. This approach offers valuable insights for enhancing the treatment of mental health issues.

心理健康挑战是重大的全球公共卫生问题，影响着数百万人，对个人、家庭和社区都有影响。治疗师在支持有心理健康问题的人方面发挥着至关重要的作用，他们提供情感、实际和经济上的帮助，并为获得治疗和服务提供便利。利用一对一访谈是一种有效的方法，它能为进一步研究提供有价值的记录誊本。在本文中，我们将重点关注治疗师与痴呆症患者家属护理人员之间的访谈记录。我们提出了一种有效处理长篇访谈记录的方法，以便进行分类。然后，我们采用基于 Shapley 值的可解释性技术来识别对分类结果有显著贡献的重要内容，并建立一个包含可能对治疗有益的句子的语料库。这种方法为加强心理健康问题的治疗提供了宝贵的见解。

引用次数: 0