首页 > 最新文献

AMIA ... Annual Symposium proceedings. AMIA Symposium最新文献

英文 中文
Improving Emergency Department Visit Risk Prediction: Exploring the Operational Utility of Applied Patient Portal Messages. 改进急诊科就诊风险预测:探索应用患者门户信息的操作效用。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Hanna Kiani, Sohaib Hassan, Julian Z Genkins, Jasmine Bilir, Julia Kadie, Tran Le, Jo-Anne Suffoletto, Jonathan H Chen

Patient portal messages represent a unique source of clinical data due to how they represent the voice of the patient, provide a glimpse into care delivery between episodic synchronous appointments, and capture variations in patient behavior and health literacy. There is little understanding of how to best apply modern natural language processing (NLP) approaches, such as large, pre-trained language models (LLMs), to patient messages. In this study, we aim to explore different approaches in incorporating patient messages into an existing Emergency Departments (ED) visit risk prediction model currently deployed at Stanford Health Care. With the addition of patient message frequencies to the baseline we were able to achieve an improved AUC of .77 and a jump in the F1 score. In future work, we aim to build upon these findings and further test combination models to incorporate features around patient message content, in addition to message frequencies.

患者门户消息代表了临床数据的独特来源,因为它们代表了患者的声音,提供了对偶发性同步预约之间的护理提供的一瞥,并捕获了患者行为和健康素养的变化。对于如何最好地将现代自然语言处理(NLP)方法(如大型预训练语言模型(llm))应用于患者信息,人们知之甚少。在本研究中,我们旨在探索将患者信息纳入斯坦福医疗中心现有急诊科(ED)就诊风险预测模型的不同方法。通过将患者信息频率添加到基线,我们能够将AUC提高到0.77,并在F1评分中实现跳跃。在未来的工作中,我们的目标是在这些发现的基础上进一步测试组合模型,以结合患者信息内容的特征,以及信息频率。
{"title":"Improving Emergency Department Visit Risk Prediction: Exploring the Operational Utility of Applied Patient Portal Messages.","authors":"Hanna Kiani, Sohaib Hassan, Julian Z Genkins, Jasmine Bilir, Julia Kadie, Tran Le, Jo-Anne Suffoletto, Jonathan H Chen","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Patient portal messages represent a unique source of clinical data due to how they represent the voice of the patient, provide a glimpse into care delivery between episodic synchronous appointments, and capture variations in patient behavior and health literacy. There is little understanding of how to best apply modern natural language processing (NLP) approaches, such as large, pre-trained language models (LLMs), to patient messages. In this study, we aim to explore different approaches in incorporating patient messages into an existing Emergency Departments (ED) visit risk prediction model currently deployed at Stanford Health Care. With the addition of patient message frequencies to the baseline we were able to achieve an improved AUC of .77 and a jump in the F1 score. In future work, we aim to build upon these findings and further test combination models to incorporate features around patient message content, in addition to message frequencies.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"610-619"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099376/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Novel Sentence Transformer-based Natural Language Processing Approach for Schema Mapping of Electronic Health Records to the OMOP Common Data Model. 基于句子转换器的电子健康记录到OMOP公共数据模型模式映射的自然语言处理方法。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Xinyu Zhou, Lovedeep Singh Dhingra, Arya Aminorroaya, Philip Adejumo, Rohan Khera

Mapping electronic health records (EHR) data to common data models (CDMs) enables the standardization of clinical records, enhancing interoperability and enabling large-scale, multi-centered clinical investigations. Using 2 large publicly available datasets, we developed transformer-based natural language processing models to map medication-related concepts from the EHR at a large and diverse healthcare system to standard concepts in OMOP CDM. We validated the model outputs against standard concepts manually mapped by clinicians. Our best model reached out-of-box accuracies of 96.5% in mapping the 200 most common drugs and 83.0% in mapping 200 random drugs in the EHR. For these tasks, this model outperformed a state-of-the-art large language model (SFR-Embedding-Mistral, 89.5% and 66.5% in accuracy for the two tasks), a widely used software for schema mapping (Usagi, 90.0% and 70.0% in accuracy), and direct string match (7.5% and 7.5% accuracy). Transformer-based deep learning models outperform existing approaches in the standardized mapping of EHR elements and can facilitate an end-to-end automated EHR transformation pipeline.

将电子健康记录(EHR)数据映射到公共数据模型(cdm)可以实现临床记录的标准化,增强互操作性并实现大规模、多中心的临床调查。使用2个大型公开可用的数据集,我们开发了基于转换器的自然语言处理模型,将大型多样化医疗保健系统中的EHR中与药物相关的概念映射到OMOP CDM中的标准概念。我们根据临床医生手动映射的标准概念验证了模型输出。我们的最佳模型在绘制200种最常见药物的图谱时达到了96.5%的开箱外准确率,在绘制200种随机药物的图谱时达到了83.0%。对于这些任务,该模型优于最先进的大型语言模型(sr - embedging - mistral,两个任务的准确率分别为89.5%和66.5%)、广泛使用的模式映射软件(Usagi,准确率分别为90.0%和70.0%)和直接字符串匹配(准确率分别为7.5%和7.5%)。基于转换器的深度学习模型在EHR元素的标准化映射方面优于现有方法,并且可以促进端到端的自动化EHR转换管道。
{"title":"A Novel Sentence Transformer-based Natural Language Processing Approach for Schema Mapping of Electronic Health Records to the OMOP Common Data Model.","authors":"Xinyu Zhou, Lovedeep Singh Dhingra, Arya Aminorroaya, Philip Adejumo, Rohan Khera","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Mapping electronic health records (EHR) data to common data models (CDMs) enables the standardization of clinical records, enhancing interoperability and enabling large-scale, multi-centered clinical investigations. Using 2 large publicly available datasets, we developed transformer-based natural language processing models to map medication-related concepts from the EHR at a large and diverse healthcare system to standard concepts in OMOP CDM. We validated the model outputs against standard concepts manually mapped by clinicians. Our best model reached out-of-box accuracies of 96.5% in mapping the 200 most common drugs and 83.0% in mapping 200 random drugs in the EHR. For these tasks, this model outperformed a state-of-the-art large language model (SFR-Embedding-Mistral, 89.5% and 66.5% in accuracy for the two tasks), a widely used software for schema mapping (Usagi, 90.0% and 70.0% in accuracy), and direct string match (7.5% and 7.5% accuracy). Transformer-based deep learning models outperform existing approaches in the standardized mapping of EHR elements and can facilitate an end-to-end automated EHR transformation pipeline.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"1332-1339"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099400/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Secondary Use of Clinical Problem List Descriptions for Bi-Encoder Based ICD-10 Classification. 基于双编码器的ICD-10分类中临床问题清单描述的二次使用。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Markus Kreuzthaler, Bastian Pfeifer, Stefan Schulz

Annotated language resources are essential for supervised machine learning methods. In the clinical domain, such data sets can boost use-case specific natural language processing services. In this work, we have analyzed a clinical problem list table consisting of millions of ICD-10 codes assigned to short problem list descriptions in German. We have investigated whether the given data forms a valuable resource within a secondary use case scenario for coding support. Our proposed methodology exploits an embedding-based k-NN classifier, which was evaluated based on its coding performance, leveraging the multilingual BERT based language model SapBERT-UMLS in comparison with medBERT.de, which is specifically tailored to medical and clinical language resources in German. Our approach reached a weighted F1-measure of 0.87 using SapBERT-UMLS and an F1-measure of 0.86 for medBERT.de. The approach revealed promising coding results when reusing annotated language resources out of clinical routine documentation.

注释语言资源对于监督式机器学习方法至关重要。在临床领域,这样的数据集可以促进用例特定的自然语言处理服务。在这项工作中,我们分析了由数百万个ICD-10代码组成的临床问题列表表,这些代码分配给德语的短问题列表描述。我们已经调查了给定的数据是否在编码支持的次要用例场景中形成了有价值的资源。我们提出的方法利用基于嵌入的k-NN分类器,该分类器根据其编码性能进行评估,利用基于多语言BERT的语言模型SapBERT-UMLS与专门针对德国医学和临床语言资源的medBERT.de进行比较。我们的方法使用SapBERT-UMLS的加权f1测量值为0.87,使用medBERT.de的加权f1测量值为0.86。当重用临床常规文档之外的注释语言资源时,该方法显示了有希望的编码结果。
{"title":"Secondary Use of Clinical Problem List Descriptions for Bi-Encoder Based ICD-10 Classification.","authors":"Markus Kreuzthaler, Bastian Pfeifer, Stefan Schulz","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Annotated language resources are essential for supervised machine learning methods. In the clinical domain, such data sets can boost use-case specific natural language processing services. In this work, we have analyzed a clinical problem list table consisting of millions of ICD-10 codes assigned to short problem list descriptions in German. We have investigated whether the given data forms a valuable resource within a secondary use case scenario for coding support. Our proposed methodology exploits an embedding-based k-NN classifier, which was evaluated based on its coding performance, leveraging the multilingual BERT based language model SapBERT-UMLS in comparison with medBERT.de, which is specifically tailored to medical and clinical language resources in German. Our approach reached a weighted F1-measure of 0.87 using SapBERT-UMLS and an F1-measure of 0.86 for medBERT.de. The approach revealed promising coding results when reusing annotated language resources out of clinical routine documentation.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"620-627"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099355/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144786","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Global Relevance of Online Health Information Sources: A Case Study of Experiences and Perceptions of Nigerians. 在线卫生信息源的全球相关性:尼日利亚人的经验和看法的案例研究。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Ommo Clark, Karuna P Joshi, Tera L Reynolds

Online health information sources (OHIS) offer potential for improving access to health information especially in areas with limited healthcare infrastructure. However, OHIS predominantly originates from Western societies potentially ignoring the specific needs and cultural contexts of diverse populations. There is limited research on the global suitability of OHIS content. This study explores the global relevance of OHIS for diverse populations through a case study examining user experiences of Nigerians living in multiple countries. Findings reveal OHIS usage patterns are influenced by the country of residence and local health services availability. The study highlights the need for culturally inclusive OHIS content to ensure equitable health information access globally. Ultimately, for OHIS to serve a global audience effectively, there needs to be reliable information sources that acknowledge and cater to different users' cultural backgrounds, including prevalent health issues, medical practices, beliefs, languages, and healthcare expectations.

在线卫生信息源(OHIS)提供了改善获取卫生信息的潜力,特别是在卫生保健基础设施有限的地区。然而,OHIS主要起源于西方社会,可能忽视了不同人群的具体需求和文化背景。关于职业健康信息系统内容的全球适用性的研究有限。本研究通过对生活在多个国家的尼日利亚人的用户体验进行案例研究,探讨了OHIS对不同人群的全球相关性。调查结果显示,职业健康信息系统的使用模式受到居住国和当地卫生服务可得性的影响。该研究强调需要具有文化包容性的OHIS内容,以确保在全球公平获取卫生信息。最终,为了使OHIS有效地为全球受众服务,需要有可靠的信息来源,承认并满足不同用户的文化背景,包括流行的健康问题、医疗实践、信仰、语言和医疗保健期望。
{"title":"Global Relevance of Online Health Information Sources: A Case Study of Experiences and Perceptions of Nigerians.","authors":"Ommo Clark, Karuna P Joshi, Tera L Reynolds","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Online health information sources (OHIS) offer potential for improving access to health information especially in areas with limited healthcare infrastructure. However, OHIS predominantly originates from Western societies potentially ignoring the specific needs and cultural contexts of diverse populations. There is limited research on the global suitability of OHIS content. This study explores the global relevance of OHIS for diverse populations through a case study examining user experiences of Nigerians living in multiple countries. Findings reveal OHIS usage patterns are influenced by the country of residence and local health services availability. The study highlights the need for culturally inclusive OHIS content to ensure equitable health information access globally. Ultimately, for OHIS to serve a global audience effectively, there needs to be reliable information sources that acknowledge and cater to different users' cultural backgrounds, including prevalent health issues, medical practices, beliefs, languages, and healthcare expectations.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"300-308"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099433/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144648","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimizing Medication Querying Using Ontology-Driven Approach with OMOP: with an application to a large-scale COVID-19 EHR dataset. 基于OMOP的本体驱动方法优化药物查询:基于大规模COVID-19电子病历数据集的应用
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Xiaojin Li, Yan Huang, Licong Cui, Shiqiang Tao, Guo-Qiang Zhang

Efficient querying for medication information in Electronic Health Record (EHR) datasets is crucial for effective patient care and clinical research. To address the complexity and data volume challenges involved in efficient medication information retrieval, we propose an ontology-driven medication query (ODMQ) optimization approach, leveraging the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM). Integrating semantic ontology structures from the OMOP CDM can help enhance query accuracy and efficiency by broadening the scope of relevant medication terms like drug names, National Drug Codes, and generics, resulting in more comprehensive query outcomes than traditional methods. ODMQ significantly reduces manual search time and enhances query capabilities. We validate ODMQ's efficacy using real-world COVID-19 EHR data, demonstrating improved query performance. Through a comprehensive manual review, ODMQ ensures that expanded search terms are relevant to user inputs. It also includes an intuitive query interface and visualizes patient history for result validation and exploration.

在电子健康记录(EHR)数据集中高效查询药物信息对于有效的患者护理和临床研究至关重要。为了解决高效药物信息检索所涉及的复杂性和数据量挑战,我们提出了一种利用观察性医疗结果合作伙伴关系(OMOP)公共数据模型(CDM)的本体驱动药物查询(ODMQ)优化方法。集成来自OMOP CDM的语义本体结构可以通过扩大相关药物术语(如药品名称、国家药品代码和仿制药)的范围来帮助提高查询的准确性和效率,从而产生比传统方法更全面的查询结果。ODMQ显著减少了手动搜索时间并增强了查询功能。我们使用真实的COVID-19 EHR数据验证了ODMQ的有效性,展示了改进的查询性能。通过全面的人工审查,ODMQ确保扩展的搜索词与用户输入相关。它还包括一个直观的查询界面和可视化的病人的历史,结果验证和探索。
{"title":"Optimizing Medication Querying Using Ontology-Driven Approach with OMOP: with an application to a large-scale COVID-19 EHR dataset.","authors":"Xiaojin Li, Yan Huang, Licong Cui, Shiqiang Tao, Guo-Qiang Zhang","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Efficient querying for medication information in Electronic Health Record (EHR) datasets is crucial for effective patient care and clinical research. To address the complexity and data volume challenges involved in efficient medication information retrieval, we propose an ontology-driven medication query (ODMQ) optimization approach, leveraging the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM). Integrating semantic ontology structures from the OMOP CDM can help enhance query accuracy and efficiency by broadening the scope of relevant medication terms like drug names, National Drug Codes, and generics, resulting in more comprehensive query outcomes than traditional methods. ODMQ significantly reduces manual search time and enhances query capabilities. We validate ODMQ's efficacy using real-world COVID-19 EHR data, demonstrating improved query performance. Through a comprehensive manual review, ODMQ ensures that expanded search terms are relevant to user inputs. It also includes an intuitive query interface and visualizes patient history for result validation and exploration.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"693-702"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099415/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144660","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessing the Seasonality of Lab Tests Among Patients with Alzheimer's Disease and Related Dementias in OneFlorida Data Trust. 在佛罗里达数据信托中评估阿尔茨海默病和相关痴呆患者实验室测试的季节性
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Wenshan Han, Balu Bhasuran, Victorine Patricia Muse, Søren Brunak, Lifeng Lin, Karim Hanna, Yu Huang, Jiang Bian, Zhe He

About 1 in 9 older adults over 65 has Alzheimer's disease (AD), many of whom also have multiple other chronic conditions such as hypertension and diabetes, necessitating careful monitoring through laboratory tests. Understanding the patterns of laboratory tests in this population aids our understanding and management of these chronic conditions along with AD. In this study, we used an unimodal cosinor model to assess the seasonality of lab tests using electronic health record (EHR) data from 34,303 AD patients from the OneFlorida+ Clinical Research Consortium. We observed significant seasonal fluctuations-higher in winter in lab tests such as glucose, neutrophils per 100 white blood cells (WBC), and WBC. Notably, certain leukocyte types like eosinophils, lymphocytes, and monocytes are elevated during summer, likely reflecting seasonal respiratory diseases and allergens. Seasonality is more pronounced in older patients and varies by gender. Our findings suggest that recognizing these patterns and adjusting reference intervals for seasonality would allow healthcare providers to enhance diagnostic precision, tailor care, and potentially improve patient outcomes.

65岁以上的老年人中约有九分之一患有阿尔茨海默病(AD),其中许多人还患有高血压和糖尿病等多种其他慢性疾病,需要通过实验室测试进行仔细监测。了解这一人群的实验室测试模式有助于我们理解和管理这些慢性疾病以及AD。在这项研究中,我们使用单峰余数模型来评估实验室测试的季节性,使用来自OneFlorida+临床研究联盟的34,303名AD患者的电子健康记录(EHR)数据。我们观察到显著的季节波动——在冬季,实验室测试如葡萄糖、每100个白细胞(WBC)中性粒细胞和白细胞(WBC)较高。值得注意的是,某些白细胞类型,如嗜酸性粒细胞、淋巴细胞和单核细胞在夏季升高,可能反映了季节性呼吸道疾病和过敏原。季节性在老年患者中更为明显,且因性别而异。我们的研究结果表明,认识到这些模式并根据季节性调整参考区间将使医疗保健提供者能够提高诊断精度,定制护理,并可能改善患者的预后。
{"title":"Assessing the Seasonality of Lab Tests Among Patients with Alzheimer's Disease and Related Dementias in OneFlorida Data Trust.","authors":"Wenshan Han, Balu Bhasuran, Victorine Patricia Muse, Søren Brunak, Lifeng Lin, Karim Hanna, Yu Huang, Jiang Bian, Zhe He","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>About 1 in 9 older adults over 65 has Alzheimer's disease (AD), many of whom also have multiple other chronic conditions such as hypertension and diabetes, necessitating careful monitoring through laboratory tests. Understanding the patterns of laboratory tests in this population aids our understanding and management of these chronic conditions along with AD. In this study, we used an unimodal cosinor model to assess the seasonality of lab tests using electronic health record (EHR) data from 34,303 AD patients from the OneFlorida+ Clinical Research Consortium. We observed significant seasonal fluctuations-higher in winter in lab tests such as glucose, neutrophils per 100 white blood cells (WBC), and WBC. Notably, certain leukocyte types like eosinophils, lymphocytes, and monocytes are elevated during summer, likely reflecting seasonal respiratory diseases and allergens. Seasonality is more pronounced in older patients and varies by gender. Our findings suggest that recognizing these patterns and adjusting reference intervals for seasonality would allow healthcare providers to enhance diagnostic precision, tailor care, and potentially improve patient outcomes.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"483-492"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099435/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144699","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Does Transparency Promote Engagement? Cancer Patients' Access of Electronic Medical Records Before and After the Information Blocking Rule. 透明度能促进参与吗?信息封锁规则实施前后癌症患者对电子病历的访问
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Fernanda C G Polubriaginof, Susan Chimonas, Allison Lipitz-Snyderman, Zoe Spiegelhoff, Kenneth Seier, Charlie White, Joshua Jorvina, Gilad Kuperman

Access to clinical information is critical to support patient engagement. The 21st Century Cures Act grants patients immediate electronic access to their full medical records. To assess the potential impact of this transparency provision, we conducted a retrospective study in a large cancer center in New York City, focusing on clinically active patients' accessing health information shared via the patient portal. We identified a significant increase (14%) in the number of pathology and radiology reports read by patients after the implementation ofimmediate release of reports. No changes were found in the rates of account creation or logins. Our results suggest that oncology patients show strong, consistent interest in their clinical data, with many taking advantage of the full electronic access granted by Cures. These findings shed new light on this legislation's impact on patient engagement and access to clinical data.

获取临床信息对于支持患者参与至关重要。《21世纪治愈法案》(21st Century Cures Act)允许患者立即以电子方式查阅自己的全部医疗记录。为了评估这种透明度规定的潜在影响,我们在纽约市的一家大型癌症中心进行了一项回顾性研究,重点关注临床活跃患者访问通过患者门户共享的健康信息。我们发现,实施即时发布报告后,患者阅读的病理和放射学报告数量显著增加(14%)。在账户创建和登录率方面没有发现任何变化。我们的研究结果表明,肿瘤患者对他们的临床数据表现出强烈的、持续的兴趣,许多人利用了Cures授予的完全电子访问。这些发现揭示了这项立法对患者参与和获取临床数据的影响。
{"title":"Does Transparency Promote Engagement? Cancer Patients' Access of Electronic Medical Records Before and After the Information Blocking Rule.","authors":"Fernanda C G Polubriaginof, Susan Chimonas, Allison Lipitz-Snyderman, Zoe Spiegelhoff, Kenneth Seier, Charlie White, Joshua Jorvina, Gilad Kuperman","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Access to clinical information is critical to support patient engagement. The 21<sup>st</sup> Century Cures Act grants patients immediate electronic access to their full medical records. To assess the potential impact of this transparency provision, we conducted a retrospective study in a large cancer center in New York City, focusing on clinically active patients' accessing health information shared via the patient portal. We identified a significant increase (14%) in the number of pathology and radiology reports read by patients after the implementation ofimmediate release of reports. No changes were found in the rates of account creation or logins. Our results suggest that oncology patients show strong, consistent interest in their clinical data, with many taking advantage of the full electronic access granted by Cures. These findings shed new light on this legislation's impact on patient engagement and access to clinical data.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"900-909"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099384/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144516","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identifying Genomic Data Sources from Biomedical Literature. 从生物医学文献中识别基因组数据来源。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Xu Zuo, Ashley Gilliam, Yan Hu, Kalpana Raja, Kirk Roberts, Hua Xu

Genomic research is becoming increasingly data-intensive, yet the proper reference of data remains a persistent challenge. Despite various efforts to establish and standardize data citation practices, scientists frequently fall short of accurately referencing data in their papers. This deficiency complicates the attribution of contributions to data providers and impedes the reproducibility of findings in genomic research. This study addresses this gap by introducing a gold standard corpus designed to identify mentions of genomic data sources and associated attributes, thereby offering insights into data source availability and accessibility. Within this corpus, we categorize entities into six classes, encompassing three primary entities (Dataset, Repository, and Contributor) and three attributes (Accession Number, URL, and DOI). We also define and annotate the relations between these main entities and attributes. We perform a comprehensive analysis of the corpus, by assessing inter-annotator agreements and implementing an information extraction pipeline using BERT-based models. Our BERT-based models achieve a best F1 score of 0.94 in recognizing mentions of genomic data sources and 0.76 in extracting relationships between these mentions and associated attributes. By introducing this genomic data source mention corpus, we aim to propel the progress of data sharing and reuse in forthcoming genomic research.

基因组研究正变得越来越数据密集,但数据的适当参考仍然是一个持续的挑战。尽管建立和规范数据引用实践的各种努力,科学家们经常不能准确地引用他们论文中的数据。这一缺陷使数据提供者的贡献归属变得复杂,并阻碍了基因组研究结果的可重复性。本研究通过引入金标准语料库来解决这一差距,该语料库旨在识别提及的基因组数据源和相关属性,从而提供对数据源可用性和可访问性的见解。在这个语料库中,我们将实体分为六类,包括三个主要实体(Dataset、Repository和Contributor)和三个属性(Accession Number、URL和DOI)。我们还定义和注释了这些主要实体和属性之间的关系。我们通过评估注释者之间的协议和使用基于bert的模型实现信息提取管道,对语料库进行了全面的分析。我们基于bert的模型在识别基因组数据源的提及方面获得了0.94的最佳F1分数,在提取这些提及与相关属性之间的关系方面获得了0.76的最佳F1分数。通过引入该基因组数据源提及语料库,我们旨在推动未来基因组研究中数据共享和重用的进展。
{"title":"Identifying Genomic Data Sources from Biomedical Literature.","authors":"Xu Zuo, Ashley Gilliam, Yan Hu, Kalpana Raja, Kirk Roberts, Hua Xu","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Genomic research is becoming increasingly data-intensive, yet the proper reference of data remains a persistent challenge. Despite various efforts to establish and standardize data citation practices, scientists frequently fall short of accurately referencing data in their papers. This deficiency complicates the attribution of contributions to data providers and impedes the reproducibility of findings in genomic research. This study addresses this gap by introducing a gold standard corpus designed to identify mentions of genomic data sources and associated attributes, thereby offering insights into data source availability and accessibility. Within this corpus, we categorize entities into six classes, encompassing three primary entities (Dataset, Repository, and Contributor) and three attributes (Accession Number, URL, and DOI). We also define and annotate the relations between these main entities and attributes. We perform a comprehensive analysis of the corpus, by assessing inter-annotator agreements and implementing an information extraction pipeline using BERT-based models. Our BERT-based models achieve a best F1 score of 0.94 in recognizing mentions of genomic data sources and 0.76 in extracting relationships between these mentions and associated attributes. By introducing this genomic data source mention corpus, we aim to propel the progress of data sharing and reuse in forthcoming genomic research.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"1350-1359"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099396/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144652","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optimizing Large Language Models for Discharge Prediction: Best Practices in Leveraging Electronic Health Record Audit Logs. 优化用于出院预测的大型语言模型:利用电子健康记录审计日志的最佳实践。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Xinmeng Zhang, Chao Yan, Yuyang Yang, Zhuohang Li, Yubo Feng, Bradley A Malin, You Chen

Electronic Health Record (EHR) audit log data are increasingly utilized for clinical tasks, from workflow modeling to predictive analyses of discharge events, adverse kidney outcomes, and hospital readmissions. These data encapsulate user-EHR interactions, reflecting both healthcare professionals' behavior and patients' health statuses. To harness this temporal information effectively, this study explores the application of Large Language Models (LLMs) in leveraging audit log data for clinical prediction tasks, specifically focusing on discharge predictions. Utilizing a year's worth of EHR data from Vanderbilt University Medical Center, we fine-tuned LLMs with randomly selected 10,000 training examples. Our findings reveal that LLaMA-2 70B, with an AUROC of 0.80 [0.77-0.82], outperforms both GPT-4 128K in a zero-shot, with an AUROC of 0.68 [0.65-0.71], and DeBERTa, with an AUROC of 0.78 [0.75-0.82]. Among various serialization methods, the first-occurrence approach-wherein only the initial appearance of each event in a sequence is retained-shows superior performance. Furthermore, for the fine-tuned LLaMA-2 70B, logit outputs yield a higher AUROC of 0.80 [0.77-0.82] compared to text outputs, with an AUROC of 0.69 [0.67-0.72]. This study underscores the potential of fine-tuned LLMs, particularly when combined with strategic sequence serialization, in advancing clinical prediction tasks.

电子健康记录(EHR)审计日志数据越来越多地用于临床任务,从工作流建模到出院事件、不良肾脏结果和医院再入院的预测分析。这些数据封装了用户- ehr交互,反映了医疗保健专业人员的行为和患者的健康状况。为了有效地利用这些时间信息,本研究探索了大型语言模型(LLMs)在利用审计日志数据进行临床预测任务中的应用,特别是专注于出院预测。利用范德比尔特大学医学中心(Vanderbilt University Medical Center)一年的电子病历数据,我们用随机选择的1万个训练样本对法学硕士进行了微调。我们的研究结果显示,LLaMA-2 70B的AUROC为0.80[0.77-0.82],在零射击中优于GPT-4 128K,其AUROC为0.68[0.65-0.71],而DeBERTa的AUROC为0.78[0.75-0.82]。在各种序列化方法中,首次出现的方法(其中只保留序列中每个事件的初始出现)表现出优越的性能。此外,对于微调后的LLaMA-2 70B, logit输出的AUROC为0.80[0.77-0.82],而文本输出的AUROC为0.69[0.67-0.72]。这项研究强调了微调llm在推进临床预测任务方面的潜力,特别是当与战略序列序列化相结合时。
{"title":"Optimizing Large Language Models for Discharge Prediction: Best Practices in Leveraging Electronic Health Record Audit Logs.","authors":"Xinmeng Zhang, Chao Yan, Yuyang Yang, Zhuohang Li, Yubo Feng, Bradley A Malin, You Chen","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Electronic Health Record (EHR) audit log data are increasingly utilized for clinical tasks, from workflow modeling to predictive analyses of discharge events, adverse kidney outcomes, and hospital readmissions. These data encapsulate user-EHR interactions, reflecting both healthcare professionals' behavior and patients' health statuses. To harness this temporal information effectively, this study explores the application of Large Language Models (LLMs) in leveraging audit log data for clinical prediction tasks, specifically focusing on discharge predictions. Utilizing a year's worth of EHR data from Vanderbilt University Medical Center, we fine-tuned LLMs with randomly selected 10,000 training examples. Our findings reveal that LLaMA-2 70B, with an AUROC of 0.80 [0.77-0.82], outperforms both GPT-4 128K in a zero-shot, with an AUROC of 0.68 [0.65-0.71], and DeBERTa, with an AUROC of 0.78 [0.75-0.82]. Among various serialization methods, the first-occurrence approach-wherein only the initial appearance of each event in a sequence is retained-shows superior performance. Furthermore, for the fine-tuned LLaMA-2 70B, logit outputs yield a higher AUROC of 0.80 [0.77-0.82] compared to text outputs, with an AUROC of 0.69 [0.67-0.72]. This study underscores the potential of fine-tuned LLMs, particularly when combined with strategic sequence serialization, in advancing clinical prediction tasks.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"1323-1331"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099422/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144658","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Sepsis Prediction Models are Trained on Labels that Diverge from Clinician-Recommended Treatment Times. 脓毒症预测模型是在偏离临床医生推荐治疗时间的标签上训练的。
Pub Date : 2025-05-22 eCollection Date: 2024-01-01
Gary E Weissman, Rebecca A Hubbard, Blanca E Himes, Kelly L Goodman-O'Leary, Michael O Harhay, Jennifer C Ginestra, Rachel Kohn, Andrew J Admon, Stephanie Parks Taylor, Scott D Halpern

Many sepsis prediction models use the Sepsis-3 definition or its variants as a training label. However, among the few sepsis models ever deployed in practice, there is scant evidence that they offer clinically meaningful decision support at the bedside. As a potential mechanism to explain this limitation, we hypothesized that clinician-recommended treatment times for sepsis would diverge from onset time defined by Sepsis-3. We conducted an electronic survey that was completed by 153 clinicians at three large and geographically diverse medical centers using vignettes derived from eight real cases of sepsis. After reviewing these vignettes, participants suggested antibiotic treatment to start an average of 7.0 hours (95% confidence interval 5.3 to 8.8) before the Sepsis-3 definition onset. Thus, predicting Sepsis-3 onset as a treatment prompt could lead to inappropriate and delayed treatment recommendations. Building predictive decision support systems that identify outcomes aligned with bedside decisions would increase their clinical utility.

许多脓毒症预测模型使用脓毒症-3定义或其变体作为训练标签。然而,在实践中部署的少数脓毒症模型中,很少有证据表明它们在床边提供临床有意义的决策支持。作为解释这一限制的潜在机制,我们假设临床医生推荐的脓毒症治疗时间与脓毒症-3定义的发病时间不同。我们进行了一项电子调查,由三个地理位置不同的大型医疗中心的153名临床医生完成,使用来自八个真实败血症病例的小片段。在回顾这些小插曲后,参与者建议在脓毒症-3定义开始前平均7.0小时(95%置信区间5.3至8.8)开始抗生素治疗。因此,预测脓毒症-3发病作为治疗提示可能导致不适当和延迟的治疗建议。建立预测决策支持系统,识别与床边决定一致的结果,将增加其临床效用。
{"title":"Sepsis Prediction Models are Trained on Labels that Diverge from Clinician-Recommended Treatment Times.","authors":"Gary E Weissman, Rebecca A Hubbard, Blanca E Himes, Kelly L Goodman-O'Leary, Michael O Harhay, Jennifer C Ginestra, Rachel Kohn, Andrew J Admon, Stephanie Parks Taylor, Scott D Halpern","doi":"","DOIUrl":"","url":null,"abstract":"<p><p>Many sepsis prediction models use the Sepsis-3 definition or its variants as a training label. However, among the few sepsis models ever deployed in practice, there is scant evidence that they offer clinically meaningful decision support at the bedside. As a potential mechanism to explain this limitation, we hypothesized that clinician-recommended treatment times for sepsis would diverge from onset time defined by Sepsis-3. We conducted an electronic survey that was completed by 153 clinicians at three large and geographically diverse medical centers using vignettes derived from eight real cases of sepsis. After reviewing these vignettes, participants suggested antibiotic treatment to start an average of 7.0 hours (95% confidence interval 5.3 to 8.8) before the Sepsis-3 definition onset. Thus, predicting Sepsis-3 onset as a treatment prompt could lead to inappropriate and delayed treatment recommendations. Building predictive decision support systems that identify outcomes aligned with bedside decisions would increase their clinical utility.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"1215-1224"},"PeriodicalIF":0.0,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099352/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144144733","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
AMIA ... Annual Symposium proceedings. AMIA Symposium
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1