首页 > 最新文献

JAMIA Open最新文献

英文 中文
Measurement, drivers, and outcomes of patient-initiated secure messaging use and intensity: a scoping review. 患者发起的安全消息传递使用和强度的度量、驱动因素和结果:范围审查。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-08-10 eCollection Date: 2025-08-01 DOI: 10.1093/jamiaopen/ooaf087
Aleksandra Wec, Kelly T Gleason, Danielle Peereboom, Mary Jo Gamper, Sharmini Rathakrishnan, Jennifer L Wolff

Objective: Use of secure messaging through the patient portal has increased in recent years. We compile evidence of how patient-initiated secure messaging has been measured, factors associated with use, and effects on individual and organization-level outcomes.

Materials and methods: We conducted a scoping review of articles published through October 2023 by systematically searching PubMed, Embase, and CINAHL. The search identified 2574 articles; 220 were selected for full text review and 78 met eligibility criteria. Factors and outcomes associated with messaging were organized according to the System Engineering Initiative for Patient Safety (SEIPS) 2.0 framework.

Results: Of 78 included studies, 70 (90%) specified the measurement approach: measuring any messaging use versus none (binary measure) (27/70; 39%), intensity of use (34/70; 49%), or both (9/70; 13%). Studies predominantly examined patient (vs clinician) characteristics (48/78; 62%), findings that patients of female sex, White race, higher socioeconomic status, and greater comorbidity were more likely to message and with greater intensity. Factors in other domains of the SEIPS framework such as tasks (7/78; 9%), tools/technology (5/78; 6%), organizational (7/78; 9%), and environmental (11/78; 14%) factors were examined less frequently, with mixed findings. Outcomes of secure messaging (23/78; 30%) were generally favorable with respect to clinical outcomes (10/23; 43%), efficiency (5/23; 22%), and care experience (5/23; 22%) and mixed with respect to health services use.

Discussion: Patient-initiated messaging use has been variably measured, with notable gaps in our understanding of the role of organization-level factors.

Conclusion: Evidence is needed to inform approaches implemented by healthcare systems to manage the growing volume of patient-initiated messages.

目的:近年来,通过患者门户网站使用安全消息的情况有所增加。我们汇编了如何测量患者发起的安全消息传递、与使用相关的因素以及对个人和组织级别结果的影响的证据。材料和方法:我们通过系统地检索PubMed、Embase和CINAHL,对截至2023年10月发表的文章进行了范围综述。检索确定了2574篇文章;选择220份进行全文审查,78份符合资格标准。根据患者安全系统工程倡议(SEIPS) 2.0框架组织与消息传递相关的因素和结果。结果:在纳入的78项研究中,70项(90%)指定了测量方法:测量任何消息使用与不测量(二元测量)(27/70;39%)、使用强度(34/70;49%),或者两者兼而有之(9/70;13%)。研究主要检查患者(vs .临床医生)的特征(48/78;62%),发现女性、白人、较高的社会经济地位和更大的合并症患者更有可能发送信息,并且强度更大。SEIPS框架其他领域的因素,如任务(7/78;9%),工具/技术(5/78;6%),组织(7/78;9%),环境(11/78;14%)因素的检查频率较低,结果好坏参半。安全消息传递的结果(23/78;30%)在临床结果方面总体有利(10/23;43%),效率(5/23;22%),护理经验(5/23;22%),在卫生服务使用方面则参差不齐。讨论:对患者发起的消息传递使用进行了不同的测量,在我们对组织级因素的作用的理解上存在明显的差距。结论:需要证据来告知卫生保健系统实施的方法,以管理越来越多的患者发起的信息。
{"title":"Measurement, drivers, and outcomes of patient-initiated secure messaging use and intensity: a scoping review.","authors":"Aleksandra Wec, Kelly T Gleason, Danielle Peereboom, Mary Jo Gamper, Sharmini Rathakrishnan, Jennifer L Wolff","doi":"10.1093/jamiaopen/ooaf087","DOIUrl":"10.1093/jamiaopen/ooaf087","url":null,"abstract":"<p><strong>Objective: </strong>Use of secure messaging through the patient portal has increased in recent years. We compile evidence of how patient-initiated secure messaging has been measured, factors associated with use, and effects on individual and organization-level outcomes.</p><p><strong>Materials and methods: </strong>We conducted a scoping review of articles published through October 2023 by systematically searching PubMed, Embase, and CINAHL. The search identified 2574 articles; 220 were selected for full text review and 78 met eligibility criteria. Factors and outcomes associated with messaging were organized according to the System Engineering Initiative for Patient Safety (SEIPS) 2.0 framework.</p><p><strong>Results: </strong>Of 78 included studies, 70 (90%) specified the measurement approach: measuring any messaging use versus none (binary measure) (27/70; 39%), intensity of use (34/70; 49%), or both (9/70; 13%). Studies predominantly examined patient (vs clinician) characteristics (48/78; 62%), findings that patients of female sex, White race, higher socioeconomic status, and greater comorbidity were more likely to message and with greater intensity. Factors in other domains of the SEIPS framework such as tasks (7/78; 9%), tools/technology (5/78; 6%), organizational (7/78; 9%), and environmental (11/78; 14%) factors were examined less frequently, with mixed findings. Outcomes of secure messaging (23/78; 30%) were generally favorable with respect to clinical outcomes (10/23; 43%), efficiency (5/23; 22%), and care experience (5/23; 22%) and mixed with respect to health services use.</p><p><strong>Discussion: </strong>Patient-initiated messaging use has been variably measured, with notable gaps in our understanding of the role of organization-level factors.</p><p><strong>Conclusion: </strong>Evidence is needed to inform approaches implemented by healthcare systems to manage the growing volume of patient-initiated messages.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 4","pages":"ooaf087"},"PeriodicalIF":3.4,"publicationDate":"2025-08-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12342875/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144838088","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A community-based approach to ethical decision-making in artificial intelligence for health care. 基于社区的医疗保健人工智能伦理决策方法。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-08-07 eCollection Date: 2025-08-01 DOI: 10.1093/jamiaopen/ooaf076
Abdou S Senghor, Tiffani J Bright, Saya Kakim, Keith C Norris, Henry A Antwi, Jasmine K Cooper, C Daniel Mullins, Claudia Baquet

Objectives: Artificial Intelligence (AI) is transforming healthcare by improving diagnostics, treatment recommendations, and resource allocation. However, its implementation also raises ethical concerns, particularly regarding biases in AI algorithms trained on inequitable data, which may reinforce health disparities. This article introduces the AI COmmunity-based Ethical Dialogue and DEcision-making (CODE) framework to embed ethical deliberation into AI development, focusing on Electronic Health Records (EHRs).

Materials and methods: We propose the AI CODE framework as a structured approach to addressing ethical challenges in AI-driven healthcare and ensuring its implementation supports health equity.

Results: The framework outlines 5 steps to advance health equity: (1) Contextual diversity and priority: Ensuring inclusive datasets and that AI reflects the community needs; (2) Sharing ethical propositions: Structured discussions on privacy, bias, and fairness; (3) Dialogic decision-making: Collaboratively with stakeholders to develop AI solutions; (4) Integrating ethical solutions: Applying solutions into AI design to enhance fairness; and (5) Evaluating effectiveness: Continuously monitoring AI to address emerging biases.

Discussion: We examine the framework's role in mitigating AI biases through structured community engagement and its relevance within evolving healthcare policies. While the framework promotes ethical AI integration in healthcare, it also faces challenges in implementation.

Conclusion: The framework provides practical guidance to ensure AI systems are ethical, community-driven, and aligned with health equity goals.

目标:人工智能(AI)正在通过改进诊断、治疗建议和资源分配来改变医疗保健。然而,它的实施也引起了伦理问题,特别是关于在不公平数据上训练的人工智能算法的偏见,这可能会加剧健康差距。本文介绍了人工智能社区伦理对话和决策(CODE)框架,将伦理审议嵌入到人工智能开发中,重点是电子健康记录(EHRs)。材料和方法:我们提出AI CODE框架作为解决人工智能驱动的医疗保健中的道德挑战的结构化方法,并确保其实施支持健康公平。结果:该框架概述了促进卫生公平的5个步骤:(1)背景多样性和重点:确保包容性数据集和人工智能反映社区需求;(2)分享伦理命题:关于隐私、偏见和公平的结构化讨论;(3)对话式决策:与利益相关方共同制定人工智能解决方案;(4)整合伦理解决方案:将解决方案应用到AI设计中,增强公平性;(5)评估有效性:持续监测人工智能以解决新出现的偏见。讨论:我们研究了该框架在通过结构化社区参与减轻人工智能偏见方面的作用,以及它在不断发展的医疗保健政策中的相关性。虽然该框架促进了人工智能在医疗保健领域的道德整合,但它在实施方面也面临挑战。结论:该框架为确保人工智能系统符合道德、社区驱动并与卫生公平目标保持一致提供了实用指导。
{"title":"A community-based approach to ethical decision-making in artificial intelligence for health care.","authors":"Abdou S Senghor, Tiffani J Bright, Saya Kakim, Keith C Norris, Henry A Antwi, Jasmine K Cooper, C Daniel Mullins, Claudia Baquet","doi":"10.1093/jamiaopen/ooaf076","DOIUrl":"10.1093/jamiaopen/ooaf076","url":null,"abstract":"<p><strong>Objectives: </strong>Artificial Intelligence (AI) is transforming healthcare by improving diagnostics, treatment recommendations, and resource allocation. However, its implementation also raises ethical concerns, particularly regarding biases in AI algorithms trained on inequitable data, which may reinforce health disparities. This article introduces the AI COmmunity-based Ethical Dialogue and DEcision-making (CODE) framework to embed ethical deliberation into AI development, focusing on Electronic Health Records (EHRs).</p><p><strong>Materials and methods: </strong>We propose the AI CODE framework as a structured approach to addressing ethical challenges in AI-driven healthcare and ensuring its implementation supports health equity.</p><p><strong>Results: </strong>The framework outlines 5 steps to advance health equity: (1) Contextual diversity and priority: Ensuring inclusive datasets and that AI reflects the community needs; (2) Sharing ethical propositions: Structured discussions on privacy, bias, and fairness; (3) Dialogic decision-making: Collaboratively with stakeholders to develop AI solutions; (4) Integrating ethical solutions: Applying solutions into AI design to enhance fairness; and (5) Evaluating effectiveness: Continuously monitoring AI to address emerging biases.</p><p><strong>Discussion: </strong>We examine the framework's role in mitigating AI biases through structured community engagement and its relevance within evolving healthcare policies. While the framework promotes ethical AI integration in healthcare, it also faces challenges in implementation.</p><p><strong>Conclusion: </strong>The framework provides practical guidance to ensure AI systems are ethical, community-driven, and aligned with health equity goals.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 4","pages":"ooaf076"},"PeriodicalIF":3.4,"publicationDate":"2025-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12342142/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144838085","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Subpopulation-specific synthetic electronic health records can increase mortality prediction performance. 针对特定亚群的合成电子健康记录可提高死亡率预测性能。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-08-07 eCollection Date: 2025-08-01 DOI: 10.1093/jamiaopen/ooaf091
Oriel Perets, Nadav Rappoport

Objective: To address biased representation in Electronic Health Records (EHRs) across subpopulations (SPs), which leads to predictive models underperforming for underrepresented groups, we propose a framework to enhance equitable predictive performance.

Materials and methods: We developed a framework using generative adversarial networks (GANs) to create SP-specific synthetic data, which augments the original training datasets. Subsequently, we employed an ensemble approach, training distinct prediction models tailored to each SP.

Results: The proposed framework was evaluated on two datasets derived from the MIMIC database, achieving a performance improvement in Receiver Operating Characteristics Area Under Curve (ROCAUC) ranging from 8% to 31% for underrepresented SPs.

Discussion: The results indicate that targeted synthetic data augmentation and SP-specific model training significantly mitigate the performance disparities observed in conventional predictive models trained on imbalanced EHR data.

Conclusion: Our novel GAN-based framework, combined with an ensemble prediction approach, effectively enhances predictive equity across SPs. The code and ensemble models developed in this study are publicly available, supporting further research and practical adoption of equitable predictive analytics in healthcare.

目的:为了解决电子健康记录(EHRs)在亚人群(SPs)中的偏代表性问题,这导致预测模型在代表性不足的群体中表现不佳,我们提出了一个框架来提高公平的预测性能。材料和方法:我们开发了一个框架,使用生成对抗网络(gan)来创建特定于sp的合成数据,这增加了原始训练数据集。随后,我们采用了一种集成方法,为每个sp量身定制了不同的预测模型。结果:所提出的框架在来自MIMIC数据库的两个数据集上进行了评估,对于代表性不足的sp,接收器操作特征曲线下面积(ROCAUC)的性能提高了8%至31%。讨论:结果表明,有针对性的合成数据增强和特定sp的模型训练显著减轻了在不平衡EHR数据上训练的传统预测模型所观察到的性能差异。结论:我们新颖的基于gan的框架,结合集成预测方法,有效地增强了跨sp的预测公平性。本研究中开发的代码和集成模型是公开的,支持进一步的研究和在医疗保健中公平的预测分析的实际采用。
{"title":"Subpopulation-specific synthetic electronic health records can increase mortality prediction performance.","authors":"Oriel Perets, Nadav Rappoport","doi":"10.1093/jamiaopen/ooaf091","DOIUrl":"10.1093/jamiaopen/ooaf091","url":null,"abstract":"<p><strong>Objective: </strong>To address biased representation in Electronic Health Records (EHRs) across subpopulations (SPs), which leads to predictive models underperforming for underrepresented groups, we propose a framework to enhance equitable predictive performance.</p><p><strong>Materials and methods: </strong>We developed a framework using generative adversarial networks (GANs) to create SP-specific synthetic data, which augments the original training datasets. Subsequently, we employed an ensemble approach, training distinct prediction models tailored to each SP.</p><p><strong>Results: </strong>The proposed framework was evaluated on two datasets derived from the MIMIC database, achieving a performance improvement in Receiver Operating Characteristics Area Under Curve (ROCAUC) ranging from 8% to 31% for underrepresented SPs.</p><p><strong>Discussion: </strong>The results indicate that targeted synthetic data augmentation and SP-specific model training significantly mitigate the performance disparities observed in conventional predictive models trained on imbalanced EHR data.</p><p><strong>Conclusion: </strong>Our novel GAN-based framework, combined with an ensemble prediction approach, effectively enhances predictive equity across SPs. The code and ensemble models developed in this study are publicly available, supporting further research and practical adoption of equitable predictive analytics in healthcare.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 4","pages":"ooaf091"},"PeriodicalIF":3.4,"publicationDate":"2025-08-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12342355/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144838089","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Chronic obstructive pulmonary disease screening using time-frequency features of self-recorded respiratory sounds. 利用自录呼吸音的时频特征筛查慢性阻塞性肺疾病。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-08-04 eCollection Date: 2025-08-01 DOI: 10.1093/jamiaopen/ooaf083
Alberto Tena, Ivan Juez-Garcia, Iván D Benítez, Francesc Clariá, Jessica González, Jordi de Batlle, Francesc Solsona

Objectives: Chronic obstructive pulmonary disease (COPD) is the third leading cause of death worldwide, with up to 70% of cases remaining undiagnosed. This paper proposes a COPD screening tool based on time-frequency representation features of self-recorded respiratory sounds.

Materials and methods: Respiratory sound samples (breath and cough sounds) were extracted from COPD and asymptomatic non-COPD volunteers using a large, scientific-purpose database. We analyzed 39 time-frequency representation features of breath and cough sounds, combined with age, sex, and smoking status, using Autoencoder neural networks and random forest (RF) algorithms. We compared the performance of different breath and cough RF models built to detect COPD: one based exclusively on sound features, one based exclusively on sociodemographic characteristics, and one based on sound features and sociodemographic characteristics.

Results: Models including breathing features outperformed models exclusively based on sociodemographic characteristics. Specifically, the model combining sociodemographic characteristics and breathing features achieved an area under the curve (AUC), accuracy, sensitivity, and specificity of 0.901, 0.836, 0.871, and 0.761, respectively, in the test set, representing a substantial increase in AUC when compared to the model based exclusively on sociodemographic characteristics (0.901 vs 0.818).

Discussion: Our results suggest that a lightweight collection of the time-frequency representation features of self-recorded beathing sounds could effectively improve the predictive performance of COPD screening or case-finding questionnaires.

Conclusion: COPD screening through self-recorded breathing sounds could be easily integrated as a low-cost first step in case-finding programs, potentially contributing to mitigate COPD underdiagnosis.

慢性阻塞性肺疾病(COPD)是全球第三大死亡原因,高达70%的病例仍未得到诊断。本文提出了一种基于自录呼吸音时频表征特征的慢阻肺筛查工具。材料和方法:使用大型科学数据库从COPD和无症状非COPD志愿者中提取呼吸声音样本(呼吸声和咳嗽声)。我们使用Autoencoder神经网络和随机森林(RF)算法,结合年龄、性别和吸烟状况,分析了呼吸和咳嗽声音的39个时频表示特征。我们比较了用于检测COPD的不同呼吸和咳嗽RF模型的性能:一个完全基于声音特征,一个完全基于社会人口特征,一个基于声音特征和社会人口特征。结果:包括呼吸特征的模型优于完全基于社会人口特征的模型。具体而言,结合社会人口特征和呼吸特征的模型在测试集中分别获得了0.901、0.836、0.871和0.761的曲线下面积(AUC)、准确性、灵敏度和特异性,与仅基于社会人口特征的模型相比,AUC大幅增加(0.901 vs 0.818)。讨论:我们的研究结果表明,自录的呼吸声的时间频率表征特征的轻量级集合可以有效地提高COPD筛查或病例查找问卷的预测性能。结论:通过自录呼吸音进行COPD筛查可以很容易地整合为病例发现项目的低成本第一步,可能有助于减轻COPD的漏诊。
{"title":"Chronic obstructive pulmonary disease screening using time-frequency features of self-recorded respiratory sounds.","authors":"Alberto Tena, Ivan Juez-Garcia, Iván D Benítez, Francesc Clariá, Jessica González, Jordi de Batlle, Francesc Solsona","doi":"10.1093/jamiaopen/ooaf083","DOIUrl":"10.1093/jamiaopen/ooaf083","url":null,"abstract":"<p><strong>Objectives: </strong>Chronic obstructive pulmonary disease (COPD) is the third leading cause of death worldwide, with up to 70% of cases remaining undiagnosed. This paper proposes a COPD screening tool based on time-frequency representation features of self-recorded respiratory sounds.</p><p><strong>Materials and methods: </strong>Respiratory sound samples (breath and cough sounds) were extracted from COPD and asymptomatic non-COPD volunteers using a large, scientific-purpose database. We analyzed 39 time-frequency representation features of breath and cough sounds, combined with age, sex, and smoking status, using Autoencoder neural networks and random forest (RF) algorithms. We compared the performance of different breath and cough RF models built to detect COPD: one based exclusively on sound features, one based exclusively on sociodemographic characteristics, and one based on sound features and sociodemographic characteristics.</p><p><strong>Results: </strong>Models including breathing features outperformed models exclusively based on sociodemographic characteristics. Specifically, the model combining sociodemographic characteristics and breathing features achieved an area under the curve (AUC), accuracy, sensitivity, and specificity of 0.901, 0.836, 0.871, and 0.761, respectively, in the test set, representing a substantial increase in AUC when compared to the model based exclusively on sociodemographic characteristics (0.901 vs 0.818).</p><p><strong>Discussion: </strong>Our results suggest that a lightweight collection of the time-frequency representation features of self-recorded beathing sounds could effectively improve the predictive performance of COPD screening or case-finding questionnaires.</p><p><strong>Conclusion: </strong>COPD screening through self-recorded breathing sounds could be easily integrated as a low-cost first step in case-finding programs, potentially contributing to mitigate COPD underdiagnosis.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 4","pages":"ooaf083"},"PeriodicalIF":3.4,"publicationDate":"2025-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12322310/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144790230","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
VetDash: a clinical dashboard for enhancing measurement-based care in veteran health. VetDash:一个临床仪表板,用于加强退伍军人健康方面的基于测量的护理。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-07-31 eCollection Date: 2025-08-01 DOI: 10.1093/jamiaopen/ooaf075
Santiago Allende, Hayley S Sullivan, Peter J Bayley

Objectives: Measurement-based care (MBC) improves clinical decision-making but remains underutilized in the Veterans Health Administration due to barriers such as provider awareness, time constraints, and user-experience limitations. This study describes the development of the War Related Illness and Injury Study Center Veteran Dashboard (VetDash), a patient-level clinical dashboard designed to integrate the VA's Collect, Share, Act model into the provider workflow.

Materials and methods: VetDash was developed using R Shiny, utilizing data from the WRIISC Clinical Intake Packet Database. It integrates patient-reported health data and military history into a dashboard hosted on a Linux-based Shiny Server within the VA's intranet.

Results: VetDash includes four modules: Patient Characteristics, Patient Health Symptoms, Patient Exposures, and Patient Self-Report Measures. Providers can visualize patient-reported symptoms, military exposures, and self-report measures, and compare patients to provider-defined cohorts.

Discussion and conclusion: VetDash facilitates MBC integration into the clinical workflow, potentially improving personalized patient care. Future research should evaluate its impact on clinical decisions, outcomes, and explore further enhancements.

目的:基于测量的护理(MBC)改善了临床决策,但由于提供者意识、时间限制和用户体验限制等障碍,在退伍军人健康管理中仍未得到充分利用。本研究描述了与战争有关的疾病和伤害研究中心退伍军人仪表板(VetDash)的发展,这是一个病人级的临床仪表板,旨在将VA的收集、分享、行动模型集成到提供者工作流程中。材料和方法:VetDash是使用R Shiny开发的,利用了来自WRIISC临床摄入包数据库的数据。它将病人报告的健康数据和军事历史整合到一个仪表盘中,该仪表盘托管在VA内部网内基于linux的Shiny服务器上。结果:VetDash包括四个模块:患者特征、患者健康症状、患者暴露和患者自我报告测量。提供者可以可视化患者报告的症状、军事暴露和自我报告措施,并将患者与提供者定义的队列进行比较。讨论与结论:VetDash促进了MBC与临床工作流程的整合,有可能改善个性化的患者护理。未来的研究应评估其对临床决策和结果的影响,并探索进一步的增强。
{"title":"VetDash: a clinical dashboard for enhancing measurement-based care in veteran health.","authors":"Santiago Allende, Hayley S Sullivan, Peter J Bayley","doi":"10.1093/jamiaopen/ooaf075","DOIUrl":"10.1093/jamiaopen/ooaf075","url":null,"abstract":"<p><strong>Objectives: </strong>Measurement-based care (MBC) improves clinical decision-making but remains underutilized in the Veterans Health Administration due to barriers such as provider awareness, time constraints, and user-experience limitations. This study describes the development of the War Related Illness and Injury Study Center Veteran Dashboard (VetDash), a patient-level clinical dashboard designed to integrate the VA's <i>Collect, Share, Act</i> model into the provider workflow.</p><p><strong>Materials and methods: </strong>VetDash was developed using R Shiny, utilizing data from the WRIISC Clinical Intake Packet Database. It integrates patient-reported health data and military history into a dashboard hosted on a Linux-based Shiny Server within the VA's intranet.</p><p><strong>Results: </strong>VetDash includes four modules: Patient Characteristics, Patient Health Symptoms, Patient Exposures, and Patient Self-Report Measures. Providers can visualize patient-reported symptoms, military exposures, and self-report measures, and compare patients to provider-defined cohorts.</p><p><strong>Discussion and conclusion: </strong>VetDash facilitates MBC integration into the clinical workflow, potentially improving personalized patient care. Future research should evaluate its impact on clinical decisions, outcomes, and explore further enhancements.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 4","pages":"ooaf075"},"PeriodicalIF":3.4,"publicationDate":"2025-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12313089/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144761659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forecasting school violence risk with incomplete interview data: an automated assessment approach. 用不完全访谈数据预测校园暴力风险:一种自动评估方法。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-07-31 eCollection Date: 2025-08-01 DOI: 10.1093/jamiaopen/ooaf084
Lara J Kanbar, Alexander Osborn, Andrew Cifuentes, Jennifer Combs, Michael Sorter, Drew Barzman, Judith W Dexheimer

Objectives: School violence risk prevention in the United States relies on manual assessments that are time-consuming and subjective. We developed a machine learning algorithm named Automated RIsk Assessment (ARIA), using natural language processing (NLP) to find linguistic patterns in standardized interview questions that can predict risk of aggression. Our goal was to evaluate the incremental change in performance with the addition of each question to simulate situations where interviews cannot be completed.

Materials and methods: Students were interviewed with 2 14-question risk assessments, the Brief Rating of Aggression by Children and Adolescents (BRACHA) and the School Safety Scale (SSS), that encouraged open-ended answers to the interview questions. The reference standard was defined as the subject's likeliness to display aggression in the future as determined by a forensic psychiatrist. Feature sets were extracted to represent the addition of 1 question at a time in a typical interview, up to and including the 28 total main questions along with other sub-questions that arose. The ARIA NLP pipeline tokenized each feature set, then extracted n-gram features (n 5) that captured contextual and semantic information. The features were evaluated using an L2-regularized logistic regression classifier and L2-regularized support vector machine (L2-SVM) classifier.

Results: Between May 1, 2015 and February 6, 2021, 412 assessment interviews were conducted. When compared to clinical judgement, ARIA performed with an area under the Receiver Operating Characteristic curve of 0.9 after 10 BRACHA questions, suggesting that it remains powerful even with truncated interviews. The full BRACHA had similar performance to the BRACHA + SSS assessment.

Discussion and conclusion: ARIA could use incomplete risk assessment interviews to provide modest recommendations even if interview completion is not possible. This could help to reduce the burden for the social worker or school counselor who may be using ARIA in less-than-ideal conditions.

目标:美国的校园暴力风险预防依赖于耗时且主观的人工评估。我们开发了一种名为自动风险评估(ARIA)的机器学习算法,使用自然语言处理(NLP)在标准化面试问题中找到可以预测攻击风险的语言模式。我们的目标是通过增加每个问题来模拟面试无法完成的情况来评估绩效的增量变化。材料与方法:采用2份14题风险评估问卷对学生进行访谈,问卷采用BRACHA(儿童青少年攻击行为简易评定量表)和SSS(学校安全量表),鼓励开放式回答访谈问题。参照标准定义为被试在未来表现出攻击性的可能性,由法医精神病学家确定。特征集被提取出来,代表在一次典型的采访中每次增加一个问题,最多包括28个主要问题以及其他出现的子问题。ARIA NLP管道对每个特征集进行标记,然后提取捕获上下文和语义信息的n个图特征(n≤5)。使用l2正则化逻辑回归分类器和l2正则化支持向量机(L2-SVM)分类器对特征进行评估。结果:2015年5月1日至2021年2月6日,共进行了412次评估访谈。与临床判断相比,在回答10个BRACHA问题后,ARIA在接受者工作特征曲线下的面积为0.9,这表明即使在截短的访谈中,ARIA仍然很强大。完整的BRACHA与BRACHA + SSS评估的表现相似。讨论和结论:即使不可能完成访谈,ARIA也可以使用不完整的风险评估访谈来提供适度的建议。这可能有助于减轻社会工作者或学校辅导员的负担,他们可能在不太理想的条件下使用ARIA。
{"title":"Forecasting school violence risk with incomplete interview data: an automated assessment approach.","authors":"Lara J Kanbar, Alexander Osborn, Andrew Cifuentes, Jennifer Combs, Michael Sorter, Drew Barzman, Judith W Dexheimer","doi":"10.1093/jamiaopen/ooaf084","DOIUrl":"10.1093/jamiaopen/ooaf084","url":null,"abstract":"<p><strong>Objectives: </strong>School violence risk prevention in the United States relies on manual assessments that are time-consuming and subjective. We developed a machine learning algorithm named Automated RIsk Assessment (ARIA), using natural language processing (NLP) to find linguistic patterns in standardized interview questions that can predict risk of aggression. Our goal was to evaluate the incremental change in performance with the addition of each question to simulate situations where interviews cannot be completed.</p><p><strong>Materials and methods: </strong>Students were interviewed with 2 14-question risk assessments, the Brief Rating of Aggression by Children and Adolescents (BRACHA) and the School Safety Scale (SSS), that encouraged open-ended answers to the interview questions. The reference standard was defined as the subject's likeliness to display aggression in the future as determined by a forensic psychiatrist. Feature sets were extracted to represent the addition of 1 question at a time in a typical interview, up to and including the 28 total main questions along with other sub-questions that arose. The ARIA NLP pipeline tokenized each feature set, then extracted n-gram features (<i>n</i> <b>≤</b> 5) that captured contextual and semantic information. The features were evaluated using an L2-regularized logistic regression classifier and L2-regularized support vector machine (L2-SVM) classifier.</p><p><strong>Results: </strong>Between May 1, 2015 and February 6, 2021, 412 assessment interviews were conducted. When compared to clinical judgement, ARIA performed with an area under the Receiver Operating Characteristic curve of 0.9 after 10 BRACHA questions, suggesting that it remains powerful even with truncated interviews. The full BRACHA had similar performance to the BRACHA + SSS assessment.</p><p><strong>Discussion and conclusion: </strong>ARIA could use incomplete risk assessment interviews to provide modest recommendations even if interview completion is not possible. This could help to reduce the burden for the social worker or school counselor who may be using ARIA in less-than-ideal conditions.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 4","pages":"ooaf084"},"PeriodicalIF":3.4,"publicationDate":"2025-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12313018/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144761658","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparing artificial intelligence- vs clinician-authored summaries of simulated primary care electronic health records. 比较人工智能与临床医生撰写的模拟初级保健电子健康记录摘要。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-07-30 eCollection Date: 2025-08-01 DOI: 10.1093/jamiaopen/ooaf082
Lara Shemtob, Abdullah Nouri, Adam Harvey-Sullivan, Connor S Qiu, Jonathan Martin, Martha Martin, Sara Noden, Tanveer Rob, Ana L Neves, Azeem Majeed, Jonathan Clarke, Thomas Beaney

Objective: To compare clinical summaries generated from simulated patient primary care electronic health records (EHRs) by GPT-4, to summaries generated by clinicians on multiple domains of quality including utility, concision, accuracy, and bias.

Materials and methods: Seven primary care physicians generated 70 simulated patient EHR notes, each representing 10 patient contacts with the practice over at least 2 years. Each record was summarized by a different clinician and by GPT-4. artificial intelligence (AI)- and clinician-authored summaries were rated blind by clinicians according to 8 domains of quality and an overall rating.

Results: The median time taken for a clinician to read through and assimilate the information in the EHRs before summarizing, was 7 minutes. Clinicians rated clinician-authored summaries higher than AI-authored summaries overall (7.39 vs 7.00 out of 10; P = .02), but with greater variability in clinician-authored summary ratings. AI and clinician-authored summaries had similar accuracy and AI-authored summaries were less likely to omit important information and more likely to use patient-friendly language.

Discussion: Although AI-authored summaries were rated slightly lower overall compared with clinician-authored summaries, they demonstrated similar accuracy and greater consistency. This demonstrates potential applications for generating summaries in primary care, particularly given the substantial time taken for clinicians to undertake this work.

Conclusion: The results suggest the feasibility, utility and acceptability of using AI-authored summaries to integrate into EHRs to support clinicians in primary care. AI summarization tools have the potential to improve healthcare productivity, including by enabling clinicians to spend more time on direct patient care.

目的:比较GPT-4从模拟患者初级保健电子健康记录(EHRs)生成的临床摘要与临床医生在多个质量领域生成的摘要,包括实用性、简洁性、准确性和偏倚。材料和方法:7名初级保健医生生成70个模拟患者电子病历记录,每个记录代表10名患者在至少2年内与该诊所接触。每个记录由不同的临床医生和GPT-4进行总结。临床医生根据8个质量领域和总体评级,将人工智能(AI)和临床医生撰写的摘要评为盲摘要。结果:临床医生在总结之前通读和吸收电子病历信息的中位时间为7分钟。临床医生认为临床医生撰写的摘要总体上高于人工智能撰写的摘要(7.39 vs 7.00;P = .02),但临床撰写的总结评分差异较大。人工智能和临床医生撰写的摘要具有相似的准确性,人工智能撰写的摘要不太可能遗漏重要信息,更有可能使用对患者友好的语言。讨论:尽管与临床医生撰写的摘要相比,人工智能撰写的摘要的总体评分略低,但它们表现出相似的准确性和更高的一致性。这证明了在初级保健中生成摘要的潜在应用,特别是考虑到临床医生需要花费大量时间来进行这项工作。结论:研究结果表明,将人工智能撰写的摘要整合到电子病历中,以支持临床医生进行初级保健,具有可行性、实用性和可接受性。人工智能总结工具有可能提高医疗保健工作效率,包括使临床医生能够将更多时间用于直接护理患者。
{"title":"Comparing artificial intelligence- vs clinician-authored summaries of simulated primary care electronic health records.","authors":"Lara Shemtob, Abdullah Nouri, Adam Harvey-Sullivan, Connor S Qiu, Jonathan Martin, Martha Martin, Sara Noden, Tanveer Rob, Ana L Neves, Azeem Majeed, Jonathan Clarke, Thomas Beaney","doi":"10.1093/jamiaopen/ooaf082","DOIUrl":"10.1093/jamiaopen/ooaf082","url":null,"abstract":"<p><strong>Objective: </strong>To compare clinical summaries generated from simulated patient primary care electronic health records (EHRs) by GPT-4, to summaries generated by clinicians on multiple domains of quality including utility, concision, accuracy, and bias.</p><p><strong>Materials and methods: </strong>Seven primary care physicians generated 70 simulated patient EHR notes, each representing 10 patient contacts with the practice over at least 2 years. Each record was summarized by a different clinician and by GPT-4. artificial intelligence (AI)- and clinician-authored summaries were rated blind by clinicians according to 8 domains of quality and an overall rating.</p><p><strong>Results: </strong>The median time taken for a clinician to read through and assimilate the information in the EHRs before summarizing, was 7 minutes. Clinicians rated clinician-authored summaries higher than AI-authored summaries overall (7.39 vs 7.00 out of 10; <i>P</i> = .02), but with greater variability in clinician-authored summary ratings. AI and clinician-authored summaries had similar accuracy and AI-authored summaries were less likely to omit important information and more likely to use patient-friendly language.</p><p><strong>Discussion: </strong>Although AI-authored summaries were rated slightly lower overall compared with clinician-authored summaries, they demonstrated similar accuracy and greater consistency. This demonstrates potential applications for generating summaries in primary care, particularly given the substantial time taken for clinicians to undertake this work.</p><p><strong>Conclusion: </strong>The results suggest the feasibility, utility and acceptability of using AI-authored summaries to integrate into EHRs to support clinicians in primary care. AI summarization tools have the potential to improve healthcare productivity, including by enabling clinicians to spend more time on direct patient care.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 4","pages":"ooaf082"},"PeriodicalIF":3.4,"publicationDate":"2025-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12309840/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144754635","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessment of 3 standards-based clinical decision support (CDS) tools in an academic electronic health record using Clinical Quality Language, CDS Hooks, and Fast Healthcare Interoperability Resources: a retrospective evaluation. 使用临床质量语言、CDS Hooks和快速医疗保健互操作性资源评估学术电子健康记录中3种基于标准的临床决策支持(CDS)工具:回顾性评估。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-07-30 eCollection Date: 2025-08-01 DOI: 10.1093/jamiaopen/ooaf085
Mark Isabelle, Ivan K Ip, Michael Bakhtin, Louise Schneider, Ali S Raja, Sayon Dutta, Adam Landman, Ronilda Lacson

Objectives: To evaluate clinical decision support (CDS) of varying complexities and care settings represented using Health Information Technology (HIT) standards-Clinical Quality Language (CQL) for representing clinical logic and Fast Healthcare Interoperability Resources (FHIR) for health information exchange.

Materials and methods: This Institutional Review Board-approved, retrospective study was performed at an academic medical center (January 1, 2023-December 31, 2023). Recommendations extracted from patient-centered outcomes guidelines were translated into standardized syntax (SNOMED CT) and representations (CQL, FHIR). Clinical decision support Hooks applications were developed for: CDS1-provides education for emergency department (ED) patients with venous thromboembolism; CDS2-recommends CT pulmonary angiogram in ED patients with suspected pulmonary embolism (PE) and uses FHIR Questionnaire resources for representing interactive content; CDS3-recommends mammography/breast magnetic resonance imaging surveillance in outpatients with breast cancer history. We randomly selected 50 ED patients with suspected PE and 50 outpatients undergoing breast imaging surveillance. We compared outcomes of false-positive alerts and the accuracy of CDS1, the more complex CDS2, and CDS3 for outpatients.

Results: Clinical decision support Hooks applications used CQL logic for trigger expressions and logic files and provided recommendations to ED and outpatient providers. CDS1 had a false-positive alert and accuracy of 11.1% and 98%, respectively, not significantly different from CDS2 (0.0% false-positive alerts, P = .33 and 96% accuracy, P = .56) or from CDS3 (0.0% false-positive alerts, P = .15 and 100% accuracy, P = .31).

Discussion: Health Information Technology standards can represent recommendations of varying complexities in various care settings.

Conclusion: The potential to represent CDS using standardized syntax and formats can help facilitate the dissemination of CDS-consumable artifacts.

目的:评估使用健康信息技术(HIT)标准——临床质量语言(CQL)代表临床逻辑和快速医疗互操作性资源(FHIR)代表健康信息交换的不同复杂性和护理环境的临床决策支持(CDS)。材料和方法:本研究经机构审查委员会批准,在一家学术医学中心进行回顾性研究(2023年1月1日至2023年12月31日)。从以患者为中心的结局指南中提取的建议被翻译成标准化语法(SNOMED CT)和表示(CQL, FHIR)。临床决策支持Hooks应用程序开发用于:cds1 -为急诊科(ED)静脉血栓栓塞患者提供教育;cds2 -推荐对疑似肺栓塞(PE)的ED患者进行CT肺血管造影,并使用FHIR问卷资源代表互动内容;cds3建议对有乳腺癌病史的门诊患者进行乳房x光检查/乳房磁共振成像监测。我们随机选择50例疑似PE的ED患者和50例接受乳腺成像监测的门诊患者。我们比较了假阳性预警的结果和门诊患者CDS1、更复杂的CDS2和CDS3的准确性。结果:临床决策支持Hooks应用程序将CQL逻辑用于触发器表达式和逻辑文件,并向急诊科和门诊提供者提供建议。CDS1的假阳性检出率和准确率分别为11.1%和98%,与CDS2无显著差异(0.0%假阳性检出率,P =。33和96%的准确率,P = .56)或来自CDS3(0.0%的假阳性警报,P = .56)。15和100%的准确率,P = .31)。讨论:卫生信息技术标准可以代表各种护理环境中不同复杂性的建议。结论:使用标准化语法和格式表示cd的潜力可以帮助促进cd可消费工件的传播。
{"title":"Assessment of 3 standards-based clinical decision support (CDS) tools in an academic electronic health record using Clinical Quality Language, CDS Hooks, and Fast Healthcare Interoperability Resources: a retrospective evaluation.","authors":"Mark Isabelle, Ivan K Ip, Michael Bakhtin, Louise Schneider, Ali S Raja, Sayon Dutta, Adam Landman, Ronilda Lacson","doi":"10.1093/jamiaopen/ooaf085","DOIUrl":"10.1093/jamiaopen/ooaf085","url":null,"abstract":"<p><strong>Objectives: </strong>To evaluate clinical decision support (CDS) of varying complexities and care settings represented using Health Information Technology (HIT) standards-Clinical Quality Language (CQL) for representing clinical logic and Fast Healthcare Interoperability Resources (FHIR) for health information exchange.</p><p><strong>Materials and methods: </strong>This Institutional Review Board-approved, retrospective study was performed at an academic medical center (January 1, 2023-December 31, 2023). Recommendations extracted from patient-centered outcomes guidelines were translated into standardized syntax (SNOMED CT) and representations (CQL, FHIR). Clinical decision support Hooks applications were developed for: CDS1-provides education for emergency department (ED) patients with venous thromboembolism; CDS2-recommends CT pulmonary angiogram in ED patients with suspected pulmonary embolism (PE) and uses FHIR Questionnaire resources for representing interactive content; CDS3-recommends mammography/breast magnetic resonance imaging surveillance in outpatients with breast cancer history. We randomly selected 50 ED patients with suspected PE and 50 outpatients undergoing breast imaging surveillance. We compared outcomes of false-positive alerts and the accuracy of CDS1, the more complex CDS2, and CDS3 for outpatients.</p><p><strong>Results: </strong>Clinical decision support Hooks applications used CQL logic for trigger expressions and logic files and provided recommendations to ED and outpatient providers. CDS1 had a false-positive alert and accuracy of 11.1% and 98%, respectively, not significantly different from CDS2 (0.0% false-positive alerts, <i>P</i> = .33 and 96% accuracy, <i>P</i> = .56) or from CDS3 (0.0% false-positive alerts, <i>P</i> = .15 and 100% accuracy, <i>P</i> = .31).</p><p><strong>Discussion: </strong>Health Information Technology standards can represent recommendations of varying complexities in various care settings.</p><p><strong>Conclusion: </strong>The potential to represent CDS using standardized syntax and formats can help facilitate the dissemination of CDS-consumable artifacts.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 4","pages":"ooaf085"},"PeriodicalIF":3.4,"publicationDate":"2025-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12309839/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144754634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Explaining care need assessment surveys: qualitative and quantitative evaluation of state-of-the-art local and global explainable artificial intelligence methods. 解释护理需求评估调查:对本地和全球最先进的可解释人工智能方法进行定性和定量评估。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-07-29 eCollection Date: 2025-08-01 DOI: 10.1093/jamiaopen/ooaf064
Necip Oğuz Şerbetci, Stefan Blüher, Paul Gellert, Ulf Leser

Objective: With extended life expectancy, the number of people in need of care has been growing. To optimally support them, it is important to know the patterns and conditions of their daily life that influence the need for support, and thus, the classification of the care need. In this study, we aim to utilize a large corpus consisting of care benefits applications to do an explorative analysis of factors affecting care need to support the tedious work of experts gathering reliable criteria for a care need assessment.

Materials and methods: We compare state-of-the-art methods from explainable artificial intelligence (XAI) as means to extract such patterns from over 72 000 German care benefits applications. We train transformer models to predict assessment results as decided by a Medical Service Unit from accompanying text notes. To understand the key factors for care need assessment and its constituent modules (such as mobility and self-therapy), we apply feature attribution methods to extract the key phrases for each prediction. These local explanations are then aggregated into global insights to derive key phrases for different modules and severity of care need over the dataset.

Results: Our experiments show that transformers-based models perform slightly better than traditional bag-of-words baselines in predicting care need. We find that the bag-of-words baseline also provides useful care-relevant phrases, whereas phrases obtained through transformer explanations better balance rare and common phrases, such as diagnoses mentioned only once, and are better in assigning the correct assessment module.

Discussion: Even though XAI results can become unwieldy, they let us get an understanding of thousands of documents with no extra annotations other than existing assessment outcomes.

Conclusion: This work provides a systematic application and comparison of both traditional and state-of-the-art deep learning based XAI approaches to extract insights from a large corpus of text. Both traditional and deep learning approaches provide useful phrases, and we recommend using both to explore and understand large text corpora better. We will make our code available at https://github.com/oguzserbetci/explainer.

目的:随着预期寿命的延长,需要护理的人数不断增加。为了最佳地支持他们,重要的是要了解影响支持需求的日常生活模式和条件,从而了解护理需求的分类。在这项研究中,我们的目标是利用一个由护理福利申请组成的大型语料库,对影响护理需求的因素进行探索性分析,以支持专家收集可靠的护理需求评估标准的繁琐工作。材料和方法:我们比较了来自可解释人工智能(XAI)的最先进的方法,作为从超过72000个德国护理福利申请中提取此类模式的手段。我们训练变压器模型来预测由医疗服务单位根据随附的文本注释决定的评估结果。为了了解护理需求评估的关键因素及其组成模块(如流动性和自我治疗),我们应用特征归因方法提取每个预测的关键短语。然后,这些局部解释被汇总成全局见解,从而得出数据集中不同模块和护理需求严重程度的关键短语。结果:我们的实验表明,基于变压器的模型在预测护理需求方面比传统的词袋基线稍好。我们发现词袋基线也提供了有用的护理相关短语,而通过转换解释获得的短语更好地平衡了罕见和常见短语,例如只提到一次的诊断,并且更好地分配正确的评估模块。讨论:尽管XAI结果可能会变得笨拙,但它们使我们能够理解数千个文档,除了现有的评估结果之外,不需要额外的注释。结论:这项工作提供了传统和最先进的基于深度学习的XAI方法的系统应用和比较,以从大量文本语料库中提取见解。传统和深度学习方法都提供了有用的短语,我们建议使用这两种方法来更好地探索和理解大型文本语料库。我们将在https://github.com/oguzserbetci/explainer上提供我们的代码。
{"title":"Explaining care need assessment surveys: qualitative and quantitative evaluation of state-of-the-art local and global explainable artificial intelligence methods.","authors":"Necip Oğuz Şerbetci, Stefan Blüher, Paul Gellert, Ulf Leser","doi":"10.1093/jamiaopen/ooaf064","DOIUrl":"10.1093/jamiaopen/ooaf064","url":null,"abstract":"<p><strong>Objective: </strong>With extended life expectancy, the number of people in need of care has been growing. To optimally support them, it is important to know the patterns and conditions of their daily life that influence the need for support, and thus, the classification of the care need. In this study, we aim to utilize a large corpus consisting of care benefits applications to do an explorative analysis of factors affecting care need to support the tedious work of experts gathering reliable criteria for a care need assessment.</p><p><strong>Materials and methods: </strong>We compare state-of-the-art methods from explainable artificial intelligence (XAI) as means to extract such patterns from over 72 000 German care benefits applications. We train transformer models to predict assessment results as decided by a Medical Service Unit from accompanying text notes. To understand the key factors for care need assessment and its constituent modules (such as mobility and self-therapy), we apply feature attribution methods to extract the key phrases for each prediction. These local explanations are then aggregated into global insights to derive key phrases for different modules and severity of care need over the dataset.</p><p><strong>Results: </strong>Our experiments show that transformers-based models perform slightly better than traditional bag-of-words baselines in predicting care need. We find that the bag-of-words baseline also provides useful care-relevant phrases, whereas phrases obtained through transformer explanations better balance rare and common phrases, such as diagnoses mentioned only once, and are better in assigning the correct assessment module.</p><p><strong>Discussion: </strong>Even though XAI results can become unwieldy, they let us get an understanding of thousands of documents with no extra annotations other than existing assessment outcomes.</p><p><strong>Conclusion: </strong>This work provides a systematic application and comparison of both traditional and state-of-the-art deep learning based XAI approaches to extract insights from a large corpus of text. Both traditional and deep learning approaches provide useful phrases, and we recommend using both to explore and understand large text corpora better. We will make our code available at https://github.com/oguzserbetci/explainer.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 4","pages":"ooaf064"},"PeriodicalIF":3.4,"publicationDate":"2025-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12307913/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144754636","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison of 2 electronic health record data extraction methods for laboratory tests used in the Veterans Aging Cohort Study Index. 两种电子健康记录数据提取方法在退伍军人老龄化队列研究索引中的实验室检验比较。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-07-28 eCollection Date: 2025-08-01 DOI: 10.1093/jamiaopen/ooaf074
Christine Ramsey-Hardy, Melissa Skanderson, Janet P Tate, Amy C Justice, Vincent C Marconi, Charles Alcorn, Ronald G Hauser, Amy Anderson-Mellies, Kathleen A McGinnis

Objective: To compare Observational Medical Outcomes Partnership (OMOP) Logical Observation Identifiers Names and Codes (LOINC) and Veterans Aging Cohort Study (VACS) methods for extracting laboratory chemistry data from Veterans Health Administration (VA) electronic health records (EHR).

Materials and methods: Laboratory chemistry test results for 16 laboratory tests commonly assess in Veterans in VACS HIV (N = 143 830) followed in the VA 2015-2019 were extracted from the EHR and compared using 2 different data extraction approaches.

Results: The LOINC approach captured laboratory results from all 130 VA stations for all 16 labs. The VACS approach captured laboratory results from 128 to130 stations. Both approaches yielded laboratory results for a patient on a given date for 97% or more of the observations for 10 of the tests, 94%-97% for 5 of the tests, and 89% for 1 test. The percentage of exact matches on the value of the test result exceeded 99% for 10 of the laboratory tests and 92% for all other laboratory tests.

Discussion: Both approaches resulted in extraction of similar amounts of data in terms of individual patients, VA stations and total observations for all 16 tests. Both approaches yielded high agreement on test results in terms of identical values and correlation of test results for all tests.

Conclusion: The high level of agreement between VACS and LOINC approaches in this study demonstrate the favorable use of the LOINC approach for extracting laboratory results for most tests due to the ease and efficiency of this approach without compromising validity.

目的:比较观察性医疗结局伙伴关系(OMOP)、逻辑观察标识名称和代码(LOINC)和退伍军人老龄化队列研究(VACS)两种方法对退伍军人健康管理局(VA)电子健康档案(EHR)实验室化学数据的提取效果。材料与方法:从电子病历中提取2015-2019年退伍军人VACS HIV感染者常用的16项实验室检测结果(N = 143830),并采用两种不同的数据提取方法进行比较。结果:LOINC方法捕获了所有16个实验室的所有130个VA站的实验室结果。VACS方法收集了128至130个站点的实验室结果。两种方法在给定日期为患者提供了97%或更多的实验室结果,其中10项测试为94%-97%,5项测试为94%-97%,1项测试为89%。在10项实验室测试中,测试结果值的精确匹配百分比超过99%,在所有其他实验室测试中,该百分比超过92%。讨论:两种方法在患者个体、VA站和所有16项试验的总观察值方面提取的数据量相似。就所有测试的相同值和测试结果的相关性而言,这两种方法对测试结果产生了高度一致。结论:本研究中VACS和LOINC方法之间的高度一致性表明,LOINC方法对于提取大多数测试的实验室结果是有利的,因为这种方法易于使用,效率高,而且不影响有效性。
{"title":"Comparison of 2 electronic health record data extraction methods for laboratory tests used in the Veterans Aging Cohort Study Index.","authors":"Christine Ramsey-Hardy, Melissa Skanderson, Janet P Tate, Amy C Justice, Vincent C Marconi, Charles Alcorn, Ronald G Hauser, Amy Anderson-Mellies, Kathleen A McGinnis","doi":"10.1093/jamiaopen/ooaf074","DOIUrl":"10.1093/jamiaopen/ooaf074","url":null,"abstract":"<p><strong>Objective: </strong>To compare Observational Medical Outcomes Partnership (OMOP) Logical Observation Identifiers Names and Codes (LOINC) and Veterans Aging Cohort Study (VACS) methods for extracting laboratory chemistry data from Veterans Health Administration (VA) electronic health records (EHR).</p><p><strong>Materials and methods: </strong>Laboratory chemistry test results for 16 laboratory tests commonly assess in Veterans in VACS HIV (<i>N</i> = 143 830) followed in the VA 2015-2019 were extracted from the EHR and compared using 2 different data extraction approaches.</p><p><strong>Results: </strong>The LOINC approach captured laboratory results from all 130 VA stations for all 16 labs. The VACS approach captured laboratory results from 128 to130 stations. Both approaches yielded laboratory results for a patient on a given date for 97% or more of the observations for 10 of the tests, 94%-97% for 5 of the tests, and 89% for 1 test. The percentage of exact matches on the value of the test result exceeded 99% for 10 of the laboratory tests and 92% for all other laboratory tests.</p><p><strong>Discussion: </strong>Both approaches resulted in extraction of similar amounts of data in terms of individual patients, VA stations and total observations for all 16 tests. Both approaches yielded high agreement on test results in terms of identical values and correlation of test results for all tests.</p><p><strong>Conclusion: </strong>The high level of agreement between VACS and LOINC approaches in this study demonstrate the favorable use of the LOINC approach for extracting laboratory results for most tests due to the ease and efficiency of this approach without compromising validity.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 4","pages":"ooaf074"},"PeriodicalIF":3.4,"publicationDate":"2025-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12303373/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144733689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JAMIA Open
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1