首页 > 最新文献

JAMIA Open最新文献

英文 中文
Chronic obstructive pulmonary disease screening using time-frequency features of self-recorded respiratory sounds. 利用自录呼吸音的时频特征筛查慢性阻塞性肺疾病。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-08-04 eCollection Date: 2025-08-01 DOI: 10.1093/jamiaopen/ooaf083
Alberto Tena, Ivan Juez-Garcia, Iván D Benítez, Francesc Clariá, Jessica González, Jordi de Batlle, Francesc Solsona

Objectives: Chronic obstructive pulmonary disease (COPD) is the third leading cause of death worldwide, with up to 70% of cases remaining undiagnosed. This paper proposes a COPD screening tool based on time-frequency representation features of self-recorded respiratory sounds.

Materials and methods: Respiratory sound samples (breath and cough sounds) were extracted from COPD and asymptomatic non-COPD volunteers using a large, scientific-purpose database. We analyzed 39 time-frequency representation features of breath and cough sounds, combined with age, sex, and smoking status, using Autoencoder neural networks and random forest (RF) algorithms. We compared the performance of different breath and cough RF models built to detect COPD: one based exclusively on sound features, one based exclusively on sociodemographic characteristics, and one based on sound features and sociodemographic characteristics.

Results: Models including breathing features outperformed models exclusively based on sociodemographic characteristics. Specifically, the model combining sociodemographic characteristics and breathing features achieved an area under the curve (AUC), accuracy, sensitivity, and specificity of 0.901, 0.836, 0.871, and 0.761, respectively, in the test set, representing a substantial increase in AUC when compared to the model based exclusively on sociodemographic characteristics (0.901 vs 0.818).

Discussion: Our results suggest that a lightweight collection of the time-frequency representation features of self-recorded beathing sounds could effectively improve the predictive performance of COPD screening or case-finding questionnaires.

Conclusion: COPD screening through self-recorded breathing sounds could be easily integrated as a low-cost first step in case-finding programs, potentially contributing to mitigate COPD underdiagnosis.

慢性阻塞性肺疾病(COPD)是全球第三大死亡原因,高达70%的病例仍未得到诊断。本文提出了一种基于自录呼吸音时频表征特征的慢阻肺筛查工具。材料和方法:使用大型科学数据库从COPD和无症状非COPD志愿者中提取呼吸声音样本(呼吸声和咳嗽声)。我们使用Autoencoder神经网络和随机森林(RF)算法,结合年龄、性别和吸烟状况,分析了呼吸和咳嗽声音的39个时频表示特征。我们比较了用于检测COPD的不同呼吸和咳嗽RF模型的性能:一个完全基于声音特征,一个完全基于社会人口特征,一个基于声音特征和社会人口特征。结果:包括呼吸特征的模型优于完全基于社会人口特征的模型。具体而言,结合社会人口特征和呼吸特征的模型在测试集中分别获得了0.901、0.836、0.871和0.761的曲线下面积(AUC)、准确性、灵敏度和特异性,与仅基于社会人口特征的模型相比,AUC大幅增加(0.901 vs 0.818)。讨论:我们的研究结果表明,自录的呼吸声的时间频率表征特征的轻量级集合可以有效地提高COPD筛查或病例查找问卷的预测性能。结论:通过自录呼吸音进行COPD筛查可以很容易地整合为病例发现项目的低成本第一步,可能有助于减轻COPD的漏诊。
{"title":"Chronic obstructive pulmonary disease screening using time-frequency features of self-recorded respiratory sounds.","authors":"Alberto Tena, Ivan Juez-Garcia, Iván D Benítez, Francesc Clariá, Jessica González, Jordi de Batlle, Francesc Solsona","doi":"10.1093/jamiaopen/ooaf083","DOIUrl":"10.1093/jamiaopen/ooaf083","url":null,"abstract":"<p><strong>Objectives: </strong>Chronic obstructive pulmonary disease (COPD) is the third leading cause of death worldwide, with up to 70% of cases remaining undiagnosed. This paper proposes a COPD screening tool based on time-frequency representation features of self-recorded respiratory sounds.</p><p><strong>Materials and methods: </strong>Respiratory sound samples (breath and cough sounds) were extracted from COPD and asymptomatic non-COPD volunteers using a large, scientific-purpose database. We analyzed 39 time-frequency representation features of breath and cough sounds, combined with age, sex, and smoking status, using Autoencoder neural networks and random forest (RF) algorithms. We compared the performance of different breath and cough RF models built to detect COPD: one based exclusively on sound features, one based exclusively on sociodemographic characteristics, and one based on sound features and sociodemographic characteristics.</p><p><strong>Results: </strong>Models including breathing features outperformed models exclusively based on sociodemographic characteristics. Specifically, the model combining sociodemographic characteristics and breathing features achieved an area under the curve (AUC), accuracy, sensitivity, and specificity of 0.901, 0.836, 0.871, and 0.761, respectively, in the test set, representing a substantial increase in AUC when compared to the model based exclusively on sociodemographic characteristics (0.901 vs 0.818).</p><p><strong>Discussion: </strong>Our results suggest that a lightweight collection of the time-frequency representation features of self-recorded beathing sounds could effectively improve the predictive performance of COPD screening or case-finding questionnaires.</p><p><strong>Conclusion: </strong>COPD screening through self-recorded breathing sounds could be easily integrated as a low-cost first step in case-finding programs, potentially contributing to mitigate COPD underdiagnosis.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 4","pages":"ooaf083"},"PeriodicalIF":3.4,"publicationDate":"2025-08-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12322310/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144790230","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
VetDash: a clinical dashboard for enhancing measurement-based care in veteran health. VetDash:一个临床仪表板,用于加强退伍军人健康方面的基于测量的护理。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-07-31 eCollection Date: 2025-08-01 DOI: 10.1093/jamiaopen/ooaf075
Santiago Allende, Hayley S Sullivan, Peter J Bayley

Objectives: Measurement-based care (MBC) improves clinical decision-making but remains underutilized in the Veterans Health Administration due to barriers such as provider awareness, time constraints, and user-experience limitations. This study describes the development of the War Related Illness and Injury Study Center Veteran Dashboard (VetDash), a patient-level clinical dashboard designed to integrate the VA's Collect, Share, Act model into the provider workflow.

Materials and methods: VetDash was developed using R Shiny, utilizing data from the WRIISC Clinical Intake Packet Database. It integrates patient-reported health data and military history into a dashboard hosted on a Linux-based Shiny Server within the VA's intranet.

Results: VetDash includes four modules: Patient Characteristics, Patient Health Symptoms, Patient Exposures, and Patient Self-Report Measures. Providers can visualize patient-reported symptoms, military exposures, and self-report measures, and compare patients to provider-defined cohorts.

Discussion and conclusion: VetDash facilitates MBC integration into the clinical workflow, potentially improving personalized patient care. Future research should evaluate its impact on clinical decisions, outcomes, and explore further enhancements.

目的:基于测量的护理(MBC)改善了临床决策,但由于提供者意识、时间限制和用户体验限制等障碍,在退伍军人健康管理中仍未得到充分利用。本研究描述了与战争有关的疾病和伤害研究中心退伍军人仪表板(VetDash)的发展,这是一个病人级的临床仪表板,旨在将VA的收集、分享、行动模型集成到提供者工作流程中。材料和方法:VetDash是使用R Shiny开发的,利用了来自WRIISC临床摄入包数据库的数据。它将病人报告的健康数据和军事历史整合到一个仪表盘中,该仪表盘托管在VA内部网内基于linux的Shiny服务器上。结果:VetDash包括四个模块:患者特征、患者健康症状、患者暴露和患者自我报告测量。提供者可以可视化患者报告的症状、军事暴露和自我报告措施,并将患者与提供者定义的队列进行比较。讨论与结论:VetDash促进了MBC与临床工作流程的整合,有可能改善个性化的患者护理。未来的研究应评估其对临床决策和结果的影响,并探索进一步的增强。
{"title":"VetDash: a clinical dashboard for enhancing measurement-based care in veteran health.","authors":"Santiago Allende, Hayley S Sullivan, Peter J Bayley","doi":"10.1093/jamiaopen/ooaf075","DOIUrl":"10.1093/jamiaopen/ooaf075","url":null,"abstract":"<p><strong>Objectives: </strong>Measurement-based care (MBC) improves clinical decision-making but remains underutilized in the Veterans Health Administration due to barriers such as provider awareness, time constraints, and user-experience limitations. This study describes the development of the War Related Illness and Injury Study Center Veteran Dashboard (VetDash), a patient-level clinical dashboard designed to integrate the VA's <i>Collect, Share, Act</i> model into the provider workflow.</p><p><strong>Materials and methods: </strong>VetDash was developed using R Shiny, utilizing data from the WRIISC Clinical Intake Packet Database. It integrates patient-reported health data and military history into a dashboard hosted on a Linux-based Shiny Server within the VA's intranet.</p><p><strong>Results: </strong>VetDash includes four modules: Patient Characteristics, Patient Health Symptoms, Patient Exposures, and Patient Self-Report Measures. Providers can visualize patient-reported symptoms, military exposures, and self-report measures, and compare patients to provider-defined cohorts.</p><p><strong>Discussion and conclusion: </strong>VetDash facilitates MBC integration into the clinical workflow, potentially improving personalized patient care. Future research should evaluate its impact on clinical decisions, outcomes, and explore further enhancements.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 4","pages":"ooaf075"},"PeriodicalIF":3.4,"publicationDate":"2025-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12313089/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144761659","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Forecasting school violence risk with incomplete interview data: an automated assessment approach. 用不完全访谈数据预测校园暴力风险:一种自动评估方法。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-07-31 eCollection Date: 2025-08-01 DOI: 10.1093/jamiaopen/ooaf084
Lara J Kanbar, Alexander Osborn, Andrew Cifuentes, Jennifer Combs, Michael Sorter, Drew Barzman, Judith W Dexheimer

Objectives: School violence risk prevention in the United States relies on manual assessments that are time-consuming and subjective. We developed a machine learning algorithm named Automated RIsk Assessment (ARIA), using natural language processing (NLP) to find linguistic patterns in standardized interview questions that can predict risk of aggression. Our goal was to evaluate the incremental change in performance with the addition of each question to simulate situations where interviews cannot be completed.

Materials and methods: Students were interviewed with 2 14-question risk assessments, the Brief Rating of Aggression by Children and Adolescents (BRACHA) and the School Safety Scale (SSS), that encouraged open-ended answers to the interview questions. The reference standard was defined as the subject's likeliness to display aggression in the future as determined by a forensic psychiatrist. Feature sets were extracted to represent the addition of 1 question at a time in a typical interview, up to and including the 28 total main questions along with other sub-questions that arose. The ARIA NLP pipeline tokenized each feature set, then extracted n-gram features (n 5) that captured contextual and semantic information. The features were evaluated using an L2-regularized logistic regression classifier and L2-regularized support vector machine (L2-SVM) classifier.

Results: Between May 1, 2015 and February 6, 2021, 412 assessment interviews were conducted. When compared to clinical judgement, ARIA performed with an area under the Receiver Operating Characteristic curve of 0.9 after 10 BRACHA questions, suggesting that it remains powerful even with truncated interviews. The full BRACHA had similar performance to the BRACHA + SSS assessment.

Discussion and conclusion: ARIA could use incomplete risk assessment interviews to provide modest recommendations even if interview completion is not possible. This could help to reduce the burden for the social worker or school counselor who may be using ARIA in less-than-ideal conditions.

目标:美国的校园暴力风险预防依赖于耗时且主观的人工评估。我们开发了一种名为自动风险评估(ARIA)的机器学习算法,使用自然语言处理(NLP)在标准化面试问题中找到可以预测攻击风险的语言模式。我们的目标是通过增加每个问题来模拟面试无法完成的情况来评估绩效的增量变化。材料与方法:采用2份14题风险评估问卷对学生进行访谈,问卷采用BRACHA(儿童青少年攻击行为简易评定量表)和SSS(学校安全量表),鼓励开放式回答访谈问题。参照标准定义为被试在未来表现出攻击性的可能性,由法医精神病学家确定。特征集被提取出来,代表在一次典型的采访中每次增加一个问题,最多包括28个主要问题以及其他出现的子问题。ARIA NLP管道对每个特征集进行标记,然后提取捕获上下文和语义信息的n个图特征(n≤5)。使用l2正则化逻辑回归分类器和l2正则化支持向量机(L2-SVM)分类器对特征进行评估。结果:2015年5月1日至2021年2月6日,共进行了412次评估访谈。与临床判断相比,在回答10个BRACHA问题后,ARIA在接受者工作特征曲线下的面积为0.9,这表明即使在截短的访谈中,ARIA仍然很强大。完整的BRACHA与BRACHA + SSS评估的表现相似。讨论和结论:即使不可能完成访谈,ARIA也可以使用不完整的风险评估访谈来提供适度的建议。这可能有助于减轻社会工作者或学校辅导员的负担,他们可能在不太理想的条件下使用ARIA。
{"title":"Forecasting school violence risk with incomplete interview data: an automated assessment approach.","authors":"Lara J Kanbar, Alexander Osborn, Andrew Cifuentes, Jennifer Combs, Michael Sorter, Drew Barzman, Judith W Dexheimer","doi":"10.1093/jamiaopen/ooaf084","DOIUrl":"10.1093/jamiaopen/ooaf084","url":null,"abstract":"<p><strong>Objectives: </strong>School violence risk prevention in the United States relies on manual assessments that are time-consuming and subjective. We developed a machine learning algorithm named Automated RIsk Assessment (ARIA), using natural language processing (NLP) to find linguistic patterns in standardized interview questions that can predict risk of aggression. Our goal was to evaluate the incremental change in performance with the addition of each question to simulate situations where interviews cannot be completed.</p><p><strong>Materials and methods: </strong>Students were interviewed with 2 14-question risk assessments, the Brief Rating of Aggression by Children and Adolescents (BRACHA) and the School Safety Scale (SSS), that encouraged open-ended answers to the interview questions. The reference standard was defined as the subject's likeliness to display aggression in the future as determined by a forensic psychiatrist. Feature sets were extracted to represent the addition of 1 question at a time in a typical interview, up to and including the 28 total main questions along with other sub-questions that arose. The ARIA NLP pipeline tokenized each feature set, then extracted n-gram features (<i>n</i> <b>≤</b> 5) that captured contextual and semantic information. The features were evaluated using an L2-regularized logistic regression classifier and L2-regularized support vector machine (L2-SVM) classifier.</p><p><strong>Results: </strong>Between May 1, 2015 and February 6, 2021, 412 assessment interviews were conducted. When compared to clinical judgement, ARIA performed with an area under the Receiver Operating Characteristic curve of 0.9 after 10 BRACHA questions, suggesting that it remains powerful even with truncated interviews. The full BRACHA had similar performance to the BRACHA + SSS assessment.</p><p><strong>Discussion and conclusion: </strong>ARIA could use incomplete risk assessment interviews to provide modest recommendations even if interview completion is not possible. This could help to reduce the burden for the social worker or school counselor who may be using ARIA in less-than-ideal conditions.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 4","pages":"ooaf084"},"PeriodicalIF":3.4,"publicationDate":"2025-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12313018/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144761658","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparing artificial intelligence- vs clinician-authored summaries of simulated primary care electronic health records. 比较人工智能与临床医生撰写的模拟初级保健电子健康记录摘要。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-07-30 eCollection Date: 2025-08-01 DOI: 10.1093/jamiaopen/ooaf082
Lara Shemtob, Abdullah Nouri, Adam Harvey-Sullivan, Connor S Qiu, Jonathan Martin, Martha Martin, Sara Noden, Tanveer Rob, Ana L Neves, Azeem Majeed, Jonathan Clarke, Thomas Beaney

Objective: To compare clinical summaries generated from simulated patient primary care electronic health records (EHRs) by GPT-4, to summaries generated by clinicians on multiple domains of quality including utility, concision, accuracy, and bias.

Materials and methods: Seven primary care physicians generated 70 simulated patient EHR notes, each representing 10 patient contacts with the practice over at least 2 years. Each record was summarized by a different clinician and by GPT-4. artificial intelligence (AI)- and clinician-authored summaries were rated blind by clinicians according to 8 domains of quality and an overall rating.

Results: The median time taken for a clinician to read through and assimilate the information in the EHRs before summarizing, was 7 minutes. Clinicians rated clinician-authored summaries higher than AI-authored summaries overall (7.39 vs 7.00 out of 10; P = .02), but with greater variability in clinician-authored summary ratings. AI and clinician-authored summaries had similar accuracy and AI-authored summaries were less likely to omit important information and more likely to use patient-friendly language.

Discussion: Although AI-authored summaries were rated slightly lower overall compared with clinician-authored summaries, they demonstrated similar accuracy and greater consistency. This demonstrates potential applications for generating summaries in primary care, particularly given the substantial time taken for clinicians to undertake this work.

Conclusion: The results suggest the feasibility, utility and acceptability of using AI-authored summaries to integrate into EHRs to support clinicians in primary care. AI summarization tools have the potential to improve healthcare productivity, including by enabling clinicians to spend more time on direct patient care.

目的:比较GPT-4从模拟患者初级保健电子健康记录(EHRs)生成的临床摘要与临床医生在多个质量领域生成的摘要,包括实用性、简洁性、准确性和偏倚。材料和方法:7名初级保健医生生成70个模拟患者电子病历记录,每个记录代表10名患者在至少2年内与该诊所接触。每个记录由不同的临床医生和GPT-4进行总结。临床医生根据8个质量领域和总体评级,将人工智能(AI)和临床医生撰写的摘要评为盲摘要。结果:临床医生在总结之前通读和吸收电子病历信息的中位时间为7分钟。临床医生认为临床医生撰写的摘要总体上高于人工智能撰写的摘要(7.39 vs 7.00;P = .02),但临床撰写的总结评分差异较大。人工智能和临床医生撰写的摘要具有相似的准确性,人工智能撰写的摘要不太可能遗漏重要信息,更有可能使用对患者友好的语言。讨论:尽管与临床医生撰写的摘要相比,人工智能撰写的摘要的总体评分略低,但它们表现出相似的准确性和更高的一致性。这证明了在初级保健中生成摘要的潜在应用,特别是考虑到临床医生需要花费大量时间来进行这项工作。结论:研究结果表明,将人工智能撰写的摘要整合到电子病历中,以支持临床医生进行初级保健,具有可行性、实用性和可接受性。人工智能总结工具有可能提高医疗保健工作效率,包括使临床医生能够将更多时间用于直接护理患者。
{"title":"Comparing artificial intelligence- vs clinician-authored summaries of simulated primary care electronic health records.","authors":"Lara Shemtob, Abdullah Nouri, Adam Harvey-Sullivan, Connor S Qiu, Jonathan Martin, Martha Martin, Sara Noden, Tanveer Rob, Ana L Neves, Azeem Majeed, Jonathan Clarke, Thomas Beaney","doi":"10.1093/jamiaopen/ooaf082","DOIUrl":"10.1093/jamiaopen/ooaf082","url":null,"abstract":"<p><strong>Objective: </strong>To compare clinical summaries generated from simulated patient primary care electronic health records (EHRs) by GPT-4, to summaries generated by clinicians on multiple domains of quality including utility, concision, accuracy, and bias.</p><p><strong>Materials and methods: </strong>Seven primary care physicians generated 70 simulated patient EHR notes, each representing 10 patient contacts with the practice over at least 2 years. Each record was summarized by a different clinician and by GPT-4. artificial intelligence (AI)- and clinician-authored summaries were rated blind by clinicians according to 8 domains of quality and an overall rating.</p><p><strong>Results: </strong>The median time taken for a clinician to read through and assimilate the information in the EHRs before summarizing, was 7 minutes. Clinicians rated clinician-authored summaries higher than AI-authored summaries overall (7.39 vs 7.00 out of 10; <i>P</i> = .02), but with greater variability in clinician-authored summary ratings. AI and clinician-authored summaries had similar accuracy and AI-authored summaries were less likely to omit important information and more likely to use patient-friendly language.</p><p><strong>Discussion: </strong>Although AI-authored summaries were rated slightly lower overall compared with clinician-authored summaries, they demonstrated similar accuracy and greater consistency. This demonstrates potential applications for generating summaries in primary care, particularly given the substantial time taken for clinicians to undertake this work.</p><p><strong>Conclusion: </strong>The results suggest the feasibility, utility and acceptability of using AI-authored summaries to integrate into EHRs to support clinicians in primary care. AI summarization tools have the potential to improve healthcare productivity, including by enabling clinicians to spend more time on direct patient care.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 4","pages":"ooaf082"},"PeriodicalIF":3.4,"publicationDate":"2025-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12309840/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144754635","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessment of 3 standards-based clinical decision support (CDS) tools in an academic electronic health record using Clinical Quality Language, CDS Hooks, and Fast Healthcare Interoperability Resources: a retrospective evaluation. 使用临床质量语言、CDS Hooks和快速医疗保健互操作性资源评估学术电子健康记录中3种基于标准的临床决策支持(CDS)工具:回顾性评估。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-07-30 eCollection Date: 2025-08-01 DOI: 10.1093/jamiaopen/ooaf085
Mark Isabelle, Ivan K Ip, Michael Bakhtin, Louise Schneider, Ali S Raja, Sayon Dutta, Adam Landman, Ronilda Lacson

Objectives: To evaluate clinical decision support (CDS) of varying complexities and care settings represented using Health Information Technology (HIT) standards-Clinical Quality Language (CQL) for representing clinical logic and Fast Healthcare Interoperability Resources (FHIR) for health information exchange.

Materials and methods: This Institutional Review Board-approved, retrospective study was performed at an academic medical center (January 1, 2023-December 31, 2023). Recommendations extracted from patient-centered outcomes guidelines were translated into standardized syntax (SNOMED CT) and representations (CQL, FHIR). Clinical decision support Hooks applications were developed for: CDS1-provides education for emergency department (ED) patients with venous thromboembolism; CDS2-recommends CT pulmonary angiogram in ED patients with suspected pulmonary embolism (PE) and uses FHIR Questionnaire resources for representing interactive content; CDS3-recommends mammography/breast magnetic resonance imaging surveillance in outpatients with breast cancer history. We randomly selected 50 ED patients with suspected PE and 50 outpatients undergoing breast imaging surveillance. We compared outcomes of false-positive alerts and the accuracy of CDS1, the more complex CDS2, and CDS3 for outpatients.

Results: Clinical decision support Hooks applications used CQL logic for trigger expressions and logic files and provided recommendations to ED and outpatient providers. CDS1 had a false-positive alert and accuracy of 11.1% and 98%, respectively, not significantly different from CDS2 (0.0% false-positive alerts, P = .33 and 96% accuracy, P = .56) or from CDS3 (0.0% false-positive alerts, P = .15 and 100% accuracy, P = .31).

Discussion: Health Information Technology standards can represent recommendations of varying complexities in various care settings.

Conclusion: The potential to represent CDS using standardized syntax and formats can help facilitate the dissemination of CDS-consumable artifacts.

目的:评估使用健康信息技术(HIT)标准——临床质量语言(CQL)代表临床逻辑和快速医疗互操作性资源(FHIR)代表健康信息交换的不同复杂性和护理环境的临床决策支持(CDS)。材料和方法:本研究经机构审查委员会批准,在一家学术医学中心进行回顾性研究(2023年1月1日至2023年12月31日)。从以患者为中心的结局指南中提取的建议被翻译成标准化语法(SNOMED CT)和表示(CQL, FHIR)。临床决策支持Hooks应用程序开发用于:cds1 -为急诊科(ED)静脉血栓栓塞患者提供教育;cds2 -推荐对疑似肺栓塞(PE)的ED患者进行CT肺血管造影,并使用FHIR问卷资源代表互动内容;cds3建议对有乳腺癌病史的门诊患者进行乳房x光检查/乳房磁共振成像监测。我们随机选择50例疑似PE的ED患者和50例接受乳腺成像监测的门诊患者。我们比较了假阳性预警的结果和门诊患者CDS1、更复杂的CDS2和CDS3的准确性。结果:临床决策支持Hooks应用程序将CQL逻辑用于触发器表达式和逻辑文件,并向急诊科和门诊提供者提供建议。CDS1的假阳性检出率和准确率分别为11.1%和98%,与CDS2无显著差异(0.0%假阳性检出率,P =。33和96%的准确率,P = .56)或来自CDS3(0.0%的假阳性警报,P = .56)。15和100%的准确率,P = .31)。讨论:卫生信息技术标准可以代表各种护理环境中不同复杂性的建议。结论:使用标准化语法和格式表示cd的潜力可以帮助促进cd可消费工件的传播。
{"title":"Assessment of 3 standards-based clinical decision support (CDS) tools in an academic electronic health record using Clinical Quality Language, CDS Hooks, and Fast Healthcare Interoperability Resources: a retrospective evaluation.","authors":"Mark Isabelle, Ivan K Ip, Michael Bakhtin, Louise Schneider, Ali S Raja, Sayon Dutta, Adam Landman, Ronilda Lacson","doi":"10.1093/jamiaopen/ooaf085","DOIUrl":"10.1093/jamiaopen/ooaf085","url":null,"abstract":"<p><strong>Objectives: </strong>To evaluate clinical decision support (CDS) of varying complexities and care settings represented using Health Information Technology (HIT) standards-Clinical Quality Language (CQL) for representing clinical logic and Fast Healthcare Interoperability Resources (FHIR) for health information exchange.</p><p><strong>Materials and methods: </strong>This Institutional Review Board-approved, retrospective study was performed at an academic medical center (January 1, 2023-December 31, 2023). Recommendations extracted from patient-centered outcomes guidelines were translated into standardized syntax (SNOMED CT) and representations (CQL, FHIR). Clinical decision support Hooks applications were developed for: CDS1-provides education for emergency department (ED) patients with venous thromboembolism; CDS2-recommends CT pulmonary angiogram in ED patients with suspected pulmonary embolism (PE) and uses FHIR Questionnaire resources for representing interactive content; CDS3-recommends mammography/breast magnetic resonance imaging surveillance in outpatients with breast cancer history. We randomly selected 50 ED patients with suspected PE and 50 outpatients undergoing breast imaging surveillance. We compared outcomes of false-positive alerts and the accuracy of CDS1, the more complex CDS2, and CDS3 for outpatients.</p><p><strong>Results: </strong>Clinical decision support Hooks applications used CQL logic for trigger expressions and logic files and provided recommendations to ED and outpatient providers. CDS1 had a false-positive alert and accuracy of 11.1% and 98%, respectively, not significantly different from CDS2 (0.0% false-positive alerts, <i>P</i> = .33 and 96% accuracy, <i>P</i> = .56) or from CDS3 (0.0% false-positive alerts, <i>P</i> = .15 and 100% accuracy, <i>P</i> = .31).</p><p><strong>Discussion: </strong>Health Information Technology standards can represent recommendations of varying complexities in various care settings.</p><p><strong>Conclusion: </strong>The potential to represent CDS using standardized syntax and formats can help facilitate the dissemination of CDS-consumable artifacts.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 4","pages":"ooaf085"},"PeriodicalIF":3.4,"publicationDate":"2025-07-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12309839/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144754634","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Explaining care need assessment surveys: qualitative and quantitative evaluation of state-of-the-art local and global explainable artificial intelligence methods. 解释护理需求评估调查:对本地和全球最先进的可解释人工智能方法进行定性和定量评估。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-07-29 eCollection Date: 2025-08-01 DOI: 10.1093/jamiaopen/ooaf064
Necip Oğuz Şerbetci, Stefan Blüher, Paul Gellert, Ulf Leser

Objective: With extended life expectancy, the number of people in need of care has been growing. To optimally support them, it is important to know the patterns and conditions of their daily life that influence the need for support, and thus, the classification of the care need. In this study, we aim to utilize a large corpus consisting of care benefits applications to do an explorative analysis of factors affecting care need to support the tedious work of experts gathering reliable criteria for a care need assessment.

Materials and methods: We compare state-of-the-art methods from explainable artificial intelligence (XAI) as means to extract such patterns from over 72 000 German care benefits applications. We train transformer models to predict assessment results as decided by a Medical Service Unit from accompanying text notes. To understand the key factors for care need assessment and its constituent modules (such as mobility and self-therapy), we apply feature attribution methods to extract the key phrases for each prediction. These local explanations are then aggregated into global insights to derive key phrases for different modules and severity of care need over the dataset.

Results: Our experiments show that transformers-based models perform slightly better than traditional bag-of-words baselines in predicting care need. We find that the bag-of-words baseline also provides useful care-relevant phrases, whereas phrases obtained through transformer explanations better balance rare and common phrases, such as diagnoses mentioned only once, and are better in assigning the correct assessment module.

Discussion: Even though XAI results can become unwieldy, they let us get an understanding of thousands of documents with no extra annotations other than existing assessment outcomes.

Conclusion: This work provides a systematic application and comparison of both traditional and state-of-the-art deep learning based XAI approaches to extract insights from a large corpus of text. Both traditional and deep learning approaches provide useful phrases, and we recommend using both to explore and understand large text corpora better. We will make our code available at https://github.com/oguzserbetci/explainer.

目的:随着预期寿命的延长,需要护理的人数不断增加。为了最佳地支持他们,重要的是要了解影响支持需求的日常生活模式和条件,从而了解护理需求的分类。在这项研究中,我们的目标是利用一个由护理福利申请组成的大型语料库,对影响护理需求的因素进行探索性分析,以支持专家收集可靠的护理需求评估标准的繁琐工作。材料和方法:我们比较了来自可解释人工智能(XAI)的最先进的方法,作为从超过72000个德国护理福利申请中提取此类模式的手段。我们训练变压器模型来预测由医疗服务单位根据随附的文本注释决定的评估结果。为了了解护理需求评估的关键因素及其组成模块(如流动性和自我治疗),我们应用特征归因方法提取每个预测的关键短语。然后,这些局部解释被汇总成全局见解,从而得出数据集中不同模块和护理需求严重程度的关键短语。结果:我们的实验表明,基于变压器的模型在预测护理需求方面比传统的词袋基线稍好。我们发现词袋基线也提供了有用的护理相关短语,而通过转换解释获得的短语更好地平衡了罕见和常见短语,例如只提到一次的诊断,并且更好地分配正确的评估模块。讨论:尽管XAI结果可能会变得笨拙,但它们使我们能够理解数千个文档,除了现有的评估结果之外,不需要额外的注释。结论:这项工作提供了传统和最先进的基于深度学习的XAI方法的系统应用和比较,以从大量文本语料库中提取见解。传统和深度学习方法都提供了有用的短语,我们建议使用这两种方法来更好地探索和理解大型文本语料库。我们将在https://github.com/oguzserbetci/explainer上提供我们的代码。
{"title":"Explaining care need assessment surveys: qualitative and quantitative evaluation of state-of-the-art local and global explainable artificial intelligence methods.","authors":"Necip Oğuz Şerbetci, Stefan Blüher, Paul Gellert, Ulf Leser","doi":"10.1093/jamiaopen/ooaf064","DOIUrl":"10.1093/jamiaopen/ooaf064","url":null,"abstract":"<p><strong>Objective: </strong>With extended life expectancy, the number of people in need of care has been growing. To optimally support them, it is important to know the patterns and conditions of their daily life that influence the need for support, and thus, the classification of the care need. In this study, we aim to utilize a large corpus consisting of care benefits applications to do an explorative analysis of factors affecting care need to support the tedious work of experts gathering reliable criteria for a care need assessment.</p><p><strong>Materials and methods: </strong>We compare state-of-the-art methods from explainable artificial intelligence (XAI) as means to extract such patterns from over 72 000 German care benefits applications. We train transformer models to predict assessment results as decided by a Medical Service Unit from accompanying text notes. To understand the key factors for care need assessment and its constituent modules (such as mobility and self-therapy), we apply feature attribution methods to extract the key phrases for each prediction. These local explanations are then aggregated into global insights to derive key phrases for different modules and severity of care need over the dataset.</p><p><strong>Results: </strong>Our experiments show that transformers-based models perform slightly better than traditional bag-of-words baselines in predicting care need. We find that the bag-of-words baseline also provides useful care-relevant phrases, whereas phrases obtained through transformer explanations better balance rare and common phrases, such as diagnoses mentioned only once, and are better in assigning the correct assessment module.</p><p><strong>Discussion: </strong>Even though XAI results can become unwieldy, they let us get an understanding of thousands of documents with no extra annotations other than existing assessment outcomes.</p><p><strong>Conclusion: </strong>This work provides a systematic application and comparison of both traditional and state-of-the-art deep learning based XAI approaches to extract insights from a large corpus of text. Both traditional and deep learning approaches provide useful phrases, and we recommend using both to explore and understand large text corpora better. We will make our code available at https://github.com/oguzserbetci/explainer.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 4","pages":"ooaf064"},"PeriodicalIF":3.4,"publicationDate":"2025-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12307913/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144754636","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparison of 2 electronic health record data extraction methods for laboratory tests used in the Veterans Aging Cohort Study Index. 两种电子健康记录数据提取方法在退伍军人老龄化队列研究索引中的实验室检验比较。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-07-28 eCollection Date: 2025-08-01 DOI: 10.1093/jamiaopen/ooaf074
Christine Ramsey-Hardy, Melissa Skanderson, Janet P Tate, Amy C Justice, Vincent C Marconi, Charles Alcorn, Ronald G Hauser, Amy Anderson-Mellies, Kathleen A McGinnis

Objective: To compare Observational Medical Outcomes Partnership (OMOP) Logical Observation Identifiers Names and Codes (LOINC) and Veterans Aging Cohort Study (VACS) methods for extracting laboratory chemistry data from Veterans Health Administration (VA) electronic health records (EHR).

Materials and methods: Laboratory chemistry test results for 16 laboratory tests commonly assess in Veterans in VACS HIV (N = 143 830) followed in the VA 2015-2019 were extracted from the EHR and compared using 2 different data extraction approaches.

Results: The LOINC approach captured laboratory results from all 130 VA stations for all 16 labs. The VACS approach captured laboratory results from 128 to130 stations. Both approaches yielded laboratory results for a patient on a given date for 97% or more of the observations for 10 of the tests, 94%-97% for 5 of the tests, and 89% for 1 test. The percentage of exact matches on the value of the test result exceeded 99% for 10 of the laboratory tests and 92% for all other laboratory tests.

Discussion: Both approaches resulted in extraction of similar amounts of data in terms of individual patients, VA stations and total observations for all 16 tests. Both approaches yielded high agreement on test results in terms of identical values and correlation of test results for all tests.

Conclusion: The high level of agreement between VACS and LOINC approaches in this study demonstrate the favorable use of the LOINC approach for extracting laboratory results for most tests due to the ease and efficiency of this approach without compromising validity.

目的:比较观察性医疗结局伙伴关系(OMOP)、逻辑观察标识名称和代码(LOINC)和退伍军人老龄化队列研究(VACS)两种方法对退伍军人健康管理局(VA)电子健康档案(EHR)实验室化学数据的提取效果。材料与方法:从电子病历中提取2015-2019年退伍军人VACS HIV感染者常用的16项实验室检测结果(N = 143830),并采用两种不同的数据提取方法进行比较。结果:LOINC方法捕获了所有16个实验室的所有130个VA站的实验室结果。VACS方法收集了128至130个站点的实验室结果。两种方法在给定日期为患者提供了97%或更多的实验室结果,其中10项测试为94%-97%,5项测试为94%-97%,1项测试为89%。在10项实验室测试中,测试结果值的精确匹配百分比超过99%,在所有其他实验室测试中,该百分比超过92%。讨论:两种方法在患者个体、VA站和所有16项试验的总观察值方面提取的数据量相似。就所有测试的相同值和测试结果的相关性而言,这两种方法对测试结果产生了高度一致。结论:本研究中VACS和LOINC方法之间的高度一致性表明,LOINC方法对于提取大多数测试的实验室结果是有利的,因为这种方法易于使用,效率高,而且不影响有效性。
{"title":"Comparison of 2 electronic health record data extraction methods for laboratory tests used in the Veterans Aging Cohort Study Index.","authors":"Christine Ramsey-Hardy, Melissa Skanderson, Janet P Tate, Amy C Justice, Vincent C Marconi, Charles Alcorn, Ronald G Hauser, Amy Anderson-Mellies, Kathleen A McGinnis","doi":"10.1093/jamiaopen/ooaf074","DOIUrl":"10.1093/jamiaopen/ooaf074","url":null,"abstract":"<p><strong>Objective: </strong>To compare Observational Medical Outcomes Partnership (OMOP) Logical Observation Identifiers Names and Codes (LOINC) and Veterans Aging Cohort Study (VACS) methods for extracting laboratory chemistry data from Veterans Health Administration (VA) electronic health records (EHR).</p><p><strong>Materials and methods: </strong>Laboratory chemistry test results for 16 laboratory tests commonly assess in Veterans in VACS HIV (<i>N</i> = 143 830) followed in the VA 2015-2019 were extracted from the EHR and compared using 2 different data extraction approaches.</p><p><strong>Results: </strong>The LOINC approach captured laboratory results from all 130 VA stations for all 16 labs. The VACS approach captured laboratory results from 128 to130 stations. Both approaches yielded laboratory results for a patient on a given date for 97% or more of the observations for 10 of the tests, 94%-97% for 5 of the tests, and 89% for 1 test. The percentage of exact matches on the value of the test result exceeded 99% for 10 of the laboratory tests and 92% for all other laboratory tests.</p><p><strong>Discussion: </strong>Both approaches resulted in extraction of similar amounts of data in terms of individual patients, VA stations and total observations for all 16 tests. Both approaches yielded high agreement on test results in terms of identical values and correlation of test results for all tests.</p><p><strong>Conclusion: </strong>The high level of agreement between VACS and LOINC approaches in this study demonstrate the favorable use of the LOINC approach for extracting laboratory results for most tests due to the ease and efficiency of this approach without compromising validity.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 4","pages":"ooaf074"},"PeriodicalIF":3.4,"publicationDate":"2025-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12303373/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144733689","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Advancing methodological development of artificial intelligence in patient-centered comparative clinical effectiveness research: Patient-Centered Outcomes Research Institute's unique contribution to research done differently. 推进人工智能在以患者为中心的比较临床疗效研究中的方法学发展:以患者为中心的结果研究所对不同研究的独特贡献。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-07-26 eCollection Date: 2025-08-01 DOI: 10.1093/jamiaopen/ooaf081
Jinghua Ou, Erin Holve

Background: Recent advancements of Artificial Intelligence (AI) are rapidly transforming clinical research. While this technology offers exciting opportunities, it amplifies existing concerns regarding the need for transparent methodology that fosters patient engagement, and introduces new challenges. PCORI's Improving Methods portfolio has invested in methodological research to enhance rigor and transparency via patient-centered approaches in AI.

Objective: This commentary outlines PCORI's approach to funding and promoting a portfolio of methodological research that aims to improve the conduct of patient-centered comparative clinical effectiveness research (CER), with a focus on AI methods. The paper highlights a growing portfolio of over 40 AI related projects, including a recent cohort leveraging large language models to augment research processes in CER.

Discussion: PCORI's current portfolio of methods projects in AI illustrate timely opportunities for the clinical research informatics community to develop and assess AI applications that will further advance a robust, interoperable and ethical infrastructure for patient-centered CER. PCORI's requirement for ongoing, meaningful engagement of patients throughout the research lifecycle provides a blueprint for patient-centered AI by developing and applying models and methods designed to create value for patients and other healthcare partners.

背景:人工智能(AI)的最新进展正在迅速改变临床研究。虽然这项技术提供了令人兴奋的机会,但它放大了现有的担忧,即需要透明的方法来促进患者的参与,并引入了新的挑战。PCORI的改进方法组合投资于方法研究,通过以患者为中心的方法提高人工智能的严谨性和透明度。目的:本评论概述了PCORI资助和促进一系列方法学研究的方法,旨在改善以患者为中心的比较临床有效性研究(CER)的实施,重点是人工智能方法。该论文强调了40多个人工智能相关项目的不断增长的投资组合,包括最近利用大型语言模型来增强CER研究过程的队列。讨论:PCORI目前在人工智能方面的方法项目组合为临床研究信息学社区开发和评估人工智能应用提供了及时的机会,这些应用将进一步推进以患者为中心的CER的强大,可互操作和道德基础设施。PCORI对患者在整个研究生命周期中持续、有意义的参与的要求,通过开发和应用旨在为患者和其他医疗保健合作伙伴创造价值的模型和方法,为以患者为中心的人工智能提供了蓝图。
{"title":"Advancing methodological development of artificial intelligence in patient-centered comparative clinical effectiveness research: Patient-Centered Outcomes Research Institute's unique contribution to research done differently.","authors":"Jinghua Ou, Erin Holve","doi":"10.1093/jamiaopen/ooaf081","DOIUrl":"10.1093/jamiaopen/ooaf081","url":null,"abstract":"<p><strong>Background: </strong>Recent advancements of Artificial Intelligence (AI) are rapidly transforming clinical research. While this technology offers exciting opportunities, it amplifies existing concerns regarding the need for transparent methodology that fosters patient engagement, and introduces new challenges. PCORI's Improving Methods portfolio has invested in methodological research to enhance rigor and transparency via patient-centered approaches in AI.</p><p><strong>Objective: </strong>This commentary outlines PCORI's approach to funding and promoting a portfolio of methodological research that aims to improve the conduct of patient-centered comparative clinical effectiveness research (CER), with a focus on AI methods. The paper highlights a growing portfolio of over 40 AI related projects, including a recent cohort leveraging large language models to augment research processes in CER.</p><p><strong>Discussion: </strong>PCORI's current portfolio of methods projects in AI illustrate timely opportunities for the clinical research informatics community to develop and assess AI applications that will further advance a robust, interoperable and ethical infrastructure for patient-centered CER. PCORI's requirement for ongoing, meaningful engagement of patients throughout the research lifecycle provides a blueprint for patient-centered AI by developing and applying models and methods designed to create value for patients and other healthcare partners.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 4","pages":"ooaf081"},"PeriodicalIF":3.4,"publicationDate":"2025-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12296393/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144733674","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A fair machine learning model to predict flares of systemic lupus erythematosus. 一个公平的机器学习模型来预测系统性红斑狼疮的耀斑。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-07-26 eCollection Date: 2025-08-01 DOI: 10.1093/jamiaopen/ooaf072
Yongqiu Li, Lixia Yao, Yao An Lee, Yu Huang, Peter A Merkel, Ernest Vina, Ya-Yun Yeh, Yujia Li, John M Allen, Jiang Bian, Jingchuan Guo

Objective: Systemic lupus erythematosus (SLE) is a chronic autoimmune disease that disproportionately affects women and racial/ethnic minority groups. Predicting disease flares is essential for improving patient outcomes, yet few studies integrate both clinical and social determinants of health (SDoH). We therefore developed FLAME (FLAre Machine learning prediction of SLE), a machine learning pipeline that uses electronic health records (EHRs) and contextual-level SDoH to predict 3-month flare risk, emphasizing explainability and fairness.

Materials and methods: We conducted a retrospective cohort study of 28 433 patients with SLE from the University of Florida Health (2011-2022), linked to 675 contextual-level SDoH variables. We used XGBoost and logistic regression models to predict 3-month flare risk, evaluating model performance using the area under the receiver operating characteristic (AUROC). We applied SHapley Additive exPlanations (SHAP) values and causal structure learning to identify key predictors. Fairness was assessed using the equality of opportunity metric, measured by the false-negative rate across racial/ethnic groups.

Results: The FLAME model, incorporating clinical and contextual-level SDoH, achieved an AUROC of 0.66. The clinical-only model performed slightly better (AUROC of 0.67), while the SDoH-only model had lower performance (AUROC of 0.54). SHAP analysis identified headache, organic brain syndrome, and pyuria as key predictors. Causal learning revealed interactions between clinical factors and contextual-level SDoH. Fairness assessments showed no significant biases across groups.

Discussion: FLAME offers a fair and interpretable approach to predicting SLE flares, providing meaningful insights that may guide future clinical interventions.

Conclusions: FLAME shows promise as an EHR-based tool to support personalized, equitable, and holistic SLE care.

目的:系统性红斑狼疮(SLE)是一种慢性自身免疫性疾病,多发于女性和少数族裔群体。预测疾病爆发对于改善患者预后至关重要,但很少有研究将健康的临床和社会决定因素(SDoH)结合起来。因此,我们开发了FLAME (SLE的FLAre机器学习预测),这是一种机器学习管道,使用电子健康记录(EHRs)和情境级SDoH来预测3个月的耀斑风险,强调可解释性和公平性。材料和方法:我们对来自佛罗里达健康大学(2011-2022)的28433例SLE患者进行了一项回顾性队列研究,与675个背景水平的SDoH变量相关。我们使用XGBoost和逻辑回归模型来预测3个月的耀斑风险,并使用接收器工作特征下的面积(AUROC)来评估模型的性能。我们应用SHapley加性解释(SHAP)值和因果结构学习来识别关键预测因子。公平是用机会均等指标来评估的,通过不同种族/民族群体的假阴性率来衡量。结果:纳入临床和情境水平SDoH的FLAME模型的AUROC为0.66。单纯临床模型的AUROC略好(0.67),单纯sdoh模型的AUROC较低(0.54)。SHAP分析发现头痛、器质性脑综合征和脓尿是主要的预测因素。因果学习揭示了临床因素与情境水平SDoH之间的相互作用。公平评估显示各组之间没有明显的偏见。讨论:FLAME为预测SLE耀斑提供了一种公平且可解释的方法,为指导未来的临床干预提供了有意义的见解。结论:FLAME有望作为一种基于电子病历的工具,支持个性化、公平和全面的SLE护理。
{"title":"A fair machine learning model to predict flares of systemic lupus erythematosus.","authors":"Yongqiu Li, Lixia Yao, Yao An Lee, Yu Huang, Peter A Merkel, Ernest Vina, Ya-Yun Yeh, Yujia Li, John M Allen, Jiang Bian, Jingchuan Guo","doi":"10.1093/jamiaopen/ooaf072","DOIUrl":"10.1093/jamiaopen/ooaf072","url":null,"abstract":"<p><strong>Objective: </strong>Systemic lupus erythematosus (SLE) is a chronic autoimmune disease that disproportionately affects women and racial/ethnic minority groups. Predicting disease flares is essential for improving patient outcomes, yet few studies integrate both clinical and social determinants of health (SDoH). We therefore developed FLAME (<b>FLA</b>re <b>M</b>achine learning prediction of SL<b>E</b>), a machine learning pipeline that uses electronic health records (EHRs) and contextual-level SDoH to predict 3-month flare risk, emphasizing explainability and fairness.</p><p><strong>Materials and methods: </strong>We conducted a retrospective cohort study of 28 433 patients with SLE from the University of Florida Health (2011-2022), linked to 675 contextual-level SDoH variables. We used XGBoost and logistic regression models to predict 3-month flare risk, evaluating model performance using the area under the receiver operating characteristic (AUROC). We applied SHapley Additive exPlanations (SHAP) values and causal structure learning to identify key predictors. Fairness was assessed using the equality of opportunity metric, measured by the false-negative rate across racial/ethnic groups.</p><p><strong>Results: </strong>The FLAME model, incorporating clinical and contextual-level SDoH, achieved an AUROC of 0.66. The clinical-only model performed slightly better (AUROC of 0.67), while the SDoH-only model had lower performance (AUROC of 0.54). SHAP analysis identified headache, organic brain syndrome, and pyuria as key predictors. Causal learning revealed interactions between clinical factors and contextual-level SDoH. Fairness assessments showed no significant biases across groups.</p><p><strong>Discussion: </strong>FLAME offers a fair and interpretable approach to predicting SLE flares, providing meaningful insights that may guide future clinical interventions.</p><p><strong>Conclusions: </strong>FLAME shows promise as an EHR-based tool to support personalized, equitable, and holistic SLE care.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 4","pages":"ooaf072"},"PeriodicalIF":3.4,"publicationDate":"2025-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12296391/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144733673","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Social vulnerability, lower broadband internet access, and rurality associated with lower telemedicine use in U.S. Counties. 在美国各县,社会脆弱性、较低的宽带互联网接入和乡村性与较低的远程医疗使用有关。
IF 3.4 Q2 HEALTH CARE SCIENCES & SERVICES Pub Date : 2025-07-26 eCollection Date: 2025-08-01 DOI: 10.1093/jamiaopen/ooaf056
Mollie R Cummins, Bob Wong, Neng Wan, Jiuying Han, Sukrut D Shishupal, Ramkiran Gouripeddi, Julia Ivanova, Asiyah Franklin, Jace Johnny, Triton Ong, Brandon M Welch, Brian E Bunnell

Objective: Our objective was to determine how social vulnerabilities, broadband access, and rurality relate to telemedicine use across the United States through large-scale analysis of real-world telemedicine data.

Materials and methods: We conducted a retrospective, observational study of dyadic U.S. telemedicine sessions that occurred January 1, 2022 to December 31, 2022, linked to the 2020 Centers for Disease Control and Prevention Social Vulnerability Index (SVI) and the National Center for Health Statistics Urban-Rural Classification Scheme for Counties. We examined county-level telemedicine use rates (sessions per 1000 population) in relation to SVI indexes, broadband internet access, and rurality classifications using polynomial regression and data visualization.

Results: We found a negative, nonlinear association between overall social and socioeconomic status vulnerabilities and telemedicine use. Telemedicine rates in urban counties exceeded that of rural counties. There was more variability in telemedicine use for the urban counties according to social vulnerability and broadband access.

Discussion: Rurality and broadband access demonstrated a greater effect on telemedicine use than social vulnerability, and the relationship between social vulnerability, broadband access, and telemedicine use differed for rural versus urban areas.

Conclusion: This observational study of nearly 8 million U.S. telemedicine sessions showed that rurality and broadband access are key drivers of telemedicine use and may be more important than many social vulnerabilities in determining community-level telemedicine use. We also found nuanced differences in the relationship between social vulnerability and telemedicine use between rural and urban counties, and at different levels of broadband access.

目的:我们的目标是通过对真实世界远程医疗数据的大规模分析,确定美国各地的社会脆弱性、宽带接入和乡村性与远程医疗使用之间的关系。材料和方法:我们对发生在2022年1月1日至2022年12月31日的美国双元远程医疗会议进行了回顾性观察研究,这些会议与2020年疾病控制和预防中心的社会脆弱性指数(SVI)和国家卫生统计中心的城乡分类方案有关。我们使用多项式回归和数据可视化研究了县级远程医疗使用率(每1000人的会话)与SVI指数、宽带互联网接入和农村分类的关系。结果:我们发现整体社会和社会经济地位脆弱性与远程医疗使用之间存在负的非线性关联。城市县的远程医疗率高于农村县。根据社会脆弱性和宽带接入情况,城市县的远程医疗使用存在较大差异。讨论:农村和宽带接入对远程医疗使用的影响大于社会脆弱性,社会脆弱性、宽带接入和远程医疗使用之间的关系在农村和城市地区有所不同。结论:这项对近800万美国远程医疗会议的观察性研究表明,乡村性和宽带接入是远程医疗使用的关键驱动因素,在决定社区级远程医疗使用方面,可能比许多社会脆弱性更重要。我们还发现,在农村和城市县之间,以及在不同的宽带接入水平上,社会脆弱性与远程医疗使用之间的关系存在细微差异。
{"title":"Social vulnerability, lower broadband internet access, and rurality associated with lower telemedicine use in U.S. Counties.","authors":"Mollie R Cummins, Bob Wong, Neng Wan, Jiuying Han, Sukrut D Shishupal, Ramkiran Gouripeddi, Julia Ivanova, Asiyah Franklin, Jace Johnny, Triton Ong, Brandon M Welch, Brian E Bunnell","doi":"10.1093/jamiaopen/ooaf056","DOIUrl":"10.1093/jamiaopen/ooaf056","url":null,"abstract":"<p><strong>Objective: </strong>Our objective was to determine how social vulnerabilities, broadband access, and rurality relate to telemedicine use across the United States through large-scale analysis of real-world telemedicine data.</p><p><strong>Materials and methods: </strong>We conducted a retrospective, observational study of dyadic U.S. telemedicine sessions that occurred January 1, 2022 to December 31, 2022, linked to the 2020 Centers for Disease Control and Prevention Social Vulnerability Index (SVI) and the National Center for Health Statistics Urban-Rural Classification Scheme for Counties. We examined county-level telemedicine use rates (sessions per 1000 population) in relation to SVI indexes, broadband internet access, and rurality classifications using polynomial regression and data visualization.</p><p><strong>Results: </strong>We found a negative, nonlinear association between overall social and socioeconomic status vulnerabilities and telemedicine use. Telemedicine rates in urban counties exceeded that of rural counties. There was more variability in telemedicine use for the urban counties according to social vulnerability and broadband access.</p><p><strong>Discussion: </strong>Rurality and broadband access demonstrated a greater effect on telemedicine use than social vulnerability, and the relationship between social vulnerability, broadband access, and telemedicine use differed for rural versus urban areas.</p><p><strong>Conclusion: </strong>This observational study of nearly 8 million U.S. telemedicine sessions showed that rurality and broadband access are key drivers of telemedicine use and may be more important than many social vulnerabilities in determining community-level telemedicine use. We also found nuanced differences in the relationship between social vulnerability and telemedicine use between rural and urban counties, and at different levels of broadband access.</p>","PeriodicalId":36278,"journal":{"name":"JAMIA Open","volume":"8 4","pages":"ooaf056"},"PeriodicalIF":3.4,"publicationDate":"2025-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12296349/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144733690","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
JAMIA Open
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1