首页 > 最新文献

Journal of the American Medical Informatics Association最新文献

英文 中文
Multi-site analysis of COVID-19 and new-onset diabetes reveals need for improved sensitivity of EHR-based COVID-19 phenotypes-a DiCAYA network analysis. 对COVID-19和新发糖尿病的多位点分析表明,需要提高基于ehr的COVID-19表型的敏感性——DiCAYA网络分析。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-24 DOI: 10.1093/jamia/ocaf229
Sarah Conderino, H Lester Kirchner, Lorna E Thorpe, Jasmin Divers, Annemarie G Hirsch, Cara M Nordberg, Brian S Schwartz, Lu Zhang, Bo Cai, Caroline Rudisill, Jihad S Obeid, Angela Liese, Katie S Allen, Brian E Dixon, Tessa Crume, Dana Dabelea, Shawna Burgett, Anna Bellatorre, Hui Shao, Jiang Bian, Yi Guo, Sarah Bost, Tianchen Lyu, Kristi Reynolds, Matthew T Mefford, Hui Zhou, Matt Zhou, Eva Lustigova, Levon H Utidjian, Mitchell Maltenfort, Manmohan Kamboj, Eneida A Mendonca, Patrick Hanley, Ibrahim Zaganjor, Meda E Pavkov, Marc Rosenman, Andrea R Titus

Objective: We discuss implications of potential ascertainment biases for studies examining diabetes risk following SARS-CoV-2 infection using electronic health records (EHRs). We quantitatively explore sensitivity of results to misclassification of COVID-19 status using data from the U.S.-based Diabetes in Children, Adolescents and Young Adults (DiCAYA) Network on children (≤17 years) and young adults (18-44 years).

Materials and methods: In our retrospective case study from the DiCAYA Network, SARS-CoV-2 was identified using labs and diagnoses from 6/1/2020-12/31/2021. Patients were followed through 12/31/2022 for new diabetes diagnoses. Sites examined incident diabetes by COVID-19 status using Cox proportional hazards models. Results were pooled in meta-analyses. A bias analysis examined potential impact of COVID-19 misclassification scenarios on results, guided by hypotheses that sensitivity would be < 50% and would be higher among those who developed diabetes.

Results: Prevalence of documented COVID-19 was low overall and variable across sites (children: 4.4%-7.7%, young adults: 6.2%-22.7%). Individuals with documented COVID-19 were at higher risk of incident diabetes compared to those with no documented infection, but results were heterogeneous across sites. Findings were highly sensitive to COVID-19 misclassification assumptions. Observed results could be biased away from the null under several differential misclassification scenarios.

Discussion: Although EHR-based documentation of COVID-19 was associated with incident diabetes, COVID-19 phenotypes likely had low sensitivity, with considerable variation across sites. Misclassification assumptions strongly impacted interpretation of results.

Conclusion: Given the potential for low phenotype sensitivity and misclassification, caution is warranted when interpreting analyses of COVID-19 and incident diabetes using clinical or administrative databases.

目的:我们讨论使用电子健康记录(EHRs)检查SARS-CoV-2感染后糖尿病风险的研究中潜在的确定偏差的含义。我们利用美国儿童、青少年和年轻人糖尿病(DiCAYA)网络对儿童(≤17岁)和年轻人(18-44岁)的数据,定量探讨了结果对COVID-19状态错误分类的敏感性。材料和方法:在我们来自DiCAYA网络的回顾性病例研究中,使用实验室和诊断从2020年6月1日至2021年12月31日确定了SARS-CoV-2。随访患者至2022年12月31日,以获得新的糖尿病诊断。站点使用Cox比例风险模型对COVID-19状态下的糖尿病事件进行了检查。结果汇总在荟萃分析中。一项偏倚分析考察了COVID-19错误分类情景对结果的潜在影响,假设敏感性为:结果:记录的COVID-19患病率总体较低,各部位差异较大(儿童:4.4%-7.7%,年轻人:6.2%-22.7%)。与未记录感染的个体相比,记录感染COVID-19的个体发生糖尿病的风险更高,但不同部位的结果不同。研究结果对COVID-19错误分类假设高度敏感。在几种不同的误分类情况下,观察到的结果可能偏离零值。讨论:尽管基于电子病历的COVID-19记录与糖尿病事件相关,但COVID-19表型可能具有低敏感性,且各部位差异很大。错误的分类假设严重影响了对结果的解释。结论:考虑到潜在的低表型敏感性和错误分类,在使用临床或管理数据库解释COVID-19和偶发糖尿病的分析时需要谨慎。
{"title":"Multi-site analysis of COVID-19 and new-onset diabetes reveals need for improved sensitivity of EHR-based COVID-19 phenotypes-a DiCAYA network analysis.","authors":"Sarah Conderino, H Lester Kirchner, Lorna E Thorpe, Jasmin Divers, Annemarie G Hirsch, Cara M Nordberg, Brian S Schwartz, Lu Zhang, Bo Cai, Caroline Rudisill, Jihad S Obeid, Angela Liese, Katie S Allen, Brian E Dixon, Tessa Crume, Dana Dabelea, Shawna Burgett, Anna Bellatorre, Hui Shao, Jiang Bian, Yi Guo, Sarah Bost, Tianchen Lyu, Kristi Reynolds, Matthew T Mefford, Hui Zhou, Matt Zhou, Eva Lustigova, Levon H Utidjian, Mitchell Maltenfort, Manmohan Kamboj, Eneida A Mendonca, Patrick Hanley, Ibrahim Zaganjor, Meda E Pavkov, Marc Rosenman, Andrea R Titus","doi":"10.1093/jamia/ocaf229","DOIUrl":"10.1093/jamia/ocaf229","url":null,"abstract":"<p><strong>Objective: </strong>We discuss implications of potential ascertainment biases for studies examining diabetes risk following SARS-CoV-2 infection using electronic health records (EHRs). We quantitatively explore sensitivity of results to misclassification of COVID-19 status using data from the U.S.-based Diabetes in Children, Adolescents and Young Adults (DiCAYA) Network on children (≤17 years) and young adults (18-44 years).</p><p><strong>Materials and methods: </strong>In our retrospective case study from the DiCAYA Network, SARS-CoV-2 was identified using labs and diagnoses from 6/1/2020-12/31/2021. Patients were followed through 12/31/2022 for new diabetes diagnoses. Sites examined incident diabetes by COVID-19 status using Cox proportional hazards models. Results were pooled in meta-analyses. A bias analysis examined potential impact of COVID-19 misclassification scenarios on results, guided by hypotheses that sensitivity would be < 50% and would be higher among those who developed diabetes.</p><p><strong>Results: </strong>Prevalence of documented COVID-19 was low overall and variable across sites (children: 4.4%-7.7%, young adults: 6.2%-22.7%). Individuals with documented COVID-19 were at higher risk of incident diabetes compared to those with no documented infection, but results were heterogeneous across sites. Findings were highly sensitive to COVID-19 misclassification assumptions. Observed results could be biased away from the null under several differential misclassification scenarios.</p><p><strong>Discussion: </strong>Although EHR-based documentation of COVID-19 was associated with incident diabetes, COVID-19 phenotypes likely had low sensitivity, with considerable variation across sites. Misclassification assumptions strongly impacted interpretation of results.</p><p><strong>Conclusion: </strong>Given the potential for low phenotype sensitivity and misclassification, caution is warranted when interpreting analyses of COVID-19 and incident diabetes using clinical or administrative databases.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.6,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12884381/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145829065","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Response to "toward semantic interoperability of imaging and clinical data: reflections on the DICOM-OMOP integration framework". 对“成像和临床数据的语义互操作性:对DICOM-OMOP集成框架的思考”的回应。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-24 DOI: 10.1093/jamia/ocaf216
Woo Yeon Park, Teri Sippel Schmidt, Gabriel Salvador, Kevin O'Donnell, Brad Genereaux, Kyulee Jeon, Seng Chan You, Blake E Dewey, Paul Nagy
{"title":"Response to \"toward semantic interoperability of imaging and clinical data: reflections on the DICOM-OMOP integration framework\".","authors":"Woo Yeon Park, Teri Sippel Schmidt, Gabriel Salvador, Kevin O'Donnell, Brad Genereaux, Kyulee Jeon, Seng Chan You, Blake E Dewey, Paul Nagy","doi":"10.1093/jamia/ocaf216","DOIUrl":"https://doi.org/10.1093/jamia/ocaf216","url":null,"abstract":"","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.6,"publicationDate":"2025-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145844527","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
AutoReporter: development of an artificial intelligence tool for automated assessment of research reporting guideline adherence. AutoReporter:开发用于自动评估研究报告指南遵守情况的人工智能工具。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-23 DOI: 10.1093/jamia/ocaf223
David Chen, Patrick Li, Ealia Khoshkish, Seungmin Lee, Tony Ning, Umair Tahir, Henry C Y Wong, Michael S F Lee, Srinivas Raman

Objectives: To develop AutoReporter, a large language model (LLM) system that automates evaluation of adherence to research reporting guidelines.

Materials and methods: Eight prompt-engineering and retrieval strategies coupled with reasoning and general-purpose LLMs were benchmarked on the SPIRIT-CONSORT-TM corpus. The top-performing approach, AutoReporter, was validated on BenchReport, a novel benchmark dataset of expert-rated reporting guideline assessments from 10 systematic reviews.

Results: AutoReporter, a zero-shot, no-retrieval prompt coupled with the o3-mini reasoning LLM, demonstrated strong accuracy (CONSORT 90.09%; SPIRIT: 92.07%), substantial agreement with humans (CONSORT Cohen's κ = 0.70, SPIRIT Cohen's κ = 0.77), runtime (CONSORT: 617.26 s; SPIRIT: 544.51 s), and cost (CONSORT: 0.68 USD; SPIRIT: 0.65 USD). AutoReporter achieved a mean accuracy of 91.8% and substantial agreement (Cohen's κ > 0.6) with expert ratings from the BenchReport benchmark.

Discussion: Structured prompting alone can match or exceed fine-tuned domain models while forgoing manually annotated corpora and computationally intensive training.

Conclusion: Large language models can feasibly automate reporting guideline adherence assessments for scalable quality control in scientific research reporting. AutoReporter is publicly accessible at https://autoreporter.streamlit.app.

目的:开发AutoReporter,一个大型语言模型(LLM)系统,自动评估遵守研究报告指南。材料和方法:在spirit - consortium - tm语料库上对8种与推理和通用llm相结合的提示工程和检索策略进行了基准测试。表现最好的方法AutoReporter在BenchReport上得到了验证,BenchReport是一个新的基准数据集,由10个系统评论的专家评级报告指南评估组成。结果:AutoReporter,零采样,无检索提示与o3-mini推理LLM相结合,显示出很强的准确性(CONSORT 90.09%; SPIRIT: 92.07%),与人类基本一致(CONSORT Cohen's κ = 0.70, SPIRIT Cohen's κ = 0.77),运行时间(CONSORT: 617.26 s; SPIRIT: 544.51 s),成本(CONSORT: 0.68美元;SPIRIT: 0.65美元)。AutoReporter的平均准确率为91.8%,与BenchReport基准的专家评级基本一致(Cohen’s κ > 0.6)。讨论:单独的结构化提示可以匹配或超过微调的领域模型,同时放弃手动注释的语料库和计算密集型训练。结论:大型语言模型可实现科研报告质量控制中报告准则依从性评估的自动化。AutoReporter可在https://autoreporter.streamlit.app公开访问。
{"title":"AutoReporter: development of an artificial intelligence tool for automated assessment of research reporting guideline adherence.","authors":"David Chen, Patrick Li, Ealia Khoshkish, Seungmin Lee, Tony Ning, Umair Tahir, Henry C Y Wong, Michael S F Lee, Srinivas Raman","doi":"10.1093/jamia/ocaf223","DOIUrl":"https://doi.org/10.1093/jamia/ocaf223","url":null,"abstract":"<p><strong>Objectives: </strong>To develop AutoReporter, a large language model (LLM) system that automates evaluation of adherence to research reporting guidelines.</p><p><strong>Materials and methods: </strong>Eight prompt-engineering and retrieval strategies coupled with reasoning and general-purpose LLMs were benchmarked on the SPIRIT-CONSORT-TM corpus. The top-performing approach, AutoReporter, was validated on BenchReport, a novel benchmark dataset of expert-rated reporting guideline assessments from 10 systematic reviews.</p><p><strong>Results: </strong>AutoReporter, a zero-shot, no-retrieval prompt coupled with the o3-mini reasoning LLM, demonstrated strong accuracy (CONSORT 90.09%; SPIRIT: 92.07%), substantial agreement with humans (CONSORT Cohen's κ = 0.70, SPIRIT Cohen's κ = 0.77), runtime (CONSORT: 617.26 s; SPIRIT: 544.51 s), and cost (CONSORT: 0.68 USD; SPIRIT: 0.65 USD). AutoReporter achieved a mean accuracy of 91.8% and substantial agreement (Cohen's κ > 0.6) with expert ratings from the BenchReport benchmark.</p><p><strong>Discussion: </strong>Structured prompting alone can match or exceed fine-tuned domain models while forgoing manually annotated corpora and computationally intensive training.</p><p><strong>Conclusion: </strong>Large language models can feasibly automate reporting guideline adherence assessments for scalable quality control in scientific research reporting. AutoReporter is publicly accessible at https://autoreporter.streamlit.app.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.6,"publicationDate":"2025-12-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145821799","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Identifying and supporting trafficked individuals: provider and community organization perspectives on existing sociotechnical approaches. 识别和支持被贩运的个人:提供者和社区组织对现有社会技术方法的看法。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-22 DOI: 10.1093/jamia/ocaf220
Michelle Gomez, Ellen W Clayton, Colin G Walsh, Kim M Unertl

Objectives: Trafficked persons experience adverse health consequences and seek help, but many go unrecognized by health-care professionals. This study explored professionals' perspectives on current approaches toward identifying and supporting trafficked persons in health-care settings, highlighting current technology roles, gaps, and future directions.

Materials and methods: We developed an interview guide to investigate current human trafficking (HT) approaches, safety procedures, and HT education. Semistructured interviews were conducted via Zoom, iteratively coded in Dedoose, and analyzed using a thematic analysis approach.

Results: We interviewed 19 health-care and community group professionals and identified 3 themes: (1) participants described a responsibility to build trust with patients through compassionate communication, rapport, and trauma-informed approaches across different stages of care. (2) Technology played a dual role, as professionals navigated both benefits and challenges of tools such as Zoom, virtual interpreters, and cameras in trust building. (3) Safety and privacy concerns guided how participants documented patient encounters and shared community resources, ensuring confidentiality while supporting patient and community well-being.

Discussion: Technology can both support and hinder trust in health care, directly affecting trafficked patients and their safety. Informatics can improve care for trafficked persons, but further research is needed on technology-based interventions. We provide recommendations to strengthen trust, enhance safety, support trauma-informed care, and promote safe documentation practices.

Conclusion: Effective sociotechnical approaches rely on trust, safety, and mindful documentation to support trafficked patients. Future research directions include refining the role of informatics in trauma-informed care to strengthen trust and mitigate unintended consequences.

目标:被贩运者经历了不利的健康后果并寻求帮助,但许多人没有得到保健专业人员的认识。本研究探讨了专业人员对目前在卫生保健机构中识别和支持被贩运者的方法的看法,强调了目前的技术作用、差距和未来方向。材料和方法:我们制定了一份访谈指南来调查当前的人口贩运(HT)方法、安全程序和HT教育。通过Zoom进行半结构化访谈,在Dedoose中迭代编码,并使用主题分析方法进行分析。结果:我们采访了19名医疗保健和社区团体专业人员,并确定了3个主题:(1)参与者描述了在不同护理阶段通过富有同情心的沟通、融洽关系和创伤知情方法与患者建立信任的责任。(2)技术发挥了双重作用,因为专业人员在建立信任方面既能驾驭Zoom、虚拟口译员和摄像机等工具的优势,也能应对它们带来的挑战。(3)安全和隐私问题指导参与者如何记录患者遭遇和共享社区资源,在支持患者和社区福祉的同时确保保密性。讨论:技术既可以支持也可以阻碍对卫生保健的信任,直接影响到被贩运的病人及其安全。信息学可以改善对被贩运者的护理,但需要进一步研究基于技术的干预措施。我们提供建议,以加强信任,提高安全性,支持创伤知情护理,并促进安全的文件实践。结论:有效的社会技术手段依赖于信任、安全和谨慎的文件来支持被拐卖的患者。未来的研究方向包括完善信息学在创伤知情护理中的作用,以加强信任和减轻意外后果。
{"title":"Identifying and supporting trafficked individuals: provider and community organization perspectives on existing sociotechnical approaches.","authors":"Michelle Gomez, Ellen W Clayton, Colin G Walsh, Kim M Unertl","doi":"10.1093/jamia/ocaf220","DOIUrl":"https://doi.org/10.1093/jamia/ocaf220","url":null,"abstract":"<p><strong>Objectives: </strong>Trafficked persons experience adverse health consequences and seek help, but many go unrecognized by health-care professionals. This study explored professionals' perspectives on current approaches toward identifying and supporting trafficked persons in health-care settings, highlighting current technology roles, gaps, and future directions.</p><p><strong>Materials and methods: </strong>We developed an interview guide to investigate current human trafficking (HT) approaches, safety procedures, and HT education. Semistructured interviews were conducted via Zoom, iteratively coded in Dedoose, and analyzed using a thematic analysis approach.</p><p><strong>Results: </strong>We interviewed 19 health-care and community group professionals and identified 3 themes: (1) participants described a responsibility to build trust with patients through compassionate communication, rapport, and trauma-informed approaches across different stages of care. (2) Technology played a dual role, as professionals navigated both benefits and challenges of tools such as Zoom, virtual interpreters, and cameras in trust building. (3) Safety and privacy concerns guided how participants documented patient encounters and shared community resources, ensuring confidentiality while supporting patient and community well-being.</p><p><strong>Discussion: </strong>Technology can both support and hinder trust in health care, directly affecting trafficked patients and their safety. Informatics can improve care for trafficked persons, but further research is needed on technology-based interventions. We provide recommendations to strengthen trust, enhance safety, support trauma-informed care, and promote safe documentation practices.</p><p><strong>Conclusion: </strong>Effective sociotechnical approaches rely on trust, safety, and mindful documentation to support trafficked patients. Future research directions include refining the role of informatics in trauma-informed care to strengthen trust and mitigate unintended consequences.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.6,"publicationDate":"2025-12-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145806132","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Clinical decision support for population health management: development and validation of integrated acuity and intervention prediction models. 人口健康管理的临床决策支持:综合敏锐度和干预预测模型的开发和验证。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-19 DOI: 10.1093/jamia/ocaf225
Sanjay Basu, Sadiq Y Patel, Parth Sheth, Bhairavi Muralidharan, Namrata Elamaran, Aakriti Kinra, Rajaie Batniji

Objective: Population health management programs coordinate care for over 80 million Medicaid beneficiaries but lack systematic clinical decision support for determining when to intervene and which interventions to select for patients with complex conditions. Our objective was to develop and validate a clinical decision support system integrating acuity prediction and intervention selection models for population health management programs.

Materials and methods: We conducted a retrospective cohort study of 155 631 Medicaid patients enrolled in population health programs across Washington, Virginia, and Ohio (January 2023-July 2025). We developed integrated informatics workflows combining time-to-event prediction models for acute care events with heterogeneous treatment effect estimators for intervention selection. Models used structured electronic health record data, claims, and care management records. Performance was evaluated through clinical validation with 3 blinded physicians reviewing 200 cases.

Results: The integrated decision support system achieved 81.3% sensitivity (95% CI, 79.8%-82.8%) and 82.1% specificity (95% CI, 80.6%-83.6%) for 30-day acute care prediction. The intervention selection component demonstrated 1.59 percentage points absolute risk reduction compared with standard care (95% CI, 0.21-3.04), translating to preventing one acute event for every 63 patients receiving model-guided rather than standard care. Clinical validation revealed systematic differences: physicians relied on recent utilization patterns (explaining 75.8% of decision variance) while models integrated broader clinical signals, identifying intervention opportunities earlier in disease trajectories. Both approaches recommended similar intervention types, suggesting complementary rather than replacement roles.

Discussion: An integrated clinical decision support system can enhance population health management by providing actionable guidance on intervention timing and selection.

Conclusion: An integrated decision support system's ability to identify opportunities before high utilization manifests offers potential for shifting from reactive to preventive care delivery for vulnerable populations.

目的:人口健康管理项目协调了超过8000万医疗补助受益人的护理,但缺乏系统的临床决策支持,以确定何时干预以及为复杂病情的患者选择哪种干预措施。我们的目标是开发和验证一个临床决策支持系统,该系统集成了人口健康管理项目的敏锐度预测和干预选择模型。材料和方法:我们在2023年1月至2025年7月期间,对华盛顿州、弗吉尼亚州和俄亥俄州人口健康计划中登记的155631名医疗补助患者进行了回顾性队列研究。我们开发了集成信息学工作流程,将急性护理事件的时间到事件预测模型与用于干预选择的异构治疗效果估计器结合起来。模型使用结构化电子健康记录数据、索赔和护理管理记录。通过3名盲法医生对200例病例的临床验证来评估其表现。结果:综合决策支持系统对30天急性护理预测的敏感性为81.3% (95% CI, 79.8% ~ 82.8%),特异性为82.1% (95% CI, 80.6% ~ 83.6%)。干预选择部分显示,与标准治疗相比,绝对风险降低1.59个百分点(95% CI, 0.21-3.04),转化为每63名接受模型指导而不是标准治疗的患者预防1例急性事件。临床验证揭示了系统差异:医生依赖于最近的使用模式(解释了75.8%的决策差异),而模型整合了更广泛的临床信号,在疾病轨迹的早期识别干预机会。两种方法都推荐类似的干预类型,建议互补而不是替代角色。讨论:综合临床决策支持系统可以通过对干预时机和选择提供可操作的指导来加强人群健康管理。结论:综合决策支持系统能够在高使用率出现之前识别机会,为弱势群体提供从被动到预防性保健服务的转变提供了潜力。
{"title":"Clinical decision support for population health management: development and validation of integrated acuity and intervention prediction models.","authors":"Sanjay Basu, Sadiq Y Patel, Parth Sheth, Bhairavi Muralidharan, Namrata Elamaran, Aakriti Kinra, Rajaie Batniji","doi":"10.1093/jamia/ocaf225","DOIUrl":"https://doi.org/10.1093/jamia/ocaf225","url":null,"abstract":"<p><strong>Objective: </strong>Population health management programs coordinate care for over 80 million Medicaid beneficiaries but lack systematic clinical decision support for determining when to intervene and which interventions to select for patients with complex conditions. Our objective was to develop and validate a clinical decision support system integrating acuity prediction and intervention selection models for population health management programs.</p><p><strong>Materials and methods: </strong>We conducted a retrospective cohort study of 155 631 Medicaid patients enrolled in population health programs across Washington, Virginia, and Ohio (January 2023-July 2025). We developed integrated informatics workflows combining time-to-event prediction models for acute care events with heterogeneous treatment effect estimators for intervention selection. Models used structured electronic health record data, claims, and care management records. Performance was evaluated through clinical validation with 3 blinded physicians reviewing 200 cases.</p><p><strong>Results: </strong>The integrated decision support system achieved 81.3% sensitivity (95% CI, 79.8%-82.8%) and 82.1% specificity (95% CI, 80.6%-83.6%) for 30-day acute care prediction. The intervention selection component demonstrated 1.59 percentage points absolute risk reduction compared with standard care (95% CI, 0.21-3.04), translating to preventing one acute event for every 63 patients receiving model-guided rather than standard care. Clinical validation revealed systematic differences: physicians relied on recent utilization patterns (explaining 75.8% of decision variance) while models integrated broader clinical signals, identifying intervention opportunities earlier in disease trajectories. Both approaches recommended similar intervention types, suggesting complementary rather than replacement roles.</p><p><strong>Discussion: </strong>An integrated clinical decision support system can enhance population health management by providing actionable guidance on intervention timing and selection.</p><p><strong>Conclusion: </strong>An integrated decision support system's ability to identify opportunities before high utilization manifests offers potential for shifting from reactive to preventive care delivery for vulnerable populations.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.6,"publicationDate":"2025-12-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145851489","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Knowledge graph-augmented large language models for reconstructing life course risk pathways: a gestational diabetes mellitus-to-dementia case study. 重建生命历程风险路径的知识图谱增强大语言模型:一个妊娠期糖尿病到痴呆的案例研究。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-16 DOI: 10.1093/jamia/ocaf219
Shuang Wang, Yang Zhang, Ying Gao, Xin He, Guanghui Deng, Jian Du

Objectives: To develop and evaluate a knowledge graph-augmented large language model (LLM) framework that synthesizes epidemiological evidence to infer life-course exposure-outcome pathways, using gestational diabetes mellitus (GDM) and dementia as a case study.

Materials and methods: We constructed a causal knowledge graph by extracting empirical epidemiological associations from scientific literature, excluding hypothetical assertions. The graph was integrated with GPT-4 through four graph retrieval-augmented generation (GRAG) strategies to infer bridging variables between early-life exposure (GDM) and later-life outcome (dementia). Semantic triples served as structured inputs to support LLM reasoning. Each GRAG strategy was evaluated by human clinical experts and three LLM-based reviewers (GPT-4o, Llama 3-70B, and Gemini Advanced), assessing scientific reliability, novelty, and clinical relevance.

Results: The GRAG strategy using a minimal set of abstracts specifically related to GDM-dementia bridging variables performed comparably to the strategy using broader sub-community abstracts, and both significantly outperformed approaches using the full GDM- or dementia-related corpus or baseline GPT-4 without external augmentation. The knowledge graph-augmented LLM identified 108 maternal candidate mediators, including validated risk factors such as chronic kidney disease and physical inactivity. The structured approach improved accuracy and reduced confabulation compared to standard LLM outputs.

Discussion: Our findings suggest that augmenting LLMs with epidemiological knowledge graphs enables effective reasoning over fragmented literature and supports the reconstruction of progressive risk pathways. Expert assessments revealed that LLMs may overestimate clinical relevance, highlighting the need for human-AI collaboration in interpretation and application.

Conclusion: Integrating semantic epidemiological knowledge with LLMs via GRAG strategies provides a promising framework for life-course epidemiology, enabling early detection of modifiable risk factors and guiding variable selection in cohort study design.

目的:以妊娠期糖尿病(GDM)和痴呆为例,开发和评估一个知识图增强大语言模型(LLM)框架,该框架综合流行病学证据来推断生命过程暴露-结局途径。材料和方法:我们通过从科学文献中提取经验流行病学关联来构建因果知识图,排除假设断言。该图表通过四种图表检索增强生成(GRAG)策略与GPT-4整合,以推断早期生活暴露(GDM)和后期生活结果(痴呆)之间的桥接变量。语义三元组作为结构化输入来支持LLM推理。每个GRAG策略由人类临床专家和三位基于法学硕士的评审员(gpt - 40、Llama 3-70B和Gemini Advanced)评估,评估科学可靠性、新颖性和临床相关性。结果:使用与GDM-痴呆桥接变量相关的最小摘要集的GRAG策略与使用更广泛的子社区摘要的策略表现相当,并且两者都明显优于使用完整GDM或痴呆相关语料库或基线GPT-4的方法,没有外部增强。知识图谱增强的LLM确定了108个母体候选介质,包括经过验证的风险因素,如慢性肾脏疾病和缺乏身体活动。与标准LLM输出相比,结构化方法提高了精度并减少了虚构。讨论:我们的研究结果表明,用流行病学知识图谱增加llm可以对碎片化的文献进行有效的推理,并支持渐进风险路径的重建。专家评估显示,法学硕士可能高估了临床相关性,强调了在解释和应用方面需要人类与人工智能合作。结论:通过GRAG策略将语义流行病学知识与llm整合为生命过程流行病学提供了一个有希望的框架,可以早期发现可改变的危险因素,并指导队列研究设计中的变量选择。
{"title":"Knowledge graph-augmented large language models for reconstructing life course risk pathways: a gestational diabetes mellitus-to-dementia case study.","authors":"Shuang Wang, Yang Zhang, Ying Gao, Xin He, Guanghui Deng, Jian Du","doi":"10.1093/jamia/ocaf219","DOIUrl":"https://doi.org/10.1093/jamia/ocaf219","url":null,"abstract":"<p><strong>Objectives: </strong>To develop and evaluate a knowledge graph-augmented large language model (LLM) framework that synthesizes epidemiological evidence to infer life-course exposure-outcome pathways, using gestational diabetes mellitus (GDM) and dementia as a case study.</p><p><strong>Materials and methods: </strong>We constructed a causal knowledge graph by extracting empirical epidemiological associations from scientific literature, excluding hypothetical assertions. The graph was integrated with GPT-4 through four graph retrieval-augmented generation (GRAG) strategies to infer bridging variables between early-life exposure (GDM) and later-life outcome (dementia). Semantic triples served as structured inputs to support LLM reasoning. Each GRAG strategy was evaluated by human clinical experts and three LLM-based reviewers (GPT-4o, Llama 3-70B, and Gemini Advanced), assessing scientific reliability, novelty, and clinical relevance.</p><p><strong>Results: </strong>The GRAG strategy using a minimal set of abstracts specifically related to GDM-dementia bridging variables performed comparably to the strategy using broader sub-community abstracts, and both significantly outperformed approaches using the full GDM- or dementia-related corpus or baseline GPT-4 without external augmentation. The knowledge graph-augmented LLM identified 108 maternal candidate mediators, including validated risk factors such as chronic kidney disease and physical inactivity. The structured approach improved accuracy and reduced confabulation compared to standard LLM outputs.</p><p><strong>Discussion: </strong>Our findings suggest that augmenting LLMs with epidemiological knowledge graphs enables effective reasoning over fragmented literature and supports the reconstruction of progressive risk pathways. Expert assessments revealed that LLMs may overestimate clinical relevance, highlighting the need for human-AI collaboration in interpretation and application.</p><p><strong>Conclusion: </strong>Integrating semantic epidemiological knowledge with LLMs via GRAG strategies provides a promising framework for life-course epidemiology, enabling early detection of modifiable risk factors and guiding variable selection in cohort study design.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.6,"publicationDate":"2025-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145776344","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Patient perspectives on gender identity and anatomy data collection in electronic health records: a qualitative study. 患者对电子健康记录中性别认同和解剖数据收集的看法:一项定性研究。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-11 DOI: 10.1093/jamia/ocaf205
Samuel Dubin, Gabrielle Mayer, Nishant Pradhan, Madeline Xin, Richard Greene

Objectives: Documentation of gender identity (GI) and anatomy data in the electronic health record (EHR) is a proposed standard of care for transgender populations. However, there is limited research on implementation of proposed best practices, particularly anatomy data collection. This study aims to characterize factors that influence patient preferences and comfort around the collection and documentation of GI and anatomy in EHRs.

Materials and methods: From November 2023 to January 2024, 17 one-on-one, semi-structured virtual interviews were conducted with transgender adults residing in the Metropolitan New York area. Transcriptions were analyzed using inductive thematic analysis.

Results: Themes clustered around comfort and preferences for data collection processes and outcomes. Factors that influenced preferences and comfort around anatomy data were distinct from those impacting GI documentation preferences and comfort. The tension between the categories of GI and sex assigned at birth impacted anatomy data documentation preferences. Clinical context emerged as a consistent factor that impacts both preferences and comfort of GI and anatomy data documentation.

Discussion and conclusion: GI data collection efforts in clinical settings must consider the implication of anatomy data collection when determining data collection best practice methodologies. Anticipated and experienced stigma remain significant hurdles to patient comfort and willingness to collect GI and anatomy data, and their impact on actual data collection should be further elucidated among diverse gender identities. Clinical data collection methods, tools, and education warrant ongoing research investment to further elucidate best practices.

目的:在电子健康记录(EHR)中记录性别认同(GI)和解剖数据是跨性别人群的拟议护理标准。然而,关于实施建议的最佳实践的研究有限,特别是解剖数据收集。本研究旨在描述影响患者在电子病历中收集和记录GI和解剖的偏好和舒适度的因素。材料与方法:从2023年11月至2024年1月,对居住在纽约大都会地区的跨性别成年人进行了17次一对一、半结构化的虚拟访谈。采用归纳主题性分析对转录进行分析。结果:主题集中在舒适度和偏好的数据收集过程和结果。影响解剖学数据偏好和舒适度的因素与影响GI文档偏好和舒适度的因素不同。GI分类和出生时性别分配之间的紧张关系影响了解剖学数据记录的偏好。临床环境成为影响GI和解剖数据文档偏好和舒适度的一致因素。讨论和结论:在确定数据收集最佳实践方法时,临床环境中的GI数据收集工作必须考虑解剖学数据收集的含义。预期和经历的耻辱感仍然是患者舒适度和收集GI和解剖数据意愿的重大障碍,它们对实际数据收集的影响应在不同性别认同中进一步阐明。临床数据收集方法、工具和教育需要持续的研究投资,以进一步阐明最佳实践。
{"title":"Patient perspectives on gender identity and anatomy data collection in electronic health records: a qualitative study.","authors":"Samuel Dubin, Gabrielle Mayer, Nishant Pradhan, Madeline Xin, Richard Greene","doi":"10.1093/jamia/ocaf205","DOIUrl":"https://doi.org/10.1093/jamia/ocaf205","url":null,"abstract":"<p><strong>Objectives: </strong>Documentation of gender identity (GI) and anatomy data in the electronic health record (EHR) is a proposed standard of care for transgender populations. However, there is limited research on implementation of proposed best practices, particularly anatomy data collection. This study aims to characterize factors that influence patient preferences and comfort around the collection and documentation of GI and anatomy in EHRs.</p><p><strong>Materials and methods: </strong>From November 2023 to January 2024, 17 one-on-one, semi-structured virtual interviews were conducted with transgender adults residing in the Metropolitan New York area. Transcriptions were analyzed using inductive thematic analysis.</p><p><strong>Results: </strong>Themes clustered around comfort and preferences for data collection processes and outcomes. Factors that influenced preferences and comfort around anatomy data were distinct from those impacting GI documentation preferences and comfort. The tension between the categories of GI and sex assigned at birth impacted anatomy data documentation preferences. Clinical context emerged as a consistent factor that impacts both preferences and comfort of GI and anatomy data documentation.</p><p><strong>Discussion and conclusion: </strong>GI data collection efforts in clinical settings must consider the implication of anatomy data collection when determining data collection best practice methodologies. Anticipated and experienced stigma remain significant hurdles to patient comfort and willingness to collect GI and anatomy data, and their impact on actual data collection should be further elucidated among diverse gender identities. Clinical data collection methods, tools, and education warrant ongoing research investment to further elucidate best practices.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.6,"publicationDate":"2025-12-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145726643","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Toward semantic interoperability of imaging and clinical data: reflections on the DICOM-OMOP integration framework. 影像和临床数据的语义互操作性:对DICOM-OMOP集成框架的思考。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-05 DOI: 10.1093/jamia/ocaf215
Weihao Cheng, Zekai Yu
{"title":"Toward semantic interoperability of imaging and clinical data: reflections on the DICOM-OMOP integration framework.","authors":"Weihao Cheng, Zekai Yu","doi":"10.1093/jamia/ocaf215","DOIUrl":"https://doi.org/10.1093/jamia/ocaf215","url":null,"abstract":"","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.6,"publicationDate":"2025-12-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145844499","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Scalable confounding adjustment in real-world evidence: benchmarking data-adaptive and investigator-specified strategies in a large-scale trial emulation study. 真实世界证据中的可扩展混杂调整:在大规模试验模拟研究中对数据自适应和研究者指定策略进行基准测试。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-03 DOI: 10.1093/jamia/ocaf204
Andrew R Weckstein, Shirley V Wang, Richard Wyss, Sebastian Schneeweiss

Objectives: Real-world evidence (RWE) increasingly informs clinical decisions, yet manual adjustment for confounding limits scalability. Data-adaptive (DA) algorithms for high-dimensional proxy adjustment show promise but have not been systematically compared to investigator-specified (IS) approaches across diverse treatment scenarios. We evaluated whether DA strategies perform comparably to manually curated IS models using claims-based emulations of 15 randomized trials from the RCT-DUPLICATE initiative.

Materials and methods: We identified new-user cohorts for 15 trial emulations in Optum's de-identified Clinformatics Data Mart Database (2004-2023). Treatment effects were estimated using 3 adjustment strategies: (1) IS models with manually tailored covariates; (2) full-DA strategies using empirical features from semiautomated pipelines; and (3) hybrid-DA models incorporating both empirical and investigator-defined covariates. Agreement with RCT benchmarks was assessed via binary metrics and difference-in-differences.

Results: Outcome-adaptive LASSO achieved better RWE-RCT agreement than IS adjustment in 73% of full-DA and 87% of hybrid-DA emulations. Other DA methods considering feature associations with both treatment and outcome performed similarly well, while models tuned solely for treatment prediction performed poorly. Performance of IS vs DA strategies differed across emulated trials.

Discussion: Top DA algorithms matched manual IS models on average, but impact varied by emulation. Case studies illustrate the continued importance of subject-matter knowledge, particularly for complex treatment strategies.

Conclusion: Data-adaptive algorithms show promise for scalable confounding adjustment in large-scale evidence systems and as augmentation tools for investigator-specified designs. Hybrid strategies combining algorithmic methods with investigator expertise offer the most reliable approach for individual causal questions.

目的:真实世界证据(RWE)越来越多地为临床决策提供信息,但人为调整混杂因素限制了可扩展性。用于高维代理调整的数据自适应(DA)算法显示出前景,但尚未在不同治疗方案中与研究者指定(IS)方法进行系统比较。我们使用来自RCT-DUPLICATE计划的15个随机试验的基于索赔的模拟来评估DA策略是否与人工策划的IS模型表现相当。材料和方法:我们在Optum的去识别临床数据集市数据库(2004-2023)中确定了15个试验模拟的新用户队列。采用3种调整策略估计治疗效果:(1)人工定制协变量的IS模型;(2)基于半自动化管道经验特征的全数据分析策略;(3)结合实证变量和研究者定义协变量的混合数据分析模型。通过二元指标和差中差评估与RCT基准的一致性。结果:结果自适应LASSO在73%的全da和87%的混合da模拟中获得了比IS调整更好的RWE-RCT一致性。其他考虑到与治疗和结果的特征关联的数据分析方法同样表现良好,而仅针对治疗预测进行调整的模型表现不佳。在模拟试验中,IS和DA策略的性能存在差异。讨论:顶级数据处理算法平均匹配手动IS模型,但影响因仿真而异。案例研究说明了主题知识的持续重要性,特别是对于复杂的治疗策略。结论:数据自适应算法有望在大规模证据系统中进行可扩展的混杂调整,并作为调查员指定设计的增强工具。混合策略结合算法方法和调查员的专业知识,为个别因果问题提供了最可靠的方法。
{"title":"Scalable confounding adjustment in real-world evidence: benchmarking data-adaptive and investigator-specified strategies in a large-scale trial emulation study.","authors":"Andrew R Weckstein, Shirley V Wang, Richard Wyss, Sebastian Schneeweiss","doi":"10.1093/jamia/ocaf204","DOIUrl":"https://doi.org/10.1093/jamia/ocaf204","url":null,"abstract":"<p><strong>Objectives: </strong>Real-world evidence (RWE) increasingly informs clinical decisions, yet manual adjustment for confounding limits scalability. Data-adaptive (DA) algorithms for high-dimensional proxy adjustment show promise but have not been systematically compared to investigator-specified (IS) approaches across diverse treatment scenarios. We evaluated whether DA strategies perform comparably to manually curated IS models using claims-based emulations of 15 randomized trials from the RCT-DUPLICATE initiative.</p><p><strong>Materials and methods: </strong>We identified new-user cohorts for 15 trial emulations in Optum's de-identified Clinformatics Data Mart Database (2004-2023). Treatment effects were estimated using 3 adjustment strategies: (1) IS models with manually tailored covariates; (2) full-DA strategies using empirical features from semiautomated pipelines; and (3) hybrid-DA models incorporating both empirical and investigator-defined covariates. Agreement with RCT benchmarks was assessed via binary metrics and difference-in-differences.</p><p><strong>Results: </strong>Outcome-adaptive LASSO achieved better RWE-RCT agreement than IS adjustment in 73% of full-DA and 87% of hybrid-DA emulations. Other DA methods considering feature associations with both treatment and outcome performed similarly well, while models tuned solely for treatment prediction performed poorly. Performance of IS vs DA strategies differed across emulated trials.</p><p><strong>Discussion: </strong>Top DA algorithms matched manual IS models on average, but impact varied by emulation. Case studies illustrate the continued importance of subject-matter knowledge, particularly for complex treatment strategies.</p><p><strong>Conclusion: </strong>Data-adaptive algorithms show promise for scalable confounding adjustment in large-scale evidence systems and as augmentation tools for investigator-specified designs. Hybrid strategies combining algorithmic methods with investigator expertise offer the most reliable approach for individual causal questions.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.6,"publicationDate":"2025-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145670308","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Development and application of desiderata for automated clinical ordering. 临床自动点单所需数据的开发与应用。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-01 DOI: 10.1093/jamia/ocaf152
Sameh N Saleh, Kevin B Johnson

Introduction: Automation of clinical orders in electronic health records (EHRs) has the potential to reduce clinician burden and enhance patient safety. However, determining which orders are appropriate for automation requires a structured framework to ensure clinical validity, transparency, and safety.

Objective: To develop and validate a framework of desiderata for assessing the appropriateness of automating clinical orders in EHRs and to demonstrate its operational value in a live health system dataset.

Materials and methods: The study comprised 4 phases to move from concept generation to real-world demonstration. First, we conducted focus group analyses using ground theory to identify themes and developed desiderata informed by these themes and existing literature. We validated the desiderata by surveying clinicians at a single institution, presenting 10 use cases to and assessing perceived appropriateness, cognitive support, and patient safety using a 4-point Likert scale. Survey results were compared to a priori appropriateness designations using t-tests. To evaluate operational impact, we analyzed one year of order-based alerts and orders (1.4 million firings alert and 44.1 million orders, respectively) using filtering rules and association rule mining to identify candidate orders for automation and their impact.

Results: We identified 8 desiderata for automated order appropriateness: logical consistency, data provenance, order transparency, context permanence, monitoring plans, trigger consistency, care team empowerment, and system accountability. Use cases deemed appropriate based on these criteria received significantly higher scores for appropriateness (3.13 ± 0.84 vs 2.30 ± 0.99), cognitive support (3.08 ± 0.82 vs 2.25 ± 0.94), and patient safety (3.08 ± 0.86 vs 2.21 ± 0.98) (all P < .001) compared to those considered inappropriate. Operational analysis revealed an alert firing 19 109 times annually, with a 96% signed order rate, where automation could save an estimated 26.5 provider hours per year. Additionally, an association rule with 16 628 occurrences (68.4% confidence) suggested automation could save 15.8 hours annually and yield 8000 additional appropriate orders.

Discussion: The desiderata align with clinician perceptions and provide a structured approach for evaluating automated orders. Our findings highlight the potential for automation of certain clinical orders to improve cognitive support while maintaining patient safety.

Conclusion: Healthcare systems should use these desiderata, coupled with data mining techniques, to systematically identify and govern appropriate automated orders. Further research is needed to validate operational scalability.

简介:电子健康记录(EHRs)中临床医嘱的自动化有可能减轻临床医生的负担,提高患者的安全。然而,确定哪些订单适合自动化需要一个结构化的框架,以确保临床有效性、透明度和安全性。目的:开发和验证一个理想的框架,用于评估在电子病历中自动化临床医嘱的适当性,并展示其在实时卫生系统数据集中的操作价值。材料和方法:研究分为四个阶段,从概念产生到现实世界的演示。首先,我们使用基础理论进行焦点小组分析,以确定主题,并根据这些主题和现有文献开发出所需的数据。我们通过调查单个机构的临床医生来验证期望,提出10个用例,并使用4点李克特量表评估感知适当性、认知支持和患者安全性。使用t检验将调查结果与先验适当性指定进行比较。为了评估运营影响,我们使用过滤规则和关联规则挖掘分析了一年的基于订单的警报和订单(分别为140万解雇警报和4410万订单),以确定自动化的候选订单及其影响。结果:我们确定了自动化订单适当性的8个需求:逻辑一致性、数据来源、订单透明度、上下文持久性、监控计划、触发一致性、护理团队授权和系统责任。基于这些标准认为合适的用例在适当性(3.13±0.84 vs 2.30±0.99)、认知支持(3.08±0.82 vs 2.25±0.94)和患者安全性(3.08±0.86 vs 2.21±0.98)方面获得了显着更高的分数(所有P)讨论:期望与临床医生的看法一致,并提供了评估自动化订单的结构化方法。我们的研究结果强调了某些临床医嘱自动化的潜力,以提高认知支持,同时维护患者安全。结论:医疗保健系统应该使用这些理想的数据,结合数据挖掘技术,系统地识别和管理适当的自动化订单。需要进一步的研究来验证操作的可扩展性。
{"title":"Development and application of desiderata for automated clinical ordering.","authors":"Sameh N Saleh, Kevin B Johnson","doi":"10.1093/jamia/ocaf152","DOIUrl":"10.1093/jamia/ocaf152","url":null,"abstract":"<p><strong>Introduction: </strong>Automation of clinical orders in electronic health records (EHRs) has the potential to reduce clinician burden and enhance patient safety. However, determining which orders are appropriate for automation requires a structured framework to ensure clinical validity, transparency, and safety.</p><p><strong>Objective: </strong>To develop and validate a framework of desiderata for assessing the appropriateness of automating clinical orders in EHRs and to demonstrate its operational value in a live health system dataset.</p><p><strong>Materials and methods: </strong>The study comprised 4 phases to move from concept generation to real-world demonstration. First, we conducted focus group analyses using ground theory to identify themes and developed desiderata informed by these themes and existing literature. We validated the desiderata by surveying clinicians at a single institution, presenting 10 use cases to and assessing perceived appropriateness, cognitive support, and patient safety using a 4-point Likert scale. Survey results were compared to a priori appropriateness designations using t-tests. To evaluate operational impact, we analyzed one year of order-based alerts and orders (1.4 million firings alert and 44.1 million orders, respectively) using filtering rules and association rule mining to identify candidate orders for automation and their impact.</p><p><strong>Results: </strong>We identified 8 desiderata for automated order appropriateness: logical consistency, data provenance, order transparency, context permanence, monitoring plans, trigger consistency, care team empowerment, and system accountability. Use cases deemed appropriate based on these criteria received significantly higher scores for appropriateness (3.13 ± 0.84 vs 2.30 ± 0.99), cognitive support (3.08 ± 0.82 vs 2.25 ± 0.94), and patient safety (3.08 ± 0.86 vs 2.21 ± 0.98) (all P < .001) compared to those considered inappropriate. Operational analysis revealed an alert firing 19 109 times annually, with a 96% signed order rate, where automation could save an estimated 26.5 provider hours per year. Additionally, an association rule with 16 628 occurrences (68.4% confidence) suggested automation could save 15.8 hours annually and yield 8000 additional appropriate orders.</p><p><strong>Discussion: </strong>The desiderata align with clinician perceptions and provide a structured approach for evaluating automated orders. Our findings highlight the potential for automation of certain clinical orders to improve cognitive support while maintaining patient safety.</p><p><strong>Conclusion: </strong>Healthcare systems should use these desiderata, coupled with data mining techniques, to systematically identify and govern appropriate automated orders. Further research is needed to validate operational scalability.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":"1899-1907"},"PeriodicalIF":4.6,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12646375/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145126405","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of the American Medical Informatics Association
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1