首页 > 最新文献

Journal of the American Medical Informatics Association最新文献

英文 中文
Should we synthesize more than we need: impact of synthetic data generation for high-dimensional cross-sectional medical data. 我们是否应该合成比我们需要的更多:合成数据生成对高维横断面医疗数据的影响。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-01 DOI: 10.1093/jamia/ocaf169
Lisa Pilgram, Samer El Kababji, Dan Liu, Khaled El Emam

Objective: In medical research and education, generative artificial intelligence/machine learning (AI/ML) models to synthesize artificial medical data can enable the sharing of high-quality data while preserving the privacy of patients. Given that such data is often high-dimensional, a relevant consideration is whether to synthesize the entire dataset when only a task-relevant subset is needed. This study evaluates how the number of variables in training impacts fidelity, utility, and privacy of the synthetic data (SD).

Material and methods: We used 12 cross-sectional medical datasets, defined a downstream task with corresponding core variables, and derived 6354 variants by adding adjunct variables to the core. SD was generated using 7 different generative models and evaluated for fidelity, downstream utility, and privacy. Mixed-effect models were used to assess the effect of adjunct variables on the respective evaluation metric, accounting for the medical dataset as a random component.

Results: Fidelity was unaffected by the number of adjunct variables in 5/7 SDG models. Similarly, downstream utility remained stable in 6/7 (predictive task) and 5/7 (inferential task) SDG models. Where significant effects were observed, they were minimal, resulting, for example, in a 0.05 decrease in Area under the Receiver Operating Characteristic curve (AUROC) when adding 120 variables. Privacy was not impacted by the number of adjunct variables.

Discussion: Our findings show that fidelity, utility, and privacy are preserved when generating a more comprehensive medical dataset than the task-relevant subset.

Conclusion: Our findings support a cost-effective, utility, and privacy-preserving way of implementing SDG into medical research and education.

目的:在医学研究和教育中,利用生成式人工智能/机器学习(AI/ML)模型合成人工医疗数据,可以在保护患者隐私的同时实现高质量数据的共享。考虑到这些数据通常是高维的,一个相关的考虑是,当只需要一个任务相关的子集时,是否要合成整个数据集。本研究评估了训练中变量的数量如何影响合成数据(SD)的保真度、效用和隐私性。材料和方法:我们使用了12个横断面医学数据集,定义了具有相应核心变量的下游任务,并通过在核心中添加辅助变量衍生出6354个变体。使用7种不同的生成模型生成SD,并对保真度、下游效用和隐私性进行评估。混合效应模型用于评估辅助变量对各自评价指标的影响,将医疗数据集作为随机组成部分。结果:5/7个SDG模型中辅助变量的数量不影响保真度。同样,在6/7(预测任务)和5/7(推理任务)SDG模型中,下游效用保持稳定。当观察到显著的影响时,它们是最小的,例如,当增加120个变量时,受试者工作特征曲线下的面积(AUROC)减少0.05。隐私不受附加变量数量的影响。讨论:我们的研究结果表明,当生成比任务相关子集更全面的医疗数据集时,保真度、实用性和隐私性得到了保护。结论:我们的研究结果支持在医学研究和教育中实施可持续发展目标的一种具有成本效益、实用性和隐私保护的方式。
{"title":"Should we synthesize more than we need: impact of synthetic data generation for high-dimensional cross-sectional medical data.","authors":"Lisa Pilgram, Samer El Kababji, Dan Liu, Khaled El Emam","doi":"10.1093/jamia/ocaf169","DOIUrl":"10.1093/jamia/ocaf169","url":null,"abstract":"<p><strong>Objective: </strong>In medical research and education, generative artificial intelligence/machine learning (AI/ML) models to synthesize artificial medical data can enable the sharing of high-quality data while preserving the privacy of patients. Given that such data is often high-dimensional, a relevant consideration is whether to synthesize the entire dataset when only a task-relevant subset is needed. This study evaluates how the number of variables in training impacts fidelity, utility, and privacy of the synthetic data (SD).</p><p><strong>Material and methods: </strong>We used 12 cross-sectional medical datasets, defined a downstream task with corresponding core variables, and derived 6354 variants by adding adjunct variables to the core. SD was generated using 7 different generative models and evaluated for fidelity, downstream utility, and privacy. Mixed-effect models were used to assess the effect of adjunct variables on the respective evaluation metric, accounting for the medical dataset as a random component.</p><p><strong>Results: </strong>Fidelity was unaffected by the number of adjunct variables in 5/7 SDG models. Similarly, downstream utility remained stable in 6/7 (predictive task) and 5/7 (inferential task) SDG models. Where significant effects were observed, they were minimal, resulting, for example, in a 0.05 decrease in Area under the Receiver Operating Characteristic curve (AUROC) when adding 120 variables. Privacy was not impacted by the number of adjunct variables.</p><p><strong>Discussion: </strong>Our findings show that fidelity, utility, and privacy are preserved when generating a more comprehensive medical dataset than the task-relevant subset.</p><p><strong>Conclusion: </strong>Our findings support a cost-effective, utility, and privacy-preserving way of implementing SDG into medical research and education.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":"1843-1854"},"PeriodicalIF":4.6,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12646385/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145259724","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Including AI in diffusion-weighted breast MRI has potential to increase reader confidence and reduce workload. 在弥散加权乳房MRI中加入人工智能有可能增加读者的信心并减少工作量。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-01 DOI: 10.1093/jamia/ocaf156
Dimitrios Bounias, Lina Simons, Michael Baumgartner, Chris Ehring, Peter Neher, Lorenz A Kapsner, Balint Kovacs, Ralf Floca, Paul F Jaeger, Jessica Eberle, Dominique Hadler, Frederik B Laun, Sabine Ohlmeyer, Lena Maier-Hein, Michael Uder, Evelyn Wenkel, Klaus H Maier-Hein, Sebastian Bickelhaupt

Objectives: Breast diffusion-weighted imaging (DWI) has shown potential as a standalone imaging technique for certain indications, eg, supplemental screening of women with dense breasts. This study evaluates an artificial intelligence (AI)-powered computer-aided diagnosis (CAD) system for clinical interpretation and workload reduction in breast DWI.

Materials and methods: This retrospective IRB-approved study included: n = 824 examinations for model development (2017-2020) and n = 235 for evaluation (01/2021-06/2021). Readings were performed by three readers using either the AI-CAD or manual readings. BI-RADS-like (Breast Imaging Reporting and Data System) classification was based on DWI. Histopathology served as ground truth. The model was nnDetection-based, trained using 5-fold cross-validation and ensembling. Statistical significance was determined using McNemar's test. Inter-rater agreement was calculated using Cohen's kappa. Model performance was calculated using the area under the receiver operating curve (AUC).

Results: The AI-augmented approach significantly reduced BI-RADS-like 3 calls in breast DWI by 29% (P =.019) and increased interrater agreement (0.57 ± 0.10 vs 0.49 ± 0.11), while preserving diagnostic accuracy. Two of the three readers detected more malignant lesions (63/69 vs 59/69 and 64/69 vs 62/69) with the AI-CAD. The AI model achieved an AUC of 0.78 (95% CI: [0.72, 0.85]; P <.001), which increased for women at screening age to 0.82 (95% CI: [0.73, 0.90]; P <.001), indicating a potential for workload reduction of 20.9% at 96% sensitivity.

Discussion and conclusion: Breast DWI might benefit from AI support. In our study, AI showed potential for reduction of BI-RADS-like 3 calls and increase of inter-rater agreement. However, given the limited study size, further research is needed.

目的:乳腺弥散加权成像(DWI)已经显示出作为一种独立成像技术在某些适应症中的潜力,例如,对致密乳房的女性进行补充筛查。本研究评估了一种人工智能(AI)驱动的计算机辅助诊断(CAD)系统,用于临床解释和减少乳腺DWI的工作量。材料和方法:这项经irb批准的回顾性研究包括:n = 824例模型开发检查(2017-2020)和n = 235例评估检查(2021年1月- 2021年6月)。读数由三名读者使用AI-CAD或手动读数进行。bi - rads类(乳腺成像报告和数据系统)分类基于DWI。组织病理学是最基本的事实。该模型基于nndetection,使用5倍交叉验证和集成进行训练。采用McNemar检验确定统计学显著性。评级机构间的协议是用科恩的kappa来计算的。模型性能计算使用面积下的接收者工作曲线(AUC)。结果:人工智能增强方法在保持诊断准确性的同时,显著减少了乳房DWI中bi - rads样3次呼叫29% (P = 0.019),提高了判据一致性(0.57±0.10 vs 0.49±0.11)。三名读卡器中有两名使用AI-CAD检测到更多的恶性病变(63/69 vs 59/69, 64/69 vs 62/69)。人工智能模型的AUC为0.78 (95% CI: [0.72, 0.85]; P讨论和结论:乳房DWI可能受益于人工智能的支持。在我们的研究中,人工智能显示出减少bi - rad -like 3呼叫和增加评级间协议的潜力。然而,由于研究规模有限,还需要进一步的研究。
{"title":"Including AI in diffusion-weighted breast MRI has potential to increase reader confidence and reduce workload.","authors":"Dimitrios Bounias, Lina Simons, Michael Baumgartner, Chris Ehring, Peter Neher, Lorenz A Kapsner, Balint Kovacs, Ralf Floca, Paul F Jaeger, Jessica Eberle, Dominique Hadler, Frederik B Laun, Sabine Ohlmeyer, Lena Maier-Hein, Michael Uder, Evelyn Wenkel, Klaus H Maier-Hein, Sebastian Bickelhaupt","doi":"10.1093/jamia/ocaf156","DOIUrl":"10.1093/jamia/ocaf156","url":null,"abstract":"<p><strong>Objectives: </strong>Breast diffusion-weighted imaging (DWI) has shown potential as a standalone imaging technique for certain indications, eg, supplemental screening of women with dense breasts. This study evaluates an artificial intelligence (AI)-powered computer-aided diagnosis (CAD) system for clinical interpretation and workload reduction in breast DWI.</p><p><strong>Materials and methods: </strong>This retrospective IRB-approved study included: n = 824 examinations for model development (2017-2020) and n = 235 for evaluation (01/2021-06/2021). Readings were performed by three readers using either the AI-CAD or manual readings. BI-RADS-like (Breast Imaging Reporting and Data System) classification was based on DWI. Histopathology served as ground truth. The model was nnDetection-based, trained using 5-fold cross-validation and ensembling. Statistical significance was determined using McNemar's test. Inter-rater agreement was calculated using Cohen's kappa. Model performance was calculated using the area under the receiver operating curve (AUC).</p><p><strong>Results: </strong>The AI-augmented approach significantly reduced BI-RADS-like 3 calls in breast DWI by 29% (P =.019) and increased interrater agreement (0.57 ± 0.10 vs 0.49 ± 0.11), while preserving diagnostic accuracy. Two of the three readers detected more malignant lesions (63/69 vs 59/69 and 64/69 vs 62/69) with the AI-CAD. The AI model achieved an AUC of 0.78 (95% CI: [0.72, 0.85]; P <.001), which increased for women at screening age to 0.82 (95% CI: [0.73, 0.90]; P <.001), indicating a potential for workload reduction of 20.9% at 96% sensitivity.</p><p><strong>Discussion and conclusion: </strong>Breast DWI might benefit from AI support. In our study, AI showed potential for reduction of BI-RADS-like 3 calls and increase of inter-rater agreement. However, given the limited study size, further research is needed.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":"1908-1915"},"PeriodicalIF":4.6,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12646386/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145126441","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
FHIR-Former: enhancing clinical predictions through Fast Healthcare Interoperability Resources and large language models. FHIR-Former:通过快速医疗互操作性资源和大型语言模型增强临床预测。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-01 DOI: 10.1093/jamia/ocaf165
Merlin Engelke, Giulia Baldini, Jens Kleesiek, Felix Nensa, Amin Dada

Objective: To address the challenges of data heterogeneity and manual feature engineering in clinical predictive modeling, we introduce FHIR-Former, an open-source framework integrating Fast Healthcare Interoperability Resources (FHIR) with large language models (LLMs) to automate and standardize clinical prediction tasks.

Materials and methods: FHIR-Former dynamically processes structured (eg, lab results, medications) and unstructured (eg, clinical notes) data from FHIR resources. The pipeline supports multiple classification tasks, including 30-day readmission, imaging study prediction, and ICD code classification. Leveraging open-source LLMs (GeBERTa), we trained models on 1.1 million data points across ten FHIR resources using retrospective inpatient data (2018-2024). Hyperparameters were optimized via Bayesian methods, and outputs were mapped to FHIR RiskAssessment resources for interoperability.

Results: FHIR-Former achieved an F1-score of 70.7% and accuracy of 72.9% for 30-day readmission, 51.8% F1-score (88.1% accuracy) for mortality prediction, and 61% macro F1-score for imaging study classification. The ICD code prediction model attained 94% accuracy. Performance demonstrated promising performance for readmission and showed scalability across tasks without manual feature engineering.

Discussion: FHIR-Former eliminates institution-specific preprocessing by adapting to diverse FHIR implementations, enabling seamless integration of multimodal data. Its configurable architecture outperformed prior frameworks reliant on static inputs or limited to unstructured text. Real-time risk scores embedded in FHIR servers enhance clinical workflows without disrupting existing practices.

Conclusion: By harmonizing FHIR standardization with LLM flexibility, FHIR-Former advances scalable, interoperable predictive modeling in healthcare. The open-source framework facilitates automation, improves resource allocation, and supports personalized decision-making, bridging gaps between AI innovation and clinical practice.

目的:为了解决临床预测建模中数据异构和手动特征工程的挑战,我们引入了FHIR- former,这是一个将快速医疗互操作性资源(FHIR)与大型语言模型(llm)集成在一起的开源框架,用于自动化和标准化临床预测任务。材料和方法:FHIR- former动态处理来自FHIR资源的结构化(如实验室结果、药物)和非结构化(如临床记录)数据。该管道支持多种分类任务,包括30天再入院、成像研究预测和ICD代码分类。利用开源法学硕士(GeBERTa),我们使用回顾性住院患者数据(2018-2024)在10个FHIR资源中的110万个数据点上训练模型。通过贝叶斯方法优化超参数,并将输出映射到FHIR风险评估资源以实现互操作性。结果:FHIR-Former对30天再入院患者的f1评分为70.7%,准确率为72.9%,对死亡率预测的f1评分为51.8%,准确率为88.1%,对影像学研究分类的宏观f1评分为61%。ICD代码预测模型的准确率达到94%。性能展示了重入的良好性能,并展示了跨任务的可伸缩性,无需手动特征工程。讨论:FHIR- former通过适应不同的FHIR实现消除了机构特定的预处理,实现了多模式数据的无缝集成。其可配置架构优于依赖静态输入或仅限于非结构化文本的先前框架。嵌入在FHIR服务器中的实时风险评分可以在不破坏现有实践的情况下增强临床工作流程。结论:通过协调FHIR标准化和LLM灵活性,FHIR- former在医疗保健领域推进了可扩展、可互操作的预测建模。开源框架促进了自动化,改善了资源分配,并支持个性化决策,弥合了人工智能创新与临床实践之间的差距。
{"title":"FHIR-Former: enhancing clinical predictions through Fast Healthcare Interoperability Resources and large language models.","authors":"Merlin Engelke, Giulia Baldini, Jens Kleesiek, Felix Nensa, Amin Dada","doi":"10.1093/jamia/ocaf165","DOIUrl":"10.1093/jamia/ocaf165","url":null,"abstract":"<p><strong>Objective: </strong>To address the challenges of data heterogeneity and manual feature engineering in clinical predictive modeling, we introduce FHIR-Former, an open-source framework integrating Fast Healthcare Interoperability Resources (FHIR) with large language models (LLMs) to automate and standardize clinical prediction tasks.</p><p><strong>Materials and methods: </strong>FHIR-Former dynamically processes structured (eg, lab results, medications) and unstructured (eg, clinical notes) data from FHIR resources. The pipeline supports multiple classification tasks, including 30-day readmission, imaging study prediction, and ICD code classification. Leveraging open-source LLMs (GeBERTa), we trained models on 1.1 million data points across ten FHIR resources using retrospective inpatient data (2018-2024). Hyperparameters were optimized via Bayesian methods, and outputs were mapped to FHIR RiskAssessment resources for interoperability.</p><p><strong>Results: </strong>FHIR-Former achieved an F1-score of 70.7% and accuracy of 72.9% for 30-day readmission, 51.8% F1-score (88.1% accuracy) for mortality prediction, and 61% macro F1-score for imaging study classification. The ICD code prediction model attained 94% accuracy. Performance demonstrated promising performance for readmission and showed scalability across tasks without manual feature engineering.</p><p><strong>Discussion: </strong>FHIR-Former eliminates institution-specific preprocessing by adapting to diverse FHIR implementations, enabling seamless integration of multimodal data. Its configurable architecture outperformed prior frameworks reliant on static inputs or limited to unstructured text. Real-time risk scores embedded in FHIR servers enhance clinical workflows without disrupting existing practices.</p><p><strong>Conclusion: </strong>By harmonizing FHIR standardization with LLM flexibility, FHIR-Former advances scalable, interoperable predictive modeling in healthcare. The open-source framework facilitates automation, improves resource allocation, and supports personalized decision-making, bridging gaps between AI innovation and clinical practice.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":"1793-1801"},"PeriodicalIF":4.6,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12646377/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145287573","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Large language models accurately identify immunosuppression in intensive care unit patients. 大型语言模型准确识别重症监护病房患者的免疫抑制。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-01 DOI: 10.1093/jamia/ocaf141
Vijeeth Guggilla, Mengjia Kang, Melissa J Bak, Steven D Tran, Anna Pawlowski, Prasanth Nannapaneni, Luke V Rasmussen, Daniel Schneider, Helen K Donnelly, Ankit Agrawal, David Liebovitz, Alexander V Misharin, G R Scott Budinger, Richard G Wunderink, Theresa L Walunas, Catherine A Gao

Objective: Rule-based structured data algorithms and natural language processing (NLP) approaches applied to unstructured clinical notes have limited accuracy and poor generalizability for identifying immunosuppression. Large language models (LLMs) may effectively identify patients with heterogenous types of immunosuppression from unstructured clinical notes. We compared the performance of LLMs applied to unstructured notes for identifying patients with immunosuppressive conditions or immunosuppressive medication use against 2 baselines: (1) structured data algorithms using diagnosis codes and medication orders and (2) NLP approaches applied to unstructured notes.

Materials and methods: We used hospital admission notes from a primary cohort of 827 intensive care unit (ICU) patients at Northwestern Memorial Hospital and a validation cohort of 200 ICU patients at Beth Israel Deaconess Medical Center, along with diagnosis codes and medication orders from the primary cohort. We evaluated the performance of structured data algorithms, NLP approaches, and LLMs in identifying 7 immunosuppressive conditions and 6 immunosuppressive medications.

Results: In the primary cohort, structured data algorithms achieved peak F1 scores ranging from 0.30 to 0.97 for identifying immunosuppressive conditions and medications. NLP approaches achieved peak F1 scores ranging from 0 to 1. GPT-4o outperformed or matched structured data algorithms and NLP approaches across all conditions and medications, with F1 scores ranging from 0.51 to 1. GPT-4o also performed impressively in our validation cohort (F1 = 1 for 8/13 variables).

Discussion: LLMs, particularly GPT-4o, outperformed structured data algorithms and NLP approaches in identifying immunosuppressive conditions and medications with robust external validation.

Conclusion: LLMs can be applied for improved cohort identification for research purposes.

目的:基于规则的结构化数据算法和应用于非结构化临床记录的自然语言处理(NLP)方法在识别免疫抑制方面准确性有限,通用性差。大型语言模型(LLMs)可以从非结构化的临床记录中有效地识别异质型免疫抑制患者。我们比较了应用于非结构化笔记的llm的性能,用于识别免疫抑制状况或免疫抑制药物使用的患者,对比了两个基线:(1)使用诊断代码和药物订单的结构化数据算法,以及(2)应用于非结构化笔记的NLP方法。材料和方法:我们使用了来自西北纪念医院827名重症监护病房(ICU)患者的主要队列和来自贝斯以色列女执事医疗中心200名ICU患者的验证队列的住院记录,以及来自主要队列的诊断代码和用药单。我们评估了结构化数据算法、NLP方法和llm在识别7种免疫抑制条件和6种免疫抑制药物方面的性能。结果:在主要队列中,结构化数据算法在识别免疫抑制疾病和药物方面达到了0.30至0.97的F1评分峰值。NLP方法的F1得分峰值在0到1之间。gpt - 40在所有疾病和药物治疗中表现优于或匹配结构化数据算法和NLP方法,F1得分范围为0.51至1。gpt - 40在我们的验证队列中也表现令人印象深刻(8/13个变量F1 = 1)。讨论:llm,特别是gpt - 40,在识别免疫抑制条件和药物方面优于结构化数据算法和NLP方法,并具有强大的外部验证。结论:llm可用于改进队列识别,用于研究目的。
{"title":"Large language models accurately identify immunosuppression in intensive care unit patients.","authors":"Vijeeth Guggilla, Mengjia Kang, Melissa J Bak, Steven D Tran, Anna Pawlowski, Prasanth Nannapaneni, Luke V Rasmussen, Daniel Schneider, Helen K Donnelly, Ankit Agrawal, David Liebovitz, Alexander V Misharin, G R Scott Budinger, Richard G Wunderink, Theresa L Walunas, Catherine A Gao","doi":"10.1093/jamia/ocaf141","DOIUrl":"10.1093/jamia/ocaf141","url":null,"abstract":"<p><strong>Objective: </strong>Rule-based structured data algorithms and natural language processing (NLP) approaches applied to unstructured clinical notes have limited accuracy and poor generalizability for identifying immunosuppression. Large language models (LLMs) may effectively identify patients with heterogenous types of immunosuppression from unstructured clinical notes. We compared the performance of LLMs applied to unstructured notes for identifying patients with immunosuppressive conditions or immunosuppressive medication use against 2 baselines: (1) structured data algorithms using diagnosis codes and medication orders and (2) NLP approaches applied to unstructured notes.</p><p><strong>Materials and methods: </strong>We used hospital admission notes from a primary cohort of 827 intensive care unit (ICU) patients at Northwestern Memorial Hospital and a validation cohort of 200 ICU patients at Beth Israel Deaconess Medical Center, along with diagnosis codes and medication orders from the primary cohort. We evaluated the performance of structured data algorithms, NLP approaches, and LLMs in identifying 7 immunosuppressive conditions and 6 immunosuppressive medications.</p><p><strong>Results: </strong>In the primary cohort, structured data algorithms achieved peak F1 scores ranging from 0.30 to 0.97 for identifying immunosuppressive conditions and medications. NLP approaches achieved peak F1 scores ranging from 0 to 1. GPT-4o outperformed or matched structured data algorithms and NLP approaches across all conditions and medications, with F1 scores ranging from 0.51 to 1. GPT-4o also performed impressively in our validation cohort (F1 = 1 for 8/13 variables).</p><p><strong>Discussion: </strong>LLMs, particularly GPT-4o, outperformed structured data algorithms and NLP approaches in identifying immunosuppressive conditions and medications with robust external validation.</p><p><strong>Conclusion: </strong>LLMs can be applied for improved cohort identification for research purposes.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":"1888-1898"},"PeriodicalIF":4.6,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12490808/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145114981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Efficient extraction of medication information from clinical notes: an evaluation in 2 languages. 从临床记录中有效提取药物信息:两种语言的评估。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-01 DOI: 10.1093/jamia/ocaf113
Thibaut Fabacher, Erik-André Sauleau, Emmanuelle Arcay, Bineta Faye, Maxime Alter, Archia Chahard, Nathan Miraillet, Adrien Coulet, Aurélie Névéol

Objective: To evaluate the accuracy, computational cost, and portability of a new natural language processing (NLP) method for extracting medication information from clinical narratives.

Materials and methods: We propose an original transformer-based architecture for the extraction of entities and their relations pertaining to patients' medication regimen. First, we used this approach to train and evaluate a model on French clinical notes, using a newly annotated corpus from Hôpitaux Universitaires de Strasbourg. Second, the portability of the approach was assessed by conducting an evaluation on clinical documents in English from the 2018 n2c2 shared task. Information extraction accuracy and computational cost were assessed by comparison with an available method using transformers.

Results: The proposed architecture achieves on the task of relation extraction itself performance that are competitive with the state-of-the-art on both French and English (F-measures 0.82 and 0.96 vs 0.81 and 0.95), but reduces the computational cost by 10. End-to-end (Named Entity recognition and Relation Extraction) F1 performance is 0.69 and 0.82 for French and English corpus.

Discussion: While an existing system developed for English notes was deployed in a French hospital setting with reasonable effort, we found that an alternative architecture offered end-to-end drug information extraction with comparable extraction performance and lower computational impact for both French and English clinical text processing, respectively.

Conclusion: The proposed architecture can be used to extract medication information from clinical text with high performance and low computational cost and consequently suits with usually limited hospital IT resources.

目的:评估一种新的自然语言处理(NLP)方法从临床叙述中提取药物信息的准确性、计算成本和可移植性。材料和方法:我们提出了一种基于变压器的原始架构,用于提取与患者用药方案相关的实体及其关系。首先,我们使用来自Hôpitaux Universitaires de Strasbourg的新注释语料库,使用这种方法来训练和评估法国临床笔记的模型。其次,通过对2018年n2c2共享任务中的英文临床文件进行评估,评估了该方法的可移植性。通过与现有的变压器信息提取方法的比较,评估了信息提取的准确性和计算成本。结果:所提出的架构在关系提取任务本身的性能上达到了与法语和英语的最先进技术相竞争的水平(f值为0.82和0.96 vs 0.81和0.95),但计算成本降低了10。法语和英语语料库的端到端(命名实体识别和关系提取)F1性能分别为0.69和0.82。讨论:虽然为英语笔记开发的现有系统在法国医院环境中部署了合理的努力,但我们发现另一种架构提供端到端的药物信息提取,其提取性能与法语和英语临床文本处理相当,并且计算影响更低。结论:所提出的体系结构能够高效、低计算成本地从临床文本中提取药物信息,适合医院通常有限的IT资源。
{"title":"Efficient extraction of medication information from clinical notes: an evaluation in 2 languages.","authors":"Thibaut Fabacher, Erik-André Sauleau, Emmanuelle Arcay, Bineta Faye, Maxime Alter, Archia Chahard, Nathan Miraillet, Adrien Coulet, Aurélie Névéol","doi":"10.1093/jamia/ocaf113","DOIUrl":"10.1093/jamia/ocaf113","url":null,"abstract":"<p><strong>Objective: </strong>To evaluate the accuracy, computational cost, and portability of a new natural language processing (NLP) method for extracting medication information from clinical narratives.</p><p><strong>Materials and methods: </strong>We propose an original transformer-based architecture for the extraction of entities and their relations pertaining to patients' medication regimen. First, we used this approach to train and evaluate a model on French clinical notes, using a newly annotated corpus from Hôpitaux Universitaires de Strasbourg. Second, the portability of the approach was assessed by conducting an evaluation on clinical documents in English from the 2018 n2c2 shared task. Information extraction accuracy and computational cost were assessed by comparison with an available method using transformers.</p><p><strong>Results: </strong>The proposed architecture achieves on the task of relation extraction itself performance that are competitive with the state-of-the-art on both French and English (F-measures 0.82 and 0.96 vs 0.81 and 0.95), but reduces the computational cost by 10. End-to-end (Named Entity recognition and Relation Extraction) F1 performance is 0.69 and 0.82 for French and English corpus.</p><p><strong>Discussion: </strong>While an existing system developed for English notes was deployed in a French hospital setting with reasonable effort, we found that an alternative architecture offered end-to-end drug information extraction with comparable extraction performance and lower computational impact for both French and English clinical text processing, respectively.</p><p><strong>Conclusion: </strong>The proposed architecture can be used to extract medication information from clinical text with high performance and low computational cost and consequently suits with usually limited hospital IT resources.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":"1855-1864"},"PeriodicalIF":4.6,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12646380/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145276320","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A communication-efficient federated learning algorithm to assess racial disparities in post-transplantation survival time. 一种有效沟通的联邦学习算法评估移植后生存时间的种族差异。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-01 DOI: 10.1093/jamia/ocaf138
Yudong Wang, Dazheng Zhang, Jiayi Tong, Xing He, Liang Li, Lichao Sun, Ashutosh M Shukla, Jiang Bian, David A Asch, Yong Chen

Objective: Patients of different race have different outcomes following renal transplantation. Patients of different race also undergo renal transplantation at different hospitals. We used a novel decentralized multisite approach to quantitatively assess the effect of site of care on racial disparities between non-Hispanic Black (NHB) and non-Hispanic White (NHW) patients in post-transplantation survival times.

Materials and methods: In this study, we develop a communication-efficient federated learning algorithm to assess site-of-care associated racial disparities based on decentralized time-to-event data, called Communication-Efficient Distributed Analysis for Racial Disparity in Time-to-event Data (CEDAR-t2e). The algorithm includes 2 modules. Module I is to estimate the site-specific proportional hazards model for time-to-event outcomes in a distributed manner, in which the Poissonization is used to simplify the estimation procedure. Based on the estimated results from Module I, Module II calculates how long the kidney failure time of NHB patients would be extended had they been admitted to transplant centers in the same distribution as NHW patients were admitted.

Results: With application to United States Renal Data System data covering 39 043 patients across 73 transplant centers, we found no evidence suggesting the presence of site-of-care associated racial disparities in post-transplantation survival times. In particular, restricting to one year after transplantation, the counterfactual graft failure time would have been extended by only 0.61 days on average if NHB had the same admission distribution to transplant centers as NHW patients.

Discussion: The proposed approach offers a quantitative measure to evaluate site-of-care associated racial disparities.

Conclusion: Our approach has the potential to be extended to investigate site-of-care related disparities in other time-to-event outcomes, thus promoting health equity and improving patient health in various fields.

目的:不同种族患者肾移植术后预后不同。不同种族的患者在不同的医院接受肾移植。我们采用了一种新颖的分散式多站点方法来定量评估非西班牙裔黑人(NHB)和非西班牙裔白人(NHW)患者移植后生存时间的护理地点对种族差异的影响。材料和方法:在本研究中,我们开发了一种通信高效的联邦学习算法,基于分散的时间到事件数据来评估与护理地点相关的种族差异,称为时间到事件数据中种族差异的通信高效分布式分析(CEDAR-t2e)。该算法包括2个模块。模块1以分布式的方式估计时间事件结果的地点特定比例风险模型,其中使用泊松化来简化估计过程。根据模块1的估计结果,模块2计算如果NHB患者与NHW患者被送入相同分布的移植中心,他们的肾衰竭时间会延长多久。结果:应用美国肾脏数据系统的数据,涵盖73个移植中心的39 043名患者,我们发现没有证据表明在移植后生存时间中存在与护理地点相关的种族差异。特别是,在移植后1年内,如果NHB患者与NHW患者在移植中心的入院分布相同,则反事实移植失败时间平均仅延长0.61天。讨论:提出的方法提供了一种定量的方法来评估与护理地点相关的种族差异。结论:我们的方法有可能被扩展到调查其他事件时间结局中与护理地点相关的差异,从而促进卫生公平并改善各个领域的患者健康。
{"title":"A communication-efficient federated learning algorithm to assess racial disparities in post-transplantation survival time.","authors":"Yudong Wang, Dazheng Zhang, Jiayi Tong, Xing He, Liang Li, Lichao Sun, Ashutosh M Shukla, Jiang Bian, David A Asch, Yong Chen","doi":"10.1093/jamia/ocaf138","DOIUrl":"10.1093/jamia/ocaf138","url":null,"abstract":"<p><strong>Objective: </strong>Patients of different race have different outcomes following renal transplantation. Patients of different race also undergo renal transplantation at different hospitals. We used a novel decentralized multisite approach to quantitatively assess the effect of site of care on racial disparities between non-Hispanic Black (NHB) and non-Hispanic White (NHW) patients in post-transplantation survival times.</p><p><strong>Materials and methods: </strong>In this study, we develop a communication-efficient federated learning algorithm to assess site-of-care associated racial disparities based on decentralized time-to-event data, called Communication-Efficient Distributed Analysis for Racial Disparity in Time-to-event Data (CEDAR-t2e). The algorithm includes 2 modules. Module I is to estimate the site-specific proportional hazards model for time-to-event outcomes in a distributed manner, in which the Poissonization is used to simplify the estimation procedure. Based on the estimated results from Module I, Module II calculates how long the kidney failure time of NHB patients would be extended had they been admitted to transplant centers in the same distribution as NHW patients were admitted.</p><p><strong>Results: </strong>With application to United States Renal Data System data covering 39 043 patients across 73 transplant centers, we found no evidence suggesting the presence of site-of-care associated racial disparities in post-transplantation survival times. In particular, restricting to one year after transplantation, the counterfactual graft failure time would have been extended by only 0.61 days on average if NHB had the same admission distribution to transplant centers as NHW patients.</p><p><strong>Discussion: </strong>The proposed approach offers a quantitative measure to evaluate site-of-care associated racial disparities.</p><p><strong>Conclusion: </strong>Our approach has the potential to be extended to investigate site-of-care related disparities in other time-to-event outcomes, thus promoting health equity and improving patient health in various fields.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":"1916-1926"},"PeriodicalIF":4.6,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12646379/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145132365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A standards-based approach to digital health research: implementing the people heart study. 基于标准的数字健康研究方法:实施人的心脏研究。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-12-01 DOI: 10.1093/jamia/ocaf163
Raheel Sayeed, David Kreda, Joshua C Mandel, Bryan Larson, William Gordon, Kenneth D Mandl, Isaac Kohane

Objective: To assess whether HL7 Fast Healthcare Interoperability Resources (FHIR) can underpin a fully standards-based, end-to-end digital research architecture, demonstrate it in a live study, and quantify its benefits for interoperability and development efficiency.

Materials and methods: We designed a generalizable standards-based architecture to accelerate digital health research relying on FHIR as the sole transactional model throughout a participant research lifecycle starting from API-based study discovery to results. It was instantiated for People Heart Study, a real-world digital health cardiovascular-risk assessment study with its protocol transformed into FHIR resources (eligibility, consent, tasks, and results). Evaluation examined workflow coverage, validator conformance across independent servers, and points requiring custom extensions or app logic.

Results: The architecture was implemented using cloud managed FHIR stores including an illustrative public research discovery API for first-/third-party apps. A participant-facing iOS app was published on the App Store. Our evaluation reveals that 6 of 10 research app workflows could be executed entirely from FHIR artifacts; 2 were partially standards-driven and 2 remained limited requiring custom development. All FHIR resources passed structural, semantic validation with minimal custom extension usage and terminology integrity issues.

Discussion: Our approach addresses persistent challenges in digital health research by enhancing data interoperability, minimizing redundant development, and supporting the full research lifecycle. The architecture aligns with national priorities and complements healthcare standardization efforts.

Conclusion: By leveraging FHIR, our architecture enables generalizability, interoperability, and reuse across diverse digital health research contexts, transforming study design into data modeling rather than software development, and fostering a more inclusive and agile digital health ecosystem.

目的:评估HL7快速医疗保健互操作性资源(FHIR)是否可以支持完全基于标准的端到端数字研究架构,在现场研究中进行演示,并量化其对互操作性和开发效率的好处。材料和方法:我们设计了一个基于通用标准的架构,以加速数字健康研究,依靠FHIR作为参与者研究生命周期中从基于api的研究发现到结果的唯一交易模型。这是一项现实世界的数字健康心血管风险评估研究,其方案转化为FHIR资源(资格、同意、任务和结果)。评估检查了工作流覆盖范围、跨独立服务器的验证器一致性,以及需要自定义扩展或应用程序逻辑的点。结果:该架构是使用云管理的FHIR商店实现的,包括用于第一个/第三方应用程序的说明性公共研究发现API。一款面向参与者的iOS应用在app Store上发布。我们的评估显示,10个研究应用程序工作流中有6个可以完全从FHIR工件执行;2个部分是标准驱动的,2个仍然需要定制开发。所有的FHIR资源都通过了结构的、语义的验证,并且最小化了自定义扩展的使用和术语完整性问题。讨论:我们的方法通过增强数据互操作性、最小化冗余开发和支持整个研究生命周期来解决数字健康研究中持续存在的挑战。该体系结构与国家优先事项保持一致,并补充了医疗保健标准化工作。结论:通过利用FHIR,我们的架构能够在不同的数字健康研究环境中实现通用性、互操作性和重用性,将研究设计转化为数据建模而不是软件开发,并促进更具包容性和敏捷性的数字健康生态系统。
{"title":"A standards-based approach to digital health research: implementing the people heart study.","authors":"Raheel Sayeed, David Kreda, Joshua C Mandel, Bryan Larson, William Gordon, Kenneth D Mandl, Isaac Kohane","doi":"10.1093/jamia/ocaf163","DOIUrl":"10.1093/jamia/ocaf163","url":null,"abstract":"<p><strong>Objective: </strong>To assess whether HL7 Fast Healthcare Interoperability Resources (FHIR) can underpin a fully standards-based, end-to-end digital research architecture, demonstrate it in a live study, and quantify its benefits for interoperability and development efficiency.</p><p><strong>Materials and methods: </strong>We designed a generalizable standards-based architecture to accelerate digital health research relying on FHIR as the sole transactional model throughout a participant research lifecycle starting from API-based study discovery to results. It was instantiated for People Heart Study, a real-world digital health cardiovascular-risk assessment study with its protocol transformed into FHIR resources (eligibility, consent, tasks, and results). Evaluation examined workflow coverage, validator conformance across independent servers, and points requiring custom extensions or app logic.</p><p><strong>Results: </strong>The architecture was implemented using cloud managed FHIR stores including an illustrative public research discovery API for first-/third-party apps. A participant-facing iOS app was published on the App Store. Our evaluation reveals that 6 of 10 research app workflows could be executed entirely from FHIR artifacts; 2 were partially standards-driven and 2 remained limited requiring custom development. All FHIR resources passed structural, semantic validation with minimal custom extension usage and terminology integrity issues.</p><p><strong>Discussion: </strong>Our approach addresses persistent challenges in digital health research by enhancing data interoperability, minimizing redundant development, and supporting the full research lifecycle. The architecture aligns with national priorities and complements healthcare standardization efforts.</p><p><strong>Conclusion: </strong>By leveraging FHIR, our architecture enables generalizability, interoperability, and reuse across diverse digital health research contexts, transforming study design into data modeling rather than software development, and fostering a more inclusive and agile digital health ecosystem.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":"1811-1821"},"PeriodicalIF":4.6,"publicationDate":"2025-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12646378/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145208434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Determining optimal strategies for personalized atrial fibrillation treatment in intensive care unit patients using a deep learning-based causal inference approach: rhythm and/or rate control. 使用基于深度学习的因果推理方法确定重症监护病房患者房颤个性化治疗的最佳策略:节奏和/或速率控制。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-11-29 DOI: 10.1093/jamia/ocaf203
Min Woo Kang, Shin Young Ahn, Yoonjin Kang

Objectives: Atrial fibrillation (AF) is common among intensive care unit (ICU) patients. Effective management of AF in this setting remains a subject of debate, with current guidelines often derived from outpatient studies. This study aims to evaluate the effectiveness of different AF management strategies-both, rhythm, rate, or no control-in reducing mortality in ICU patients using a deep learning-based causal inference model.

Materials and methods: Data from the Medical Information Mart for Intensive Care (MIMIC)-III and MIMIC-IV were utilized, encompassing ICU admissions with documented AF. Exposures included both rhythm and rate, only rhythm, and only rate, or no control. A deep learning-based causal inference model analyzed treatment effects. Additionally, the characteristics of patients who benefited more from rhythm control compared to rate control were identified using treatment effect sizes and multivariable logistic regression.

Results: The study population comprised 13 583 patients. Both rhythm and rate control, rhythm control-only, and rate control-only strategies significantly reduced in-hospital mortality compared to no control, with average treatment effects of -1.23% (-1.43% to -1.03%), -2.32% (-2.48% to -2.15%), and -9.11% (-9.29% to -8.93%), respectively. Rhythm control proved more effective than rate control in specific subgroups: older age, higher maximum heart rate, presence of new-onset AF, absence of hypertension, absence of diabetes, chronic liver disease, not having undergone heart surgery, and the use of vasopressor agents.

Conclusion: Using a deep learning-based causal inference model, we quantified mortality reduction for each treatment strategy and identified the patient characteristics associated with the most favorable outcomes for each strategy.

目的:房颤(AF)在重症监护病房(ICU)患者中很常见。在这种情况下,房颤的有效管理仍然是一个有争议的话题,目前的指导方针通常来自门诊研究。本研究旨在利用基于深度学习的因果推理模型,评估不同房颤管理策略(节律、频率或不控制)在降低ICU患者死亡率方面的有效性。材料和方法:使用重症监护医学信息市场(MIMIC)-III和MIMIC- iv的数据,包括有AF记录的ICU入院患者。暴露包括节律和速率,仅包括节律和速率,或无对照。基于深度学习的因果推理模型分析了治疗效果。此外,通过治疗效应大小和多变量逻辑回归,确定了从节律控制中获益更多的患者的特征。结果:研究人群包括13 583例患者。与无对照相比,节律和速率控制、仅节律控制和仅速率控制策略均显著降低了住院死亡率,平均治疗效果分别为-1.23%(-1.43%至-1.03%)、-2.32%(-2.48%至-2.15%)和-9.11%(-9.29%至-8.93%)。在特定的亚组中,心律控制比心率控制更有效:年龄较大、最大心率较高、新发房颤、无高血压、无糖尿病、慢性肝病、未接受心脏手术和使用血管加压药物。结论:使用基于深度学习的因果推理模型,我们量化了每种治疗策略的死亡率降低,并确定了与每种策略最有利结果相关的患者特征。
{"title":"Determining optimal strategies for personalized atrial fibrillation treatment in intensive care unit patients using a deep learning-based causal inference approach: rhythm and/or rate control.","authors":"Min Woo Kang, Shin Young Ahn, Yoonjin Kang","doi":"10.1093/jamia/ocaf203","DOIUrl":"https://doi.org/10.1093/jamia/ocaf203","url":null,"abstract":"<p><strong>Objectives: </strong>Atrial fibrillation (AF) is common among intensive care unit (ICU) patients. Effective management of AF in this setting remains a subject of debate, with current guidelines often derived from outpatient studies. This study aims to evaluate the effectiveness of different AF management strategies-both, rhythm, rate, or no control-in reducing mortality in ICU patients using a deep learning-based causal inference model.</p><p><strong>Materials and methods: </strong>Data from the Medical Information Mart for Intensive Care (MIMIC)-III and MIMIC-IV were utilized, encompassing ICU admissions with documented AF. Exposures included both rhythm and rate, only rhythm, and only rate, or no control. A deep learning-based causal inference model analyzed treatment effects. Additionally, the characteristics of patients who benefited more from rhythm control compared to rate control were identified using treatment effect sizes and multivariable logistic regression.</p><p><strong>Results: </strong>The study population comprised 13 583 patients. Both rhythm and rate control, rhythm control-only, and rate control-only strategies significantly reduced in-hospital mortality compared to no control, with average treatment effects of -1.23% (-1.43% to -1.03%), -2.32% (-2.48% to -2.15%), and -9.11% (-9.29% to -8.93%), respectively. Rhythm control proved more effective than rate control in specific subgroups: older age, higher maximum heart rate, presence of new-onset AF, absence of hypertension, absence of diabetes, chronic liver disease, not having undergone heart surgery, and the use of vasopressor agents.</p><p><strong>Conclusion: </strong>Using a deep learning-based causal inference model, we quantified mortality reduction for each treatment strategy and identified the patient characteristics associated with the most favorable outcomes for each strategy.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.6,"publicationDate":"2025-11-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145642141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
A Lossless One-shot Distributed Algorithm for Addressing Heterogeneity in Multi-Site Generalized Linear Models. 多点广义线性模型非均匀性的一种无损单次分布算法。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-11-19 DOI: 10.1093/jamia/ocaf198
Bingyu Zhang, Qiong Wu, Jenna M Reps, Lu Li, Jiayi Tong, Yiwen Lu, Dazheng Zhang, Juan Manuel Ramirez-Anguita, Jiang Bian, Milou T Brand, Thomas Falconer, Miguel A Mayer, Ross D Williams, Yong Chen

Objective: We propose Heterogeneity-aware Collaborative One-shot Lossless Algorithm for Generalized Linear Model (COLA-GLM-H), a novel one-shot lossless distributed algorithm that enables the integration of heterogeneous multi-institutional data while preserving patient privacy by avoiding patient-level data sharing.

Materials and methods: Generalized Linear Models (GLMs) are widely used in medical research for analyzing diverse outcome types. In multi-institution settings, we demonstrated that the global likelihood can be reconstructed using only institution-level summary statistics, enabling lossless estimation without accessing individual records. We validated COLA-GLM-H in two real-world studies: (1) a U.S. pediatric centralized network (719,383 patients) evaluating long-term cardiovascular risks following COVID-19, and (2) an internationally decentralized network of 120,429 hospitalized patients from seven databases across three countries assessing risk factors for COVID-19 mortality.

Results: In the centralized network, COLA-GLM-H produced estimates identical to those from pooled analyses. In the decentralized setting, the algorithm effectively integrated heterogeneous data across multiple clinical institutions using a single communication round.

Conclusions: COLA-GLM-H is a privacy-preserving, lossless, and communication- and computation-efficient solution for multi-institutional research. It accounts for between-institution heterogeneity and supports all outcome types within the exponential family, enabling secure, scalable, and accurate analysis in collaborative clinical research.

目的:我们提出了异构感知的广义线性模型协作一次性无损算法(COLA-GLM-H),这是一种新颖的一次性无损分布式算法,可以集成异构多机构数据,同时通过避免患者级数据共享来保护患者隐私。材料和方法:广义线性模型(GLMs)广泛应用于医学研究,用于分析不同的结果类型。在多机构设置中,我们证明了全局似然可以仅使用机构级汇总统计来重建,从而在不访问单个记录的情况下实现无损估计。我们在两个现实世界的研究中验证了COLA-GLM-H:(1)美国儿科集中网络(719,383例患者)评估COVID-19后的长期心血管风险;(2)来自三个国家七个数据库的120,429名住院患者的国际分散网络评估COVID-19死亡率的危险因素。结果:在集中式网络中,COLA-GLM-H产生的估计值与汇集分析的估计值相同。在分散的设置中,该算法使用单个通信轮有效地集成了跨多个临床机构的异构数据。结论:COLA-GLM-H是一种隐私保护、无损、通信和计算效率高的多机构研究解决方案。它考虑了机构间的异质性,并支持指数族内的所有结果类型,从而在协作临床研究中实现安全、可扩展和准确的分析。
{"title":"A Lossless One-shot Distributed Algorithm for Addressing Heterogeneity in Multi-Site Generalized Linear Models.","authors":"Bingyu Zhang, Qiong Wu, Jenna M Reps, Lu Li, Jiayi Tong, Yiwen Lu, Dazheng Zhang, Juan Manuel Ramirez-Anguita, Jiang Bian, Milou T Brand, Thomas Falconer, Miguel A Mayer, Ross D Williams, Yong Chen","doi":"10.1093/jamia/ocaf198","DOIUrl":"https://doi.org/10.1093/jamia/ocaf198","url":null,"abstract":"<p><strong>Objective: </strong>We propose Heterogeneity-aware Collaborative One-shot Lossless Algorithm for Generalized Linear Model (COLA-GLM-H), a novel one-shot lossless distributed algorithm that enables the integration of heterogeneous multi-institutional data while preserving patient privacy by avoiding patient-level data sharing.</p><p><strong>Materials and methods: </strong>Generalized Linear Models (GLMs) are widely used in medical research for analyzing diverse outcome types. In multi-institution settings, we demonstrated that the global likelihood can be reconstructed using only institution-level summary statistics, enabling lossless estimation without accessing individual records. We validated COLA-GLM-H in two real-world studies: (1) a U.S. pediatric centralized network (719,383 patients) evaluating long-term cardiovascular risks following COVID-19, and (2) an internationally decentralized network of 120,429 hospitalized patients from seven databases across three countries assessing risk factors for COVID-19 mortality.</p><p><strong>Results: </strong>In the centralized network, COLA-GLM-H produced estimates identical to those from pooled analyses. In the decentralized setting, the algorithm effectively integrated heterogeneous data across multiple clinical institutions using a single communication round.</p><p><strong>Conclusions: </strong>COLA-GLM-H is a privacy-preserving, lossless, and communication- and computation-efficient solution for multi-institutional research. It accounts for between-institution heterogeneity and supports all outcome types within the exponential family, enabling secure, scalable, and accurate analysis in collaborative clinical research.</p>","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":" ","pages":""},"PeriodicalIF":4.6,"publicationDate":"2025-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145551583","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Retraction and replacement of: Electronic connectivity between hospital pairs: impact on emergency department-related utilization. 撤销和更换:医院对之间的电子连接:对急诊科相关利用的影响。
IF 4.6 2区 医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS Pub Date : 2025-11-01 DOI: 10.1093/jamia/ocaf158
{"title":"Retraction and replacement of: Electronic connectivity between hospital pairs: impact on emergency department-related utilization.","authors":"","doi":"10.1093/jamia/ocaf158","DOIUrl":"10.1093/jamia/ocaf158","url":null,"abstract":"","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":"32 11","pages":"1789"},"PeriodicalIF":4.6,"publicationDate":"2025-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12626204/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"145551556","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Journal of the American Medical Informatics Association
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1