首页 > 最新文献

NPJ Digital Medicine最新文献

英文 中文
A novel digital twin strategy to examine the implications of randomized clinical trials for real-world populations 一种新的数字双胞胎策略来检查随机临床试验对现实世界人群的影响
IF 15.2 1区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2026-03-07 DOI: 10.1038/s41746-026-02464-1
Phyllis M. Thangaraj, Sumukh Vasisht Shankar, Sicong Huang, Girish N. Nadkarni, Bobak J. Mortazavi, Evangelos K. Oikonomou, Rohan Khera
Randomized clinical trials (RCTs) guide medical practice; however, their generalizability across populations varies. We developed a statistically informed Generative Adversarial Network model, RCT-Twin-GAN, that leverages relationships between covariates and outcomes to generate a digital twin of an RCT conditioned on covariate distributions from a second patient population. We reproduced the disparate treatment effects of RCTs with similar interventions: the Systolic Blood Pressure Intervention Trial (SPRINT) and the Action to Control Cardiovascular Risk in Diabetes (ACCORD) Blood Pressure Trial. To demonstrate treatment effects of each RCT conditioned on the other RCT population, we evaluated the cardiovascular event-free survival of SPRINT-Twins conditioned on the ACCORD cohort and vice versa. The digital twins demonstrated balanced treatment arms (mean absolute standardized mean difference (MASMD)) of covariates 0.019 (SD 0.018), and the ACCORD-conditioned covariates of the SPRINT-Twins distributed more similarly to ACCORD than SPRINT (MASMD 0.0082 SD 0.016 vs. 0.46 SD 0.20). Notably, SPRINT-conditioned ACCORD-Twins reproduced the non-significant outcome seen in ACCORD (0.88 (0.73–1.06) vs. 0.87 (0.68–1.13)), while ACCORD-conditioned SPRINT-Twins reproduced the significant outcome seen in SPRINT (0.75 (0.64–0.89) vs. 0.79 (0.72–0.86)). Finally, we applied this approach to a real-world population in the electronic health record. RCT-Twin-GAN simulates the translation of RCT-derived treatment effects across patient populations.
随机临床试验(RCTs)指导医疗实践;然而,它们在不同人群中的普遍性有所不同。我们开发了一个统计信息生成对抗网络模型,RCT- twin - gan,该模型利用协变量和结果之间的关系,以来自第二个患者群体的协变量分布为条件,生成RCT的数字双胞胎。我们重现了具有类似干预措施的随机对照试验的不同治疗效果:收缩压干预试验(SPRINT)和控制糖尿病心血管风险的行动(ACCORD)血压试验。为了证明每个RCT在另一个RCT人群上的治疗效果,我们评估了SPRINT-Twins在ACCORD队列上的无心血管事件生存期,反之亦然。数字双胞胎表现出平衡的治疗臂(平均绝对标准化平均差(MASMD)),协变量为0.019 (SD 0.018), SPRINT-双胞胎的ACCORD条件协变量分布更类似于ACCORD,而不是SPRINT (MASMD 0.0082 SD 0.016 vs. 0.46 SD 0.20)。值得注意的是,SPRINT条件的ACCORD- twins重现了ACCORD的非显著结果(0.88 (0.73-1.06)vs. 0.87(0.68-1.13)),而ACCORD条件的SPRINT- twins重现了SPRINT的显著结果(0.75 (0.64-0.89)vs. 0.79(0.72-0.86))。最后,我们将此方法应用于电子健康记录中的现实世界人群。RCT-Twin-GAN模拟了rct衍生的治疗效果在患者群体中的转化。
{"title":"A novel digital twin strategy to examine the implications of randomized clinical trials for real-world populations","authors":"Phyllis M. Thangaraj, Sumukh Vasisht Shankar, Sicong Huang, Girish N. Nadkarni, Bobak J. Mortazavi, Evangelos K. Oikonomou, Rohan Khera","doi":"10.1038/s41746-026-02464-1","DOIUrl":"https://doi.org/10.1038/s41746-026-02464-1","url":null,"abstract":"Randomized clinical trials (RCTs) guide medical practice; however, their generalizability across populations varies. We developed a statistically informed Generative Adversarial Network model, RCT-Twin-GAN, that leverages relationships between covariates and outcomes to generate a digital twin of an RCT conditioned on covariate distributions from a second patient population. We reproduced the disparate treatment effects of RCTs with similar interventions: the Systolic Blood Pressure Intervention Trial (SPRINT) and the Action to Control Cardiovascular Risk in Diabetes (ACCORD) Blood Pressure Trial. To demonstrate treatment effects of each RCT conditioned on the other RCT population, we evaluated the cardiovascular event-free survival of SPRINT-Twins conditioned on the ACCORD cohort and vice versa. The digital twins demonstrated balanced treatment arms (mean absolute standardized mean difference (MASMD)) of covariates 0.019 (SD 0.018), and the ACCORD-conditioned covariates of the SPRINT-Twins distributed more similarly to ACCORD than SPRINT (MASMD 0.0082 SD 0.016 vs. 0.46 SD 0.20). Notably, SPRINT-conditioned ACCORD-Twins reproduced the non-significant outcome seen in ACCORD (0.88 (0.73–1.06) vs. 0.87 (0.68–1.13)), while ACCORD-conditioned SPRINT-Twins reproduced the significant outcome seen in SPRINT (0.75 (0.64–0.89) vs. 0.79 (0.72–0.86)). Finally, we applied this approach to a real-world population in the electronic health record. RCT-Twin-GAN simulates the translation of RCT-derived treatment effects across patient populations.","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"75 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2026-03-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147371126","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Pricing models for diagnostic AI based on qualitative insights from healthcare decision makers. 基于医疗保健决策者定性见解的诊断人工智能定价模型。
IF 15.2 1区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2026-03-06 DOI: 10.1038/s41746-026-02501-z
Jan Kirchhoff,Fabian Berns,Christian Schieder,Johannes Schobel
AI-enabled diagnostic decision support systems (DDSS) could improve diagnostic accuracy and efficiency, yet adoption is often impeded by pricing approaches that rely on opaque technical usage metrics. We examined how pricing can remain clinically legible and budgetable while accounting for AI-specific technical and organizational cost drivers. We conducted semi-structured interviews with healthcare decision makers (n = 17) across hospital, outpatient, laboratory, and industry settings and conducted a deductive-inductive thematic analysis. Ten themes emerged, including widespread resistance to purely usage-based pricing and strong preferences for transparency and predictability. Participants supported hybrid models combining a base fee with variable components defined in clinically meaningful units (per patient, per test, or per episode) and emphasized reimbursement alignment alongside integration, training, and support as integral value elements. Outcome-linked payment was viewed as ethically compelling but operationally difficult. We synthesize these findings into stakeholder-informed design principles and actionable recommendations for pricing models that facilitate procurement, reimbursement fit, and sustainable scaling of diagnostic AI.
支持人工智能的诊断决策支持系统(DDSS)可以提高诊断的准确性和效率,但其采用往往受到依赖于不透明技术使用指标的定价方法的阻碍。我们研究了定价如何在考虑人工智能特定技术和组织成本驱动因素的同时保持临床清晰和可预算。我们对医院、门诊、实验室和行业环境中的医疗保健决策者(n = 17)进行了半结构化访谈,并进行了演绎-归纳主题分析。出现了十个主题,包括对纯粹基于使用的定价的普遍抵制,以及对透明度和可预测性的强烈偏好。参与者支持混合模式,将基本费用与临床有意义的单位(每位患者、每次检查或每次发作)定义的可变组成部分结合起来,并强调报销与整合、培训和支持一起作为整体价值要素。与结果挂钩的支付被认为在道德上令人信服,但在操作上困难。我们将这些发现综合为利益相关者知情的设计原则和可操作的定价模型建议,以促进采购、报销和诊断人工智能的可持续扩展。
{"title":"Pricing models for diagnostic AI based on qualitative insights from healthcare decision makers.","authors":"Jan Kirchhoff,Fabian Berns,Christian Schieder,Johannes Schobel","doi":"10.1038/s41746-026-02501-z","DOIUrl":"https://doi.org/10.1038/s41746-026-02501-z","url":null,"abstract":"AI-enabled diagnostic decision support systems (DDSS) could improve diagnostic accuracy and efficiency, yet adoption is often impeded by pricing approaches that rely on opaque technical usage metrics. We examined how pricing can remain clinically legible and budgetable while accounting for AI-specific technical and organizational cost drivers. We conducted semi-structured interviews with healthcare decision makers (n = 17) across hospital, outpatient, laboratory, and industry settings and conducted a deductive-inductive thematic analysis. Ten themes emerged, including widespread resistance to purely usage-based pricing and strong preferences for transparency and predictability. Participants supported hybrid models combining a base fee with variable components defined in clinically meaningful units (per patient, per test, or per episode) and emphasized reimbursement alignment alongside integration, training, and support as integral value elements. Outcome-linked payment was viewed as ethically compelling but operationally difficult. We synthesize these findings into stakeholder-informed design principles and actionable recommendations for pricing models that facilitate procurement, reimbursement fit, and sustainable scaling of diagnostic AI.","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"10 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2026-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147368377","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Comparing artificial intelligence and healthcare professional performance in surgical and interventional video analysis: a systematic review and meta-analysis 比较人工智能和医疗保健专业人员在外科和介入性视频分析中的表现:系统回顾和荟萃分析
IF 15.2 1区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2026-03-06 DOI: 10.1038/s41746-026-02401-2
Amir Rafati Fard, Simon C. Williams, Kieran J. Smith, Jasneet K. Dhaliwal, Tomas Ferreira, Adrito Das, Joachim Starup-Hansen, John G. Hanrahan, Chan Hee Koh, Danyal Z. Khan, Danail Stoyanov, Hani J. Marcus
This systematic review and meta-analysis examines the design of studies comparing the performance of artificial intelligence (AI) with that of healthcare professionals in the analysis of videos from surgical and interventional procedures, and quantitatively evaluates the performance of AI, unassisted healthcare professionals, and AI-assisted healthcare professionals. From the 37,956 studies identified, 146 were included, with 76 providing sufficient information for inclusion in our exploratory meta-analysis. AI had significantly greater sensitivity and comparable specificity compared to unassisted healthcare professionals at their respective peak performance levels, with a relative risk of 1.12 (95% CI 1.07–1.19, p < 0.001) and 1.04 (95% CI 0.98–1.10, p = 0.224), respectively. AI-assisted healthcare professionals had significantly greater sensitivity and specificity compared to unassisted healthcare professionals across all levels of expertise, with a relative risk of 1.18 (95% CI 1.12–1.25, p < 0.001) and 1.05 (95% CI 1.02–1.08, p < 0.001), respectively. There was no significant difference in sensitivity and specificity of AI-assisted expert healthcare professionals versus AI, with a relative risk of 0.99 (95% CI 0.95–1.04, p = 0.787) and 1.03 (95% CI 0.97–1.08, p = 0.395), respectively. Whilst most studies to date have evaluated AI head-to-head against unassisted healthcare professionals, fewer studies examined AI as an assistive tool, despite the real-world integration of AI more likely to involve assistance than autonomy.
本系统综述和荟萃分析检验了比较人工智能(AI)与医疗保健专业人员在外科和介入手术视频分析中的表现的研究设计,并定量评估了人工智能、无辅助医疗保健专业人员和人工智能辅助医疗保健专业人员的表现。在37956项研究中,有146项被纳入,其中76项提供了足够的信息,可以纳入我们的探索性荟萃分析。在各自的最高表现水平上,人工智能的敏感性和可比特异性明显高于无辅助医疗保健专业人员,相对风险分别为1.12 (95% CI 1.07-1.19, p < 0.001)和1.04 (95% CI 0.98-1.10, p = 0.224)。人工智能辅助的医疗保健专业人员在所有专业水平上都比无辅助的医疗保健专业人员具有更高的敏感性和特异性,相对风险分别为1.18 (95% CI 1.12-1.25, p < 0.001)和1.05 (95% CI 1.02-1.08, p < 0.001)。人工智能辅助的专家医疗保健专业人员与人工智能的敏感性和特异性无显著差异,相对风险分别为0.99 (95% CI 0.95-1.04, p = 0.787)和1.03 (95% CI 0.97-1.08, p = 0.395)。虽然迄今为止大多数研究都是将人工智能与无辅助的医疗保健专业人员进行面对面的评估,但将人工智能作为辅助工具进行评估的研究较少,尽管人工智能在现实世界中的整合更有可能涉及辅助而不是自主。
{"title":"Comparing artificial intelligence and healthcare professional performance in surgical and interventional video analysis: a systematic review and meta-analysis","authors":"Amir Rafati Fard, Simon C. Williams, Kieran J. Smith, Jasneet K. Dhaliwal, Tomas Ferreira, Adrito Das, Joachim Starup-Hansen, John G. Hanrahan, Chan Hee Koh, Danyal Z. Khan, Danail Stoyanov, Hani J. Marcus","doi":"10.1038/s41746-026-02401-2","DOIUrl":"https://doi.org/10.1038/s41746-026-02401-2","url":null,"abstract":"This systematic review and meta-analysis examines the design of studies comparing the performance of artificial intelligence (AI) with that of healthcare professionals in the analysis of videos from surgical and interventional procedures, and quantitatively evaluates the performance of AI, unassisted healthcare professionals, and AI-assisted healthcare professionals. From the 37,956 studies identified, 146 were included, with 76 providing sufficient information for inclusion in our exploratory meta-analysis. AI had significantly greater sensitivity and comparable specificity compared to unassisted healthcare professionals at their respective peak performance levels, with a relative risk of 1.12 (95% CI 1.07–1.19, p < 0.001) and 1.04 (95% CI 0.98–1.10, p = 0.224), respectively. AI-assisted healthcare professionals had significantly greater sensitivity and specificity compared to unassisted healthcare professionals across all levels of expertise, with a relative risk of 1.18 (95% CI 1.12–1.25, p < 0.001) and 1.05 (95% CI 1.02–1.08, p < 0.001), respectively. There was no significant difference in sensitivity and specificity of AI-assisted expert healthcare professionals versus AI, with a relative risk of 0.99 (95% CI 0.95–1.04, p = 0.787) and 1.03 (95% CI 0.97–1.08, p = 0.395), respectively. Whilst most studies to date have evaluated AI head-to-head against unassisted healthcare professionals, fewer studies examined AI as an assistive tool, despite the real-world integration of AI more likely to involve assistance than autonomy.","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"49 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2026-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147350614","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
RoentMod: a synthetic chest X-ray modification model to identify and correct image interpretation model shortcuts. RoentMod:一种合成胸部x线修改模型,用于识别和纠正图像解释模型的快捷方式。
IF 15.1 1区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2026-03-06 DOI: 10.1038/s41746-026-02497-6
Lauren H Cooke, Matthias Jung, Jan M Brendel, Nora M Kerkovits, Borek Foldyna, Michael T Lu, Vineet K Raghu

Chest radiographs (CXRs) are among the most common tests in medicine; automated interpretation may reduce radiologists' workload and expand access. Deep learning multi-task and foundation models have shown strong CXR interpretation performance but are vulnerable to shortcut learning, where spurious correlations drive decision-making. We introduce RoentMod, a counterfactual image editing framework that generates realistic CXRs with user-specified and synthetic pathology while maintaining the original anatomical features. RoentMod combines an open-source medical image generator (RoentGen) with an image-to-image modification model without retraining. In reader studies of RoentMod-produced images, 93% appeared realistic, 89-99% correctly incorporated the specified finding, and all preserved native anatomy comparable to real follow-up CXRs. Using RoentMod, we demonstrate that state-of-the-art multi-task and foundation models frequently exploit off-target pathology as shortcuts, limiting their specificity. Incorporating RoentMod-generated counterfactual images during training mitigated this vulnerability, improving model discrimination across multiple pathologies by 3-19% AUC in internal validation and by 1-11% for 5 out of 6 tested pathologies in external testing. These findings establish RoentMod as a tool to probe and correct shortcut learning in medical AI. By enabling controlled counterfactual interventions, RoentMod enhances the robustness and interpretability of CXR interpretation models and provides a strategy to improve medical imaging models.

胸部x光片(cxr)是医学中最常见的检查之一;自动解释可能会减少放射科医生的工作量并扩大访问范围。深度学习多任务和基础模型显示出强大的CXR解释性能,但容易受到捷径学习的影响,其中虚假相关性驱动决策。我们介绍了RoentMod,一个反事实图像编辑框架,生成具有用户指定和合成病理的逼真cxr,同时保持原始解剖特征。RoentMod将开源医学图像生成器(RoentGen)与无需重新训练的图像到图像修改模型相结合。在roentmod生成的图像的读者研究中,93%的图像是真实的,89-99%的图像正确地包含了指定的发现,所有保存的原始解剖结构与真实的后续cxr相当。使用RoentMod,我们证明了最先进的多任务和基础模型经常利用脱靶病理作为捷径,限制了它们的特异性。在训练过程中结合roentmod生成的反事实图像可以缓解这一漏洞,在内部验证中将多种病理的模型识别率提高3-19%,在外部测试中将6种被测试病理中的5种提高1-11%。这些发现奠定了RoentMod作为探索和纠正医疗人工智能中捷径学习的工具的地位。通过实现受控的反事实干预,RoentMod增强了CXR解释模型的鲁棒性和可解释性,并提供了改进医学成像模型的策略。
{"title":"RoentMod: a synthetic chest X-ray modification model to identify and correct image interpretation model shortcuts.","authors":"Lauren H Cooke, Matthias Jung, Jan M Brendel, Nora M Kerkovits, Borek Foldyna, Michael T Lu, Vineet K Raghu","doi":"10.1038/s41746-026-02497-6","DOIUrl":"https://doi.org/10.1038/s41746-026-02497-6","url":null,"abstract":"<p><p>Chest radiographs (CXRs) are among the most common tests in medicine; automated interpretation may reduce radiologists' workload and expand access. Deep learning multi-task and foundation models have shown strong CXR interpretation performance but are vulnerable to shortcut learning, where spurious correlations drive decision-making. We introduce RoentMod, a counterfactual image editing framework that generates realistic CXRs with user-specified and synthetic pathology while maintaining the original anatomical features. RoentMod combines an open-source medical image generator (RoentGen) with an image-to-image modification model without retraining. In reader studies of RoentMod-produced images, 93% appeared realistic, 89-99% correctly incorporated the specified finding, and all preserved native anatomy comparable to real follow-up CXRs. Using RoentMod, we demonstrate that state-of-the-art multi-task and foundation models frequently exploit off-target pathology as shortcuts, limiting their specificity. Incorporating RoentMod-generated counterfactual images during training mitigated this vulnerability, improving model discrimination across multiple pathologies by 3-19% AUC in internal validation and by 1-11% for 5 out of 6 tested pathologies in external testing. These findings establish RoentMod as a tool to probe and correct shortcut learning in medical AI. By enabling controlled counterfactual interventions, RoentMod enhances the robustness and interpretability of CXR interpretation models and provides a strategy to improve medical imaging models.</p>","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":" ","pages":""},"PeriodicalIF":15.1,"publicationDate":"2026-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147369864","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
3D Spatiotemporal cardiac reconstruction for predicting MACE in acute myocardial infarction. 三维时空心脏重建预测急性心肌梗死MACE。
IF 15.2 1区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2026-03-06 DOI: 10.1038/s41746-026-02449-0
Qiang Gao,Jingping Wu,Yingshuang Gao,Yongyong Ren,Xiaolei Wang,Guojun Zhu,Jinyi Xiang,Dongaolei An,Lei Xu,Yan Zhou,Jun Pu,Dan Mu,Lei Zhao,Hui Lu,Lian-Ming Wu
Artificial intelligence has made significant strides in predicting major adverse cardiovascular events (MACE) in patients with acute myocardial infarction (AMI) following percutaneous coronary intervention. However, most existing methods rely solely on tabular variables derived from clinical data and cardiac magnetic resonance (CMR), without fully leveraging the predictive potential of the CMR imaging modality itself. Moreover, these approaches often overlook the synergistic benefits of multimodal integration between imaging and tabular data. In addition, current models primarily focus on short-term MACE risk assessment (e.g., within 6 months or 1 year), limiting their applicability for long-term prognostication. To address these limitations, we first developed ReconSeg3D, a model that reconstructs short-axis cine CMR stacks into temporally-resolved 3D bi-ventricular volumes, capturing fine-grained cardiac anatomy and dynamic motion. These bi-ventricular sequences were then integrated with 45 clinical and CMR-derived variables using spatiotemporal decomposition and cross-attention mechanisms to construct a multimodal MACE prediction model-HeartTTable. HeartTTable achieved a 5-year time-dependent AUC of 0.934 (95% CI 0.907-0.959) and a Harrell's C-index of 0.897 for predicting MACE risk, significantly outperforming models based solely on clinical and CMR-derived tabular features, and demonstrated strong capabilities in postoperative risk stratification. Our study contributes to improved long-term postoperative management for AMI patients by offering clinicians an objective, data-driven decision-support tool.
人工智能在预测急性心肌梗死(AMI)患者经皮冠状动脉介入治疗后的主要不良心血管事件(MACE)方面取得了重大进展。然而,大多数现有方法仅依赖于从临床数据和心脏磁共振(CMR)得出的表格变量,而没有充分利用CMR成像模式本身的预测潜力。此外,这些方法往往忽略了影像和表格数据之间多模式整合的协同效益。此外,目前的模型主要侧重于短期MACE风险评估(例如6个月或1年内),限制了它们对长期预测的适用性。为了解决这些限制,我们首先开发了ReconSeg3D,这是一种将短轴CMR堆栈重建为临时分辨的3D双心室体积的模型,可以捕获细粒度的心脏解剖和动态运动。然后利用时空分解和交叉注意机制,将这些双心室序列与45个临床和cmr衍生变量整合,构建多模态MACE预测模型- hearttable。在预测MACE风险方面,hearttable的5年时间依赖AUC为0.934 (95% CI 0.907-0.959), Harrell's C-index为0.897,明显优于单纯基于临床和cmr衍生的表格特征的模型,在术后风险分层方面表现出强大的能力。我们的研究为临床医生提供了一个客观的、数据驱动的决策支持工具,有助于改善AMI患者的长期术后管理。
{"title":"3D Spatiotemporal cardiac reconstruction for predicting MACE in acute myocardial infarction.","authors":"Qiang Gao,Jingping Wu,Yingshuang Gao,Yongyong Ren,Xiaolei Wang,Guojun Zhu,Jinyi Xiang,Dongaolei An,Lei Xu,Yan Zhou,Jun Pu,Dan Mu,Lei Zhao,Hui Lu,Lian-Ming Wu","doi":"10.1038/s41746-026-02449-0","DOIUrl":"https://doi.org/10.1038/s41746-026-02449-0","url":null,"abstract":"Artificial intelligence has made significant strides in predicting major adverse cardiovascular events (MACE) in patients with acute myocardial infarction (AMI) following percutaneous coronary intervention. However, most existing methods rely solely on tabular variables derived from clinical data and cardiac magnetic resonance (CMR), without fully leveraging the predictive potential of the CMR imaging modality itself. Moreover, these approaches often overlook the synergistic benefits of multimodal integration between imaging and tabular data. In addition, current models primarily focus on short-term MACE risk assessment (e.g., within 6 months or 1 year), limiting their applicability for long-term prognostication. To address these limitations, we first developed ReconSeg3D, a model that reconstructs short-axis cine CMR stacks into temporally-resolved 3D bi-ventricular volumes, capturing fine-grained cardiac anatomy and dynamic motion. These bi-ventricular sequences were then integrated with 45 clinical and CMR-derived variables using spatiotemporal decomposition and cross-attention mechanisms to construct a multimodal MACE prediction model-HeartTTable. HeartTTable achieved a 5-year time-dependent AUC of 0.934 (95% CI 0.907-0.959) and a Harrell's C-index of 0.897 for predicting MACE risk, significantly outperforming models based solely on clinical and CMR-derived tabular features, and demonstrated strong capabilities in postoperative risk stratification. Our study contributes to improved long-term postoperative management for AMI patients by offering clinicians an objective, data-driven decision-support tool.","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"19 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2026-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147368239","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
MS360°: a conceptual digital-first, data-driven hybrid care framework for personalised multiple sclerosis management. MS360°:一个概念上的数字优先,数据驱动的个性化多发性硬化症管理混合护理框架。
IF 15.2 1区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2026-03-06 DOI: 10.1038/s41746-026-02461-4
Isabel Voigt,Lars Masanneck,Marc Pawlitzki,Hernan Inojosa,Sven G Meuth,Tjalf Ziemssen
This perspective introduces MS360°, a conceptual hybrid care model for the management of multiple sclerosis (MS). It integrates traditional on-site assessments with digital health technologies (DHT) to enable more continuous, personalised, and proactive disease management. Current MS care is often fragmented, limiting timely interventions and patient engagement. MS360° addresses these challenges by introducing a digital-first hybrid framework for continuous data collection through remote monitoring, wearable sensors, and telemedicine. This data can be used to dynamically steer structured patient pathways and trigger targeted on-site assessments and interventions such as neurological examinations, imaging, laboratory assessments, and standardised functional tests based on predefined thresholds and patient profiles. The interaction of multidisciplinary teams, structured care pathways and bidirectional data flow enables timely clinical decision-making, stratified patient management and early detection of disease progression. Digital tools can further enhance patient engagement and lifestyle management, promoting adherence and outcomes. New technologies, including artificial intelligence and digital twins, are being discussed as potential future extensions for precision care, workflow optimisation, and risk prediction. MS360° provides a quality-driven conceptual framework, offering a roadmap for integrating digital innovations into patient-centred MS care.
这一观点介绍了MS360°,一个概念混合护理模式的管理多发性硬化症(MS)。它将传统的现场评估与数字卫生技术(DHT)相结合,以实现更持续、个性化和主动的疾病管理。目前的多发性硬化症治疗往往是分散的,限制了及时的干预和患者的参与。MS360°通过引入数字优先的混合框架,通过远程监控、可穿戴传感器和远程医疗进行连续数据收集,从而解决了这些挑战。这些数据可用于动态引导结构化的患者路径,并触发有针对性的现场评估和干预措施,如神经学检查、成像、实验室评估和基于预定义阈值和患者概况的标准化功能测试。多学科团队的相互作用、结构化的护理路径和双向数据流使及时的临床决策、分层的患者管理和早期发现疾病进展成为可能。数字工具可以进一步加强患者参与和生活方式管理,促进依从性和结果。包括人工智能和数字孪生在内的新技术正在被讨论,作为未来精准护理、工作流程优化和风险预测的潜在扩展。MS360°提供了一个质量驱动的概念框架,为将数字创新整合到以患者为中心的MS护理中提供了路线图。
{"title":"MS360°: a conceptual digital-first, data-driven hybrid care framework for personalised multiple sclerosis management.","authors":"Isabel Voigt,Lars Masanneck,Marc Pawlitzki,Hernan Inojosa,Sven G Meuth,Tjalf Ziemssen","doi":"10.1038/s41746-026-02461-4","DOIUrl":"https://doi.org/10.1038/s41746-026-02461-4","url":null,"abstract":"This perspective introduces MS360°, a conceptual hybrid care model for the management of multiple sclerosis (MS). It integrates traditional on-site assessments with digital health technologies (DHT) to enable more continuous, personalised, and proactive disease management. Current MS care is often fragmented, limiting timely interventions and patient engagement. MS360° addresses these challenges by introducing a digital-first hybrid framework for continuous data collection through remote monitoring, wearable sensors, and telemedicine. This data can be used to dynamically steer structured patient pathways and trigger targeted on-site assessments and interventions such as neurological examinations, imaging, laboratory assessments, and standardised functional tests based on predefined thresholds and patient profiles. The interaction of multidisciplinary teams, structured care pathways and bidirectional data flow enables timely clinical decision-making, stratified patient management and early detection of disease progression. Digital tools can further enhance patient engagement and lifestyle management, promoting adherence and outcomes. New technologies, including artificial intelligence and digital twins, are being discussed as potential future extensions for precision care, workflow optimisation, and risk prediction. MS360° provides a quality-driven conceptual framework, offering a roadmap for integrating digital innovations into patient-centred MS care.","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"1 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2026-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147359263","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Evaluating a digital decision aid for atrial fibrillation rhythm control in a hybrid implementation-effectiveness trial. 在一项混合实施-有效性试验中评估心房颤动节律控制的数字决策辅助。
IF 15.2 1区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2026-03-06 DOI: 10.1038/s41746-026-02405-y
Meghan Reading Turchioe,Afra Shamnath,David Slotwiner,Yihong Zhao,Deepak Saluja,Seth Goldbarg,JoonHyuk Kim,Paul Varosy,Angelo Biviano
Digital decision aids significantly improve shared decision-making outcomes, but barriers to implementation in clinical settings remain. We conducted a Hybrid Type 2 implementation-effectiveness trial of an atrial fibrillation rhythm control decision aid (clinicaltrials.gov NCT04993807; registered 08/06/2021) among 75 older adults across two sites. Guided by the RE-AIM framework, we assessed decision quality and implementation outcomes. While the decision aid was highly acceptable and broadly adopted, changes in decisional conflict and self-efficacy varied widely, with no significant average improvement across the cohort. Subgroup and qualitative analyses revealed that the decision aid was most effective when delivered to the right patient, at the right time, and in the right clinical context. Barriers included variability in health literacy, digital access, and timing of delivery relative to the clinical decision-making process. Findings underscore the challenges of deploying digital interventions within real-world workflows and highlight the importance of targeting decision support tools based on patient readiness, literacy, and care context.
数字化决策有助于显著改善共享决策结果,但在临床环境中实施的障碍仍然存在。我们在两个地点的75名老年人中进行了一项房颤节律控制决策辅助的混合型实施有效性试验(clinicaltrials.gov NCT04993807;注册日期为2021年6月8日)。在RE-AIM框架的指导下,我们评估了决策质量和实施结果。虽然决策辅助被高度接受和广泛采用,但决策冲突和自我效能的变化差异很大,在整个队列中没有显着的平均改善。亚组分析和定性分析显示,当在正确的时间和正确的临床环境中向正确的患者提供决策辅助时,决策辅助是最有效的。障碍包括卫生知识的可变性、数字获取以及相对于临床决策过程的交付时间。研究结果强调了在现实工作流程中部署数字干预措施的挑战,并强调了基于患者准备情况、素养和护理环境的目标决策支持工具的重要性。
{"title":"Evaluating a digital decision aid for atrial fibrillation rhythm control in a hybrid implementation-effectiveness trial.","authors":"Meghan Reading Turchioe,Afra Shamnath,David Slotwiner,Yihong Zhao,Deepak Saluja,Seth Goldbarg,JoonHyuk Kim,Paul Varosy,Angelo Biviano","doi":"10.1038/s41746-026-02405-y","DOIUrl":"https://doi.org/10.1038/s41746-026-02405-y","url":null,"abstract":"Digital decision aids significantly improve shared decision-making outcomes, but barriers to implementation in clinical settings remain. We conducted a Hybrid Type 2 implementation-effectiveness trial of an atrial fibrillation rhythm control decision aid (clinicaltrials.gov NCT04993807; registered 08/06/2021) among 75 older adults across two sites. Guided by the RE-AIM framework, we assessed decision quality and implementation outcomes. While the decision aid was highly acceptable and broadly adopted, changes in decisional conflict and self-efficacy varied widely, with no significant average improvement across the cohort. Subgroup and qualitative analyses revealed that the decision aid was most effective when delivered to the right patient, at the right time, and in the right clinical context. Barriers included variability in health literacy, digital access, and timing of delivery relative to the clinical decision-making process. Findings underscore the challenges of deploying digital interventions within real-world workflows and highlight the importance of targeting decision support tools based on patient readiness, literacy, and care context.","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"225 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2026-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147368378","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Clarifying validation terminologies in healthcare 澄清医疗保健中的验证术语
IF 15.2 1区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2026-03-05 DOI: 10.1038/s41746-026-02471-2
Amanda Dy, Sandra M. Buetow, Andrew J. Bredemeyer, Monika Lamba Saini, Fabienne Lucas, Shannon Bennett, Kim R. M. Blenman, Keith Wharton Jr., Sunil Singhal, M. E. de Baca, Kevin Schap, Matthew G. Hanna, Staci J. Kearney, Norman Zerbe, Roberto Salgado, Jithesh Veetil, Jansen N. Seheult, David S. McClintock, April Khademi, Jochen K. Lennerz
Validation is a cornerstone of reliability and trust in diagnostics, yet discipline-specific assumptions and unspoken contextual differences often lead to miscommunication, misalignment, and avoidable delays. As AI/ML becomes more integrated into healthcare, there is a growing necessity to re-examine how the term validation is used and understood. We highlight inconsistencies in the use of the term validation through an analysis of 94 themes across five domains, including Communication Science (n = 12), AI/ML (n = 26), Clinical and Laboratory Practice (n = 19), Regulatory Science (n = 22), and Business (n = 15). We emphasize how persistent reliance on domain-specific implied definitions impedes interdisciplinary alignment. Rather than advocating for a single definition, we derived five consensus proposals that collectively advocate for more specific and context-aware additions to the term validation to support clarity, reliability, and compliance across disciplines. Our goal is to support clearer communication and provide useful strategies that inform the development, regulation, and use of digital health technologies.
验证是诊断可靠性和信任的基石,但是特定学科的假设和未说出来的上下文差异经常导致错误沟通、不一致和可避免的延迟。随着AI/ML越来越多地集成到医疗保健中,越来越有必要重新审视术语验证的使用和理解方式。通过对五个领域94个主题的分析,我们强调了术语验证使用的不一致性,包括通信科学(n = 12)、人工智能/机器学习(n = 26)、临床和实验室实践(n = 19)、监管科学(n = 22)和商业(n = 15)。我们强调持久依赖于特定领域的隐含定义如何阻碍跨学科的协调。我们没有提倡单一的定义,而是得出了五个一致的建议,它们共同提倡对术语验证进行更具体和上下文敏感的添加,以支持跨学科的清晰度、可靠性和遵从性。我们的目标是支持更清晰的沟通,并提供有用的战略,为数字卫生技术的开发、管理和使用提供信息。
{"title":"Clarifying validation terminologies in healthcare","authors":"Amanda Dy, Sandra M. Buetow, Andrew J. Bredemeyer, Monika Lamba Saini, Fabienne Lucas, Shannon Bennett, Kim R. M. Blenman, Keith Wharton Jr., Sunil Singhal, M. E. de Baca, Kevin Schap, Matthew G. Hanna, Staci J. Kearney, Norman Zerbe, Roberto Salgado, Jithesh Veetil, Jansen N. Seheult, David S. McClintock, April Khademi, Jochen K. Lennerz","doi":"10.1038/s41746-026-02471-2","DOIUrl":"https://doi.org/10.1038/s41746-026-02471-2","url":null,"abstract":"Validation is a cornerstone of reliability and trust in diagnostics, yet discipline-specific assumptions and unspoken contextual differences often lead to miscommunication, misalignment, and avoidable delays. As AI/ML becomes more integrated into healthcare, there is a growing necessity to re-examine how the term validation is used and understood. We highlight inconsistencies in the use of the term validation through an analysis of 94 themes across five domains, including Communication Science (n = 12), AI/ML (n = 26), Clinical and Laboratory Practice (n = 19), Regulatory Science (n = 22), and Business (n = 15). We emphasize how persistent reliance on domain-specific implied definitions impedes interdisciplinary alignment. Rather than advocating for a single definition, we derived five consensus proposals that collectively advocate for more specific and context-aware additions to the term validation to support clarity, reliability, and compliance across disciplines. Our goal is to support clearer communication and provide useful strategies that inform the development, regulation, and use of digital health technologies.","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"61 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2026-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147350655","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Automated detection of new cerebral infarctions and prognostic implications using deep learning on serial MRI 在连续MRI上使用深度学习自动检测新的脑梗死和预后意义
IF 15.2 1区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2026-03-05 DOI: 10.1038/s41746-026-02511-x
Hwan-ho Cho, Joonwon Lee, Jeonghoon Bae, Dongwhane Lee, Hyung Chan Kim, Suk Yoon Lee, Jung Hwa Seo, Woo-Keun Seo, Jin-Man Jung, Hyunjin Park, Seongho Park
We developed and externally validated a deep learning model to automatically detect new ischemic lesions on serial FLAIR MRI scans in patients with stroke. Manual interpretation of follow-up imaging is labor-intensive and variable, and silent brain infarctions (SBIs) are frequently missed despite their prognostic importance. Using 25,451 paired slices from 1055 patients across two hospitals, we trained a convolutional neural network with supervised contrastive learning to classify new lesion occurrence. The model achieved an area under the receiver operating characteristic curve of 0.89 in both internal and external validation cohorts. To evaluate clinical relevance, we further analyzed an independent asymptomatic cohort of 307 patients with a median follow-up of two years. Patients classified as SBI-positive by the model showed a significantly higher risk of subsequent symptomatic stroke than those without SBI. In multivariable Cox regression adjusted for age and major vascular risk factors, model-positive patients had a 3.8-fold increased risk of stroke recurrence. These findings indicate that AI can identify clinically meaningful SBIs that are under-recognized in routine practice and independently associated with stroke recurrence. Automated lesion detection may provide a reproducible imaging biomarker for risk stratification, supporting standardized interpretation of follow-up MRI and informing secondary stroke prevention strategies.
我们开发并外部验证了一个深度学习模型,该模型可以在中风患者的连续FLAIR MRI扫描中自动检测新的缺血性病变。随访影像的人工解释是劳动密集型和可变的,尽管无症状脑梗死(sbi)具有预后重要性,但经常被遗漏。使用来自两家医院1055名患者的25,451对切片,我们训练了一个带有监督对比学习的卷积神经网络,以分类新病变的发生。在内部和外部验证队列中,该模型的受试者工作特征曲线下面积均为0.89。为了评估临床相关性,我们进一步分析了307例无症状患者的独立队列,中位随访时间为两年。经模型分类为SBI阳性的患者出现后续症状性卒中的风险明显高于无SBI的患者。在校正年龄和主要血管危险因素的多变量Cox回归中,模型阳性患者卒中复发风险增加3.8倍。这些研究结果表明,人工智能可以识别出在常规实践中未被充分认识且与卒中复发独立相关的有临床意义的sbi。自动病变检测可以为风险分层提供可重复的成像生物标志物,支持后续MRI的标准化解释,并为二级卒中预防策略提供信息。
{"title":"Automated detection of new cerebral infarctions and prognostic implications using deep learning on serial MRI","authors":"Hwan-ho Cho, Joonwon Lee, Jeonghoon Bae, Dongwhane Lee, Hyung Chan Kim, Suk Yoon Lee, Jung Hwa Seo, Woo-Keun Seo, Jin-Man Jung, Hyunjin Park, Seongho Park","doi":"10.1038/s41746-026-02511-x","DOIUrl":"https://doi.org/10.1038/s41746-026-02511-x","url":null,"abstract":"We developed and externally validated a deep learning model to automatically detect new ischemic lesions on serial FLAIR MRI scans in patients with stroke. Manual interpretation of follow-up imaging is labor-intensive and variable, and silent brain infarctions (SBIs) are frequently missed despite their prognostic importance. Using 25,451 paired slices from 1055 patients across two hospitals, we trained a convolutional neural network with supervised contrastive learning to classify new lesion occurrence. The model achieved an area under the receiver operating characteristic curve of 0.89 in both internal and external validation cohorts. To evaluate clinical relevance, we further analyzed an independent asymptomatic cohort of 307 patients with a median follow-up of two years. Patients classified as SBI-positive by the model showed a significantly higher risk of subsequent symptomatic stroke than those without SBI. In multivariable Cox regression adjusted for age and major vascular risk factors, model-positive patients had a 3.8-fold increased risk of stroke recurrence. These findings indicate that AI can identify clinically meaningful SBIs that are under-recognized in routine practice and independently associated with stroke recurrence. Automated lesion detection may provide a reproducible imaging biomarker for risk stratification, supporting standardized interpretation of follow-up MRI and informing secondary stroke prevention strategies.","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"27 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2026-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147350611","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Shifting the retinal foundation models paradigm from slices to volumes for optical coherence tomography 将视网膜基础模型范例从切片转移到光学相干断层扫描的体积
IF 15.2 1区 医学 Q1 HEALTH CARE SCIENCES & SERVICES Pub Date : 2026-03-05 DOI: 10.1038/s41746-026-02496-7
Raphael Judkiewicz, Eran Berkowitz, Meishar Meisel, Tomer Michaeli, Joachim A. Behar
Optical Coherence Tomography (OCT) is essential in ophthalmology for cross-sectional imaging of the retina. Pretrained foundation models facilitate task-specific model development by enabling fine-tuning with limited labeled data. However, current foundation models rely on a single B-scan (usually the central slice), overlooking volumetric context. This research investigates video foundation models to capture full 3D retinal structure and improve diagnostic performance. V-JEPA, a state-of-the-art video foundation model, was benchmarked against retinal foundation models (RETFound, VisionFM) and a natural image foundation model (DINOv2). All were fine-tuned to detect Age-related Macular Degeneration or Glaucomatous Optic Neuropathy using five OCT datasets. V-JEPA consistently equaled or outperformed image-based models, achieving an average AUROC of 0.94 (0.80–0.99), versus 0.90 (0.76–0.98) for the best image model, a statistically significant improvement (p < 0.001). To our knowledge, this is the first application of transformer-based video models to volumetric OCT, highlighting their promise in 3D medical imaging.
光学相干断层扫描(OCT)是必不可少的眼科视网膜的横断成像。预训练的基础模型通过对有限的标记数据进行微调来促进特定于任务的模型开发。然而,目前的基础模型依赖于单个b扫描(通常是中心切片),忽略了体积背景。本研究研究视频基础模型,以捕捉全3D视网膜结构和提高诊断性能。V-JEPA是一种最先进的视频基础模型,与视网膜基础模型(RETFound, VisionFM)和自然图像基础模型(DINOv2)进行基准测试。使用5个OCT数据集对所有数据进行微调以检测年龄相关性黄斑变性或青光眼视神经病变。V-JEPA的平均AUROC为0.94(0.80-0.99),而最佳图像模型的平均AUROC为0.90(0.76-0.98),具有统计学上的显著提高(p < 0.001)。据我们所知,这是基于变压器的视频模型首次应用于体积OCT,突出了它们在3D医学成像中的前景。
{"title":"Shifting the retinal foundation models paradigm from slices to volumes for optical coherence tomography","authors":"Raphael Judkiewicz, Eran Berkowitz, Meishar Meisel, Tomer Michaeli, Joachim A. Behar","doi":"10.1038/s41746-026-02496-7","DOIUrl":"https://doi.org/10.1038/s41746-026-02496-7","url":null,"abstract":"Optical Coherence Tomography (OCT) is essential in ophthalmology for cross-sectional imaging of the retina. Pretrained foundation models facilitate task-specific model development by enabling fine-tuning with limited labeled data. However, current foundation models rely on a single B-scan (usually the central slice), overlooking volumetric context. This research investigates video foundation models to capture full 3D retinal structure and improve diagnostic performance. V-JEPA, a state-of-the-art video foundation model, was benchmarked against retinal foundation models (RETFound, VisionFM) and a natural image foundation model (DINOv2). All were fine-tuned to detect Age-related Macular Degeneration or Glaucomatous Optic Neuropathy using five OCT datasets. V-JEPA consistently equaled or outperformed image-based models, achieving an average AUROC of 0.94 (0.80–0.99), versus 0.90 (0.76–0.98) for the best image model, a statistically significant improvement (p < 0.001). To our knowledge, this is the first application of transformer-based video models to volumetric OCT, highlighting their promise in 3D medical imaging.","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"130 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2026-03-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147346791","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
NPJ Digital Medicine
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1