NPJ Digital Medicine最新文献

英文中文

Neck-to-knee dixon MRI thigh volume as a superior mass biomarker for Sarcopenia: evidence from the UK biobank. 颈部到膝盖的dixon MRI大腿体积作为肌肉减少症的优越肿块生物标志物：来自英国生物银行的证据。

IF 15.1 1区医学 Q1 HEALTH CARE SCIENCES & SERVICES

NPJ Digital Medicine

Pub Date : 2026-02-05 DOI: 10.1038/s41746-026-02379-x

Hyeon Su Kim, Hyunwoo Park, Junseok Kang, Hyunbin Kim, Bonsang Gu, Bhola Shivam, Jun-Il Yoo

Sarcopenia assessment requires biomarkers capturing muscle-specific strength beyond single-slice measurements. We developed an automated MRI framework segmenting 27 pelvic-thigh musculoskeletal structures to investigate muscle distribution as functional biomarkers. Among 37,004 UK Biobank participants (64.5 ± 7.9 years), transformer-based segmentation achieved Dice similarity coefficient of 0.896. Dixon MRI-derived thigh muscle volume showed exceptional DEXA concordance (r = 0.936). Posterior/anterior (P/A) muscle ratio independently predicted adverse outcomes: weak grip strength (OR 1.60, 95%CI 1.45-1.77), sarcopenia (OR 1.42, 95%CI 1.13-1.78), mortality (OR 1.49, 95%CI 1.23-1.81), and falls (OR 1.12, 95%CI 1.05-1.20), all p < 0.005, while left/right asymmetry showed no associations. Automated MRI phenotyping reveals muscle distribution patterns, particularly reduced anterior compartment volume, predict functional decline independent of total muscle mass, supporting evolution toward composition-aware sarcopenia criteria.

肌肉减少症的评估需要生物标志物捕捉肌肉特异性强度，而不是单片测量。我们开发了一个自动MRI框架，分割27个骨盆-大腿肌肉骨骼结构，以研究肌肉分布作为功能生物标志物。在37,004名UK Biobank参与者（64.5±7.9岁）中，基于变压器的分割获得了0.896的Dice相似系数。Dixon mri衍生的大腿肌肉体积显示异常的DEXA一致性（r = 0.936）。后/前（P/A）肌比独立预测不良结局：握力弱（OR 1.60, 95%CI 1.45-1.77）、肌肉减少（OR 1.42, 95%CI 1.13-1.78）、死亡率（OR 1.49, 95%CI 1.23-1.81）和跌倒（OR 1.12, 95%CI 1.05-1.20），均为P

引用次数: 0

Scaling medical device regulatory science using large language models. 使用大型语言模型扩展医疗器械监管科学。

IF 15.1 1区医学 Q1 HEALTH CARE SCIENCES & SERVICES

NPJ Digital Medicine

Pub Date : 2026-02-05 DOI: 10.1038/s41746-026-02353-7

Hanyang Li, Xiao He, Adarsh Subbaswamy, Patrick Vossler, Alexej Gossmann, Karandeep Singh, Jean Feng

Advances in artificial intelligence (AI) and machine learning (ML) have led to a surge in AI/ML-enabled medical devices, posing new challenges for regulators because best practices for developing, testing, and monitoring these devices are still emerging. Consequently, there is a critical need for up-to-date data analyses of the regulatory landscape to inform policy-making. However, such analyses have historically relied upon manual annotation efforts because regulatory documents are unstructured, complex, multi-modal, and filled with jargon. Efforts to automate annotation using simple natural language processing methods have achieved limited success, as they lack the reasoning needed to interpret regulatory materials. Recent progress in large language models (LLMs) presents an unprecedented opportunity to unlock information embedded in regulatory documents. This work conducts the first wide-ranging validation study of LLMs for scaling data analyses in the field of medical device regulatory science. Evaluating LLM outputs using expert manual annotations and "LLM-as-a-judge," we find that LLMs can accurately extract attributes spanning pre- and post-market settings, with accuracy rates often reaching 80% or higher. We then demonstrate how LLMs can scale up analyses in three applications: (1) monitoring device validation practices, (2) coding medical device reports, and (3) identifying potential risk factors for post-market adverse events.

人工智能（AI）和机器学习（ML）的进步导致支持AI/ML的医疗设备激增，这给监管机构带来了新的挑战，因为开发、测试和监控这些设备的最佳实践仍在不断涌现。因此，迫切需要对监管环境进行最新数据分析，以便为决策提供信息。然而，这种分析历来依赖于手工注释工作，因为规范性文档是非结构化的、复杂的、多模式的，并且充满了术语。使用简单的自然语言处理方法实现自动化注释的努力取得了有限的成功，因为它们缺乏解释法规材料所需的推理。大型语言模型（llm）的最新进展为解锁嵌入在监管文件中的信息提供了前所未有的机会。这项工作进行了第一次广泛的法学硕士验证研究，以扩大医疗器械监管科学领域的数据分析。使用专家手动注释和“LLM作为法官”评估LLM输出，我们发现LLM可以准确地提取跨越市场前和市场后设置的属性，准确率通常达到80%或更高。然后，我们展示了llm如何在三个应用中扩展分析：(1)监测设备验证实践，(2)编码医疗设备报告，以及(3)识别上市后不良事件的潜在风险因素。

{"title":"Scaling medical device regulatory science using large language models.","authors":"Hanyang Li, Xiao He, Adarsh Subbaswamy, Patrick Vossler, Alexej Gossmann, Karandeep Singh, Jean Feng","doi":"10.1038/s41746-026-02353-7","DOIUrl":"https://doi.org/10.1038/s41746-026-02353-7","url":null,"abstract":"<p><p>Advances in artificial intelligence (AI) and machine learning (ML) have led to a surge in AI/ML-enabled medical devices, posing new challenges for regulators because best practices for developing, testing, and monitoring these devices are still emerging. Consequently, there is a critical need for up-to-date data analyses of the regulatory landscape to inform policy-making. However, such analyses have historically relied upon manual annotation efforts because regulatory documents are unstructured, complex, multi-modal, and filled with jargon. Efforts to automate annotation using simple natural language processing methods have achieved limited success, as they lack the reasoning needed to interpret regulatory materials. Recent progress in large language models (LLMs) presents an unprecedented opportunity to unlock information embedded in regulatory documents. This work conducts the first wide-ranging validation study of LLMs for scaling data analyses in the field of medical device regulatory science. Evaluating LLM outputs using expert manual annotations and \"LLM-as-a-judge,\" we find that LLMs can accurately extract attributes spanning pre- and post-market settings, with accuracy rates often reaching 80% or higher. We then demonstrate how LLMs can scale up analyses in three applications: (1) monitoring device validation practices, (2) coding medical device reports, and (3) identifying potential risk factors for post-market adverse events.</p>","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":" ","pages":""},"PeriodicalIF":15.1,"publicationDate":"2026-02-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146126041","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Large-scale self-supervised video foundation model for intelligent surgery 智能手术的大规模自监督视频基础模型

IF 15.2 1区医学 Q1 HEALTH CARE SCIENCES & SERVICES

NPJ Digital Medicine

Pub Date : 2026-02-04 DOI: 10.1038/s41746-026-02403-0

Shu Yang, Fengtao Zhou, Leon Mayer, Fuxiang Huang, Yiliang Chen, Yihui Wang, Sunan He, Yuxiang Nie, Xi Wang, Yueming Jin, Huihui Sun, Shuchang Xu, Alex Qinyang Liu, Zheng Li, Jing Qin, Jeremy YuenChun Teoh, Lena Maier-Hein, Hao Chen

引用次数: 0

Medicare advantage becoming a disadvantage with use of artificial intelligence in prior authorization review 在事先授权审查中使用人工智能，医疗保险优势成为劣势

IF 15.2 1区医学 Q1 HEALTH CARE SCIENCES & SERVICES

NPJ Digital Medicine

Pub Date : 2026-02-04 DOI: 10.1038/s41746-026-02387-x

Sara Raza, Sara Gerke, Christina Silcox, Rachele Hendricks-Sturrup, Carmel Shachar

引用次数: 0

Text-image alignment for ILD imaging: linking CXR evidence to CT quantification ILD成像的文本-图像对齐：将CXR证据与CT量化联系起来

IF 15.2 1区医学 Q1 HEALTH CARE SCIENCES & SERVICES

NPJ Digital Medicine

Pub Date : 2026-02-04 DOI: 10.1038/s41746-025-02292-9

Jiani Gao, Yijiu Ren, Fengjing Yang, Xuefei Hu, Changbo Sun, Sihua Wang, Chang Chen

引用次数: 0

StructSAM: structure-aware prompt adaptation for robust lung cancer lesion segmentation in CT StructSAM：结构感知的快速适应在CT中稳健的肺癌病变分割

IF 15.2 1区医学 Q1 HEALTH CARE SCIENCES & SERVICES

NPJ Digital Medicine

Pub Date : 2026-02-03 DOI: 10.1038/s41746-025-02306-6

Mengjie Liu, Yuxin Yao, Jinyong Jia, Jiali Yao, Zhengze Huang, Ziyang Zeng, Guangjin Pu, Yan Wu, Yuqi Bai, Bin Wang, Lili Jiang

Accurate delineation of lung lesions in computed tomography (CT) scans is critical for diagnosis, staging, and treatment planning, yet remains a challenging task. While foundation models like the Segment Anything Model (SAM) excel in natural images, they often falter in medical imaging due to low contrast, ambiguous boundaries, and a lack of 3D context. To address these limitations, we propose StructSAM, a structure-aware prompt adaptation framework designed for robust volumetric segmentation, with a primary focus on lung cancer. StructSAM injects anatomical priors into the prompt pathway, employs a 3D inter-slice aggregator for volumetric consistency, and leverages PEFT for scalability. Experiments on the LIDC-IDRI dataset demonstrate that StructSAM achieves state-of-the-art accuracy on lung nodule segmentation, outperforming both classical architectures and SAM-based adaptations. Crucially, extended cross-organ evaluations on KiTS19 and MSD Pancreas datasets reveal that StructSAM effectively generalizes to other anatomical structures, highlighting its robustness to domain shifts. These findings suggest that embedding structural priors into foundation models is a promising strategy toward generic, clinically reliable, and efficient medical image segmentation.

在计算机断层扫描（CT）中准确描绘肺病变对诊断、分期和治疗计划至关重要，但仍然是一项具有挑战性的任务。虽然像分段任意模型（SAM）这样的基础模型在自然图像中表现出色，但由于对比度低、边界模糊和缺乏3D背景，它们在医学成像中往往表现不佳。为了解决这些限制，我们提出了StructSAM，这是一个结构感知的快速适应框架，专为鲁棒的体积分割而设计，主要关注肺癌。StructSAM在提示路径中注入解剖先验，采用3D片间聚合器实现体积一致性，并利用PEFT实现可扩展性。在LIDC-IDRI数据集上的实验表明，StructSAM在肺结节分割方面达到了最先进的精度，优于经典架构和基于sam的自适应。至关重要的是，对KiTS19和MSD胰腺数据集的跨器官扩展评估表明，StructSAM有效地推广到其他解剖结构，突出了其对结构域转移的鲁棒性。这些发现表明，在基础模型中嵌入结构先验是实现通用、临床可靠和高效的医学图像分割的一种很有前途的策略。

{"title":"StructSAM: structure-aware prompt adaptation for robust lung cancer lesion segmentation in CT","authors":"Mengjie Liu, Yuxin Yao, Jinyong Jia, Jiali Yao, Zhengze Huang, Ziyang Zeng, Guangjin Pu, Yan Wu, Yuqi Bai, Bin Wang, Lili Jiang","doi":"10.1038/s41746-025-02306-6","DOIUrl":"https://doi.org/10.1038/s41746-025-02306-6","url":null,"abstract":"Accurate delineation of lung lesions in computed tomography (CT) scans is critical for diagnosis, staging, and treatment planning, yet remains a challenging task. While foundation models like the Segment Anything Model (SAM) excel in natural images, they often falter in medical imaging due to low contrast, ambiguous boundaries, and a lack of 3D context. To address these limitations, we propose StructSAM, a structure-aware prompt adaptation framework designed for robust volumetric segmentation, with a primary focus on lung cancer. StructSAM injects anatomical priors into the prompt pathway, employs a 3D inter-slice aggregator for volumetric consistency, and leverages PEFT for scalability. Experiments on the LIDC-IDRI dataset demonstrate that StructSAM achieves state-of-the-art accuracy on lung nodule segmentation, outperforming both classical architectures and SAM-based adaptations. Crucially, extended cross-organ evaluations on KiTS19 and MSD Pancreas datasets reveal that StructSAM effectively generalizes to other anatomical structures, highlighting its robustness to domain shifts. These findings suggest that embedding structural priors into foundation models is a promising strategy toward generic, clinically reliable, and efficient medical image segmentation.","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"253 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146102070","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Digital health interventions for perioperative patient-reported outcomes: a network meta-analysis 围手术期患者报告结果的数字健康干预：网络荟萃分析

IF 15.2 1区医学 Q1 HEALTH CARE SCIENCES & SERVICES

NPJ Digital Medicine

Pub Date : 2026-02-03 DOI: 10.1038/s41746-026-02398-8

Ziyue Luo, Ruihao Zhou, Jingwen Wei, Kailei Nong, Xiran Peng, Lu Chen, Peiyi Li, Sisi Deng, Mengchan Ou, Ling Ye, Yaqiang Wang, Guo Chen, Xuechao Hao, Sheyu Li, Tao Zhu

Digital health interventions (DHIs), delivered via digital platforms such as internet-based programs, mobile applications or short messages, may improve patient-reported outcomes (PROs), but comparative effectiveness is unclear. We conducted a network meta-analysis of randomized controlled trials in adults undergoing elective surgery under general anesthesia, identified in PubMed, Embase, CENTRAL, and Web of Science to March 1, 2025. Standardized mean differences (SMDs), mean differences (MDs) with minimal important differences (MIDs), and 95% CIs were estimated. Risk of bias was assessed with RoB 2 and certainty of evidence with GRADE. Fifty-six trials (6,154 patients) were included. Extended reality (XR) most effectively reduced perioperative anxiety (SMD 0.60; 95% CI 0.37–0.84; MD 8.05; MID 6.71; moderate-certainty). For postoperative pain, mobile applications (SMD 0.64; 95% CI 0.32–0.95; MD 1.36; MID 1.0; moderate-certainty) and XR (SMD 0.51; 95% CI 0.26–0.76; MD 1.09; MID 1.0; moderate-certainty) were probably effective. For quality of life, 2D video yielded the greatest gain (SMD 0.99; 95% CI 0.11–1.88; MD 0.11; MID 0.05; high-certainty). XR also improved satisfaction (SMD 1.27; 95% CI 0.63–1.91; MD 1.91; MID 0.75; moderate-certainty). These findings suggest that DHIs may improve perioperative PROs.

通过基于互联网的项目、移动应用程序或短信等数字平台提供的数字健康干预措施（DHIs）可能会改善患者报告的结果（PROs），但相对有效性尚不清楚。我们对在全身麻醉下接受择期手术的成人随机对照试验进行了网络荟萃分析，这些试验已在PubMed、Embase、CENTRAL和Web of Science上确认至2025年3月1日。估计标准化平均差异（SMDs）、最小重要差异（MIDs）的平均差异（MDs）和95% ci。偏倚风险用RoB 2评估，证据确定性用GRADE评估。纳入56项试验（6154例患者）。扩展现实（XR）最有效地减少围手术期焦虑（SMD 0.60; 95% CI 0.37-0.84; MD 8.05; MID 6.71；中等确定性）。对于术后疼痛，移动应用（SMD 0.64; 95% CI 0.32-0.95; MD 1.36; MID 1.0；中等确定性）和XR （SMD 0.51; 95% CI 0.26-0.76; MD 1.09; MID 1.0；中等确定性）可能有效。对于生活质量，2D视频获得了最大的增益（SMD 0.99; 95% CI 0.11 - 1.88; MD 0.11; MID 0.05；高确定性）。XR也提高了满意度（SMD 1.27; 95% CI 0.63-1.91; MD 1.91; MID 0.75；中等确定性）。这些发现提示DHIs可以改善围手术期的PROs。

{"title":"Digital health interventions for perioperative patient-reported outcomes: a network meta-analysis","authors":"Ziyue Luo, Ruihao Zhou, Jingwen Wei, Kailei Nong, Xiran Peng, Lu Chen, Peiyi Li, Sisi Deng, Mengchan Ou, Ling Ye, Yaqiang Wang, Guo Chen, Xuechao Hao, Sheyu Li, Tao Zhu","doi":"10.1038/s41746-026-02398-8","DOIUrl":"https://doi.org/10.1038/s41746-026-02398-8","url":null,"abstract":"Digital health interventions (DHIs), delivered via digital platforms such as internet-based programs, mobile applications or short messages, may improve patient-reported outcomes (PROs), but comparative effectiveness is unclear. We conducted a network meta-analysis of randomized controlled trials in adults undergoing elective surgery under general anesthesia, identified in PubMed, Embase, CENTRAL, and Web of Science to March 1, 2025. Standardized mean differences (SMDs), mean differences (MDs) with minimal important differences (MIDs), and 95% CIs were estimated. Risk of bias was assessed with RoB 2 and certainty of evidence with GRADE. Fifty-six trials (6,154 patients) were included. Extended reality (XR) most effectively reduced perioperative anxiety (SMD 0.60; 95% CI 0.37–0.84; MD 8.05; MID 6.71; moderate-certainty). For postoperative pain, mobile applications (SMD 0.64; 95% CI 0.32–0.95; MD 1.36; MID 1.0; moderate-certainty) and XR (SMD 0.51; 95% CI 0.26–0.76; MD 1.09; MID 1.0; moderate-certainty) were probably effective. For quality of life, 2D video yielded the greatest gain (SMD 0.99; 95% CI 0.11–1.88; MD 0.11; MID 0.05; high-certainty). XR also improved satisfaction (SMD 1.27; 95% CI 0.63–1.91; MD 1.91; MID 0.75; moderate-certainty). These findings suggest that DHIs may improve perioperative PROs.","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"1 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146102073","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Predicting adverse events for risk stratification of chemotherapy based stem cell mobilization in multiple myeloma 预测多发性骨髓瘤中基于干细胞动员的化疗风险分层的不良事件

IF 15.2 1区医学 Q1 HEALTH CARE SCIENCES & SERVICES

NPJ Digital Medicine

Pub Date : 2026-02-03 DOI: 10.1038/s41746-026-02394-y

F. Schwarz, L. Levien, M. Maulhardt, G. Wulf, N. Brökers, E. Aydilek

Autologous stem-cell transplantation is a fundamental therapy for multiple myeloma. Although inpatient chemo-based stem-cell mobilization (SCM) is standard care in Germany, outpatient approaches could ease healthcare constraints. We analyzed 109 myeloma patients undergoing SCM and collection at the University Medical Center Göttingen for safety. We then trained machine learning models to predict adverse events (AEs) requiring hospitalization and to forecast AE onset timing for optimized ward management. In our cohort, 97% achieved successful collection, but 69% experienced severe AEs necessitating hospitalization. Simulations suggest a risk-stratified outpatient protocol could cut bed usage by at least one third without compromising safety. Classification models accurately predicted some AE types (e.g., elevated creatinine, ROC-AUC 1.0), though neutropenic fever remained challenging (ROC-AUC 0.67). Regression models forecast AE onset with a mean error of just over one day. These results outline a data-driven roadmap for safely adopting outpatient SCM and optimizing resource allocation in clinical practice.

自体干细胞移植是多发性骨髓瘤的基本治疗方法。虽然住院化疗干细胞动员（SCM）是德国的标准治疗，门诊方法可以缓解医疗限制。为了安全起见，我们分析了109名骨髓瘤患者在大学医学中心Göttingen接受SCM和收集。然后，我们训练机器学习模型来预测需要住院治疗的不良事件（AE），并预测AE的发作时间，以优化病房管理。在我们的队列中，97%成功收集，但69%发生严重ae，需要住院治疗。模拟表明，风险分层门诊方案可以在不影响安全性的情况下减少至少三分之一的床位使用。分类模型准确预测了一些AE类型（如肌酐升高，ROC-AUC 1.0），尽管中性粒细胞减少热仍然具有挑战性（ROC-AUC 0.67）。回归模型预测AE发作的平均误差刚刚超过一天。这些结果概述了一个数据驱动的路线图，以安全采用门诊SCM和优化临床实践中的资源分配。

{"title":"Predicting adverse events for risk stratification of chemotherapy based stem cell mobilization in multiple myeloma","authors":"F. Schwarz, L. Levien, M. Maulhardt, G. Wulf, N. Brökers, E. Aydilek","doi":"10.1038/s41746-026-02394-y","DOIUrl":"https://doi.org/10.1038/s41746-026-02394-y","url":null,"abstract":"Autologous stem-cell transplantation is a fundamental therapy for multiple myeloma. Although inpatient chemo-based stem-cell mobilization (SCM) is standard care in Germany, outpatient approaches could ease healthcare constraints. We analyzed 109 myeloma patients undergoing SCM and collection at the University Medical Center Göttingen for safety. We then trained machine learning models to predict adverse events (AEs) requiring hospitalization and to forecast AE onset timing for optimized ward management. In our cohort, 97% achieved successful collection, but 69% experienced severe AEs necessitating hospitalization. Simulations suggest a risk-stratified outpatient protocol could cut bed usage by at least one third without compromising safety. Classification models accurately predicted some AE types (e.g., elevated creatinine, ROC-AUC 1.0), though neutropenic fever remained challenging (ROC-AUC 0.67). Regression models forecast AE onset with a mean error of just over one day. These results outline a data-driven roadmap for safely adopting outpatient SCM and optimizing resource allocation in clinical practice.","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"4 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146102069","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Towards accurate and interpretable competency-based assessment: enhancing clinical competency assessment through multimodal AI and anomaly detection 迈向准确和可解释的基于能力的评估：通过多模态人工智能和异常检测加强临床能力评估

IF 15.2 1区医学 Q1 HEALTH CARE SCIENCES & SERVICES

NPJ Digital Medicine

Pub Date : 2026-02-03 DOI: 10.1038/s41746-025-02299-2

Sapir Gershov, Fadi Mahameed, Aeyal Raz, Shlomi Laufer

Artificial Intelligence (AI) is reshaping medical education, particularly in the domain of competency-based assessment, where current methods remain subjective and resource-intensive. We introduce a multimodal AI framework that integrates video, audio, and patient monitor data to provide objective and interpretable competency assessments. Using 90 anesthesia residents, we established “ideal” performance benchmarks and trained an anomaly detection model (MEMTO) to quantify deviations from these benchmarks. Competency scores derived from these deviations showed strong alignment with expert ratings (Spearman’s ρ = 0.78; ICC = 0.75) and demonstrated high ranking precision (Relative L2-distance = 0.12). SHAP analysis revealed that communication and eye contact with the patient monitor are key drivers of variability. By linking AI-assisted anomaly detection with interpretable feedback, our framework addresses critical challenges of fairness, reliability, and transparency in simulation-based education. This work provides actionable evidence for integrating AI into medical training and advancing scalable, equitable evaluation of competence.

人工智能（AI）正在重塑医学教育，特别是在基于能力的评估领域，目前的方法仍然是主观的和资源密集型的。我们引入了一个多模式人工智能框架，该框架集成了视频、音频和患者监护数据，以提供客观和可解释的能力评估。我们选取了90名麻醉住院医师，建立了“理想”性能基准，并训练了异常检测模型（MEMTO）来量化偏离这些基准的偏差。从这些偏差中得出的能力分数显示出与专家评级的强烈一致性（Spearman的ρ = 0.78; ICC = 0.75），并显示出较高的排名精度（相对l2 -距离= 0.12）。SHAP分析显示，与监护仪的沟通和眼神接触是导致变化的关键因素。通过将人工智能辅助异常检测与可解释的反馈联系起来，我们的框架解决了基于模拟的教育中公平性、可靠性和透明度的关键挑战。这项工作为将人工智能纳入医疗培训和推进可扩展的、公平的能力评估提供了可操作的证据。

{"title":"Towards accurate and interpretable competency-based assessment: enhancing clinical competency assessment through multimodal AI and anomaly detection","authors":"Sapir Gershov, Fadi Mahameed, Aeyal Raz, Shlomi Laufer","doi":"10.1038/s41746-025-02299-2","DOIUrl":"https://doi.org/10.1038/s41746-025-02299-2","url":null,"abstract":"Artificial Intelligence (AI) is reshaping medical education, particularly in the domain of competency-based assessment, where current methods remain subjective and resource-intensive. We introduce a multimodal AI framework that integrates video, audio, and patient monitor data to provide objective and interpretable competency assessments. Using 90 anesthesia residents, we established “ideal” performance benchmarks and trained an anomaly detection model (MEMTO) to quantify deviations from these benchmarks. Competency scores derived from these deviations showed strong alignment with expert ratings (Spearman’s ρ = 0.78; ICC = 0.75) and demonstrated high ranking precision (Relative L2-distance = 0.12). SHAP analysis revealed that communication and eye contact with the patient monitor are key drivers of variability. By linking AI-assisted anomaly detection with interpretable feedback, our framework addresses critical challenges of fairness, reliability, and transparency in simulation-based education. This work provides actionable evidence for integrating AI into medical training and advancing scalable, equitable evaluation of competence.","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"44 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146102077","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Impact of broadband availability and digital literacy on video telehealth use among cancer patients 宽带可用性和数字素养对癌症患者视频远程医疗使用的影响

IF 15.2 1区医学 Q1 HEALTH CARE SCIENCES & SERVICES

NPJ Digital Medicine

Pub Date : 2026-02-03 DOI: 10.1038/s41746-026-02397-9

Joshua C. Pritchett, Pravesh Sharma, Ming Huang, Ruchita Dholakia, Tabetha A. Brockman, James P. Moriarty, Celia C. Kamath, Hannah Ahn, Paul A. Decker, Ruoxiang Jiang, Jonathan Ticku, Nandita Khera, LaPrincess C. Brewer, Jon C. Tilburt, Bijan J. Borah, Christi A. Patten, Tufia C. Haddad

Video telehealth visits (VTV) have emerged as a critical tool for oncology care delivery, with potential to address longstanding access disparities. We examined the association between broadband internet availability, individual digital literacy factors, and VTV utilization among patients with cancer. In a retrospective cohort of 13,897 patients across a multi-site practice, VTV utilization was significantly lower in areas with ≤1 internet service provider (ISP) offering download speeds ≥25 Mbps (p = 0.0009). Validation in a regional cohort (n = 6665) confirmed lower VTV utilization in low-broadband areas. Among 1134 surveyed patients, higher digital literacy was the strongest predictor of VTV use (OR 2.5; p < 0.001), even where broadband was limited. This study demonstrates that while both broadband availability and digital literacy independently influence VTV utilization, individual digital skills can partially offset structural limitations, underscoring the need for concurrent investment in broadband infrastructure and targeted digital literacy initiatives to advance access to care.

视频远程医疗访问（VTV）已成为肿瘤护理提供的关键工具，有可能解决长期存在的获取差距。我们研究了宽带互联网可用性、个人数字素养因素和癌症患者VTV使用之间的关系。在一项多地点实践的13897名患者的回顾性队列研究中，在≤1个互联网服务提供商（ISP）提供下载速度≥25 Mbps的地区，VTV利用率明显较低（p = 0.0009）。区域队列验证（n = 6665）证实低宽带地区的VTV利用率较低。在1134名接受调查的患者中，更高的数字素养是VTV使用的最强预测因子（OR 2.5; p < 0.001），即使在宽带有限的地方也是如此。本研究表明，虽然宽带可用性和数字素养都独立影响着VTV的利用，但个人数字技能可以部分抵消结构性限制，强调需要同时投资宽带基础设施和有针对性的数字素养举措，以促进获得医疗服务。

{"title":"Impact of broadband availability and digital literacy on video telehealth use among cancer patients","authors":"Joshua C. Pritchett, Pravesh Sharma, Ming Huang, Ruchita Dholakia, Tabetha A. Brockman, James P. Moriarty, Celia C. Kamath, Hannah Ahn, Paul A. Decker, Ruoxiang Jiang, Jonathan Ticku, Nandita Khera, LaPrincess C. Brewer, Jon C. Tilburt, Bijan J. Borah, Christi A. Patten, Tufia C. Haddad","doi":"10.1038/s41746-026-02397-9","DOIUrl":"https://doi.org/10.1038/s41746-026-02397-9","url":null,"abstract":"Video telehealth visits (VTV) have emerged as a critical tool for oncology care delivery, with potential to address longstanding access disparities. We examined the association between broadband internet availability, individual digital literacy factors, and VTV utilization among patients with cancer. In a retrospective cohort of 13,897 patients across a multi-site practice, VTV utilization was significantly lower in areas with ≤1 internet service provider (ISP) offering download speeds ≥25 Mbps (p = 0.0009). Validation in a regional cohort (n = 6665) confirmed lower VTV utilization in low-broadband areas. Among 1134 surveyed patients, higher digital literacy was the strongest predictor of VTV use (OR 2.5; p < 0.001), even where broadband was limited. This study demonstrates that while both broadband availability and digital literacy independently influence VTV utilization, individual digital skills can partially offset structural limitations, underscoring the need for concurrent investment in broadband infrastructure and targeted digital literacy initiatives to advance access to care.","PeriodicalId":19349,"journal":{"name":"NPJ Digital Medicine","volume":"87 1","pages":""},"PeriodicalIF":15.2,"publicationDate":"2026-02-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"146102071","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

NPJ Digital Medicine

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀