Accurate prediction of disease-risk factors from volumetric medical scans by a deep vision model pre-trained with 2D scans

IF 26.8 1区医学 Q1 ENGINEERING, BIOMEDICAL Nature Biomedical Engineering Pub Date : 2024-10-01 DOI:10.1038/s41551-024-01257-9

Oren Avram, Berkin Durmus, Nadav Rakocz, Giulia Corradetti, Ulzee An, Muneeswar G. Nittala, Prerit Terway, Akos Rudas, Zeyuan Johnson Chen, Yu Wakatsuki, Kazutaka Hirabayashi, Swetha Velaga, Liran Tiosano, Federico Corvi, Aditya Verma, Ayesha Karamat, Sophiana Lindenberg, Deniz Oncel, Louay Almidani, Victoria Hull, Sohaib Fasih-Ahmad, Houri Esmaeilkhanian, Maxime Cannesson, Charles C. Wykoff, Elior Rahmani, Corey W. Arnold, Bolei Zhou, Noah Zaitlen, Ilan Gronau, Sriram Sankararaman, Jeffrey N. Chiang, Srinivas R. Sadda, Eran Halperin

{"title":"Accurate prediction of disease-risk factors from volumetric medical scans by a deep vision model pre-trained with 2D scans","authors":"Oren Avram, Berkin Durmus, Nadav Rakocz, Giulia Corradetti, Ulzee An, Muneeswar G. Nittala, Prerit Terway, Akos Rudas, Zeyuan Johnson Chen, Yu Wakatsuki, Kazutaka Hirabayashi, Swetha Velaga, Liran Tiosano, Federico Corvi, Aditya Verma, Ayesha Karamat, Sophiana Lindenberg, Deniz Oncel, Louay Almidani, Victoria Hull, Sohaib Fasih-Ahmad, Houri Esmaeilkhanian, Maxime Cannesson, Charles C. Wykoff, Elior Rahmani, Corey W. Arnold, Bolei Zhou, Noah Zaitlen, Ilan Gronau, Sriram Sankararaman, Jeffrey N. Chiang, Srinivas R. Sadda, Eran Halperin","doi":"10.1038/s41551-024-01257-9","DOIUrl":null,"url":null,"abstract":"<p>The application of machine learning to tasks involving volumetric biomedical imaging is constrained by the limited availability of annotated datasets of three-dimensional (3D) scans for model training. Here we report a deep-learning model pre-trained on 2D scans (for which annotated data are relatively abundant) that accurately predicts disease-risk factors from 3D medical-scan modalities. The model, which we named SLIViT (for ‘slice integration by vision transformer’), preprocesses a given volumetric scan into 2D images, extracts their feature map and integrates it into a single prediction. We evaluated the model in eight different learning tasks, including classification and regression for six datasets involving four volumetric imaging modalities (computed tomography, magnetic resonance imaging, optical coherence tomography and ultrasound). SLIViT consistently outperformed domain-specific state-of-the-art models and was typically as accurate as clinical specialists who had spent considerable time manually annotating the analysed scans. Automating diagnosis tasks involving volumetric scans may save valuable clinician hours, reduce data acquisition costs and duration, and help expedite medical research and clinical applications.</p>","PeriodicalId":19063,"journal":{"name":"Nature Biomedical Engineering","volume":"17 1","pages":""},"PeriodicalIF":26.8000,"publicationDate":"2024-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature Biomedical Engineering","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1038/s41551-024-01257-9","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}

引用次数: 0

Abstract

The application of machine learning to tasks involving volumetric biomedical imaging is constrained by the limited availability of annotated datasets of three-dimensional (3D) scans for model training. Here we report a deep-learning model pre-trained on 2D scans (for which annotated data are relatively abundant) that accurately predicts disease-risk factors from 3D medical-scan modalities. The model, which we named SLIViT (for ‘slice integration by vision transformer’), preprocesses a given volumetric scan into 2D images, extracts their feature map and integrates it into a single prediction. We evaluated the model in eight different learning tasks, including classification and regression for six datasets involving four volumetric imaging modalities (computed tomography, magnetic resonance imaging, optical coherence tomography and ultrasound). SLIViT consistently outperformed domain-specific state-of-the-art models and was typically as accurate as clinical specialists who had spent considerable time manually annotating the analysed scans. Automating diagnosis tasks involving volumetric scans may save valuable clinician hours, reduce data acquisition costs and duration, and help expedite medical research and clinical applications.

Abstract Image

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

用二维扫描预训练的深度视觉模型从体积医学扫描中准确预测疾病风险因素

由于用于模型训练的三维（3D）扫描注释数据集有限，机器学习在涉及容积生物医学成像任务中的应用受到限制。在此，我们报告了一种在二维扫描（注释数据相对丰富）上预先训练的深度学习模型，该模型能从三维医学扫描模式中准确预测疾病风险因素。我们将该模型命名为 SLIViT（意为 "通过视觉转换器进行切片整合"），它将给定的容积扫描预处理为二维图像，提取其特征图，并将其整合为一个预测结果。我们在八个不同的学习任务中对该模型进行了评估，包括涉及四种容积成像模式（计算机断层扫描、磁共振成像、光学相干断层扫描和超声波）的六个数据集的分类和回归。SLIViT 的表现始终优于特定领域的最先进模型，其准确性通常不亚于花费大量时间手动标注分析扫描结果的临床专家。将涉及容积扫描的诊断任务自动化，可以节省临床医生的宝贵时间，降低数据采集成本，缩短数据采集时间，并有助于加快医学研究和临床应用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Nature Biomedical Engineering Medicine-Medicine (miscellaneous)

CiteScore

45.30

自引率

1.10%

发文量

138

期刊介绍： Nature Biomedical Engineering is an online-only monthly journal that was launched in January 2017. It aims to publish original research, reviews, and commentary focusing on applied biomedicine and health technology. The journal targets a diverse audience, including life scientists who are involved in developing experimental or computational systems and methods to enhance our understanding of human physiology. It also covers biomedical researchers and engineers who are engaged in designing or optimizing therapies, assays, devices, or procedures for diagnosing or treating diseases. Additionally, clinicians, who make use of research outputs to evaluate patient health or administer therapy in various clinical settings and healthcare contexts, are also part of the target audience.