Adil Oezsoy, James Alexander Brooks, Marko van Treeck, Yvonne Doerffel, Ulrike Morgera, Jens Berger, Marco Gustav, Oliver Lester Saldanha, Tom Luedde, Jakob Nikolas Kather, Tobias Paul Seraphin, Michael Kallenbach
{"title":"Weakly supervised deep learning can analyze focal liver lesions in contrast-enhanced ultrasound.","authors":"Adil Oezsoy, James Alexander Brooks, Marko van Treeck, Yvonne Doerffel, Ulrike Morgera, Jens Berger, Marco Gustav, Oliver Lester Saldanha, Tom Luedde, Jakob Nikolas Kather, Tobias Paul Seraphin, Michael Kallenbach","doi":"10.1159/000545098","DOIUrl":null,"url":null,"abstract":"<p><p>Introduction Assessing the malignancy of focal liver lesions is an important yet challenging aspect of routine patient care. Contrast-enhanced ultrasound (CEUS) has proved to be a highly reliable tool but is very dependent on the examiner's expertise. The emergence of artificial intelligence has opened doors to algorithms that could potentially aid in the diagnostic process. In this study, we evaluate the performance of a weakly supervised deep learning model in classifying focal liver lesions (FLL) as malignant or benign. Methods Our retrospective feasibility study was based on a cohort of patients from a tertiary care hospital in Germany undergoing routine CEUS examination to evaluate malignancy of FLL. We trained a weakly supervised attention-based multiple instance learning algorithm during 5-fold cross-validation to distinguish malignant from benign liver tumors, without using any manual annotations, only case labels. We aggregated the on-average best performing cross-validation cycle and tested this combined model on a held-out test set. We evaluated its performance using standard performance metrics and developed explainability methods to gain insight into the model's decisions. Results We enrolled 370 patients, comprising a total of 955,938 images extracted from CEUS videos or manually captured during the examination. Our combined model was able to identify malignant lesions with a mean area under the receiver operating curve of 0.844 in the cross-validation experiment and 0.94 (95% CI 0.89 - 0.99) in the held-out test set. The accuracy, sensitivity, specificity, and F1-Score of the combined model in finding malignant lesions in the held-out test, yielded 80.0%, 81.8%, 84.6%, and 0.81, respectively. Our exploratory analysis using visual explainability methods revealed that the model appears to prioritize information that is also highly relevant to expert clinicians in this task. Conclusions Weakly supervised deep learning can classify malignancy in CEUS examinations of FLLs and thus might one day be able to assist doctors' decision-making in clinical routine.</p>","PeriodicalId":11315,"journal":{"name":"Digestion","volume":" ","pages":"1-21"},"PeriodicalIF":3.0000,"publicationDate":"2025-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digestion","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1159/000545098","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction Assessing the malignancy of focal liver lesions is an important yet challenging aspect of routine patient care. Contrast-enhanced ultrasound (CEUS) has proved to be a highly reliable tool but is very dependent on the examiner's expertise. The emergence of artificial intelligence has opened doors to algorithms that could potentially aid in the diagnostic process. In this study, we evaluate the performance of a weakly supervised deep learning model in classifying focal liver lesions (FLL) as malignant or benign. Methods Our retrospective feasibility study was based on a cohort of patients from a tertiary care hospital in Germany undergoing routine CEUS examination to evaluate malignancy of FLL. We trained a weakly supervised attention-based multiple instance learning algorithm during 5-fold cross-validation to distinguish malignant from benign liver tumors, without using any manual annotations, only case labels. We aggregated the on-average best performing cross-validation cycle and tested this combined model on a held-out test set. We evaluated its performance using standard performance metrics and developed explainability methods to gain insight into the model's decisions. Results We enrolled 370 patients, comprising a total of 955,938 images extracted from CEUS videos or manually captured during the examination. Our combined model was able to identify malignant lesions with a mean area under the receiver operating curve of 0.844 in the cross-validation experiment and 0.94 (95% CI 0.89 - 0.99) in the held-out test set. The accuracy, sensitivity, specificity, and F1-Score of the combined model in finding malignant lesions in the held-out test, yielded 80.0%, 81.8%, 84.6%, and 0.81, respectively. Our exploratory analysis using visual explainability methods revealed that the model appears to prioritize information that is also highly relevant to expert clinicians in this task. Conclusions Weakly supervised deep learning can classify malignancy in CEUS examinations of FLLs and thus might one day be able to assist doctors' decision-making in clinical routine.
期刊介绍:
''Digestion'' concentrates on clinical research reports: in addition to editorials and reviews, the journal features sections on Stomach/Esophagus, Bowel, Neuro-Gastroenterology, Liver/Bile, Pancreas, Metabolism/Nutrition and Gastrointestinal Oncology. Papers cover physiology in humans, metabolic studies and clinical work on the etiology, diagnosis, and therapy of human diseases. It is thus especially cut out for gastroenterologists employed in hospitals and outpatient units. Moreover, the journal''s coverage of studies on the metabolism and effects of therapeutic drugs carries considerable value for clinicians and investigators beyond the immediate field of gastroenterology.