Shiva Sabazade MD , Marco A. Lumia Michalski MD , Jakub Bartoszek MD , Maria Fili MD, PhD , Mats Holmström MS , Gustav Stålhammar MD, PhD
{"title":"Development and Validation of a Deep Learning Algorithm for Differentiation of Choroidal Nevi from Small Melanoma in Fundus Photographs","authors":"Shiva Sabazade MD , Marco A. Lumia Michalski MD , Jakub Bartoszek MD , Maria Fili MD, PhD , Mats Holmström MS , Gustav Stålhammar MD, PhD","doi":"10.1016/j.xops.2024.100613","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><div>To develop and validate a deep learning algorithm capable of differentiating small choroidal melanomas from nevi.</div></div><div><h3>Design</h3><div>Retrospective multicenter cohort study.</div></div><div><h3>Participants</h3><div>A total of 802 images from 688 patients diagnosed with choroidal nevi or melanoma.</div></div><div><h3>Methods</h3><div>Wide field and standard field fundus photographs were collected from patients diagnosed with choroidal nevi or melanoma by ocular oncologists during clinical examinations. A lesion was classified as a nevus if it was followed for at least 5 years without being rediagnosed as melanoma. A neural network optimized for image classification was trained and validated on cohorts of 495 and 168 images and subsequently tested on independent sets of 86 and 53 images.</div></div><div><h3>Main Outcome Measures</h3><div>Area under the curve (AUC) in receiver operating characteristic analysis for differentiating small choroidal melanomas from nevi.</div></div><div><h3>Results</h3><div>The algorithm achieved an AUC of 0.88 in both test cohorts, outperforming ophthalmologists using the Mushroom shape, Orange pigment, Large size, Enlargement, and Subretinal fluid (AUC 0.77) and To Find Small Ocular Melanoma Using Helpful Hints Daily (AUC 0.67) risk factors (DeLong’s test, <em>P</em> < 0.001). The algorithm performed equally well for wide field and standard field photos (AUC 0.89 for both when analyzed separately). Using an optimal operating point of 0.63 (on a scale from 0.00 to 1.00) determined from the training and validation datasets, the algorithm achieved 100% sensitivity and 74% specificity in the first test cohort (F-score 0.72), and 80% sensitivity and 81% specificity in the second (F-score 0.71), which consisted of images from external clinics nationwide. It outperformed 12 ophthalmologists in sensitivity (Mann–Whitney <em>U</em>, <em>P</em> = 0.006) but not specificity (<em>P</em> = 0.54). The algorithm showed higher sensitivity than both resident and consultant ophthalmologists (Dunn's test, <em>P</em> = 0.04 and <em>P</em> = 0.006, respectively) but not ocular oncologists (<em>P</em> > 0.99, all <em>P</em> values Bonferroni corrected).</div></div><div><h3>Conclusions</h3><div>This study develops and validates a deep learning algorithm for differentiating small choroidal melanomas from nevi, matching or surpassing the discriminatory performance of experienced human ophthalmologists. Further research will aim to validate its utility in clinical settings.</div></div><div><h3>Financial Disclosure(s)</h3><div>Financial DisclosuresProprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.</div></div>","PeriodicalId":74363,"journal":{"name":"Ophthalmology science","volume":null,"pages":null},"PeriodicalIF":3.2000,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ophthalmology science","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666914524001490","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPHTHALMOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose
To develop and validate a deep learning algorithm capable of differentiating small choroidal melanomas from nevi.
Design
Retrospective multicenter cohort study.
Participants
A total of 802 images from 688 patients diagnosed with choroidal nevi or melanoma.
Methods
Wide field and standard field fundus photographs were collected from patients diagnosed with choroidal nevi or melanoma by ocular oncologists during clinical examinations. A lesion was classified as a nevus if it was followed for at least 5 years without being rediagnosed as melanoma. A neural network optimized for image classification was trained and validated on cohorts of 495 and 168 images and subsequently tested on independent sets of 86 and 53 images.
Main Outcome Measures
Area under the curve (AUC) in receiver operating characteristic analysis for differentiating small choroidal melanomas from nevi.
Results
The algorithm achieved an AUC of 0.88 in both test cohorts, outperforming ophthalmologists using the Mushroom shape, Orange pigment, Large size, Enlargement, and Subretinal fluid (AUC 0.77) and To Find Small Ocular Melanoma Using Helpful Hints Daily (AUC 0.67) risk factors (DeLong’s test, P < 0.001). The algorithm performed equally well for wide field and standard field photos (AUC 0.89 for both when analyzed separately). Using an optimal operating point of 0.63 (on a scale from 0.00 to 1.00) determined from the training and validation datasets, the algorithm achieved 100% sensitivity and 74% specificity in the first test cohort (F-score 0.72), and 80% sensitivity and 81% specificity in the second (F-score 0.71), which consisted of images from external clinics nationwide. It outperformed 12 ophthalmologists in sensitivity (Mann–Whitney U, P = 0.006) but not specificity (P = 0.54). The algorithm showed higher sensitivity than both resident and consultant ophthalmologists (Dunn's test, P = 0.04 and P = 0.006, respectively) but not ocular oncologists (P > 0.99, all P values Bonferroni corrected).
Conclusions
This study develops and validates a deep learning algorithm for differentiating small choroidal melanomas from nevi, matching or surpassing the discriminatory performance of experienced human ophthalmologists. Further research will aim to validate its utility in clinical settings.
Financial Disclosure(s)
Financial DisclosuresProprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.