Melinda Y. Chang MD , Gena Heidary MD, PhD , Shannon Beres MD , Stacy L. Pineles MD , Eric D. Gaier MD, PhD , Ryan Gise MD , Mark Reid PhD , Kleanthis Avramidis MEng , Mohammad Rostami PhD , Shrikanth Narayanan PhD , Pediatric Optic Nerve Investigator Group (PONIG)
{"title":"Artificial Intelligence to Differentiate Pediatric Pseudopapilledema and True Papilledema on Fundus Photographs","authors":"Melinda Y. Chang MD , Gena Heidary MD, PhD , Shannon Beres MD , Stacy L. Pineles MD , Eric D. Gaier MD, PhD , Ryan Gise MD , Mark Reid PhD , Kleanthis Avramidis MEng , Mohammad Rostami PhD , Shrikanth Narayanan PhD , Pediatric Optic Nerve Investigator Group (PONIG)","doi":"10.1016/j.xops.2024.100496","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><p>To develop and test an artificial intelligence (AI) model to aid in differentiating pediatric pseudopapilledema from true papilledema on fundus photographs.</p></div><div><h3>Design</h3><p>Multicenter retrospective study.</p></div><div><h3>Subjects</h3><p>A total of 851 fundus photographs from 235 children (age < 18 years) with pseudopapilledema and true papilledema.</p></div><div><h3>Methods</h3><p>Four pediatric neuro-ophthalmologists at 4 different institutions contributed fundus photographs of children with confirmed diagnoses of papilledema or pseudopapilledema. An AI model to classify fundus photographs as papilledema or pseudopapilledema was developed using a DenseNet backbone and a tribranch convolutional neural network. We performed 10-fold cross-validation and separately analyzed an external test set. The AI model’s performance was compared with 2 masked human expert pediatric neuro-ophthalmologists, who performed the same classification task.</p></div><div><h3>Main Outcome Measures</h3><p>Accuracy, sensitivity, and specificity of the AI model compared with human experts.</p></div><div><h3>Results</h3><p>The area under receiver operating curve of the AI model was 0.77 for the cross-validation set and 0.81 for the external test set. The accuracy of the AI model was 70.0% for the cross-validation set and 73.9% for the external test set. The sensitivity of the AI model was 73.4% for the cross-validation set and 90.4% for the external test set. The AI model’s accuracy was significantly higher than human experts on the cross validation set (<em>P</em> < 0.002), and the model’s sensitivity was significantly higher on the external test set (<em>P</em> = 0.0002). The specificity of the AI model and human experts was similar (56.4%–67.3%). Moreover, the AI model was significantly more sensitive at detecting mild papilledema than human experts, whereas AI and humans performed similarly on photographs of moderate-to-severe papilledema. On review of the external test set, only 1 child (with nearly resolved pseudotumor cerebri) had both eyes with papilledema incorrectly classified as pseudopapilledema.</p></div><div><h3>Conclusions</h3><p>When classifying fundus photographs of pediatric papilledema and pseudopapilledema, our AI model achieved > 90% sensitivity at detecting papilledema, superior to human experts. Due to the high sensitivity and low false negative rate, AI may be useful to triage children with suspected papilledema requiring work-up to evaluate for serious underlying neurologic conditions.</p></div><div><h3>Financial Disclosure(s)</h3><p>Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.</p></div>","PeriodicalId":74363,"journal":{"name":"Ophthalmology science","volume":null,"pages":null},"PeriodicalIF":3.2000,"publicationDate":"2024-02-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666914524000320/pdfft?md5=a4155e55d0289e14935dfa99b5052346&pid=1-s2.0-S2666914524000320-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ophthalmology science","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666914524000320","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"OPHTHALMOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose
To develop and test an artificial intelligence (AI) model to aid in differentiating pediatric pseudopapilledema from true papilledema on fundus photographs.
Design
Multicenter retrospective study.
Subjects
A total of 851 fundus photographs from 235 children (age < 18 years) with pseudopapilledema and true papilledema.
Methods
Four pediatric neuro-ophthalmologists at 4 different institutions contributed fundus photographs of children with confirmed diagnoses of papilledema or pseudopapilledema. An AI model to classify fundus photographs as papilledema or pseudopapilledema was developed using a DenseNet backbone and a tribranch convolutional neural network. We performed 10-fold cross-validation and separately analyzed an external test set. The AI model’s performance was compared with 2 masked human expert pediatric neuro-ophthalmologists, who performed the same classification task.
Main Outcome Measures
Accuracy, sensitivity, and specificity of the AI model compared with human experts.
Results
The area under receiver operating curve of the AI model was 0.77 for the cross-validation set and 0.81 for the external test set. The accuracy of the AI model was 70.0% for the cross-validation set and 73.9% for the external test set. The sensitivity of the AI model was 73.4% for the cross-validation set and 90.4% for the external test set. The AI model’s accuracy was significantly higher than human experts on the cross validation set (P < 0.002), and the model’s sensitivity was significantly higher on the external test set (P = 0.0002). The specificity of the AI model and human experts was similar (56.4%–67.3%). Moreover, the AI model was significantly more sensitive at detecting mild papilledema than human experts, whereas AI and humans performed similarly on photographs of moderate-to-severe papilledema. On review of the external test set, only 1 child (with nearly resolved pseudotumor cerebri) had both eyes with papilledema incorrectly classified as pseudopapilledema.
Conclusions
When classifying fundus photographs of pediatric papilledema and pseudopapilledema, our AI model achieved > 90% sensitivity at detecting papilledema, superior to human experts. Due to the high sensitivity and low false negative rate, AI may be useful to triage children with suspected papilledema requiring work-up to evaluate for serious underlying neurologic conditions.
Financial Disclosure(s)
Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.