Ting-Wei Wang MD, PhD , Yu-Chieh Shiao MD , Jia-Sheng Hong PhD , Wei-Kai Lee PhD , Ming-Sheng Hsu MD , Hao-Min Cheng MD, PhD , Huai-Che Yang MD, PhD , Cheng-Chia Lee MD, PhD , Hung-Chuan Pan MD, PhD , Weir Chiang You MD, PhD , Jiing-Feng Lirng MD , Wan-Yuo Guo MD, PhD , Yu-Te Wu PhD
{"title":"Artificial Intelligence Detection and Segmentation Models: A Systematic Review and Meta-Analysis of Brain Tumors in Magnetic Resonance Imaging","authors":"Ting-Wei Wang MD, PhD , Yu-Chieh Shiao MD , Jia-Sheng Hong PhD , Wei-Kai Lee PhD , Ming-Sheng Hsu MD , Hao-Min Cheng MD, PhD , Huai-Che Yang MD, PhD , Cheng-Chia Lee MD, PhD , Hung-Chuan Pan MD, PhD , Weir Chiang You MD, PhD , Jiing-Feng Lirng MD , Wan-Yuo Guo MD, PhD , Yu-Te Wu PhD","doi":"10.1016/j.mcpdig.2024.01.002","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><p>To thoroughly analyze factors affecting the generalization ability of deep learning algorithms on brain tumor detection and segmentation models.</p></div><div><h3>Patients and Methods</h3><p>We searched PubMed, Embase, Web of Science, Cochrane Library, and IEEE from inception to July 25, 2023, and 19 studies with 12,000 patients were identified. The criteria required studies to use magnetic resonance imaging (MRI) for brain tumor detection and segmentation, offer clear performance metrics, and use external validation data sets. The study focused on outcomes such as sensitivity and Dice score. Study quality was assessed using QUADAS-2 and CLAIM tools. The meta-analysis evaluated varying algorithms and their performance across different validation data sets.</p></div><div><h3>Results</h3><p>MRI hardware as the manufacturer may contribute to data set diversity, impacting AI model generalizability. The study found that the best algorithms had a pooled lesion-wise Dice score of 84%, with pooled sensitivities of 87% (patient-wise) and 86% (lesion-wise). Post-2022 methodologies highlighted evolving artificial intelligence techniques. Performance differences were evident among tumor types, likely due to size disparities. 3D models outperformed their 2D and ensemble counterparts in detection. Although specific preprocessing techniques improved segmentation outcomes, some hindered detection.</p></div><div><h3>Conclusion</h3><p>The study underscores the potential of deep learning in improving brain tumor diagnostics and treatment planning. We also identify the need for further research, including developing a comprehensive diversity index, expanded meta-analyses, and using generative adversarial networks for data diversification, paving the way for AI-driven advancements in oncological patient care.</p></div><div><h3>Trial Registration</h3><p>PROPERO (CRD42023459108).</p></div>","PeriodicalId":74127,"journal":{"name":"Mayo Clinic Proceedings. Digital health","volume":"2 1","pages":"Pages 75-91"},"PeriodicalIF":0.0000,"publicationDate":"2024-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2949761224000038/pdfft?md5=462accb0c195aebed809efe8ef0de1df&pid=1-s2.0-S2949761224000038-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mayo Clinic Proceedings. Digital health","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949761224000038","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Objective
To thoroughly analyze factors affecting the generalization ability of deep learning algorithms on brain tumor detection and segmentation models.
Patients and Methods
We searched PubMed, Embase, Web of Science, Cochrane Library, and IEEE from inception to July 25, 2023, and 19 studies with 12,000 patients were identified. The criteria required studies to use magnetic resonance imaging (MRI) for brain tumor detection and segmentation, offer clear performance metrics, and use external validation data sets. The study focused on outcomes such as sensitivity and Dice score. Study quality was assessed using QUADAS-2 and CLAIM tools. The meta-analysis evaluated varying algorithms and their performance across different validation data sets.
Results
MRI hardware as the manufacturer may contribute to data set diversity, impacting AI model generalizability. The study found that the best algorithms had a pooled lesion-wise Dice score of 84%, with pooled sensitivities of 87% (patient-wise) and 86% (lesion-wise). Post-2022 methodologies highlighted evolving artificial intelligence techniques. Performance differences were evident among tumor types, likely due to size disparities. 3D models outperformed their 2D and ensemble counterparts in detection. Although specific preprocessing techniques improved segmentation outcomes, some hindered detection.
Conclusion
The study underscores the potential of deep learning in improving brain tumor diagnostics and treatment planning. We also identify the need for further research, including developing a comprehensive diversity index, expanded meta-analyses, and using generative adversarial networks for data diversification, paving the way for AI-driven advancements in oncological patient care.