Yi-Fan Kang , Lie Yang , Kai Xu , Bin-Bin Hu , Lan-Jun Cai , Yin-Hao Liu , Xiang Lu
{"title":"A lightweight intelligent laryngeal cancer detection system for rural areas","authors":"Yi-Fan Kang , Lie Yang , Kai Xu , Bin-Bin Hu , Lan-Jun Cai , Yin-Hao Liu , Xiang Lu","doi":"10.1016/j.amjoto.2024.104474","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><p>Early diagnosis of laryngeal cancer (LC) is crucial, particularly in rural areas. Despite existing studies on deep learning models for LC identification, challenges remain in selecting suitable models for rural areas with shortages of laryngologists and limited computer resources. We present the intelligent laryngeal cancer detection system (ILCDS), a deep learning-based solution tailored for effective LC screening in resource-constrained rural areas.</p></div><div><h3>Methods</h3><p>We compiled a dataset comprised of 2023 laryngoscopic images and applied data augmentation techniques for dataset expansion. Subsequently, we utilized eight deep learning models—AlexNet, VGG, ResNet, DenseNet, MobileNet, ShuffleNet, Vision Transformer, and Swin Transformer—for LC identification. A comprehensive evaluation of their performances and efficiencies was conducted, and the most suitable model was selected to assemble the ILCDS.</p></div><div><h3>Results</h3><p>Regarding performance, all models attained an average accuracy exceeding 90 % on the test set. Particularly noteworthy are VGG, DenseNet, and MobileNet, which exceeded an accuracy of 95 %, with scores of 95.32 %, 95.75 %, and 95.99 %, respectively. Regarding efficiency, MobileNet excels owing to its compact size and fast inference speed, making it an ideal model for integration into ILCDS.</p></div><div><h3>Conclusion</h3><p>The ILCDS demonstrated promising accuracy in LC detection while maintaining modest computational resource requirements, indicating its potential to enhance LC screening accuracy and alleviate the workload on otolaryngologists in rural areas.</p></div>","PeriodicalId":7591,"journal":{"name":"American Journal of Otolaryngology","volume":"45 6","pages":"Article 104474"},"PeriodicalIF":1.8000,"publicationDate":"2024-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American Journal of Otolaryngology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0196070924002606","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"OTORHINOLARYNGOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Objective
Early diagnosis of laryngeal cancer (LC) is crucial, particularly in rural areas. Despite existing studies on deep learning models for LC identification, challenges remain in selecting suitable models for rural areas with shortages of laryngologists and limited computer resources. We present the intelligent laryngeal cancer detection system (ILCDS), a deep learning-based solution tailored for effective LC screening in resource-constrained rural areas.
Methods
We compiled a dataset comprised of 2023 laryngoscopic images and applied data augmentation techniques for dataset expansion. Subsequently, we utilized eight deep learning models—AlexNet, VGG, ResNet, DenseNet, MobileNet, ShuffleNet, Vision Transformer, and Swin Transformer—for LC identification. A comprehensive evaluation of their performances and efficiencies was conducted, and the most suitable model was selected to assemble the ILCDS.
Results
Regarding performance, all models attained an average accuracy exceeding 90 % on the test set. Particularly noteworthy are VGG, DenseNet, and MobileNet, which exceeded an accuracy of 95 %, with scores of 95.32 %, 95.75 %, and 95.99 %, respectively. Regarding efficiency, MobileNet excels owing to its compact size and fast inference speed, making it an ideal model for integration into ILCDS.
Conclusion
The ILCDS demonstrated promising accuracy in LC detection while maintaining modest computational resource requirements, indicating its potential to enhance LC screening accuracy and alleviate the workload on otolaryngologists in rural areas.
期刊介绍:
Be fully informed about developments in otology, neurotology, audiology, rhinology, allergy, laryngology, speech science, bronchoesophagology, facial plastic surgery, and head and neck surgery. Featured sections include original contributions, grand rounds, current reviews, case reports and socioeconomics.