G. Nirmala , P. Prathap Nayudu , A. Ranjith Kumar , Renuka Sagar
{"title":"Automatic cervical cancer classification using adaptive vision transformer encoder with CNN for medical application","authors":"G. Nirmala , P. Prathap Nayudu , A. Ranjith Kumar , Renuka Sagar","doi":"10.1016/j.patcog.2024.111201","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate and early cervical cancer screening can reduce the mortality rate of cervical cancer patients. The Pap test, often known as a Pap smear, is one of the frequently used methods for the early diagnosis of cervical cancer. However, manual analysis can be time-consuming. Previous approaches have faced challenges such as low accuracy, increased computing complexity, larger feature dimensionality, poor reliability, and increased time consumption due to subpar hyper-parameter optimization. This paper proposes an automatic cervical cancer classification system using a deep learning algorithm to address these issues. The proposed system consists of three stages: pre-processing, segmentation, and classification. Initially, images are collected and pre-processed through normalization, smoothing, and resizing. The pre-processed images are then passed to the segmentation stage, where an Adaptive Deep Residual Aggregation Network is utilized (ADRAN). After segmentation, the images are classified into seven categories: Carcinoma_in_situ, Light_dysplastic, Moderate_dysplastic, Normal_columnar, Normal_Intermediate, Normal_superficial, and Severe_dysplastic using an Adaptive Vision Transformer Encoder (AVTE) with CNN. To improve the efficiency of the transformer learning network, the hyperparameters of AVTE with CNN are optimized using an Adaptive Cat Swarm Optimization algorithm (ACSO). The efficiency of the presented technique is evaluated based on various metrics, and experimentation is conducted using the Herlev dataset.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"160 ","pages":"Article 111201"},"PeriodicalIF":7.5000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S003132032400952X","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate and early cervical cancer screening can reduce the mortality rate of cervical cancer patients. The Pap test, often known as a Pap smear, is one of the frequently used methods for the early diagnosis of cervical cancer. However, manual analysis can be time-consuming. Previous approaches have faced challenges such as low accuracy, increased computing complexity, larger feature dimensionality, poor reliability, and increased time consumption due to subpar hyper-parameter optimization. This paper proposes an automatic cervical cancer classification system using a deep learning algorithm to address these issues. The proposed system consists of three stages: pre-processing, segmentation, and classification. Initially, images are collected and pre-processed through normalization, smoothing, and resizing. The pre-processed images are then passed to the segmentation stage, where an Adaptive Deep Residual Aggregation Network is utilized (ADRAN). After segmentation, the images are classified into seven categories: Carcinoma_in_situ, Light_dysplastic, Moderate_dysplastic, Normal_columnar, Normal_Intermediate, Normal_superficial, and Severe_dysplastic using an Adaptive Vision Transformer Encoder (AVTE) with CNN. To improve the efficiency of the transformer learning network, the hyperparameters of AVTE with CNN are optimized using an Adaptive Cat Swarm Optimization algorithm (ACSO). The efficiency of the presented technique is evaluated based on various metrics, and experimentation is conducted using the Herlev dataset.
期刊介绍:
The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.