Vision Transformer based Deep Learning Model for Monkeypox Detection

Dipanjali Kundu, Umme Raihan Siddiqi, Md. Mahbubur Rahman
{"title":"Vision Transformer based Deep Learning Model for Monkeypox Detection","authors":"Dipanjali Kundu, Umme Raihan Siddiqi, Md. Mahbubur Rahman","doi":"10.1109/ICCIT57492.2022.10054797","DOIUrl":null,"url":null,"abstract":"Images of skin lesions may be used to detect this virus, which is a reliable method for identifying the pox virus group. However, early identification and prediction are difficult due to the virus’s resemblance to other pox viruses. An intelligent computer-aided detection system may be a great alternative to relying on labor-intensive human identification. Therefore, in this research an machine learning and deep learning classification method for monkeypox prediction has been proposed and trained, and tested over 1300 skin lesion images. A comparative analysis of machine learning algorithms (K-NN and SVM) and Deep learning algorithms (Vision Transformer, RestNet50) to establish the efficacy of this study. Layered Convolutional Neural Network (CNN) with transfer learning and pretrained models such as RestNet50 integrated, together with customized hyperparameters for extracting the features from the input images. The feed-forward, which is also completely integrated, helped the algorithm divide the visuals into two categories–chickenpox and monkeypox. Among the ML model, the K-NN achieves the best accuracy of 84%. However, The Vision Transformer(ViT) outperforms the other models with an accuracy of 93%. In Addition to it, we analyze our pretrained model to achieve the desired outcome based on the relevant existing model as already established to the end user.","PeriodicalId":255498,"journal":{"name":"2022 25th International Conference on Computer and Information Technology (ICCIT)","volume":"93 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 25th International Conference on Computer and Information Technology (ICCIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIT57492.2022.10054797","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

Images of skin lesions may be used to detect this virus, which is a reliable method for identifying the pox virus group. However, early identification and prediction are difficult due to the virus’s resemblance to other pox viruses. An intelligent computer-aided detection system may be a great alternative to relying on labor-intensive human identification. Therefore, in this research an machine learning and deep learning classification method for monkeypox prediction has been proposed and trained, and tested over 1300 skin lesion images. A comparative analysis of machine learning algorithms (K-NN and SVM) and Deep learning algorithms (Vision Transformer, RestNet50) to establish the efficacy of this study. Layered Convolutional Neural Network (CNN) with transfer learning and pretrained models such as RestNet50 integrated, together with customized hyperparameters for extracting the features from the input images. The feed-forward, which is also completely integrated, helped the algorithm divide the visuals into two categories–chickenpox and monkeypox. Among the ML model, the K-NN achieves the best accuracy of 84%. However, The Vision Transformer(ViT) outperforms the other models with an accuracy of 93%. In Addition to it, we analyze our pretrained model to achieve the desired outcome based on the relevant existing model as already established to the end user.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于视觉变换的猴痘检测深度学习模型
皮肤病变图像可用于检测该病毒,这是识别痘病毒群的可靠方法。然而,由于该病毒与其他痘病毒相似,早期识别和预测是困难的。智能计算机辅助检测系统可能是依赖劳动密集型的人类识别的一个很好的替代方案。因此,本研究提出了一种用于猴痘预测的机器学习和深度学习分类方法,并对1300多张皮肤病变图像进行了训练和测试。对机器学习算法(K-NN和SVM)和深度学习算法(Vision Transformer, RestNet50)进行对比分析,以确定本研究的有效性。结合了迁移学习和RestNet50等预训练模型的分层卷积神经网络(CNN),以及从输入图像中提取特征的自定义超参数。前馈,也是完全集成的,帮助算法将视觉分为两类——水痘和猴痘。在ML模型中,K-NN达到了84%的最佳准确率。然而,视觉变压器(ViT)以93%的准确率优于其他模型。除此之外,我们还分析我们的预训练模型,以实现基于已经建立的最终用户的相关现有模型的期望结果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
SlotFinder: A Spatio-temporal based Car Parking System Land Cover and Land Use Detection using Semi-Supervised Learning Comparative Analysis of Process Scheduling Algorithm using AI models Throughput Optimization of IEEE 802.15.4e TSCH-Based Scheduling: A Deep Neural Network (DNN) Scheme Towards Developing a Voice-Over-Guided System for Visually Impaired People to Learn Writing the Alphabets
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1