VDRNet19: a dense residual deep learning model using stochastic gradient descent with momentum optimizer based on VGG-structure for classifying dementia

M. Pandiyarajan, R. S. Valarmathi
{"title":"VDRNet19: a dense residual deep learning model using stochastic gradient descent with momentum optimizer based on VGG-structure for classifying dementia","authors":"M. Pandiyarajan, R. S. Valarmathi","doi":"10.1007/s41870-024-02103-6","DOIUrl":null,"url":null,"abstract":"<p>Dementia disease is a syndrome caused by various disorders and conditions that affect the brain which causes gradual decline in neurological function commonly observed in older individuals. The disease is categorized into three stages in our research: Mild dementia (MD), Non-dementia (ND) and very mild dementia (VMD). Magnetic Resonance Imaging (MRI) scan of the brain is used for diagnosing dementia. In this research, a dense residual deep learning model using stochastic gradient descent with momentum optimizer based on VGG-structure for classifying dementia (VDRNet19) is proposed, which can detect all three stages of dementia The proposed model is trained and tested with the Open Access Series of Imaging and Studies (OASIS) dataset. In this work, the Contrast Limited Adaptive Histogram Equalization (CLAHE) image enhancement method is employed to preprocess the raw for analysis. In order to confront the imbalance in dataset, augmentation techniques are used. As a result, a balanced dataset comprising a total of 1941 images across the three classes are obtained. Initially, six existing models including DenseNet201, VGG19, ResNet152, AlzheimerNet [13], MobileNetV2 and ensemble of pretrained networks were trained and tested to attain 93.84%, 92.42%, 91.1%, 89.73%, 87.67% and 94.86% of test accuracies respectively. DenseNet201, VGG19, ResNet152 yields the highest accuracy, which is the backbone to design the proposed model. VDRNet19 using optimizer as stochastic gradient descent with momentum, 0.01 as learning rate, achieves the highest testing accuracy of 97.26%. This study compared six pre-trained models alongside the proposed model in terms of performance metrics to determine if the VDRNet19 model excels in classifying the three classes. An ablation study was conducted to validate the chosen hyperparameters. Results indicate that the proposed model surpasses traditional methods in classifying dementia stages from brain MRI scan images.</p>","PeriodicalId":14138,"journal":{"name":"International Journal of Information Technology","volume":"16 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Information Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s41870-024-02103-6","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Dementia disease is a syndrome caused by various disorders and conditions that affect the brain which causes gradual decline in neurological function commonly observed in older individuals. The disease is categorized into three stages in our research: Mild dementia (MD), Non-dementia (ND) and very mild dementia (VMD). Magnetic Resonance Imaging (MRI) scan of the brain is used for diagnosing dementia. In this research, a dense residual deep learning model using stochastic gradient descent with momentum optimizer based on VGG-structure for classifying dementia (VDRNet19) is proposed, which can detect all three stages of dementia The proposed model is trained and tested with the Open Access Series of Imaging and Studies (OASIS) dataset. In this work, the Contrast Limited Adaptive Histogram Equalization (CLAHE) image enhancement method is employed to preprocess the raw for analysis. In order to confront the imbalance in dataset, augmentation techniques are used. As a result, a balanced dataset comprising a total of 1941 images across the three classes are obtained. Initially, six existing models including DenseNet201, VGG19, ResNet152, AlzheimerNet [13], MobileNetV2 and ensemble of pretrained networks were trained and tested to attain 93.84%, 92.42%, 91.1%, 89.73%, 87.67% and 94.86% of test accuracies respectively. DenseNet201, VGG19, ResNet152 yields the highest accuracy, which is the backbone to design the proposed model. VDRNet19 using optimizer as stochastic gradient descent with momentum, 0.01 as learning rate, achieves the highest testing accuracy of 97.26%. This study compared six pre-trained models alongside the proposed model in terms of performance metrics to determine if the VDRNet19 model excels in classifying the three classes. An ablation study was conducted to validate the chosen hyperparameters. Results indicate that the proposed model surpasses traditional methods in classifying dementia stages from brain MRI scan images.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
VDRNet19:使用基于 VGG 结构的随机梯度下降与动量优化器的密集残差深度学习模型,用于痴呆症分类
痴呆症是由各种影响大脑的疾病和病症引起的综合征,导致神经功能逐渐衰退,常见于老年人。在我们的研究中,这种疾病被分为三个阶段:轻度痴呆(MD)、非痴呆(ND)和极轻度痴呆(VMD)。脑部磁共振成像(MRI)扫描用于诊断痴呆症。本研究提出了一种基于 VGG 结构的使用随机梯度下降与动量优化器的密集残差深度学习模型(VDRNet19),用于痴呆症分类,该模型可检测痴呆症的所有三个阶段。在这项工作中,采用了对比度受限自适应直方图均衡化(CLAHE)图像增强方法对原始图像进行预处理,以便进行分析。为了解决数据集中的不平衡问题,使用了增强技术。结果,得到了一个由三个类别共 1941 幅图像组成的平衡数据集。最初,对包括 DenseNet201、VGG19、ResNet152、AlzheimerNet [13]、MobileNetV2 和预训练网络集合在内的六个现有模型进行了训练和测试,测试准确率分别达到 93.84%、92.42%、91.1%、89.73%、87.67% 和 94.86%。其中,DenseNet201、VGG19 和 ResNet152 的准确率最高,是设计所提模型的基础。VDRNet19 的优化器为随机梯度下降,学习率为 0.01,测试准确率最高,达到 97.26%。本研究比较了六个预先训练的模型和所提出模型的性能指标,以确定 VDRNet19 模型是否能出色地对三个类别进行分类。为了验证所选的超参数,还进行了一项消融研究。结果表明,在根据脑磁共振成像扫描图像对痴呆症阶段进行分类方面,所提出的模型超越了传统方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Statistical cryptanalysis of seven classical lightweight ciphers CNN-BO-LSTM: an ensemble framework for prognosis of liver cancer Architecting lymphoma fusion: PROMETHEE-II guided optimization of combination therapeutic synergy RBCA-ETS: enhancing extractive text summarization with contextual embedding and word-level attention RAMD and transient analysis of a juice clarification unit in sugar plants
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1