An efficient vision transformer for Alzheimer’s disease classification using magnetic resonance images

IF 4.9 2区 医学 Q1 ENGINEERING, BIOMEDICAL Biomedical Signal Processing and Control Pub Date : 2024-11-27 DOI:10.1016/j.bspc.2024.107263
Si-Yuan Lu , Yu-Dong Zhang , Yu-Dong Yao
{"title":"An efficient vision transformer for Alzheimer’s disease classification using magnetic resonance images","authors":"Si-Yuan Lu ,&nbsp;Yu-Dong Zhang ,&nbsp;Yu-Dong Yao","doi":"10.1016/j.bspc.2024.107263","DOIUrl":null,"url":null,"abstract":"<div><div>Alzheimer’s disease (AD) is the most common dementia that is often seen among the elderly. AD can cause the loss of cognitive ability and memory, which can result in death as AD is progressive. The exact cause of AD is still in research, but it is believed to be related to genes, diet, and environment. One observation of AD is the shrinkage of the hippocampus and frontal lobe cortex. Magnetic resonance imaging (MRI) is often employed in the diagnosis of AD as it can produce clear images of the soft tissues. In this study, a new computer-aided diagnosis (CAD) method named RanCom-ViT, is proposed to interpret the brain MRI slices automatically and precisely for AD diagnosis with better global representation learning and efficiency. A pre-trained vision transformer (ViT) is chosen as the backbone because ViTs with attention modules can achieve better performance than convolutional neural networks on larger datasets. Then, a novel token compression block is proposed to improve the efficiency of the RanCom-ViT by removing the less important tokens. Further, the classification head of the RanCom-ViT is enhanced by a random vector functional-link structure to obtain better classification performance in AD diagnosis. A large public brain MRI dataset is utilized in the evaluation experiments of the proposed RanCom-ViT, and it achieved an overall accuracy of 99.54% with a double throughput than the benchmark model. The results reveal that the RanCom-ViT outperforms several existing state-of-the-art AD diagnosis methods in terms of accuracy, and the token compression method contributes to higher efficiency.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"101 ","pages":"Article 107263"},"PeriodicalIF":4.9000,"publicationDate":"2024-11-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809424013211","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Alzheimer’s disease (AD) is the most common dementia that is often seen among the elderly. AD can cause the loss of cognitive ability and memory, which can result in death as AD is progressive. The exact cause of AD is still in research, but it is believed to be related to genes, diet, and environment. One observation of AD is the shrinkage of the hippocampus and frontal lobe cortex. Magnetic resonance imaging (MRI) is often employed in the diagnosis of AD as it can produce clear images of the soft tissues. In this study, a new computer-aided diagnosis (CAD) method named RanCom-ViT, is proposed to interpret the brain MRI slices automatically and precisely for AD diagnosis with better global representation learning and efficiency. A pre-trained vision transformer (ViT) is chosen as the backbone because ViTs with attention modules can achieve better performance than convolutional neural networks on larger datasets. Then, a novel token compression block is proposed to improve the efficiency of the RanCom-ViT by removing the less important tokens. Further, the classification head of the RanCom-ViT is enhanced by a random vector functional-link structure to obtain better classification performance in AD diagnosis. A large public brain MRI dataset is utilized in the evaluation experiments of the proposed RanCom-ViT, and it achieved an overall accuracy of 99.54% with a double throughput than the benchmark model. The results reveal that the RanCom-ViT outperforms several existing state-of-the-art AD diagnosis methods in terms of accuracy, and the token compression method contributes to higher efficiency.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用磁共振图像进行阿尔茨海默病分类的高效视觉转换器
阿尔茨海默病(AD)是老年人最常见的痴呆症。阿兹海默病会导致认知能力和记忆力丧失,由于阿兹海默病是渐进性的,因此可能导致死亡。阿尔茨海默病的确切病因仍在研究中,但相信与基因、饮食和环境有关。对注意力缺失症的一个观察结果是海马体和额叶皮层的萎缩。由于磁共振成像(MRI)可以产生清晰的软组织图像,因此经常被用于诊断注意力缺失症。本研究提出了一种名为 "RanCom-ViT "的新型计算机辅助诊断(CAD)方法,可自动、精确地解读脑部核磁共振成像切片,用于诊断注意力缺失症,并具有更好的全局表示学习能力和效率。之所以选择预先训练好的视觉变换器(ViT)作为骨干,是因为带有注意力模块的视觉变换器能在更大的数据集上获得比卷积神经网络更好的性能。然后,我们提出了一个新颖的标记压缩块,通过去除不太重要的标记来提高 RanCom-ViT 的效率。此外,通过随机向量功能链接结构增强了 RanCom-ViT 的分类头,从而在 AD 诊断中获得更好的分类性能。在对所提出的 RanCom-ViT 进行评估实验时,使用了一个大型公共脑核磁共振数据集,其总体准确率达到 99.54%,吞吐量是基准模型的两倍。结果表明,RanCom-ViT 在准确率方面优于现有的几种最先进的注意力缺失诊断方法,令牌压缩方法有助于提高效率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Biomedical Signal Processing and Control
Biomedical Signal Processing and Control 工程技术-工程:生物医学
CiteScore
9.80
自引率
13.70%
发文量
822
审稿时长
4 months
期刊介绍: Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management. Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.
期刊最新文献
Efficient contact-based registration for minimally invasive anterior hip arthroplasty A dense kernel point convolutional neural network for chronic liver disease classification with hybrid chaotic slime mould and giant trevally optimizer A study of complex network features for electrocardiograms and its Applications in atrial fibrillation recognition Frequency information enhanced half instance normalization network for denoising electrocardiograms Editorial Board
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1