Afolabi Salami Alausa, Jose M Sanchez-Bornot, Abdoreza Asadpour, Paula L McClean, KongFatt Wong-Lin
{"title":"Alzheimer's Disease Classification Confidence of Individuals using Deep Learning on Heterogeneous Data","authors":"Afolabi Salami Alausa, Jose M Sanchez-Bornot, Abdoreza Asadpour, Paula L McClean, KongFatt Wong-Lin","doi":"10.1101/2024.08.02.24311397","DOIUrl":null,"url":null,"abstract":"Making accurate diagnosis of Alzheimer's disease (AD) is crucial for effective treatment and management. Although deep learning has been applied to AD classification, it is typically performed at group level, the data used are not sufficiently heterogeneous and comprehensive, and decision confidence is not evaluated at individual (single patient) level. This paper proposed a more practical deep learning approach that not only detects AD stages of individuals, but also provides its corresponding confidence estimation. In particular, in addition to a convolutional neural network (CNN), we incorporated a softmax confidence metric based on the network's output activity to evaluate its classification confidence. Further, we applied this approach to a heterogeneous and comprehensive data that comprised cognitive and functional assessments, tau-PET and MRI neuroimaging, medical/family history, demographic, and APoE genotype. Importantly, we utilised leave-one-out cross-validation to train the CNN and classify an individual's healthy control, mild cognitive impairment or AD state, while concurrently estimating each output decision's confidence. We showed that, over different confidence softmax temperature values, CNN could attain classification accuracies at 83-85% for the three classes while having robust confidence scores of 78-83%. Further improvement in confidence breakdown was achieved using the optimal temperature value in confidence evaluation, with higher confidence scores for correct than error decisions. Overall, the computed classification confidence of an individual may aid clinicians and other stakeholders in understanding the reliability of the model's decision outcome and offer better trust. The implication of this work may extend to other classification applications, in which the confidence level of a single deep learning-based decision can be evaluated.","PeriodicalId":501367,"journal":{"name":"medRxiv - Neurology","volume":"109 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv - Neurology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.08.02.24311397","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Making accurate diagnosis of Alzheimer's disease (AD) is crucial for effective treatment and management. Although deep learning has been applied to AD classification, it is typically performed at group level, the data used are not sufficiently heterogeneous and comprehensive, and decision confidence is not evaluated at individual (single patient) level. This paper proposed a more practical deep learning approach that not only detects AD stages of individuals, but also provides its corresponding confidence estimation. In particular, in addition to a convolutional neural network (CNN), we incorporated a softmax confidence metric based on the network's output activity to evaluate its classification confidence. Further, we applied this approach to a heterogeneous and comprehensive data that comprised cognitive and functional assessments, tau-PET and MRI neuroimaging, medical/family history, demographic, and APoE genotype. Importantly, we utilised leave-one-out cross-validation to train the CNN and classify an individual's healthy control, mild cognitive impairment or AD state, while concurrently estimating each output decision's confidence. We showed that, over different confidence softmax temperature values, CNN could attain classification accuracies at 83-85% for the three classes while having robust confidence scores of 78-83%. Further improvement in confidence breakdown was achieved using the optimal temperature value in confidence evaluation, with higher confidence scores for correct than error decisions. Overall, the computed classification confidence of an individual may aid clinicians and other stakeholders in understanding the reliability of the model's decision outcome and offer better trust. The implication of this work may extend to other classification applications, in which the confidence level of a single deep learning-based decision can be evaluated.