{"title":"Identifying Parkinson’s Disease using Multimodal Approach and Deep Learning","authors":"Mahsa Mohaghegh, Jaya Gascon","doi":"10.1109/citisia53721.2021.9719945","DOIUrl":null,"url":null,"abstract":"Parkinson’s disease is a progressive neurological disease resulting in motor impairments that decrease patients’ quality of life. It is a condition currently without a cure. This study proposes an approach to Parkinson’s disease detection using multimodal analysis, combining handwriting and voice data. We introduce the use of image transformer architectures to classify Parkinson’s disease patients from healthy subjects, as early diagnosis of Parkinson’s disease contributes to the management of motor symptoms. Data-efficient image transformer with self-supervised learning on DINO obtained an accuracy of above 90% on a combination of spiral and meander drawings from the NewHandPD dataset. In comparison, an audio spectrogram transformer obtained an accuracy of above 80% on the sustained vowel phonations of /a/ and /o/ from the PC-GITA corpus. This work considers using a multimodal approach in identifying Parkinson’s disease and the usability of transformer architectures in image and audio spectrogram classification tasks.","PeriodicalId":252063,"journal":{"name":"2021 6th International Conference on Innovative Technology in Intelligent System and Industrial Applications (CITISIA)","volume":"304 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 6th International Conference on Innovative Technology in Intelligent System and Industrial Applications (CITISIA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/citisia53721.2021.9719945","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Parkinson’s disease is a progressive neurological disease resulting in motor impairments that decrease patients’ quality of life. It is a condition currently without a cure. This study proposes an approach to Parkinson’s disease detection using multimodal analysis, combining handwriting and voice data. We introduce the use of image transformer architectures to classify Parkinson’s disease patients from healthy subjects, as early diagnosis of Parkinson’s disease contributes to the management of motor symptoms. Data-efficient image transformer with self-supervised learning on DINO obtained an accuracy of above 90% on a combination of spiral and meander drawings from the NewHandPD dataset. In comparison, an audio spectrogram transformer obtained an accuracy of above 80% on the sustained vowel phonations of /a/ and /o/ from the PC-GITA corpus. This work considers using a multimodal approach in identifying Parkinson’s disease and the usability of transformer architectures in image and audio spectrogram classification tasks.