Ngoc Truc Ngan Ho, Paulina Gonzalez, Gideon K. Gogovi
{"title":"Writing the Signs: An Explainable Machine Learning Approach for Alzheimer's Disease Classification from Handwriting","authors":"Ngoc Truc Ngan Ho, Paulina Gonzalez, Gideon K. Gogovi","doi":"10.1049/htl2.70006","DOIUrl":null,"url":null,"abstract":"<p>Alzheimer's disease is a global health challenge, emphasizing the need for early detection to enable timely intervention and improve outcomes. This study analyzes handwriting data from individuals with and without Alzheimer's to identify predictive features across copying, graphic and memory-based tasks. Machine learning models, including Random Forest, Bootstrap Aggregating (Bagging), Extreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), Adaptive Boosting (AdaBoost) and Gradient Boosting, were applied to classify patients, with SHapley Additive exPlanations (SHAP) enhancing model interpretability. Time-related features were crucial in copying and graphic tasks, reflecting cognitive processing speed, while pressure-related features were significant in memory tasks, indicating recall confidence. Simpler graphic tasks showed strong discriminatory power, aiding early detection. Performance metrics demonstrated model effectiveness: For memory tasks, Random Forest achieved the highest accuracy (<span></span><math>\n <semantics>\n <mrow>\n <mn>0.840</mn>\n <mo>±</mo>\n <mn>0.038</mn>\n </mrow>\n <annotation>$0.840 \\pm 0.038$</annotation>\n </semantics></math>), while Bagged SVC was the lowest (<span></span><math>\n <semantics>\n <mrow>\n <mn>0.617</mn>\n <mo>±</mo>\n <mn>0.046</mn>\n </mrow>\n <annotation>$0.617 \\pm 0.046$</annotation>\n </semantics></math>). Copying tasks recorded a peak accuracy of <span></span><math>\n <semantics>\n <mrow>\n <mn>0.804</mn>\n <mo>±</mo>\n <mn>0.075</mn>\n </mrow>\n <annotation>$0.804 \\pm 0.075$</annotation>\n </semantics></math> with Gradient Boost and a low of <span></span><math>\n <semantics>\n <mrow>\n <mn>0.566</mn>\n <mo>±</mo>\n <mn>0.032</mn>\n </mrow>\n <annotation>$0.566 \\pm 0.032$</annotation>\n </semantics></math> for Bagged SVC. Graphic tasks reached <span></span><math>\n <semantics>\n <mrow>\n <mn>0.799</mn>\n <mo>±</mo>\n <mn>0.041</mn>\n </mrow>\n <annotation>$0.799 \\pm 0.041$</annotation>\n </semantics></math> with Gradient Boost and 0.643 ± 0.071 with AdaBoost. For all tasks combined, Random Forest excelled (<span></span><math>\n <semantics>\n <mrow>\n <mn>0.854</mn>\n <mo>±</mo>\n <mn>0.033</mn>\n </mrow>\n <annotation>$0.854 \\pm 0.033$</annotation>\n </semantics></math>), while Gradient Boost performed worst (<span></span><math>\n <semantics>\n <mrow>\n <mn>0.598</mn>\n <mo>±</mo>\n <mn>0.151</mn>\n </mrow>\n <annotation>$0.598 \\pm 0.151$</annotation>\n </semantics></math>). These results highlight handwriting analysis's potential in Alzheimer's detection.</p>","PeriodicalId":37474,"journal":{"name":"Healthcare Technology Letters","volume":"12 1","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2025-02-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1049/htl2.70006","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Healthcare Technology Letters","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1049/htl2.70006","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Alzheimer's disease is a global health challenge, emphasizing the need for early detection to enable timely intervention and improve outcomes. This study analyzes handwriting data from individuals with and without Alzheimer's to identify predictive features across copying, graphic and memory-based tasks. Machine learning models, including Random Forest, Bootstrap Aggregating (Bagging), Extreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), Adaptive Boosting (AdaBoost) and Gradient Boosting, were applied to classify patients, with SHapley Additive exPlanations (SHAP) enhancing model interpretability. Time-related features were crucial in copying and graphic tasks, reflecting cognitive processing speed, while pressure-related features were significant in memory tasks, indicating recall confidence. Simpler graphic tasks showed strong discriminatory power, aiding early detection. Performance metrics demonstrated model effectiveness: For memory tasks, Random Forest achieved the highest accuracy (), while Bagged SVC was the lowest (). Copying tasks recorded a peak accuracy of with Gradient Boost and a low of for Bagged SVC. Graphic tasks reached with Gradient Boost and 0.643 ± 0.071 with AdaBoost. For all tasks combined, Random Forest excelled (), while Gradient Boost performed worst (). These results highlight handwriting analysis's potential in Alzheimer's detection.
期刊介绍:
Healthcare Technology Letters aims to bring together an audience of biomedical and electrical engineers, physical and computer scientists, and mathematicians to enable the exchange of the latest ideas and advances through rapid online publication of original healthcare technology research. Major themes of the journal include (but are not limited to): Major technological/methodological areas: Biomedical signal processing Biomedical imaging and image processing Bioinstrumentation (sensors, wearable technologies, etc) Biomedical informatics Major application areas: Cardiovascular and respiratory systems engineering Neural engineering, neuromuscular systems Rehabilitation engineering Bio-robotics, surgical planning and biomechanics Therapeutic and diagnostic systems, devices and technologies Clinical engineering Healthcare information systems, telemedicine, mHealth.