Rui Li , Eric Harshfield , Steven Bell , Michael Burkhart , Anil Tuladhar , Saima Hilal , Daniel Tozer , Francesca Chappell , Stephen Makin , Jessica Lo , Joanna Wardlaw , Frank-Erik de Leeuw , Christopher Chen , Zoe Kourtzi , Hugh Markus
{"title":"预测脑小血管疾病中的痴呆症:机器学习与传统统计模型的比较","authors":"Rui Li , Eric Harshfield , Steven Bell , Michael Burkhart , Anil Tuladhar , Saima Hilal , Daniel Tozer , Francesca Chappell , Stephen Makin , Jessica Lo , Joanna Wardlaw , Frank-Erik de Leeuw , Christopher Chen , Zoe Kourtzi , Hugh Markus","doi":"10.1016/j.cccb.2024.100235","DOIUrl":null,"url":null,"abstract":"<div><h3>Introduction</h3><p>Cerebral small vessel disease (SVD) contributes to 45% of dementia cases worldwide. Only a minority of SVD patients develop dementia, yet we lack a reliable model for predicting incident dementia in SVD. Most attempts to date have relied on traditional statistical approaches, whereas machine learning (ML) methods are increasingly used for clinical prediction in other settings.</p></div><div><h3>Methods</h3><p>We investigated whether ML methods improved prediction of incident dementia in SVD over traditional statistical. We included three cohorts with varying SVD severity (RUN DMC, n=503; SCANS, n=121; HARMONISATION, n=265). Baseline demographics, vascular risk factors, cognitive scores, and MRI features of SVD were used for prediction. We conducted both survival analysis and classification analysis predicting 3-year dementia risk. For each analysis, several ML methods were evaluated against standard Cox or logistic regression. Finally, we compared the feature importance ranking by different models.</p></div><div><h3>Results</h3><p>We included 789 participants without missing data in the survival analysis, among whom 108 (13.7%) developed dementia during a median (IQR) follow-up period of 5.4 (4.1, 8.7) years. After excluding those censored before three years, we included 750 participants in the classification analysis, among whom 48 (6.4%) developed dementia by year 3. Comparing statistical and ML models, only the regularised Cox/logistic regression models outperformed their statistical counterparts overall, but not significantly so in survival analysis. Baseline cognitive scores were highly predictive, and all methods ranked global cognition as the most important feature.</p></div><div><h3>Discussion</h3><p>ML survival or classification models brought little improvement over traditional statistical approaches in predicting incident dementia in SVD. ML approaches may be better suited to prediction problems using a larger number of input variables.</p></div>","PeriodicalId":72549,"journal":{"name":"Cerebral circulation - cognition and behavior","volume":"6 ","pages":"Article 100235"},"PeriodicalIF":1.9000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666245024000369/pdfft?md5=1a8a1ebdfff1743e3d65cbda2e2d4fe6&pid=1-s2.0-S2666245024000369-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Predicting Incident Dementia in Cerebral Small Vessel Disease: Comparison of Machine Learning and Traditional Statistical Models\",\"authors\":\"Rui Li , Eric Harshfield , Steven Bell , Michael Burkhart , Anil Tuladhar , Saima Hilal , Daniel Tozer , Francesca Chappell , Stephen Makin , Jessica Lo , Joanna Wardlaw , Frank-Erik de Leeuw , Christopher Chen , Zoe Kourtzi , Hugh Markus\",\"doi\":\"10.1016/j.cccb.2024.100235\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Introduction</h3><p>Cerebral small vessel disease (SVD) contributes to 45% of dementia cases worldwide. Only a minority of SVD patients develop dementia, yet we lack a reliable model for predicting incident dementia in SVD. Most attempts to date have relied on traditional statistical approaches, whereas machine learning (ML) methods are increasingly used for clinical prediction in other settings.</p></div><div><h3>Methods</h3><p>We investigated whether ML methods improved prediction of incident dementia in SVD over traditional statistical. We included three cohorts with varying SVD severity (RUN DMC, n=503; SCANS, n=121; HARMONISATION, n=265). Baseline demographics, vascular risk factors, cognitive scores, and MRI features of SVD were used for prediction. We conducted both survival analysis and classification analysis predicting 3-year dementia risk. For each analysis, several ML methods were evaluated against standard Cox or logistic regression. Finally, we compared the feature importance ranking by different models.</p></div><div><h3>Results</h3><p>We included 789 participants without missing data in the survival analysis, among whom 108 (13.7%) developed dementia during a median (IQR) follow-up period of 5.4 (4.1, 8.7) years. After excluding those censored before three years, we included 750 participants in the classification analysis, among whom 48 (6.4%) developed dementia by year 3. Comparing statistical and ML models, only the regularised Cox/logistic regression models outperformed their statistical counterparts overall, but not significantly so in survival analysis. Baseline cognitive scores were highly predictive, and all methods ranked global cognition as the most important feature.</p></div><div><h3>Discussion</h3><p>ML survival or classification models brought little improvement over traditional statistical approaches in predicting incident dementia in SVD. ML approaches may be better suited to prediction problems using a larger number of input variables.</p></div>\",\"PeriodicalId\":72549,\"journal\":{\"name\":\"Cerebral circulation - cognition and behavior\",\"volume\":\"6 \",\"pages\":\"Article 100235\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2024-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2666245024000369/pdfft?md5=1a8a1ebdfff1743e3d65cbda2e2d4fe6&pid=1-s2.0-S2666245024000369-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Cerebral circulation - cognition and behavior\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666245024000369\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"CLINICAL NEUROLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cerebral circulation - cognition and behavior","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666245024000369","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
Predicting Incident Dementia in Cerebral Small Vessel Disease: Comparison of Machine Learning and Traditional Statistical Models
Introduction
Cerebral small vessel disease (SVD) contributes to 45% of dementia cases worldwide. Only a minority of SVD patients develop dementia, yet we lack a reliable model for predicting incident dementia in SVD. Most attempts to date have relied on traditional statistical approaches, whereas machine learning (ML) methods are increasingly used for clinical prediction in other settings.
Methods
We investigated whether ML methods improved prediction of incident dementia in SVD over traditional statistical. We included three cohorts with varying SVD severity (RUN DMC, n=503; SCANS, n=121; HARMONISATION, n=265). Baseline demographics, vascular risk factors, cognitive scores, and MRI features of SVD were used for prediction. We conducted both survival analysis and classification analysis predicting 3-year dementia risk. For each analysis, several ML methods were evaluated against standard Cox or logistic regression. Finally, we compared the feature importance ranking by different models.
Results
We included 789 participants without missing data in the survival analysis, among whom 108 (13.7%) developed dementia during a median (IQR) follow-up period of 5.4 (4.1, 8.7) years. After excluding those censored before three years, we included 750 participants in the classification analysis, among whom 48 (6.4%) developed dementia by year 3. Comparing statistical and ML models, only the regularised Cox/logistic regression models outperformed their statistical counterparts overall, but not significantly so in survival analysis. Baseline cognitive scores were highly predictive, and all methods ranked global cognition as the most important feature.
Discussion
ML survival or classification models brought little improvement over traditional statistical approaches in predicting incident dementia in SVD. ML approaches may be better suited to prediction problems using a larger number of input variables.