{"title":"Machine learning techniques for independent gait recovery prediction in acute anterior circulation ischemic stroke.","authors":"Jiangping Ma, Yuanjie Xie","doi":"10.1186/s12984-025-01548-5","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>This study aimed to develop and validate a machine learning-based predictive model for gait recovery in patients with acute anterior circulation ischemic stroke.</p><p><strong>Methods: </strong>Between May and November 2023, 237 patients with acute anterior circulation ischemic stroke were enrolled. Patients were randomly divided into training and validation sets at a 7:3 ratio. Thirty-one medical characteristics were collected, and the Least Absolute Shrinkage and Selection Operator (LASSO) regression was applied to screen predictor variables. Predictive models were developed using the Random Survival Forest (RSF) and COX regression methods. The optimal model was identified based on C-index values. The SHapley Additive exPlanations (SHAP) method was employed to interpret the RSF model globally and locally.</p><p><strong>Results: </strong>Ten predictors were identified through LASSO regression, including age, gender, periventricular white matter hyperintensities (PVWMH), Montreal Cognitive Assessment (MoCA), National Institutes of Health Stroke Scale (NIHSS), enlarged perivascular spaces in basal ganglia (BG-EPVS), lacunes, parietal infarction, basal ganglia infarction, and Timed Up & Go (TUG) test score. The C-index values of the COX regression and RSF models were 0.741 and 0.761 in the training set and 0.705 and 0.725 in the validation set, respectively. SHAP analysis of the RSF model identified BG-EPVS, TUG, MoCA, age, and PVWMH as the top five most influential predictors of gait recovery.</p><p><strong>Conclusion: </strong>The RSF model demonstrated superior performance to the COX regression model in predicting gait recovery, offering a reliable tool for clinical decision-making regarding stroke patients' prognoses.</p>","PeriodicalId":16384,"journal":{"name":"Journal of NeuroEngineering and Rehabilitation","volume":"22 1","pages":"19"},"PeriodicalIF":5.2000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11786359/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of NeuroEngineering and Rehabilitation","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1186/s12984-025-01548-5","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: This study aimed to develop and validate a machine learning-based predictive model for gait recovery in patients with acute anterior circulation ischemic stroke.
Methods: Between May and November 2023, 237 patients with acute anterior circulation ischemic stroke were enrolled. Patients were randomly divided into training and validation sets at a 7:3 ratio. Thirty-one medical characteristics were collected, and the Least Absolute Shrinkage and Selection Operator (LASSO) regression was applied to screen predictor variables. Predictive models were developed using the Random Survival Forest (RSF) and COX regression methods. The optimal model was identified based on C-index values. The SHapley Additive exPlanations (SHAP) method was employed to interpret the RSF model globally and locally.
Results: Ten predictors were identified through LASSO regression, including age, gender, periventricular white matter hyperintensities (PVWMH), Montreal Cognitive Assessment (MoCA), National Institutes of Health Stroke Scale (NIHSS), enlarged perivascular spaces in basal ganglia (BG-EPVS), lacunes, parietal infarction, basal ganglia infarction, and Timed Up & Go (TUG) test score. The C-index values of the COX regression and RSF models were 0.741 and 0.761 in the training set and 0.705 and 0.725 in the validation set, respectively. SHAP analysis of the RSF model identified BG-EPVS, TUG, MoCA, age, and PVWMH as the top five most influential predictors of gait recovery.
Conclusion: The RSF model demonstrated superior performance to the COX regression model in predicting gait recovery, offering a reliable tool for clinical decision-making regarding stroke patients' prognoses.
目的:本研究旨在开发并验证基于机器学习的急性前循环缺血性脑卒中患者步态恢复预测模型。方法:2023年5月至11月,237例急性前循环缺血性脑卒中患者入组。患者按7:3的比例随机分为训练组和验证组。收集31个医学特征,应用最小绝对收缩和选择算子(LASSO)回归筛选预测变量。采用随机生存森林(RSF)和COX回归方法建立预测模型。根据c -指标值确定最优模型。采用SHapley加性解释(SHAP)方法对RSF模型进行全局和局部解释。结果:通过LASSO回归确定了10个预测因素,包括年龄、性别、心室周围白质高信号(PVWMH)、蒙特利尔认知评估(MoCA)、美国国立卫生研究院卒中量表(NIHSS)、基底节区血管周围间隙增大(BG-EPVS)、凹窝、顶叶梗死、基底节区梗死、Timed Up & Go (TUG)测试分数。COX回归模型和RSF模型在训练集中的c指数值分别为0.741和0.761,在验证集中的c指数值分别为0.705和0.725。RSF模型的SHAP分析确定BG-EPVS、TUG、MoCA、年龄和PVWMH是步态恢复的前五大最具影响力的预测因子。结论:RSF模型在预测脑卒中患者步态恢复方面优于COX回归模型,为脑卒中患者预后的临床决策提供了可靠的工具。
期刊介绍:
Journal of NeuroEngineering and Rehabilitation considers manuscripts on all aspects of research that result from cross-fertilization of the fields of neuroscience, biomedical engineering, and physical medicine & rehabilitation.