{"title":"Determinants of Visual Impairment Among Chinese Middle-Aged and Older Adults: Risk Prediction Model Using Machine Learning Algorithms.","authors":"Lijun Mao, Zhen Yu, Luotao Lin, Manoj Sharma, Hualing Song, Hailei Zhao, Xianglong Xu","doi":"10.2196/59810","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Visual impairment (VI) is a prevalent global health issue, affecting over 2.2 billion people worldwide, with nearly half of the Chinese population aged 60 years and older being affected. Early detection of high-risk VI is essential for preventing irreversible vision loss among Chinese middle-aged and older adults. While machine learning (ML) algorithms exhibit significant predictive advantages, their application in predicting VI risk among the general middle-aged and older adult population in China remains limited.</p><p><strong>Objective: </strong>This study aimed to predict VI and identify its determinants using ML algorithms.</p><p><strong>Methods: </strong>We used 19,047 participants from 4 waves of the China Health and Retirement Longitudinal Study (CHARLS) that were conducted between 2011 and 2018. To envisage the prevalence of VI, we generated a geographical distribution map. Additionally, we constructed a model using indicators of a self-reported questionnaire, a physical examination, and blood biomarkers as predictors. Multiple ML algorithms, including gradient boosting machine, distributed random forest, the generalized linear model, deep learning, and stacked ensemble, were used for prediction. We plotted receiver operating characteristic and calibration curves to assess the predictive performance. Variable importance analysis was used to identify key predictors.</p><p><strong>Results: </strong>Among all participants, 33.9% (6449/19,047) had VI. Qinghai, Chongqing, Anhui, and Sichuan showed the highest VI rates, while Beijing and Xinjiang had the lowest. The generalized linear model, gradient boosting machine, and stacked ensemble achieved acceptable area under curve values of 0.706, 0.710, and 0.715, respectively, with the stacked ensemble performing best. Key predictors included hearing impairment, self-expectation of health status, pain, age, hand grip strength, depression, night sleep duration, high-density lipoprotein cholesterol, and arthritis or rheumatism.</p><p><strong>Conclusions: </strong>Nearly one-third of middle-aged and older adults in China had VI. The prevalence of VI shows regional variations, but there are no distinct east-west or north-south distribution differences. ML algorithms demonstrate accurate predictive capabilities for VI. The combination of prediction models and variable importance analysis provides valuable insights for the early identification and intervention of VI among Chinese middle-aged and older adults.</p>","PeriodicalId":36245,"journal":{"name":"JMIR Aging","volume":"7 ","pages":"e59810"},"PeriodicalIF":5.0000,"publicationDate":"2024-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11481821/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Aging","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2196/59810","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GERIATRICS & GERONTOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Visual impairment (VI) is a prevalent global health issue, affecting over 2.2 billion people worldwide, with nearly half of the Chinese population aged 60 years and older being affected. Early detection of high-risk VI is essential for preventing irreversible vision loss among Chinese middle-aged and older adults. While machine learning (ML) algorithms exhibit significant predictive advantages, their application in predicting VI risk among the general middle-aged and older adult population in China remains limited.
Objective: This study aimed to predict VI and identify its determinants using ML algorithms.
Methods: We used 19,047 participants from 4 waves of the China Health and Retirement Longitudinal Study (CHARLS) that were conducted between 2011 and 2018. To envisage the prevalence of VI, we generated a geographical distribution map. Additionally, we constructed a model using indicators of a self-reported questionnaire, a physical examination, and blood biomarkers as predictors. Multiple ML algorithms, including gradient boosting machine, distributed random forest, the generalized linear model, deep learning, and stacked ensemble, were used for prediction. We plotted receiver operating characteristic and calibration curves to assess the predictive performance. Variable importance analysis was used to identify key predictors.
Results: Among all participants, 33.9% (6449/19,047) had VI. Qinghai, Chongqing, Anhui, and Sichuan showed the highest VI rates, while Beijing and Xinjiang had the lowest. The generalized linear model, gradient boosting machine, and stacked ensemble achieved acceptable area under curve values of 0.706, 0.710, and 0.715, respectively, with the stacked ensemble performing best. Key predictors included hearing impairment, self-expectation of health status, pain, age, hand grip strength, depression, night sleep duration, high-density lipoprotein cholesterol, and arthritis or rheumatism.
Conclusions: Nearly one-third of middle-aged and older adults in China had VI. The prevalence of VI shows regional variations, but there are no distinct east-west or north-south distribution differences. ML algorithms demonstrate accurate predictive capabilities for VI. The combination of prediction models and variable importance analysis provides valuable insights for the early identification and intervention of VI among Chinese middle-aged and older adults.
背景:视力损伤(VI)是一个普遍的全球健康问题,影响着全球 22 亿多人,其中近一半的中国 60 岁及以上人口受到影响。要防止中国中老年人出现不可逆转的视力损失,早期发现高风险视力损伤至关重要。虽然机器学习(ML)算法具有显著的预测优势,但其在预测中国中老年人视力减退风险方面的应用仍然有限:本研究旨在利用 ML 算法预测 VI 并确定其决定因素:我们使用了中国健康与退休纵向研究(CHARLS)的 19047 名参与者,这些参与者来自 2011 年至 2018 年间进行的 4 次波次研究。为了解 VI 的流行情况,我们绘制了一张地理分布图。此外,我们还利用自我报告问卷、体检和血液生物标志物指标作为预测因子,构建了一个模型。预测中使用了多种 ML 算法,包括梯度提升机、分布式随机森林、广义线性模型、深度学习和堆叠集合。我们绘制了接收者操作特征曲线和校准曲线来评估预测性能。我们使用变量重要性分析来确定关键预测因子:在所有参与者中,33.9%(6449/19047)的人患有 VI。青海、重庆、安徽和四川的 VI 率最高,而北京和新疆的 VI 率最低。广义线性模型、梯度提升机和堆叠集合的曲线下面积分别为 0.706、0.710 和 0.715,其中堆叠集合的表现最佳。主要预测因素包括听力障碍、对健康状况的自我预期、疼痛、年龄、手部握力、抑郁、夜间睡眠时间、高密度脂蛋白胆固醇以及关节炎或风湿病:结论:中国近三分之一的中老年人患有 VI。结论:中国近三分之一的中老年人患有椎管狭窄,椎管狭窄的患病率存在地区差异,但并不存在明显的东西或南北分布差异。ML 算法显示了对 VI 的准确预测能力。预测模型与变量重要性分析相结合,为中国中老年人VI的早期识别和干预提供了有价值的见解。