Chong Yang She, Wen Ying Fan, Yun Yun Li, Yong Tao, Zu Fei Li
{"title":"Predicting Diabetic Retinopathy Using a Machine Learning Approach Informed by Whole-Exome Sequencing Studies.","authors":"Chong Yang She, Wen Ying Fan, Yun Yun Li, Yong Tao, Zu Fei Li","doi":"10.3967/bes2025.002","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>To establish and validate a novel diabetic retinopathy (DR) risk-prediction model using a whole-exome sequencing (WES)-based machine learning (ML) method.</p><p><strong>Methods: </strong>WES was performed to identify potential single nucleotide polymorphism (SNP) or mutation sites in a DR pedigree comprising 10 members. A prediction model was established and validated in a cohort of 420 type 2 diabetic patients based on both genetic and demographic features. The contribution of each feature was assessed using Shapley Additive explanation analysis. The efficacies of the models with and without SNP were compared.</p><p><strong>Results: </strong>WES revealed that seven SNPs/mutations ( <i>rs116911833 in TRIM7, 1997T>C in LRBA, 1643T>C in PRMT10,</i> rs117858678 in <i>C9orf152</i>, <i>rs201922794 in CLDN25</i>, <i>rs146694895 in SH3GLB2</i>, and <i>rs201407189 in FANCC</i>) were associated with DR. Notably, the model including <i>rs146694895</i> <i>and rs201407189</i> achieved better performance in predicting DR (accuracy: 80.2%; sensitivity: 83.3%; specificity: 76.7%; area under the receiver operating characteristic curve [AUC]: 80.0%) than the model without these SNPs (accuracy: 79.4%; sensitivity: 80.3%; specificity: 78.3%; AUC: 79.3%).</p><p><strong>Conclusion: </strong>Novel SNP sites associated with DR were identified in the DR pedigree. Inclusion of <i>rs146694895</i> and <i>rs201407189</i> significantly enhanced the performance of the ML-based DR prediction model.</p>","PeriodicalId":93903,"journal":{"name":"Biomedical and environmental sciences : BES","volume":"38 1","pages":"67-78"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical and environmental sciences : BES","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3967/bes2025.002","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: To establish and validate a novel diabetic retinopathy (DR) risk-prediction model using a whole-exome sequencing (WES)-based machine learning (ML) method.
Methods: WES was performed to identify potential single nucleotide polymorphism (SNP) or mutation sites in a DR pedigree comprising 10 members. A prediction model was established and validated in a cohort of 420 type 2 diabetic patients based on both genetic and demographic features. The contribution of each feature was assessed using Shapley Additive explanation analysis. The efficacies of the models with and without SNP were compared.
Results: WES revealed that seven SNPs/mutations ( rs116911833 in TRIM7, 1997T>C in LRBA, 1643T>C in PRMT10, rs117858678 in C9orf152, rs201922794 in CLDN25, rs146694895 in SH3GLB2, and rs201407189 in FANCC) were associated with DR. Notably, the model including rs146694895and rs201407189 achieved better performance in predicting DR (accuracy: 80.2%; sensitivity: 83.3%; specificity: 76.7%; area under the receiver operating characteristic curve [AUC]: 80.0%) than the model without these SNPs (accuracy: 79.4%; sensitivity: 80.3%; specificity: 78.3%; AUC: 79.3%).
Conclusion: Novel SNP sites associated with DR were identified in the DR pedigree. Inclusion of rs146694895 and rs201407189 significantly enhanced the performance of the ML-based DR prediction model.