Identification of Parkinson's disease using MRI and genetic data from the PPMI cohort: an improved machine learning fusion approach.

IF 4.5 2区 医学 Q2 GERIATRICS & GERONTOLOGY Frontiers in Aging Neuroscience Pub Date : 2025-02-04 eCollection Date: 2025-01-01 DOI:10.3389/fnagi.2025.1510192
Yifeng Yang, Liangyun Hu, Yang Chen, Weidong Gu, Guangwu Lin, YuanZhong Xie, Shengdong Nie
{"title":"Identification of Parkinson's disease using MRI and genetic data from the PPMI cohort: an improved machine learning fusion approach.","authors":"Yifeng Yang, Liangyun Hu, Yang Chen, Weidong Gu, Guangwu Lin, YuanZhong Xie, Shengdong Nie","doi":"10.3389/fnagi.2025.1510192","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>This study aim to leverage advanced machine learning techniques to develop and validate novel MRI imaging features and single nucleotide polymorphism (SNP) gene data fusion methodologies to enhance the early identification and diagnosis of Parkinson's disease (PD).</p><p><strong>Methods: </strong>We leveraged a comprehensive dataset from the Parkinson's Progression Markers Initiative (PPMI), which includes high-resolution neuroimaging data, genetic single-nucleotide polymorphism (SNP) profiles, and detailed clinical information from individuals with early-stage PD and healthy controls. Two multi-modal fusion strategies were used: feature-level fusion, where we employed a hybrid feature selection algorithm combining Fisher discriminant analysis, an ensemble Lasso (EnLasso) method, and partial least squares (PLS) regression to identify and integrate the most informative features from neuroimaging and genetic data; and decision-level fusion, where we developed an adaptive ensemble stacking (AE_Stacking) model to synergistically integrate the predictions from multiple base classifiers trained on individual modalities.</p><p><strong>Results: </strong>The AE_Stacking model achieving the highest average balanced accuracy of 95.36% and an area under the receiver operating characteristic curve (AUC) of 0.974, significantly outperforming feature-level fusion and single-modal models (<i>p</i> < 0.05). Furthermore, by analyzing the features selected across multiple iterations of our models, we identified stable brain region features [lh 6r (FD) and rh 46 (GI)] and key genetic markers (rs356181 and rs2736990 SNPs within the SNCA gene region; rs213202 SNP within the VPS52 gene region), highlighting their potential as reliable early diagnostic indicators for the disease.</p><p><strong>Conclusion: </strong>The AE_Stacking model, trained on MRI and genetic data, demonstrates potential in distinguishing individuals with PD. Our findings enhance understanding of the disease and advance us toward the goal of precision medicine for neurodegenerative disorder.</p>","PeriodicalId":12450,"journal":{"name":"Frontiers in Aging Neuroscience","volume":"17 ","pages":"1510192"},"PeriodicalIF":4.5000,"publicationDate":"2025-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11832485/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Aging Neuroscience","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3389/fnagi.2025.1510192","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"GERIATRICS & GERONTOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Objective: This study aim to leverage advanced machine learning techniques to develop and validate novel MRI imaging features and single nucleotide polymorphism (SNP) gene data fusion methodologies to enhance the early identification and diagnosis of Parkinson's disease (PD).

Methods: We leveraged a comprehensive dataset from the Parkinson's Progression Markers Initiative (PPMI), which includes high-resolution neuroimaging data, genetic single-nucleotide polymorphism (SNP) profiles, and detailed clinical information from individuals with early-stage PD and healthy controls. Two multi-modal fusion strategies were used: feature-level fusion, where we employed a hybrid feature selection algorithm combining Fisher discriminant analysis, an ensemble Lasso (EnLasso) method, and partial least squares (PLS) regression to identify and integrate the most informative features from neuroimaging and genetic data; and decision-level fusion, where we developed an adaptive ensemble stacking (AE_Stacking) model to synergistically integrate the predictions from multiple base classifiers trained on individual modalities.

Results: The AE_Stacking model achieving the highest average balanced accuracy of 95.36% and an area under the receiver operating characteristic curve (AUC) of 0.974, significantly outperforming feature-level fusion and single-modal models (p < 0.05). Furthermore, by analyzing the features selected across multiple iterations of our models, we identified stable brain region features [lh 6r (FD) and rh 46 (GI)] and key genetic markers (rs356181 and rs2736990 SNPs within the SNCA gene region; rs213202 SNP within the VPS52 gene region), highlighting their potential as reliable early diagnostic indicators for the disease.

Conclusion: The AE_Stacking model, trained on MRI and genetic data, demonstrates potential in distinguishing individuals with PD. Our findings enhance understanding of the disease and advance us toward the goal of precision medicine for neurodegenerative disorder.

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用MRI和来自PPMI队列的遗传数据识别帕金森病:一种改进的机器学习融合方法
目的:本研究旨在利用先进的机器学习技术开发和验证新的MRI成像特征和单核苷酸多态性(SNP)基因数据融合方法,以增强帕金森病(PD)的早期识别和诊断。方法:我们利用了来自帕金森进展标记计划(PPMI)的综合数据集,其中包括高分辨率神经成像数据,遗传单核苷酸多态性(SNP)谱,以及早期PD患者和健康对照者的详细临床信息。使用了两种多模式融合策略:特征级融合,其中我们采用混合特征选择算法,结合Fisher判别分析,集成Lasso (EnLasso)方法和偏最小二乘(PLS)回归来识别和整合来自神经影像学和遗传数据的最具信息量的特征;决策级融合,其中我们开发了一个自适应集成堆叠(AE_Stacking)模型,以协同集成来自多个基于单个模式训练的基分类器的预测。结果:AE_Stacking模型的平均平衡精度最高,达到95.36%,受试者工作特征曲线下面积(AUC)为0.974,显著优于特征级融合和单模态模型(p < 0.05)。此外,通过分析我们的模型在多次迭代中选择的特征,我们确定了稳定的脑区域特征[lh 6r (FD)和rh 46 (GI)]和关键遗传标记(SNCA基因区域内的rs356181和rs2736990 snp;rs213202 SNP位于VPS52基因区域内),突出了它们作为该疾病可靠的早期诊断指标的潜力。结论:基于MRI和遗传数据训练的AE_Stacking模型在区分PD个体方面具有潜力。我们的发现增强了对这种疾病的理解,并推动我们朝着神经退行性疾病的精准医学目标迈进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Frontiers in Aging Neuroscience
Frontiers in Aging Neuroscience GERIATRICS & GERONTOLOGY-NEUROSCIENCES
CiteScore
6.30
自引率
8.30%
发文量
1426
期刊介绍: Frontiers in Aging Neuroscience is a leading journal in its field, publishing rigorously peer-reviewed research that advances our understanding of the mechanisms of Central Nervous System aging and age-related neural diseases. Specialty Chief Editor Thomas Wisniewski at the New York University School of Medicine is supported by an outstanding Editorial Board of international researchers. This multidisciplinary open-access journal is at the forefront of disseminating and communicating scientific knowledge and impactful discoveries to researchers, academics, clinicians and the public worldwide.
期刊最新文献
Rethinking cervical deep lymphovenous anastomosis in Alzheimer's disease: problems and prospects. Efficacy and safety of robot-assisted vs. stereotactic framework deep brain stimulation for Parkinson's disease: a retrospective study. The role of glutathione in cognition, cognitive effort, and cognitive endurance in young and older adults. Editorial: Mental, sensory, physical and life style parameters related to cognitive decline in aging. Altered attention and visuospatial abnormalities in Parkinson's disease: links to attention networks and basal ganglia.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1