Longitudinal clustering analysis and prediction of Parkinson's disease progression using radiomics and hybrid machine learning.

IF 2.3 2区 医学 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING Quantitative Imaging in Medicine and Surgery Pub Date : 2022-02-01 DOI:10.21037/qims-21-425
Mohammad R Salmanpour, Mojtaba Shamsaei, Ghasem Hajianfar, Hamid Soltanian-Zadeh, Arman Rahmim
{"title":"Longitudinal clustering analysis and prediction of Parkinson's disease progression using radiomics and hybrid machine learning.","authors":"Mohammad R Salmanpour,&nbsp;Mojtaba Shamsaei,&nbsp;Ghasem Hajianfar,&nbsp;Hamid Soltanian-Zadeh,&nbsp;Arman Rahmim","doi":"10.21037/qims-21-425","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>We employed machine learning approaches to (I) determine distinct progression trajectories in Parkinson's disease (PD) (unsupervised clustering task), and (II) predict progression trajectories (supervised prediction task), from early (years 0 and 1) data, making use of clinical and imaging features.</p><p><strong>Methods: </strong>We studied PD-subjects derived from longitudinal datasets (years 0, 1, 2 & 4; Parkinson's Progressive Marker Initiative). We extracted and analyzed 981 features, including motor, non-motor, and radiomics features extracted for each region-of-interest (ROIs: left/right caudate and putamen) using our standardized standardized environment for radiomics analysis (SERA) radiomics software. Segmentation of ROIs on dopamine transposer - single photon emission computed tomography (DAT SPECT) images were performed via magnetic resonance images (MRI). After performing cross-sectional clustering on 885 subjects (original dataset) to identify disease subtypes, we identified optimal longitudinal trajectories using hybrid machine learning systems (HMLS), including principal component analysis (PCA) + K-Means algorithms (KMA) followed by Bayesian information criterion (BIC), Calinski-Harabatz criterion (CHC), and elbow criterion (EC). Subsequently, prediction of the identified trajectories from early year data was performed using multiple HMLSs including 16 Dimension Reduction Algorithms (DRA) and 10 classification algorithms.</p><p><strong>Results: </strong>We identified 3 distinct progression trajectories. Hotelling's t squared test (HTST) showed that the identified trajectories were distinct. The trajectories included those with (I, II) disease escalation (2 trajectories, 27% and 38% of patients) and (III) stable disease (1 trajectory, 35% of patients). For trajectory prediction from early year data, HMLSs including the stochastic neighbor embedding algorithm (SNEA, as a DRA) as well as locally linear embedding algorithm (LLEA, as a DRA), linked with the new probabilistic neural network classifier (NPNNC, as a classifier), resulted in accuracies of 78.4% and 79.2% respectively, while other HMLSs such as SNEA + Lib_SVM (library for support vector machines) and t_SNE (t-distributed stochastic neighbor embedding) + NPNNC resulted in 76.5% and 76.1% respectively.</p><p><strong>Conclusions: </strong>This study moves beyond cross-sectional PD subtyping to clustering of longitudinal disease trajectories. We conclude that combining medical information with SPECT-based radiomics features, and optimal utilization of HMLSs, can identify distinct disease trajectories in PD patients, and enable effective prediction of disease trajectories from early year data.</p>","PeriodicalId":54267,"journal":{"name":"Quantitative Imaging in Medicine and Surgery","volume":" ","pages":"906-919"},"PeriodicalIF":2.3000,"publicationDate":"2022-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8739095/pdf/qims-12-02-906.pdf","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Quantitative Imaging in Medicine and Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/qims-21-425","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 15

Abstract

Background: We employed machine learning approaches to (I) determine distinct progression trajectories in Parkinson's disease (PD) (unsupervised clustering task), and (II) predict progression trajectories (supervised prediction task), from early (years 0 and 1) data, making use of clinical and imaging features.

Methods: We studied PD-subjects derived from longitudinal datasets (years 0, 1, 2 & 4; Parkinson's Progressive Marker Initiative). We extracted and analyzed 981 features, including motor, non-motor, and radiomics features extracted for each region-of-interest (ROIs: left/right caudate and putamen) using our standardized standardized environment for radiomics analysis (SERA) radiomics software. Segmentation of ROIs on dopamine transposer - single photon emission computed tomography (DAT SPECT) images were performed via magnetic resonance images (MRI). After performing cross-sectional clustering on 885 subjects (original dataset) to identify disease subtypes, we identified optimal longitudinal trajectories using hybrid machine learning systems (HMLS), including principal component analysis (PCA) + K-Means algorithms (KMA) followed by Bayesian information criterion (BIC), Calinski-Harabatz criterion (CHC), and elbow criterion (EC). Subsequently, prediction of the identified trajectories from early year data was performed using multiple HMLSs including 16 Dimension Reduction Algorithms (DRA) and 10 classification algorithms.

Results: We identified 3 distinct progression trajectories. Hotelling's t squared test (HTST) showed that the identified trajectories were distinct. The trajectories included those with (I, II) disease escalation (2 trajectories, 27% and 38% of patients) and (III) stable disease (1 trajectory, 35% of patients). For trajectory prediction from early year data, HMLSs including the stochastic neighbor embedding algorithm (SNEA, as a DRA) as well as locally linear embedding algorithm (LLEA, as a DRA), linked with the new probabilistic neural network classifier (NPNNC, as a classifier), resulted in accuracies of 78.4% and 79.2% respectively, while other HMLSs such as SNEA + Lib_SVM (library for support vector machines) and t_SNE (t-distributed stochastic neighbor embedding) + NPNNC resulted in 76.5% and 76.1% respectively.

Conclusions: This study moves beyond cross-sectional PD subtyping to clustering of longitudinal disease trajectories. We conclude that combining medical information with SPECT-based radiomics features, and optimal utilization of HMLSs, can identify distinct disease trajectories in PD patients, and enable effective prediction of disease trajectories from early year data.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
利用放射组学和混合机器学习进行帕金森病进展的纵向聚类分析和预测。
背景:我们采用机器学习方法(I)确定帕金森病(PD)的不同进展轨迹(无监督聚类任务),以及(II)利用临床和影像学特征,从早期(0年和1年)数据预测进展轨迹(监督预测任务)。方法:我们研究了来自纵向数据集的pd受试者(0、1、2和4年;帕金森进行性标志物倡议)。我们使用我们的放射组学分析标准化环境(SERA)放射组学软件提取并分析了981个特征,包括为每个感兴趣区域(roi:左/右尾状核和壳核)提取的运动、非运动和放射组学特征。通过磁共振成像(MRI)对多巴胺转座单光子发射计算机断层扫描(DAT SPECT)图像进行roi分割。在对885名受试者(原始数据集)进行横断面聚类以确定疾病亚型后,我们使用混合机器学习系统(HMLS)确定了最佳纵向轨迹,包括主成分分析(PCA) + k -均值算法(KMA),然后是贝叶斯信息准则(BIC)、Calinski-Harabatz准则(CHC)和肘部准则(EC)。随后,使用包括16种降维算法(DRA)和10种分类算法在内的多种HMLSs对从年初数据中识别出的轨迹进行预测。结果:我们确定了3种不同的进展轨迹。霍特林的t平方检验(HTST)显示,识别的轨迹是明显的。这些轨迹包括(I, II)疾病升级(2个轨迹,27%和38%的患者)和(III)疾病稳定(1个轨迹,35%的患者)。对于年初数据的轨迹预测,包括随机邻居嵌入算法(SNEA,作为DRA)和局部线性嵌入算法(LLEA,作为DRA)在内的HMLSs与新型概率神经网络分类器(NPNNC,作为分类器)相结合,准确率分别为78.4%和79.2%。而sna + Lib_SVM(支持向量机库)和t_SNE (t分布随机邻居嵌入)+ NPNNC等hmls的准确率分别为76.5%和76.1%。结论:这项研究超越了横断面PD亚型,转向了纵向疾病轨迹的聚类。我们得出结论,将医学信息与基于spect的放射组学特征相结合,并优化利用HMLSs,可以识别PD患者不同的疾病轨迹,并能够从早期数据中有效预测疾病轨迹。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Quantitative Imaging in Medicine and Surgery
Quantitative Imaging in Medicine and Surgery Medicine-Radiology, Nuclear Medicine and Imaging
CiteScore
4.20
自引率
17.90%
发文量
252
期刊介绍: Information not localized
期刊最新文献
Enhancing cancer prognostics with group penalty models: a comparative study on radiomics feature selection in lung adenocarcinomas and meningiomas. Acquisition of Ktrans perfusion parameter maps from DCE-MRI in breast cancer using a deep learning approach. Temporal evolution of CT imaging features in oligometastatic lung lesions after stereotactic body radiation therapy: a multicenter retrospective study of early tumor response as a predictor of favorable local control. Accessory cavitated uterine malformation in an adolescent with refractory dysmenorrhea: diagnostic imaging and minimally invasive management. Assessment of fetal cardiac structure and function in hyperthyroid pregnancies using fetal heart quantification technology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1