Baiyuan Yang, Yongyun Zhu, Kelu Li, Fang Wang, Bin Liu, Qian Zhou, Yuchao Tai, Zhaochao Liu, Lin Yang, Ruiqiong Ba, Chunyan Lei, Hui Ren, Zhong Xu, Ailan Pang, Xinglong Yang
{"title":"基于代谢组学和蛋白质组学的机器学习模型预测帕金森病的认知障碍","authors":"Baiyuan Yang, Yongyun Zhu, Kelu Li, Fang Wang, Bin Liu, Qian Zhou, Yuchao Tai, Zhaochao Liu, Lin Yang, Ruiqiong Ba, Chunyan Lei, Hui Ren, Zhong Xu, Ailan Pang, Xinglong Yang","doi":"10.1038/s41531-024-00795-y","DOIUrl":null,"url":null,"abstract":"<p>There is an urgent need to identify predictive biomarkers of Parkinson’s disease (PD) with cognitive impairment (PDCI) in order to individualize patient management, ensure timely intervention, and improve prognosis. The aim of this study was to screen for these biomarkers by comparing the plasma proteome and metabolome of PD patients with or without cognitive impairment. Proteomics and metabolomics analyses were performed on a discover cohort. A machine learning model was used to identify candidate protein and metabolite biomarkers of PDCI, which were validated in an independent cohort. The predictive ability of these biomarkers for PDCI was evaluated by plotting receiver operating characteristic curves and calculating the area under the curve (AUC). Moreover, we assessed the predictive ability of these proteins in combination with neuroimaging. In the discover cohort (<i>n</i> = 100), we identified 25 protein features with best results in the machine learning model, including top-ranked PSAP and H3C15. The two-proteins were used for model construction, achieving an Area under the curve (AUC) of 0.951 in the train set and AUC of 0.981 in the test set. Similarly, the model gives a rank list of endogenous metabolite features, Glycocholic Acid and 6-Methylnicotinamide were two top features. Combining these two markers further got the AUC of 0.969 in train set and 0.867 in the test set. To validate the performance of the protein biomarkers, we performed targeted analysis of selected proteins (H3C15 and PSAP) and proteins likely associated with PDCI (NCAM2 and LAMB2) using parallel reaction monitoring in validation cohort (<i>n</i> = 116). The AUC of the classifier built with H3C15 and PSAP is 0.813. Moreover, when combining H3C15, PSAP, NCAM2, and LAMB2, the model achieved AUC of 0.983 in the train set, AUC of 0.981 in the test set, and AUC of 0.839 in the validation set. Furthermore, we verified that these protein markers we discovered can improve the predictive effect of neuroimaging on PDCI: the classifier built with neuroimaging features had AUC of 0.833, which improved to 0.905 when combined with H3C15. Taken together, our integrated proteomics and metabolomics analysis successfully identified potential biomarkers for PDCI. Additionally, H3C15 showed promise in enhancing the predictive performance of neuroimaging for cognitive impairment.</p>","PeriodicalId":19706,"journal":{"name":"NPJ Parkinson's Disease","volume":"14 1","pages":""},"PeriodicalIF":6.7000,"publicationDate":"2024-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning model base on metabolomics and proteomics to predict cognitive impairment in Parkinson’s disease\",\"authors\":\"Baiyuan Yang, Yongyun Zhu, Kelu Li, Fang Wang, Bin Liu, Qian Zhou, Yuchao Tai, Zhaochao Liu, Lin Yang, Ruiqiong Ba, Chunyan Lei, Hui Ren, Zhong Xu, Ailan Pang, Xinglong Yang\",\"doi\":\"10.1038/s41531-024-00795-y\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>There is an urgent need to identify predictive biomarkers of Parkinson’s disease (PD) with cognitive impairment (PDCI) in order to individualize patient management, ensure timely intervention, and improve prognosis. The aim of this study was to screen for these biomarkers by comparing the plasma proteome and metabolome of PD patients with or without cognitive impairment. Proteomics and metabolomics analyses were performed on a discover cohort. A machine learning model was used to identify candidate protein and metabolite biomarkers of PDCI, which were validated in an independent cohort. The predictive ability of these biomarkers for PDCI was evaluated by plotting receiver operating characteristic curves and calculating the area under the curve (AUC). Moreover, we assessed the predictive ability of these proteins in combination with neuroimaging. In the discover cohort (<i>n</i> = 100), we identified 25 protein features with best results in the machine learning model, including top-ranked PSAP and H3C15. The two-proteins were used for model construction, achieving an Area under the curve (AUC) of 0.951 in the train set and AUC of 0.981 in the test set. Similarly, the model gives a rank list of endogenous metabolite features, Glycocholic Acid and 6-Methylnicotinamide were two top features. Combining these two markers further got the AUC of 0.969 in train set and 0.867 in the test set. To validate the performance of the protein biomarkers, we performed targeted analysis of selected proteins (H3C15 and PSAP) and proteins likely associated with PDCI (NCAM2 and LAMB2) using parallel reaction monitoring in validation cohort (<i>n</i> = 116). The AUC of the classifier built with H3C15 and PSAP is 0.813. Moreover, when combining H3C15, PSAP, NCAM2, and LAMB2, the model achieved AUC of 0.983 in the train set, AUC of 0.981 in the test set, and AUC of 0.839 in the validation set. Furthermore, we verified that these protein markers we discovered can improve the predictive effect of neuroimaging on PDCI: the classifier built with neuroimaging features had AUC of 0.833, which improved to 0.905 when combined with H3C15. Taken together, our integrated proteomics and metabolomics analysis successfully identified potential biomarkers for PDCI. Additionally, H3C15 showed promise in enhancing the predictive performance of neuroimaging for cognitive impairment.</p>\",\"PeriodicalId\":19706,\"journal\":{\"name\":\"NPJ Parkinson's Disease\",\"volume\":\"14 1\",\"pages\":\"\"},\"PeriodicalIF\":6.7000,\"publicationDate\":\"2024-10-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"NPJ Parkinson's Disease\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1038/s41531-024-00795-y\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"NEUROSCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"NPJ Parkinson's Disease","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1038/s41531-024-00795-y","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NEUROSCIENCES","Score":null,"Total":0}
Machine learning model base on metabolomics and proteomics to predict cognitive impairment in Parkinson’s disease
There is an urgent need to identify predictive biomarkers of Parkinson’s disease (PD) with cognitive impairment (PDCI) in order to individualize patient management, ensure timely intervention, and improve prognosis. The aim of this study was to screen for these biomarkers by comparing the plasma proteome and metabolome of PD patients with or without cognitive impairment. Proteomics and metabolomics analyses were performed on a discover cohort. A machine learning model was used to identify candidate protein and metabolite biomarkers of PDCI, which were validated in an independent cohort. The predictive ability of these biomarkers for PDCI was evaluated by plotting receiver operating characteristic curves and calculating the area under the curve (AUC). Moreover, we assessed the predictive ability of these proteins in combination with neuroimaging. In the discover cohort (n = 100), we identified 25 protein features with best results in the machine learning model, including top-ranked PSAP and H3C15. The two-proteins were used for model construction, achieving an Area under the curve (AUC) of 0.951 in the train set and AUC of 0.981 in the test set. Similarly, the model gives a rank list of endogenous metabolite features, Glycocholic Acid and 6-Methylnicotinamide were two top features. Combining these two markers further got the AUC of 0.969 in train set and 0.867 in the test set. To validate the performance of the protein biomarkers, we performed targeted analysis of selected proteins (H3C15 and PSAP) and proteins likely associated with PDCI (NCAM2 and LAMB2) using parallel reaction monitoring in validation cohort (n = 116). The AUC of the classifier built with H3C15 and PSAP is 0.813. Moreover, when combining H3C15, PSAP, NCAM2, and LAMB2, the model achieved AUC of 0.983 in the train set, AUC of 0.981 in the test set, and AUC of 0.839 in the validation set. Furthermore, we verified that these protein markers we discovered can improve the predictive effect of neuroimaging on PDCI: the classifier built with neuroimaging features had AUC of 0.833, which improved to 0.905 when combined with H3C15. Taken together, our integrated proteomics and metabolomics analysis successfully identified potential biomarkers for PDCI. Additionally, H3C15 showed promise in enhancing the predictive performance of neuroimaging for cognitive impairment.
期刊介绍:
npj Parkinson's Disease is a comprehensive open access journal that covers a wide range of research areas related to Parkinson's disease. It publishes original studies in basic science, translational research, and clinical investigations. The journal is dedicated to advancing our understanding of Parkinson's disease by exploring various aspects such as anatomy, etiology, genetics, cellular and molecular physiology, neurophysiology, epidemiology, and therapeutic development. By providing free and immediate access to the scientific and Parkinson's disease community, npj Parkinson's Disease promotes collaboration and knowledge sharing among researchers and healthcare professionals.