Sunghong Park , Doyoon Kim , Heirim Lee , Chang Hyung Hong , Sang Joon Son , Hyun Woong Roh , Dokyoon Kim , Yonghyun Nam , Dong-gi Lee , Hyunjung Shin , Hyun Goo Woo
{"title":"Plasma protein-based identification of neuroimage-driven subtypes in mild cognitive impairment via protein-protein interaction aware explainable graph propagational network","authors":"Sunghong Park , Doyoon Kim , Heirim Lee , Chang Hyung Hong , Sang Joon Son , Hyun Woong Roh , Dokyoon Kim , Yonghyun Nam , Dong-gi Lee , Hyunjung Shin , Hyun Goo Woo","doi":"10.1016/j.compbiomed.2024.109303","DOIUrl":null,"url":null,"abstract":"<div><div>As an early indicator of dementia, mild cognitive impairment (MCI) requires specialized treatment according to its subtypes for the effective prevention and management of dementia progression. Based on the neuropathological characteristics, MCI can be classified into Alzheimer's disease (AD)-related cognitive impairment (ADCI) and subcortical vascular cognitive impairment (SVCI), being more likely to progress to AD and subcortical vascular dementia (SVD), respectively. For identifying MCI subtypes, plasma protein biomarkers are recently seen as promising tools due to their minimal invasiveness and cost-effectiveness in diagnostic procedures. Furthermore, the application of machine learning (ML) has led the preciseness in the biomarker discovery and the resulting diagnostics. Nevertheless, previous ML-based studies often fail to consider interactions between proteins, which are essential in complex neurodegenerative disorders such as MCI and dementia. Although protein-protein interactions (PPIs) have been employed in network models, these models frequently do not fully capture the diverse properties of PPIs due to their local awareness. This limitation increases the likelihood of overlooking critical components and amplifying the impact of noisy interactions. In this study, we introduce a new graph-based ML model for classifying MCI subtypes, called <em>eXplainable Graph Propagational Network</em> (XGPN). The proposed method extracts the globally interactive effects between proteins by propagating the independent effect of plasma proteins on the PPI network, and thereby, MCI subtypes are predicted by estimation of the risk effect of each protein. Moreover, the process of model training and the outcome of subtype classification are fully explainable due to the simplicity and transparency of XGPN's architecture. The experimental results indicated that the interactive effect between proteins significantly contributed to the distinct differences between MCI subtype groups, resulting in an enhanced classification performance with an average improvement of 10.0 % compared to existing methods, also identifying key biomarkers and their impact on ADCI and SVCI.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"183 ","pages":"Article 109303"},"PeriodicalIF":7.0000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S001048252401388X","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
As an early indicator of dementia, mild cognitive impairment (MCI) requires specialized treatment according to its subtypes for the effective prevention and management of dementia progression. Based on the neuropathological characteristics, MCI can be classified into Alzheimer's disease (AD)-related cognitive impairment (ADCI) and subcortical vascular cognitive impairment (SVCI), being more likely to progress to AD and subcortical vascular dementia (SVD), respectively. For identifying MCI subtypes, plasma protein biomarkers are recently seen as promising tools due to their minimal invasiveness and cost-effectiveness in diagnostic procedures. Furthermore, the application of machine learning (ML) has led the preciseness in the biomarker discovery and the resulting diagnostics. Nevertheless, previous ML-based studies often fail to consider interactions between proteins, which are essential in complex neurodegenerative disorders such as MCI and dementia. Although protein-protein interactions (PPIs) have been employed in network models, these models frequently do not fully capture the diverse properties of PPIs due to their local awareness. This limitation increases the likelihood of overlooking critical components and amplifying the impact of noisy interactions. In this study, we introduce a new graph-based ML model for classifying MCI subtypes, called eXplainable Graph Propagational Network (XGPN). The proposed method extracts the globally interactive effects between proteins by propagating the independent effect of plasma proteins on the PPI network, and thereby, MCI subtypes are predicted by estimation of the risk effect of each protein. Moreover, the process of model training and the outcome of subtype classification are fully explainable due to the simplicity and transparency of XGPN's architecture. The experimental results indicated that the interactive effect between proteins significantly contributed to the distinct differences between MCI subtype groups, resulting in an enhanced classification performance with an average improvement of 10.0 % compared to existing methods, also identifying key biomarkers and their impact on ADCI and SVCI.
期刊介绍:
Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.