Sunghong Park, Chang Hyung Hong, Sang Joon Son, Hyun Woong Roh, Doyoon Kim, Hyunjung Shin, Hyun Goo Woo
{"title":"Identification of molecular subtypes of dementia by using blood-proteins interaction-aware graph propagational network.","authors":"Sunghong Park, Chang Hyung Hong, Sang Joon Son, Hyun Woong Roh, Doyoon Kim, Hyunjung Shin, Hyun Goo Woo","doi":"10.1093/bib/bbae428","DOIUrl":null,"url":null,"abstract":"<p><p>Plasma protein biomarkers have been considered promising tools for diagnosing dementia subtypes due to their low variability, cost-effectiveness, and minimal invasiveness in diagnostic procedures. Machine learning (ML) methods have been applied to enhance accuracy of the biomarker discovery. However, previous ML-based studies often overlook interactions between proteins, which are crucial in complex disorders like dementia. While protein-protein interactions (PPIs) have been used in network models, these models often fail to fully capture the diverse properties of PPIs due to their local awareness. This drawback increases the chance of neglecting critical components and magnifying the impact of noisy interactions. In this study, we propose a novel graph-based ML model for dementia subtype diagnosis, the graph propagational network (GPN). By propagating the independent effect of plasma proteins on PPI network, the GPN extracts the globally interactive effects between proteins. Experimental results showed that the interactive effect between proteins yielded to further clarify the differences between dementia subtype groups and contributed to the performance improvement where the GPN outperformed existing methods by 10.4% on average.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":null,"pages":null},"PeriodicalIF":6.8000,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11370639/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Briefings in bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/bib/bbae428","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Plasma protein biomarkers have been considered promising tools for diagnosing dementia subtypes due to their low variability, cost-effectiveness, and minimal invasiveness in diagnostic procedures. Machine learning (ML) methods have been applied to enhance accuracy of the biomarker discovery. However, previous ML-based studies often overlook interactions between proteins, which are crucial in complex disorders like dementia. While protein-protein interactions (PPIs) have been used in network models, these models often fail to fully capture the diverse properties of PPIs due to their local awareness. This drawback increases the chance of neglecting critical components and magnifying the impact of noisy interactions. In this study, we propose a novel graph-based ML model for dementia subtype diagnosis, the graph propagational network (GPN). By propagating the independent effect of plasma proteins on PPI network, the GPN extracts the globally interactive effects between proteins. Experimental results showed that the interactive effect between proteins yielded to further clarify the differences between dementia subtype groups and contributed to the performance improvement where the GPN outperformed existing methods by 10.4% on average.
血浆蛋白生物标志物因其变异性低、成本效益高、诊断过程中的侵入性小而被认为是诊断痴呆症亚型的有前途的工具。机器学习(ML)方法已被用于提高生物标记物发现的准确性。然而,以往基于 ML 的研究往往忽略了蛋白质之间的相互作用,而这在痴呆症等复杂疾病中至关重要。虽然蛋白质-蛋白质相互作用(PPIs)已被用于网络模型,但这些模型往往由于其局部意识而无法完全捕捉到 PPIs 的各种特性。这一缺陷增加了忽略关键成分和放大嘈杂相互作用影响的几率。在本研究中,我们提出了一种用于痴呆症亚型诊断的基于图的新型 ML 模型--图传播网络(GPN)。通过在 PPI 网络上传播血浆蛋白的独立效应,GPN 提取了蛋白之间的全局交互效应。实验结果表明,蛋白质之间的交互效应进一步明确了痴呆症亚型组之间的差异,并有助于提高性能,GPN的性能比现有方法平均高出10.4%。
期刊介绍:
Briefings in Bioinformatics is an international journal serving as a platform for researchers and educators in the life sciences. It also appeals to mathematicians, statisticians, and computer scientists applying their expertise to biological challenges. The journal focuses on reviews tailored for users of databases and analytical tools in contemporary genetics, molecular and systems biology. It stands out by offering practical assistance and guidance to non-specialists in computerized methodologies. Covering a wide range from introductory concepts to specific protocols and analyses, the papers address bacterial, plant, fungal, animal, and human data.
The journal's detailed subject areas include genetic studies of phenotypes and genotypes, mapping, DNA sequencing, expression profiling, gene expression studies, microarrays, alignment methods, protein profiles and HMMs, lipids, metabolic and signaling pathways, structure determination and function prediction, phylogenetic studies, and education and training.