Qing Su , Wenting Liu , Xiaoyan Liu , Pixiong Su , Boqia Xie
{"title":"Bioinformatics-focused identification of metabolomic Markers in coronary microvascular disease","authors":"Qing Su , Wenting Liu , Xiaoyan Liu , Pixiong Su , Boqia Xie","doi":"10.1016/j.compbiomed.2025.109992","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Coronary microvascular disease (CMVD), marked by dysfunction of the small coronary vessels, poses significant diagnostic challenges due to the complexity and high cost of current procedures like the index of microcirculatory resistance (IMR). This study aimed to identify metabolomic biomarkers from coronary artery samples to facilitate CMVD diagnosis using advanced bioinformatics techniques—specifically, random forest algorithms and generalized linear models (GLMs)—to develop more cost-effective blood-based diagnostics.</div></div><div><h3>Methods</h3><div>In this prospective study, 68 patients scheduled for coronary angiography and IMR assessment were enrolled. Plasma samples obtained from their coronary arteries were analyzed using untargeted metabolomics with liquid chromatography-mass spectrometry. Advanced bioinformatics methods were applied: random forest algorithms were utilized for feature selection to identify significant metabolites, and GLMs were constructed for predictive modeling. The diagnostic performance of the models was evaluated through receiver operating characteristic (ROC) curve analysis.</div></div><div><h3>Results</h3><div>The random forest analysis identified the top 10 metabolites that significantly contributed to the classification of CMVD. The GLM built using these metabolites demonstrated excellent diagnostic accuracy, achieving area under the ROC curve (AUC) values of 0.984 in the initial (discovery) cohort and 0.938 in the subsequent (validation) cohort. The use of mathematical modeling enhanced the robustness and interpretability of the biomarker selection process.</div></div><div><h3>Conclusions</h3><div>Advanced bioinformatics techniques, including random forest algorithms and GLMs, effectively identified key metabolites associated with CMVD. While the collection of coronary artery blood samples is invasive due to the necessity of coronary angiography, this method offers a more practical and cost-effective alternative to IMR measurement, potentially improving the diagnostic approach for CMVD.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"189 ","pages":"Article 109992"},"PeriodicalIF":7.0000,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482525003439","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Coronary microvascular disease (CMVD), marked by dysfunction of the small coronary vessels, poses significant diagnostic challenges due to the complexity and high cost of current procedures like the index of microcirculatory resistance (IMR). This study aimed to identify metabolomic biomarkers from coronary artery samples to facilitate CMVD diagnosis using advanced bioinformatics techniques—specifically, random forest algorithms and generalized linear models (GLMs)—to develop more cost-effective blood-based diagnostics.
Methods
In this prospective study, 68 patients scheduled for coronary angiography and IMR assessment were enrolled. Plasma samples obtained from their coronary arteries were analyzed using untargeted metabolomics with liquid chromatography-mass spectrometry. Advanced bioinformatics methods were applied: random forest algorithms were utilized for feature selection to identify significant metabolites, and GLMs were constructed for predictive modeling. The diagnostic performance of the models was evaluated through receiver operating characteristic (ROC) curve analysis.
Results
The random forest analysis identified the top 10 metabolites that significantly contributed to the classification of CMVD. The GLM built using these metabolites demonstrated excellent diagnostic accuracy, achieving area under the ROC curve (AUC) values of 0.984 in the initial (discovery) cohort and 0.938 in the subsequent (validation) cohort. The use of mathematical modeling enhanced the robustness and interpretability of the biomarker selection process.
Conclusions
Advanced bioinformatics techniques, including random forest algorithms and GLMs, effectively identified key metabolites associated with CMVD. While the collection of coronary artery blood samples is invasive due to the necessity of coronary angiography, this method offers a more practical and cost-effective alternative to IMR measurement, potentially improving the diagnostic approach for CMVD.
期刊介绍:
Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.