Integration of single-cell and bulk RNA sequencing data using machine learning identifies oxidative stress-related genes LUM and PCOLCE2 as potential biomarkers for heart failure.
Chaofang Li, Ruijinlin Hao, Chuanfu Li, Li Liu, Zhengnian Ding
{"title":"Integration of single-cell and bulk RNA sequencing data using machine learning identifies oxidative stress-related genes LUM and PCOLCE2 as potential biomarkers for heart failure.","authors":"Chaofang Li, Ruijinlin Hao, Chuanfu Li, Li Liu, Zhengnian Ding","doi":"10.1016/j.ijbiomac.2025.140793","DOIUrl":null,"url":null,"abstract":"<p><p>Oxidative stress (OS) is a pivotal mechanism driving the progression of cardiovascular diseases, particularly heart failure (HF). However, the comprehensive characterisation of OS-related genes in HF remains largely unexplored. In the present study, we analysed single-cell RNA sequencing datasets from the Gene Expression Omnibus and OS gene sets from GeneCards. We identified 167 OS-related genes potentially linked to HF by applying algorithms, such as AUCell, UCell, singscore, ssgsea, and AddModuleScore, combined with correlation analysis. Subsequently, we used feature selection algorithms, including least absolute shrinkage and selection operator, XGBoost, Boruta, random forest, gradient boosting machines, decision trees, and support vector machine recursive feature elimination, to identify lumican (LUM) and procollagen C-endopeptidase enhancer 2 (PCOLCE2) as key biomarker candidates with significant diagnostic potential. Bulk RNA-sequencing confirmed their elevated expression in patients with HF, highlighting their predictive utility. Single-cell analysis further revealed their upregulation primarily in fibroblasts, emphasising their cell-specific role in HF. To validate these findings, we developed a transverse aortic constriction-induced HF mouse model that showed enhanced cardiac OS activity and significant PCOLCE2 upregulation in the HF group. These results provide strong evidence of the involvement of OS-related mechanisms in HF. Herein, we propose a diagnostic strategy that provides novel insights into the molecular mechanisms underlying HF. However, further studies are required to validate its clinical utility and ensure its application in the diagnosis of HF.</p>","PeriodicalId":333,"journal":{"name":"International Journal of Biological Macromolecules","volume":" ","pages":"140793"},"PeriodicalIF":8.5000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Biological Macromolecules","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1016/j.ijbiomac.2025.140793","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/8 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Oxidative stress (OS) is a pivotal mechanism driving the progression of cardiovascular diseases, particularly heart failure (HF). However, the comprehensive characterisation of OS-related genes in HF remains largely unexplored. In the present study, we analysed single-cell RNA sequencing datasets from the Gene Expression Omnibus and OS gene sets from GeneCards. We identified 167 OS-related genes potentially linked to HF by applying algorithms, such as AUCell, UCell, singscore, ssgsea, and AddModuleScore, combined with correlation analysis. Subsequently, we used feature selection algorithms, including least absolute shrinkage and selection operator, XGBoost, Boruta, random forest, gradient boosting machines, decision trees, and support vector machine recursive feature elimination, to identify lumican (LUM) and procollagen C-endopeptidase enhancer 2 (PCOLCE2) as key biomarker candidates with significant diagnostic potential. Bulk RNA-sequencing confirmed their elevated expression in patients with HF, highlighting their predictive utility. Single-cell analysis further revealed their upregulation primarily in fibroblasts, emphasising their cell-specific role in HF. To validate these findings, we developed a transverse aortic constriction-induced HF mouse model that showed enhanced cardiac OS activity and significant PCOLCE2 upregulation in the HF group. These results provide strong evidence of the involvement of OS-related mechanisms in HF. Herein, we propose a diagnostic strategy that provides novel insights into the molecular mechanisms underlying HF. However, further studies are required to validate its clinical utility and ensure its application in the diagnosis of HF.
期刊介绍:
The International Journal of Biological Macromolecules is a well-established international journal dedicated to research on the chemical and biological aspects of natural macromolecules. Focusing on proteins, macromolecular carbohydrates, glycoproteins, proteoglycans, lignins, biological poly-acids, and nucleic acids, the journal presents the latest findings in molecular structure, properties, biological activities, interactions, modifications, and functional properties. Papers must offer new and novel insights, encompassing related model systems, structural conformational studies, theoretical developments, and analytical techniques. Each paper is required to primarily focus on at least one named biological macromolecule, reflected in the title, abstract, and text.