Fubo Wang, Chengbang Wang, Shaohua Chen, Chunmeng Wei, Jin Ji, Yan Liu, Leifeng Liang, Yifeng Chen, Xing Li, Lin Zhao, Xiaolei Shi, Yu Fang, Weimin Lu, Tianman Li, Zhe Liu, Wenhao Lu, Tingting Li, Xiangui Hu, Mugan Li, Fuchen Liu, Xing He, Jiannan Wen, Zuheng Wang, Wenxuan Zhou, Zehui Chen, Yonggang Hong, Shaohua Zhang, Xiao Li, Rongbin Zhou, Linjian Mo, Duobing Zhang, Tianyu Li, Qingyun Zhang, Li Wang, Xuedong Wei, Bo Yang, Shenglin Huang, Huiyong Zhang, Guijian Pang, Liu Ouyang, Zhenguang Wang, Jiwen Cheng, Bin Xu, Zengnan Mo
{"title":"Identification of blood-derived exosomal tumor RNA signatures as noninvasive diagnostic biomarkers for multi-cancer: a multi-phase, multi-center study","authors":"Fubo Wang, Chengbang Wang, Shaohua Chen, Chunmeng Wei, Jin Ji, Yan Liu, Leifeng Liang, Yifeng Chen, Xing Li, Lin Zhao, Xiaolei Shi, Yu Fang, Weimin Lu, Tianman Li, Zhe Liu, Wenhao Lu, Tingting Li, Xiangui Hu, Mugan Li, Fuchen Liu, Xing He, Jiannan Wen, Zuheng Wang, Wenxuan Zhou, Zehui Chen, Yonggang Hong, Shaohua Zhang, Xiao Li, Rongbin Zhou, Linjian Mo, Duobing Zhang, Tianyu Li, Qingyun Zhang, Li Wang, Xuedong Wei, Bo Yang, Shenglin Huang, Huiyong Zhang, Guijian Pang, Liu Ouyang, Zhenguang Wang, Jiwen Cheng, Bin Xu, Zengnan Mo","doi":"10.1186/s12943-025-02271-4","DOIUrl":null,"url":null,"abstract":"Cancer remains a leading global cause of mortality, making early detection crucial for improving survival outcomes. The study aims to develop a machine learning-enabled blood-derived exosomal RNA profiling platform for multi-cancer detection and localization. In this multi-phase, multi-center study, we analyzed RNA from exosomes derived from peripheral blood plasma in 818 participants across eight cancer types during the discovery phase. Machine learning techniques were applied to identify potential pan-cancer biomarkers. During the screening and model validation phases, the sample size was progressively expanded to 1,385 participants in two steps, while the candidate biomarkers were refined into a set of 12 exosomal tumor RNA signatures (ETR.sig). In the subsequent model construction phase, diagnostic models were developed using the expanded cohort and ETR.sig. Statistical analyses included the calculation of receiver operating characteristic (ROC) curves and AUC values to assess the models' ability to distinguish cancer cases from controls and determine tumor origins. To further validate and explore the biological relevance of the identified biomarkers, we integrated tissue RNA-seq, single-cell data, and clinical information. Machine learning analysis initially identified 33 candidate biomarkers, which were narrowed down to 20 ETR.sig in the screening phase and 12 ETR.sig in the validation phase. In the model construction phase, a diagnostic model based on ETR.sig, built using the Random Forest (RF) algorithm, showed excellent performance with an AUC of 0.915 for distinguishing pan-cancer from controls. The multi-class classification model also demonstrated strong classification power, with macro-average and micro-average AUCs of 0.983 and 0.985, respectively, for differentiating between eight cancer types. Additionally, tumor origin classification using the RF-based diagnostic models achieved high AUC values: BRCA 0.976, COAD 0.98, KIRC 0.947, LIHC 0.967, LUAD 0.853, OV 0.972, PAAD 0.977, and PRAD 0.898. Integration of tissue RNA-seq, single-cell data, and clinical information revealed key associations between ETR.sig-related genes and tumor development. The study demonstrates the robust potential of exosomal RNA as a minimally invasive biomarker resource for cancer detection. The developed ETR.sig platform offers a promising tool for precision oncology and broad-spectrum cancer screening, integrating advanced computational models with nanoscale vesicle biology for accurate and rapid diagnosis.","PeriodicalId":19000,"journal":{"name":"Molecular Cancer","volume":"28 1","pages":""},"PeriodicalIF":27.7000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Cancer","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12943-025-02271-4","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Cancer remains a leading global cause of mortality, making early detection crucial for improving survival outcomes. The study aims to develop a machine learning-enabled blood-derived exosomal RNA profiling platform for multi-cancer detection and localization. In this multi-phase, multi-center study, we analyzed RNA from exosomes derived from peripheral blood plasma in 818 participants across eight cancer types during the discovery phase. Machine learning techniques were applied to identify potential pan-cancer biomarkers. During the screening and model validation phases, the sample size was progressively expanded to 1,385 participants in two steps, while the candidate biomarkers were refined into a set of 12 exosomal tumor RNA signatures (ETR.sig). In the subsequent model construction phase, diagnostic models were developed using the expanded cohort and ETR.sig. Statistical analyses included the calculation of receiver operating characteristic (ROC) curves and AUC values to assess the models' ability to distinguish cancer cases from controls and determine tumor origins. To further validate and explore the biological relevance of the identified biomarkers, we integrated tissue RNA-seq, single-cell data, and clinical information. Machine learning analysis initially identified 33 candidate biomarkers, which were narrowed down to 20 ETR.sig in the screening phase and 12 ETR.sig in the validation phase. In the model construction phase, a diagnostic model based on ETR.sig, built using the Random Forest (RF) algorithm, showed excellent performance with an AUC of 0.915 for distinguishing pan-cancer from controls. The multi-class classification model also demonstrated strong classification power, with macro-average and micro-average AUCs of 0.983 and 0.985, respectively, for differentiating between eight cancer types. Additionally, tumor origin classification using the RF-based diagnostic models achieved high AUC values: BRCA 0.976, COAD 0.98, KIRC 0.947, LIHC 0.967, LUAD 0.853, OV 0.972, PAAD 0.977, and PRAD 0.898. Integration of tissue RNA-seq, single-cell data, and clinical information revealed key associations between ETR.sig-related genes and tumor development. The study demonstrates the robust potential of exosomal RNA as a minimally invasive biomarker resource for cancer detection. The developed ETR.sig platform offers a promising tool for precision oncology and broad-spectrum cancer screening, integrating advanced computational models with nanoscale vesicle biology for accurate and rapid diagnosis.
期刊介绍:
Molecular Cancer is a platform that encourages the exchange of ideas and discoveries in the field of cancer research, particularly focusing on the molecular aspects. Our goal is to facilitate discussions and provide insights into various areas of cancer and related biomedical science. We welcome articles from basic, translational, and clinical research that contribute to the advancement of understanding, prevention, diagnosis, and treatment of cancer.
The scope of topics covered in Molecular Cancer is diverse and inclusive. These include, but are not limited to, cell and tumor biology, angiogenesis, utilizing animal models, understanding metastasis, exploring cancer antigens and the immune response, investigating cellular signaling and molecular biology, examining epidemiology, genetic and molecular profiling of cancer, identifying molecular targets, studying cancer stem cells, exploring DNA damage and repair mechanisms, analyzing cell cycle regulation, investigating apoptosis, exploring molecular virology, and evaluating vaccine and antibody-based cancer therapies.
Molecular Cancer serves as an important platform for sharing exciting discoveries in cancer-related research. It offers an unparalleled opportunity to communicate information to both specialists and the general public. The online presence of Molecular Cancer enables immediate publication of accepted articles and facilitates the presentation of large datasets and supplementary information. This ensures that new research is efficiently and rapidly disseminated to the scientific community.