{"title":"An In Silico Analysis Reveals an EMT-Associated Gene Signature for Predicting Recurrence of Early-Stage Lung Adenocarcinoma","authors":"Yi Han, F. Wong, Di Wang, C. Kahlert","doi":"10.1177/11769351221100727","DOIUrl":null,"url":null,"abstract":"Background: The potential micrometastasis tends to cause recurrence of lung adenocarcinoma (LUAD) after surgical resection and consequently leads to an increase in the mortality risk. Compelling evidence has suggested the underlying mechanisms of tumor metastasis could involve the activation of an epithelial-mesenchymal transition (EMT) program. Hence, the objective of this study was to develop an EMT-associated gene signature for predicting the recurrence of early-stage LUAD. Methods: The mRNA expression data of patients with early-stage LUAD were downloaded from Gene Expression Omnibus (GEO) and The Cancer Genome Atlas (TCGA) available databases. Gene Set Variation Analysis (GSVA) was first performed to provide an assessment of EMT phenotype, whereas Weighted Gene Co-expression Network Analysis (WGCNA) was constructed to determine EMT-associated key modules and genes. Based on the genes, a novel EMT-associated signature for predicting the recurrence of early-stage LUAD was identified using a least absolute shrinkage and selection operator (LASSO) algorithm and a stepwise Cox proportional hazards regression model. Kaplan-Meier survival analysis, receiver operating characteristic (ROC) curves and Cox regression analyses were used to estimate the performance of the identified gene signature. Results: GSVA revealed diverse EMT states in the early-stage LUAD. Further correlation analyses showed that the EMT states presented high correlations with several hallmarks of cancers, tumor purity, tumor microenvironment cells, and immune checkpoint genes. More importantly, Kaplan-Meier survival analyses indicated that patients with high EMT scores had worse recurrence-free survival (RFS) and overall survival (OS) than those with low EMT scores. A novel 5-gene signature (AGL, ECM1, ENPP1, SNX7, and TSPAN12) was established based on the EMT-associated genes from WGCNA and this signature successfully predicted that the high-risk patients had a higher recurrence rate compared with the low-risk patients. In further analyses, the signature represented robust prognostic values in 2 independent validation cohorts (GEO and TCGA datasets) and a combined GEO cohort as evaluated by Kaplan-Meier survival (P-value < .0001) and ROC analysis (AUC = 0.781). Moreover, the signature was corroborated to be independent of clinical factors by univariate and multivariate Cox regression analyses. Interestingly, the combination of the signature-based recurrence risk and tumor-node-metastasis (TNM) stage showed a superior predictive ability on the recurrence of patients with early-stage LUAD. Conclusion: Our study suggests that patients with early-stage LUAD exhibit diverse EMT states that play a vital role in tumor recurrence. The novel and promising EMT-associated 5-gene signature identified and validated in this study may be applied to predict the recurrence of early-stage LUAD, facilitating risk stratification, recurrence monitoring, and individualized management for the patients after surgical resection.","PeriodicalId":35418,"journal":{"name":"Cancer Informatics","volume":" ","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/11769351221100727","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 2
Abstract
Background: The potential micrometastasis tends to cause recurrence of lung adenocarcinoma (LUAD) after surgical resection and consequently leads to an increase in the mortality risk. Compelling evidence has suggested the underlying mechanisms of tumor metastasis could involve the activation of an epithelial-mesenchymal transition (EMT) program. Hence, the objective of this study was to develop an EMT-associated gene signature for predicting the recurrence of early-stage LUAD. Methods: The mRNA expression data of patients with early-stage LUAD were downloaded from Gene Expression Omnibus (GEO) and The Cancer Genome Atlas (TCGA) available databases. Gene Set Variation Analysis (GSVA) was first performed to provide an assessment of EMT phenotype, whereas Weighted Gene Co-expression Network Analysis (WGCNA) was constructed to determine EMT-associated key modules and genes. Based on the genes, a novel EMT-associated signature for predicting the recurrence of early-stage LUAD was identified using a least absolute shrinkage and selection operator (LASSO) algorithm and a stepwise Cox proportional hazards regression model. Kaplan-Meier survival analysis, receiver operating characteristic (ROC) curves and Cox regression analyses were used to estimate the performance of the identified gene signature. Results: GSVA revealed diverse EMT states in the early-stage LUAD. Further correlation analyses showed that the EMT states presented high correlations with several hallmarks of cancers, tumor purity, tumor microenvironment cells, and immune checkpoint genes. More importantly, Kaplan-Meier survival analyses indicated that patients with high EMT scores had worse recurrence-free survival (RFS) and overall survival (OS) than those with low EMT scores. A novel 5-gene signature (AGL, ECM1, ENPP1, SNX7, and TSPAN12) was established based on the EMT-associated genes from WGCNA and this signature successfully predicted that the high-risk patients had a higher recurrence rate compared with the low-risk patients. In further analyses, the signature represented robust prognostic values in 2 independent validation cohorts (GEO and TCGA datasets) and a combined GEO cohort as evaluated by Kaplan-Meier survival (P-value < .0001) and ROC analysis (AUC = 0.781). Moreover, the signature was corroborated to be independent of clinical factors by univariate and multivariate Cox regression analyses. Interestingly, the combination of the signature-based recurrence risk and tumor-node-metastasis (TNM) stage showed a superior predictive ability on the recurrence of patients with early-stage LUAD. Conclusion: Our study suggests that patients with early-stage LUAD exhibit diverse EMT states that play a vital role in tumor recurrence. The novel and promising EMT-associated 5-gene signature identified and validated in this study may be applied to predict the recurrence of early-stage LUAD, facilitating risk stratification, recurrence monitoring, and individualized management for the patients after surgical resection.
期刊介绍:
The field of cancer research relies on advances in many other disciplines, including omics technology, mass spectrometry, radio imaging, computer science, and biostatistics. Cancer Informatics provides open access to peer-reviewed high-quality manuscripts reporting bioinformatics analysis of molecular genetics and/or clinical data pertaining to cancer, emphasizing the use of machine learning, artificial intelligence, statistical algorithms, advanced imaging techniques, data visualization, and high-throughput technologies. As the leading journal dedicated exclusively to the report of the use of computational methods in cancer research and practice, Cancer Informatics leverages methodological improvements in systems biology, genomics, proteomics, metabolomics, and molecular biochemistry into the fields of cancer detection, treatment, classification, risk-prediction, prevention, outcome, and modeling.