{"title":"Identification of Prognostic Biomarkers for Breast Cancer Metastasis Using Penalized Additive Hazards Regression Model.","authors":"Leili Tapak, Omid Hamidi, Payam Amini, Saeid Afshar, Siamak Salimy, Irina Dinu","doi":"10.1177/11769351231157942","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Breast cancer (BC) has been reported as one of the most common cancers diagnosed in females throughout the world. Survival rate of BC patients is affected by metastasis. So, exploring its underlying mechanisms and identifying related biomarkers to monitor BC relapse/recurrence using new statistical methods is essential. This study investigated the high-dimensional gene-expression profiles of BC patients using penalized additive hazards regression models.</p><p><strong>Methods: </strong>A publicly available dataset related to the time to metastasis in BC patients (GSE2034) was used. There was information of 22 283 genes expression profiles related to 286 BC patients. Penalized additive hazards regression models with different penalties, including LASSO, SCAD, SICA, MCP and Elastic net were used to identify metastasis related genes.</p><p><strong>Results: </strong>Five regression models with penalties were applied in the additive hazards model and jointly found 9 genes including <i>SNU13</i>, <i>CLINT1</i>, <i>MAPK9</i>, <i>ABCC5</i>, <i>NKX3</i>-1, <i>NCOR2</i>, <i>COL2A1</i>, and <i>ZNF219</i>. According the median of the prognostic index calculated using the regression coefficients of the penalized additive hazards model, the patients were labeled as high/low risk groups. A significant difference was detected in the survival curves of the identified groups. The selected genes were examined using validation data and were significantly associated with the hazard of metastasis.</p><p><strong>Conclusion: </strong>This study showed that <i>MAPK9</i>, <i>NKX3</i>-1, <i>NCOR1</i>, <i>ABCC5</i>, and <i>CD44</i> are the potential recurrence and metastatic predictors in breast cancer and can be taken into account as candidates for further research in tumorigenesis, invasion, metastasis, and epithelial-mesenchymal transition of breast cancer.</p>","PeriodicalId":35418,"journal":{"name":"Cancer Informatics","volume":null,"pages":null},"PeriodicalIF":2.4000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/c1/12/10.1177_11769351231157942.PMC10034277.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Cancer Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/11769351231157942","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Breast cancer (BC) has been reported as one of the most common cancers diagnosed in females throughout the world. Survival rate of BC patients is affected by metastasis. So, exploring its underlying mechanisms and identifying related biomarkers to monitor BC relapse/recurrence using new statistical methods is essential. This study investigated the high-dimensional gene-expression profiles of BC patients using penalized additive hazards regression models.
Methods: A publicly available dataset related to the time to metastasis in BC patients (GSE2034) was used. There was information of 22 283 genes expression profiles related to 286 BC patients. Penalized additive hazards regression models with different penalties, including LASSO, SCAD, SICA, MCP and Elastic net were used to identify metastasis related genes.
Results: Five regression models with penalties were applied in the additive hazards model and jointly found 9 genes including SNU13, CLINT1, MAPK9, ABCC5, NKX3-1, NCOR2, COL2A1, and ZNF219. According the median of the prognostic index calculated using the regression coefficients of the penalized additive hazards model, the patients were labeled as high/low risk groups. A significant difference was detected in the survival curves of the identified groups. The selected genes were examined using validation data and were significantly associated with the hazard of metastasis.
Conclusion: This study showed that MAPK9, NKX3-1, NCOR1, ABCC5, and CD44 are the potential recurrence and metastatic predictors in breast cancer and can be taken into account as candidates for further research in tumorigenesis, invasion, metastasis, and epithelial-mesenchymal transition of breast cancer.
期刊介绍:
The field of cancer research relies on advances in many other disciplines, including omics technology, mass spectrometry, radio imaging, computer science, and biostatistics. Cancer Informatics provides open access to peer-reviewed high-quality manuscripts reporting bioinformatics analysis of molecular genetics and/or clinical data pertaining to cancer, emphasizing the use of machine learning, artificial intelligence, statistical algorithms, advanced imaging techniques, data visualization, and high-throughput technologies. As the leading journal dedicated exclusively to the report of the use of computational methods in cancer research and practice, Cancer Informatics leverages methodological improvements in systems biology, genomics, proteomics, metabolomics, and molecular biochemistry into the fields of cancer detection, treatment, classification, risk-prediction, prevention, outcome, and modeling.