[基于机器学习的乳腺浸润性癌蛋白编码基因标志物鉴定]。

Yue Wu, Kai-Yuan Min, Jiang-Feng Liu, Wan-Feng Liang, Ye-Hong Yang, Gang Hu, Jun-Tao Yang
{"title":"[基于机器学习的乳腺浸润性癌蛋白编码基因标志物鉴定]。","authors":"Yue Wu, Kai-Yuan Min, Jiang-Feng Liu, Wan-Feng Liang, Ye-Hong Yang, Gang Hu, Jun-Tao Yang","doi":"10.3881/j.issn.1000-503X.15717","DOIUrl":null,"url":null,"abstract":"<p><p>Objective To screen out the biomarkers linked to prognosis of breast invasive carcinoma based on the analysis of transcriptome data by random forest (RF),extreme gradient boosting (XGBoost),light gradient boosting machine (LightGBM),and categorical boosting (CatBoost). Methods We obtained the expression data of breast invasive carcinoma from The Cancer Genome Atlas and employed DESeq2,<i>t</i>-test,and Cox univariate analysis to identify the differentially expressed protein-coding genes associated with survival prognosis in human breast invasive carcinoma samples.Furthermore,RF,XGBoost,LightGBM,and CatBoost models were established to mine the protein-coding gene markers related to the prognosis of breast invasive cancer and the model performance was compared.The expression data of breast cancer from the Gene Expression Omnibus was used for validation. Results A total of 151 differentially expressed protein-coding genes related to survival prognosis were screened out.The machine learning model established with C3orf80,UGP2,and SPC25 demonstrated the best performance. Conclusions Three protein-coding genes (UGP2,C3orf80,and SPC25) were screened out to identify breast invasive carcinoma.This study provides a new direction for the treatment and diagnosis of breast invasive carcinoma.</p>","PeriodicalId":6919,"journal":{"name":"中国医学科学院学报","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"[Identification of Protein-Coding Gene Markers in Breast Invasive Carcinoma Based on Machine Learning].\",\"authors\":\"Yue Wu, Kai-Yuan Min, Jiang-Feng Liu, Wan-Feng Liang, Ye-Hong Yang, Gang Hu, Jun-Tao Yang\",\"doi\":\"10.3881/j.issn.1000-503X.15717\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Objective To screen out the biomarkers linked to prognosis of breast invasive carcinoma based on the analysis of transcriptome data by random forest (RF),extreme gradient boosting (XGBoost),light gradient boosting machine (LightGBM),and categorical boosting (CatBoost). Methods We obtained the expression data of breast invasive carcinoma from The Cancer Genome Atlas and employed DESeq2,<i>t</i>-test,and Cox univariate analysis to identify the differentially expressed protein-coding genes associated with survival prognosis in human breast invasive carcinoma samples.Furthermore,RF,XGBoost,LightGBM,and CatBoost models were established to mine the protein-coding gene markers related to the prognosis of breast invasive cancer and the model performance was compared.The expression data of breast cancer from the Gene Expression Omnibus was used for validation. Results A total of 151 differentially expressed protein-coding genes related to survival prognosis were screened out.The machine learning model established with C3orf80,UGP2,and SPC25 demonstrated the best performance. Conclusions Three protein-coding genes (UGP2,C3orf80,and SPC25) were screened out to identify breast invasive carcinoma.This study provides a new direction for the treatment and diagnosis of breast invasive carcinoma.</p>\",\"PeriodicalId\":6919,\"journal\":{\"name\":\"中国医学科学院学报\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-04-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"中国医学科学院学报\",\"FirstCategoryId\":\"1087\",\"ListUrlMain\":\"https://doi.org/10.3881/j.issn.1000-503X.15717\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"中国医学科学院学报","FirstCategoryId":"1087","ListUrlMain":"https://doi.org/10.3881/j.issn.1000-503X.15717","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

摘要

目的 基于随机森林(RF)、极梯度增强(XGBoost)、光梯度增强机(LightGBM)和分类增强(CatBoost)对转录组数据的分析,筛选出与乳腺浸润癌预后相关的生物标志物。方法 我们从癌症基因组图谱(The Cancer Genome Atlas)中获得了乳腺浸润癌的表达数据,并采用 DESeq2、t 检验和 Cox 单变量分析方法确定了人类乳腺浸润癌样本中与生存预后相关的差异表达蛋白编码基因。此外,还建立了 RF、XGBoost、LightGBM 和 CatBoost 模型来挖掘与乳腺浸润癌预后相关的蛋白编码基因标记,并比较了模型的性能。结果 共筛选出 151 个与生存预后相关的差异表达蛋白编码基因,其中以 C3orf80、UGP2 和 SPC25 建立的机器学习模型表现最佳。结论 筛选出的三个蛋白编码基因(UGP2、C3orf80 和 SPC25)可用于鉴别乳腺浸润癌,该研究为乳腺浸润癌的治疗和诊断提供了新的方向。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
[Identification of Protein-Coding Gene Markers in Breast Invasive Carcinoma Based on Machine Learning].

Objective To screen out the biomarkers linked to prognosis of breast invasive carcinoma based on the analysis of transcriptome data by random forest (RF),extreme gradient boosting (XGBoost),light gradient boosting machine (LightGBM),and categorical boosting (CatBoost). Methods We obtained the expression data of breast invasive carcinoma from The Cancer Genome Atlas and employed DESeq2,t-test,and Cox univariate analysis to identify the differentially expressed protein-coding genes associated with survival prognosis in human breast invasive carcinoma samples.Furthermore,RF,XGBoost,LightGBM,and CatBoost models were established to mine the protein-coding gene markers related to the prognosis of breast invasive cancer and the model performance was compared.The expression data of breast cancer from the Gene Expression Omnibus was used for validation. Results A total of 151 differentially expressed protein-coding genes related to survival prognosis were screened out.The machine learning model established with C3orf80,UGP2,and SPC25 demonstrated the best performance. Conclusions Three protein-coding genes (UGP2,C3orf80,and SPC25) were screened out to identify breast invasive carcinoma.This study provides a new direction for the treatment and diagnosis of breast invasive carcinoma.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
中国医学科学院学报
中国医学科学院学报 Medicine-Medicine (all)
CiteScore
0.60
自引率
0.00%
发文量
6813
期刊介绍: Acta Academiae Medicinae Sinicae was founded in February 1979. It is a comprehensive medical academic journal published in China and abroad, supervised by the Ministry of Health of the People's Republic of China and sponsored by the Chinese Academy of Medical Sciences and Peking Union Medical College. The journal mainly reports the latest research results, work progress and dynamics in the fields of basic medicine, clinical medicine, pharmacy, preventive medicine, biomedicine, medical teaching and research, aiming to promote the exchange of medical information and improve the academic level of medicine. At present, the journal has been included in 10 famous foreign retrieval systems and their databases [Medline (PubMed online version), Elsevier, EMBASE, CA, WPRIM, ExtraMED, IC, JST, UPD and EBSCO-ASP]; and has been included in important domestic retrieval systems and databases [China Science Citation Database (Documentation and Information Center of the Chinese Academy of Sciences), China Core Journals Overview (Peking University Library), China Science and Technology Paper Statistical Source Database (China Science and Technology Core Journals) (China Institute of Scientific and Technological Information), China Science and Technology Journal Paper and Citation Database (China Institute of Scientific and Technological Information)].
期刊最新文献
Advances in Research on Application of Quantitative CT in Clinical Diagnosis and Treatment of Osteoporosis. Research Progress of Drugs in Prevention and Treatment of Nephrolithiasis. Thermal Ablation of Pulmonary Nodules by Electromagnetic Navigation Bronchoscopy Combined With Real-Time CT-Based 3D Fusion Navigation:Report of One Case. [Risk Factors for Returning of Pediatric Liver Transplant Recipients to the Intensive Care Unit]. [Development and Reliability and Validity Analysis of the Knowledge,Attitude,and Practice Evaluation Scale for Teachers' Early Childhood Sex Education].
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1