使用集合技术的机器学习预测前列腺癌的总生存期

IF 6.3 2区 医学 Q1 BIOLOGY Computers in biology and medicine Pub Date : 2025-05-01 Epub Date: 2025-03-12 DOI:10.1016/j.compbiomed.2025.110008
Declan Ikechukwu Emegano , Mubarak Taiwo Mustapha , Dilber Uzun Ozsahin , Ilker Ozsahin
{"title":"使用集合技术的机器学习预测前列腺癌的总生存期","authors":"Declan Ikechukwu Emegano ,&nbsp;Mubarak Taiwo Mustapha ,&nbsp;Dilber Uzun Ozsahin ,&nbsp;Ilker Ozsahin","doi":"10.1016/j.compbiomed.2025.110008","DOIUrl":null,"url":null,"abstract":"<div><div>Prostate adenocarcinoma (PAC) is a complex and common cancer in males and is one of the leading causes of cancer-related death globally. PAC is a multifaceted disease that encompasses different subtypes, including acinar and ductal adenocarcinoma, small cell carcinoma, neuroendocrine tumors, and transitional cell carcinoma with each subtype presenting distinct prognostic difficulties. Therefore, predicting the overall survival (OS) rate of individuals with PAC continues to be a substantial clinical barrier due to the diverse nature of the illness, coexisting medical conditions, and constraints associated with conventional diagnostic markers. As a result, we focus on using ensemble machine learning (ML) models to predict the OS of PAC patients.</div><div>We evaluated these eight (8) ensemble ML models: Random Forest (RF), AdaBoost, Gradient Boosting (GB), Extreme Gradient Boosting (XGB), LightGBM (LGBM), CatBoost, Hard Voting Classifier (HVC), and Support Vector Classifier (SVC), using the data set obtained from the Cancer Genome Atlas (TCGA) PanCancer Atlas. The ensemble ML models were evaluated using essential performance indicators, such as accuracy, precision, recall, F-1 score, and ROC AUC score. The results show that GB outperformed other models by obtaining a perfect score of 1.0 in accuracy, precision, recall, and F-1 score, and 0.99 as ROC AUC. Similarly, RF and AdaBoost exhibited robust efficiency, suggesting their potential in healthcare settings for predicting PAC survival. In conclusion, the study highlights the importance of ensemble techniques in improving prediction precision and underscores the need for further research in clinical settings.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"189 ","pages":"Article 110008"},"PeriodicalIF":6.3000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning prediction of overall survival in prostate adenocarcinoma using ensemble techniques\",\"authors\":\"Declan Ikechukwu Emegano ,&nbsp;Mubarak Taiwo Mustapha ,&nbsp;Dilber Uzun Ozsahin ,&nbsp;Ilker Ozsahin\",\"doi\":\"10.1016/j.compbiomed.2025.110008\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Prostate adenocarcinoma (PAC) is a complex and common cancer in males and is one of the leading causes of cancer-related death globally. PAC is a multifaceted disease that encompasses different subtypes, including acinar and ductal adenocarcinoma, small cell carcinoma, neuroendocrine tumors, and transitional cell carcinoma with each subtype presenting distinct prognostic difficulties. Therefore, predicting the overall survival (OS) rate of individuals with PAC continues to be a substantial clinical barrier due to the diverse nature of the illness, coexisting medical conditions, and constraints associated with conventional diagnostic markers. As a result, we focus on using ensemble machine learning (ML) models to predict the OS of PAC patients.</div><div>We evaluated these eight (8) ensemble ML models: Random Forest (RF), AdaBoost, Gradient Boosting (GB), Extreme Gradient Boosting (XGB), LightGBM (LGBM), CatBoost, Hard Voting Classifier (HVC), and Support Vector Classifier (SVC), using the data set obtained from the Cancer Genome Atlas (TCGA) PanCancer Atlas. The ensemble ML models were evaluated using essential performance indicators, such as accuracy, precision, recall, F-1 score, and ROC AUC score. The results show that GB outperformed other models by obtaining a perfect score of 1.0 in accuracy, precision, recall, and F-1 score, and 0.99 as ROC AUC. Similarly, RF and AdaBoost exhibited robust efficiency, suggesting their potential in healthcare settings for predicting PAC survival. In conclusion, the study highlights the importance of ensemble techniques in improving prediction precision and underscores the need for further research in clinical settings.</div></div>\",\"PeriodicalId\":10578,\"journal\":{\"name\":\"Computers in biology and medicine\",\"volume\":\"189 \",\"pages\":\"Article 110008\"},\"PeriodicalIF\":6.3000,\"publicationDate\":\"2025-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers in biology and medicine\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0010482525003592\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/3/12 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482525003592","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/12 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

前列腺癌(PAC)是一种复杂而常见的男性癌症,是全球癌症相关死亡的主要原因之一。PAC是一种多面性疾病,包括不同的亚型,包括腺泡和导管腺癌、小细胞癌、神经内分泌肿瘤和移行细胞癌,每种亚型都有不同的预后困难。因此,由于疾病的多样性、共存的医疗条件以及与传统诊断标志物相关的限制,预测PAC患者的总生存率(OS)仍然是一个重大的临床障碍。因此,我们专注于使用集成机器学习(ML)模型来预测PAC患者的OS。我们评估了这八种集成ML模型:随机森林(RF)、AdaBoost、梯度增强(GB)、极端梯度增强(XGB)、LightGBM (LGBM)、CatBoost、硬投票分类器(HVC)和支持向量分类器(SVC),使用的数据集来自癌症基因组图集(TCGA) PanCancer Atlas。使用基本性能指标对集成ML模型进行评估,如准确性、精密度、召回率、F-1评分和ROC AUC评分。结果表明,GB模型在正确率、精密度、召回率和F-1得分上均获得1.0的满分,ROC AUC为0.99,优于其他模型。同样,RF和AdaBoost表现出强大的效率,表明它们在医疗保健环境中预测PAC生存的潜力。总之,该研究强调了集成技术在提高预测精度方面的重要性,并强调了在临床环境中进一步研究的必要性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

摘要图片

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Machine learning prediction of overall survival in prostate adenocarcinoma using ensemble techniques
Prostate adenocarcinoma (PAC) is a complex and common cancer in males and is one of the leading causes of cancer-related death globally. PAC is a multifaceted disease that encompasses different subtypes, including acinar and ductal adenocarcinoma, small cell carcinoma, neuroendocrine tumors, and transitional cell carcinoma with each subtype presenting distinct prognostic difficulties. Therefore, predicting the overall survival (OS) rate of individuals with PAC continues to be a substantial clinical barrier due to the diverse nature of the illness, coexisting medical conditions, and constraints associated with conventional diagnostic markers. As a result, we focus on using ensemble machine learning (ML) models to predict the OS of PAC patients.
We evaluated these eight (8) ensemble ML models: Random Forest (RF), AdaBoost, Gradient Boosting (GB), Extreme Gradient Boosting (XGB), LightGBM (LGBM), CatBoost, Hard Voting Classifier (HVC), and Support Vector Classifier (SVC), using the data set obtained from the Cancer Genome Atlas (TCGA) PanCancer Atlas. The ensemble ML models were evaluated using essential performance indicators, such as accuracy, precision, recall, F-1 score, and ROC AUC score. The results show that GB outperformed other models by obtaining a perfect score of 1.0 in accuracy, precision, recall, and F-1 score, and 0.99 as ROC AUC. Similarly, RF and AdaBoost exhibited robust efficiency, suggesting their potential in healthcare settings for predicting PAC survival. In conclusion, the study highlights the importance of ensemble techniques in improving prediction precision and underscores the need for further research in clinical settings.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Computers in biology and medicine
Computers in biology and medicine 工程技术-工程:生物医学
CiteScore
11.70
自引率
10.40%
发文量
1086
审稿时长
74 days
期刊介绍: Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.
期刊最新文献
Prediction of tumor control probability in prostate cancer radiotherapy using a biophysical model incorporating cancer stem cell and hypoxia A multiscale modeling approach to study the role of mechanics and inflammation in the pathophysiology of articular cartilage Comparative analysis of traditional and deep learning time series architectures for influenza A infectious disease forecasting Policy optimization for multi-variant COVID-19 pandemic under unequal isolation and booster shot Model-based, data-driven and hybrid biometric person authentication using cardiac signals of electrical, mechanical and optical nature
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1