StackTHP: A stacking ensemble model for accurate prediction of tumor-homing peptides in cancer therapy

IF 6.3 2区 医学 Q1 BIOLOGY Computers in biology and medicine Pub Date : 2025-05-01 Epub Date: 2025-03-05 DOI:10.1016/j.compbiomed.2025.109958
Fazla Rabby Raihan , Lway Faisal Abdulrazak , Md. Ashikur Rahman , Md Mamun Ali , Sobhy M. Ibrahim , Kawsar Ahmed , Francis M. Bui , Imran Mahmud
{"title":"StackTHP: A stacking ensemble model for accurate prediction of tumor-homing peptides in cancer therapy","authors":"Fazla Rabby Raihan ,&nbsp;Lway Faisal Abdulrazak ,&nbsp;Md. Ashikur Rahman ,&nbsp;Md Mamun Ali ,&nbsp;Sobhy M. Ibrahim ,&nbsp;Kawsar Ahmed ,&nbsp;Francis M. Bui ,&nbsp;Imran Mahmud","doi":"10.1016/j.compbiomed.2025.109958","DOIUrl":null,"url":null,"abstract":"<div><div>The tumor-homing peptides (THPs) have emerged as one of the attractive resources for targeted cancer therapy, being able to bind and penetrate tumor cells selectively while ignoring adjacent healthy tissues. Therefore, the computational models to predict THPs became popular very rapidly, since laboratory methods are slow and resourceful. Herein, we are proposing StackTHP, a newly developed stacking-ensemble model aimed at further improving THP prediction accuracy. StackTHP implements multiple feature extraction methods, including amino acid composition (AAC), and pseudo amino acid composition (PAAC) together with classical machine learning classifiers like Extra Trees, Random Forest, and AdaBoost, while the logistic regression-based meta-classifier is used for the stacking framework. StackTHP outperformed all other models, producing an accuracy of 91.92 %, Matthew's correlation coefficient (MCC) of 0.8415, AUC of 0.977 on benchmark datasets, indicates that it is better than approaches attempted earlier and provides a robust solution for proceeding towards the discovery and development of peptide-based cancer therapies. Future research will focus on the application of StackTHP over more diverse sets of data along with some hybrid methods to enhance the prediction capability. The dataset and the code are available at the following link: <span><span>https://github.com/Ashikur562/StackTHP</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"189 ","pages":"Article 109958"},"PeriodicalIF":6.3000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482525003099","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/3/5 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

The tumor-homing peptides (THPs) have emerged as one of the attractive resources for targeted cancer therapy, being able to bind and penetrate tumor cells selectively while ignoring adjacent healthy tissues. Therefore, the computational models to predict THPs became popular very rapidly, since laboratory methods are slow and resourceful. Herein, we are proposing StackTHP, a newly developed stacking-ensemble model aimed at further improving THP prediction accuracy. StackTHP implements multiple feature extraction methods, including amino acid composition (AAC), and pseudo amino acid composition (PAAC) together with classical machine learning classifiers like Extra Trees, Random Forest, and AdaBoost, while the logistic regression-based meta-classifier is used for the stacking framework. StackTHP outperformed all other models, producing an accuracy of 91.92 %, Matthew's correlation coefficient (MCC) of 0.8415, AUC of 0.977 on benchmark datasets, indicates that it is better than approaches attempted earlier and provides a robust solution for proceeding towards the discovery and development of peptide-based cancer therapies. Future research will focus on the application of StackTHP over more diverse sets of data along with some hybrid methods to enhance the prediction capability. The dataset and the code are available at the following link: https://github.com/Ashikur562/StackTHP.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
StackTHP:用于准确预测肿瘤归巢肽在癌症治疗中的堆叠集成模型
肿瘤归巢肽(tumor-homing peptide, THPs)由于能够选择性地结合和穿透肿瘤细胞而忽略邻近的健康组织,已成为靶向癌症治疗的重要资源之一。因此,预测THPs的计算模型变得非常流行,因为实验室方法缓慢且资源丰富。在此,我们提出了一种新开发的堆栈集成模型StackTHP,旨在进一步提高THP的预测精度。StackTHP与Extra Trees、Random Forest、AdaBoost等经典机器学习分类器一起实现了氨基酸组成(AAC)、伪氨基酸组成(PAAC)等多种特征提取方法,而堆叠框架则采用基于逻辑回归的元分类器。StackTHP优于所有其他模型,准确率为91.92%,马修相关系数(MCC)为0.8415,基准数据集的AUC为0.977,表明它比之前尝试的方法更好,并为发现和开发基于肽的癌症治疗提供了一个强大的解决方案。未来的研究将集中在StackTHP在更多样化的数据集上的应用,以及一些混合方法来增强预测能力。数据集和代码可从以下链接获得:https://github.com/Ashikur562/StackTHP。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Computers in biology and medicine
Computers in biology and medicine 工程技术-工程:生物医学
CiteScore
11.70
自引率
10.40%
发文量
1086
审稿时长
74 days
期刊介绍: Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.
期刊最新文献
Cervical cancer image analysis: Detection and segmentation using self-guided quantum GANs and musical chairs optimization Integrative adaptive indexes from noisy routine haematological markers can predict and discriminate health status and biological age Multiphysics learning with graph neural networks for thrombosis prediction in intracranial aneurysms Long-term clonal analysis using stochastic models reveals heterogeneity and quiescence of hematopoietic stem cells A lightweight and explainable cardiac signal framework for screening-oriented cardiometabolic risk assessment
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1