AntiT2DMP-Pred: Leveraging feature fusion and optimization for superior machine learning prediction of type 2 diabetes mellitus

IF 4.2 3区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS Methods Pub Date : 2025-02-01 DOI:10.1016/j.ymeth.2025.01.003
Shaherin Basith , Balachandran Manavalan , Gwang Lee
{"title":"AntiT2DMP-Pred: Leveraging feature fusion and optimization for superior machine learning prediction of type 2 diabetes mellitus","authors":"Shaherin Basith ,&nbsp;Balachandran Manavalan ,&nbsp;Gwang Lee","doi":"10.1016/j.ymeth.2025.01.003","DOIUrl":null,"url":null,"abstract":"<div><div>Pancreatic α-amylase breaks down starch into isomaltose and maltose, which are further hydrolyzed by α-glucosidase in the intestine into monosaccharides, rapidly raising blood sugar levels and contributing to type 2 diabetes mellitus (T2DM). Synthetic inhibitors of carbohydrate-digesting enzymes are used to manage T2DM but may harm organ function over time. Bioactive peptides offer a safer alternative, avoiding such adverse effects. Computational methods for predicting antidiabetic peptides (ADPs) can significantly reduce the time and cost of experimental testing. While machine learning (ML) has been applied to identify ADPs, advancements in data analysis and algorithms continue to drive progress in the field. To address this, we developed AntiT2DMP-Pred, the first ML-based tool specifically designed for predicting type 2 antidiabetic peptides (T2ADPs). This tool employs a feature fusion strategy, combining ten highly discriminative feature descriptors chosen from a pool of 32 descriptors and eight ML algorithms, tested across a range of baseline models. AntiT2DMP-Pred demonstrated excellent performance, surpassing both baseline and feature-optimized models, with an accuracy (ACC) and Matthews’ correlation coefficient (MCC) of 0.976 and 0.953 on the training dataset, and an ACC and MCC of 0.957 and 0.851 on the independent dataset. The web server (<span><span>https://balalab-skku.org/AntiT2DMP-Pred</span><svg><path></path></svg></span>) is freely accessible, enabling researchers worldwide to utilize it in their experimental workflows and contribute to the discovery and understanding of T2ADPs, ultimately supporting peptide-based therapeutic development for diabetes management.</div></div>","PeriodicalId":390,"journal":{"name":"Methods","volume":"234 ","pages":"Pages 264-274"},"PeriodicalIF":4.2000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Methods","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1046202325000039","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

Pancreatic α-amylase breaks down starch into isomaltose and maltose, which are further hydrolyzed by α-glucosidase in the intestine into monosaccharides, rapidly raising blood sugar levels and contributing to type 2 diabetes mellitus (T2DM). Synthetic inhibitors of carbohydrate-digesting enzymes are used to manage T2DM but may harm organ function over time. Bioactive peptides offer a safer alternative, avoiding such adverse effects. Computational methods for predicting antidiabetic peptides (ADPs) can significantly reduce the time and cost of experimental testing. While machine learning (ML) has been applied to identify ADPs, advancements in data analysis and algorithms continue to drive progress in the field. To address this, we developed AntiT2DMP-Pred, the first ML-based tool specifically designed for predicting type 2 antidiabetic peptides (T2ADPs). This tool employs a feature fusion strategy, combining ten highly discriminative feature descriptors chosen from a pool of 32 descriptors and eight ML algorithms, tested across a range of baseline models. AntiT2DMP-Pred demonstrated excellent performance, surpassing both baseline and feature-optimized models, with an accuracy (ACC) and Matthews’ correlation coefficient (MCC) of 0.976 and 0.953 on the training dataset, and an ACC and MCC of 0.957 and 0.851 on the independent dataset. The web server (https://balalab-skku.org/AntiT2DMP-Pred) is freely accessible, enabling researchers worldwide to utilize it in their experimental workflows and contribute to the discovery and understanding of T2ADPs, ultimately supporting peptide-based therapeutic development for diabetes management.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
AntiT2DMP-Pred:利用特征融合和优化进行2型糖尿病的卓越机器学习预测。
胰腺α-淀粉酶将淀粉分解成异麦芽糖和麦芽糖,在肠道内α-葡萄糖苷酶进一步水解成单糖,迅速升高血糖水平,导致2型糖尿病(T2DM)。碳水化合物消化酶的合成抑制剂用于控制2型糖尿病,但随着时间的推移可能会损害器官功能。生物活性肽提供了一个更安全的选择,避免了这样的副作用。预测抗糖尿病肽(ADPs)的计算方法可以显著减少实验测试的时间和成本。虽然机器学习(ML)已被应用于识别adp,但数据分析和算法的进步继续推动该领域的进步。为了解决这个问题,我们开发了AntiT2DMP-Pred,这是第一个专门用于预测2型抗糖尿病肽(T2ADPs)的基于ml的工具。该工具采用特征融合策略,结合从32个描述符和8个ML算法中选择的10个高度判别性的特征描述符,并在一系列基线模型中进行了测试。AntiT2DMP-Pred表现出优异的性能,超过了基线模型和特征优化模型,在训练数据集上的准确率(ACC)和马修斯相关系数(MCC)分别为0.976和0.953,在独立数据集上的ACC和MCC分别为0.957和0.851。web服务器(https://balalab-skku.org/AntiT2DMP-Pred)是免费访问的,使世界各地的研究人员能够在他们的实验工作流程中使用它,并有助于发现和了解t2adp,最终支持基于肽的糖尿病治疗开发。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Methods
Methods 生物-生化研究方法
CiteScore
9.80
自引率
2.10%
发文量
222
审稿时长
11.3 weeks
期刊介绍: Methods focuses on rapidly developing techniques in the experimental biological and medical sciences. Each topical issue, organized by a guest editor who is an expert in the area covered, consists solely of invited quality articles by specialist authors, many of them reviews. Issues are devoted to specific technical approaches with emphasis on clear detailed descriptions of protocols that allow them to be reproduced easily. The background information provided enables researchers to understand the principles underlying the methods; other helpful sections include comparisons of alternative methods giving the advantages and disadvantages of particular methods, guidance on avoiding potential pitfalls, and suggestions for troubleshooting.
期刊最新文献
Production of AFM wedged cantilevers for stress-relaxation experiments: Uniaxial loading of soft, spherical cells Latest trends & strategies in ocular drug delivery A high sensitivity assay of UBE3A ubiquitin ligase activity HybProm: An attention-assisted hybrid CNN-BiLSTM model for the interpretable prediction of DNA promoter A transferability-guided protein-ligand interaction prediction method
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1