iMPT-FRAKEL: A Simple Multi-label Web-server that Only Uses Fingerprints to Identify which Metabolic Pathway Types Compounds can Participate In

Q3 Computer Science Open Bioinformatics Journal Pub Date : 2020-08-18 DOI:10.2174/1875036202013010083
Yan-juan Jia, Lei Chen, Jian-Peng Zhou, Min Liu
{"title":"iMPT-FRAKEL: A Simple Multi-label Web-server that Only Uses Fingerprints to Identify which Metabolic Pathway Types Compounds can Participate In","authors":"Yan-juan Jia, Lei Chen, Jian-Peng Zhou, Min Liu","doi":"10.2174/1875036202013010083","DOIUrl":null,"url":null,"abstract":"\n \n Metabolic pathway is one of the most basic biological pathways in living organisms. It consists of a series of chemical reactions and provides the necessary molecules and energies for organisms. To date, lots of metabolic pathways have been detected. However, there still exist hidden participants (compounds and enzymes) for some metabolic pathways due to the complexity and diversity of metabolic pathways. It is necessary to develop quick, reliable, and non-animal-involved prediction model to recognize metabolic pathways for any compound.\n \n \n \n In this study, a multi-label classifier, namely iMPT-FRAKEL, was developed for identifying which metabolic pathway types that compounds can participate in. Compounds and 12 metabolic pathway types were retrieved from KEGG. Each compound was represented by its fingerprints, which was the most widely used form for representing compounds and can be extracted from its SMILES format. A popular multi-label classification scheme, Random k-Labelsets (RAKEL) algorithm, was adopted to build the classifier. Classic machine learning algorithm, Support Vector Machine (SVM) with RBF kernel, was selected as the basic classification algorithm. Ten-fold cross-validation was used to evaluate the performance of the iMPT-FRAKEL. In addition, a web-server version of such classifier was set up, which can be assessed at http://cie.shmtu.edu.cn/impt/index.\n \n \n \n iMPT-FRAKEL yielded the accuracy of 0.804, exact match of 0.745 and hamming loss of 0.039. Comparison results indicated that such classifier was superior to other models, including models with Binary Relevance (BR) or other classification algorithms.\n \n \n \n The proposed classifier employed limited prior knowledge of compounds but gives satisfying performance for recognizing metabolic pathways of compounds.\n","PeriodicalId":38956,"journal":{"name":"Open Bioinformatics Journal","volume":"1 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2020-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Open Bioinformatics Journal","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2174/1875036202013010083","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 4

Abstract

Metabolic pathway is one of the most basic biological pathways in living organisms. It consists of a series of chemical reactions and provides the necessary molecules and energies for organisms. To date, lots of metabolic pathways have been detected. However, there still exist hidden participants (compounds and enzymes) for some metabolic pathways due to the complexity and diversity of metabolic pathways. It is necessary to develop quick, reliable, and non-animal-involved prediction model to recognize metabolic pathways for any compound. In this study, a multi-label classifier, namely iMPT-FRAKEL, was developed for identifying which metabolic pathway types that compounds can participate in. Compounds and 12 metabolic pathway types were retrieved from KEGG. Each compound was represented by its fingerprints, which was the most widely used form for representing compounds and can be extracted from its SMILES format. A popular multi-label classification scheme, Random k-Labelsets (RAKEL) algorithm, was adopted to build the classifier. Classic machine learning algorithm, Support Vector Machine (SVM) with RBF kernel, was selected as the basic classification algorithm. Ten-fold cross-validation was used to evaluate the performance of the iMPT-FRAKEL. In addition, a web-server version of such classifier was set up, which can be assessed at http://cie.shmtu.edu.cn/impt/index. iMPT-FRAKEL yielded the accuracy of 0.804, exact match of 0.745 and hamming loss of 0.039. Comparison results indicated that such classifier was superior to other models, including models with Binary Relevance (BR) or other classification algorithms. The proposed classifier employed limited prior knowledge of compounds but gives satisfying performance for recognizing metabolic pathways of compounds.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
iMPT-FRAKEL:一个简单的多标签网络服务器,仅使用指纹来识别化合物可以参与的代谢途径类型
代谢途径是生物体内最基本的生物途径之一。它由一系列化学反应组成,为生物体提供必要的分子和能量。迄今为止,已经发现了许多代谢途径。然而,由于代谢途径的复杂性和多样性,一些代谢途径仍然存在隐藏的参与者(化合物和酶)。开发快速、可靠、不涉及动物的预测模型来识别任何化合物的代谢途径是很有必要的。在本研究中,我们开发了一个多标签分类器,即iMPT-FRAKEL,用于识别化合物可以参与哪些代谢途径类型。从KEGG中检索到化合物和12种代谢途径类型。每种化合物都用指纹图谱表示,指纹图谱是最广泛使用的表示化合物的形式,可以从其SMILES格式中提取。采用了一种流行的多标签分类方案Random k-Labelsets (RAKEL)算法来构建分类器。选择经典的机器学习算法——RBF核支持向量机(SVM)作为基本分类算法。采用10倍交叉验证来评价iMPT-FRAKEL的性能。此外,还建立了该分类器的web服务器版本,可以在http://cie.shmtu.edu.cn/impt/index上进行评估。iMPT-FRAKEL的准确度为0.804,精确匹配度为0.745,汉明损失为0.039。对比结果表明,该分类器优于其他模型,包括具有二元相关性(Binary Relevance, BR)的模型或其他分类算法。该分类器利用了有限的化合物先验知识,但在识别化合物代谢途径方面取得了令人满意的效果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Open Bioinformatics Journal
Open Bioinformatics Journal Computer Science-Computer Science (miscellaneous)
CiteScore
2.40
自引率
0.00%
发文量
4
期刊介绍: The Open Bioinformatics Journal is an Open Access online journal, which publishes research articles, reviews/mini-reviews, letters, clinical trial studies and guest edited single topic issues in all areas of bioinformatics and computational biology. The coverage includes biomedicine, focusing on large data acquisition, analysis and curation, computational and statistical methods for the modeling and analysis of biological data, and descriptions of new algorithms and databases. The Open Bioinformatics Journal, a peer reviewed journal, is an important and reliable source of current information on the developments in the field. The emphasis will be on publishing quality articles rapidly and freely available worldwide.
期刊最新文献
Decision-making Support System for Predicting and Eliminating Malnutrition and Anemia Immunoinformatics Approach for the Design of Chimeric Vaccine Against Whitmore Disease A New Deep Learning Model based on Neuroimaging for Predicting Alzheimer's Disease Early Prediction of Covid-19 Samples from Chest X-ray Images using Deep Learning Approach Electronic Health Record (EHR) System Development for Study on EHR Data-based Early Prediction of Diabetes Using Machine Learning Algorithms
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1