基于Chou伪氨基酸组成概念的金属蛋白酶家族的机器学习预测。

Majid Mohammad Beigi, Mohaddeseh Behjati, Hassan Mohabatkar
{"title":"基于Chou伪氨基酸组成概念的金属蛋白酶家族的机器学习预测。","authors":"Majid Mohammad Beigi,&nbsp;Mohaddeseh Behjati,&nbsp;Hassan Mohabatkar","doi":"10.1007/s10969-011-9120-4","DOIUrl":null,"url":null,"abstract":"<p><p>Matrix metalloproteinase (MMPs) and disintegrin and metalloprotease (ADAMs) belong to the zinc-dependent metalloproteinase family of proteins. These proteins participate in various physiological and pathological states. Thus, prediction of these proteins using amino acid sequence would be helpful. We have developed a method to predict these proteins based on the features derived from Chou's pseudo amino acid composition (PseAAC) server and support vector machine (SVM) as a powerful machine learning approach. With this method, for ADAMs and MMPs families, an overall accuracy and Matthew's correlation coefficient (MCC) of 95.89 and 0.90% were achieved respectively. Furthermore, the method is able to predict two major subclasses of MMP family; Furin-activated secreted MMPs and Type II trans-membrane; with MCC of 0.89 and 0.91%, respectively. The overall accuracy for Furin-activated secreted MMPs and Type II trans-membrane was 98.18 and 99.07, respectively. Our data demonstrates an effective classification of Metalloproteinase family based on the concept of PseAAC and SVM.</p>","PeriodicalId":73957,"journal":{"name":"Journal of structural and functional genomics","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2011-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1007/s10969-011-9120-4","citationCount":"111","resultStr":"{\"title\":\"Prediction of metalloproteinase family based on the concept of Chou's pseudo amino acid composition using a machine learning approach.\",\"authors\":\"Majid Mohammad Beigi,&nbsp;Mohaddeseh Behjati,&nbsp;Hassan Mohabatkar\",\"doi\":\"10.1007/s10969-011-9120-4\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Matrix metalloproteinase (MMPs) and disintegrin and metalloprotease (ADAMs) belong to the zinc-dependent metalloproteinase family of proteins. These proteins participate in various physiological and pathological states. Thus, prediction of these proteins using amino acid sequence would be helpful. We have developed a method to predict these proteins based on the features derived from Chou's pseudo amino acid composition (PseAAC) server and support vector machine (SVM) as a powerful machine learning approach. With this method, for ADAMs and MMPs families, an overall accuracy and Matthew's correlation coefficient (MCC) of 95.89 and 0.90% were achieved respectively. Furthermore, the method is able to predict two major subclasses of MMP family; Furin-activated secreted MMPs and Type II trans-membrane; with MCC of 0.89 and 0.91%, respectively. The overall accuracy for Furin-activated secreted MMPs and Type II trans-membrane was 98.18 and 99.07, respectively. Our data demonstrates an effective classification of Metalloproteinase family based on the concept of PseAAC and SVM.</p>\",\"PeriodicalId\":73957,\"journal\":{\"name\":\"Journal of structural and functional genomics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1007/s10969-011-9120-4\",\"citationCount\":\"111\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of structural and functional genomics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1007/s10969-011-9120-4\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2011/12/3 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of structural and functional genomics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s10969-011-9120-4","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2011/12/3 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 111

摘要

基质金属蛋白酶(MMPs)和崩解素和金属蛋白酶(ADAMs)属于锌依赖性金属蛋白酶家族。这些蛋白质参与各种生理和病理状态。因此,利用氨基酸序列对这些蛋白质进行预测是有帮助的。我们开发了一种基于Chou的伪氨基酸组成(PseAAC)服务器和支持向量机(SVM)的特征来预测这些蛋白质的方法,作为一种强大的机器学习方法。该方法对ADAMs和MMPs家族的总体准确率和马修相关系数(MCC)分别达到95.89和0.90%。此外,该方法能够预测MMP家族的两个主要亚类;furin激活的分泌型MMPs和II型跨膜;MCC分别为0.89和0.91%。furin激活的分泌型MMPs和II型跨膜的总体准确性分别为98.18和99.07。我们的数据证明了基于PseAAC和SVM概念的金属蛋白酶家族的有效分类。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Prediction of metalloproteinase family based on the concept of Chou's pseudo amino acid composition using a machine learning approach.

Matrix metalloproteinase (MMPs) and disintegrin and metalloprotease (ADAMs) belong to the zinc-dependent metalloproteinase family of proteins. These proteins participate in various physiological and pathological states. Thus, prediction of these proteins using amino acid sequence would be helpful. We have developed a method to predict these proteins based on the features derived from Chou's pseudo amino acid composition (PseAAC) server and support vector machine (SVM) as a powerful machine learning approach. With this method, for ADAMs and MMPs families, an overall accuracy and Matthew's correlation coefficient (MCC) of 95.89 and 0.90% were achieved respectively. Furthermore, the method is able to predict two major subclasses of MMP family; Furin-activated secreted MMPs and Type II trans-membrane; with MCC of 0.89 and 0.91%, respectively. The overall accuracy for Furin-activated secreted MMPs and Type II trans-membrane was 98.18 and 99.07, respectively. Our data demonstrates an effective classification of Metalloproteinase family based on the concept of PseAAC and SVM.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Structural Genomics: General Applications Classification of ligand molecules in PDB with graph match-based structural superposition HOMCOS: an updated server to search and model complex 3D structures. NLDB: a database for 3D protein-ligand interactions in enzymatic reactions. Toward the next step in G protein-coupled receptor research: a knowledge-driven analysis for the next potential targets in drug discovery
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1