Fer-COCL: A Novel Method Based on Multiple Deep Learning Algorithms for Identifying Fertility-Related Proteins

IF 2.9 2区 化学 Q2 CHEMISTRY, MULTIDISCIPLINARY Match-Communications in Mathematical and in Computer Chemistry Pub Date : 2023-07-01 DOI:10.46793/match.90-3.537z
Shenmin Zhang, Xinjie Li, Hongyan Shi, Yuanyuan Jing, Yunyun Liang, Yusen Zhang
{"title":"Fer-COCL: A Novel Method Based on Multiple Deep Learning Algorithms for Identifying Fertility-Related Proteins","authors":"Shenmin Zhang, Xinjie Li, Hongyan Shi, Yuanyuan Jing, Yunyun Liang, Yusen Zhang","doi":"10.46793/match.90-3.537z","DOIUrl":null,"url":null,"abstract":"The survival of species depends on the fertility of organisms. It is also worthwhile to study the proteins that can regulate the reproductive activity of organisms. Since biological experiments are laborious to confirm proteins, it has become a priority that develop relevant computational models to predict the function of fertility-related proteins. With the development of machine learning, pertinent various algorithms can be the key to identifying fertility-related proteins. In this work, we develop a model Fer-COCL based on deep learning. The model consists of multiple features as well as multiple deep learning algorithms. First, we extract features using Amino acid composition (AAC), Dipeptide composition (DPC), CTD transition (CTDT) and deviation between the dipeptide and the expected mean (DDE). After that, the spliced features are fed into the classifier. The data processed jointly by convolutional neural network and long short-term memory is input to the fully connected layer for classification. After evaluating the model using 10-fold cross-validation, the accuracy of the two data sets reaches 97.1% and 98.3%, respectively. The results indicate that the model is efficient and accurate, facilitating biologists' research on biological fertility. In addition, a free online tool for predicting the function of fertility-related proteins is available at http://fercocl.zhanglab.site/.","PeriodicalId":51115,"journal":{"name":"Match-Communications in Mathematical and in Computer Chemistry","volume":null,"pages":null},"PeriodicalIF":2.9000,"publicationDate":"2023-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Match-Communications in Mathematical and in Computer Chemistry","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.46793/match.90-3.537z","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

The survival of species depends on the fertility of organisms. It is also worthwhile to study the proteins that can regulate the reproductive activity of organisms. Since biological experiments are laborious to confirm proteins, it has become a priority that develop relevant computational models to predict the function of fertility-related proteins. With the development of machine learning, pertinent various algorithms can be the key to identifying fertility-related proteins. In this work, we develop a model Fer-COCL based on deep learning. The model consists of multiple features as well as multiple deep learning algorithms. First, we extract features using Amino acid composition (AAC), Dipeptide composition (DPC), CTD transition (CTDT) and deviation between the dipeptide and the expected mean (DDE). After that, the spliced features are fed into the classifier. The data processed jointly by convolutional neural network and long short-term memory is input to the fully connected layer for classification. After evaluating the model using 10-fold cross-validation, the accuracy of the two data sets reaches 97.1% and 98.3%, respectively. The results indicate that the model is efficient and accurate, facilitating biologists' research on biological fertility. In addition, a free online tool for predicting the function of fertility-related proteins is available at http://fercocl.zhanglab.site/.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
ferc - cocl:一种基于多个深度学习算法的鉴定生育相关蛋白的新方法
物种的生存取决于生物体的繁殖力。研究能够调节生物体生殖活动的蛋白质也是值得的。由于生物实验很难确认蛋白质,因此开发相关的计算模型来预测生育相关蛋白质的功能已成为当务之急。随着机器学习的发展,相关的各种算法可以成为识别生育相关蛋白的关键。在这项工作中,我们开发了一个基于深度学习的Fer-COCL模型。该模型由多个特征和多个深度学习算法组成。首先,我们利用氨基酸组成(AAC)、二肽组成(DPC)、CTD过渡(CTDT)和二肽与预期均值之间的偏差(DDE)提取特征。之后,将拼接后的特征输入到分类器中。将卷积神经网络与长短期记忆共同处理的数据输入到全连接层进行分类。采用10倍交叉验证对模型进行评估后,两组数据集的准确率分别达到97.1%和98.3%。结果表明,该模型高效、准确,为生物学家研究生物生育力提供了方便。此外,一个预测生育相关蛋白质功能的免费在线工具可在http://fercocl.zhanglab.site/上获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
4.40
自引率
26.90%
发文量
71
审稿时长
2 months
期刊介绍: MATCH Communications in Mathematical and in Computer Chemistry publishes papers of original research as well as reviews on chemically important mathematical results and non-routine applications of mathematical techniques to chemical problems. A paper acceptable for publication must contain non-trivial mathematics or communicate non-routine computer-based procedures AND have a clear connection to chemistry. Papers are published without any processing or publication charge.
期刊最新文献
ChemCNet: An Explainable Integrated Model for Intelligent Analyzing Chemistry Synthesis Reactions Asymptotic Distribution of Degree-Based Topological Indices Note on the Minimum Bond Incident Degree Indices of k-Cyclic Graphs Sombor Index of Hypergraphs The ABC Index Conundrum's Complete Solution
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1