A hierarchical neural network for predicting protein functions

J. C. Nievola, E. Paraiso, A. Freitas
{"title":"A hierarchical neural network for predicting protein functions","authors":"J. C. Nievola, E. Paraiso, A. Freitas","doi":"10.1109/BIBE.2015.7367651","DOIUrl":null,"url":null,"abstract":"This paper introduces the use of a modified feedforward neural network to cope with the problem of predicting protein functions. Since this kind of classification task is inherently hierarchical, this work proposes the use of two different architectures for the modified feedforward neural network, both mimicking the hierarchical nature of the classes (protein functions) to be predicted. The first approach consists of four feed-forward neural networks in cascade, each one taking as input the classification obtained by the previous network, which means, the input to a network is the classes that could be assigned to the protein at the immediately higher (parent) level in the class hierarchy. The second approach is an extension of the first one, which also adds as input to each sub-network the attributes of the protein being classified. In both situations, it was used two kinds of feed-forward architectures: an Adaline network, which is composed of a single layer of adjustable weights, and a MLP (\"Multi-Layer Perceptron\"), composed by two layers of adjustable weights. Both approaches were compared with a baseline consisting of a single MLP that maps the input attributes to the classes of the lowest level in the hierarchy. The MLP was built with the input layer, plus one hidden layer and one output layer. The three approaches were compared on eight datasets, the first four involving the prediction of GPCR (G-Protein Coupled Receptor) functions and the second four datasets involving the prediction of enzymes functions. The results show that a big-bang hierarchical neural network, based on the MLP paradigm, using a top-down evaluation for new instances has better behavior in hierarchical problems, when compared to its flat version.","PeriodicalId":422807,"journal":{"name":"2015 IEEE 15th International Conference on Bioinformatics and Bioengineering (BIBE)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE 15th International Conference on Bioinformatics and Bioengineering (BIBE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBE.2015.7367651","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

This paper introduces the use of a modified feedforward neural network to cope with the problem of predicting protein functions. Since this kind of classification task is inherently hierarchical, this work proposes the use of two different architectures for the modified feedforward neural network, both mimicking the hierarchical nature of the classes (protein functions) to be predicted. The first approach consists of four feed-forward neural networks in cascade, each one taking as input the classification obtained by the previous network, which means, the input to a network is the classes that could be assigned to the protein at the immediately higher (parent) level in the class hierarchy. The second approach is an extension of the first one, which also adds as input to each sub-network the attributes of the protein being classified. In both situations, it was used two kinds of feed-forward architectures: an Adaline network, which is composed of a single layer of adjustable weights, and a MLP ("Multi-Layer Perceptron"), composed by two layers of adjustable weights. Both approaches were compared with a baseline consisting of a single MLP that maps the input attributes to the classes of the lowest level in the hierarchy. The MLP was built with the input layer, plus one hidden layer and one output layer. The three approaches were compared on eight datasets, the first four involving the prediction of GPCR (G-Protein Coupled Receptor) functions and the second four datasets involving the prediction of enzymes functions. The results show that a big-bang hierarchical neural network, based on the MLP paradigm, using a top-down evaluation for new instances has better behavior in hierarchical problems, when compared to its flat version.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
预测蛋白质功能的层次神经网络
本文介绍了一种改进的前馈神经网络来解决蛋白质功能的预测问题。由于这种分类任务本质上是分层的,因此这项工作提出了对改进的前馈神经网络使用两种不同的架构,两者都模仿要预测的类(蛋白质功能)的分层性质。第一种方法由4个前馈级联神经网络组成,每个神经网络都以前一个网络获得的分类作为输入,这意味着,网络的输入是在类层次结构中可以分配给蛋白质的更高(亲本)层次的类。第二种方法是第一种方法的扩展,它也将被分类蛋白质的属性作为输入添加到每个子网络中。在这两种情况下,它都使用了两种前馈架构:由单层可调权值组成的Adaline网络和由两层可调权值组成的MLP(多层感知器)。将这两种方法与由单个MLP组成的基线进行比较,该MLP将输入属性映射到层次结构中最低级别的类。MLP由输入层、隐藏层和输出层组成。在8个数据集上对这三种方法进行了比较,前四个数据集涉及GPCR (g蛋白偶联受体)功能的预测,后四个数据集涉及酶功能的预测。结果表明,基于MLP范式的大爆炸层次神经网络,对新实例使用自顶向下评估,与扁平版本相比,在层次问题中具有更好的行为。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Automated SOSORT-recommended angles measurement in patients with adolescent idiopathic scoliosis Estimating changes in a cognitive performance using heart rate variability Some examples on the performance of density functional theory in the description of bioinorganic systems and processes Modeling the metabolism of escherichia coli under oxygen gradients with dynamically changing flux bounds An automated approach to conduct effective on-site presumptive drug tests
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1