首页 > 最新文献

Southeast Europe Journal of Soft Computing最新文献

英文 中文
Artificial Neural Networks in Bacteria Taxonomic Classification 人工神经网络在细菌分类中的应用
Pub Date : 2018-11-28 DOI: 10.21533/SCJOURNAL.V7I2.158
M. Can, Osman Gursoy
In 1980s, the face of the microbiology dramatically changed with the rRNA-based phylogenetic classifications, by Carl Woese. He delineated the three main branches of life. He used the technique not only to explore microbial diversity but also as a method for bacterial annotation. Today, rRNA-based analysis remains a central method in microbiology. Many researchers followed this track, using several new generations of Artificial Neural Networks they obtained high accuracies using available datasets of their time. Recently the number of known bacteria increased enormously. In this article we used ANN's to annotate bacterial 16S rRNA gene sequences from five selected phylums in Greengenes database taxonomy: Proteobacteria, Firmicutes, Bacteroidetes, Actinobacteria, and Chloroflexi. 93% average accuracy is obtained in classif-ications. When we used the bundle testing technique, the average accuracy easily raised to 100%.
20世纪80年代,卡尔·沃斯提出了基于rrna的系统发育分类,微生物学的面貌发生了巨大的变化。他描绘了生命的三个主要分支。他不仅将该技术用于探索微生物多样性,还将其作为细菌注释的方法。今天,基于rrna的分析仍然是微生物学的核心方法。许多研究人员沿着这条轨道,使用了几代新一代的人工神经网络,他们利用当时可用的数据集获得了很高的精度。最近,已知细菌的数量急剧增加。本文采用人工神经网络对Proteobacteria、Firmicutes、Bacteroidetes、Actinobacteria和Chloroflexi这5个门类的细菌16S rRNA基因序列进行了标注,平均准确率达到93%。当我们使用集束测试技术时,平均精度很容易提高到100%。
{"title":"Artificial Neural Networks in Bacteria Taxonomic Classification","authors":"M. Can, Osman Gursoy","doi":"10.21533/SCJOURNAL.V7I2.158","DOIUrl":"https://doi.org/10.21533/SCJOURNAL.V7I2.158","url":null,"abstract":"In 1980s, the face of the microbiology dramatically changed with the rRNA-based phylogenetic classifications, by Carl Woese. He delineated the three main branches of life. He used the technique not only to explore microbial diversity but also as a method for bacterial annotation. Today, rRNA-based analysis remains a central method in microbiology. Many researchers followed this track, using several new generations of Artificial Neural Networks they obtained high accuracies using available datasets of their time. Recently the number of known bacteria increased enormously. In this article we used ANN's to annotate bacterial 16S rRNA gene sequences from five selected phylums in Greengenes database taxonomy: Proteobacteria, Firmicutes, Bacteroidetes, Actinobacteria, and Chloroflexi. 93% average accuracy is obtained in classif-ications. When we used the bundle testing technique, the average accuracy easily raised to 100%.","PeriodicalId":243185,"journal":{"name":"Southeast Europe Journal of Soft Computing","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114428398","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Risk Exposition of Prices in Agricultural Commodities Using Options and Futures 利用期权和期货分析农产品价格的风险
Pub Date : 2018-11-28 DOI: 10.21533/scjournal.v7i2.164
J. Karabegović
Changes and fluctuations in commodity prices exert different effects on value chain participants, depending on the position they have in the chain. Agricultural commodities are exposed to a set of different factors influencing the prices of the commodities. They are influenced by the season, weather shocks, demand and supply forces, household income, tastes and preferences of the consumers. Observing the most recent history, high price fluctuations have been observed during the financial crisis in 2008. One out of many approaches for hedging the price risk is the usage of financial derivatives. This study will be concerned with the volatility modelling methods with the help of futures and options for corn and soya. Methods used for modelling the volatility a Black Scholes Implied Volatility. The simplest method in ARCH family, namely the GARCH (1,1) method will be used for modelling volatility based on the historical futures data dating back to 2005. The implied volatility is derived solving back the Black – Scholes Model, only this time looking for sigma. The sole purpose of the thesis is to examine which of the two methods has a better predictive power. Model comparison is done with the help of forecast regression models. The regression models have shown the difficulty in assessing which model has more accurate predictive power. The Adjusted R2 for both models in both cases is relatively low. However, the GARCH (1,1) model has slightly higher values for this indicator. Even the GARCH (1,1) model h a better performance, due to the relatively low adjusted R2 values, no stable conclusion regarding the model performance can be derived. assets(Hull, 2008). Options are financial instruments in the form of a contract that are traded between the writer of an option and an option holder, and it provides the re GARCH and
商品价格的变化和波动对价值链参与者产生不同的影响,取决于他们在价值链中的位置。农产品的价格受到一系列不同因素的影响。它们受季节、天气冲击、供需力量、家庭收入、消费者口味和偏好的影响。从最近的历史来看,2008年金融危机期间出现了高价格波动。许多对冲价格风险的方法之一是使用金融衍生品。本文以玉米和大豆期货期权为研究对象,探讨波动性建模方法。对布莱克斯科尔斯隐含波动率进行建模的方法。基于2005年以来的历史期货数据,我们将使用ARCH家族中最简单的方法GARCH(1,1)方法对波动率进行建模。隐含波动率是通过求解布莱克-斯科尔斯模型推导出来的,只是这次寻找的是西格玛。本文的唯一目的是检验这两种方法中哪一种具有更好的预测能力。利用预测回归模型进行模型比较。回归模型表明,很难评估哪种模型具有更准确的预测能力。在这两种情况下,两种型号的调整后R2都相对较低。然而,GARCH(1,1)模型对该指标的值略高。即使GARCH(1,1)模型具有较好的性能,但由于调整后的R2值相对较低,无法得出关于模型性能的稳定结论。资产(船体,2008)。期权是以合约的形式在期权出售者和期权持有人之间进行交易的金融工具,它提供了价格和价格
{"title":"Risk Exposition of Prices in Agricultural Commodities Using Options and Futures","authors":"J. Karabegović","doi":"10.21533/scjournal.v7i2.164","DOIUrl":"https://doi.org/10.21533/scjournal.v7i2.164","url":null,"abstract":"Changes and fluctuations in commodity prices exert different effects on value chain participants, depending on the position they have in the chain. Agricultural commodities are exposed to a set of different factors influencing the prices of the commodities. They are influenced by the season, weather shocks, demand and supply forces, household income, tastes and preferences of the consumers. Observing the most recent history, high price fluctuations have been observed during the financial crisis in 2008. One out of many approaches for hedging the price risk is the usage of financial derivatives. This study will be concerned with the volatility modelling methods with the help of futures and options for corn and soya. Methods used for modelling the volatility a Black Scholes Implied Volatility. The simplest method in ARCH family, namely the GARCH (1,1) method will be used for modelling volatility based on the historical futures data dating back to 2005. The implied volatility is derived solving back the Black – Scholes Model, only this time looking for sigma. The sole purpose of the thesis is to examine which of the two methods has a better predictive power. Model comparison is done with the help of forecast regression models. The regression models have shown the difficulty in assessing which model has more accurate predictive power. The Adjusted R2 for both models in both cases is relatively low. However, the GARCH (1,1) model has slightly higher values for this indicator. Even the GARCH (1,1) model h a better performance, due to the relatively low adjusted R2 values, no stable conclusion regarding the model performance can be derived. assets(Hull, 2008). Options are financial instruments in the form of a contract that are traded between the writer of an option and an option holder, and it provides the re GARCH and","PeriodicalId":243185,"journal":{"name":"Southeast Europe Journal of Soft Computing","volume":"152 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122429473","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Validation Tools for Predicted Linear B-Epitopes: Antigenicity 预测线性b表位的验证工具:抗原性
Pub Date : 2018-11-28 DOI: 10.21533/SCJOURNAL.V7I2.160
A. Abidi, M. Can
{"title":"Validation Tools for Predicted Linear B-Epitopes: Antigenicity","authors":"A. Abidi, M. Can","doi":"10.21533/SCJOURNAL.V7I2.160","DOIUrl":"https://doi.org/10.21533/SCJOURNAL.V7I2.160","url":null,"abstract":"","PeriodicalId":243185,"journal":{"name":"Southeast Europe Journal of Soft Computing","volume":"64 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116962272","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Authorship Authentication of Short Messages from Social Networks Using Recurrent Artificial Neural Networks 基于递归人工神经网络的社交网络短消息作者身份认证
Pub Date : 2018-11-28 DOI: 10.21533/SCJOURNAL.V7I2.163
N. M. Demir
Dataset consists of 17000 tweets collected from Twitter, as 500 tweets for each of 34 authors that meet certain criteria. Raw data is collected by using the software Nvivo. The collected raw data is preprocessed to extract frequencies of 200 features. In the data analysis 128 of features are eliminated since they are rare in tweets. As a progressive presentation, five – ten – fifteen – twenty - thirty and thirty four of these 34 authors are selected each time. Since recurrent artificial neural networks are more stable and iterations converge more quickly, in this work this architecture is preferred. In general, ANNs are more successful in distinguishing two classes, therefore for N authors, N×N neural networks are trained for pair wise classification. These N×N experts then organized as N special teams (CANNT) to aggregate decisions of these N×N experts. Number of authors is seen not so effective on the accuracy of the authentication, and around 80% accuracy is achieved for any number of authors.
数据集由从Twitter收集的17000条推文组成,34位符合特定标准的作者每人500条推文。使用Nvivo软件收集原始数据。对采集到的原始数据进行预处理,提取200个特征的频率。在数据分析中,128个特征被淘汰,因为它们在推文中很少见。作为一种递进的呈现,每次从这34位作者中选出5位、15位、23位和34位。由于循环人工神经网络更稳定,迭代收敛更快,在这项工作中,这种架构是首选的。一般来说,人工神经网络在区分两个类别方面更成功,因此对于N个作者,N×N神经网络被训练用于配对分类。这些N×N专家然后组成N个特别小组(can)来汇总这些N×N专家的决策。作者的数量对身份验证的准确性没有太大影响,对于任何数量的作者,准确率都可以达到80%左右。
{"title":"Authorship Authentication of Short Messages from Social Networks Using Recurrent Artificial Neural Networks","authors":"N. M. Demir","doi":"10.21533/SCJOURNAL.V7I2.163","DOIUrl":"https://doi.org/10.21533/SCJOURNAL.V7I2.163","url":null,"abstract":"Dataset consists of 17000 tweets collected from Twitter, as 500 tweets for each of 34 authors that meet certain criteria. Raw data is collected by using the software Nvivo. The collected raw data is preprocessed to extract frequencies of 200 features. In the data analysis 128 of features are eliminated since they are rare in tweets. As a progressive presentation, five – ten – fifteen – twenty - thirty and thirty four of these 34 authors are selected each time. Since recurrent artificial neural networks are more stable and iterations converge more quickly, in this work this architecture is preferred. In general, ANNs are more successful in distinguishing two classes, therefore for N authors, N×N neural networks are trained for pair wise classification. These N×N experts then organized as N special teams (CANNT) to aggregate decisions of these N×N experts. Number of authors is seen not so effective on the accuracy of the authentication, and around 80% accuracy is achieved for any number of authors.","PeriodicalId":243185,"journal":{"name":"Southeast Europe Journal of Soft Computing","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134340486","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Longest Common Subsequences in Bacteria Taxonomic Classification 细菌分类中的最长公共子序列
Pub Date : 2018-11-28 DOI: 10.21533/SCJOURNAL.V7I2.166
M. Can, Osman Gursoy
In 1980s, Carl Woese made a ground breaking contribution to microbiology using rRNA-genes for phylogenetic classifications. He used it not only to explore microbial diversity but also as a method for bacterial annotation. Today, rRNA-based analysis remains a central method in microbiology. Many researchers followed this track, using several new generations of Artificial Neural Networks obtained high accuracies using available datasets of their time. By the time, the number of bacteria increased enormously. In this article we used Longest Common Subsequence similarity measure to classify bacterial 16S rRNA gene sequences of 1.820.414 bacteria in SILVA, 3.196.038 bacteria in RDP, and 198.509 bacteria in Greengenes. The last two taxonomy have six taxonomical levels, phylum, class, order, family, genus, and species, while SILVA has two more levels subclass and suborder, but lacks species level. The majority of classifications (98%) were of high accuracy (98%).
20世纪80年代,卡尔·沃斯(Carl Woese)利用rrna基因进行系统发育分类,对微生物学做出了开创性的贡献。他不仅用它来探索微生物的多样性,而且作为细菌注释的一种方法。今天,基于rrna的分析仍然是微生物学的核心方法。许多研究人员沿着这条轨道,使用了几代新一代的人工神经网络,利用当时可用的数据集获得了很高的精度。到那时,细菌的数量急剧增加。本文采用最长公共子序列相似性度量对SILVA的1.820.414株细菌、RDP的3.196.038株细菌和Greengenes的198.509株细菌的16S rRNA基因序列进行了分类。后两种分类法有门、纲、目、科、属和种6个级别,而SILVA有亚纲和亚目两个级别,但缺乏种级别。大多数分类(98%)准确率高(98%)。
{"title":"Longest Common Subsequences in Bacteria Taxonomic Classification","authors":"M. Can, Osman Gursoy","doi":"10.21533/SCJOURNAL.V7I2.166","DOIUrl":"https://doi.org/10.21533/SCJOURNAL.V7I2.166","url":null,"abstract":"In 1980s, Carl Woese made a ground breaking contribution to microbiology using rRNA-genes for phylogenetic classifications. He used it not only to explore microbial diversity but also as a method for bacterial annotation. Today, rRNA-based analysis remains a central method in microbiology. Many researchers followed this track, using several new generations of Artificial Neural Networks obtained high accuracies using available datasets of their time. By the time, the number of bacteria increased enormously. In this article we used Longest Common Subsequence similarity measure to classify bacterial 16S rRNA gene sequences of 1.820.414 bacteria in SILVA, 3.196.038 bacteria in RDP, and 198.509 bacteria in Greengenes. The last two taxonomy have six taxonomical levels, phylum, class, order, family, genus, and species, while SILVA has two more levels subclass and suborder, but lacks species level. The majority of classifications (98%) were of high accuracy (98%).","PeriodicalId":243185,"journal":{"name":"Southeast Europe Journal of Soft Computing","volume":"117 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124142091","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Validation Tools for Predicted Linear B-Epitopes: Beta Turns 预测线性b表位的验证工具:β匝数
Pub Date : 2018-11-28 DOI: 10.21533/SCJOURNAL.V7I2.162
A. Abidi
{"title":"Validation Tools for Predicted Linear B-Epitopes: Beta Turns","authors":"A. Abidi","doi":"10.21533/SCJOURNAL.V7I2.162","DOIUrl":"https://doi.org/10.21533/SCJOURNAL.V7I2.162","url":null,"abstract":"","PeriodicalId":243185,"journal":{"name":"Southeast Europe Journal of Soft Computing","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114562020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Assessment of Accuracies of Protein 3-Dimensional Prediction Software 蛋白质三维预测软件的准确性评价
Pub Date : 2018-11-28 DOI: 10.21533/SCJOURNAL.V7I2.161
R. Gosto
Protein 3-dimensional structure prediction is determination of the 3-dimensional structure of a protein from its amino acid sequence by using protein structure prediction software. By understanding protein’s 3-dimensional structure, we should be able to figure out the function of the said protein. We already have several protein prediction software, but the purpose of this study is to determine how accurate they are, and if the results presented are true and to what extent. To determine how accurate protein 3-dimensional structure prediction software are, we compared x-ray crystallography determined protein structures to software predicted 3-dimensonal protein structures. All of the software used showed good accuracy, and according to our results, “i-Tasser” software was the most accurate, closely followed by RaptorX.
蛋白质三维结构预测是利用蛋白质结构预测软件从蛋白质的氨基酸序列中确定蛋白质的三维结构。通过了解蛋白质的三维结构,我们应该能够找出所述蛋白质的功能。我们已经有了几种蛋白质预测软件,但本研究的目的是确定它们的准确性,以及所呈现的结果是否真实以及在何种程度上真实。为了确定蛋白质三维结构预测软件的准确性,我们将x射线晶体学确定的蛋白质结构与软件预测的三维蛋白质结构进行了比较。所有使用的软件都显示出良好的准确性,根据我们的结果,“i-Tasser”软件是最准确的,其次是RaptorX。
{"title":"Assessment of Accuracies of Protein 3-Dimensional Prediction Software","authors":"R. Gosto","doi":"10.21533/SCJOURNAL.V7I2.161","DOIUrl":"https://doi.org/10.21533/SCJOURNAL.V7I2.161","url":null,"abstract":"Protein 3-dimensional structure prediction is determination of the 3-dimensional structure of a protein from its amino acid sequence by using protein structure prediction software. By understanding protein’s 3-dimensional structure, we should be able to figure out the function of the said protein. We already have several protein prediction software, but the purpose of this study is to determine how accurate they are, and if the results presented are true and to what extent. To determine how accurate protein 3-dimensional structure prediction software are, we compared x-ray crystallography determined protein structures to software predicted 3-dimensonal protein structures. All of the software used showed good accuracy, and according to our results, “i-Tasser” software was the most accurate, closely followed by RaptorX.","PeriodicalId":243185,"journal":{"name":"Southeast Europe Journal of Soft Computing","volume":"65 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132789796","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Computational Geometry Applications 计算几何应用
Pub Date : 2018-11-28 DOI: 10.21533/SCJOURNAL.V7I2.159
A. Selimi, M. Saračević
Computational geometry is an integral part of mathematics and computer science deals with the algorithmic solution of geometry problems. From the beginning to today, computer geometry links different areas of science and techniques, such as the theory of algorithms, combinatorial and Euclidean geometry, but including data structures and optimization. Today, computational geometry has a great deal of application in computer graphics, geometric modeling, computer vision, and geodesic path, motion planning and parallel computing. The complex calculations and theories in the field of geometry are long time studied and developed, but from the aspect of application in modern information technologies they still are in the beginning. In this research is given the applications of computational geometry in polygon triangulation, manufacturing of objects with molds, point location, and robot motion planning.
计算几何是数学的一个组成部分,计算机科学处理几何问题的算法解。从一开始到今天,计算机几何连接了不同的科学和技术领域,如算法理论、组合几何和欧几里得几何,但包括数据结构和优化。如今,计算几何在计算机图形学、几何建模、计算机视觉、测地线路径、运动规划和并行计算等领域有着广泛的应用。几何领域的复杂计算和理论研究和发展已经很长时间了,但从在现代信息技术中的应用来看,它们还处于起步阶段。本文研究了计算几何在多边形三角剖分、模具制造、点定位和机器人运动规划等方面的应用。
{"title":"Computational Geometry Applications","authors":"A. Selimi, M. Saračević","doi":"10.21533/SCJOURNAL.V7I2.159","DOIUrl":"https://doi.org/10.21533/SCJOURNAL.V7I2.159","url":null,"abstract":"Computational geometry is an integral part of mathematics and computer science deals with the algorithmic solution of geometry problems. From the beginning to today, computer geometry links different areas of science and techniques, such as the theory of algorithms, combinatorial and Euclidean geometry, but including data structures and optimization. Today, computational geometry has a great deal of application in computer graphics, geometric modeling, computer vision, and geodesic path, motion planning and parallel computing. The complex calculations and theories in the field of geometry are long time studied and developed, but from the aspect of application in modern information technologies they still are in the beginning. In this research is given the applications of computational geometry in polygon triangulation, manufacturing of objects with molds, point location, and robot motion planning.","PeriodicalId":243185,"journal":{"name":"Southeast Europe Journal of Soft Computing","volume":"76 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133864138","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
A Recurrent Neural Network Linear B-Epitope Predictor: BIRUNI 递归神经网络线性b表位预测器:BIRUNI
Pub Date : 2018-11-28 DOI: 10.21533/SCJOURNAL.V7I2.165
A. Abidi
Experimental methods used for characterizing epitopes that play a vital role in the development of peptide vaccines, in diagnosis of diseases, and also for allergy research are time consuming and need huge resources. There are many online epitope prediction tools are available that can help scientists in short listing the candidate peptides. To predict B-cell epitopes in an antigenic sequence, Jordan recurrent neural network (BIRUNI) is found to besuccessful. To train and test neural networks, 262.583 B epitopes are retrieved from IEDB database. 99.9% of these epitopes have lengths in the interval 6-25 amino acids. For each of these lengths, committees of 11 expert recurrent neural networks are trained. To train these experts alongside epitopes, non-epitopes are needed. Non-epitopes are created as random sequences of amino acids of the same length followed by a filtering process. To distinguish epitopes and non-epitopes, the votes of eleven experts are aggregated by majority vote. An overall accuracy of 97.23% is achieved. Then these experts are used to predict the Linear Bepitopes of five antigens, Plasmodium Falciparum, Human Polio Virus Sabin Strain, Meningitis, Plasmodium Vivax and Mycobacterium Tuberculosis. The success of BIRUNU is compared with the five online prediction tools ABCPRED, BCPRED, K&T, BEPIPRED, and AAP.It is seen that BIRUNI outperforms all of them in the average.
抗原表位在多肽疫苗的开发、疾病的诊断和过敏研究中起着至关重要的作用,用于表征抗原表位的实验方法耗时且需要大量资源。有许多在线表位预测工具可用,可以帮助科学家在候选肽短列表。为了预测抗原序列中的b细胞表位,Jordan递归神经网络(BIRUNI)被发现是成功的。为了训练和测试神经网络,从IEDB数据库中检索了262.583个B表位。99.9%的表位长度在6-25个氨基酸之间。对于每一个长度,由11个专家组成的循环神经网络委员会都要接受训练。为了训练这些专家,除了表位之外,还需要非表位。非表位是由相同长度的氨基酸随机序列通过过滤过程产生的。为了区分表位和非表位,11位专家的投票以多数投票的方式汇总。总体准确率达到97.23%。然后利用这些专家来预测恶性疟原虫、人类脊髓灰质炎病毒沙宾株、脑膜炎、间日疟原虫和结核分枝杆菌这五种抗原的线性倍人猿。BIRUNU的成功与5种在线预测工具ABCPRED、BCPRED、K&T、BEPIPRED和AAP进行了比较。可以看出,BIRUNI的平均表现优于所有这些。
{"title":"A Recurrent Neural Network Linear B-Epitope Predictor: BIRUNI","authors":"A. Abidi","doi":"10.21533/SCJOURNAL.V7I2.165","DOIUrl":"https://doi.org/10.21533/SCJOURNAL.V7I2.165","url":null,"abstract":"Experimental methods used for characterizing epitopes that play a vital role in the development of peptide vaccines, in diagnosis of diseases, and also for allergy research are time consuming and need huge resources. There are many online epitope prediction tools are available that can help scientists in short listing the candidate peptides. To predict B-cell epitopes in an antigenic sequence, Jordan recurrent neural network (BIRUNI) is found to besuccessful. To train and test neural networks, 262.583 B epitopes are retrieved from IEDB database. 99.9% of these epitopes have lengths in the interval 6-25 amino acids. For each of these lengths, committees of 11 expert recurrent neural networks are trained. To train these experts alongside epitopes, non-epitopes are needed. Non-epitopes are created as random sequences of amino acids of the same length followed by a filtering process. To distinguish epitopes and non-epitopes, the votes of eleven experts are aggregated by majority vote. An overall accuracy of 97.23% is achieved. Then these experts are used to predict the Linear Bepitopes of five antigens, Plasmodium Falciparum, Human Polio Virus Sabin Strain, Meningitis, Plasmodium Vivax and Mycobacterium Tuberculosis. The success of BIRUNU is compared with the five online prediction tools ABCPRED, BCPRED, K&T, BEPIPRED, and AAP.It is seen that BIRUNI outperforms all of them in the average.","PeriodicalId":243185,"journal":{"name":"Southeast Europe Journal of Soft Computing","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2018-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126882441","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
期刊
Southeast Europe Journal of Soft Computing
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1