J. Vargas, J. A. Velasco, G. Alvarez, Diego Linares, E. Bravo
{"title":"Automatic segmentation of Potyviridae family polyproteins","authors":"J. Vargas, J. A. Velasco, G. Alvarez, Diego Linares, E. Bravo","doi":"10.1504/IJBRA.2015.073238","DOIUrl":null,"url":null,"abstract":"We describe an automatic segmentation method for polyproteins of the viruses belonging to the Potyviridae family. It uses machine learning techniques in order to predict the cleavage site which define the segments in which said polyproteins are cut in their process of functional maturation. The segmentation application is publicly available for use on a website and it can be accessed through the web service interface too. The prediction models have an average sensitivity of 0.79 and a Matthews correlation coefficient average of 0.23. This method is capable of predicting correctly (coinciding with previously published segmentation) the segmentation of sequences which come from Potyvirus and Rymovirus, genera. However accurate prediction capabilities are affected when faced with either atypical sequences or viruses belonging to less common genera in the Potyviridae family. Future work will focus on establishing greater flexibility in this sense.","PeriodicalId":35444,"journal":{"name":"International Journal of Bioinformatics Research and Applications","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2015-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1504/IJBRA.2015.073238","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Bioinformatics Research and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJBRA.2015.073238","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Health Professions","Score":null,"Total":0}
引用次数: 0
Abstract
We describe an automatic segmentation method for polyproteins of the viruses belonging to the Potyviridae family. It uses machine learning techniques in order to predict the cleavage site which define the segments in which said polyproteins are cut in their process of functional maturation. The segmentation application is publicly available for use on a website and it can be accessed through the web service interface too. The prediction models have an average sensitivity of 0.79 and a Matthews correlation coefficient average of 0.23. This method is capable of predicting correctly (coinciding with previously published segmentation) the segmentation of sequences which come from Potyvirus and Rymovirus, genera. However accurate prediction capabilities are affected when faced with either atypical sequences or viruses belonging to less common genera in the Potyviridae family. Future work will focus on establishing greater flexibility in this sense.
期刊介绍:
Bioinformatics is an interdisciplinary research field that combines biology, computer science, mathematics and statistics into a broad-based field that will have profound impacts on all fields of biology. The emphasis of IJBRA is on basic bioinformatics research methods, tool development, performance evaluation and their applications in biology. IJBRA addresses the most innovative developments, research issues and solutions in bioinformatics and computational biology and their applications. Topics covered include Databases, bio-grid, system biology Biomedical image processing, modelling and simulation Bio-ontology and data mining, DNA assembly, clustering, mapping Computational genomics/proteomics Silico technology: computational intelligence, high performance computing E-health, telemedicine Gene expression, microarrays, identification, annotation Genetic algorithms, fuzzy logic, neural networks, data visualisation Hidden Markov models, machine learning, support vector machines Molecular evolution, phylogeny, modelling, simulation, sequence analysis Parallel algorithms/architectures, computational structural biology Phylogeny reconstruction algorithms, physiome, protein structure prediction Sequence assembly, search, alignment Signalling/computational biomedical data engineering Simulated annealing, statistical analysis, stochastic grammars.