Bruna Zamith Santos, G. Pereira, F. Nakano, R. Cerri
{"title":"Strategies for Selection of Positive and Negative Instances in the Hierarchical Classification of Transposable Elements","authors":"Bruna Zamith Santos, G. Pereira, F. Nakano, R. Cerri","doi":"10.1109/BRACIS.2018.00079","DOIUrl":null,"url":null,"abstract":"Transposable Elements (TEs) are DNA sequences capable of changing the gene's activity through transposition within the cells of a host. Once TEs insert themselves in other genes, they can change or reduce the activity of certain proteins, which in some cases could unfeasible the survival of such organisms or even provide genetic variability. A variety of methods has been proposed for the identification and classification of TEs, but most of them still involve a lot of manual work or are too class-specific, which restricts its applicability. Besides, the classes involved in such problems are often hierarchically structured, which is ignored by most of these methods. In this scenario, one problem that still needs further investigation is the use of strategies for selecting positive and negative instances during the induction of hierarchical models. Therefore, in this paper we explore four distinct strategies for selecting training instances, making use of several Machine Learning classifiers with different biases which were applied to the Hierarchical Classification of TEs using a local approach. Thus, we recommend the best strategy based on the results experimentally obtained.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BRACIS.2018.00079","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Transposable Elements (TEs) are DNA sequences capable of changing the gene's activity through transposition within the cells of a host. Once TEs insert themselves in other genes, they can change or reduce the activity of certain proteins, which in some cases could unfeasible the survival of such organisms or even provide genetic variability. A variety of methods has been proposed for the identification and classification of TEs, but most of them still involve a lot of manual work or are too class-specific, which restricts its applicability. Besides, the classes involved in such problems are often hierarchically structured, which is ignored by most of these methods. In this scenario, one problem that still needs further investigation is the use of strategies for selecting positive and negative instances during the induction of hierarchical models. Therefore, in this paper we explore four distinct strategies for selecting training instances, making use of several Machine Learning classifiers with different biases which were applied to the Hierarchical Classification of TEs using a local approach. Thus, we recommend the best strategy based on the results experimentally obtained.