Strategies for Selection of Positive and Negative Instances in the Hierarchical Classification of Transposable Elements

Bruna Zamith Santos, G. Pereira, F. Nakano, R. Cerri
{"title":"Strategies for Selection of Positive and Negative Instances in the Hierarchical Classification of Transposable Elements","authors":"Bruna Zamith Santos, G. Pereira, F. Nakano, R. Cerri","doi":"10.1109/BRACIS.2018.00079","DOIUrl":null,"url":null,"abstract":"Transposable Elements (TEs) are DNA sequences capable of changing the gene's activity through transposition within the cells of a host. Once TEs insert themselves in other genes, they can change or reduce the activity of certain proteins, which in some cases could unfeasible the survival of such organisms or even provide genetic variability. A variety of methods has been proposed for the identification and classification of TEs, but most of them still involve a lot of manual work or are too class-specific, which restricts its applicability. Besides, the classes involved in such problems are often hierarchically structured, which is ignored by most of these methods. In this scenario, one problem that still needs further investigation is the use of strategies for selecting positive and negative instances during the induction of hierarchical models. Therefore, in this paper we explore four distinct strategies for selecting training instances, making use of several Machine Learning classifiers with different biases which were applied to the Hierarchical Classification of TEs using a local approach. Thus, we recommend the best strategy based on the results experimentally obtained.","PeriodicalId":405190,"journal":{"name":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 7th Brazilian Conference on Intelligent Systems (BRACIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BRACIS.2018.00079","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

Abstract

Transposable Elements (TEs) are DNA sequences capable of changing the gene's activity through transposition within the cells of a host. Once TEs insert themselves in other genes, they can change or reduce the activity of certain proteins, which in some cases could unfeasible the survival of such organisms or even provide genetic variability. A variety of methods has been proposed for the identification and classification of TEs, but most of them still involve a lot of manual work or are too class-specific, which restricts its applicability. Besides, the classes involved in such problems are often hierarchically structured, which is ignored by most of these methods. In this scenario, one problem that still needs further investigation is the use of strategies for selecting positive and negative instances during the induction of hierarchical models. Therefore, in this paper we explore four distinct strategies for selecting training instances, making use of several Machine Learning classifiers with different biases which were applied to the Hierarchical Classification of TEs using a local approach. Thus, we recommend the best strategy based on the results experimentally obtained.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
转座要素层次分类中正反例的选择策略
转座因子(te)是能够通过在宿主细胞内的转座改变基因活性的DNA序列。一旦te插入到其他基因中,它们可以改变或降低某些蛋白质的活性,这在某些情况下可能会使这些生物体的生存无法实现,甚至会产生遗传变异。对于TEs的识别和分类已经提出了多种方法,但大多数方法仍然涉及大量的手工工作或过于特定于类别,这限制了其适用性。此外,这些问题所涉及的类通常是分层结构的,这一点被大多数方法所忽略。在这种情况下,仍然需要进一步研究的一个问题是在分层模型的归纳过程中选择积极和消极实例的策略的使用。因此,在本文中,我们探索了四种不同的策略来选择训练实例,利用几种具有不同偏差的机器学习分类器,这些分类器使用局部方法应用于te的分层分类。因此,我们根据实验结果推荐最佳策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Exploring the Data Using Extended Association Rule Network SPt: A Text Mining Process to Extract Relevant Areas from SW Documents to Exploratory Tests Gene Essentiality Prediction Using Topological Features From Metabolic Networks Bio-Inspired and Heuristic Methods Applied to a Benchmark of the Task Scheduling Problem A New Genetic Algorithm-Based Pruning Approach for Optimum-Path Forest
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1