一种基于核苷酸结构三联体组合的新型Pre-microRNA鉴定鲁棒特征选择方法

Petra Stepanowsky, Jihoon Kim, L. Ohno-Machado
{"title":"一种基于核苷酸结构三联体组合的新型Pre-microRNA鉴定鲁棒特征选择方法","authors":"Petra Stepanowsky, Jihoon Kim, L. Ohno-Machado","doi":"10.1109/HISB.2012.20","DOIUrl":null,"url":null,"abstract":"MicroRNAs are a class of small non-coding RNAs that play an important role in post-transcriptional regulation of gene products. Identification of novel microRNA is difficult because the validated microRNA set is still small in size and diverse. Existing feature selection methods use different combinations of features related to the biogenesis of microRNAs, but performance evaluations are not comprehensive. We developed a robust feature selection method using a combination of three types of nucleotide-structure triplets, the minimum free energy of the secondary structure of precursor microRNAs and other extracted characteristics. We compared our new combination feature set and three other previously published sets using three different classifiers: logistic regression, support vector machine, and random forest. Our proposed feature set was not only robust across all classifier methods, but also had the highest classification performance, as measured by the area under the ROC curve.","PeriodicalId":375089,"journal":{"name":"2012 IEEE Second International Conference on Healthcare Informatics, Imaging and Systems Biology","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"A Robust Feature Selection Method for Novel Pre-microRNA Identification Using a Combination of Nucleotide-Structure Triplets\",\"authors\":\"Petra Stepanowsky, Jihoon Kim, L. Ohno-Machado\",\"doi\":\"10.1109/HISB.2012.20\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"MicroRNAs are a class of small non-coding RNAs that play an important role in post-transcriptional regulation of gene products. Identification of novel microRNA is difficult because the validated microRNA set is still small in size and diverse. Existing feature selection methods use different combinations of features related to the biogenesis of microRNAs, but performance evaluations are not comprehensive. We developed a robust feature selection method using a combination of three types of nucleotide-structure triplets, the minimum free energy of the secondary structure of precursor microRNAs and other extracted characteristics. We compared our new combination feature set and three other previously published sets using three different classifiers: logistic regression, support vector machine, and random forest. Our proposed feature set was not only robust across all classifier methods, but also had the highest classification performance, as measured by the area under the ROC curve.\",\"PeriodicalId\":375089,\"journal\":{\"name\":\"2012 IEEE Second International Conference on Healthcare Informatics, Imaging and Systems Biology\",\"volume\":\"12 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2012-09-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2012 IEEE Second International Conference on Healthcare Informatics, Imaging and Systems Biology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/HISB.2012.20\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE Second International Conference on Healthcare Informatics, Imaging and Systems Biology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/HISB.2012.20","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

MicroRNAs是一类小的非编码rna,在基因产物的转录后调控中起重要作用。鉴定新的microRNA是困难的,因为验证的microRNA集仍然小的尺寸和多样性。现有的特征选择方法使用与microrna生物发生相关的特征的不同组合,但性能评估并不全面。我们开发了一种强大的特征选择方法,使用三种类型的核苷酸结构三联体,前体microRNAs二级结构的最小自由能和其他提取特征的组合。我们使用三种不同的分类器:逻辑回归、支持向量机和随机森林,将我们的新组合特征集和其他三个先前发表的特征集进行了比较。我们提出的特征集不仅在所有分类器方法中都具有鲁棒性,而且通过ROC曲线下的面积来衡量,还具有最高的分类性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
A Robust Feature Selection Method for Novel Pre-microRNA Identification Using a Combination of Nucleotide-Structure Triplets
MicroRNAs are a class of small non-coding RNAs that play an important role in post-transcriptional regulation of gene products. Identification of novel microRNA is difficult because the validated microRNA set is still small in size and diverse. Existing feature selection methods use different combinations of features related to the biogenesis of microRNAs, but performance evaluations are not comprehensive. We developed a robust feature selection method using a combination of three types of nucleotide-structure triplets, the minimum free energy of the secondary structure of precursor microRNAs and other extracted characteristics. We compared our new combination feature set and three other previously published sets using three different classifiers: logistic regression, support vector machine, and random forest. Our proposed feature set was not only robust across all classifier methods, but also had the highest classification performance, as measured by the area under the ROC curve.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Enhancing Twitter Data Analysis with Simple Semantic Filtering: Example in Tracking Influenza-Like Illnesses Aggregated Indexing of Biomedical Time Series Data Temporal Analysis of Physicians' EHR Workflow during Outpatient Visits Does Domain Knowledge Matter for Assertion Annotation in Clinical Texts? A Randomized Response Model for Privacy-Preserving Data Dissemination
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1