Classification of seed members of five riboswitch families as short sequences based on the features extracted by Block Location-Based Feature Extraction (BLBFE) method.

IF 2.2 4区 工程技术 Q3 PHARMACOLOGY & PHARMACY Bioimpacts Pub Date : 2021-01-01 Epub Date: 2020-04-17 DOI:10.34172/bi.2021.17
Faegheh Golabi, Elnaz Mehdizadeh Aghdam, Mousa Shamsi, Mohammad Hossein Sedaaghi, Abolfazl Barzegar, Mohammad Saeid Hejazi
{"title":"Classification of seed members of five riboswitch families as short sequences based on the features extracted by Block Location-Based Feature Extraction (BLBFE) method.","authors":"Faegheh Golabi,&nbsp;Elnaz Mehdizadeh Aghdam,&nbsp;Mousa Shamsi,&nbsp;Mohammad Hossein Sedaaghi,&nbsp;Abolfazl Barzegar,&nbsp;Mohammad Saeid Hejazi","doi":"10.34172/bi.2021.17","DOIUrl":null,"url":null,"abstract":"<p><p><i><b>Introduction:</b></i> Riboswitches are short regulatory elements generally found in the untranslated regions of prokaryotes' mRNAs and classified into several families. Due to the binding possibility between riboswitches and antibiotics, their usage as engineered regulatory elements and also their evolutionary contribution, the need for bioinformatics tools of riboswitch detection is increasing. We have previously introduced an alignment independent algorithm for the identification of frequent sequential blocks in the families of riboswitches. Herein, we report the application of block location-based feature extraction strategy (BLBFE), which uses the locations of detected blocks on riboswitch sequences as features for classification of seed sequences. Besides, mono- and dinucleotide frequencies, k-mer, DAC, DCC, DACC, PC-PseDNC-General and SC-PseDNC-General methods as some feature extraction strategies were investigated. <i><b>Methods:</b></i> The classifiers of the Decision tree, KNN, LDA, and Naïve Bayes, as well as k-fold cross-validation, were employed for all methods of feature extraction to compare their performances based on the criteria of accuracy, sensitivity, specificity, and f-score performance measures. <i><b>Results:</b></i> The outcome of the study showed that the BLBFE strategy classified the riboswitches indicating 87.65% average correct classification rate (CCR). Moreover, the performance of the proposed feature extraction method was confirmed with average values of 94.31%, 85.01%, 95.45% and 85.38% for accuracy, sensitivity, specificity, and f-score, respectively. <i><b>Conclusion:</b></i> Our result approved the performance of the BLBFE strategy in the classification and discrimination of the riboswitch groups showing remarkable higher values of CCR, accuracy, sensitivity, specificity and f-score relative to previously studied feature extraction methods.</p>","PeriodicalId":48614,"journal":{"name":"Bioimpacts","volume":"11 2","pages":"101-109"},"PeriodicalIF":2.2000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/08/5b/bi-11-101.PMC8022236.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioimpacts","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.34172/bi.2021.17","RegionNum":4,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2020/4/17 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction: Riboswitches are short regulatory elements generally found in the untranslated regions of prokaryotes' mRNAs and classified into several families. Due to the binding possibility between riboswitches and antibiotics, their usage as engineered regulatory elements and also their evolutionary contribution, the need for bioinformatics tools of riboswitch detection is increasing. We have previously introduced an alignment independent algorithm for the identification of frequent sequential blocks in the families of riboswitches. Herein, we report the application of block location-based feature extraction strategy (BLBFE), which uses the locations of detected blocks on riboswitch sequences as features for classification of seed sequences. Besides, mono- and dinucleotide frequencies, k-mer, DAC, DCC, DACC, PC-PseDNC-General and SC-PseDNC-General methods as some feature extraction strategies were investigated. Methods: The classifiers of the Decision tree, KNN, LDA, and Naïve Bayes, as well as k-fold cross-validation, were employed for all methods of feature extraction to compare their performances based on the criteria of accuracy, sensitivity, specificity, and f-score performance measures. Results: The outcome of the study showed that the BLBFE strategy classified the riboswitches indicating 87.65% average correct classification rate (CCR). Moreover, the performance of the proposed feature extraction method was confirmed with average values of 94.31%, 85.01%, 95.45% and 85.38% for accuracy, sensitivity, specificity, and f-score, respectively. Conclusion: Our result approved the performance of the BLBFE strategy in the classification and discrimination of the riboswitch groups showing remarkable higher values of CCR, accuracy, sensitivity, specificity and f-score relative to previously studied feature extraction methods.

Abstract Image

Abstract Image

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于块定位特征提取(BLBFE)方法对5个核开关家族种子成员进行短序列分类。
核糖开关是一种短调控元件,通常存在于原核生物mrna的非翻译区,分为几个科。由于核糖开关与抗生素之间的结合可能性,它们作为工程调控元件的用途以及它们在进化中的贡献,对核糖开关检测的生物信息学工具的需求正在增加。我们之前已经介绍了一种不依赖于比对的算法,用于识别核糖开关家族中频繁的序列块。在此,我们报告了基于块位置的特征提取策略(BLBFE)的应用,该策略使用检测到的块在核糖开关序列上的位置作为种子序列分类的特征。此外,还研究了单核苷酸频率和二核苷酸频率、k-mer、DAC、DCC、DACC、PC-PseDNC-General和SC-PseDNC-General等特征提取策略。方法:采用决策树、KNN、LDA和Naïve贝叶斯分类器以及k-fold交叉验证对所有特征提取方法进行分类,以准确性、灵敏度、特异性和f-score性能指标为标准,比较它们的性能。结果:研究结果表明,BLBFE策略对核蛋白开关的平均正确分类率(CCR)为87.65%。此外,所提出的特征提取方法的准确性、灵敏度、特异性和f-score的平均值分别为94.31%、85.01%、95.45%和85.38%。结论:我们的研究结果证实了BLBFE策略在核糖体开关组的分类和区分方面的性能,与先前研究的特征提取方法相比,其CCR、准确性、灵敏度、特异性和f-score值都有显著提高。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Bioimpacts
Bioimpacts Pharmacology, Toxicology and Pharmaceutics-Pharmaceutical Science
CiteScore
4.80
自引率
7.70%
发文量
36
审稿时长
5 weeks
期刊介绍: BioImpacts (BI) is a peer-reviewed multidisciplinary international journal, covering original research articles, reviews, commentaries, hypotheses, methodologies, and visions/reflections dealing with all aspects of biological and biomedical researches at molecular, cellular, functional and translational dimensions.
期刊最新文献
A novel imidazo[1,2-a]pyridine derivative and its co-administration with curcumin exert anti-inflammatory effects by modulating the STAT3/NF-κB/iNOS/COX-2 signaling pathway in breast and ovarian cancer cell lines. Regulation of cell fate by cell imprinting approach in vitro. Cancer treatment comes to age: from one-size-fits-all to next-generation sequencing (NGS) technologies. Design and implementation of a lab-on-a-chip for assisted reproductive technologies. Integrating rapamycin with novel PI3K/Akt/mTOR inhibitor microRNAs on NOTCH1-driven T-cell acute lymphoblastic leukemia (T-ALL).
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1