Design and Analysis of Stochastic Query Optimizer for Biobank Databases

Manik Sharma, Gurvinder Singh, Rajinder Singh, Jasbir Singh
{"title":"Design and Analysis of Stochastic Query Optimizer for Biobank Databases","authors":"Manik Sharma, Gurvinder Singh, Rajinder Singh, Jasbir Singh","doi":"10.1109/ICCSA.2015.17","DOIUrl":null,"url":null,"abstract":"The enduring revolution in the field of life sciences is producing biological data at phenomenal rate. The large volume of biological data is meaningful only when it is accessed and analyzed by different researchers. A flood of biological databases is creating a problem in finding an efficient query execution plan for the complex query as posed by bio-researchers. In this research paper, an effort has been made to effectively process the biological data collected in biobanks. The major objective of this research work is to find an optimal query execution plan for biobank database queries using the proposed Restricted Exhaustive Enumeration Approach (REA) and Entropy based Restricted Genetic Approach (ERGA). The result of different query optimization approaches viz. Exhaustive Enumeration, Restricted Exhaustive Enumeration, Simple Genetic Approach and Entropy Based Restricted Genetic Approach are compared with each other on the basis of usage of system resources. The results of Entropy Based Restricted Genetic Algorithm in finding query execution plan for Burbank queries are better than EA and SGAby 2-20% and 7-15% respectively. Furthermore, experimental results reveal that use of Inter-site parallel environment further optimized the results of ERGA by 0-4%.","PeriodicalId":197153,"journal":{"name":"2015 15th International Conference on Computational Science and Its Applications","volume":"29 5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-06-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 15th International Conference on Computational Science and Its Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCSA.2015.17","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

The enduring revolution in the field of life sciences is producing biological data at phenomenal rate. The large volume of biological data is meaningful only when it is accessed and analyzed by different researchers. A flood of biological databases is creating a problem in finding an efficient query execution plan for the complex query as posed by bio-researchers. In this research paper, an effort has been made to effectively process the biological data collected in biobanks. The major objective of this research work is to find an optimal query execution plan for biobank database queries using the proposed Restricted Exhaustive Enumeration Approach (REA) and Entropy based Restricted Genetic Approach (ERGA). The result of different query optimization approaches viz. Exhaustive Enumeration, Restricted Exhaustive Enumeration, Simple Genetic Approach and Entropy Based Restricted Genetic Approach are compared with each other on the basis of usage of system resources. The results of Entropy Based Restricted Genetic Algorithm in finding query execution plan for Burbank queries are better than EA and SGAby 2-20% and 7-15% respectively. Furthermore, experimental results reveal that use of Inter-site parallel environment further optimized the results of ERGA by 0-4%.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
生物样本数据库随机查询优化器的设计与分析
生命科学领域的持久革命正在以惊人的速度产生生物数据。大量的生物数据只有在被不同的研究人员访问和分析时才有意义。生物数据库的泛滥给生物研究人员提出的复杂查询寻找有效的查询执行计划带来了难题。在本研究中,对生物库中收集的生物数据进行了有效的处理。本研究的主要目的是利用所提出的受限穷举枚举法(REA)和基于熵的受限遗传法(ERGA)为生物样本库数据库查询找到最优的查询执行计划。基于系统资源的使用情况,比较了穷尽枚举、受限穷尽枚举、简单遗传和基于熵的受限遗传四种查询优化方法的优化结果。基于熵的受限遗传算法在查找Burbank查询执行计划方面分别优于EA和SGAby 2-20%和7-15%。此外,实验结果表明,使用站点间并行环境可使ERGA结果优化0-4%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Image Segmentation Using Teaching-Learning-Based Optimization Algorithm and Fuzzy Entropy A Classification-Based Algorithm for Building 3D Maps of Environmental Objects A Coupling Simulation on Multigroup Radiation Diffusion and Heat Conduction Models PPGIS Case Studies Comparison and Future Questioning Human Smarties: The Human Communities of the Future
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1