Hadoop based clustering system for genome sequencing

Anju Ramesh Ekre, R. Mante
{"title":"Hadoop based clustering system for genome sequencing","authors":"Anju Ramesh Ekre, R. Mante","doi":"10.1109/ICONSTEM.2016.7560916","DOIUrl":null,"url":null,"abstract":"Genomics is an interdisciplinary branch of science that is bringing vital changes in the field of medicine and agriculture. It is believed that the scientific and technological advancements in 21st century will be related to the processing, manipulation and analysis of the vast information that is generated from genome sequencing of living organisms. A scientific and big data research domain includes the problem of genome sequencing. Genome sequence is also called as read sequence. Next-Generation sequencing is playing a crucial role in the development and advancements of read alignment algorithms. Computer scientists, mathematician and physicists are together helping for this research of alignment. However, increase in the data size and faster data access requirement for the scientists and researchers are increasing which is leading advancements in genome alignment towards acceleration approach. This paper includes a MapReduce acceleration scheme for faster sequence alignment. It works on multiple commodity hardware. With the use of MapReduce programming along with the clustering algorithm for distribution of genome data on multiple nodes may reduce the time, also it can lead towards accuracy in genome sequencing.","PeriodicalId":256750,"journal":{"name":"2016 Second International Conference on Science Technology Engineering and Management (ICONSTEM)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-03-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Second International Conference on Science Technology Engineering and Management (ICONSTEM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICONSTEM.2016.7560916","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

Abstract

Genomics is an interdisciplinary branch of science that is bringing vital changes in the field of medicine and agriculture. It is believed that the scientific and technological advancements in 21st century will be related to the processing, manipulation and analysis of the vast information that is generated from genome sequencing of living organisms. A scientific and big data research domain includes the problem of genome sequencing. Genome sequence is also called as read sequence. Next-Generation sequencing is playing a crucial role in the development and advancements of read alignment algorithms. Computer scientists, mathematician and physicists are together helping for this research of alignment. However, increase in the data size and faster data access requirement for the scientists and researchers are increasing which is leading advancements in genome alignment towards acceleration approach. This paper includes a MapReduce acceleration scheme for faster sequence alignment. It works on multiple commodity hardware. With the use of MapReduce programming along with the clustering algorithm for distribution of genome data on multiple nodes may reduce the time, also it can lead towards accuracy in genome sequencing.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于Hadoop的基因组测序集群系统
基因组学是一门跨学科的科学分支,正在给医学和农业领域带来重大变化。人们认为,21世纪的科技进步将与生物基因组测序产生的大量信息的处理、操纵和分析有关。科学和大数据研究领域包括基因组测序问题。基因组序列又称读序列。下一代测序在读取比对算法的发展和进步中起着至关重要的作用。计算机科学家、数学家和物理学家正在共同帮助这项对准研究。然而,数据量的增加和对科学家和研究人员更快的数据访问需求正在增加,这导致了基因组比对朝着加速方法的发展。本文包括一个MapReduce加速方案,用于更快的序列对齐。它适用于多种商用硬件。利用MapReduce编程和聚类算法将基因组数据分布在多个节点上,可以减少时间,也可以提高基因组测序的准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
A novel approach for detecting facial image spoofing using local ternary pattern A comparative analysis on multilevel inverter with and without harmonics injection method Modeling, analysis and control of INOSLC (Improved Negative Output Super-Lift Luo Converter) using PI controller Hadoop based clustering system for genome sequencing Fake biometric detection using image quality assessment: Application to iris, fingerprint recognition
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1