A Fast Reference-Free Genome Compression Using Deep Neural Networks

Zeinab Nazemi Absardi, R. Javidan
{"title":"A Fast Reference-Free Genome Compression Using Deep Neural Networks","authors":"Zeinab Nazemi Absardi, R. Javidan","doi":"10.1109/BdKCSE48644.2019.9010661","DOIUrl":null,"url":null,"abstract":"Recent development of DNA sequencing technologies has led to a significant increase in genomic data volume. Such a big amount of genome data needs appropriate data storage, data management, and data transfer policies. Compressing genomes can be used for efficient data management. Auto-encoder is a kind of deep neural networks, due to its ability to reduce the dimension of data is suitable for this purpose. In this paper, a new method for genome compression with auto-encoders based on deep neural networks is proposed. It is the first time that an auto-encoder is used to compress the genomes. Experimental results showed that the proposed method can achieve a compression ratio of up to 5 and 92 percent compression accuracy in case of reference-free genome compression. Moreover, after the auto-encoder training stage, the trained network will have a very short compression time which makes it suitable for real-time applications.","PeriodicalId":206080,"journal":{"name":"2019 Big Data, Knowledge and Control Systems Engineering (BdKCSE)","volume":"109 6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 Big Data, Knowledge and Control Systems Engineering (BdKCSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BdKCSE48644.2019.9010661","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

Recent development of DNA sequencing technologies has led to a significant increase in genomic data volume. Such a big amount of genome data needs appropriate data storage, data management, and data transfer policies. Compressing genomes can be used for efficient data management. Auto-encoder is a kind of deep neural networks, due to its ability to reduce the dimension of data is suitable for this purpose. In this paper, a new method for genome compression with auto-encoders based on deep neural networks is proposed. It is the first time that an auto-encoder is used to compress the genomes. Experimental results showed that the proposed method can achieve a compression ratio of up to 5 and 92 percent compression accuracy in case of reference-free genome compression. Moreover, after the auto-encoder training stage, the trained network will have a very short compression time which makes it suitable for real-time applications.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于深度神经网络的快速无参考基因组压缩
近年来DNA测序技术的发展导致了基因组数据量的显著增加。如此庞大的基因组数据需要合适的数据存储、数据管理和数据传输策略。压缩基因组可以用于有效的数据管理。自编码器是深度神经网络的一种,由于其对数据降维的能力适合于此目的。提出了一种基于深度神经网络的自编码器基因组压缩新方法。这是首次使用自编码器来压缩基因组。实验结果表明,在无参比的情况下,该方法可以实现高达5%的压缩比和92%的压缩精度。此外,经过自编码器训练阶段,训练后的网络压缩时间非常短,适合于实时应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Forecasting the Spot Price of P1A Shipping Route Building Lepida ScpA BigData Infrastructure Knowledge Based Modelling of Complex Interconnected Systems Using Big Data for Data Leak Prevention Analysis of the Functionalities of a Shared ICS Security Operations Center
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1