Compression Performance Analysis of Different File Formats

Han Yang, Guangjun Qin, Yongqing Hu
{"title":"Compression Performance Analysis of Different File Formats","authors":"Han Yang, Guangjun Qin, Yongqing Hu","doi":"arxiv-2308.12275","DOIUrl":null,"url":null,"abstract":"In data storage and transmission, file compression is a common technique for\nreducing the volume of data, reducing data storage space and transmission time\nand bandwidth. However, there are significant differences in the compression\nperformance of different types of file formats, and the benefits vary. In this\npaper, 22 file formats with approximately 178GB of data were collected and the\nZlib algorithm was used for compression experiments to compare performance in\norder to investigate the compression gains of different file types. The\nexperimental results show that some file types are poorly compressed, with\nalmost constant file size and long compression time, resulting in lower gains;\nsome other file types are significantly reduced in file size and compression\ntime after compression, which can effectively reduce the data volume. Based on\nthe above experimental results, this paper will then selectively reduce the\ndata volume by compression in data storage and transmission for the file types\nin order to obtain the maximum compression yield.","PeriodicalId":501310,"journal":{"name":"arXiv - CS - Other Computer Science","volume":"9 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Other Computer Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2308.12275","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In data storage and transmission, file compression is a common technique for reducing the volume of data, reducing data storage space and transmission time and bandwidth. However, there are significant differences in the compression performance of different types of file formats, and the benefits vary. In this paper, 22 file formats with approximately 178GB of data were collected and the Zlib algorithm was used for compression experiments to compare performance in order to investigate the compression gains of different file types. The experimental results show that some file types are poorly compressed, with almost constant file size and long compression time, resulting in lower gains; some other file types are significantly reduced in file size and compression time after compression, which can effectively reduce the data volume. Based on the above experimental results, this paper will then selectively reduce the data volume by compression in data storage and transmission for the file types in order to obtain the maximum compression yield.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
不同文件格式的压缩性能分析
在数据存储和传输中,文件压缩是减少数据量、减少数据存储空间、减少传输时间和带宽的常用技术。但是,不同类型的文件格式在压缩性能上存在显著差异,其好处也各不相同。本文收集了22种文件格式约178GB的数据,并使用zlib算法进行压缩实验,比较性能,以研究不同文件类型的压缩收益。实验结果表明,有些文件类型压缩效果较差,文件大小几乎不变,压缩时间长,导致增益较低;有些文件类型压缩后文件大小和压缩时间明显减小,可以有效地减少数据量。在上述实验结果的基础上,本文将针对文件类型在数据存储和传输中选择性地压缩数据量,以获得最大的压缩产量。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Artificial Intelligence-based Smart Port Logistics Metaverse for Enhancing Productivity, Environment, and Safety in Port Logistics: A Case Study of Busan Port Evaluating the Usability of Qualified Electronic Signatures: Systematized Use Cases and Design Paradigms A Brief Discussion on the Philosophical Principles and Development Directions of Data Circulation Predicting Star Scientists in the Field of Artificial Intelligence: A Machine Learning Approach A Match Made in Semantics: Physics-infused Digital Twins for Smart Building Automation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1