Document Image Analysis in Compressed Domain-Limitations, Applications & Challenges

Kavita V. Horadi
{"title":"Document Image Analysis in Compressed Domain-Limitations, Applications & Challenges","authors":"Kavita V. Horadi","doi":"10.1109/ICECA49313.2020.9297593","DOIUrl":null,"url":null,"abstract":"Document image analysis plays a vital role in this digital era. Recent developments in the IT industry have led to the growth of digital data in various fields like medical, government offices, education sector, banks, social media, digital library, and so on. Advancement in the recent technologies has paved their way to convert the traditional offices into paperless offices. Also, the growth of digital libraries, e-governance, and internet based applications has led to the increase in the volume of digital data, which mainly include texts, graphs, images, audio and video as various components in the document image by resulting in the development of complex document images, which are used for archival and transmission on regular basis. This paper proposes an idea for processing the document image in its compressed version by particularly focusing on how content matching and structural analysis can be performed in the compressed representation of document image. This gives an insight on the importance of processing document images in its compressed domain. Due to the exponential growth of data, the data is stored in compressed form. There is an actual need for investigating further research from the perspective of dealing directly with the compressed representation of document images as a remedy to the ever-increasing big data-related challenges. This paper also discusses the various applications of document images and opens up the challenges faced by the researchers in addressing these applications. An overview of the state of the art datasets available in the literature in the area of document image analysis is also addressed","PeriodicalId":297285,"journal":{"name":"2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA)","volume":"90 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECA49313.2020.9297593","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Document image analysis plays a vital role in this digital era. Recent developments in the IT industry have led to the growth of digital data in various fields like medical, government offices, education sector, banks, social media, digital library, and so on. Advancement in the recent technologies has paved their way to convert the traditional offices into paperless offices. Also, the growth of digital libraries, e-governance, and internet based applications has led to the increase in the volume of digital data, which mainly include texts, graphs, images, audio and video as various components in the document image by resulting in the development of complex document images, which are used for archival and transmission on regular basis. This paper proposes an idea for processing the document image in its compressed version by particularly focusing on how content matching and structural analysis can be performed in the compressed representation of document image. This gives an insight on the importance of processing document images in its compressed domain. Due to the exponential growth of data, the data is stored in compressed form. There is an actual need for investigating further research from the perspective of dealing directly with the compressed representation of document images as a remedy to the ever-increasing big data-related challenges. This paper also discusses the various applications of document images and opens up the challenges faced by the researchers in addressing these applications. An overview of the state of the art datasets available in the literature in the area of document image analysis is also addressed
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
压缩域文档图像分析——限制、应用与挑战
文档图像分析在数字时代起着至关重要的作用。IT行业的最新发展导致了医疗、政府办公室、教育部门、银行、社交媒体、数字图书馆等各个领域的数字数据的增长。最近技术的进步为传统办公室向无纸化办公室的转变铺平了道路。此外,数字图书馆、电子政务和基于互联网的应用的增长导致了数字数据量的增加,主要包括文本、图形、图像、音频和视频作为文件图像的各个组成部分,从而导致了复杂文件图像的发展,这些文件图像用于定期存档和传输。本文提出了一种处理压缩版文档图像的思路,特别关注如何在文档图像的压缩表示中进行内容匹配和结构分析。这说明了在压缩域中处理文档图像的重要性。由于数据呈指数级增长,数据以压缩形式存储。实际需要从直接处理文档图像压缩表示的角度进一步调查研究,以解决日益增加的大数据相关挑战。本文还讨论了文档图像的各种应用,并揭示了研究人员在解决这些应用时所面临的挑战。在文献图像分析领域的文献中提供的最新数据集的概述也被处理
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Analysis of Prosodic features for the degree of emotions of an Assamese Emotional Speech MCU system based on IEC61508 for Autonomous Functional safety platform Comparative analysis of facial recognition models using video for real time attendance monitoring system Analysis of using IoT Sensors in Healthcare units Supported by Cloud Computing Human Friendly Smart Trolley with Automatic Billing System
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1