{"title":"Document Image Analysis in Compressed Domain-Limitations, Applications & Challenges","authors":"Kavita V. Horadi","doi":"10.1109/ICECA49313.2020.9297593","DOIUrl":null,"url":null,"abstract":"Document image analysis plays a vital role in this digital era. Recent developments in the IT industry have led to the growth of digital data in various fields like medical, government offices, education sector, banks, social media, digital library, and so on. Advancement in the recent technologies has paved their way to convert the traditional offices into paperless offices. Also, the growth of digital libraries, e-governance, and internet based applications has led to the increase in the volume of digital data, which mainly include texts, graphs, images, audio and video as various components in the document image by resulting in the development of complex document images, which are used for archival and transmission on regular basis. This paper proposes an idea for processing the document image in its compressed version by particularly focusing on how content matching and structural analysis can be performed in the compressed representation of document image. This gives an insight on the importance of processing document images in its compressed domain. Due to the exponential growth of data, the data is stored in compressed form. There is an actual need for investigating further research from the perspective of dealing directly with the compressed representation of document images as a remedy to the ever-increasing big data-related challenges. This paper also discusses the various applications of document images and opens up the challenges faced by the researchers in addressing these applications. An overview of the state of the art datasets available in the literature in the area of document image analysis is also addressed","PeriodicalId":297285,"journal":{"name":"2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA)","volume":"90 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 4th International Conference on Electronics, Communication and Aerospace Technology (ICECA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECA49313.2020.9297593","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Document image analysis plays a vital role in this digital era. Recent developments in the IT industry have led to the growth of digital data in various fields like medical, government offices, education sector, banks, social media, digital library, and so on. Advancement in the recent technologies has paved their way to convert the traditional offices into paperless offices. Also, the growth of digital libraries, e-governance, and internet based applications has led to the increase in the volume of digital data, which mainly include texts, graphs, images, audio and video as various components in the document image by resulting in the development of complex document images, which are used for archival and transmission on regular basis. This paper proposes an idea for processing the document image in its compressed version by particularly focusing on how content matching and structural analysis can be performed in the compressed representation of document image. This gives an insight on the importance of processing document images in its compressed domain. Due to the exponential growth of data, the data is stored in compressed form. There is an actual need for investigating further research from the perspective of dealing directly with the compressed representation of document images as a remedy to the ever-increasing big data-related challenges. This paper also discusses the various applications of document images and opens up the challenges faced by the researchers in addressing these applications. An overview of the state of the art datasets available in the literature in the area of document image analysis is also addressed