Alan Koshy, N. Mj, Prof. Shyna A, Prof. Ansamma John
{"title":"文本图像中高质量文本提取的预处理技术","authors":"Alan Koshy, N. Mj, Prof. Shyna A, Prof. Ansamma John","doi":"10.1109/ICIICT1.2019.8741488","DOIUrl":null,"url":null,"abstract":"In this age of digitization, there is a growing need to preserve physical copies of documents such as historical text. It is important in digitization to capture every aspect of the document which is infeasible due to challenges such as fading, creases, and shadows. Various approaches have been put forth to improve upon text extraction by means of preprocessing. This paper analyses the effect of applying some general preprocessing methods such as Thresholding, Morphology, and Blurring and enhancements of quality in the output obtained. Experimental results show that preprocessing improves the visual and structural quality of the document to a certain extent.","PeriodicalId":118897,"journal":{"name":"2019 1st International Conference on Innovations in Information and Communication Technology (ICIICT)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Preprocessing Techniques for High Quality Text Extraction from Text Images\",\"authors\":\"Alan Koshy, N. Mj, Prof. Shyna A, Prof. Ansamma John\",\"doi\":\"10.1109/ICIICT1.2019.8741488\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this age of digitization, there is a growing need to preserve physical copies of documents such as historical text. It is important in digitization to capture every aspect of the document which is infeasible due to challenges such as fading, creases, and shadows. Various approaches have been put forth to improve upon text extraction by means of preprocessing. This paper analyses the effect of applying some general preprocessing methods such as Thresholding, Morphology, and Blurring and enhancements of quality in the output obtained. Experimental results show that preprocessing improves the visual and structural quality of the document to a certain extent.\",\"PeriodicalId\":118897,\"journal\":{\"name\":\"2019 1st International Conference on Innovations in Information and Communication Technology (ICIICT)\",\"volume\":\"7 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-04-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 1st International Conference on Innovations in Information and Communication Technology (ICIICT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICIICT1.2019.8741488\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 1st International Conference on Innovations in Information and Communication Technology (ICIICT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIICT1.2019.8741488","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Preprocessing Techniques for High Quality Text Extraction from Text Images
In this age of digitization, there is a growing need to preserve physical copies of documents such as historical text. It is important in digitization to capture every aspect of the document which is infeasible due to challenges such as fading, creases, and shadows. Various approaches have been put forth to improve upon text extraction by means of preprocessing. This paper analyses the effect of applying some general preprocessing methods such as Thresholding, Morphology, and Blurring and enhancements of quality in the output obtained. Experimental results show that preprocessing improves the visual and structural quality of the document to a certain extent.