V. Manikandan, V. Venkatachalam, M. Kirthiga, K. Harini, N. Devarajan
{"title":"An enhanced algorithm for Character Segmentation in document image processing","authors":"V. Manikandan, V. Venkatachalam, M. Kirthiga, K. Harini, N. Devarajan","doi":"10.1109/ICCIC.2010.5705728","DOIUrl":null,"url":null,"abstract":"Optical Character Recognition consists of various steps like skew detection, segmentation of columns, lines, words, and characters before feeding the isolated character to an optical character recognition system. Several methodologies are followed to perform these steps using conventional Hough Transformation. In this paper, a new algorithm is proposed to perform all those steps involved in document image processing. The algorithm is implemented for skew detection, column and line segmentation and Character Segmentation. This can be extended to all other steps like character recognition. The novelty of this approach lies in “the consideration of any image, as one formed by several black and white lines of various lengths and at various angles”. The pixel values of the binary image are stored in an array. All the pixel values in the array are compared with their horizontally adjacent pixel values, row by row, for the presence of collinear points (i.e., a line). It is done by detecting the continuity of either the white or black pixels accordingly. Once the continuity is detected, the starting and end coordinates are displayed as an intermediate result. A new image will be generated as a result, which indicates the pixel area of line, identified from the input image. The algorithm is applied for English and other regional languages.","PeriodicalId":246468,"journal":{"name":"2010 IEEE International Conference on Computational Intelligence and Computing Research","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE International Conference on Computational Intelligence and Computing Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIC.2010.5705728","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 9
Abstract
Optical Character Recognition consists of various steps like skew detection, segmentation of columns, lines, words, and characters before feeding the isolated character to an optical character recognition system. Several methodologies are followed to perform these steps using conventional Hough Transformation. In this paper, a new algorithm is proposed to perform all those steps involved in document image processing. The algorithm is implemented for skew detection, column and line segmentation and Character Segmentation. This can be extended to all other steps like character recognition. The novelty of this approach lies in “the consideration of any image, as one formed by several black and white lines of various lengths and at various angles”. The pixel values of the binary image are stored in an array. All the pixel values in the array are compared with their horizontally adjacent pixel values, row by row, for the presence of collinear points (i.e., a line). It is done by detecting the continuity of either the white or black pixels accordingly. Once the continuity is detected, the starting and end coordinates are displayed as an intermediate result. A new image will be generated as a result, which indicates the pixel area of line, identified from the input image. The algorithm is applied for English and other regional languages.