An enhanced algorithm for Character Segmentation in document image processing

2010 IEEE International Conference on Computational Intelligence and Computing Research Pub Date : 2010-12-01 DOI:10.1109/ICCIC.2010.5705728

V. Manikandan, V. Venkatachalam, M. Kirthiga, K. Harini, N. Devarajan

{"title":"An enhanced algorithm for Character Segmentation in document image processing","authors":"V. Manikandan, V. Venkatachalam, M. Kirthiga, K. Harini, N. Devarajan","doi":"10.1109/ICCIC.2010.5705728","DOIUrl":null,"url":null,"abstract":"Optical Character Recognition consists of various steps like skew detection, segmentation of columns, lines, words, and characters before feeding the isolated character to an optical character recognition system. Several methodologies are followed to perform these steps using conventional Hough Transformation. In this paper, a new algorithm is proposed to perform all those steps involved in document image processing. The algorithm is implemented for skew detection, column and line segmentation and Character Segmentation. This can be extended to all other steps like character recognition. The novelty of this approach lies in “the consideration of any image, as one formed by several black and white lines of various lengths and at various angles”. The pixel values of the binary image are stored in an array. All the pixel values in the array are compared with their horizontally adjacent pixel values, row by row, for the presence of collinear points (i.e., a line). It is done by detecting the continuity of either the white or black pixels accordingly. Once the continuity is detected, the starting and end coordinates are displayed as an intermediate result. A new image will be generated as a result, which indicates the pixel area of line, identified from the input image. The algorithm is applied for English and other regional languages.","PeriodicalId":246468,"journal":{"name":"2010 IEEE International Conference on Computational Intelligence and Computing Research","volume":"54 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"9","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2010 IEEE International Conference on Computational Intelligence and Computing Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIC.2010.5705728","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 9

Abstract

Optical Character Recognition consists of various steps like skew detection, segmentation of columns, lines, words, and characters before feeding the isolated character to an optical character recognition system. Several methodologies are followed to perform these steps using conventional Hough Transformation. In this paper, a new algorithm is proposed to perform all those steps involved in document image processing. The algorithm is implemented for skew detection, column and line segmentation and Character Segmentation. This can be extended to all other steps like character recognition. The novelty of this approach lies in “the consideration of any image, as one formed by several black and white lines of various lengths and at various angles”. The pixel values of the binary image are stored in an array. All the pixel values in the array are compared with their horizontally adjacent pixel values, row by row, for the presence of collinear points (i.e., a line). It is done by detecting the continuity of either the white or black pixels accordingly. Once the continuity is detected, the starting and end coordinates are displayed as an intermediate result. A new image will be generated as a result, which indicates the pixel area of line, identified from the input image. The algorithm is applied for English and other regional languages.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

文档图像处理中字符分割的改进算法

光学字符识别包括各种步骤，如倾斜检测，分割列、行、词和字符，然后将孤立的字符输入光学字符识别系统。要使用传统的霍夫变换执行这些步骤，需要遵循几种方法。本文提出了一种新的算法来完成文档图像处理中涉及的所有步骤。该算法实现了倾斜检测、列线分割和字符分割。这可以扩展到所有其他步骤，如字符识别。这种方法的新颖之处在于“将任何图像视为由几条不同长度和不同角度的黑白线组成的图像”。二值图像的像素值存储在一个数组中。数组中的所有像素值与其水平相邻的像素值逐行比较，以确定是否存在共线点(即一条线)。它是通过检测相应的白色或黑色像素的连续性来完成的。一旦检测到连续性，开始和结束坐标将作为中间结果显示。结果将生成一个新的图像，该图像表示从输入图像中识别出的线的像素区域。该算法适用于英语和其他区域语言。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2010 IEEE International Conference on Computational Intelligence and Computing Research

自引率

0.00%

发文量

期刊最新文献

Network security and management Impact of black hole attack on multicast in ad hoc network (IBAMA) Evolutionary computing techniques for model order formulation of linear time invariant discrete systems RC5 encryption using key derived from fingerprint image Malayalam word sense disambiguation