{"title":"A Proposed Hybrid OCR System for Arabic and Indian Numerical Postal Codes","authors":"Y. Alginahi, A. A. Siddiqi","doi":"10.1109/ICCTD.2009.162","DOIUrl":null,"url":null,"abstract":"Arabic text recognition offers unique technical challenges and has been addressed more recently in the document analysis research field than other languages. Automatic Arabic/Indian numeral Optical Character Recognition (OCR) system for postal services are used in many countries, but still there are problems in such systems where machines still provide errors in reading the crucial information needed to distribute the mail efficiently. The need to investigate fast and efficient recognition methods is important so as to correctly read the postal codes from mail envelopes. The significance of this study is to recognize essential information, e.g., postal codes from the mail envelopes, by applying the OCR methods. The proposed system is a hybrid of three different feature extraction methods and classification methods. The proposed system, systematically compares the performance of each and every method, and makes sure that the numeral is recognized or rejected. The results provide a recognition rate of 99.4%.","PeriodicalId":269403,"journal":{"name":"2009 International Conference on Computer Technology and Development","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Conference on Computer Technology and Development","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCTD.2009.162","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Arabic text recognition offers unique technical challenges and has been addressed more recently in the document analysis research field than other languages. Automatic Arabic/Indian numeral Optical Character Recognition (OCR) system for postal services are used in many countries, but still there are problems in such systems where machines still provide errors in reading the crucial information needed to distribute the mail efficiently. The need to investigate fast and efficient recognition methods is important so as to correctly read the postal codes from mail envelopes. The significance of this study is to recognize essential information, e.g., postal codes from the mail envelopes, by applying the OCR methods. The proposed system is a hybrid of three different feature extraction methods and classification methods. The proposed system, systematically compares the performance of each and every method, and makes sure that the numeral is recognized or rejected. The results provide a recognition rate of 99.4%.