{"title":"A skew detection and correction technique for Arabic script text-line based on subwords bounding","authors":"Atallah Al-Shatnawi","doi":"10.1109/ICCIC.2014.7238501","DOIUrl":null,"url":null,"abstract":"Text-line skew detection and correction is the first step in Arabic document recognition and analysis. It is a crucial pre-processing stage of Arabic Character Recognition (ACR). It has a direct effect on the dependability and efficiency of other system stages such as baseline detection, segmentation and feature extraction stages. In this paper an efficient skew detection and correction method for Arabic handwritten text-line based on sub-words bounding is presented. It is constructed from three stages including: pre-processing, skew detection and skew correction stages. The proposed method estimates a text-line baseline based on calculating the middle point for its sub-words bounding. Then align the text-line components on the estimated baseline. The proposed method is implemented on 3960 text-line handwritten images, which were written by 40 writers. It is discussed with the horizontal projection method in terms of effectiveness. The proposed method obtained an accuracy ratio of 96.15%, and takes 6.7 seconds as average time. It can also automatically detect text baselines of documents with any orientation.","PeriodicalId":187874,"journal":{"name":"2014 IEEE International Conference on Computational Intelligence and Computing Research","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE International Conference on Computational Intelligence and Computing Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIC.2014.7238501","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
Text-line skew detection and correction is the first step in Arabic document recognition and analysis. It is a crucial pre-processing stage of Arabic Character Recognition (ACR). It has a direct effect on the dependability and efficiency of other system stages such as baseline detection, segmentation and feature extraction stages. In this paper an efficient skew detection and correction method for Arabic handwritten text-line based on sub-words bounding is presented. It is constructed from three stages including: pre-processing, skew detection and skew correction stages. The proposed method estimates a text-line baseline based on calculating the middle point for its sub-words bounding. Then align the text-line components on the estimated baseline. The proposed method is implemented on 3960 text-line handwritten images, which were written by 40 writers. It is discussed with the horizontal projection method in terms of effectiveness. The proposed method obtained an accuracy ratio of 96.15%, and takes 6.7 seconds as average time. It can also automatically detect text baselines of documents with any orientation.