{"title":"A novel approach for Kannada text extraction","authors":"S. Seeri, S. Giraddi, B. Prashant","doi":"10.1109/ICPRIME.2012.6208387","DOIUrl":null,"url":null,"abstract":"Popularity of the digital cameras is increasing rapidly day by day because of advanced applications and availability of digital cameras. The detection and extraction of text regions in an image is a well known problem in the computer vision. Text in images contains useful semantic information which can be used to fully understand the images. Proposed method aims at detecting and extracting Kannada text from government organization signboard images acquired by digital camera. Segmentation is performed using edge detection method and heuristic features are used to remove the non text regions. Kannada text identification is performed using the structural feature boundary length of the object strokes. Rule based method is employed to validate the objects as Kannada text. The proposed method is effective, efficient and encouraging results are obtained. It has the precision rate of 84.21%, recall rate of 83.16% and Kannada text identification accuracy of 75.77%. Hence proposed method is robust with font size, small orientation and alignment of text.","PeriodicalId":148511,"journal":{"name":"International Conference on Pattern Recognition, Informatics and Medical Engineering (PRIME-2012)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Pattern Recognition, Informatics and Medical Engineering (PRIME-2012)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICPRIME.2012.6208387","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 14
Abstract
Popularity of the digital cameras is increasing rapidly day by day because of advanced applications and availability of digital cameras. The detection and extraction of text regions in an image is a well known problem in the computer vision. Text in images contains useful semantic information which can be used to fully understand the images. Proposed method aims at detecting and extracting Kannada text from government organization signboard images acquired by digital camera. Segmentation is performed using edge detection method and heuristic features are used to remove the non text regions. Kannada text identification is performed using the structural feature boundary length of the object strokes. Rule based method is employed to validate the objects as Kannada text. The proposed method is effective, efficient and encouraging results are obtained. It has the precision rate of 84.21%, recall rate of 83.16% and Kannada text identification accuracy of 75.77%. Hence proposed method is robust with font size, small orientation and alignment of text.