Gururaj Mukarambi, Satishkumar Mallapa, B. V. Dhandra
{"title":"Script identification from camera based Tri-Lingual document","authors":"Gururaj Mukarambi, Satishkumar Mallapa, B. V. Dhandra","doi":"10.1109/SSPS.2017.8071593","DOIUrl":null,"url":null,"abstract":"In this paper, an algorithm is proposed for Trilingual Script Identification System in block wise for camera captured images. The Local Binary Pattern (LBP) features are used for Kannada, Hindi and English images for testing the performance of a proposed algorithm, a dataset of 6000 neat block images are considered. For each script a total of 2000 images are used for the proposed method. The segmentation technique is used to segment the document image in blocks. Block of sizes 128×128, 256×256, 512×512 and 1024×1024 for Kannada, Hindi and English have been considered. The LBP features are extracted in 8 neighbors, there by generating 59 features and submitted to KNN and SVM classifiers to classify the underlying image. The identification accuracy for KNN and SVM classifiers are respectively 96.60% and 98.00% for block size 128×128, 98.71% and 98.07% for block size 256×256, 99.70% and 98.00% for block size 512×512 and further 94.90% and 99.01% for block size 1024×1024 respectively. The optimal accuracy is 99.01% for SVM classifier for block size 1024×1024. The proposed method is independent of thinning.","PeriodicalId":382353,"journal":{"name":"2017 Third International Conference on Sensing, Signal Processing and Security (ICSSS)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Third International Conference on Sensing, Signal Processing and Security (ICSSS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SSPS.2017.8071593","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
In this paper, an algorithm is proposed for Trilingual Script Identification System in block wise for camera captured images. The Local Binary Pattern (LBP) features are used for Kannada, Hindi and English images for testing the performance of a proposed algorithm, a dataset of 6000 neat block images are considered. For each script a total of 2000 images are used for the proposed method. The segmentation technique is used to segment the document image in blocks. Block of sizes 128×128, 256×256, 512×512 and 1024×1024 for Kannada, Hindi and English have been considered. The LBP features are extracted in 8 neighbors, there by generating 59 features and submitted to KNN and SVM classifiers to classify the underlying image. The identification accuracy for KNN and SVM classifiers are respectively 96.60% and 98.00% for block size 128×128, 98.71% and 98.07% for block size 256×256, 99.70% and 98.00% for block size 512×512 and further 94.90% and 99.01% for block size 1024×1024 respectively. The optimal accuracy is 99.01% for SVM classifier for block size 1024×1024. The proposed method is independent of thinning.