Offline handwritten Gurmukhi character recognition: study of different feature-classifier combinations

DAR '12 Pub Date : 2012-12-16 DOI:10.1145/2432553.2432571
Munish Kumar, R. Sharma, M. Jindal
{"title":"Offline handwritten Gurmukhi character recognition: study of different feature-classifier combinations","authors":"Munish Kumar, R. Sharma, M. Jindal","doi":"10.1145/2432553.2432571","DOIUrl":null,"url":null,"abstract":"Offline handwritten character recognition (OHCR) is the method of converting handwritten text into machine processable layout. Since late sixties, efforts have been made for offline handwritten character recognition throughout the world. Principal Component Analysis (PCA) has also been used for extracting representative features for character recognition. In order to assess the prominence of features in offline handwritten Gurmukhi character recognition, we have recognized offline handwritten Gurmukhi characters with different combinations of features and classifiers. The recognition system first sets up a skeleton of the character so that significant feature information about the character can be extracted. For the purpose of classification, we have used k-NN, Linear-SVM, Polynomial-SVM and RBF-SVM based approaches. In present work, we have collected 7,000 samples of isolated offline handwritten Gurmukhi characters from 200 different writers. The set of basic 35 akhars of Gurmukhi has been considered here. A partitioning policy for selecting the training and testing patterns has also been experimented in present work. We have used zoning feature; diagonal feature; directional feature; intersection and open end points feature; transition feature; parabola curve fitting based feature and power curve fitting based feature extraction technique in order to find the feature set for a given character. The proposed system achieves a recognition accuracy of 94.8% when PCA is not applied and a recognition accuracy of 97.7% when PCA is applied.","PeriodicalId":410986,"journal":{"name":"DAR '12","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"36","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"DAR '12","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2432553.2432571","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 36

Abstract

Offline handwritten character recognition (OHCR) is the method of converting handwritten text into machine processable layout. Since late sixties, efforts have been made for offline handwritten character recognition throughout the world. Principal Component Analysis (PCA) has also been used for extracting representative features for character recognition. In order to assess the prominence of features in offline handwritten Gurmukhi character recognition, we have recognized offline handwritten Gurmukhi characters with different combinations of features and classifiers. The recognition system first sets up a skeleton of the character so that significant feature information about the character can be extracted. For the purpose of classification, we have used k-NN, Linear-SVM, Polynomial-SVM and RBF-SVM based approaches. In present work, we have collected 7,000 samples of isolated offline handwritten Gurmukhi characters from 200 different writers. The set of basic 35 akhars of Gurmukhi has been considered here. A partitioning policy for selecting the training and testing patterns has also been experimented in present work. We have used zoning feature; diagonal feature; directional feature; intersection and open end points feature; transition feature; parabola curve fitting based feature and power curve fitting based feature extraction technique in order to find the feature set for a given character. The proposed system achieves a recognition accuracy of 94.8% when PCA is not applied and a recognition accuracy of 97.7% when PCA is applied.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
离线手写Gurmukhi字符识别:不同特征分类器组合的研究
离线手写字符识别(OHCR)是将手写文本转换为机器可处理的布局的方法。自60年代末以来,世界各地都在努力进行离线手写字符识别。主成分分析(PCA)也被用于提取字符识别的代表性特征。为了评估离线手写廓尔穆克文字识别中特征的突出程度,我们采用不同的特征和分类器组合对离线手写廓尔穆克文字进行了识别。识别系统首先建立字符的骨架,以便提取字符的重要特征信息。为了分类的目的,我们使用了k-NN、线性支持向量机、多项式支持向量机和基于RBF-SVM的方法。在目前的工作中,我们从200个不同的作者那里收集了7000个孤立的离线手写古尔穆克汉字样本。这里已经考虑了古尔木基语的基本35阿哈。本文还尝试了一种用于选择训练和测试模式的分区策略。我们使用了分区功能;对角特征;定向功能;交叉点和开端点特征;过渡特性;基于抛物线曲线拟合的特征和基于幂曲线拟合的特征提取技术,以找到给定字符的特征集。该系统在不使用主成分分析时的识别准确率为94.8%,使用主成分分析时的识别准确率为97.7%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Model based table cell detection and content extraction from degraded document images Assamese online handwritten digit recognition system using hidden Markov models On performance analysis of end-to-end OCR systems of Indic scripts A data acquisition and analysis system for palm leaf documents in Telugu Bangla date field extraction in offline handwritten documents
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1