首页 > 最新文献

DAR '12最新文献

英文 中文
Bangla date field extraction in offline handwritten documents 脱机手写文档中的孟加拉语日期字段提取
Pub Date : 2012-12-16 DOI: 10.1145/2432553.2432561
Ranju Mandal, P. Roy, U. Pal
Date is a useful information for various application (e.g. date wise document indexing) and automatic extraction of date information involves difficult challenges due to writing styles of different individuals, touching characters and confusion among identification of numerals, punctuation and texts. In this paper, we present a framework for indexing/retrieval of Bangla date patterns from handwritten documents. The method first classifies word components of each text line into month and non-month class using word level feature. Next, non-month words are segmented into individual components and classified into one of text, digit or punctuation. Using this information of word and character level components, the date patterns are searched. First using voting approach and then using regular expression we detect the candidate lines for numeric and semi-numeric date. Dynamic Time Warping (DTW) matching of profile based features is used for classification of month/non-month words. Numerals and punctuations are classified using gradient based feature and SVM classifier. The experiment is performed on Bangla handwritten dataset and the results demonstrate the effectiveness of the proposed system.
日期是一种有用的信息,适用于各种应用(例如日期智能文档索引),由于不同人的写作风格、触摸字符以及数字、标点和文本识别的混淆,日期信息的自动提取面临着困难的挑战。在本文中,我们提出了一个从手写文档中索引/检索孟加拉语日期模式的框架。该方法首先利用词级特征将每个文本行的词成分分为月类和非月类。其次,将非月份词分割成单独的组成部分,分为文本、数字或标点。使用单词和字符级组件的这些信息,搜索日期模式。首先使用投票方法,然后使用正则表达式检测数字和半数字日期的候选行。基于特征的动态时间翘曲(DTW)匹配用于月份/非月份词的分类。采用梯度特征和SVM分类器对数字和标点符号进行分类。在孟加拉语手写数据集上进行了实验,结果证明了该系统的有效性。
{"title":"Bangla date field extraction in offline handwritten documents","authors":"Ranju Mandal, P. Roy, U. Pal","doi":"10.1145/2432553.2432561","DOIUrl":"https://doi.org/10.1145/2432553.2432561","url":null,"abstract":"Date is a useful information for various application (e.g. date wise document indexing) and automatic extraction of date information involves difficult challenges due to writing styles of different individuals, touching characters and confusion among identification of numerals, punctuation and texts. In this paper, we present a framework for indexing/retrieval of Bangla date patterns from handwritten documents. The method first classifies word components of each text line into month and non-month class using word level feature. Next, non-month words are segmented into individual components and classified into one of text, digit or punctuation. Using this information of word and character level components, the date patterns are searched. First using voting approach and then using regular expression we detect the candidate lines for numeric and semi-numeric date. Dynamic Time Warping (DTW) matching of profile based features is used for classification of month/non-month words. Numerals and punctuations are classified using gradient based feature and SVM classifier. The experiment is performed on Bangla handwritten dataset and the results demonstrate the effectiveness of the proposed system.","PeriodicalId":410986,"journal":{"name":"DAR '12","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115860443","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
A data acquisition and analysis system for palm leaf documents in Telugu 泰卢固语棕榈叶文献数据采集与分析系统
Pub Date : 2012-12-16 DOI: 10.1145/2432553.2432578
P. N. Sastry, R. Krishnan
This paper briefly reviews the progress in the field of hand written character recognition (HWCR) applied to the Indian languages with a special emphasis on the palm leaf character recognition (PLCR) techniques. The various methodologies and techniques for character recognition (CR) have been discussed in the paper. HWCR applied to historical documents like Palm leaves and old hand written manuscripts is much more challenging due to the limited progress in this area. These documents containing texts and treaties on a host of subjects are of both national and historical importance. Characters on the palm leaf have the additional properties like depth, an added feature which can be gainfully exploited during Palm Leaf Character Recognition (PLCR). The unique method of data collection initiated with isolated Telugu characters from palm leaf manuscripts, and the building of the palm leaf character database is described in this paper. A comparative analysis of the results for PLCR obtained by various techniques are also presented.
本文简要回顾了印度语手写字符识别(HWCR)技术的研究进展,重点介绍了棕榈叶字符识别技术。本文讨论了字符识别(CR)的各种方法和技术。由于该领域的进展有限,将HWCR应用于棕榈叶和旧手抄本等历史文件更具挑战性。这些文件载有关于许多主题的案文和条约,具有国家和历史重要性。棕榈叶上的字符具有额外的属性,如深度,这是在棕榈叶字符识别(PLCR)中可以有效利用的附加功能。本文介绍了利用棕榈叶手稿中孤立的泰卢固语文字进行数据采集的独特方法,以及棕榈叶文字数据库的建立。本文还比较分析了各种技术对PLCR的测量结果。
{"title":"A data acquisition and analysis system for palm leaf documents in Telugu","authors":"P. N. Sastry, R. Krishnan","doi":"10.1145/2432553.2432578","DOIUrl":"https://doi.org/10.1145/2432553.2432578","url":null,"abstract":"This paper briefly reviews the progress in the field of hand written character recognition (HWCR) applied to the Indian languages with a special emphasis on the palm leaf character recognition (PLCR) techniques. The various methodologies and techniques for character recognition (CR) have been discussed in the paper. HWCR applied to historical documents like Palm leaves and old hand written manuscripts is much more challenging due to the limited progress in this area. These documents containing texts and treaties on a host of subjects are of both national and historical importance. Characters on the palm leaf have the additional properties like depth, an added feature which can be gainfully exploited during Palm Leaf Character Recognition (PLCR). The unique method of data collection initiated with isolated Telugu characters from palm leaf manuscripts, and the building of the palm leaf character database is described in this paper. A comparative analysis of the results for PLCR obtained by various techniques are also presented.","PeriodicalId":410986,"journal":{"name":"DAR '12","volume":"220 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115129845","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Benchmarking recognition results on camera captured word image data sets 对相机捕获的文字图像数据集的识别结果进行基准测试
Pub Date : 2012-12-16 DOI: 10.1145/2432553.2432572
D. Kumar, M. Prasad, A. Ramakrishnan
We have benchmarked the maximum obtainable recognition accuracy on five publicly available standard word image data sets using semi-automated segmentation and a commercial OCR. These images have been cropped from camera captured scene images, born digital images (BDI) and street view images. Using the Matlab based tool developed by us, we have annotated at the pixel level more than 3600 word images from the five data sets. The word images binarized by the tool, as well as by our own midline analysis and propagation of segmentation (MAPS) algorithm are recognized using the trial version of Nuance Omnipage OCR and these two results are compared with the best reported in the literature. The benchmark word recognition rates obtained on ICDAR 2003, Sign evaluation, Street view, Born-digital and ICDAR 2011 data sets are 83.9%, 89.3%, 79.6%, 88.5% and 86.7%, respectively. The results obtained from MAPS binarized word images without the use of any lexicon are 64.5% and 71.7% for ICDAR 2003 and 2011 respectively, and these values are higher than the best reported values in the literature of 61.1% and 41.2%, respectively. MAPS results of 82.8% for BDI 2011 dataset matches the performance of the state of the art method based on power law transform.
我们使用半自动分割和商用OCR对五个公开可用的标准单词图像数据集进行了最大识别精度的基准测试。这些图像是从相机拍摄的场景图像、原生数字图像(BDI)和街景图像中裁剪而成的。使用我们开发的基于Matlab的工具,我们在像素级标注了来自5个数据集的3600多张单词图像。使用Nuance Omnipage OCR试用版对该工具二值化的单词图像以及我们自己的中线分析和传播分割(MAPS)算法进行识别,并将这两种结果与文献中报道的最佳结果进行比较。在ICDAR 2003、Sign evaluation、Street view、Born-digital和ICDAR 2011数据集上获得的基准词识别率分别为83.9%、89.3%、79.6%、88.5%和86.7%。在ICDAR 2003和2011中,不使用任何词典的MAPS二值化词图像的结果分别为64.5%和71.7%,高于文献报道的最佳值61.1%和41.2%。BDI 2011数据集的MAPS结果有82.8%与基于幂律变换的最先进方法的性能相匹配。
{"title":"Benchmarking recognition results on camera captured word image data sets","authors":"D. Kumar, M. Prasad, A. Ramakrishnan","doi":"10.1145/2432553.2432572","DOIUrl":"https://doi.org/10.1145/2432553.2432572","url":null,"abstract":"We have benchmarked the maximum obtainable recognition accuracy on five publicly available standard word image data sets using semi-automated segmentation and a commercial OCR. These images have been cropped from camera captured scene images, born digital images (BDI) and street view images. Using the Matlab based tool developed by us, we have annotated at the pixel level more than 3600 word images from the five data sets. The word images binarized by the tool, as well as by our own midline analysis and propagation of segmentation (MAPS) algorithm are recognized using the trial version of Nuance Omnipage OCR and these two results are compared with the best reported in the literature. The benchmark word recognition rates obtained on ICDAR 2003, Sign evaluation, Street view, Born-digital and ICDAR 2011 data sets are 83.9%, 89.3%, 79.6%, 88.5% and 86.7%, respectively. The results obtained from MAPS binarized word images without the use of any lexicon are 64.5% and 71.7% for ICDAR 2003 and 2011 respectively, and these values are higher than the best reported values in the literature of 61.1% and 41.2%, respectively. MAPS results of 82.8% for BDI 2011 dataset matches the performance of the state of the art method based on power law transform.","PeriodicalId":410986,"journal":{"name":"DAR '12","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116629334","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Assamese online handwritten digit recognition system using hidden Markov models 使用隐马尔可夫模型的阿萨姆在线手写数字识别系统
Pub Date : 2012-12-16 DOI: 10.1145/2432553.2432573
G. S. Reddy, Bandita Sarma, R. Naik, S. Prasanna, C. Mahanta
This work describes the development of Assamese online handwritten digit recognition system. Assamese numerals are the same as the Bangla numerals. A large database of handwritten numerals is collected and partitioned into two parts of equal size. The first part is used for developing the Hidden Markov Models (HMM) based digit models. The (x, y) coordinates and their first and second time derivatives are used as features. The second part of the database is tested against the models to evaluate the performance. The digit recognition system provides an average recognition performance of 96.02%. A large amount of confusion is observed among the numerals 5 & 6. The new distance feature is used as an additional feature and the models are retrained. The performance for numeral 5 & 6 increases from 91.60% & 95.40% to 95.30% & 94.90%. As a result, the confusion reduces significantly and the average recognition performance increases to 97.14%.
本工作描述了阿萨姆邦在线手写数字识别系统的开发。阿萨姆语的数字与孟加拉语的数字相同。收集了一个手写数字的大型数据库,并将其分为大小相等的两个部分。第一部分用于开发基于隐马尔可夫模型(HMM)的数字模型。(x, y)坐标及其一阶和二阶导数用作特征。数据库的第二部分针对模型进行测试,以评估性能。该数字识别系统的平均识别率为96.02%。在数字5和6之间观察到大量的混淆。新的距离特征被用作附加特征,模型被重新训练。数字5和6的性能从91.60%和95.40%提高到95.30%和94.90%。结果,混淆明显减少,平均识别性能提高到97.14%。
{"title":"Assamese online handwritten digit recognition system using hidden Markov models","authors":"G. S. Reddy, Bandita Sarma, R. Naik, S. Prasanna, C. Mahanta","doi":"10.1145/2432553.2432573","DOIUrl":"https://doi.org/10.1145/2432553.2432573","url":null,"abstract":"This work describes the development of Assamese online handwritten digit recognition system. Assamese numerals are the same as the Bangla numerals. A large database of handwritten numerals is collected and partitioned into two parts of equal size. The first part is used for developing the Hidden Markov Models (HMM) based digit models. The (x, y) coordinates and their first and second time derivatives are used as features. The second part of the database is tested against the models to evaluate the performance. The digit recognition system provides an average recognition performance of 96.02%. A large amount of confusion is observed among the numerals 5 & 6. The new distance feature is used as an additional feature and the models are retrained. The performance for numeral 5 & 6 increases from 91.60% & 95.40% to 95.30% & 94.90%. As a result, the confusion reduces significantly and the average recognition performance increases to 97.14%.","PeriodicalId":410986,"journal":{"name":"DAR '12","volume":"47 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114421119","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Offline handwritten word recognition in Hindi 脱机手写词识别在印地语
Pub Date : 2012-12-16 DOI: 10.1145/2432553.2432563
R. Sitaram, Shrang Jain, Hariharan Ravishankar
This paper discusses the Hindi offline handwritten word recognizer (HWR) that we are developing. For the purpose of training and testing the offline HWR, we have created a Hindi handwritten word and character database from 100 writers. In our HWR we use two-pass Dynamic Programming algorithm to match the test word against each word in the lexicon by initially segmenting the test word image into probable characters. We extract directional element features (DEF) on each character image segment and statistically model them. Currently we are achieving word recognition accuracies of 91.23% to 79.94% on 10 to 30 vocabulary words.
本文讨论了我们正在开发的印地语离线手写词识别器(HWR)。为了培训和测试离线HWR,我们创建了一个来自100位作家的印地语手写单词和字符数据库。在我们的HWR中,我们使用两次动态规划算法,通过最初将测试词图像分割成可能的字符,将测试词与词典中的每个词进行匹配。我们提取每个字符图像片段上的方向元素特征(DEF),并对其进行统计建模。目前我们在10 ~ 30个词汇上的单词识别准确率达到了91.23% ~ 79.94%。
{"title":"Offline handwritten word recognition in Hindi","authors":"R. Sitaram, Shrang Jain, Hariharan Ravishankar","doi":"10.1145/2432553.2432563","DOIUrl":"https://doi.org/10.1145/2432553.2432563","url":null,"abstract":"This paper discusses the Hindi offline handwritten word recognizer (HWR) that we are developing. For the purpose of training and testing the offline HWR, we have created a Hindi handwritten word and character database from 100 writers. In our HWR we use two-pass Dynamic Programming algorithm to match the test word against each word in the lexicon by initially segmenting the test word image into probable characters. We extract directional element features (DEF) on each character image segment and statistically model them. Currently we are achieving word recognition accuracies of 91.23% to 79.94% on 10 to 30 vocabulary words.","PeriodicalId":410986,"journal":{"name":"DAR '12","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124502440","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Development of an Assamese OCR using Bangla OCR 使用孟加拉语OCR开发阿萨姆语OCR
Pub Date : 2012-12-16 DOI: 10.1145/2432553.2432566
Subhankar Ghosh, P. Bora, Sanjib Das, B. Chaudhuri
This paper refers to the development of an OCR for the Assamese language by modifying an existing OCR for the Bangla language. This modification is feasible because the Assamese script is similar, except for a few characters, to the Bangla script. The OCR incorporates a two stage recognizer using SVM classifier with no post-processing. A spell-checker capable of detecting most errors and interactively recommending some corrections is implemented. The OCR is tested with about 1800 pages of good quality printed documents. The accuracy achieved is about 97%.
本文指的是通过修改现有的孟加拉语OCR,为阿萨姆语开发OCR。这种修改是可行的,因为阿萨姆文除了几个字符外,与孟加拉文很相似。OCR结合了一个两阶段识别器,使用支持向量机分类器,没有后处理。实现了一个能够检测大多数错误并交互式推荐一些更正的拼写检查器。OCR测试了大约1800页高质量的打印文件。准确率达到97%左右。
{"title":"Development of an Assamese OCR using Bangla OCR","authors":"Subhankar Ghosh, P. Bora, Sanjib Das, B. Chaudhuri","doi":"10.1145/2432553.2432566","DOIUrl":"https://doi.org/10.1145/2432553.2432566","url":null,"abstract":"This paper refers to the development of an OCR for the Assamese language by modifying an existing OCR for the Bangla language. This modification is feasible because the Assamese script is similar, except for a few characters, to the Bangla script. The OCR incorporates a two stage recognizer using SVM classifier with no post-processing. A spell-checker capable of detecting most errors and interactively recommending some corrections is implemented. The OCR is tested with about 1800 pages of good quality printed documents. The accuracy achieved is about 97%.","PeriodicalId":410986,"journal":{"name":"DAR '12","volume":"96 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122517795","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Line segmentation of handwritten Gurmukhi manuscripts Gurmukhi手写体手稿的线段分割
Pub Date : 2012-12-16 DOI: 10.1145/2432553.2432568
S. Jindal, Gurpreet Singh Lehal
The development of an OCR system for recognition of old Gurmukhi handwritten manuscripts is a complex task involving many difficulties. Historical documents are affected by problems of ageing and repeated use and many other uncontrollable factors. Segmentation is one of the important phase of an OCR, as accuracy of an OCR depends upon the accuracy of segmentation. The writing styles of historical documents make the activity of segmentation extremely difficult. Segmentation includes line, word and character segmentation. In this paper, we have discussed a method for segmenting lines for Gurmukhi handwritten manuscripts.
开发用于古穆克语手写体识别的OCR系统是一项复杂的任务,涉及许多困难。历史文献受到老化、重复使用等诸多不可控因素的影响。分割是OCR的一个重要阶段,因为OCR的准确性取决于分割的准确性。历史文献的写作风格使得分割活动极其困难。切分包括行切分、词切分和字符切分。本文讨论了古穆克语手写体的线段分割方法。
{"title":"Line segmentation of handwritten Gurmukhi manuscripts","authors":"S. Jindal, Gurpreet Singh Lehal","doi":"10.1145/2432553.2432568","DOIUrl":"https://doi.org/10.1145/2432553.2432568","url":null,"abstract":"The development of an OCR system for recognition of old Gurmukhi handwritten manuscripts is a complex task involving many difficulties. Historical documents are affected by problems of ageing and repeated use and many other uncontrollable factors. Segmentation is one of the important phase of an OCR, as accuracy of an OCR depends upon the accuracy of segmentation. The writing styles of historical documents make the activity of segmentation extremely difficult. Segmentation includes line, word and character segmentation. In this paper, we have discussed a method for segmenting lines for Gurmukhi handwritten manuscripts.","PeriodicalId":410986,"journal":{"name":"DAR '12","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121241451","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
A syntactic PR approach to Telugu handwritten character recognition 泰卢固语手写字符识别的句法PR方法
Pub Date : 2012-12-16 DOI: 10.1145/2432553.2432579
Samita Pradhan, A. Negi
This paper shows a character recognition mechanism based on a syntactic PR approach that uses the trie data structure for efficient recognition. It uses approximate matching of the string for classification. During the preprocessing an input character image is transformed into a skeletonized image and discrete curves are found using a 3 x 3 pixel region. A trie, which we call as a sequence trie is used for a look up approach at a lower level to encode a discrete curve pattern of pixels. The sequence of such discrete curves from the input pattern is looked up in the sequence trie. The encoding of several such sequence numbers for the thinned character constructs a pattern string. Approximate string matching is used to compare the encoded pattern string from a template character with the pattern string obtained from the input character. We consider the approximate matching of the string instead of the exact matching to make the approach robust in the presence of noise. Another trie data structure (called pattern trie) is used for the efficient storage and retrieval for approximate matching of the string. We make use of the trie since it takes O(m) in worst case where m is the length of the longest string in the trie. For the approximate string matching we use look ahead with a branch and bound scheme in the trie. Here we apply our method on 43 Telugu characters from the basic Telugu characters for demonstration. The proposed approach has recognised all the test characters given here correctly, however more extensive testing on realistic data is required.
本文提出了一种基于句法PR方法的字符识别机制,该机制使用trie数据结构进行高效识别。它使用字符串的近似匹配进行分类。在预处理过程中,将输入字符图像转换为骨架图像,并使用3 x 3像素区域找到离散曲线。我们称之为序列三阶树的三阶树用于较低层次的查找方法,以编码像素的离散曲线模式。在序列树中查找来自输入模式的这些离散曲线的序列。对几个这样的字符序列号进行编码,就构成了一个模式字符串。近似字符串匹配用于比较来自模板字符的编码模式字符串与从输入字符获得的模式字符串。我们考虑字符串的近似匹配而不是精确匹配,以使该方法在存在噪声的情况下具有鲁棒性。另一种trie数据结构(称为模式trie)用于有效地存储和检索字符串的近似匹配。我们利用这个树,因为在最坏的情况下,它需要O(m),其中m是树中最长字符串的长度。对于近似的字符串匹配,我们在tree中使用分支定界模式的forward。在此,我们从泰卢固语的基本字符中选取43个泰卢固语字符来进行验证。所提出的方法已经正确地识别了这里给出的所有测试特征,但是需要对实际数据进行更广泛的测试。
{"title":"A syntactic PR approach to Telugu handwritten character recognition","authors":"Samita Pradhan, A. Negi","doi":"10.1145/2432553.2432579","DOIUrl":"https://doi.org/10.1145/2432553.2432579","url":null,"abstract":"This paper shows a character recognition mechanism based on a syntactic PR approach that uses the trie data structure for efficient recognition. It uses approximate matching of the string for classification. During the preprocessing an input character image is transformed into a skeletonized image and discrete curves are found using a 3 x 3 pixel region. A trie, which we call as a sequence trie is used for a look up approach at a lower level to encode a discrete curve pattern of pixels. The sequence of such discrete curves from the input pattern is looked up in the sequence trie. The encoding of several such sequence numbers for the thinned character constructs a pattern string. Approximate string matching is used to compare the encoded pattern string from a template character with the pattern string obtained from the input character. We consider the approximate matching of the string instead of the exact matching to make the approach robust in the presence of noise. Another trie data structure (called pattern trie) is used for the efficient storage and retrieval for approximate matching of the string. We make use of the trie since it takes O(m) in worst case where m is the length of the longest string in the trie. For the approximate string matching we use look ahead with a branch and bound scheme in the trie. Here we apply our method on 43 Telugu characters from the basic Telugu characters for demonstration. The proposed approach has recognised all the test characters given here correctly, however more extensive testing on realistic data is required.","PeriodicalId":410986,"journal":{"name":"DAR '12","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129226446","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
An empirical intrinsic mode based characterization of Indian scripts 基于经验内在模式的印度文字表征
Pub Date : 2012-12-16 DOI: 10.1145/2432553.2432575
Kavita Bhardwaj, S. Chaudhury, Sumantra Dutta Roy
In this paper, we describe a novel technique for Document script identification(DSI) from printed documents, using Empirical Mode Decomposition (EMD). The intrinsic decomposition nature can adaptively decompose script images into a series of modes representing different local features of script images. In this method, Radon transformed script images are decomposed into finite set of IMFs (Intrinsic Mode Functions). The energy concentration in a particular orientation characterises a script texture as it indicates the dominance of individual script in that direction. We demonstrate how the proposed method use these IMFs as feature vectors to distinguish various scripts.
在本文中,我们描述了一种利用经验模式分解(EMD)从打印文档中识别文档脚本(DSI)的新技术。固有的分解特性可以自适应地将文字图像分解为一系列模式,这些模式代表着文字图像的不同局部特征。该方法将Radon变换后的脚本图像分解为有限的内禀模态函数集。能量集中在一个特定的方向表征一个脚本纹理,因为它表明在该方向上的单个脚本的优势。我们演示了所提出的方法如何使用这些imf作为特征向量来区分各种脚本。
{"title":"An empirical intrinsic mode based characterization of Indian scripts","authors":"Kavita Bhardwaj, S. Chaudhury, Sumantra Dutta Roy","doi":"10.1145/2432553.2432575","DOIUrl":"https://doi.org/10.1145/2432553.2432575","url":null,"abstract":"In this paper, we describe a novel technique for Document script identification(DSI) from printed documents, using Empirical Mode Decomposition (EMD). The intrinsic decomposition nature can adaptively decompose script images into a series of modes representing different local features of script images. In this method, Radon transformed script images are decomposed into finite set of IMFs (Intrinsic Mode Functions). The energy concentration in a particular orientation characterises a script texture as it indicates the dominance of individual script in that direction. We demonstrate how the proposed method use these IMFs as feature vectors to distinguish various scripts.","PeriodicalId":410986,"journal":{"name":"DAR '12","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126513387","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Recognition of Kannada characters extracted from scene images 从场景图像中提取卡纳达语字符的识别
Pub Date : 2012-12-16 DOI: 10.1145/2432553.2432557
D. Kumar, A. Ramakrishnan
In this paper, we describe a method for feature extraction and classification of characters manually isolated from scene or natural images. Characters in a scene image may be affected by low resolution, uneven illumination or occlusion. We propose a novel method to perform binarization on gray scale images by minimizing energy functional. Discrete Cosine Transform and Angular Radial Transform are used to extract the features from characters after normalization for scale and translation. We have evaluated our method on the complete test set of Chars74k dataset for English and Kannada scripts consisting of handwritten and synthesized characters, as well as characters extracted from camera captured images. We utilize only synthesized and handwritten characters from this dataset as training set. Nearest neighbor classification is used in our experiments.
在本文中,我们描述了一种从场景或自然图像中手动分离的特征提取和分类方法。场景图像中的人物可能受到低分辨率、不均匀光照或遮挡的影响。提出了一种利用最小化能量泛函对灰度图像进行二值化的新方法。分别使用离散余弦变换和角径向变换对归一化后的字符进行特征提取。我们在Chars74k数据集的完整测试集上对我们的方法进行了评估,该测试集包括英语和卡纳达语的手写和合成字符,以及从相机捕获的图像中提取的字符。我们只使用该数据集中的合成字符和手写字符作为训练集。在我们的实验中使用了最近邻分类。
{"title":"Recognition of Kannada characters extracted from scene images","authors":"D. Kumar, A. Ramakrishnan","doi":"10.1145/2432553.2432557","DOIUrl":"https://doi.org/10.1145/2432553.2432557","url":null,"abstract":"In this paper, we describe a method for feature extraction and classification of characters manually isolated from scene or natural images. Characters in a scene image may be affected by low resolution, uneven illumination or occlusion. We propose a novel method to perform binarization on gray scale images by minimizing energy functional. Discrete Cosine Transform and Angular Radial Transform are used to extract the features from characters after normalization for scale and translation. We have evaluated our method on the complete test set of Chars74k dataset for English and Kannada scripts consisting of handwritten and synthesized characters, as well as characters extracted from camera captured images. We utilize only synthesized and handwritten characters from this dataset as training set. Nearest neighbor classification is used in our experiments.","PeriodicalId":410986,"journal":{"name":"DAR '12","volume":"72 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-12-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122268114","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
期刊
DAR '12
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1