首页 > 最新文献

2012 International Conference on Frontiers in Handwriting Recognition最新文献

英文 中文
Building a Personal Handwriting Recognizer on an Android Device 在Android设备上构建个人手写识别器
Pub Date : 2012-09-18 DOI: 10.1109/ICFHR.2012.189
Debarshi Dutta, Aruni Roy Chowdhury, U. Bhattacharya, S. K. Parui
The wide usage of touch-screen based mobile devices has led to a large volume of the users preferring touch-based interaction with the machine, as opposed to traditional input via keyboards/mice. To exploit this, we focus on the Android platform to design a personalized handwriting recognition system that is acceptably fast, light-weight, possessing a user-friendly interface with minimally-intrusive correction and auto-personalization mechanisms. Since cursive writing on smaller screens is not usual, here we study non-cursive handwriting only. The recognition is done at character level using nearest-neighbor matching to a small, automatically user-adaptive and dynamically updating library of character-class template gestures.
基于触摸屏的移动设备的广泛使用导致大量用户更喜欢基于触摸的机器交互,而不是传统的通过键盘/鼠标输入。为了利用这一点,我们以Android为平台,设计了一个个性化的手写识别系统,该系统具有可接受的速度,重量轻,具有用户友好的界面,具有最小的侵入性校正和自动个性化机制。由于小屏幕上的草书书写并不常见,这里我们只研究非草书书写。识别是在字符级别使用最近邻匹配一个小的,自动用户自适应和动态更新的字符类模板手势库。
{"title":"Building a Personal Handwriting Recognizer on an Android Device","authors":"Debarshi Dutta, Aruni Roy Chowdhury, U. Bhattacharya, S. K. Parui","doi":"10.1109/ICFHR.2012.189","DOIUrl":"https://doi.org/10.1109/ICFHR.2012.189","url":null,"abstract":"The wide usage of touch-screen based mobile devices has led to a large volume of the users preferring touch-based interaction with the machine, as opposed to traditional input via keyboards/mice. To exploit this, we focus on the Android platform to design a personalized handwriting recognition system that is acceptably fast, light-weight, possessing a user-friendly interface with minimally-intrusive correction and auto-personalization mechanisms. Since cursive writing on smaller screens is not usual, here we study non-cursive handwriting only. The recognition is done at character level using nearest-neighbor matching to a small, automatically user-adaptive and dynamically updating library of character-class template gestures.","PeriodicalId":291062,"journal":{"name":"2012 International Conference on Frontiers in Handwriting Recognition","volume":"504 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116681248","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Handwritten and Machine Printed Text Separation in Document Images Using the Bag of Visual Words Paradigm 基于视觉词袋范式的文档图像手写体与机印体文本分离
Pub Date : 2012-09-18 DOI: 10.1109/ICFHR.2012.207
Konstantinos Zagoris, I. Pratikakis, A. Antonacopoulos, B. Gatos, N. Papamarkos
In a number of types of documents, ranging from forms to archive documents and books with annotations, machine printed and handwritten text may be present in the same document image, giving rise to significant issues within a digitisation and recognition pipeline. It is therefore necessary to separate the two types of text before applying different recognition methodologies to each. In this paper, a new approach is proposed which strives towards identifying and separating handwritten from machine printed text using the Bag of Visual Words paradigm (BoVW). Initially, blocks of interest are detected in the document image. For each block, a descriptor is calculated based on the BoVW. The final characterization of the blocks as Handwritten, Machine Printed or Noise is made by a Support Vector Machine classifier. The promising performance of the proposed approach is shown by using a consistent evaluation methodology which couples meaningful measures along with a new dataset.
在许多类型的文件中,从表格到档案文件和带有注释的书籍,机器打印和手写的文本可能出现在相同的文件图像中,这在数字化和识别管道中引起了重大问题。因此,在对每种文本应用不同的识别方法之前,有必要将两种类型的文本分开。本文提出了一种新的方法,利用视觉词袋范式(BoVW)来识别和分离手写文本和机器打印文本。最初,在文档图像中检测感兴趣的块。对于每个块,一个描述符是基于BoVW计算的。通过支持向量机分类器将块的最终特征描述为手写,机器打印或噪声。通过使用一致的评估方法,将有意义的度量与新的数据集结合起来,表明了所提出方法的良好性能。
{"title":"Handwritten and Machine Printed Text Separation in Document Images Using the Bag of Visual Words Paradigm","authors":"Konstantinos Zagoris, I. Pratikakis, A. Antonacopoulos, B. Gatos, N. Papamarkos","doi":"10.1109/ICFHR.2012.207","DOIUrl":"https://doi.org/10.1109/ICFHR.2012.207","url":null,"abstract":"In a number of types of documents, ranging from forms to archive documents and books with annotations, machine printed and handwritten text may be present in the same document image, giving rise to significant issues within a digitisation and recognition pipeline. It is therefore necessary to separate the two types of text before applying different recognition methodologies to each. In this paper, a new approach is proposed which strives towards identifying and separating handwritten from machine printed text using the Bag of Visual Words paradigm (BoVW). Initially, blocks of interest are detected in the document image. For each block, a descriptor is calculated based on the BoVW. The final characterization of the blocks as Handwritten, Machine Printed or Noise is made by a Support Vector Machine classifier. The promising performance of the proposed approach is shown by using a consistent evaluation methodology which couples meaningful measures along with a new dataset.","PeriodicalId":291062,"journal":{"name":"2012 International Conference on Frontiers in Handwriting Recognition","volume":"21 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126138599","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 27
On the possibility of instance-based stroke recovery 基于实例的中风恢复的可能性
Pub Date : 2012-09-18 DOI: 10.1109/ICFHR.2012.248
Yutaro Iwakiri, Soma Shiraishi, Yaokai Feng, S. Uchida
This paper tackles the stroke recovery problem, which is a typical ill-posed reverse problem, by an instance-based method. The basic idea of the instance-based stroke recovery is to refer to the drawing order of a similar instance. The instance-based method has a strong merit that it can deal with multi-stroke characters and other complex characters without any special consideration. However, it requires a sufficient numbers of instances to cover those various characters. As an initial trial of the instance-based stroke recovery method, this paper describes the principle of the method and then provides several experimental results. The experimental results indicate the potential of the proposed method on recovering the drawing order of complex characters, as expected.
本文采用基于实例的方法解决了典型的不适定逆问题——冲程恢复问题。基于实例的行程恢复的基本思想是参考类似实例的绘制顺序。基于实例的方法在处理多笔划字符和其他复杂字符时不需要特别考虑,具有很强的优点。然而,它需要足够数量的实例来涵盖这些不同的字符。作为基于实例的冲程恢复方法的初步试验,本文介绍了该方法的原理,并给出了几个实验结果。实验结果表明,该方法在恢复复杂字符的绘制顺序方面具有一定的潜力。
{"title":"On the possibility of instance-based stroke recovery","authors":"Yutaro Iwakiri, Soma Shiraishi, Yaokai Feng, S. Uchida","doi":"10.1109/ICFHR.2012.248","DOIUrl":"https://doi.org/10.1109/ICFHR.2012.248","url":null,"abstract":"This paper tackles the stroke recovery problem, which is a typical ill-posed reverse problem, by an instance-based method. The basic idea of the instance-based stroke recovery is to refer to the drawing order of a similar instance. The instance-based method has a strong merit that it can deal with multi-stroke characters and other complex characters without any special consideration. However, it requires a sufficient numbers of instances to cover those various characters. As an initial trial of the instance-based stroke recovery method, this paper describes the principle of the method and then provides several experimental results. The experimental results indicate the potential of the proposed method on recovering the drawing order of complex characters, as expected.","PeriodicalId":291062,"journal":{"name":"2012 International Conference on Frontiers in Handwriting Recognition","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123584719","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Keyword Spotting Framework Using Dynamic Background Model 基于动态背景模型的关键词定位框架
Pub Date : 2012-09-18 DOI: 10.1109/ICFHR.2012.223
Manish Kumar, Zhixin Shi, S. Setlur, V. Govindaraju, R. Sitaram
An important task in Keyword Spotting in handwritten documents is to separate Keywords from Non Keywords. Very often this is achieved by learning a filler or background model. A common method of building a background model is to allow all possible sequences or transitions of characters. However, due to large variation in handwriting styles, allowing all possible sequences of characters as background might result in an increased false reject. A weak background model could result in high false accept. We propose a novel way of learning the background model dynamically. The approach first used in word spotting in speech uses a feature vector of top K local scores per character and top N global scores of matching hypotheses. A two class classifier is learned on these features to classify between Keyword and Non Keyword.
手写体文档关键字识别的一项重要任务是将关键字与非关键字区分开来。这通常是通过学习填充或背景模型来实现的。建立背景模型的一种常用方法是允许所有可能的字符序列或转换。然而,由于笔迹风格的差异很大,允许所有可能的字符序列作为背景可能会导致错误拒绝的增加。弱背景模型可能导致高误接受。提出了一种动态学习背景模型的新方法。该方法首先用于语音中的单词识别,使用每个字符的前K个局部分数和匹配假设的前N个全局分数的特征向量。利用这些特征学习两类分类器对关键字和非关键字进行分类。
{"title":"Keyword Spotting Framework Using Dynamic Background Model","authors":"Manish Kumar, Zhixin Shi, S. Setlur, V. Govindaraju, R. Sitaram","doi":"10.1109/ICFHR.2012.223","DOIUrl":"https://doi.org/10.1109/ICFHR.2012.223","url":null,"abstract":"An important task in Keyword Spotting in handwritten documents is to separate Keywords from Non Keywords. Very often this is achieved by learning a filler or background model. A common method of building a background model is to allow all possible sequences or transitions of characters. However, due to large variation in handwriting styles, allowing all possible sequences of characters as background might result in an increased false reject. A weak background model could result in high false accept. We propose a novel way of learning the background model dynamically. The approach first used in word spotting in speech uses a feature vector of top K local scores per character and top N global scores of matching hypotheses. A two class classifier is learned on these features to classify between Keyword and Non Keyword.","PeriodicalId":291062,"journal":{"name":"2012 International Conference on Frontiers in Handwriting Recognition","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116070672","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
New Protocol Design for Wordspotting Assistance System: Case Study of the Collaborative Library Model - ARMARIUS 词语识别辅助系统的新协议设计:以协同图书馆模式ARMARIUS为例
Pub Date : 2012-09-18 DOI: 10.1109/ICFHR.2012.242
Abir Chaari, Fadoua Drira, A. Alimi, Elöd Egyed-Zsigmond, Frank Lebourgeois
The cultural heritage is full of important manuscript collections preserved in digital libraries. The need to annotate and enrich the scanned documents is claimed by some users to keep traces in the system for a further use. Moreover, the reuse of annotations could help other users to accomplish repetitive tasks in a semi-automatic way. One manuscript annotation technique is the word spotting. It is a process that seeks in a document for all the fragments that are similar to the one specified by the user. The main focus of this research work is to propose a solution integrating and encapsulating the word spotting algorithm in digital libraries. This solution involves, in particular, the specification and the implementation of an architecture to integrate the image processing tool using Restful Web services. The proposed prototype is tested on the ARMARIUS digital library. This library is one of the collaborative digital archiving models that stores ancient digitized manuscripts.
文化遗产中有大量保存在数字图书馆的重要手稿收藏。一些用户声称需要对扫描的文档进行注释和丰富,以便在系统中保留痕迹以供进一步使用。此外,注释的重用可以帮助其他用户以半自动的方式完成重复的任务。一种手稿注释技术是单词标记。这是一个在文档中寻找与用户指定的片段相似的所有片段的过程。本文的研究重点是提出一种集成和封装数字图书馆中单词识别算法的解决方案。该解决方案特别涉及使用Restful Web服务集成图像处理工具的体系结构的规范和实现。该原型在ARMARIUS数字图书馆上进行了测试。该图书馆是保存古代数字化手稿的协作数字存档模式之一。
{"title":"New Protocol Design for Wordspotting Assistance System: Case Study of the Collaborative Library Model - ARMARIUS","authors":"Abir Chaari, Fadoua Drira, A. Alimi, Elöd Egyed-Zsigmond, Frank Lebourgeois","doi":"10.1109/ICFHR.2012.242","DOIUrl":"https://doi.org/10.1109/ICFHR.2012.242","url":null,"abstract":"The cultural heritage is full of important manuscript collections preserved in digital libraries. The need to annotate and enrich the scanned documents is claimed by some users to keep traces in the system for a further use. Moreover, the reuse of annotations could help other users to accomplish repetitive tasks in a semi-automatic way. One manuscript annotation technique is the word spotting. It is a process that seeks in a document for all the fragments that are similar to the one specified by the user. The main focus of this research work is to propose a solution integrating and encapsulating the word spotting algorithm in digital libraries. This solution involves, in particular, the specification and the implementation of an architecture to integrate the image processing tool using Restful Web services. The proposed prototype is tested on the ARMARIUS digital library. This library is one of the collaborative digital archiving models that stores ancient digitized manuscripts.","PeriodicalId":291062,"journal":{"name":"2012 International Conference on Frontiers in Handwriting Recognition","volume":"71 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121998507","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Stroke Segmentation and Recognition from Bangla Online Handwritten Text 孟加拉语在线手写体笔画分割与识别
Pub Date : 2012-09-18 DOI: 10.1109/ICFHR.2012.275
Nilanjana Bhattacharya, U. Pal
This paper deals with recognition of online handwritten Bangla (Bengali) text. Here, at first, we segment cursive words into strokes. A stroke may represent a character or a part of a character. We selected a set of Bangla words written by different groups of people such that they contain all basic characters, all vowel and consonant modifiers and almost all types of possible joining among them. For segmentation of text into strokes, we discovered some rules analyzing different joining patterns of Bangla characters. Combination of online and offline information was used for segmentation. We achieved correct segmentation rate of 97.89% on the dataset. We manually analyzed different strokes to create a ground truth set of distinct stroke classes for result verification and we obtained 85 stroke classes. Directional features were used in SVM for recognition and we achieved correct stroke recognition rate of 97.68%.
本文研究了在线手写体孟加拉文本的识别问题。在这里,首先,我们将草书词分成笔画。一个笔画可以代表一个字符或字符的一部分。我们选择了一组由不同群体的人写的孟加拉语单词,这些单词包含了所有的基本字符,所有的元音和辅音修饰语以及它们之间几乎所有可能的连接类型。对于文字笔画的分割,我们通过分析不同的孟加拉文字连接方式,发现了一些规律。采用线上和线下信息相结合的方法进行分割。我们在数据集上实现了97.89%的正确分割率。我们手动分析了不同的笔画,以创建不同笔画类的基本真实集,用于结果验证,我们获得了85个笔画类。在支持向量机中使用方向特征进行识别,正确笔画识别率达到97.68%。
{"title":"Stroke Segmentation and Recognition from Bangla Online Handwritten Text","authors":"Nilanjana Bhattacharya, U. Pal","doi":"10.1109/ICFHR.2012.275","DOIUrl":"https://doi.org/10.1109/ICFHR.2012.275","url":null,"abstract":"This paper deals with recognition of online handwritten Bangla (Bengali) text. Here, at first, we segment cursive words into strokes. A stroke may represent a character or a part of a character. We selected a set of Bangla words written by different groups of people such that they contain all basic characters, all vowel and consonant modifiers and almost all types of possible joining among them. For segmentation of text into strokes, we discovered some rules analyzing different joining patterns of Bangla characters. Combination of online and offline information was used for segmentation. We achieved correct segmentation rate of 97.89% on the dataset. We manually analyzed different strokes to create a ground truth set of distinct stroke classes for result verification and we obtained 85 stroke classes. Directional features were used in SVM for recognition and we achieved correct stroke recognition rate of 97.68%.","PeriodicalId":291062,"journal":{"name":"2012 International Conference on Frontiers in Handwriting Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129904973","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 36
Invited Lecture II: Evaluating the Probability of Identification in the Forensic Sciences 特邀讲座II:评估法医学鉴定的可能性
Pub Date : 2012-09-18 DOI: 10.1109/ICFHR.2012.301
S. Srihari
Forensic identification is the task of determining whether or not observed evidence arose from a known source. In every forensic domain, it is useful to determine the probability that the evidence and known can be attributed to the same individual so that identification/exclusion opinions can be accompanied by a probability statement. At present, in most forensic domains outside of DNA, it is not possible to make such a statement since the necessary probability distributions cannot be computed with reasonable accuracy. It involves determining a likelihood ratio (LR) -the ratio of the joint probability of the evidence and source under the identification hypothesis (that the evidence came from the source) and under the exclusion hypothesis (that the evidence did not arise from the source). The joint probability approach is computationally and statistically infeasible when the number of variables is even moderately large, e.g., the number of parameters to be determined is exponential with the number of variables. An approximate method is to replace the joint probability by another probability: that of (dis)imilarity between evidence and object under the two hypotheses. While this distance-based approach reduces to linear complexity with the number of variables, it is an oversimplification. A third method, which decomposes the LR into a product of two factors, one based on distance and the other on rarity, has intuitive appeal-forensic examiners assign higher importance to rare attributes in the evidence. Theoretical discussions of the three approaches and empirical evaluations done with several data types (continuous features, binary features, multinomial and graph) will be described. Experiments with handwriting using binary and multinomial features show that the distance and rarity method is significantly better than the distance only method. Work was done with Yi Tang.
法医鉴定是确定观察到的证据是否来自已知来源的任务。在每个法医领域,确定证据和已知证据可以归因于同一个人的概率是有用的,以便识别/排除意见可以伴随着概率陈述。目前,在DNA以外的大多数法医领域,由于无法以合理的准确性计算必要的概率分布,因此不可能做出这样的陈述。它涉及确定似然比(LR)——在识别假设(证据来自于来源)和排除假设(证据不是来自于来源)下证据和来源的联合概率之比。当变量数量甚至中等大时,如待确定的参数数量与变量数量呈指数关系时,联合概率方法在计算和统计上都是不可行的。一种近似的方法是用另一种概率来代替联合概率,即在两种假设下证据与对象之间(不)相似的概率。虽然这种基于距离的方法减少了变量数量的线性复杂性,但它过于简化了。第三种方法是将LR分解为两个因素的乘积,一个是基于距离,另一个是基于稀有性,这种方法具有直观的吸引力——法医审查员将证据中的稀有性属性赋予更高的重要性。本文将描述这三种方法的理论讨论,以及用几种数据类型(连续特征、二元特征、多项特征和图)进行的实证评估。用二值特征和多项特征对手写体进行的实验表明,距离和稀有性方法明显优于仅使用距离的方法。工作是和一唐一起完成的。
{"title":"Invited Lecture II: Evaluating the Probability of Identification in the Forensic Sciences","authors":"S. Srihari","doi":"10.1109/ICFHR.2012.301","DOIUrl":"https://doi.org/10.1109/ICFHR.2012.301","url":null,"abstract":"Forensic identification is the task of determining whether or not observed evidence arose from a known source. In every forensic domain, it is useful to determine the probability that the evidence and known can be attributed to the same individual so that identification/exclusion opinions can be accompanied by a probability statement. At present, in most forensic domains outside of DNA, it is not possible to make such a statement since the necessary probability distributions cannot be computed with reasonable accuracy. It involves determining a likelihood ratio (LR) -the ratio of the joint probability of the evidence and source under the identification hypothesis (that the evidence came from the source) and under the exclusion hypothesis (that the evidence did not arise from the source). The joint probability approach is computationally and statistically infeasible when the number of variables is even moderately large, e.g., the number of parameters to be determined is exponential with the number of variables. An approximate method is to replace the joint probability by another probability: that of (dis)imilarity between evidence and object under the two hypotheses. While this distance-based approach reduces to linear complexity with the number of variables, it is an oversimplification. A third method, which decomposes the LR into a product of two factors, one based on distance and the other on rarity, has intuitive appeal-forensic examiners assign higher importance to rare attributes in the evidence. Theoretical discussions of the three approaches and empirical evaluations done with several data types (continuous features, binary features, multinomial and graph) will be described. Experiments with handwriting using binary and multinomial features show that the distance and rarity method is significantly better than the distance only method. Work was done with Yi Tang.","PeriodicalId":291062,"journal":{"name":"2012 International Conference on Frontiers in Handwriting Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129119562","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Grouping of Handwritten Bangla Basic Characters, Numerals and Vowel Modifiers for Multilayer Classification 手写体孟加拉语基本字、数字和元音修饰语的分层分类
Pub Date : 2012-09-18 DOI: 10.1109/ICFHR.2012.206
Khondker Nayef Reza, Mumit Khan
For better performance in multilayer or hierarchical classification of handwritten text, appropriate grouping of similar symbols is very important. Here we aim to develop a reliable grouping schema for the similar looking basic characters, numerals and vowel modifiers of Bangla language. We experimented with thickened and thinned segmented handwritten text to compare which type of image is better for which group. For classification we chose Support Vector Machine (SVM) as it outperforms other classifiers in this field. We used both “one against one” and “one against all” strategies for multiclass SVM and compared their performance.
为了在多层或分层的手写文本分类中获得更好的性能,对相似符号进行适当的分组是非常重要的。本文旨在为孟加拉语相似的基本字、数字和元音修饰语建立一个可靠的分组模式。我们实验了加厚和稀释的分割手写文本,以比较哪种类型的图像更适合哪一组。对于分类,我们选择支持向量机(SVM),因为它优于该领域的其他分类器。我们对多类支持向量机分别采用了“一对一”和“一对全”两种策略,并比较了它们的性能。
{"title":"Grouping of Handwritten Bangla Basic Characters, Numerals and Vowel Modifiers for Multilayer Classification","authors":"Khondker Nayef Reza, Mumit Khan","doi":"10.1109/ICFHR.2012.206","DOIUrl":"https://doi.org/10.1109/ICFHR.2012.206","url":null,"abstract":"For better performance in multilayer or hierarchical classification of handwritten text, appropriate grouping of similar symbols is very important. Here we aim to develop a reliable grouping schema for the similar looking basic characters, numerals and vowel modifiers of Bangla language. We experimented with thickened and thinned segmented handwritten text to compare which type of image is better for which group. For classification we chose Support Vector Machine (SVM) as it outperforms other classifiers in this field. We used both “one against one” and “one against all” strategies for multiclass SVM and compared their performance.","PeriodicalId":291062,"journal":{"name":"2012 International Conference on Frontiers in Handwriting Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129240122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
MfrDB: Database of Annotated On-Line Mathematical Formulae MfrDB:在线数学公式注释数据库
Pub Date : 2012-09-18 DOI: 10.1109/ICFHR.2012.231
Jan Stria, Martin Bresler, D. Prusa, Václav Hlaváč
This paper announces a ground truthed database of on-line handwritten mathematical formulae. It have recently been collected in our group in connection with the research on methods for structural pattern recognition. Unlike the availability of handwritten characters or texts, collections of structural objects are rather scarce, thus we would like to provide them to the community. We also present the methodology and tools used for data acquisition. Finally, we report on our experiment with the automatic generation of additional samples. The process utilizes the dataset to extract statistical descriptions of symbols alignments and relative sizes.
本文提出了一种地面真实的在线手写数学公式数据库。这是我们小组最近在研究结构模式识别方法时收集到的。与手写体或文本的可用性不同,结构对象的集合相当稀缺,因此我们希望将它们提供给社区。我们还介绍了用于数据采集的方法和工具。最后,我们报告了自动生成附加样本的实验。该过程利用数据集提取符号对齐和相对大小的统计描述。
{"title":"MfrDB: Database of Annotated On-Line Mathematical Formulae","authors":"Jan Stria, Martin Bresler, D. Prusa, Václav Hlaváč","doi":"10.1109/ICFHR.2012.231","DOIUrl":"https://doi.org/10.1109/ICFHR.2012.231","url":null,"abstract":"This paper announces a ground truthed database of on-line handwritten mathematical formulae. It have recently been collected in our group in connection with the research on methods for structural pattern recognition. Unlike the availability of handwritten characters or texts, collections of structural objects are rather scarce, thus we would like to provide them to the community. We also present the methodology and tools used for data acquisition. Finally, we report on our experiment with the automatic generation of additional samples. The process utilizes the dataset to extract statistical descriptions of symbols alignments and relative sizes.","PeriodicalId":291062,"journal":{"name":"2012 International Conference on Frontiers in Handwriting Recognition","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116500518","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Online Arabic Handwritten Digits Recognition 在线阿拉伯语手写数字识别
Pub Date : 2012-09-18 DOI: 10.1109/ICFHR.2012.249
S. Abdelazeem, Maha El Meseery, Hany Ahmed
This paper fills a void in the literature of online Arabic handwritten digits recognition as no systems are dedicated to this problem. The two main contributions of this paper are introducing a large online Arabic handwritten digits dataset and developing an efficient online Arabic handwritten digits recognition system. In the dataset, we collected 30,000 online Arabic digits from 300 writers. The developed system uses a combination of temporal and spatial features to recognize those digits. The system achieved 98.73% recognition rate. Comparison with a commercial product demonstrates the superiority of the proposed system.
本文填补了在线阿拉伯文手写数字识别文献的空白,因为没有专门的系统来解决这个问题。本文的两个主要贡献是介绍了一个大型的在线阿拉伯语手写数字数据集和开发了一个高效的在线阿拉伯语手写数字识别系统。在数据集中,我们从300位作者那里收集了30,000个在线阿拉伯数字。开发的系统使用时间和空间特征的组合来识别这些数字。系统的识别率达到了98.73%。与商业产品的比较表明了该系统的优越性。
{"title":"Online Arabic Handwritten Digits Recognition","authors":"S. Abdelazeem, Maha El Meseery, Hany Ahmed","doi":"10.1109/ICFHR.2012.249","DOIUrl":"https://doi.org/10.1109/ICFHR.2012.249","url":null,"abstract":"This paper fills a void in the literature of online Arabic handwritten digits recognition as no systems are dedicated to this problem. The two main contributions of this paper are introducing a large online Arabic handwritten digits dataset and developing an efficient online Arabic handwritten digits recognition system. In the dataset, we collected 30,000 online Arabic digits from 300 writers. The developed system uses a combination of temporal and spatial features to recognize those digits. The system achieved 98.73% recognition rate. Comparison with a commercial product demonstrates the superiority of the proposed system.","PeriodicalId":291062,"journal":{"name":"2012 International Conference on Frontiers in Handwriting Recognition","volume":"24 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2012-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125254571","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 16
期刊
2012 International Conference on Frontiers in Handwriting Recognition
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1