首页 > 最新文献

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)最新文献

英文 中文
Elastic Matching of Online Handwritten Tamil and Telugu Scripts Using Local Features 使用本地特征的在线手写泰米尔语和泰卢固语脚本的弹性匹配
Prasanth, Jagadeesh Babu, Raghunath Sharma, Prabhakara Rao, Dinesh Mandalapu, L. Prasanth, V. Jagadeesh Babu, R. Raghunath, Sharma Dinesh, M. G. V. P. Rao
This paper describes character based elastic matching using local features for recognizing online handwritten data. Dynamic time warping (DTW) has been used with four different feature sets: x-y features, shape context (SC) and tangent angle (TA) features, generalized shape context feature (GSC) and the fourth set containing x-y, normalized first and second derivatives and curvature features. Nearest neighborhood classifier with DTW distance was used as the classifier. In comparison, the SC and TA feature set was found to be the slowest and the fourth set was best among all in the recognition rate. The results have been compiled for the online handwritten Tamil and Telugu data. On Telugu data we obtained an accuracy of 90.6% with a speed of 0.166 symbols/sec. To increase the speed we have proposed a 2-stage recognition scheme using which we obtained accuracy of 89.77% but with a speed of 3.977 symbols/sec.
本文描述了一种基于字符的弹性匹配方法,利用局部特征识别在线手写数据。动态时间翘曲(DTW)被用于四种不同的特征集:x-y特征、形状上下文(SC)和切角(TA)特征、广义形状上下文特征(GSC)和包含x-y、标准化一阶导数和二阶导数以及曲率特征的第四集。采用DTW距离的最近邻分类器作为分类器。对比发现,SC和TA特征集的识别率是最慢的,而第四特征集的识别率是最好的。结果已汇编为在线手写泰米尔语和泰卢固语数据。在泰卢固语数据上,我们以0.166个符号/秒的速度获得了90.6%的准确率。为了提高速度,我们提出了一种两阶段识别方案,使用该方案我们获得了89.77%的准确率,但速度为3.977个符号/秒。
{"title":"Elastic Matching of Online Handwritten Tamil and Telugu Scripts Using Local Features","authors":"Prasanth, Jagadeesh Babu, Raghunath Sharma, Prabhakara Rao, Dinesh Mandalapu, L. Prasanth, V. Jagadeesh Babu, R. Raghunath, Sharma Dinesh, M. G. V. P. Rao","doi":"10.1109/ICDAR.2007.106","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.106","url":null,"abstract":"This paper describes character based elastic matching using local features for recognizing online handwritten data. Dynamic time warping (DTW) has been used with four different feature sets: x-y features, shape context (SC) and tangent angle (TA) features, generalized shape context feature (GSC) and the fourth set containing x-y, normalized first and second derivatives and curvature features. Nearest neighborhood classifier with DTW distance was used as the classifier. In comparison, the SC and TA feature set was found to be the slowest and the fourth set was best among all in the recognition rate. The results have been compiled for the online handwritten Tamil and Telugu data. On Telugu data we obtained an accuracy of 90.6% with a speed of 0.166 symbols/sec. To increase the speed we have proposed a 2-stage recognition scheme using which we obtained accuracy of 89.77% but with a speed of 3.977 symbols/sec.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129983384","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 58
Off-line Signature Verification System Performance against Image Acquisition Resolution 针对图像采集分辨率的离线签名验证系统性能
Pub Date : 2007-09-23 DOI: 10.1109/ICDAR.2007.4377032
J. Vargas-Bonilla, M. A. Ferrer-Ballester, C. Travieso-González, J. B. Alonso
The effect of changing the image resolution over an off-line signature verification system performance is analyzed. The geometrical features used for the system analyzed in this paper are based on two vectors which represent the envelope and the interior stroke distribution in polar and Cartesian coordinates. Image resolution is progressively diminished from an initial 600 ppp resolution till 45 ppp. The robustness of the analyzed system for random and simple forgeries is tested out with a hidden Markov model. The results show that 150 ppp offers a good trade-off between performance and image resolution for static features.
分析了改变图像分辨率对离线签名验证系统性能的影响。本文所分析的系统的几何特征是基于在极坐标和直角坐标下表示包络和内部行程分布的两个向量。图像分辨率从最初的600ppp分辨率逐渐降低到45ppp。利用隐马尔可夫模型验证了系统对随机伪造和简单伪造的鲁棒性。结果表明,150ppp在静态特征的性能和图像分辨率之间提供了一个很好的权衡。
{"title":"Off-line Signature Verification System Performance against Image Acquisition Resolution","authors":"J. Vargas-Bonilla, M. A. Ferrer-Ballester, C. Travieso-González, J. B. Alonso","doi":"10.1109/ICDAR.2007.4377032","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.4377032","url":null,"abstract":"The effect of changing the image resolution over an off-line signature verification system performance is analyzed. The geometrical features used for the system analyzed in this paper are based on two vectors which represent the envelope and the interior stroke distribution in polar and Cartesian coordinates. Image resolution is progressively diminished from an initial 600 ppp resolution till 45 ppp. The robustness of the analyzed system for random and simple forgeries is tested out with a hidden Markov model. The results show that 150 ppp offers a good trade-off between performance and image resolution for static features.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"16 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131096448","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Computer Assisted Transcription of Handwritten Text Images 手写文本图像的计算机辅助转录
A. Toselli, Verónica Romero, Luis Rodríguez, E. Vidal
To date, automatic handwriting recognition systems are far from being perfect and often they need a post editing where a human intervention is required to check and correct the results of such systems. We propose to have a new interactive, on-line framework which, rather than full automation, aims at assisting the human in the proper recognition- transcription process; that is, facilitate and speed up their transcription task of handwritten texts. This framework combines the efficiency of automatic handwriting recognition systems with the accuracy of the human transcriptor. The best result is a cost-effective perfect transcription of the handwriting text images.
到目前为止,自动手写识别系统还远远不够完善,通常需要后期编辑,需要人工干预来检查和纠正这些系统的结果。我们建议建立一个新的交互式在线框架,而不是完全自动化,旨在帮助人类进行适当的识别-转录过程;也就是说,方便和加快他们抄写手写文本的任务。该框架结合了自动手写识别系统的效率和人类转录器的准确性。最好的结果是具有成本效益的手写文本图像的完美转录。
{"title":"Computer Assisted Transcription of Handwritten Text Images","authors":"A. Toselli, Verónica Romero, Luis Rodríguez, E. Vidal","doi":"10.1109/ICDAR.2007.86","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.86","url":null,"abstract":"To date, automatic handwriting recognition systems are far from being perfect and often they need a post editing where a human intervention is required to check and correct the results of such systems. We propose to have a new interactive, on-line framework which, rather than full automation, aims at assisting the human in the proper recognition- transcription process; that is, facilitate and speed up their transcription task of handwritten texts. This framework combines the efficiency of automatic handwriting recognition systems with the accuracy of the human transcriptor. The best result is a cost-effective perfect transcription of the handwriting text images.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"39 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116024356","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 50
A Suffix Tree Based Handwritten Chinese Address Recognition System 基于后缀树的手写体中文地址识别系统
Y. Jiang, X. Ding, Z. Ren
The main contribution of the paper is that it presents a suffix tree based data structure for automatic handwritten Chinese address reading. Since lots of papers have discussed the destination address block (DAB) location for Chinese, we will not extend it in this paper. Instead, we pay more attention to improve the address matching performance after DAB location. As some conventional methods, the extracted text lines are pre-segmented into a series of radicals. We then build a hierarchical structure of sub-strings from the recognized characters of valid radical combinations. Coarse address candidates are selected at the same time. In address maching, we incorporate postcode information to filter redundant addresses. The pre- segmented radicals are compared with candidate address and a cost function combining recognition and structrual cost is evaluated for final decision. In the system, character segmentation, recognition, string searching and matching are considered synchronously by taking advantage of lexicon knowledge. Suffix tree can greatly facilitate the substring generation process and enable the matching process to start from any character to collect potentially bitty information. Therefore, our algorithms is more robust to the intervening noises and irregular writing styles. Finallly, we test 1,000 handwritten Chinese envelopes and achieve a correct rate of 85.30% in 3.0 seconds per mail averagely.
本文的主要贡献在于提出了一种基于后缀树的手写体中文地址自动读取数据结构。由于已有大量文献对中文的目的地址块(DAB)定位进行了讨论,本文将不作进一步的介绍。因此,我们更关注的是如何提高DAB定位后的地址匹配性能。与传统方法一样,将提取的文本行预先分割成一系列的词根。然后,我们从有效自由基组合的识别字符中构建子字符串的层次结构。同时选取粗地址候选者。在地址处理中,我们结合邮政编码信息来过滤冗余地址。将预分割的基与候选地址进行比较,并将识别和结构代价相结合的代价函数进行评估,以做出最终决定。该系统利用词汇知识,将字符分割、识别、字符串搜索和匹配同步进行。后缀树可以极大地简化子字符串的生成过程,使匹配过程可以从任何字符开始收集潜在的位信息。因此,我们的算法对干扰噪声和不规则书写风格具有更强的鲁棒性。最后,我们测试了1000个手写中文信封,平均每封邮件3.0秒的正确率达到了85.30%。
{"title":"A Suffix Tree Based Handwritten Chinese Address Recognition System","authors":"Y. Jiang, X. Ding, Z. Ren","doi":"10.1109/ICDAR.2007.36","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.36","url":null,"abstract":"The main contribution of the paper is that it presents a suffix tree based data structure for automatic handwritten Chinese address reading. Since lots of papers have discussed the destination address block (DAB) location for Chinese, we will not extend it in this paper. Instead, we pay more attention to improve the address matching performance after DAB location. As some conventional methods, the extracted text lines are pre-segmented into a series of radicals. We then build a hierarchical structure of sub-strings from the recognized characters of valid radical combinations. Coarse address candidates are selected at the same time. In address maching, we incorporate postcode information to filter redundant addresses. The pre- segmented radicals are compared with candidate address and a cost function combining recognition and structrual cost is evaluated for final decision. In the system, character segmentation, recognition, string searching and matching are considered synchronously by taking advantage of lexicon knowledge. Suffix tree can greatly facilitate the substring generation process and enable the matching process to start from any character to collect potentially bitty information. Therefore, our algorithms is more robust to the intervening noises and irregular writing styles. Finallly, we test 1,000 handwritten Chinese envelopes and achieve a correct rate of 85.30% in 3.0 seconds per mail averagely.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122396803","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Machine Dating of Handwritten Manuscripts 手抄本的机器年代测定
Pub Date : 2007-09-23 DOI: 10.1109/ICDAR.2007.4377017
Utpal Garain, S. K. Parui, T. Paquet, L. Heutte
This paper presents a pioneering study on automatic dating of handwritten manuscripts. Analysis of handwriting style forms the core of the dating method. Initially, it is hypothesized that a manuscript can be dated, to a certain level of accuracy, by looking at the way it is written. The hypothesis is then verified with real samples of known dates. A general framework is proposed for machine dating of handwritten manuscripts. Experiments on a database containing manuscripts of Gustave Flaubert (1821- 1880), the famous French novelist reports about 62% accuracy when manuscripts are dated within a range of five calendar years with respect to their exact year of writing.
本文介绍了一项手写体手稿自动测年的开创性研究。笔迹风格的分析是年代测定方法的核心。最初,人们假设一份手稿可以通过观察它的写作方式来确定它的年代,达到一定程度的准确性。然后用已知日期的真实样本验证该假设。提出了手写体手稿机器定年的一般框架。法国著名小说家古斯塔夫·福楼拜(Gustave Flaubert, 1821- 1880)在一个包含他手稿的数据库中进行了实验,结果显示,当手稿的日期在5个日历年的范围内时,其准确写作年份的准确率为62%。
{"title":"Machine Dating of Handwritten Manuscripts","authors":"Utpal Garain, S. K. Parui, T. Paquet, L. Heutte","doi":"10.1109/ICDAR.2007.4377017","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.4377017","url":null,"abstract":"This paper presents a pioneering study on automatic dating of handwritten manuscripts. Analysis of handwriting style forms the core of the dating method. Initially, it is hypothesized that a manuscript can be dated, to a certain level of accuracy, by looking at the way it is written. The hypothesis is then verified with real samples of known dates. A general framework is proposed for machine dating of handwritten manuscripts. Experiments on a database containing manuscripts of Gustave Flaubert (1821- 1880), the famous French novelist reports about 62% accuracy when manuscripts are dated within a range of five calendar years with respect to their exact year of writing.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122430564","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Arabic Handwriting Texture Analysis for Writer Identification Using the DWT-Lifting Scheme 基于dwt提升方案的阿拉伯笔迹纹理分析
S. Gazzah, N. Amara
In this paper, we present an approach for writer identification using off-line Arabic handwriting. The proposed method explores the handwriting texture analysis by 2D discrete wavelet transforms using lifting scheme. A comparative evaluation between textural features extracted by 9 different wavelet transform functions was done. A modular multilayer perceptron classifier was used. Experiments have shown that writer identification accuracies reach best performance levels with an average rate of 95.68%. Experiments have been carried out using a database of 180 text samples. The chosen text was made to guarantee the involvement of the various internal shapes and letter locations within an Arabic subword.
在本文中,我们提出了一种使用离线阿拉伯笔迹的作家识别方法。提出了一种基于二维离散小波变换的手写体纹理分析方法。对9种不同小波变换函数提取的纹理特征进行了对比评价。采用模块化多层感知器分类器。实验表明,作者识别准确率达到了最佳性能水平,平均准确率为95.68%。实验使用了一个包含180个文本样本的数据库。所选择的文本是为了保证各种内部形状和字母位置在阿拉伯语子词的参与。
{"title":"Arabic Handwriting Texture Analysis for Writer Identification Using the DWT-Lifting Scheme","authors":"S. Gazzah, N. Amara","doi":"10.1109/ICDAR.2007.62","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.62","url":null,"abstract":"In this paper, we present an approach for writer identification using off-line Arabic handwriting. The proposed method explores the handwriting texture analysis by 2D discrete wavelet transforms using lifting scheme. A comparative evaluation between textural features extracted by 9 different wavelet transform functions was done. A modular multilayer perceptron classifier was used. Experiments have shown that writer identification accuracies reach best performance levels with an average rate of 95.68%. Experiments have been carried out using a database of 180 text samples. The chosen text was made to guarantee the involvement of the various internal shapes and letter locations within an Arabic subword.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"36 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122328463","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 46
A New Method for Writer Identification and Verification Based on Farsi/Arabic Handwritten Texts 基于波斯语/阿拉伯语手写体文本的作者识别与验证新方法
F. Nejad, M. Rahmati
Most studies about writer identification are based on English documents and to our knowledge no research has been reported on Farsi or Arabic documents. In this paper, we have proposed a method for off-line writer identification and verification based on Farsi handwriting, which is text-dependent. Based on the idea that has been presented in the previous studies, here we assume handwriting as texture image and after normalization step, the Gabor filters are applied to image and then new features are extracted. Substantially, the property of proposed method is using of the bank of Gabor filters which is appropriate for the structure of Farsi handwritten texts and vision system. Also, a new method for feature extraction from output of Gabor filters is proposed which is based on moments and nonlinear transform. In this paper, with definition a confidence criterion, a new method for writer verification is proposed. Evaluation of other methods and proposed method demonstrates that proposed method achieves better performance on Farsi handwritten from 40 peoples.
大多数关于作者识别的研究都是基于英语文献,据我们所知,没有关于波斯语或阿拉伯语文献的研究报道。在本文中,我们提出了一种基于波斯语手写的离线写作者识别和验证方法,该方法依赖于文本。基于之前的研究思路,我们假设笔迹为纹理图像,经过归一化步骤后,对图像进行Gabor滤波,提取新的特征。从本质上讲,该方法的特点是使用了适合波斯语手写文本结构和视觉系统的Gabor滤波器库。同时,提出了一种基于矩量和非线性变换的Gabor滤波器输出特征提取方法。本文通过定义置信准则,提出了一种新的作者验证方法。对其他方法和本文提出的方法进行了比较,结果表明本文提出的方法对40个民族的波斯语手写体具有较好的识别效果。
{"title":"A New Method for Writer Identification and Verification Based on Farsi/Arabic Handwritten Texts","authors":"F. Nejad, M. Rahmati","doi":"10.1109/ICDAR.2007.24","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.24","url":null,"abstract":"Most studies about writer identification are based on English documents and to our knowledge no research has been reported on Farsi or Arabic documents. In this paper, we have proposed a method for off-line writer identification and verification based on Farsi handwriting, which is text-dependent. Based on the idea that has been presented in the previous studies, here we assume handwriting as texture image and after normalization step, the Gabor filters are applied to image and then new features are extracted. Substantially, the property of proposed method is using of the bank of Gabor filters which is appropriate for the structure of Farsi handwritten texts and vision system. Also, a new method for feature extraction from output of Gabor filters is proposed which is based on moments and nonlinear transform. In this paper, with definition a confidence criterion, a new method for writer verification is proposed. Evaluation of other methods and proposed method demonstrates that proposed method achieves better performance on Farsi handwritten from 40 peoples.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"70 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126614275","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 56
On Segmentation of Documents in Complex Scripts 复杂文字文档的分词研究
K. S. S. Kumar, S. Kumar, C. V. Jawahar
Document image segmentation algorithms primarily aim at separating text and graphics in presence of complex layouts. However, for many non-Latin scripts, segmentation becomes a challenge due to the characteristics of the script. In this paper, we empirically demonstrate that successful algorithms for Latin scripts may not be very effective for Indic and complex scripts. We explain this based on the differences in the spatial distribution of symbols in the scripts. We argue that the visual information used for segmentation needs to be enhanced with other information like script models for accurate results.
文档图像分割算法的主要目标是在复杂的布局中分离文本和图形。然而,对于许多非拉丁文字,由于文字的特点,分割成为一个挑战。在本文中,我们通过经验证明,拉丁文字的成功算法可能对印度语和复杂的文字并不十分有效。我们根据文字中符号空间分布的差异来解释这一点。我们认为,用于分割的视觉信息需要与脚本模型等其他信息一起增强,以获得准确的结果。
{"title":"On Segmentation of Documents in Complex Scripts","authors":"K. S. S. Kumar, S. Kumar, C. V. Jawahar","doi":"10.1109/ICDAR.2007.194","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.194","url":null,"abstract":"Document image segmentation algorithms primarily aim at separating text and graphics in presence of complex layouts. However, for many non-Latin scripts, segmentation becomes a challenge due to the characteristics of the script. In this paper, we empirically demonstrate that successful algorithms for Latin scripts may not be very effective for Indic and complex scripts. We explain this based on the differences in the spatial distribution of symbols in the scripts. We argue that the visual information used for segmentation needs to be enhanced with other information like script models for accurate results.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"120954414","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
Simultaneous Layout Style and Logical Entity Recognition in a Heterogeneous Collection of Documents 异构文档集合中的同步布局样式和逻辑实体识别
Siyuan Chen, Song Mao, G. Thoma
Logical entity recognition in heterogeneous collections of document page images remains a challenging problem since the performance of traditional supervised methods degrades dramatically in case of many distinct layout styles. In this paper we present an unsupervised method where layout style information is explicitly used in both training and recognition phases. We represent the layout style, local features, and logical labels of physical regions of a document compactly by an ordered labeled X-Y tree. Style dissimilarity of two document pages is represented by the distance between their respective trees. During the training phase, document pages with true logical labels in training set are classified into distinct layout styles by unsupervised clustering. During the recognition phase, the layout style and logical entities of an input document are recognized simultaneously by matching the input tree to the trees in closest- matched layout style cluster of training set. Experimental results show that our algorithm is robust with both balanced and unbalanced style cluster sizes, zone over-segmentation, zone length variation, and variation in tree representations of the same layout style.
在异构文档页面图像集合中的逻辑实体识别仍然是一个具有挑战性的问题,因为传统的监督方法在许多不同布局样式的情况下性能显著下降。在本文中,我们提出了一种无监督的方法,其中布局样式信息在训练和识别阶段都被明确地使用。我们通过有序的标记X-Y树紧凑地表示文档的物理区域的布局样式、局部特征和逻辑标签。两个文档页面的样式差异由它们各自树之间的距离表示。在训练阶段,通过无监督聚类将训练集中具有真实逻辑标签的文档页面划分为不同的布局样式。在识别阶段,通过将输入树与训练集最匹配的布局样式簇中的树进行匹配,同时识别输入文档的布局样式和逻辑实体。实验结果表明,该算法对平衡和不平衡样式簇大小、区域过度分割、区域长度变化和相同布局样式的树表示变化都具有鲁棒性。
{"title":"Simultaneous Layout Style and Logical Entity Recognition in a Heterogeneous Collection of Documents","authors":"Siyuan Chen, Song Mao, G. Thoma","doi":"10.1109/ICDAR.2007.231","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.231","url":null,"abstract":"Logical entity recognition in heterogeneous collections of document page images remains a challenging problem since the performance of traditional supervised methods degrades dramatically in case of many distinct layout styles. In this paper we present an unsupervised method where layout style information is explicitly used in both training and recognition phases. We represent the layout style, local features, and logical labels of physical regions of a document compactly by an ordered labeled X-Y tree. Style dissimilarity of two document pages is represented by the distance between their respective trees. During the training phase, document pages with true logical labels in training set are classified into distinct layout styles by unsupervised clustering. During the recognition phase, the layout style and logical entities of an input document are recognized simultaneously by matching the input tree to the trees in closest- matched layout style cluster of training set. Experimental results show that our algorithm is robust with both balanced and unbalanced style cluster sizes, zone over-segmentation, zone length variation, and variation in tree representations of the same layout style.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"130 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127838342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Text Input System Using Online Overlapped Handwriting Recognition for Mobile Devices 移动设备上使用在线重叠手写识别的文本输入系统
Yojiro Tonouchi, A. Kawamura
This paper proposes a novel online overlapped handwriting recognition system for mobile devices such as cellular phones. Users can input characters continuously without pauses on the single writing area. It has three features: small writing area, quick response and direct operations with handwritten gestures. Therefore, it is suitable for mobile devices such as cellular phones. The system realizes a new handwriting interface similar to touch-typing. We evaluated the system by two experiments: character recognition performance and text entry speed of Japanese sentences. Through these experiments we showed the effectiveness of the proposed system.
本文提出了一种新的用于移动设备(如手机)的在线重叠手写识别系统。用户可以在单个书写区域连续输入字符而不停顿。它有三个特点:书写面积小,反应快,手写手势操作直接。因此,它适用于手机等移动设备。该系统实现了一种类似于触摸打字的新型手写界面。我们通过两个实验来评估系统:字符识别性能和日语句子的文本输入速度。通过这些实验,我们证明了所提出系统的有效性。
{"title":"Text Input System Using Online Overlapped Handwriting Recognition for Mobile Devices","authors":"Yojiro Tonouchi, A. Kawamura","doi":"10.1109/ICDAR.2007.243","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.243","url":null,"abstract":"This paper proposes a novel online overlapped handwriting recognition system for mobile devices such as cellular phones. Users can input characters continuously without pauses on the single writing area. It has three features: small writing area, quick response and direct operations with handwritten gestures. Therefore, it is suitable for mobile devices such as cellular phones. The system realizes a new handwriting interface similar to touch-typing. We evaluated the system by two experiments: character recognition performance and text entry speed of Japanese sentences. Through these experiments we showed the effectiveness of the proposed system.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127756317","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
期刊
Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1