首页 > 最新文献

Proceedings of Sixth International Conference on Document Analysis and Recognition最新文献

英文 中文
Visual exploration and functional document labeling 视觉探索和功能文档标记
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953901
V. Eglin, Antoine Gagneux
This paper presents a new approach to textual data labeling based on texture analysis. Texture is used here to show the impact of document composition on visual exploration. We demonstrate how textural properties are well adapted to typography characterization by categorizing document regions into visual text classes (such as headings, head- and footnotes, paragraphs, abstracts, etc.). We reference and classify different types of text fonts according to their visual aspect and the visual impression that emerges from the textual data. Experiments on a set of various document images show a good accuracy and robustness for our method.
提出了一种基于纹理分析的文本数据标注方法。这里使用纹理来显示文档构成对视觉探索的影响。我们通过将文档区域分类为可视文本类(如标题、头部和脚注、段落、摘要等)来演示纹理属性如何很好地适应排版特性。我们根据不同类型的文本字体的视觉方面和从文本数据中产生的视觉印象来参考和分类。在一组不同文档图像上的实验表明,该方法具有良好的准确性和鲁棒性。
{"title":"Visual exploration and functional document labeling","authors":"V. Eglin, Antoine Gagneux","doi":"10.1109/ICDAR.2001.953901","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953901","url":null,"abstract":"This paper presents a new approach to textual data labeling based on texture analysis. Texture is used here to show the impact of document composition on visual exploration. We demonstrate how textural properties are well adapted to typography characterization by categorizing document regions into visual text classes (such as headings, head- and footnotes, paragraphs, abstracts, etc.). We reference and classify different types of text fonts according to their visual aspect and the visual impression that emerges from the textual data. Experiments on a set of various document images show a good accuracy and robustness for our method.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"31 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129572697","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Prediction of handwriting legibility 预测笔迹的易读性
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953935
M. E. Dehkordi, N. Sherkat, Tony Allen
This paper describes an independent handwriting style classifier that has been designed to select the best recognizer for a given style of writing. For this purpose a definition of handwriting legibility has been defined and a method has been implemented that can predict this legibility. The technique consists of two phases. In the feature extraction phase, a set of 16 features is extracted from the image contour. These features have been selected from amongst a set of pre-recognition features as those features that contribute the most (95%) to a discriminant between legible and illegible words. In the classification phase, a Probability Neural Network based on Bayesian decision is introduced to predict the legibility of unknown handwriting using a Parzen method to estimate a class conditional density function from the available training data.
本文描述了一个独立的手写风格分类器,该分类器被设计用于为给定的书写风格选择最佳识别器。为此,定义了手写体易读性的定义,并实现了预测这种易读性的方法。该技术包括两个阶段。在特征提取阶段,从图像轮廓中提取一组16个特征。这些特征是从一组预识别特征中挑选出来的,因为这些特征对区分易读和难读的单词贡献最大(95%)。在分类阶段,引入基于贝叶斯决策的概率神经网络,利用Parzen方法从可用的训练数据中估计类条件密度函数来预测未知笔迹的易读性。
{"title":"Prediction of handwriting legibility","authors":"M. E. Dehkordi, N. Sherkat, Tony Allen","doi":"10.1109/ICDAR.2001.953935","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953935","url":null,"abstract":"This paper describes an independent handwriting style classifier that has been designed to select the best recognizer for a given style of writing. For this purpose a definition of handwriting legibility has been defined and a method has been implemented that can predict this legibility. The technique consists of two phases. In the feature extraction phase, a set of 16 features is extracted from the image contour. These features have been selected from amongst a set of pre-recognition features as those features that contribute the most (95%) to a discriminant between legible and illegible words. In the classification phase, a Probability Neural Network based on Bayesian decision is introduced to predict the legibility of unknown handwriting using a Parzen method to estimate a class conditional density function from the available training data.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129687418","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
Character recognition experiments using Unipen data 使用Unipen数据的字符识别实验
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953836
M. Parizeau, Alexandre Lemieux, Christian Gagné
This paper presents experiments that compare the performances of several versions of a regional-fuzzy representation (RFR) developed for cursive handwriting recognition (CHR). These experiments are conducted using a common neural network classifier namely a multilayer perceptron (MLP) trained with backpropagation. Results are given for isolated digits, isolated lower-case letters and lower-case letters extracted from phrases, from the Unipen database. Data set Train-R01/V07 is used for training while DevTest-R01/V02 is used for testing. The best overall representation yields recognition rates of respectively 97.0% and 85.6% for isolated digits and lower case, and 84.4% for lower-case extracted from phrases.
本文通过实验比较了几种不同版本的区域模糊表示(RFR)在草书手写识别(CHR)中的性能。这些实验是使用一种常见的神经网络分类器即多层感知器(MLP)进行的。给出了从Unipen数据库中提取的孤立数字、孤立小写字母和从短语中提取的小写字母的结果。数据集Train-R01/V07用于训练,数据集DevTest-R01/V02用于测试。最佳的整体表示对孤立数字和小写字母的识别率分别为97.0%和85.6%,对从短语中提取的小写字母的识别率为84.4%。
{"title":"Character recognition experiments using Unipen data","authors":"M. Parizeau, Alexandre Lemieux, Christian Gagné","doi":"10.1109/ICDAR.2001.953836","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953836","url":null,"abstract":"This paper presents experiments that compare the performances of several versions of a regional-fuzzy representation (RFR) developed for cursive handwriting recognition (CHR). These experiments are conducted using a common neural network classifier namely a multilayer perceptron (MLP) trained with backpropagation. Results are given for isolated digits, isolated lower-case letters and lower-case letters extracted from phrases, from the Unipen database. Data set Train-R01/V07 is used for training while DevTest-R01/V02 is used for testing. The best overall representation yields recognition rates of respectively 97.0% and 85.6% for isolated digits and lower case, and 84.4% for lower-case extracted from phrases.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130311759","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 47
Application of slant correction to handwritten Japanese address recognition 倾斜校正在手写日文地址识别中的应用
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953874
Yimei Ding, Masayuki Okada, F. Kimura, Y. Miyake
We describe the application of slant correction to handwritten Japanese address recognition. Horizontal strokes of Japanese characters in handwritten Japanese addresses often have a slant to the right upper direction. We propose an iterative modified 8-directional chain code method, that is adapted to slant estimation for these addresses. Recognition experiments for the IPTP cdrom2 database are performed to evaluate the effectiveness of slant correction. Comparative results show that based on slant correction, the accuracy of character segmentation and recognition are both improved, which lead to the accuracy improvement of address recognition.
我们描述了倾斜校正在手写日文地址识别中的应用。在手写的日文地址中,日文字符的横笔画常向右上方向倾斜。我们提出了一种迭代改进的8向链码方法,该方法适用于这些地址的倾斜估计。通过IPTP cdrom2数据库的识别实验,对倾斜校正的有效性进行了评价。对比结果表明,基于倾斜校正的字符分割和识别精度都得到了提高,从而提高了地址识别的精度。
{"title":"Application of slant correction to handwritten Japanese address recognition","authors":"Yimei Ding, Masayuki Okada, F. Kimura, Y. Miyake","doi":"10.1109/ICDAR.2001.953874","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953874","url":null,"abstract":"We describe the application of slant correction to handwritten Japanese address recognition. Horizontal strokes of Japanese characters in handwritten Japanese addresses often have a slant to the right upper direction. We propose an iterative modified 8-directional chain code method, that is adapted to slant estimation for these addresses. Recognition experiments for the IPTP cdrom2 database are performed to evaluate the effectiveness of slant correction. Comparative results show that based on slant correction, the accuracy of character segmentation and recognition are both improved, which lead to the accuracy improvement of address recognition.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"35 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128343936","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
An efficient method for tuning handwriting parameters 一种有效的调整手写参数的方法
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953793
Gregory N. Hullender
Many problems in recognition involve making linear combinations of results from various experts. Computing the coefficients can be expensive because a single test run can take many minutes, hours, or even days, and many values need to be evaluated to find an optimum. This paper describes an algorithm that can make a good approximation to the optimum value for a parameter in a single pass over the tuning data and also outlines methods for tuning several parameters at once.
识别中的许多问题涉及到对不同专家的结果进行线性组合。计算系数的成本可能很高,因为一次测试运行可能需要花费数分钟、数小时甚至数天的时间,并且需要评估许多值才能找到最优值。本文描述了一种算法,该算法可以在单遍调优数据中很好地逼近参数的最优值,并概述了一次调优多个参数的方法。
{"title":"An efficient method for tuning handwriting parameters","authors":"Gregory N. Hullender","doi":"10.1109/ICDAR.2001.953793","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953793","url":null,"abstract":"Many problems in recognition involve making linear combinations of results from various experts. Computing the coefficients can be expensive because a single test run can take many minutes, hours, or even days, and many values need to be evaluated to find an optimum. This paper describes an algorithm that can make a good approximation to the optimum value for a parameter in a single pass over the tuning data and also outlines methods for tuning several parameters at once.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117092724","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Lexicon-driven handwritten character string recognition for Japanese address reading 日文地址读取的词典驱动手写字符串识别
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953912
Cheng-Lin Liu, Masashi Koga, H. Fujisawa
Proposes a handwritten character string recognition method for Japanese mail address reading on very large vocabulary. The recognition is performed by classification-embedded lexicon matching based on over-segmentation. The lexicon contains 111,349 address phrases and is represented in a trie structure. In recognition, the input text line image is matched with all lexicon entries by beam search to obtain reliable character segmentation and retrieve valid phrases. A classifier is embedded in lexicon matching to select from a dynamic set the characters matched with a candidate pattern. The beam search and the character classification jointly enable accurate phrase identification in real time. In experiments on 3,589 live mail images, the proposed method achieved correct rate of 83.68% with error rate less than 1%.
提出了一种针对超大词汇量的日语邮件地址识别的手写字符串识别方法。通过基于过分割的分类嵌入词典匹配实现识别。该词典包含111,349个地址短语,并以trie结构表示。在识别中,输入的文本行图像通过光束搜索与所有词典条目进行匹配,获得可靠的字符分割并检索有效短语。在词典匹配中嵌入分类器,从动态集中选择与候选模式匹配的字符。波束搜索和字符分类相结合,实现了实时准确的短语识别。在3589张实时邮件图像的实验中,该方法的正确率达到83.68%,错误率小于1%。
{"title":"Lexicon-driven handwritten character string recognition for Japanese address reading","authors":"Cheng-Lin Liu, Masashi Koga, H. Fujisawa","doi":"10.1109/ICDAR.2001.953912","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953912","url":null,"abstract":"Proposes a handwritten character string recognition method for Japanese mail address reading on very large vocabulary. The recognition is performed by classification-embedded lexicon matching based on over-segmentation. The lexicon contains 111,349 address phrases and is represented in a trie structure. In recognition, the input text line image is matched with all lexicon entries by beam search to obtain reliable character segmentation and retrieve valid phrases. A classifier is embedded in lexicon matching to select from a dynamic set the characters matched with a candidate pattern. The beam search and the character classification jointly enable accurate phrase identification in real time. In experiments on 3,589 live mail images, the proposed method achieved correct rate of 83.68% with error rate less than 1%.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"14 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125936588","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 9
Active radical modeling for handwritten Chinese characters 手写体汉字的主动自由基建模
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953790
D. Shi, S. Gunn, R. Damper
Handwritten Chinese character recognition is one of the most difficult problems of pattern recognition. Since the majority of Chinese characters are made up from just a small set of primitive structures (radicals), this paper describes an approach to active radical modeling for such handwritten characters. The most significant characteristic of our method is that radicals can be found robustly without stroke extraction, and the principal variations of the radical can be encoded in a small number of parameters. In the training phase, the example radicals are represented by manually-labeled 'landmark' points. Then a small number of principal components of the eigenvectors are calculated to capture the main variation of the training examples from the mean radical. In the matching phase, each radical model is fitted to the image evidence by adjusting the shape parameters in terms of chamfer distance minimization. Initial experiments are conducted on 1,100 loosely-constrained Chinese character categories written by 200 different writers. The correct matching rate is 95.8%, showing that our radical modeling is effective and capable of forming a sound basis for handwritten Chinese character recognition.
手写体汉字识别是模式识别的难点之一。由于大多数汉字都是由一小部分原始结构(部首)组成的,本文描述了一种针对这类手写汉字的主动部首建模方法。该方法最显著的特点是不需要提取笔划就可以稳健性地找到自由基,并且自由基的主要变化可以用少量的参数进行编码。在训练阶段,示例基由手动标记的“地标”点表示。然后计算特征向量的少量主成分,以从均值根中捕获训练样例的主要变化。在匹配阶段,根据最小倒角距离调整形状参数,将每个径向模型拟合到图像证据中。最初的实验是对200位不同作者所写的1100个松散约束的汉字类别进行的。正确匹配率为95.8%,表明我们的激进建模是有效的,能够为手写体汉字识别奠定良好的基础。
{"title":"Active radical modeling for handwritten Chinese characters","authors":"D. Shi, S. Gunn, R. Damper","doi":"10.1109/ICDAR.2001.953790","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953790","url":null,"abstract":"Handwritten Chinese character recognition is one of the most difficult problems of pattern recognition. Since the majority of Chinese characters are made up from just a small set of primitive structures (radicals), this paper describes an approach to active radical modeling for such handwritten characters. The most significant characteristic of our method is that radicals can be found robustly without stroke extraction, and the principal variations of the radical can be encoded in a small number of parameters. In the training phase, the example radicals are represented by manually-labeled 'landmark' points. Then a small number of principal components of the eigenvectors are calculated to capture the main variation of the training examples from the mean radical. In the matching phase, each radical model is fitted to the image evidence by adjusting the shape parameters in terms of chamfer distance minimization. Initial experiments are conducted on 1,100 loosely-constrained Chinese character categories written by 200 different writers. The correct matching rate is 95.8%, showing that our radical modeling is effective and capable of forming a sound basis for handwritten Chinese character recognition.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"62 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127238913","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
A feedback-based approach for segmenting handwritten legal amounts on bank cheques 一种基于反馈的方法,用于分割银行支票上手写的法定金额
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953914
Jun Zhou, C. Suen, Ke Liu
The proposed feedback-based approach is implemented in two steps. In the first step, segmentation is done according to the structural features between the connected components in the legal amounts. In the second step, a feedback process is introduced to re-segment the parts that could not be identified in the first step. Then a multiple neural network classifier is used to verify the re-segmentation result. The confidence value produced by the classifier is used to determine the best segmentation points. This approach is tested on a CENPARMI database and the result indicates that the correct segmentation rate increased by 13.4% from the previous approach.
所提出的基于反馈的方法分两步实现。第一步,根据法定金额中关联成分之间的结构特征进行分割。在第二步中,引入反馈过程对第一步中无法识别的部件进行重新分割。然后利用多神经网络分类器对再分割结果进行验证。使用分类器产生的置信度值来确定最佳分割点。在CENPARMI数据库上对该方法进行了测试,结果表明,该方法的正确分割率比前一种方法提高了13.4%。
{"title":"A feedback-based approach for segmenting handwritten legal amounts on bank cheques","authors":"Jun Zhou, C. Suen, Ke Liu","doi":"10.1109/ICDAR.2001.953914","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953914","url":null,"abstract":"The proposed feedback-based approach is implemented in two steps. In the first step, segmentation is done according to the structural features between the connected components in the legal amounts. In the second step, a feedback process is introduced to re-segment the parts that could not be identified in the first step. Then a multiple neural network classifier is used to verify the re-segmentation result. The confidence value produced by the classifier is used to determine the best segmentation points. This approach is tested on a CENPARMI database and the result indicates that the correct segmentation rate increased by 13.4% from the previous approach.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"52 4","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"113974147","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
Experimental results on the construction of multiple classifiers recognizing handwritten numerals 手写体数字多分类器构建的实验结果
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953941
H. Kang, Seong-Whan Lee
Only a few studies have been conducted on how to select multiple classifiers from the pool of available classifiers for showing good performance. The selection problem of classifiers on how to select or how many to select still remains an important issue. In this paper, provided that the number of selected classifiers is constrained in advance, a number of selection criteria are proposed and applied to the construction of multiple classifiers. All the sets of classifiers are examined by the selection criteria under the constraint of the number of selected classifiers, and then some of those sets are selected as the candidates of multiple classifier systems. The multiple classifier system candidates were evaluated by the experiments recognizing UCI handwritten numerals.
如何从可用的分类器池中选择多个分类器以显示良好的性能,这方面的研究很少。分类器的选择问题是如何选择或选择多少仍然是一个重要的问题。本文在预先限定所选分类器数量的前提下,提出了若干选择准则,并将其应用于多分类器的构建。在被选择分类器数量的约束下,对所有分类器集进行选择标准的检验,然后从中选择一些分类器集作为多个分类器系统的候选集。通过对UCI手写数字的识别实验,对多个候选分类器系统进行了评价。
{"title":"Experimental results on the construction of multiple classifiers recognizing handwritten numerals","authors":"H. Kang, Seong-Whan Lee","doi":"10.1109/ICDAR.2001.953941","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953941","url":null,"abstract":"Only a few studies have been conducted on how to select multiple classifiers from the pool of available classifiers for showing good performance. The selection problem of classifiers on how to select or how many to select still remains an important issue. In this paper, provided that the number of selected classifiers is constrained in advance, a number of selection criteria are proposed and applied to the construction of multiple classifiers. All the sets of classifiers are examined by the selection criteria under the constraint of the number of selected classifiers, and then some of those sets are selected as the candidates of multiple classifier systems. The multiple classifier system candidates were evaluated by the experiments recognizing UCI handwritten numerals.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"43 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122850062","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
A recognition method of machine-printed monetary amounts based on the two-dimensional segmentation and the bottom-up parsing 一种基于二维分割和自底向上解析的机印货币数量识别方法
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953929
Masashi Koga, Ryuji Mine, H. Sako, H. Fujisawa
We propose a new method to recognize the machine-printed monetary amount based on two-dimensional segmentation and bottom-up parsing. In conventional segmentation-based methods, the system segments the image only along the direction of the character line. This new method segments the image both horizontally and vertically, and extracts candidates of character segments correctly if there are many noises or characters are fragmented. A parsing module detects the optimal sequence of candidate segments using linguistic knowledge. In our method, a context-free grammar describes the linguistic constraints in the monetary amounts. We devised a new bottom-up parsing technique that interprets the results of character classification of the two-dimensionally segmented sub-images. We tested the validity of the new method using 1,314 images, and found that it improves the recognition accuracy significantly.
提出了一种基于二维分割和自底向上解析的机器印钞金额识别方法。在传统的基于分割的方法中,系统只沿着字符线的方向分割图像。该方法对图像进行水平和垂直分割,在噪声较多或字符被分割的情况下,正确提取候选字符段。解析模块使用语言知识检测候选片段的最佳序列。在我们的方法中,上下文无关的语法描述了货币数量中的语言约束。我们设计了一种新的自底向上的解析技术来解释二维分割子图像的特征分类结果。我们用1314张图像测试了新方法的有效性,发现它显著提高了识别精度。
{"title":"A recognition method of machine-printed monetary amounts based on the two-dimensional segmentation and the bottom-up parsing","authors":"Masashi Koga, Ryuji Mine, H. Sako, H. Fujisawa","doi":"10.1109/ICDAR.2001.953929","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953929","url":null,"abstract":"We propose a new method to recognize the machine-printed monetary amount based on two-dimensional segmentation and bottom-up parsing. In conventional segmentation-based methods, the system segments the image only along the direction of the character line. This new method segments the image both horizontally and vertically, and extracts candidates of character segments correctly if there are many noises or characters are fragmented. A parsing module detects the optimal sequence of candidate segments using linguistic knowledge. In our method, a context-free grammar describes the linguistic constraints in the monetary amounts. We devised a new bottom-up parsing technique that interprets the results of character classification of the two-dimensionally segmented sub-images. We tested the validity of the new method using 1,314 images, and found that it improves the recognition accuracy significantly.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130150978","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
期刊
Proceedings of Sixth International Conference on Document Analysis and Recognition
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1