首页 > 最新文献

Proceedings of Sixth International Conference on Document Analysis and Recognition最新文献

英文 中文
Training with positive and negative data samples: effects on a classifier for hand-drawn geometric shapes 正负数据样本训练:对手绘几何形状分类器的影响
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953939
Hanaa Barakat, D. Blostein
It is quite common in document analysis and symbol recognition to rely on a priori knowledge about the nature of the document in order to locate candidate symbols. It is desirable, but less common, for a segmentation procedure to rely on "a posteriori" feedback from a non-human-guided process to adjust for segmentation errors. For this method to succeed, the feedback must come from a reliable classifier (one that is able to reject negative symbols including miss-segmented symbols). This paper examines the use of positive and negative training data on a nearest-neighbour classifier for hand-drawn geometric shapes. We explore the issues involved in the development of a reliable classifier using this method, and we discuss the trade-off between reliability and correctness.
在文档分析和符号识别中,依靠对文档性质的先验知识来定位候选符号是很常见的。这是可取的,但不太常见,分割过程依赖于“后验”反馈从一个非人工引导的过程来调整分割错误。为了使该方法成功,反馈必须来自可靠的分类器(能够拒绝包括未分割符号在内的负符号)。本文研究了在手绘几何形状的最近邻分类器上使用正训练数据和负训练数据。我们探讨了使用这种方法开发可靠分类器所涉及的问题,并讨论了可靠性和正确性之间的权衡。
{"title":"Training with positive and negative data samples: effects on a classifier for hand-drawn geometric shapes","authors":"Hanaa Barakat, D. Blostein","doi":"10.1109/ICDAR.2001.953939","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953939","url":null,"abstract":"It is quite common in document analysis and symbol recognition to rely on a priori knowledge about the nature of the document in order to locate candidate symbols. It is desirable, but less common, for a segmentation procedure to rely on \"a posteriori\" feedback from a non-human-guided process to adjust for segmentation errors. For this method to succeed, the feedback must come from a reliable classifier (one that is able to reject negative symbols including miss-segmented symbols). This paper examines the use of positive and negative training data on a nearest-neighbour classifier for hand-drawn geometric shapes. We explore the issues involved in the development of a reliable classifier using this method, and we discuss the trade-off between reliability and correctness.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"69 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124948498","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Binarising camera images for OCR 二值化相机图像的OCR
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953754
M. Seeger, C. Dance
We describe a binarisation method designed specifically for OCR of low quality camera images: background surface thresholding or BST. This method is robust to lighting variations and produces images with very little noise and consistent stroke width. BST computes a "surface" of background intensities at every point in the image and performs adaptive thresholding based on this result. The surface is estimated by identifying regions of low-resolution text and interpolating neighbouring background intensities into these regions. The final threshold is a combination of this surface and a global offset. According to our evaluation BST produces considerably fewer OCR errors than Niblack's local average method while also being more runtime efficient.
我们描述了一种专为低质量相机图像的OCR设计的二值化方法:背景表面阈值或BST。该方法对光照变化具有鲁棒性,并且产生的图像具有非常小的噪声和一致的笔画宽度。BST在图像中的每个点计算背景强度的“表面”,并基于该结果执行自适应阈值分割。通过识别低分辨率文本的区域并将邻近的背景强度插值到这些区域来估计表面。最后的阈值是这个表面和一个全局偏移量的组合。根据我们的评估,BST比Niblack的局部平均方法产生的OCR错误要少得多,同时运行时效率也更高。
{"title":"Binarising camera images for OCR","authors":"M. Seeger, C. Dance","doi":"10.1109/ICDAR.2001.953754","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953754","url":null,"abstract":"We describe a binarisation method designed specifically for OCR of low quality camera images: background surface thresholding or BST. This method is robust to lighting variations and produces images with very little noise and consistent stroke width. BST computes a \"surface\" of background intensities at every point in the image and performs adaptive thresholding based on this result. The surface is estimated by identifying regions of low-resolution text and interpolating neighbouring background intensities into these regions. The final threshold is a combination of this surface and a global offset. According to our evaluation BST produces considerably fewer OCR errors than Niblack's local average method while also being more runtime efficient.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124997249","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 75
On-line recognition of UML diagrams UML图的在线识别
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953813
E. Lank, Jeb S. Thorley, Sean Chen, D. Blostein
Unified Modeling Language (UML) diagrams are widely used by software engineers to describe the structure of software systems. Early in the software design cycle, software engineers informally sketch initial UML diagrams on paper or whiteboards. The information provided by these UML diagrams needs to be made available to computer assisted software engineering (CASE) tools. In order to smooth this transition from paper to electronic form, we have developed an online recognition system for UML diagrams. The system accepts input from an electronic whiteboard, a data tablet or a mouse. Efforts have been made to separate the domain-independent and domain-specific parts of the recognition system. The kernel of the system is retargetable, providing a general front end for online recognition of any glyph-based diagram notation. The kernel is extended with UML-specific routines for segmentation, recognition of glyphs, and recognition of glyph relationships.
统一建模语言(UML)图被软件工程师广泛用于描述软件系统的结构。在软件设计周期的早期,软件工程师非正式地在纸上或白板上勾画出最初的UML图。这些UML图提供的信息需要提供给计算机辅助软件工程(CASE)工具。为了顺利地从纸质形式过渡到电子形式,我们为UML图开发了一个在线识别系统。该系统接受来自电子白板、数据平板或鼠标的输入。人们已经努力将识别系统的领域独立部分和特定领域部分分开。该系统的核心是可重新定位的,为任何基于符号的图表符号的在线识别提供了一个通用的前端。内核扩展了特定于uml的例程,用于分割、识别字形和识别字形关系。
{"title":"On-line recognition of UML diagrams","authors":"E. Lank, Jeb S. Thorley, Sean Chen, D. Blostein","doi":"10.1109/ICDAR.2001.953813","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953813","url":null,"abstract":"Unified Modeling Language (UML) diagrams are widely used by software engineers to describe the structure of software systems. Early in the software design cycle, software engineers informally sketch initial UML diagrams on paper or whiteboards. The information provided by these UML diagrams needs to be made available to computer assisted software engineering (CASE) tools. In order to smooth this transition from paper to electronic form, we have developed an online recognition system for UML diagrams. The system accepts input from an electronic whiteboard, a data tablet or a mouse. Efforts have been made to separate the domain-independent and domain-specific parts of the recognition system. The kernel of the system is retargetable, providing a general front end for online recognition of any glyph-based diagram notation. The kernel is extended with UML-specific routines for segmentation, recognition of glyphs, and recognition of glyph relationships.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115996551","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 38
Online recognition of sketched electrical diagrams 草图电气图的在线识别
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953832
Jean-Philippe Valois, Myriam Côté, M. Cheriet
In this paper, a model-based scheme for recognizing and beautifying online hand-drawn sketches of electric diagrams is presented. The system uses a structural and topological relations matching mechanism that allows scale, translation, rotation invariant recognition. A simple prototype was developed and preliminary experimental results show how this technique, although simple, is efficient in recognizing such sketches.
本文提出了一种基于模型的在线手绘电图识别与美化方案。该系统采用结构和拓扑关系匹配机制,允许尺度、平移、旋转不变识别。开发了一个简单的原型,初步的实验结果表明,该技术虽然简单,但在识别此类草图方面是有效的。
{"title":"Online recognition of sketched electrical diagrams","authors":"Jean-Philippe Valois, Myriam Côté, M. Cheriet","doi":"10.1109/ICDAR.2001.953832","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953832","url":null,"abstract":"In this paper, a model-based scheme for recognizing and beautifying online hand-drawn sketches of electric diagrams is presented. The system uses a structural and topological relations matching mechanism that allows scale, translation, rotation invariant recognition. A simple prototype was developed and preliminary experimental results show how this technique, although simple, is efficient in recognizing such sketches.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128242996","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
A distributed scheme for lexicon-driven handwritten word recognition and its application to large vocabulary problems 词典驱动手写词识别的分布式方案及其在大词汇量问题中的应用
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953872
Alessandro Lameiras Koerich, R. Sabourin, C. Suen
Many offline handwritten word recognition systems have been proposed since the early nineties. Most systems reported high recognition rates, however, they overlooked a very important factor in the process: speed factor. The authors explore the potential for speeding up an offline handwritten word recognition system via concurrency. The goal of the system is to achieve both full accuracy and high speed when taking into account large vocabularies. This was accomplished by integrating the recognition process with multiprocessing and distributed computing concepts. Experimental results showed that the multiprocessing environment is very promising in enhancing a sequential offline handwritten word recognition system performance.
自九十年代初以来,已经提出了许多离线手写单词识别系统。大多数系统都报告了很高的识别率,然而,它们忽略了过程中一个非常重要的因素:速度因素。作者探索了通过并发加速离线手写单词识别系统的潜力。该系统的目标是在考虑大词汇量时实现完全的准确性和高速度。这是通过将识别过程与多处理和分布式计算概念相结合来实现的。实验结果表明,多处理环境在提高顺序离线手写单词识别系统的性能方面是非常有前途的。
{"title":"A distributed scheme for lexicon-driven handwritten word recognition and its application to large vocabulary problems","authors":"Alessandro Lameiras Koerich, R. Sabourin, C. Suen","doi":"10.1109/ICDAR.2001.953872","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953872","url":null,"abstract":"Many offline handwritten word recognition systems have been proposed since the early nineties. Most systems reported high recognition rates, however, they overlooked a very important factor in the process: speed factor. The authors explore the potential for speeding up an offline handwritten word recognition system via concurrency. The goal of the system is to achieve both full accuracy and high speed when taking into account large vocabularies. This was accomplished by integrating the recognition process with multiprocessing and distributed computing concepts. Experimental results showed that the multiprocessing environment is very promising in enhancing a sequential offline handwritten word recognition system performance.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130667965","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Advanced character recognition 6610 高级字符识别6610
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953744
G. Nagy
ECSE 6610 Advanced Character Recognition. Principles and practice of the recognition of isolated or connected typeset, hand-printed, and cursive characters. Review of optical digitization, supervised and unsupervised estimation of classifier parameters, bias and variance, expectation maximization, the curse of dimensionality. Advanced classification techniques including classifier combinations, support vector machines, hidden Markov methods, styles, language context, adaptation, segmentation-free classifiers, indirect symbolic correlation. Prereq: ECSE 2610, Probability, Linear Algebra. Spring term annually.
高级字符识别。单字或连字、手印字和草书字识别的原则和实践。回顾光学数字化,分类器参数的监督和无监督估计,偏差和方差,期望最大化,维数诅咒。高级分类技术包括分类器组合、支持向量机、隐马尔可夫方法、风格、语言上下文、自适应、无分割分类器、间接符号关联。预修课程:ECSE 2610,概率,线性代数。每年春季学期。
{"title":"Advanced character recognition 6610","authors":"G. Nagy","doi":"10.1109/ICDAR.2001.953744","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953744","url":null,"abstract":"ECSE 6610 Advanced Character Recognition. Principles and practice of the recognition of isolated or connected typeset, hand-printed, and cursive characters. Review of optical digitization, supervised and unsupervised estimation of classifier parameters, bias and variance, expectation maximization, the curse of dimensionality. Advanced classification techniques including classifier combinations, support vector machines, hidden Markov methods, styles, language context, adaptation, segmentation-free classifiers, indirect symbolic correlation. Prereq: ECSE 2610, Probability, Linear Algebra. Spring term annually.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"50 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123354202","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
An a priori indicator of the discrimination power of discrete hidden Markov models 离散隐马尔可夫模型判别能力的先验指标
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953812
Frédéric Grandidier, R. Sabourin, M. Gilloux, C. Suen
During the development of a hidden Markov model based handwriting recognition system, the testing phase takes a non-negligible amount of computation time. This is especially true for real application where the lexicon size is large. In order to shorten the development process, we propose an indicator of the system discrimination power. This indicator is calculated during training and its final value is obtained at the end of the training phase, without more calculation. Its definition consists of a modification of the observation probability of the validation corpus by the trained system. Some experiments were carried out and the results show clearly the correlation between this indicator and recognition rates.
在基于隐马尔可夫模型的手写识别系统的开发过程中,测试阶段的计算时间是不可忽略的。对于词典量很大的实际应用程序尤其如此。为了缩短发展过程,我们提出了制度歧视权的指标。该指标在训练时计算,在训练阶段结束时得到最终值,无需再进行计算。它的定义包括被训练的系统对验证语料库的观测概率的修改。进行了一些实验,结果清楚地显示了该指标与识别率之间的相关性。
{"title":"An a priori indicator of the discrimination power of discrete hidden Markov models","authors":"Frédéric Grandidier, R. Sabourin, M. Gilloux, C. Suen","doi":"10.1109/ICDAR.2001.953812","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953812","url":null,"abstract":"During the development of a hidden Markov model based handwriting recognition system, the testing phase takes a non-negligible amount of computation time. This is especially true for real application where the lexicon size is large. In order to shorten the development process, we propose an indicator of the system discrimination power. This indicator is calculated during training and its final value is obtained at the end of the training phase, without more calculation. Its definition consists of a modification of the observation probability of the validation corpus by the trained system. Some experiments were carried out and the results show clearly the correlation between this indicator and recognition rates.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"151 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123497685","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 1
Applying fast segmentation techniques at a binary image represented by a set of non-overlapping blocks 对一组非重叠块表示的二值图像应用快速分割技术
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953965
B. Gatos, N. Papamarkos
Run length smoothing algorithm (RLSA) and projection profiles are among the fundamental algorithms in binary image processing, mainly used for segmentation of monochrome images. In this paper, fast RLSA and projection profiles are applied to binary images represented by a set of nonoverlapping rectangular blocks. The representation of binary images using rectangular blocks as primitives has been used with great success for several image processing tasks, such as image compression, Hough transform fast implementation and skeletonization. We show that this representation can be applied with great success for fast RLSA application and fast projection profiles evaluation. The experimental results demonstrate that starting from a block represented binary image we can apply RLSA and evaluate projection profiles in significant less CPU time. The average time gain is recorded at 60% and 88%, respectively.
运行长度平滑算法(RLSA)和投影轮廓是二值图像处理中的基本算法,主要用于单色图像的分割。本文将快速RLSA和投影轮廓应用于由一组不重叠的矩形块表示的二值图像。以矩形块为基元的二值图像表示方法在图像压缩、霍夫变换快速实现和骨架化等多个图像处理任务中得到了成功的应用。结果表明,该方法可以成功地用于快速RLSA应用和快速投影轮廓评估。实验结果表明,从块表示的二值图像开始,我们可以在更少的CPU时间内应用RLSA并评估投影轮廓。平均时间增益分别为60%和88%。
{"title":"Applying fast segmentation techniques at a binary image represented by a set of non-overlapping blocks","authors":"B. Gatos, N. Papamarkos","doi":"10.1109/ICDAR.2001.953965","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953965","url":null,"abstract":"Run length smoothing algorithm (RLSA) and projection profiles are among the fundamental algorithms in binary image processing, mainly used for segmentation of monochrome images. In this paper, fast RLSA and projection profiles are applied to binary images represented by a set of nonoverlapping rectangular blocks. The representation of binary images using rectangular blocks as primitives has been used with great success for several image processing tasks, such as image compression, Hough transform fast implementation and skeletonization. We show that this representation can be applied with great success for fast RLSA application and fast projection profiles evaluation. The experimental results demonstrate that starting from a block represented binary image we can apply RLSA and evaluate projection profiles in significant less CPU time. The average time gain is recorded at 60% and 88%, respectively.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121422132","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
A real-world evaluation of a generic document recognition method applied to a military form of the 19th century 对19世纪军事形式的通用文件识别方法的实际评估
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953894
Bertrand Coüasnon, L. Pasquer
In this paper we present a real-world evaluation of DMOS, a new generic document recognition method. This method uses a new grammatical formalism (EPF) and an associated parser able to introduce context in segmentation. We have implemented this DMOS method to build an automatic generator of structured document recognition systems. We already produced three recognition systems by only changing the EPF grammar: one on musical scores, one on mathematical formulae and one on recursive table structures. We present here a specific light grammar to automatically recognize quite damaged 19th century military forms. The quality of those forms is far from perfect: table lines are not well printed, paper is so thin that there are transparency problems (the forms are two-sided) but the biggest problem comes from small paper sheets hiding part of the structure. The evaluation of this system has been made onto 5268 images and the results show that the system did not make any mistake. Moreover it can recognize the entire structure in 97.2% of the forms (the other 2.8% are automatically set apart).
本文对一种新的通用文档识别方法DMOS进行了实际评价。该方法使用了一种新的语法形式(EPF)和一个相关的解析器,能够在分词中引入上下文。我们实现了这种DMOS方法来构建结构化文档识别系统的自动生成器。我们已经通过仅仅改变EPF语法产生了三个识别系统:一个关于乐谱,一个关于数学公式,一个关于递归表结构。我们在这里提出一个特定的轻语法来自动识别相当损坏的19世纪军事形式。这些表格的质量远非完美:表格线条没有很好地打印,纸张太薄,存在透明度问题(表格是双面的),但最大的问题是小纸张隐藏了部分结构。对5268幅图像进行了评价,结果表明该系统没有出现任何错误。此外,它可以识别97.2%的表格的整个结构(其他2.8%是自动分离的)。
{"title":"A real-world evaluation of a generic document recognition method applied to a military form of the 19th century","authors":"Bertrand Coüasnon, L. Pasquer","doi":"10.1109/ICDAR.2001.953894","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953894","url":null,"abstract":"In this paper we present a real-world evaluation of DMOS, a new generic document recognition method. This method uses a new grammatical formalism (EPF) and an associated parser able to introduce context in segmentation. We have implemented this DMOS method to build an automatic generator of structured document recognition systems. We already produced three recognition systems by only changing the EPF grammar: one on musical scores, one on mathematical formulae and one on recursive table structures. We present here a specific light grammar to automatically recognize quite damaged 19th century military forms. The quality of those forms is far from perfect: table lines are not well printed, paper is so thin that there are transparency problems (the forms are two-sided) but the biggest problem comes from small paper sheets hiding part of the structure. The evaluation of this system has been made onto 5268 images and the results show that the system did not make any mistake. Moreover it can recognize the entire structure in 97.2% of the forms (the other 2.8% are automatically set apart).","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"10 12","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"113962157","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Synthetic data for Arabic OCR system development 阿拉伯语OCR系统开发的综合数据
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953967
V. Märgner, M. Pechwitz
A system for the automatic generation of synthetic databases for the development or evaluation of Arabic word or text recognition systems (Arabic OCR) is presented. The proposed system works without any scanning of printed paper. Firstly Arabic text has to be typeset using a standard typesetting system. Secondly a noise-free bitmap of the document and the corresponding ground truth (GT) is automatically generated. Finally, an image distortion can be superimposed to the character or word image to simulate the expected real world noise of the intended application. All necessary modules are presented together with some examples. Special problems caused by specific features of Arabic, such as printing from right to left, many diacritical points, variation in the height of characters, and changes in the relative position to the writing line, are suggested. The synthetic data set was used to train and test a recognition system based on hidden Markov model (HMM), which was originally developed for German cursive script, for Arabic printed words. Recognition results with different synthetic data sets are presented.
提出了一种用于开发或评价阿拉伯语文字识别系统(OCR)的自动合成数据库的系统。该系统无需扫描印刷纸张即可工作。首先,阿拉伯文本必须使用标准排版系统进行排版。其次,自动生成文档的无噪声位图和相应的ground truth (GT);最后,可以将图像失真叠加到字符或单词图像上,以模拟预期应用程序的预期真实世界噪声。介绍了所有必要的模块,并给出了一些示例。由于阿拉伯语的特殊特点,如从右向左印刷、许多变音符点、字符高度的变化以及与书写线的相对位置的变化,提出了一些特殊问题。该合成数据集用于训练和测试基于隐马尔可夫模型(HMM)的识别系统,该系统最初是为德文草书开发的,用于识别阿拉伯印刷文字。给出了不同合成数据集的识别结果。
{"title":"Synthetic data for Arabic OCR system development","authors":"V. Märgner, M. Pechwitz","doi":"10.1109/ICDAR.2001.953967","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953967","url":null,"abstract":"A system for the automatic generation of synthetic databases for the development or evaluation of Arabic word or text recognition systems (Arabic OCR) is presented. The proposed system works without any scanning of printed paper. Firstly Arabic text has to be typeset using a standard typesetting system. Secondly a noise-free bitmap of the document and the corresponding ground truth (GT) is automatically generated. Finally, an image distortion can be superimposed to the character or word image to simulate the expected real world noise of the intended application. All necessary modules are presented together with some examples. Special problems caused by specific features of Arabic, such as printing from right to left, many diacritical points, variation in the height of characters, and changes in the relative position to the writing line, are suggested. The synthetic data set was used to train and test a recognition system based on hidden Markov model (HMM), which was originally developed for German cursive script, for Arabic printed words. Recognition results with different synthetic data sets are presented.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"51 11","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114009550","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 43
期刊
Proceedings of Sixth International Conference on Document Analysis and Recognition
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1