Proceedings of Sixth International Conference on Document Analysis and Recognition最新文献

英文中文

A recursive analysis for form cell recognition 表单单元识别的递归分析

Proceedings of Sixth International Conference on Document Analysis and Recognition

Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953879

Hiroshi Shinjo, Eiichi Hadano, K. Marukawa, Y. Shima, H. Sako

It is very difficult to analyze form structures because of breaks in lines and additional noises in the form image. This paper focuses on cell recognition in low quality form images. The recognition method has two features to achieve robustness in cell recognition. One is grid representation using several types of intersection and the terminal points of the frame lines. The other is the recursive modification of the representation. A new representation is created according to the determination of the breaks in the line and the hypothesized location of the missed intersections by using the previous representation. The modification is processed recursively until the representation has perfect consistency and all form cells are detected. In an experiment using 1565 form samples, all cells in 1538 samples (98.3% of 1565 samples) were recognized correctly by this method.

由于表单图像中存在断行和附加噪声，使得表单结构分析变得非常困难。本文主要研究低质量表单图像中的细胞识别问题。该识别方法具有两个特征，以实现细胞识别的鲁棒性。一种是使用几种类型的交点和框架线的端点的网格表示。另一种是表示的递归修改。通过使用先前的表示，根据线中断点的确定和错过交叉点的假设位置，创建新的表示。递归地处理修改，直到表示具有完美的一致性并检测到所有表单单元格。在使用1565个表单样本的实验中，该方法对1538个样本中的所有细胞(占1565个样本的98.3%)进行了正确识别。

引用次数: 19

Sentence recognition through hybrid neuro-Markovian modeling 基于混合神经-马尔可夫模型的句子识别

Proceedings of Sixth International Conference on Document Analysis and Recognition

Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953886

S. Marukatat, T. Artières, P. Gallinari, B. Dorizzi

This paper focuses on designing a handwriting recognition system dealing with on-line signal, i.e. temporal handwriting signal captured through an electronic pen or a digitalized tablet. We present here some new results concerning a hybrid on-line handwriting recognition system based on Hidden Markov Models (HMMs) and Neural Networks (NNs), which has already been presented in several contributions. In our approach, a letter-model is a Left-Right HMM, whose emission probability densities are approximated with mixtures of predictive multilayer perceptrons. The basic letter models are cascaded in order to build models for words and sentences. At the word level, recognition is performed thanks to a dictionary organized with a tree-structure. At the sentence level, a word-predecessor conditioned frame synchronous beam search algorithm allows to perform simultaneously segmentation into words and word recognition. It processes through the building of a word graph from which a set of candidate sentences may be extracted. Word and sentence recognition performances are evaluated on parts of the UNIPEN international database.

本文重点设计了一种处理在线信号的手写识别系统，即通过电子笔或数字化平板电脑捕获的实时手写信号。本文介绍了基于隐马尔可夫模型(hmm)和神经网络(nn)的混合在线手写识别系统的一些新结果，这些结果已经在一些论文中提出。在我们的方法中，字母模型是一个左-右HMM，其发射概率密度由预测多层感知器的混合物近似。基本的字母模型是级联的，以便为单词和句子建立模型。在单词级别，通过树形结构组织的字典进行识别。在句子级，一个词前条件框架同步束搜索算法允许同时执行分割成词和词识别。它通过建立一个词图来处理，从中可以提取出一组候选句子。在UNIPEN国际数据库的部分数据上对单词和句子识别性能进行了评估。

{"title":"Sentence recognition through hybrid neuro-Markovian modeling","authors":"S. Marukatat, T. Artières, P. Gallinari, B. Dorizzi","doi":"10.1109/ICDAR.2001.953886","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953886","url":null,"abstract":"This paper focuses on designing a handwriting recognition system dealing with on-line signal, i.e. temporal handwriting signal captured through an electronic pen or a digitalized tablet. We present here some new results concerning a hybrid on-line handwriting recognition system based on Hidden Markov Models (HMMs) and Neural Networks (NNs), which has already been presented in several contributions. In our approach, a letter-model is a Left-Right HMM, whose emission probability densities are approximated with mixtures of predictive multilayer perceptrons. The basic letter models are cascaded in order to build models for words and sentences. At the word level, recognition is performed thanks to a dictionary organized with a tree-structure. At the sentence level, a word-predecessor conditioned frame synchronous beam search algorithm allows to perform simultaneously segmentation into words and word recognition. It processes through the building of a word graph from which a set of candidate sentences may be extracted. Word and sentence recognition performances are evaluated on parts of the UNIPEN international database.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124365946","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 38

Three approaches to "industrial" table spotting 三种“工业”餐桌定位方法

Proceedings of Sixth International Conference on Document Analysis and Recognition

Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953842

B. Klein, Serdar Gökkus, T. Kieninger, A. Dengel

This paper introduces three approaches for an industrial, comprehensive document analysis system to enable it to spot tables in documents. Searching for a set of known table headers (approach 1) works rather well in a significant number of documents. But this approach (though it is implemented tolerant to OCR errors) is not tolerant enough towards some kinds of even minor aberrations. This not only decreases the recognition results, but also, even worse, makes users feel uncomfortable. Pragmatically trying to mimic for what the human eyes might key, leads to our two further, complementary approaches: searching for layout structures which resemble parts of columns (approach 2), and searching for groupings of similar lines (approach 3). The suitability of the approaches for our system requires them to be very simple to implement and simple to explain to users, computationally cheap, and combinable. In the domain of health insurances who receive huge amounts of so called medical liquidations on a daily basis we obtain very good results. On document samples representative for the every day practice of five customers-health insurance companies-tables were spotted as good and as fast as the customers expected the system to be. We thus consider our current approaches as a step towards cognitive adequacy.

本文介绍了一个工业综合文件分析系统的三种方法，使其能够在文件中发现表格。搜索一组已知的表头(方法1)在很多文档中都能很好地工作。但是这种方法(尽管它实现了对OCR错误的容忍)对某些类型的甚至很小的畸变的容忍度不够。这不仅降低了识别效果，而且更糟糕的是，让用户感到不舒服。务实地尝试模仿人眼可能关注的内容，导致了我们进一步的两种互补方法:搜索与列部分相似的布局结构(方法2)，以及搜索相似线的分组(方法3)。方法对我们系统的适用性要求它们非常简单，易于实现，易于向用户解释，计算成本低，并且可组合。在健康保险领域，每天收到大量所谓的医疗清算，我们取得了很好的成果。在代表五个客户(健康保险公司)日常实践的文件样本中，发现表格与客户期望的系统一样好，一样快。因此，我们认为我们目前的方法是迈向认知充分性的一步。

{"title":"Three approaches to \"industrial\" table spotting","authors":"B. Klein, Serdar Gökkus, T. Kieninger, A. Dengel","doi":"10.1109/ICDAR.2001.953842","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953842","url":null,"abstract":"This paper introduces three approaches for an industrial, comprehensive document analysis system to enable it to spot tables in documents. Searching for a set of known table headers (approach 1) works rather well in a significant number of documents. But this approach (though it is implemented tolerant to OCR errors) is not tolerant enough towards some kinds of even minor aberrations. This not only decreases the recognition results, but also, even worse, makes users feel uncomfortable. Pragmatically trying to mimic for what the human eyes might key, leads to our two further, complementary approaches: searching for layout structures which resemble parts of columns (approach 2), and searching for groupings of similar lines (approach 3). The suitability of the approaches for our system requires them to be very simple to implement and simple to explain to users, computationally cheap, and combinable. In the domain of health insurances who receive huge amounts of so called medical liquidations on a daily basis we obtain very good results. On document samples representative for the every day practice of five customers-health insurance companies-tables were spotted as good and as fast as the customers expected the system to be. We thus consider our current approaches as a step towards cognitive adequacy.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"11 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115997651","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 36

Investigation of a novel self-configurable multiple classifier system for character recognition 一种新的自配置多分类器字符识别系统的研究

Proceedings of Sixth International Conference on Document Analysis and Recognition

Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953936

K. Sirlantzis, M. Fairhurst

In this paper we introduce a global optimisation technique, namely a genetic algorithm, into a parallel multiclassifier system design process. As few similar systems have been proposed to date our main focus in this study is to explore the statistical properties of the self-configuration process in order to enhance our understanding of its internal operational mechanism and to propose possible improvements. For this we tested our system in a series of character recognition tasks ranging from printed to handwritten data. Subsequently, we compare its performance with that of two alternative multiple classifier combination strategies. Finally, we investigate, over a set of cross-validating experiments, the relation between the performances of the individual classifiers and their variability, and the frequency with which each of them is chosen to participate in the final configuration generated by the genetic algorithm.

在本文中，我们引入了一种全局优化技术，即遗传算法，用于并行多分类器系统的设计过程。由于迄今为止很少有类似的系统被提出，我们在本研究中的主要重点是探索自配置过程的统计特性，以增强我们对其内部运行机制的理解，并提出可能的改进。为此，我们在一系列从打印到手写数据的字符识别任务中测试了我们的系统。随后，我们将其与两种备选多分类器组合策略的性能进行了比较。最后，通过一组交叉验证实验，我们研究了单个分类器的性能与其可变性之间的关系，以及每个分类器被选择参与遗传算法生成的最终配置的频率。

引用次数: 8

Comparing adaptation techniques for on-line handwriting recognition 在线手写识别的自适应技术比较

Proceedings of Sixth International Conference on Document Analysis and Recognition

Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953837

A. Brakensiek, A. Kosmala, G. Rigoll

This paper describes an online handwriting recognition system with focus on adaptation techniques. Our hidden Markov model (HMM)-based recognition system for cursive German script can be adapted to the writing style of a new writer using either a retraining depending on the EM (expectation maximization)-approach or an adaptation according to the MAP (maximum a posteriori) or MLLR (maximum likelihood linear regression)-criterion. The performance of the resulting writer-dependent system increases significantly even if the amount of adaptation data is very small (about 6 words). So this approach is also applicable for online systems in hand-held computers such as PDAs. Special attention was paid to the performance comparison of the different adaptation techniques with the availability of different amounts of adaptation data ranging from a few words tip to 100 words per writer.

本文介绍了一种基于自适应技术的在线手写识别系统。我们的基于隐马尔可夫模型(HMM)的草书德语识别系统可以使用基于EM(期望最大化)方法的再训练或根据MAP(最大后验)或MLLR(最大似然线性回归)标准的自适应来适应新作者的写作风格。即使适配数据量非常小(大约6个单词)，生成的依赖于书写器的系统的性能也会显著提高。因此，这种方法也适用于pda等手持计算机中的在线系统。我们特别关注了不同的改编技术的性能比较，并提供了不同数量的改编数据，从每个作者的几个单词提示到100个单词不等。

引用次数: 35

Off-line signature verification using HMM for random, simple and skilled forgeries 离线签名验证使用HMM随机，简单和熟练的伪造

Proceedings of Sixth International Conference on Document Analysis and Recognition

Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953942

E. Justino, Flávio Bortolozzi, R. Sabourin

The problem of signature verification is in theory a pattern recognition task used to discriminate two classes, original and forgery signatures. Even after many efforts in order to develop new verification techniques for static signature verification, the influence of the forgery types has not been extensively studied. This paper reports the contribution to signature verification considering different forgery types in an HMM framework. The experiments have shown that the error rates of the simple and random forgery signatures are very closed. This reflects the real applications in which the simple forgeries represent the principal fraudulent case. In addition, the experiments show promising results in skilled forgery verification by using simple static and pseudodynamic features.

签名验证问题在理论上是一种模式识别任务，用于区分两类签名:原始签名和伪造签名。尽管为开发新的静态签名验证技术进行了许多努力，但伪造类型的影响尚未得到广泛研究。本文报告了HMM框架中考虑不同伪造类型对签名验证的贡献。实验表明，简单和随机伪造签名的误差率非常接近。这反映了简单伪造作为主要欺诈案件的实际应用。此外，实验表明，利用简单的静态和伪动态特征在熟练的伪造验证中取得了良好的效果。

引用次数: 222

A modular system to recognize numerical amounts on Brazilian bank cheques 识别巴西银行支票上数字金额的模块化系统

Proceedings of Sixth International Conference on Document Analysis and Recognition

Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953819

Luiz Oliveira, R. Sabourin, Flávio Bortolozzi, C. Suen

The paper presents a modular system to recognize numerical amounts on Brazilian bank cheques. The system uses a segmentation-based recognition approach and the recognition function is based on a recognition and verification strategy. Our approach consists of combining the outputs from different levels such as segmentation, recognition and post-processing in a probabilistic model. A new feature set is introduced to the verifier module in order to detect segmentation effects such as over-segmentation and under-segmentation. Finally, we present experimental results on two databases: numerical amounts and NIST SD19. The latter aims at validating the concept of modular system and showing the robustness of the system over a well-known database.

本文提出了一种模块化系统来识别巴西银行支票上的数字金额。该系统采用基于分割的识别方法，识别功能基于识别和验证策略。我们的方法包括在一个概率模型中结合不同层次的输出，如分割、识别和后处理。为了检测过分割和欠分割等分割效果，在验证器模块中引入了一个新的特征集。最后，我们给出了两个数据库的实验结果:numerical amount和NIST SD19。后者旨在验证模块化系统的概念，并展示系统在知名数据库上的鲁棒性。

引用次数: 30

Binarizing document image using coplanar prefilter 用共面预滤波器对文档图像进行二值化

Proceedings of Sixth International Conference on Document Analysis and Recognition

Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953750

Liying Fan, Lixin Fan, C. Tan

We propose a novel coplanar filter, which exploits the coplanarity of gray-level distribution of neighboring pixels, to pre-filter the document images. Experiments show that the proposed filter exhibits the following desired properties for document image binarization: 1) impulsive noise removal, 2) piecewise smoothing, and 3) sharp edge preservation.

本文提出了一种新的共面滤波器，利用相邻像素灰度分布的共平面性对文档图像进行预滤波。实验表明，所提出的滤波器具有以下特性:1)脉冲噪声去除，2)分段平滑，3)锐利边缘保持。

引用次数: 10

Learning of structural descriptions of graphic symbols using deformable template matching 使用可变形模板匹配学习图形符号的结构描述

Proceedings of Sixth International Conference on Document Analysis and Recognition

Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953831

Ernest Valveny, E. Martí

Accurate symbol recognition in graphic documents needs an accurate representation of the symbols to be recognized. If structural approaches are used for recognition, symbols have to be described in terms of their shape, using structural relationships among extracted features. Unlike statistical pattern recognition, in structural methods, symbols are usually, manually defined from expertise knowledge, and not automatically, inferred from sample images. In this work we explain one approach to learn from examples a representative structural description of a symbol, thus providing better information about shape variability. The description of a symbol is based on a probabilistic model. It consists of a set of lines described by, the mean and the variance of line parameters, respectively, providing information about the model of the symbol, and its shape variability. The representation of each image in the sample set as a set of lines is achieved using deformable template matching.

图形文档中准确的符号识别需要对待识别的符号进行准确的表示。如果使用结构方法进行识别，则必须使用提取的特征之间的结构关系，根据其形状来描述符号。与统计模式识别不同，在结构方法中，符号通常是从专业知识中手动定义的，而不是从样本图像中自动推断出来的。在这项工作中，我们解释了一种从例子中学习符号的代表性结构描述的方法，从而提供了关于形状可变性的更好信息。符号的描述是基于概率模型的。它由一组线组成，分别由线参数的均值和方差描述，提供有关符号模型及其形状可变性的信息。使用可变形模板匹配将样本集中的每个图像表示为一组线。

引用次数: 9

A system for real-time recognition of handwritten mathematical formulas 一个实时识别手写数学公式的系统

Proceedings of Sixth International Conference on Document Analysis and Recognition

Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953948

Kenichi Toyozumi, K. Mori, Y. Suenaga, Takahiro Suzuki

This paper presents an expanded system for the online recognition of handwritten mathematical formulas. Our target handwritten mathematical formulas are strokes drawn on a data tablet. This system recognizes such strokes as components of mathematical formulas on the basis of their positions and combinations. Including matrix structures, general mathematical expressions are acceptable for this system. Each recognition result is acquired as a L/sup A/T/sub E/X source code. This system also has a preview function to enable a more highly intuitive recognition result. In recognition experiments, this system proved to be fairly feasible in handling handwritten mathematical formulas in real-time.

本文提出了一个用于手写数学公式在线识别的扩展系统。我们的目标是手写的数学公式是在数据板上画的。该系统根据笔画的位置和组合将笔画识别为数学公式的组成部分。包括矩阵结构，一般的数学表达式是可以接受的系统。每个识别结果作为L/sup a /T/sub E/X源代码获得。该系统还具有预览功能，使识别结果更加直观。在识别实验中，该系统对手写数学公式的实时处理是相当可行的。

引用次数: 14

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of Sixth International Conference on Document Analysis and Recognition

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀