Proceedings of Sixth International Conference on Document Analysis and Recognition最新文献

英文中文

Automatic identification of English, Chinese, Arabic, Devnagari and Bangla script line 自动识别英文，中文，阿拉伯语，德文加里语和孟加拉语的文字线

Proceedings of Sixth International Conference on Document Analysis and Recognition

Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953896

U. Pal, B. Chaudhuri

In a general situation, a document page may contain several scriptforms. For optical character recognition (OCR) of such a document page, it is necessary to separate the scripts before feeding them to their individual OCR systems. An automatic technique for the identification of printed Roman, Chinese, Arabic, Devnagari and Bangla text lines from a single document is proposed. Shape based features, statistical features and some features obtained from the concept of a water reservoir are used for script identification. The proposed scheme has an accuracy of about 97.33%.

一般情况下，一个文档页面可能包含几个脚本表单。对于这种文档页面的光学字符识别(OCR)，有必要在将脚本输入各自的OCR系统之前分离脚本。提出了一种从单一文件中自动识别印刷的罗马文、中文、阿拉伯文、德文加里文和孟加拉文文本行的技术。利用基于形状的特征、统计特征和从水库概念中获得的一些特征进行文字识别。该方案的准确率约为97.33%。

引用次数: 78

Separation of overlapping text from graphics 从图形中分离重叠的文本

Proceedings of Sixth International Conference on Document Analysis and Recognition

Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953752

Ruini Cao, C. Tan

The separation of overlapping text from graphics is a challenging problem in document image analysis. This paper proposes a specific method for detecting and extracting characters that are touching graphics. It is based on the observation that the constituent strokes of characters are usually short segments in comparison with those of graphics. It combines line continuation with the feature line width to decompose and reconstruct segments underlying the region of intersection. Experimental results showed that the proposed method improved the percentage of correctly detected text as well as the accuracy of character recognition significantly.

在文档图像分析中，重叠文本与图形的分离是一个具有挑战性的问题。本文提出了一种检测和提取触摸图形字符的具体方法。它的基础是观察到，与图形相比，汉字的组成笔画通常是短段的。该方法将线的延拓与特征线的宽度相结合，对相交区域下的线段进行分解和重构。实验结果表明，该方法显著提高了文本的正确率和字符识别的准确率。

引用次数: 30

A new component based algorithm for newspaper layout analysis 一种新的基于分量的报纸版面分析算法

Proceedings of Sixth International Conference on Document Analysis and Recognition

Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953970

Fei Liu, Yupin Luo, D. Hu, Masataka Yoshikawa

The aim of the layout analysis is to extract the geometric structure from a document image. It is a progress of labeling homogenous regions of a document image. In order to present a complex newspaper layout analysis, this paper proposes a new component based bottom-up algorithm. With a novel homogeneity related definition of distance, it maintains a dynamic minimal distance mechanism to decide the components merging sequence. Under the restricting rules generated from the newspaper layout heuristically, we derive the preferred analysis result. Experimental results reveal the proposed approach is effective.

版面分析的目的是从文档图像中提取几何结构。它是对文档图像的同质区域进行标记的过程。为了对复杂的报纸版面进行分析，本文提出了一种新的基于分量的自底向上算法。采用新颖的同质性相关距离定义，保持动态最小距离机制来决定组件合并顺序。在报纸版面启发式生成的约束规则下，推导出优选的分析结果。实验结果表明，该方法是有效的。

引用次数: 27

DMOS: a generic document recognition method, application to an automatic generator of musical scores, mathematical formulae and table structures recognition systems 一种通用的文件识别方法，应用于一个自动生成乐谱、数学公式和表格结构的识别系统

Proceedings of Sixth International Conference on Document Analysis and Recognition

Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953786

Bertrand Coüasnon

Genericity in structured document recognition is a difficult challenge. We therefore propose a new generic document recognition method, called DMOS (Description and MOdification of Segmentation), that is made up of a new grammatical formalism, called EPF (Enhanced Position Formalism) and an associated parser which is able to introduce context in segmentation. We implement this method to obtain a generator of document recognition systems. This generator can automatically produce new recognition systems. It is only necessary to describe the document with an EPF grammar, which is then simply compiled. In this way, we have developed various recognition systems: one on musical scores, one on mathematical formulae and one on recursive table structures. We have also defined a specific application to damaged military forms of the 19th Century. We have been able to test the generated system on 5,000 of these military forms. This has permitted us to validate the DMOS method on a real-world application.

结构化文档识别中的通用性是一个难题。因此，我们提出了一种新的通用文档识别方法，称为DMOS(分割的描述和修改)，它由一种新的语法形式称为EPF(增强位置形式)和一个能够在分割中引入上下文的相关解析器组成。我们实现了这个方法来获得一个文档识别系统的生成器。该生成器可以自动生成新的识别系统。只需要用EPF语法描述文档，然后简单地编译它。通过这种方式，我们开发了各种识别系统:一个是乐谱识别系统，一个是数学公式识别系统，还有一个是递归表结构识别系统。我们还确定了19世纪受损军事形式的具体应用。我们已经能够在5000个这些军事表格上测试生成的系统。这使我们能够在实际应用程序中验证DMOS方法。

引用次数: 67

Individuality of handwriting: a validation study 笔迹的个性:一项验证性研究

Proceedings of Sixth International Conference on Document Analysis and Recognition

Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953764

S. Srihari, Sung-Hyuk Cha, Hina Arora, Sangjik Lee

Motivated by several rulings in United States courts concerning expert testimony in general and handwriting testimony in particular, we undertook a study to objectively validate the hypothesis that handwriting is individualistic. Handwriting samples of 1500 individuals, representative of the US population with respect to gender, age, ethnic groups, etc., were obtained. Analyzing differences in handwriting was done by using computer algorithms for extracting features from scanned images of handwriting. Attributes characteristic of the handwriting were obtained, e.g., line separation, slant, character shapes, etc. These attributes, which are a subset of attributes used by expert document examiners, were used to quantitatively establish individuality by using machine learning approaches. Using global attributes of handwriting and very few characters in the writing, the ability to determine the writer with a high degree of confidence was established. The work is a step towards providing scientific support for admitting handwriting evidence in court. The mathematical approach and the resulting software also have the promise of aiding the expert document examiner.

受美国法院关于专家证词和笔迹证词的几项裁决的启发，我们进行了一项研究，以客观地验证笔迹是个人主义的假设。获得了1500个人的笔迹样本，这些人在性别、年龄、种族等方面代表了美国人口。使用计算机算法从扫描的笔迹图像中提取特征来分析笔迹的差异。获得了笔迹的属性特征，如行距、斜度、字符形状等。这些属性是专家文档审查员使用的属性的子集，用于通过使用机器学习方法定量地建立个性。利用笔迹的全局属性和笔迹中的很少字符，建立了高度自信地确定作者的能力。这项工作是向在法庭上承认笔迹证据提供科学支持迈出的一步。数学方法和由此产生的软件也有希望帮助专家文件审查员。

{"title":"Individuality of handwriting: a validation study","authors":"S. Srihari, Sung-Hyuk Cha, Hina Arora, Sangjik Lee","doi":"10.1109/ICDAR.2001.953764","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953764","url":null,"abstract":"Motivated by several rulings in United States courts concerning expert testimony in general and handwriting testimony in particular, we undertook a study to objectively validate the hypothesis that handwriting is individualistic. Handwriting samples of 1500 individuals, representative of the US population with respect to gender, age, ethnic groups, etc., were obtained. Analyzing differences in handwriting was done by using computer algorithms for extracting features from scanned images of handwriting. Attributes characteristic of the handwriting were obtained, e.g., line separation, slant, character shapes, etc. These attributes, which are a subset of attributes used by expert document examiners, were used to quantitatively establish individuality by using machine learning approaches. Using global attributes of handwriting and very few characters in the writing, the ability to determine the writer with a high degree of confidence was established. The work is a step towards providing scientific support for admitting handwriting evidence in court. The mathematical approach and the resulting software also have the promise of aiding the expert document examiner.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117098710","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 123

Genetic algorithms for feature selection and weighting, a review and study 遗传算法的特征选择与加权，综述与研究

Proceedings of Sixth International Conference on Document Analysis and Recognition

Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953980

Faten Hussein, R. Ward, N. Kharma

Our aim is: a) to present a comprehensive survey of previous attempts at using genetic algorithms (GA) for feature selection in pattern recognition applications, with a special focus on character recognition; and b) to report on work that uses GA to optimize the weights of the classification module of a character recognition system. The main purpose of feature selection is to reduce the number of features, by eliminating irrelevant and redundant features, while simultaneously maintaining or enhancing classification accuracy. Many search algorithms have been used for feature selection. Among those, GA have proven to be an effective computational method, especially in situations where the search space is uncharacterized (mathematically), not fully understood, or/and highly dimensional.

我们的目标是:a)对以前在模式识别应用中使用遗传算法(GA)进行特征选择的尝试进行全面的调查，特别关注字符识别;b)报告使用遗传算法优化字符识别系统分类模块权重的工作。特征选择的主要目的是通过消除不相关和冗余的特征来减少特征的数量，同时保持或提高分类精度。许多搜索算法被用于特征选择。其中，遗传算法已被证明是一种有效的计算方法，特别是在搜索空间未被表征(数学上)、未被完全理解或/和高维的情况下。

引用次数: 80

Flexible Web document analysis for delivery to narrow-bandwidth devices 灵活的Web文档分析，以便交付到窄带宽设备

Proceedings of Sixth International Conference on Document Analysis and Recognition

Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953951

Gerald Penn, Jianying Hu, Hengbin Luo, Ryan T. McDonald

We propose a set of baseline heuristics for identifying genuinely tabular information and news links in HTML documents. A prototype implementation of these heuristics is described for delivering content from news providers' home pages to a narrow-bandwidth device such as a portable digital assistant or cellular phone display. Its evaluation on 75 Web sites is provided, along with a discussion of topics for future research.

我们提出了一套基线启发式方法，用于识别HTML文档中真正的表格信息和新闻链接。描述了这些启发式的原型实现，用于将内容从新闻提供者的主页传送到诸如便携式数字助理或蜂窝电话显示器之类的窄带宽设备。提供了对75个网站的评估，以及对未来研究主题的讨论。

引用次数: 59

Form document identification using line structure based features 使用基于线条结构的特征进行表单文档识别

Proceedings of Sixth International Conference on Document Analysis and Recognition

Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953881

Kuo-Chin Fan, Mei-Lin Chang, Yuan-Kai Wang

Form recognition is one of the special applications of document analysis (DA). We present a novel form recognition method by analyzing the line structure embedded in an input form document. First, all vertical and horizontal lines embedded in the form image are extracted. By analyzing the crossing relationships among horizontal lines and vertical lines, a line crossing relationship matrix can be built with each row corresponding to one horizontal line and each column corresponding to one vertical line. Moreover two line distance relationship matrices, horizontal and vertical line distance relationship matrices, are built by analyzing the distance relationships among horizontal lines and vertical lines, respectively. Last, the recognition task is performed by matching these three matrices. Experimental results reveal the feasibility and efficiency of our proposed method in recognizing form documents.

表单识别是文档分析(DA)的一个特殊应用。通过分析嵌入在输入表单文档中的线条结构，提出了一种新的表单识别方法。首先，提取嵌入到表单图像中的所有垂直线和水平线。通过分析水平线和垂直线之间的交叉关系，可以建立一个线条交叉关系矩阵，每一行对应一条水平线，每一列对应一条垂直线。通过分析水平线和垂直线之间的距离关系，分别建立了水平线和垂直线之间的距离关系矩阵。最后，通过匹配这三个矩阵来完成识别任务。实验结果表明了该方法在表单文档识别中的可行性和有效性。

引用次数: 9

Arabic hand-written text-line extraction 阿拉伯语手写文本行提取

Proceedings of Sixth International Conference on Document Analysis and Recognition

Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953799

Abderrazak Zahour, B. Taconet, P. Mercy, Said Ramdane

This paper describes a text-line extraction based method. The typical segmentation for a printed binary document is based on the horizontal projection analysis and then the regrouping of the connected components. These techniques can't be used for handwritten unconstrained text because data frequently contain undulations and shifts in the baseline, baseline-skew variability and inter-line distance variability. So, we think that the border line for a handwritten unconstrained documents should be a collection of horizontal line segments. From this point of view, we use a partial contour following based method to detect the separating lines. In the current version of our algorithm, we proceed to text slant detection, text line number evaluation by using partial projection. Then we carry out a partial contour following of every line; first in the direction of the writing, then in the opposite direction. After the treatment, the adjacent lines are separated. In the experimental session, we describe the application of the algorithm used for the extraction of text line. Database images contains about one hundred handwritten Arabic texts written by different writers. Results about diacritical points affectation are also reported.

本文描述了一种基于文本行提取的方法。典型的打印二进制文档分割方法是基于水平投影分析，然后将连接的组件重新分组。这些技术不能用于手写的无约束文本，因为数据经常包含基线的波动和移动、基线倾斜可变性和线间距离可变性。因此，我们认为手写的无约束文档的边界线应该是水平线段的集合。从这个角度来看，我们使用基于部分轮廓跟踪的方法来检测分离线。在当前版本的算法中，我们继续使用部分投影进行文本倾斜检测，文本行数评估。然后对每条线进行局部轮廓跟踪;首先顺着书写的方向，然后顺着相反的方向。处理后，相邻的线被分开。在实验部分，我们描述了该算法在文本行提取中的应用。数据库图像包含大约100个由不同作者手写的阿拉伯文本。本文还报道了变音符点影响的结果。

{"title":"Arabic hand-written text-line extraction","authors":"Abderrazak Zahour, B. Taconet, P. Mercy, Said Ramdane","doi":"10.1109/ICDAR.2001.953799","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953799","url":null,"abstract":"This paper describes a text-line extraction based method. The typical segmentation for a printed binary document is based on the horizontal projection analysis and then the regrouping of the connected components. These techniques can't be used for handwritten unconstrained text because data frequently contain undulations and shifts in the baseline, baseline-skew variability and inter-line distance variability. So, we think that the border line for a handwritten unconstrained documents should be a collection of horizontal line segments. From this point of view, we use a partial contour following based method to detect the separating lines. In the current version of our algorithm, we proceed to text slant detection, text line number evaluation by using partial projection. Then we carry out a partial contour following of every line; first in the direction of the writing, then in the opposite direction. After the treatment, the adjacent lines are separated. In the experimental session, we describe the application of the algorithm used for the extraction of text line. Database images contains about one hundred handwritten Arabic texts written by different writers. Results about diacritical points affectation are also reported.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125347342","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 127

Planar Markov modeling for Arabic writing recognition: advancement state 阿拉伯文字识别的平面马尔可夫模型:进展状态

Proceedings of Sixth International Conference on Document Analysis and Recognition

Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953757

H. Miled, N. Amara

In this paper, we show how planar hidden Markov models (PHMM) can offer great potential to solve difficult Arabic character recognition problems, especially its cursivness. A convenient architecture is defined for printed Arabic sub-words. It yields an easy solution to implement the modeling of the different morphological variations of the Arabic writing, i.e., vertical and variable horizontal linkages. A more flexible architecture, developed for Arabic handwritten words, is under test. The structure proposed presents the aptitude to absorb the variability of the manuscript. Indeed, the experiments have shown promising results and directions for further improvements. In the present paper, we describe both retained architectures, showing the applicability of the PHMM to the Arabic complexities. This is owed precisely to the definition of the PHMM, which permits to follow efficiently the natural variations in bands of the Arabic script.

在本文中，我们展示了平面隐马尔可夫模型(PHMM)如何在解决困难的阿拉伯字符识别问题，特别是其草书问题上提供巨大的潜力。为打印的阿拉伯子词定义了一种方便的体系结构。它产生了一个简单的解决方案来实现阿拉伯文字的不同形态变化的建模，即垂直和可变的水平联系。为阿拉伯手写文字开发的更灵活的架构正在测试中。所提出的结构表现出吸收手稿变异性的能力。事实上，实验已经显示出有希望的结果和进一步改进的方向。在本文中，我们描述了这两种保留的体系结构，展示了PHMM对阿拉伯复杂性的适用性。这完全归功于PHMM的定义，它允许有效地遵循阿拉伯文字带的自然变化。

引用次数: 40

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of Sixth International Conference on Document Analysis and Recognition

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀