Proceedings of 3rd International Conference on Document Analysis and Recognition最新文献

英文中文

Clustering and classification of document structure-a machine learning approach 文档结构的聚类和分类——一种机器学习方法

Proceedings of 3rd International Conference on Document Analysis and Recognition

Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.601965

A. Dengel, F. Dubiel

We describe a system which is capable of learning the presentation of document logical structures, exemplarily shown for business letters. Presenting a set of instances to the system, it clusters them into structural concepts and induces a concept hierarchy. This concept hierarchy is taken as a source for classifying future input. The paper introduces the different learning steps, describes how the resulting concept hierarchy is applied for logical labeling and reports on the results.

我们描述了一个能够学习文档逻辑结构表示的系统，例如商业信函。它向系统提供一组实例，将它们聚类成结构概念，并归纳出概念层次结构。这个概念层次结构被作为对未来输入进行分类的来源。本文介绍了不同的学习步骤，描述了如何将产生的概念层次用于逻辑标记和报告结果。

引用次数: 54

Document image analysis using integrated image and neural processing 文献图像分析采用综合图像和神经处理

Proceedings of 3rd International Conference on Document Analysis and Recognition

Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.599005

D. Le, G. Thoma, H. Wechsler

In this paper we present robust algorithms for detecting the page orientation (portrait/landscape) and the degree of skew for binary document images, and a method for classification of binary document images into textual or non-textual data blocks using neural network models. The performance of four neural network models are compared in terms of training times, memory requirements, and classification accuracy, and it was found that the radial basis functions performed best. The experiments show the feasibility of building an integrated document analysis system for page orientation and skew angle detection, and textual block classification.

在本文中，我们提出了检测二进制文档图像的页面方向(纵向/横向)和倾斜程度的鲁棒算法，以及使用神经网络模型将二进制文档图像分类为文本或非文本数据块的方法。从训练时间、记忆需求和分类精度三个方面比较了四种神经网络模型的性能，发现径向基函数表现最好。实验结果表明，构建一个集成的文档分析系统进行页面方向、倾斜角度检测和文本块分类是可行的。

引用次数: 10

Extraction of reference lines from documents with grey-level background using sub-images of wavelets 利用小波子图像从具有灰度背景的文档中提取参考线

Proceedings of 3rd International Conference on Document Analysis and Recognition

Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.601961

Y. Tang, Hong Ma, Dihua Xi, Y. Cheng, C. Suen

Based on wavelets, a new theoretical method has been developed to process form documents. In this method, two-dimensional multiresolution analysis (MSA), wavelet decomposition algorithm, and compactly supported orthonormal wavelets are used to transform a document image into sub-images. According to these sub-images, the reference lines of forms can be extracted, and knowledge about the geometric structure of the document can be acquired. Experiments prove that this new method can be applied to process documents with promising results.

提出了一种基于小波的表单文档处理新方法。该方法利用二维多分辨率分析(MSA)、小波分解算法和紧支持的标准正交小波将文档图像转换为子图像。根据这些子图像提取表单的参考线，获取文档的几何结构知识。实验证明，该方法可用于文件处理，效果良好。

引用次数: 3

Joint feature and classifier design for OCR OCR联合特征与分类器设计

Proceedings of 3rd International Conference on Document Analysis and Recognition

Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.602113

Dz-Mou Jung, G. Nagy

Shift-invariant, custom designed n-tuple features are combined with a probabilistic decision tree to classify isolated printed characters. The feature probabilities are estimated using a novel compound Bayesian procedure in order to delay the fall-off in classification accuracy with tree size due to a small sample set. On a ten-class confusion set of eight-point characters, the method yields error rates under 1% with only 3 training samples per class.

平移不变、自定义设计的n元组特征与概率决策树相结合，对孤立的打印字符进行分类。使用一种新的复合贝叶斯过程来估计特征概率，以延迟由于样本集小而导致的分类精度随树大小的下降。在8点字符的10类混淆集上，该方法产生的错误率低于1%，每个类只有3个训练样本。

引用次数: 2

Semi-automatic delineation of regions in floor plans 在平面图上半自动地划定区域

Proceedings of 3rd International Conference on Document Analysis and Recognition

Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.602062

Kathy Ryall, S. Shieber, J. Marks, Murray Mazer

We propose a technique that uses a proximity metric for delineating partially or fully bounded regions of a scanned bitmap that depicts a building floor plan. A proximity field is defined over the bitmap, which is used both to identify the centers of subjective regions in the image and to assign pixels to regions by proximity. The region boundaries generated by the method tend to match well the subjective boundaries of regions in the image. We discuss incorporation of the technique in a semi-automated interactive system for region identification in floor plans. In contrast to area-filling techniques for delineating areal regions of images, our approach works robustly for partially bounded regions. Furthermore, the frailties of the method that do remain, unlike those of alternative techniques, are well-moderated by simple human intervention.

我们提出了一种技术，该技术使用接近度量来描绘描绘建筑物平面图的扫描位图的部分或完全有界区域。在位图上定义一个接近场，它既用于识别图像中主观区域的中心，又用于通过接近度将像素分配给区域。该方法生成的区域边界往往与图像中主观区域边界匹配较好。我们讨论了将该技术结合到一个半自动化交互系统中，用于楼层平面图的区域识别。与用于描绘图像区域的区域填充技术相比，我们的方法对部分有界区域具有鲁棒性。此外，与其他技术不同的是，这种方法的弱点仍然存在，它可以通过简单的人为干预得到很好的缓和。

引用次数: 18

Reading handwritten US census forms 阅读手写的美国人口普查表格

Proceedings of 3rd International Conference on Document Analysis and Recognition

Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.598949

S. Madhvanath, V. Govindaraju, V. Ramanaprasad, Dar-Shyang Lee, S. Srihari

Commercial forms-reading systems for extraction of data from forms do not meet acceptable accuracy requirements on forms filled out by hand. In December 1993, NIST called industry and research organizations working in the area of handwriting recognition to participate in a test to determine the state of the art in the area. A database of form images containing actual responses received by the US Census Bureau was provided. The handwritten responses are very loosely constrained in terms of writing style, format of response and choice of text. The sizes of the lexicons provided are very large (about 50000 entries) and yet the coverage is incomplete (about 70%). In this paper we discuss the approach taken by CEDAR to automate the task of reading the census forms. The subtasks of field extraction and phrase recognition are described.

用于从表格中提取数据的商业表格读取系统不符合手工填写表格的可接受的准确性要求。1993年12月，NIST召集手写识别领域的工业和研究组织参加一个测试，以确定该领域的技术水平。提供了包含美国人口普查局收到的实际答复的表格图像数据库。手写回复在写作风格、回复格式和文本选择方面的限制非常宽松。所提供的词典的大小非常大(大约50000个条目)，但覆盖率不完整(大约70%)。在本文中，我们讨论了雪松采用的方法，使阅读人口普查表格的任务自动化。描述了字段提取和短语识别的子任务。

引用次数: 23

Circular histogram thresholding for color image segmentation 圆形直方图阈值分割彩色图像

Proceedings of 3rd International Conference on Document Analysis and Recognition

Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.601986

Din-Chang Tseng, Yao-Fu Li, Cheng-Tan Tung

A circular histogram thresholding for color image segmentation is proposed. A circular hue histogram is first constructed based on a UCS (I,H,S) color space. The histogram is automatically smoothed by a scale-space filter, then transformed into traditional histogram form, and finally recursively thresholded based on the maximum principle of variance. Three comparisons of performance are reported: (i) the proposed thresholding on the circular histogram with that on a traditional histogram; (ii) the proposed thresholding with clustering; and (iii) thresholding based on a UCS hue attribute with that based on a non-UCS hue attribute. Benefits of the proposed approach are confirmed in experiments.

提出了一种圆形直方图阈值分割方法。首先基于UCS (I,H,S)颜色空间构造圆形色调直方图。通过尺度空间滤波器对直方图进行自动平滑处理，然后将直方图转换为传统的直方图形式，最后根据方差最大原理递归阈值化。报告了三种性能比较:(i)提出的圆形直方图阈值与传统直方图阈值的比较;(ii)建议的聚类阈值法;(iii)基于UCS色相属性的阈值与基于非UCS色相属性的阈值。实验验证了该方法的有效性。

引用次数: 42

A method for table structure analysis using DP matching 一种基于DP匹配的表结构分析方法

Proceedings of 3rd International Conference on Document Analysis and Recognition

Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.601964

Y. Hirayama

This paper presents a novel method for table structure analysis. Many documents have table areas, and some have both table and figure areas. It is very important to be able to classify table and figure areas automatically. Furthermore, in tables, the column and row in which a character string is located are very important pieces of information. To detect and analyze table areas, the following method is applied: First, areas that may contain tables or figures are distinguished from text areas by the presence of horizontal and vertical lines. Next, the areas are assumed to be table areas and are analyzed as such. A judgment is made on whether each of the areas can in fact be a table area or not; in this way, the actual table areas are detected. Finally, the structures of the areas are analyzed and character strings in the areas are arranged by using the DP matching method. This method was applied to sixty-five pages of Japanese technical papers, magazines, manuals for software programs, and pages including 34 table areas, 48 line drawing areas, and 35 image areas. As a result, 96.6 percent of the areas were detected correctly and 91.7 percent of the tables were analyzed and arranged correctly.

本文提出了一种新的表结构分析方法。许多文档都有表格区域，有些文档既有表格区域又有图形区域。能够对表格和图形区域进行自动分类是非常重要的。此外，在表中，字符串所在的列和行是非常重要的信息。为了检测和分析表格区域，应用以下方法:首先，通过水平线和垂直线的存在将可能包含表格或图形的区域与文本区域区分开来。接下来，假定这些区域是表区域，并按表区域进行分析。判断每个区域是否实际上可以成为一个表区域;通过这种方式，可以检测到实际的表区域。最后，对区域的结构进行了分析，并采用DP匹配方法对区域中的字符串进行了排列。该方法应用于65页的日本技术论文、杂志、软件程序手册，包括34个表格区域、48个线条绘制区域和35个图像区域。结果，96.6%的区域被正确识别，91.7%的表格被正确分析和排列。

{"title":"A method for table structure analysis using DP matching","authors":"Y. Hirayama","doi":"10.1109/ICDAR.1995.601964","DOIUrl":"https://doi.org/10.1109/ICDAR.1995.601964","url":null,"abstract":"This paper presents a novel method for table structure analysis. Many documents have table areas, and some have both table and figure areas. It is very important to be able to classify table and figure areas automatically. Furthermore, in tables, the column and row in which a character string is located are very important pieces of information. To detect and analyze table areas, the following method is applied: First, areas that may contain tables or figures are distinguished from text areas by the presence of horizontal and vertical lines. Next, the areas are assumed to be table areas and are analyzed as such. A judgment is made on whether each of the areas can in fact be a table area or not; in this way, the actual table areas are detected. Finally, the structures of the areas are analyzed and character strings in the areas are arranged by using the DP matching method. This method was applied to sixty-five pages of Japanese technical papers, magazines, manuals for software programs, and pages including 34 table areas, 48 line drawing areas, and 35 image areas. As a result, 96.6 percent of the areas were detected correctly and 91.7 percent of the tables were analyzed and arranged correctly.","PeriodicalId":273519,"journal":{"name":"Proceedings of 3rd International Conference on Document Analysis and Recognition","volume":"23 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1995-08-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127806687","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 40

Model matching in intelligent document understanding 智能文档理解中的模型匹配

Proceedings of 3rd International Conference on Document Analysis and Recognition

Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.598997

Gary S. D. Farrow, C. Xydeas, J. Oakley

Intelligent Document Understanding (IDU) is the process of converting scanned document pages into an electronic, processable form. We have previously presented a IDU system architecture suitable for this task which uses a hybrid bottom-up/top-down control strategy. In this paper we focus on a specific subproblem that arises within the chosen framework, concerned with selecting an appropriate page layout structure. A detailed analysis of the problem using an error propagation model, allows computationally simple search strategies to be developed. A multistage layout formation algorithm is proposed and its performance is critically assessed when implemented using two different Layout Object selection criterion. The first selection criterion is based on a maximal column area coverage; the second is based on a probabilistic Layout Object selection. Both techniques have been incorporated into the hybrid IDU system and the results presented indicate its superiority over previously reported systems.

智能文档理解(IDU)是将扫描的文档页面转换为可处理的电子形式的过程。我们之前已经提出了一种适合此任务的IDU系统架构，该架构使用自底向上/自顶向下混合控制策略。在本文中，我们将重点关注所选框架中出现的特定子问题，涉及选择适当的页面布局结构。使用错误传播模型对问题进行详细分析，可以开发计算简单的搜索策略。提出了一种多阶段布局形成算法，并采用两种不同的布局对象选择准则对其性能进行了严格评估。第一个选择标准是基于最大的列面积覆盖;第二种是基于概率布局对象选择。这两种技术都被纳入混合IDU系统，结果表明其优于先前报道的系统。

引用次数: 3

A knowledge-based approach to the layout analysis 基于知识的布局分析方法

Proceedings of 3rd International Conference on Document Analysis and Recognition

Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.599037

F. Esposito, D. Malerba, G. Semeraro

In this paper, we present a hybrid approach to the problem of the document analysis in which the document image is segmented by means of a top-down technique and then basic blocks are grouped bottom-up in order to form complex layout components. In this latter process, called layout analysis, only generic knowledge on typesetting conventions is exploited. Such knowledge is independent of the particular class of processed documents and turns out to be valuable for a wide range of documents. Preliminary results of the layout analysis system LEX (Layout EXpert) show the methodological validity of this approach.

在本文中，我们提出了一种混合方法来解决文档分析问题，该方法采用自顶向下的方法对文档图像进行分割，然后自底向上对基本块进行分组，以形成复杂的布局组件。在后一个过程中，称为布局分析，只利用关于排版惯例的一般知识。这种知识独立于处理过的文档的特定类别，并且对各种文档都很有价值。布局分析系统LEX (layout EXpert)的初步结果表明了该方法的有效性。

引用次数: 35

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of 3rd International Conference on Document Analysis and Recognition

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀