Proceedings of the Fourth International Conference on Document Analysis and Recognition最新文献

英文中文

An interactive system to extract structured text from a geometrical representation 从几何表示中提取结构化文本的交互式系统

Proceedings of the Fourth International Conference on Document Analysis and Recognition

Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.619868

Benoit Poirier, M. Dagenais

The proliferation of electronic document formats impedes the dissemination and management of documents. Indeed, a common format with structural information is required to obtain document indexing and navigation. While in some formats it is easy to decode and preserve the document structure information, often the only easily obtainable representation is Postscript, where only the geometrical information remains. Even if an organization is willing to convert all its document producing activities to a structure preserving format such as HTML, the existing documents need to be converted. The paper addresses the difficult problem of extracting the structure of a document from a geometrical representation. An interactive tool to extract the document content and structure from a geometric representation (Postscript) has been developed. It successfully analyzes several documents produced with different tools, and produces structural information using the HyperText Markup Language (HTML). The end user, when presented with the extracted document structure, can interactively modify it, if needed. The tool is easily extended to recognize new constructs and is aimed at organizations needing to convert numerous documents for searching and browsing on intranets or on the Internet.

电子文件格式的泛滥妨碍了文件的传播和管理。实际上，需要一种具有结构化信息的通用格式来获得文档索引和导航。虽然在某些格式中很容易解码和保存文档结构信息，但通常唯一容易获得的表示形式是Postscript，其中只保留几何信息。即使组织愿意将其所有文档生成活动转换为保留结构的格式(如HTML)，也需要转换现有文档。本文解决了从几何表示中提取文档结构的难题。开发了一种从几何表示(Postscript)中提取文档内容和结构的交互式工具。它成功地分析了使用不同工具生成的几个文档，并使用超文本标记语言(HTML)生成结构信息。当向最终用户提供提取的文档结构时，如果需要，可以交互地修改它。该工具很容易扩展以识别新的结构，并针对需要转换大量文档以便在内部网或Internet上进行搜索和浏览的组织。

{"title":"An interactive system to extract structured text from a geometrical representation","authors":"Benoit Poirier, M. Dagenais","doi":"10.1109/ICDAR.1997.619868","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.619868","url":null,"abstract":"The proliferation of electronic document formats impedes the dissemination and management of documents. Indeed, a common format with structural information is required to obtain document indexing and navigation. While in some formats it is easy to decode and preserve the document structure information, often the only easily obtainable representation is Postscript, where only the geometrical information remains. Even if an organization is willing to convert all its document producing activities to a structure preserving format such as HTML, the existing documents need to be converted. The paper addresses the difficult problem of extracting the structure of a document from a geometrical representation. An interactive tool to extract the document content and structure from a geometric representation (Postscript) has been developed. It successfully analyzes several documents produced with different tools, and produces structural information using the HyperText Markup Language (HTML). The end user, when presented with the extracted document structure, can interactively modify it, if needed. The tool is easily extended to recognize new constructs and is aimed at organizations needing to convert numerous documents for searching and browsing on intranets or on the Internet.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126402025","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 4

Combining multiple representations and classifiers for pen-based handwritten digit recognition 结合多个表示和分类器的手写数字识别

Proceedings of the Fourth International Conference on Document Analysis and Recognition

Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.620583

F. Alimoglu, Ethem Alpaydin

We investigate techniques to combine multiple representations of a handwritten digit to increase classification accuracy without significantly increasing system complexity or recognition time. We compare multiexpert and multistage combination techniques and discuss in detail in a comparative manner methods for combining multiple learners: voting, mixture of experts, stacking, boosting and cascading. In pen based handwritten character recognition, the input is the dynamic movement of the pentip over the pressure sensitive tablet. There is also the image formed as a result of this movement. On a real world database, we notice that the two multi layer perceptron (MLP) neural network based classifiers using these representations separately make errors on different patterns, implying that a suitable combination of the two would lead to higher accuracy. Thus we implement and compare voting, mixture of experts, stacking and cascading. Combined classifiers have an error percentage less than individual ones. The final combined system of two MLPs has less complexity and memory requirement than a single k nearest neighbor using one of the representations.

我们研究了结合手写数字的多种表示的技术，以提高分类精度，而不会显着增加系统复杂性或识别时间。我们比较了多专家和多阶段组合技术，并以比较的方式详细讨论了多学习者组合的方法:投票、专家混合、堆叠、提升和级联。在基于笔的手写字符识别中，输入是笔尖在压敏平板上的动态运动。这一运动也形成了图像。在现实世界的数据库中，我们注意到使用这些表示的两个基于多层感知器(MLP)神经网络的分类器分别在不同的模式上产生错误，这意味着两者的适当组合将导致更高的精度。因此，我们实现并比较了投票、专家混合、堆叠和级联。组合分类器的错误率小于单个分类器。与使用其中一种表示的单个k近邻相比，两个mlp的最终组合系统具有更低的复杂性和内存需求。

{"title":"Combining multiple representations and classifiers for pen-based handwritten digit recognition","authors":"F. Alimoglu, Ethem Alpaydin","doi":"10.1109/ICDAR.1997.620583","DOIUrl":"https://doi.org/10.1109/ICDAR.1997.620583","url":null,"abstract":"We investigate techniques to combine multiple representations of a handwritten digit to increase classification accuracy without significantly increasing system complexity or recognition time. We compare multiexpert and multistage combination techniques and discuss in detail in a comparative manner methods for combining multiple learners: voting, mixture of experts, stacking, boosting and cascading. In pen based handwritten character recognition, the input is the dynamic movement of the pentip over the pressure sensitive tablet. There is also the image formed as a result of this movement. On a real world database, we notice that the two multi layer perceptron (MLP) neural network based classifiers using these representations separately make errors on different patterns, implying that a suitable combination of the two would lead to higher accuracy. Thus we implement and compare voting, mixture of experts, stacking and cascading. Combined classifiers have an error percentage less than individual ones. The final combined system of two MLPs has less complexity and memory requirement than a single k nearest neighbor using one of the representations.","PeriodicalId":435320,"journal":{"name":"Proceedings of the Fourth International Conference on Document Analysis and Recognition","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1997-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128079718","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 118

Performance comparison of several feature selection methods based on node pruning in handwritten character recognition 基于节点修剪的几种特征选择方法在手写字符识别中的性能比较

Proceedings of the Fourth International Conference on Document Analysis and Recognition

Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.619805

Kyusik Chung, Jongmin Yoon

The paper presents a performance comparison of several feature selection methods based on neural network node pruning. Assuming the features are extracted and presented as the inputs of a 3 layered perceptron classifier, we apply the five feature selection methods before/during/after neural network training in order to prune only input nodes of the neural network. Four of them are node pruning methods such as node saliency method, node sensitivity method, and two interactive pruning methods using different contribution measures. The last one is a statistical method based on principle component analysis (PCA). The first two of them prune input nodes during training whereas the last three do before/after network training. For gradient and upper down, left right hole concavity features, we perform several experiments of handwritten English alphabet and digit recognition with/without pruning using the five feature selection algorithms, respectively. The experimental results show that node saliency method outperforms the others.

本文对几种基于神经网络节点修剪的特征选择方法进行了性能比较。假设特征被提取并作为三层感知器分类器的输入，我们在神经网络训练之前/期间/之后应用五种特征选择方法，以便只修剪神经网络的输入节点。其中4种是节点修剪方法，即节点显著性方法、节点敏感性方法和采用不同贡献度量的2种交互修剪方法。最后一种是基于主成分分析(PCA)的统计方法。它们中的前两个在训练过程中对输入节点进行修剪，而后三个在网络训练之前/之后进行修剪。对于梯度特征和上下、左右孔凹度特征，我们分别使用五种特征选择算法对手写英文字母和数字进行了修剪/不修剪的识别实验。实验结果表明，节点显著性方法优于其他方法。

引用次数: 14

Surfing an ODBMS (maintaining WWW documents with O/sub 2/) 浏览ODBMS(使用O/ sub2 /维护WWW文档)

Proceedings of the Fourth International Conference on Document Analysis and Recognition

Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.620627

F. Buddrus, Marco Bellavia

Deals with the idea of an object-oriented DBMS (O/sub 2/) working as an Internet server. The approach introduced offers fine-grained storage and retrieval of World Wide Web (WWW) documents, providing sophisticated document maintenance, supporting referential integrity, document integration, integration with application objects, access control and versioning. In contrast to comparable systems, it acts transparently to the WWW users, i.e. without using proprietary query languages or protocols. While achieving the full benefits of a modern document repository, it keeps the ease of the traditional file system it replaces.

处理面向对象DBMS (O/sub / 2/)作为Internet服务器的思想。所介绍的方法提供了万维网(WWW)文档的细粒度存储和检索，提供了复杂的文档维护，支持引用完整性、文档集成、与应用程序对象的集成、访问控制和版本控制。与同类系统相比，它对WWW用户是透明的，即不使用专有的查询语言或协议。在实现现代文档存储库的全部优点的同时，它保留了它所取代的传统文件系统的便利性。

引用次数: 4

HMM word recognition engine HMM词识别引擎

Proceedings of the Fourth International Conference on Document Analysis and Recognition

Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.620559

D. Guillevic, C. Suen

We describe a hidden Markov model (HMM) based word recognition engine being developed to be integrated with the CENPARMI bank cheque processing system. The various modules are described in detail, and preliminary results are compared with our previous global feature recognition scheme. The engine is tested on words from a database of over 4,500 cheques of 1,400 writers.

我们描述了一个基于隐马尔可夫模型(HMM)的词识别引擎，该引擎正在与CENPARMI银行支票处理系统集成。详细描述了各个模块，并将初步结果与我们之前的全局特征识别方案进行了比较。该引擎的测试词来自1400位作者的4500多张支票的数据库。

引用次数: 53

Recovery of temporal information of cursively handwritten words for on-line recognition 用于在线识别的草书手写文字时间信息的恢复

Proceedings of the Fourth International Conference on Document Analysis and Recognition

Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.620647

H. Bunke, R. Ammann, Guido Kaufmann, T. M. Ha, M. Schenkel, R. Seiler, F. Eggimann

On-line recognition differs from off-line recognition in that additional information about the drawing order of the strokes is available. This temporal information makes it easier to recognize handwritten texts with an on-line recognition system. In this paper we present a method for the recovery of the stroke order from static handwritten images. The algorithm was tested by classifying the words of an off-line database with a state-of-the-art on-line recognition system. On this database with 150 different words, written by four cooperative writers, a recognition rate of 97.4% was obtained.

联机识别与脱机识别的不同之处在于，有关笔画顺序的附加信息是可用的。这种时间信息使得在线识别系统更容易识别手写文本。本文提出了一种从静态手写图像中恢复笔画顺序的方法。该算法通过使用最先进的在线识别系统对离线数据库中的单词进行分类来测试。在这个有150个不同单词的数据库中，由四位合作作者撰写，识别率为97.4%。

引用次数: 32

An object-oriented form description language and approach to handwritten form processing 一个面向对象的表单描述语言和手写表单处理方法

Proceedings of the Fourth International Conference on Document Analysis and Recognition

Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.619837

C. Cracknell, A. Downton, L. Du

Virtually all form processing systems rely upon a form description file which holds some form design specific information. Such files are written in a form description language. This paper presents a novel form description language and a new approach to form processing which makes full use of the language. The form description language which is presented is novel in two ways: firstly, it allows the comprehensive utilisation of contextual information; secondly, it enables the storage of form models which economically describe the variations in the form design which are allowed for a particular class of forms.

实际上，所有的表单处理系统都依赖于表单描述文件，其中包含一些表单设计的特定信息。这样的文件是用表单描述语言编写的。本文提出了一种新颖的形式描述语言和一种充分利用该语言进行形式处理的新方法。所提出的形式描述语言的新颖之处体现在两个方面:第一，它允许对语境信息的综合利用;其次，它可以存储表单模型，这些模型可以经济地描述特定类别表单所允许的表单设计中的变化。

引用次数: 4

Recognizing on-line handwritten Chinese character via FARG matching 基于FARG匹配的在线手写汉字识别

Proceedings of the Fourth International Conference on Document Analysis and Recognition

Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.620578

Jing Zheng, Xiaoqing Ding, Youshou Wu

The paper presents a novel method for online handwritten Chinese character recognition. In our method, each category of character is described by a fuzzy attributed relational graph (FARG). A relaxation algorithm is developed to match the input pattern with every FARG. For decision making, a similarity measure is established via statistical technique to calculate the matching degree between the input pattern and referenced FARG, according to which the recognition result is determined. The principle of our method makes it very robust against stroke connection and stroke order variation as well as stroke shape deformation. A database of 22530 samples collected from 6 subjects is used to test our recognition system which can recognize 3755 categories of Chinese characters. The result shows that our method is very effective: a top 1 recognition rate of 98.8% and a top 10 of 99.7% are reached.

提出了一种新的在线手写体汉字识别方法。在我们的方法中，每一类特征用模糊属性关系图(FARG)来描述。开发了一种松弛算法来匹配输入模式与每个FARG。在决策方面，通过统计技术建立相似度度量，计算输入模式与参考FARG之间的匹配程度，从而确定识别结果。该方法对笔画连接、笔画顺序变化和笔画形状变形具有很强的鲁棒性。以6个被试的22530个样本为数据库样本，对该识别系统进行了测试，该识别系统可识别3755类汉字。结果表明，该方法非常有效，前1名识别率为98.8%，前10名识别率为99.7%。

引用次数: 13

Measuring the effects of OCR errors on similarity linking 测量OCR误差对相似链接的影响

Proceedings of the Fourth International Conference on Document Analysis and Recognition

Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.620654

A. Myka, Ulrich Güntzer

The vector-space model offers an easy and robust model for Information Retrieval. Thereby, the similarities between queries and documents as well as the similarities between documents themselves are of importance. Document similarities may be used in order to generate links between documents that lead users from one document to related ones. Studies have shown that the vector-space model is robust in the context of OCR-processing if manually constructed queries are used. However it is not clear whether this model, if used for hypertext construction, is robust with regard to data corruption as caused by OCR engines. In this paper, we describe the performance of automatic hypertext construction, based on the vector-space model, with regard to three different measures: the number of overtakings within the used rankings, the accumulated distance of a document's position within the rankings and a comparison based on recall-precision graphs.

向量空间模型为信息检索提供了一个简单、鲁棒的模型。因此，查询和文档之间的相似性以及文档本身之间的相似性非常重要。文档相似性可以用于生成文档之间的链接，从而将用户从一个文档引导到相关的文档。研究表明，如果使用手动构造的查询，向量空间模型在ocr处理上下文中具有鲁棒性。然而，目前尚不清楚这个模型，如果用于超文本构建，是否在由OCR引擎引起的数据损坏方面是健壮的。在本文中，我们描述了基于向量空间模型的自动超文本构建的性能，涉及三个不同的度量:使用排名中的超文本数量，排名中文档位置的累积距离以及基于召回精度图的比较。

引用次数: 8

A study of moment functions and its use in Chinese character recognition 矩函数及其在汉字识别中的应用研究

Proceedings of the Fourth International Conference on Document Analysis and Recognition

Pub Date : 1997-08-18 DOI: 10.1109/ICDAR.1997.620566

S. Liao, Q. Lu

Moment descriptors have been developed as features in pattern recognition since the moment method was first introduced. In this paper, new moment features for Chinese character recognition are proposed. These provide significant improvements in terms of Chinese character recognition, especially for those characters that are very close in shapes.

矩描述子作为一种特征在模式识别中得到了发展。本文提出了一种新的矩特征用于汉字识别。这在汉字识别方面提供了显著的改进，特别是对于那些形状非常接近的汉字。

引用次数: 19

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of the Fourth International Conference on Document Analysis and Recognition

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀