Proceedings of 3rd International Conference on Document Analysis and Recognition最新文献

英文中文

On-line cursive script recognition using an island-driven search technique 使用岛驱动搜索技术的在线草书识别

Proceedings of 3rd International Conference on Document Analysis and Recognition

Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.602043

Seung-Ho Lee, Hyunkyu Lee, J. H. Kim

A new approach for on-line cursive script recognition that combines a letter spotting technique with an island-driven lattice search algorithm is presented. Initially, all plausible letter components within an input pattern are detected, using a letter spotting technique based on hidden Markov models. A word hypothesis lattice is generated as a result of the letter spotting. Then an island-driven search algorithm is performed to find the optimal path on the word hypothesis lattice, which corresponds to the most probable word among the dictionary words. The results of this experiment suggest that the proposed method works effectively in recognizing English cursive words. In a word recognition test, the average 85.4% word accuracy was obtained.

提出了一种结合字母识别技术和岛驱动格搜索算法的在线草书识别新方法。最初，使用基于隐马尔可夫模型的字母识别技术，检测输入模式中所有可能的字母组件。单词假设格是由字母点阵生成的。然后使用岛驱动搜索算法在单词假设格上寻找最优路径，该路径对应于字典单词中最可能的单词。实验结果表明，该方法能够有效地识别英文草书单词。在单词识别测试中，平均准确率达到85.4%。

引用次数: 10

ODIL: an SGML description language of the layout structure of documents ODIL:文档布局结构的SGML描述语言

Proceedings of 3rd International Conference on Document Analysis and Recognition

Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.599040

P. Lefèvre, François Reynaud

This paper describes a coding format in SGML for the output of a document recognition prototype. Our proposal is a DTD named "ODIL"-Office Document Image description Language-that describes precisely the layout structure of a document after all recognition phases, including OCR. All layout objects of a document are defined in the form of SGML elements, and their characteristics are defined by SGML attributes. The basic objects are blocks, containing homogeneous information. Five types of information are supported by the ODIL language: texts, photos, line graphics, tables, mathematic formulas. The ODIL representation of the recognition results is well adapted to a further logical structure recognition. Starting from the ODIL DTD and using the RAINBOW transit DTD will permit to use SGML tools for the logical structure recognition which is viewed as an SGML up-conversion problem.

本文描述了一种用于文档识别原型输出的SGML编码格式。我们的建议是一个名为“ODIL”(office Document Image description language)的DTD，它精确地描述文档在所有识别阶段(包括OCR)之后的布局结构。文档的所有布局对象都以SGML元素的形式定义，它们的特征由SGML属性定义。基本对象是块，包含同构信息。ODIL语言支持五种类型的信息:文本、照片、线条图、表格、数学公式。识别结果的ODIL表示很好地适应于进一步的逻辑结构识别。从ODIL DTD开始并使用RAINBOW传输DTD将允许使用SGML工具进行逻辑结构识别，这被视为SGML上转换问题。

引用次数: 9

(Chem)DeT/sub E/X automatic generation of a markup language description of (chemical) documents from bitmap images (化学)DeT/sub E/X从位图图像自动生成(化学)文档的标记语言描述

Proceedings of 3rd International Conference on Document Analysis and Recognition

Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.599035

A. Simon, Jean-Christophe Pret, A.P. Johnson

This paper presents a novel view of document processing, as being the reverse process to T/sub E/X. This concept simplifies the analysis of the physical structure of documents, and also suggests the use of a style file for layout recognition. An algorithm is given for both phases, layout analysis and layout recognition. The bottom-up layout analysis method employed is based on the Kruskal's algorithm and uses the distances between the components to construct the physical page structure. The algorithm is linear with respect to the number of the connected components. For layout recognition, a document style description language (DSDL) is introduced. This helps a fault-tolerant, recursive parsing algorithm to label the blocks of the document. The presented methods were designed to be used for scientific publications (papers, reports, books), but could be applied to a broader range of documents.

本文提出了一种新的文档处理观点，认为它是与T/sub / E/X相反的过程。这个概念简化了对文档物理结构的分析，还建议使用样式文件进行布局识别。给出了布局分析和布局识别两个阶段的算法。采用的自底向上布局分析方法基于Kruskal算法，利用组件之间的距离来构造物理页面结构。该算法与连接组件的数量呈线性关系。在版面识别方面，引入了文档样式描述语言(DSDL)。这有助于容错的递归解析算法标记文档的块。所提出的方法旨在用于科学出版物(论文、报告、书籍)，但可以应用于更广泛的文件。

引用次数: 1

Gray scale filtering for line and word segmentation 灰度滤波的线和词分割

Proceedings of 3rd International Conference on Document Analysis and Recognition

Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.601979

Yi Lu, A. Tisler

The extraction of lines, words and characters from a digital document image are necessary computational steps preceding character recognition. Much has been discussed in character segmentation and recognition but little has been done in the area of line and word segmentation. The authors present two special filters, minimum difference filters (MDF) and average difference filters (ADF) to facilitate line and word segmentation. They discuss how to select the scales of these filters dynamically and how to use the filters to eliminate crossing lines from a text image.

从数字文档图像中提取行、词和字符是字符识别的必要计算步骤。在字符分割和识别方面已经讨论了很多，但在线分和词分方面却做得很少。作者提出了两种特殊的滤波器，最小差分滤波器(MDF)和平均差分滤波器(ADF)，以方便线和词的分割。他们讨论了如何动态地选择这些过滤器的尺度，以及如何使用过滤器从文本图像中消除交叉线。

引用次数: 2

Drawing capturing system using image enhancement 采用图像增强的绘图捕获系统

Proceedings of 3rd International Conference on Document Analysis and Recognition

Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.601980

Norio Nakamura, K. Hosaka, Masakazu Nagura

The paper describes the properties of Ueda's (1985) image enhancement method for line drawings and its merit for practical use. This method can remove the line discontinuities or mis-connections caused by scanning errors. The method is applied to simple images to evaluate its effect quantitatively. The authors confirm that it is more efficient than any other methods, and propose a drawing capturing system based on this method that can build up high quality drawing databases faster than any other system.

本文介绍了Ueda(1985)的线条图图像增强方法的特点及其在实际应用中的优点。这种方法可以消除由于扫描误差引起的线路不连续或误接。将该方法应用于简单图像，定量评价其效果。在此基础上，作者提出了一个基于该方法的图形捕获系统，该系统可以比其他系统更快地建立高质量的图形数据库。

引用次数: 1

Recovering decorative patterns of ceramic objects from a monocular image using a genetic algorithm 利用遗传算法从单眼图像中恢复陶瓷物体的装饰图案

Proceedings of 3rd International Conference on Document Analysis and Recognition

Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.599008

H. Tanahashi, K. Sakaue, Kazuhiko Yamamoto

In order to develop a shape and decorative pattern database of old pottery and ceramics objects, it is necessary to obtain the surface pattern on the 3-dimensional shape of the object. This paper describes the recovery of the revolution surface from a monocular image of unknown camera parameters and the retrieval of the 2-dimensional pattern. The camera parameters are obtained by using a genetic algorithm (GA). After the revolution surfaces are reconstructed, these surfaces are developed into a 2-dimensional plane. We show that a scanner-digitized image of an old ceramic object can be analyzed by GA to reconstruct the revolution surface and to develop the 2-dimensional image on the 3-dimensional object.

为了开发古陶瓷器物的形状和装饰图案数据库，需要获得器物三维形状上的表面图案。本文描述了从未知相机参数的单眼图像中恢复旋转表面和二维模式的检索。采用遗传算法获得摄像机参数。对旋转曲面进行重构后，将其展开成二维平面。通过遗传算法分析旧陶瓷物体的扫描数字化图像，可以重建旋转表面，并在三维物体上显示二维图像。

引用次数: 4

A language model based on semantically clustered words in a Chinese character recognition system 基于语义聚类词的汉字识别语言模型

Proceedings of 3rd International Conference on Document Analysis and Recognition

Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.599033

Hsi-Jian Lee, Cheng-Huang Tung

This paper presents a new method for clustering the words in a dictionary into word groups, which are applied in a Chinese character recognition system with a language model to describe the contextual information. The Chinese synonym dictionary Tong2yi4ci2 ci2lin2 providing the semantic features is used to train the weights of the semantic attributes of the character-based word classes. The weights of the semantic attributes are next updated according to the words of the behavior dictionary, which has a rather complete word set. Then, the updated word classes are clustered into m groups according to the semantic measurement by a greedy method. The words in the behavior dictionary can finally be assigned into the m groups. The parameter space for bigram contextual information of the character recognition system is m/sup 2/. From the experimental results, the recognition system with the proposed model has shown better performance than that of a character-based bigram language model.

本文提出了一种将词典中的词聚类成词组的新方法，并将其应用于基于语言模型描述上下文信息的汉字识别系统中。使用提供语义特征的汉语同义词词典Tong2yi4ci2 ci2lin2来训练基于字符的词类的语义属性权值。然后根据行为字典中的单词更新语义属性的权重，该字典具有相当完整的单词集。然后，根据语义度量，采用贪心方法将更新后的词类聚为m组。行为字典中的单词最终可以分配到m组中。字符识别系统的双字母上下文信息的参数空间为m/sup 2/。实验结果表明，基于该模型的识别系统比基于字符的双字语言模型具有更好的识别性能。

引用次数: 11

Improved binarization algorithm for document image by histogram and edge detection 基于直方图和边缘检测的改进文档图像二值化算法

Proceedings of 3rd International Conference on Document Analysis and Recognition

Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.601976

Moon-Soo Chang, S. Kang, Woo-Sik Rho, Heok-Gu Kim, Duck-Jin Kim

A binarization method is presented to counter the stroke connectivity problems of characters arising from mid-level-quality binary image scanning systems. In the output of a binary image scanning system, separate strokes may look connected if the point size is small and the character strokes are complex while strokes may lose connectivity if they are generated at low intensity. Also, erroneous recognition may result if a blemished document surface distorts the image. To counter these problems and to further enhance the quality of character recognition, the authors have developed an integrated binarization scheme, exploiting synergistic use of an adaptive thresholding technique and variable histogram equalization. This algorithm is composed of two components. The first removes background noise via gray level histogram equalization while the second enhances the gray level of characters over and above the surrounding background via an edge image composition technique.

针对中等质量二值图像扫描系统中出现的字符笔画连通性问题，提出了一种二值化方法。在二值图像扫描系统的输出中，如果点尺寸小，字符笔画复杂，则单独的笔画看起来可能是相连的，而如果笔画在低强度下生成，则可能失去连通性。此外，如果有瑕疵的文档表面扭曲了图像，则可能导致错误识别。为了解决这些问题并进一步提高字符识别的质量，作者开发了一个集成的二值化方案，利用自适应阈值技术和变量直方图均衡的协同使用。该算法由两个部分组成。第一种方法是通过灰度直方图均衡化去除背景噪声，第二种方法是通过边缘图像合成技术增强字符在周围背景之上的灰度。

引用次数: 39

A rule learning method for academic document image processing 学术文献图像处理的规则学习方法

Proceedings of 3rd International Conference on Document Analysis and Recognition

Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.598985

A. Takasu, S. Satoh, E. Katsura

A syntactic rule learning method is presented for analyzing document images and constructing a database from them. This method is used in a digital library system named CyberMagazine, where document images are sequentially converted into database tuples by block segmentation, rough classification, and syntactic analysis. The syntactic rule has an ability to analyze symbols located in two dimensional plane, and has a syntax similar to an ordinal context free grammar except for the concatenation of symbols. In the presented learning method, the syntactic rules are generated from a set of parse trees by decomposing the trees according to non terminal symbols, generalizing the decomposed trees to a syntactic rule, and merging them.

提出了一种语法规则学习方法，用于分析文档图像并从中构建数据库。该方法在名为CyberMagazine的数字图书馆系统中使用，其中文档图像通过块分割、粗略分类和语法分析依次转换为数据库元组。该语法规则具有分析位于二维平面上的符号的能力，除了符号的连接之外，其语法与有序上下文无关语法相似。在该学习方法中，通过对一组解析树进行非终结符分解，将分解树泛化为一个语法规则，并将其合并，从而生成语法规则。

引用次数: 14

A Markovian random field approach to information retrieval 一种信息检索的马尔可夫随机场方法

Proceedings of 3rd International Conference on Document Analysis and Recognition

Pub Date : 1995-08-14 DOI: 10.1109/ICDAR.1995.602070

D. Bouchaffra, J. Meunier

A Markovian random field approach is proposed for automatic information retrieval in full text documents. We draw up an analogy between a flow of queries/document images connections and statistical mechanics systems. The Markovian flow process machine (MFP) models the interaction between queries and document images as a dynamical system. The MFP machine searches to fit the user's queries by changing the set of descriptors contained in the document images. There is hence a constant transformation of the informational states of the fund. For each state, a certain degradation of the system is considered. We use simulated annealing algorithm to isolate low energy states: this corresponds to the best "matching" in some sense between queries and images.

提出了一种用于全文文档信息自动检索的马尔可夫随机场方法。我们将查询流/文档图像连接与统计力学系统进行类比。马尔可夫流处理机(MFP)将查询和文档图像之间的交互建模为一个动态系统。MFP机器通过更改文档图像中包含的描述符集来搜索以适应用户的查询。因此，国际货币基金组织的信息状态不断发生变化。对于每个状态，都考虑了系统的一定退化。我们使用模拟退火算法来隔离低能态:这在某种意义上对应于查询和图像之间的最佳“匹配”。

引用次数: 5

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Proceedings of 3rd International Conference on Document Analysis and Recognition

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀