Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)最新文献

英文中文

An Efficient Feature Extraction and Dimensionality Reduction Scheme for Isolated Greek Handwritten Character Recognition 孤立希腊手写体字符识别的高效特征提取与降维方案

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

Pub Date : 2007-09-23 DOI: 10.1109/ICDAR.2007.49

G. Vamvakas, B. Gatos, Sergios Petridis, N. Stamatopoulos

In this paper, we present an off-line methodology for isolated Greek handwritten character recognition based on efficient feature extraction followed by a suitable feature vector dimensionality reduction scheme. Extracted features are based on (i) horizontal and vertical zones, (ii) the projections of the character profiles, (Hi) distances from the character boundaries and (iv) profiles from the character edges. The combination of these types of features leads to a 325- dimensional feature vector. At a next step, a dimensionality reduction technique is applied, according to which the dimension of the feature space is lowered down to comprise only the features pertinent to the discrimination of characters into the given set of letters. In this paper, we also present a new Greek handwritten database of 36,960 characters that we created in order to measure the performance of the proposed methodology.

在本文中，我们提出了一种离线的孤立希腊手写字符识别方法，该方法基于有效的特征提取，然后是合适的特征向量降维方案。提取的特征是基于(i)水平和垂直区域，(ii)字符轮廓的投影，(Hi)到字符边界的距离，(iv)到字符边缘的轮廓。这些类型的特征的组合导致325维特征向量。下一步，应用降维技术，根据该技术，特征空间的维数被降低，仅包含与将字符区分为给定字母集相关的特征。在本文中，我们还提出了一个新的希腊文手写数据库，该数据库包含36,960个字符，我们创建了该数据库，以衡量所提出方法的性能。

引用次数: 38

Robust Page Segmentation Based on Smearing and Error Correction Unifying Top-down and Bottom-up Approaches 统一自顶向下和自底向上方法的基于涂抹和纠错的鲁棒页面分割

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

Pub Date : 2007-09-23 DOI: 10.1109/ICDAR.2007.225

Huaigu Cao, R. Prasad, P. Natarajan, Ehry MacRostie

In this paper we present a robust multi-pass page segmentation algorithm. The first pass uses a modified smearing algorithm and the second pass performs a hybrid of bottom-up and top-down segmentation on the output of the first pass. Unlike traditional approaches, the bottom-up and top-down steps are based on primitive results of a smearing based page segmentation algorithm. Therefore, "split" and "merge" processes start with text blocks that are mostly true text blocks but a few of them are either touching or broken. We present experimental results on newspaper and journal documents from different languages to demonstrate the robustness and language independence of our approach.

本文提出了一种鲁棒的多通道页面分割算法。第一次通过使用修改的涂抹算法，第二次通过在第一次通过的输出上执行自下而上和自上而下的混合分割。与传统方法不同，自底向上和自顶向下的步骤是基于基于涂抹的页面分割算法的原始结果。因此，“分裂”和“合并”过程从文本块开始，这些文本块大多是真正的文本块，但其中一些不是触摸就是破碎。我们提供了不同语言的报纸和期刊文档的实验结果，以证明我们的方法的鲁棒性和语言独立性。

引用次数: 24

A General Method of Segmentation-Recognition Collaboration Applied to Pairs of Touching and Overlapping Symbols 一种用于接触重叠符号对分割-识别协同的通用方法

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

Pub Date : 2007-09-23 DOI: 10.1109/ICDAR.2007.11

C. Renaudin

In this paper, we present a general method to segment and recognize pairs of touching and overlapping symbols. The method, whose purpose is to lead to a generic approach in future works, is based on the evaluation of several segmentation candidates produced by grouping elements of an over-segmentation. A pure general method would involve a high number of segmentation candidates. An adaptive introduction of heuristics, representing the knowledge dedicated to the applicative domain, allows to decrease this number. We have largely tested the method on pairs of touching and overlapping digits.

本文提出了一种通用的分割和识别触摸和重叠符号对的方法。该方法是基于对过度分割的分组元素产生的若干候选分割的评估，其目的是在未来的工作中导致一种通用的方法。一个纯粹的通用方法将涉及大量的分割候选。自适应地引入启发式，表示专用于应用领域的知识，可以减少这个数字。我们已经在成对触摸和重叠的数字上对该方法进行了大量测试。

引用次数: 13

SVM Based Scheme for Thai and English Script Identification 基于支持向量机的泰英文字识别方案

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

Pub Date : 2007-09-23 DOI: 10.1109/ICDAR.2007.237

S. Chanda, O. R. Terrades, U. Pal

In some Thai documents, a single text line of a document page may contain both Thai and English scripts. For the optical character recognition (OCR) of such a document page it is better to identify, at first, Thai and English script portions and then to use individual OCR system of the respective scripts on these identified portions. In this paper, a SVM based method is proposed for identification of word-wise printed English and Thai scripts from a single line of a document page. Here, at first, the document is segmented into lines and then lines are segmented into character groups (words). In the proposed scheme, we identify the script of the individual character group combining different character features obtained from structural shape, profile, component overlapping information, topological properties, water reservoir concept etc. Based on the experiment on 6110 data we obtained 99.36% script identification accuracy from the proposed scheme.

在某些泰语文档中，文档页面的单个文本行可能同时包含泰语和英语脚本。对于这种文档页面的光学字符识别(OCR)，最好先识别泰语和英语脚本部分，然后在这些已识别的部分上使用各自脚本的单独OCR系统。本文提出了一种基于支持向量机的方法，用于从文档页面的单行中识别打印的英语和泰语脚本。在这里，首先将文档分割成行，然后将行分割成字符组(单词)。在该方案中，我们结合从结构形状、剖面、组件重叠信息、拓扑属性、水库概念等获得的不同特征特征，识别单个特征组的脚本。通过对6110个数据的实验，该方案的文字识别准确率达到99.36%。

引用次数: 32

Text and Layout Information Extraction from Document Files of Various Formats Based on the Analysis of Page Description Language 基于页面描述语言分析的各种格式文档文件文本和版面信息提取

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

Pub Date : 2007-09-23 DOI: 10.1109/ICDAR.2007.242

T. Hirano, Y. Okano, Yasuhiro Okada, Fumio Yoda

We propose a document analysis method, which extracts text and layout information from document files of various formats. This method analyzes the page description language (PDL) data generated from a printed document. By converting the document to PDL data, this method can handle various document formats. Graphic elements such as text objects, image objects, and path objects in the PDL data are analyzed to extract text and layout information (character size, character position, and table position). By applying OCR to the image objects and the path objects, text images in source documents and vectorized font characters in engineering drawings are converted to text. Moreover, tables in various documents are detected by analyzing path objects. Therefore, it is possible to extract the full content information from document files of various formats as long as the document is printable.

提出了一种从各种格式的文档文件中提取文本和版面信息的文档分析方法。该方法分析从打印文档生成的页面描述语言(PDL)数据。通过将文档转换为PDL数据，该方法可以处理各种文档格式。分析PDL数据中的文本对象、图像对象和路径对象等图形元素，以提取文本和布局信息(字符大小、字符位置和表位置)。通过对图像对象和路径对象进行OCR处理，将源文档中的文本图像和工程图纸中的矢量化字体字符转换为文本。此外，还可以通过分析路径对象来检测各种文档中的表。因此，只要文档是可打印的，就可以从各种格式的文档文件中提取完整的内容信息。

引用次数: 6

A New Syntactic Approach to Graphic Symbol Recognition 图形符号识别的句法新方法

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

Pub Date : 2007-09-23 DOI: 10.1109/ICDAR.2007.26

Yu Yajie, Wan Zhang, Wenyin Liu

This paper presents a novel syntactic symbol recognition approach to the vector based symbol recognition problem. Different from existing syntactic approaches, which usually describe the geometric relations among primitives, our method formulates a new model to describe the geometric information of a primitive with respect to the whole symbol object based on mathematical analysis. The mathematical model is theoretically rotation and scale invariant and experiments show its accuracy for vector based symbol recognition.

针对基于向量的符号识别问题，提出了一种新的句法符号识别方法。与现有的描述原语之间几何关系的语法方法不同，本文的方法在数学分析的基础上，建立了一种新的模型来描述原语相对于整个符号对象的几何信息。该数学模型在理论上是旋转和尺度不变的，实验证明了该模型对基于矢量的符号识别的准确性。

引用次数: 6

Performance Analysis Framework for Layout Analysis Methods 布局分析方法的性能分析框架

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

Pub Date : 2007-09-23 DOI: 10.1109/ICDAR.2007.207

A. Antonacopoulos, D. Bridson

This paper presents a new framework for in-depth analysis of the performance of layout analysis methods. Contrary to existing approaches aimed at evaluation or benchmarking, the proposed framework provides detailed information at various levels that can be used by method developers to identify specific problems and improve their work. Complex layouts are supported as well as the flexible configuration of goal-oriented performance analysis scenarios. The comparison of segmentation results against the ground truth is performed in a very efficient way based on a decomposition of any region shape into an interval-based description. The framework has been validated using the dataset and method results of the ICDAR2005 Page Segmentation Competition.

本文提出了一个新的框架来深入分析布局分析方法的性能。与现有的旨在评估或基准测试的方法相反，建议的框架提供了方法开发人员可以使用的各个级别的详细信息，以确定具体问题并改进他们的工作。支持复杂的布局以及面向目标的性能分析场景的灵活配置。基于将任何区域形状分解为基于区间的描述，以一种非常有效的方式将分割结果与地面真实情况进行比较。使用ICDAR2005页面分割大赛的数据集和方法结果对该框架进行了验证。

引用次数: 25

A Blind Indic Script Recognizer for Multi-script Documents 用于多脚本文档的盲索引脚本识别器

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

Pub Date : 2007-09-23 DOI: 10.1109/ICDAR.2007.2

P. Pati, A. Ramakrishnan

We report a hierarchical blind script identifier for 11 different Indian scripts. An initial grouping of the 11 scripts is accomplished at the first level of this hierarchy. At the subsequent level, we recognize the script in each group. The various nodes of this tree use different feature-classifier combinations. A database of 20,000 words of different font styles and sizes is collected and used for each script. Effectiveness of Gabor and Discrete Cosine Transform features has been independently evaluated using nearest neighbor, linear discriminant and support vector machine classifiers. The minimum and maximum accuracies obtained, using this hierarchical mechanism, are 92.2% and 97.6%, respectively.

我们报告了11种不同印度文字的分级盲文字标识符。11个脚本的初始分组是在这个层次结构的第一级完成的。在接下来的层次上，我们识别每一组中的脚本。该树的各个节点使用不同的特征分类器组合。收集了不同字体样式和大小的2万个单词的数据库，并用于每个脚本。使用最近邻、线性判别和支持向量机分类器对Gabor和离散余弦变换特征的有效性进行了独立评估。使用这种分层机制获得的最小和最大精度分别为92.2%和97.6%。

引用次数: 6

Segmentation of Very Low Resolution Screen-Rendered Text 非常低分辨率屏幕渲染文本的分割

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

Pub Date : 2007-09-23 DOI: 10.1109/ICDAR.2007.229

S. Wachenfeld, Stefan Fleischer, Hans-Ulrich Klein, Xiaoyi Jiang

The lower the resolution of a given text is, the more difficult it becomes to segment it into single characters. The resolution of screen-rendered text can be very low. This paper focuses on smoothed screen-rendered text of very low resolution with typical x-heights of 4 to 7 pixels which is much lower than in other low resolution OCR situations. We propose a recognition-based segmentation algorithm which makes use of over segmentation by dynamic programming, candidate rating by single character classifiers and a graph based search algorithm for an optimal cut sequence. The algorithm is described in detail and experimental results are presented which show the performance on example screen- shot images taken from the public Screen-Word database.

给定文本的分辨率越低，将其分割成单个字符就越困难。屏幕渲染文本的分辨率可能非常低。本文关注的是非常低分辨率的平滑屏幕渲染文本，典型的x高度为4到7像素，这比其他低分辨率OCR情况要低得多。我们提出了一种基于识别的分割算法，该算法利用动态规划的过度分割、单字符分类器的候选排序和基于图的搜索算法来寻找最优切割序列。对该算法进行了详细的描述，并给出了从公共screen- word数据库中获取的截屏图像的实验结果。

引用次数: 9

Writer Identification Using Steered Hermite Features and SVM 基于导向赫米特特征和支持向量机的作家识别

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

Pub Date : 2007-09-23 DOI: 10.1109/ICDAR.2007.271

A. Imdad, S. Bres, V. Eglin, C. Rivero-Moreno, H. Emptoz

Writer recognition is considered as a difficult problem to solve due to variations found in the writing, even from the same writer. In this paper, steered Hermite features are used to identify writer from a written document. We will show that steered Hermite features are highly useful for text images because they extract lot of information, notably for data characterized by oriented features, curves and segments. The algorithm we propose here, first calculates the steered Hermite features of the images which are then passed on to support vector machine for training and testing. The base of tests consists of sample of some lines of writings (five at most) of primarily diversified writings of authors from IAM database. With the proposed algorithm based on steered Hermite features, we were able to achieve an accuracy of around 83% percent for a set of 30 authors with non overlapping images of written text.

作者识别被认为是一个难以解决的问题，因为在写作中发现了差异，甚至来自同一作者。在本文中，使用导向的埃尔米特特征从书面文件中识别作者。我们将展示导向Hermite特征对文本图像非常有用，因为它们提取了大量信息，特别是对于定向特征、曲线和片段特征的数据。我们在这里提出的算法，首先计算图像的导向Hermite特征，然后将其传递给支持向量机进行训练和测试。测试的基础包括来自IAM数据库的作者主要多样化的作品的一些行(最多五行)的样本。使用基于导向Hermite特征的算法，我们能够在一组30位作者的无重叠书面文本图像中实现83%左右的准确率。

引用次数: 28

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀