Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)最新文献

英文中文

Combination of OCR Engines for Page Segmentation Based on Performance Evaluation 基于性能评估的组合OCR引擎页面分割

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

Pub Date : 2007-09-23 DOI: 10.1109/ICDAR.2007.83

Miquel A. Ferrer, Ernest Valveny

In this paper we present a method to improve the performance of individual page segmentation engines based on the combination of the output of several engines. The rules of combination are designed after analyzing the results of each individual method. This analysis is performed using a performance evaluation framework that aims at characterizing each method according to its strengths and weaknesses rather than computing a single performance measure telling which is the "best" segmentation method.

在本文中，我们提出了一种基于多个引擎输出的组合来提高单个页面分割引擎性能的方法。在分析了每种方法的结果后，设计了组合规则。此分析使用性能评估框架执行，该框架旨在根据每种方法的优点和缺点来描述其特征，而不是计算单个性能度量来告诉哪个是“最佳”分割方法。

引用次数: 2

Arabic Handwriting Texture Analysis for Writer Identification Using the DWT-Lifting Scheme 基于dwt提升方案的阿拉伯笔迹纹理分析

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

Pub Date : 2007-09-23 DOI: 10.1109/ICDAR.2007.62

S. Gazzah, N. Amara

In this paper, we present an approach for writer identification using off-line Arabic handwriting. The proposed method explores the handwriting texture analysis by 2D discrete wavelet transforms using lifting scheme. A comparative evaluation between textural features extracted by 9 different wavelet transform functions was done. A modular multilayer perceptron classifier was used. Experiments have shown that writer identification accuracies reach best performance levels with an average rate of 95.68%. Experiments have been carried out using a database of 180 text samples. The chosen text was made to guarantee the involvement of the various internal shapes and letter locations within an Arabic subword.

在本文中，我们提出了一种使用离线阿拉伯笔迹的作家识别方法。提出了一种基于二维离散小波变换的手写体纹理分析方法。对9种不同小波变换函数提取的纹理特征进行了对比评价。采用模块化多层感知器分类器。实验表明，作者识别准确率达到了最佳性能水平，平均准确率为95.68%。实验使用了一个包含180个文本样本的数据库。所选择的文本是为了保证各种内部形状和字母位置在阿拉伯语子词的参与。

引用次数: 46

On Segmentation of Documents in Complex Scripts 复杂文字文档的分词研究

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

Pub Date : 2007-09-23 DOI: 10.1109/ICDAR.2007.194

K. S. S. Kumar, S. Kumar, C. V. Jawahar

Document image segmentation algorithms primarily aim at separating text and graphics in presence of complex layouts. However, for many non-Latin scripts, segmentation becomes a challenge due to the characteristics of the script. In this paper, we empirically demonstrate that successful algorithms for Latin scripts may not be very effective for Indic and complex scripts. We explain this based on the differences in the spatial distribution of symbols in the scripts. We argue that the visual information used for segmentation needs to be enhanced with other information like script models for accurate results.

文档图像分割算法的主要目标是在复杂的布局中分离文本和图形。然而，对于许多非拉丁文字，由于文字的特点，分割成为一个挑战。在本文中，我们通过经验证明，拉丁文字的成功算法可能对印度语和复杂的文字并不十分有效。我们根据文字中符号空间分布的差异来解释这一点。我们认为，用于分割的视觉信息需要与脚本模型等其他信息一起增强，以获得准确的结果。

引用次数: 25

Skew Detection for Chinese Handwriting by Horizontal Stroke Histogram 基于水平笔画直方图的汉字书写歪斜检测

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

Pub Date : 2007-09-23 DOI: 10.1109/ICDAR.2007.233

Tong-Hua Su, Tian-Wen Zhang, Hu-Jie Huang, Yu Zhou

This paper proposes a skew detection method for real Chinese handwritten documents. After analyzing the characteristics of Chinese characters, it utilizes the horizontal stroke histogram. Its accuracy, ability to increase the recall rate of text line separation, and CPU time consuming are investigated using 853 real Chinese handwritten documents. The results show that: 1) the method can identify 98.83% of the skew angles within one degree, with an improvement of 8.44% than Wigner-Ville distribution (WVD) method; 2) when incorporated into text line separation, the recall rate has an improvement of 2.54% than WVD method; 3) the method only consumes one-twentieth of WVD method on the same test environment.

提出了一种针对真实中文手写文档的倾斜检测方法。在分析了汉字的特点后，采用了横画直方图。以853份真实中文手写文档为例，考察了该方法的准确率、提高文本行分离查全率的能力和CPU耗时。结果表明:1)该方法能在1度内识别出98.83%的倾斜角，比WVD方法提高了8.44%;2)结合文本行分离，召回率比WVD方法提高了2.54%;3)在相同的测试环境下，该方法的能耗仅为WVD法的二十分之一。

引用次数: 22

Layout Based Information Extraction from HTML Documents 从HTML文档中提取基于布局的信息

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

Pub Date : 2007-09-23 DOI: 10.1109/ICDAR.2007.155

Radek Burget

We propose a method of information extraction from HTML documents based on modelling the visual information in the document. A page segmentation algorithm is used for detecting the document layout and subsequently, the extraction process is based on the analysis of mutual positions of the detected blocks and their visual features. This approach is more robust that the traditional DOM-based methods and it opens new possibilities for the extraction task specification.

提出了一种基于可视化信息建模的HTML文档信息提取方法。采用页面分割算法检测文档布局，然后根据检测块的相互位置及其视觉特征进行提取。这种方法比传统的基于dom的方法更健壮，并且为提取任务规范开辟了新的可能性。

引用次数: 23

Information Management System Using Structure Analysis of Paper/Electronic Documents and Its Applications 基于结构分析的纸质/电子文档信息管理系统及其应用

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

Pub Date : 2007-09-23 DOI: 10.1109/ICDAR.2007.144

M. Seki, Masakazu Fujio, T. Nagasaki, Hiroshi Shinjo, K. Marukawa

An information management system using analyzing document structure is presented. The purpose is simultaneous management of information in various paper and electronic documents. The system contains image document analysis, PDF document analysis, and HTML document analysis. The two applications are presented and the developed prototypes are described. One application is document summarization. The other application is table understanding to correlate data to items.

提出了一种基于文档结构分析的信息管理系统。目的是同时管理各种纸质和电子文档中的信息。该系统包括图像文档分析、PDF文档分析和HTML文档分析。介绍了这两种应用，并对开发的样机进行了描述。一个应用是文档摘要。另一个应用程序是表理解，将数据与项关联起来。

引用次数: 4

Computer Assisted Transcription of Handwritten Text Images 手写文本图像的计算机辅助转录

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

Pub Date : 2007-09-23 DOI: 10.1109/ICDAR.2007.86

A. Toselli, Verónica Romero, Luis Rodríguez, E. Vidal

To date, automatic handwriting recognition systems are far from being perfect and often they need a post editing where a human intervention is required to check and correct the results of such systems. We propose to have a new interactive, on-line framework which, rather than full automation, aims at assisting the human in the proper recognition- transcription process; that is, facilitate and speed up their transcription task of handwritten texts. This framework combines the efficiency of automatic handwriting recognition systems with the accuracy of the human transcriptor. The best result is a cost-effective perfect transcription of the handwriting text images.

到目前为止，自动手写识别系统还远远不够完善，通常需要后期编辑，需要人工干预来检查和纠正这些系统的结果。我们建议建立一个新的交互式在线框架，而不是完全自动化，旨在帮助人类进行适当的识别-转录过程;也就是说，方便和加快他们抄写手写文本的任务。该框架结合了自动手写识别系统的效率和人类转录器的准确性。最好的结果是具有成本效益的手写文本图像的完美转录。

引用次数: 50

Direction Code Based Features for Recognition of Online Handwritten Characters of Bangla 基于方向码特征的孟加拉语在线手写体识别

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

Pub Date : 2007-09-23 DOI: 10.1109/ICDAR.2007.100

U. Bhattacharya, B. K. Gupta, S. K. Parui

In the present article, we describe a novel direction code based feature extraction approach for recognition of online Bangla handwritten basic characters. We have implemented the proposed approach on a database of 7043 online handwritten Bangla (a major script of the Indian subcontinent) character samples, which has been developed by us. This is a 50-class recognition problem and we achieved 93.90% and 83.61% recognition accuracies respectively on its training and test sets.

在本文中，我们描述了一种新的基于方向码的特征提取方法，用于在线孟加拉语手写基本字符的识别。我们已经在我们开发的7043个在线手写孟加拉语(印度次大陆的一种主要文字)字符样本数据库上实现了所提出的方法。这是一个50类的识别问题，我们在训练集和测试集上分别达到了93.90%和83.61%的识别准确率。

引用次数: 86

Off-Line Handwritten Character Recognition of Devnagari Script Devnagari文字的离线手写字符识别

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

Pub Date : 2007-09-23 DOI: 10.1109/ICDAR.2007.189

U. Pal, N. Sharma, T. Wakabayashi, F. Kimura

In this paper we present a system towards the recognition of off-line handwritten characters of Devnagari, the most popular script in India. The features used for recognition purpose are mainly based on directional information obtained from the arc tangent of the gradient. To get the feature, at first, a 2times2 mean filtering is applied 4 times on the gray level image and a non-linear size normalization is done on the image. The normalized image is then segmented to 49times49 blocks and a Roberts filter is applied to obtain gradient image. Next, the arc tangent of the gradient (direction of gradient) is initially quantized into 32 directions and the strength of the gradient is accumulated with each of the quantized direction. Finally, the blocks and the directions are down sampled using Gaussian filter to get 392 dimensional feature vector. A modified quadratic classifier is applied on these features for recognition. We used 36172 handwritten data for testing our system and obtained 94.24% accuracy using 5-fold cross-validation scheme.

在本文中，我们提出了一个识别印度最流行的手写体Devnagari的离线手写字符的系统。用于识别目的的特征主要是基于从梯度的弧切线中获得的方向信息。为了得到特征，首先对灰度图像进行4次2times2均值滤波，并对图像进行非线性尺寸归一化。然后将归一化后的图像分割为49次49块，并应用罗伯茨滤波器得到梯度图像。接下来，将梯度的切弧(梯度方向)初始量化为32个方向，并在每个量化方向上累积梯度的强度。最后，利用高斯滤波对分块和方向进行下采样，得到392维特征向量。利用改进的二次分类器对这些特征进行识别。我们使用了36172个手写数据来测试我们的系统，使用5倍交叉验证方案获得了94.24%的准确率。

引用次数: 127

A Suffix Tree Based Handwritten Chinese Address Recognition System 基于后缀树的手写体中文地址识别系统

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

Pub Date : 2007-09-23 DOI: 10.1109/ICDAR.2007.36

Y. Jiang, X. Ding, Z. Ren

The main contribution of the paper is that it presents a suffix tree based data structure for automatic handwritten Chinese address reading. Since lots of papers have discussed the destination address block (DAB) location for Chinese, we will not extend it in this paper. Instead, we pay more attention to improve the address matching performance after DAB location. As some conventional methods, the extracted text lines are pre-segmented into a series of radicals. We then build a hierarchical structure of sub-strings from the recognized characters of valid radical combinations. Coarse address candidates are selected at the same time. In address maching, we incorporate postcode information to filter redundant addresses. The pre- segmented radicals are compared with candidate address and a cost function combining recognition and structrual cost is evaluated for final decision. In the system, character segmentation, recognition, string searching and matching are considered synchronously by taking advantage of lexicon knowledge. Suffix tree can greatly facilitate the substring generation process and enable the matching process to start from any character to collect potentially bitty information. Therefore, our algorithms is more robust to the intervening noises and irregular writing styles. Finallly, we test 1,000 handwritten Chinese envelopes and achieve a correct rate of 85.30% in 3.0 seconds per mail averagely.

本文的主要贡献在于提出了一种基于后缀树的手写体中文地址自动读取数据结构。由于已有大量文献对中文的目的地址块(DAB)定位进行了讨论，本文将不作进一步的介绍。因此，我们更关注的是如何提高DAB定位后的地址匹配性能。与传统方法一样，将提取的文本行预先分割成一系列的词根。然后，我们从有效自由基组合的识别字符中构建子字符串的层次结构。同时选取粗地址候选者。在地址处理中，我们结合邮政编码信息来过滤冗余地址。将预分割的基与候选地址进行比较，并将识别和结构代价相结合的代价函数进行评估，以做出最终决定。该系统利用词汇知识，将字符分割、识别、字符串搜索和匹配同步进行。后缀树可以极大地简化子字符串的生成过程，使匹配过程可以从任何字符开始收集潜在的位信息。因此，我们的算法对干扰噪声和不规则书写风格具有更强的鲁棒性。最后，我们测试了1000个手写中文信封，平均每封邮件3.0秒的正确率达到了85.30%。

{"title":"A Suffix Tree Based Handwritten Chinese Address Recognition System","authors":"Y. Jiang, X. Ding, Z. Ren","doi":"10.1109/ICDAR.2007.36","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.36","url":null,"abstract":"The main contribution of the paper is that it presents a suffix tree based data structure for automatic handwritten Chinese address reading. Since lots of papers have discussed the destination address block (DAB) location for Chinese, we will not extend it in this paper. Instead, we pay more attention to improve the address matching performance after DAB location. As some conventional methods, the extracted text lines are pre-segmented into a series of radicals. We then build a hierarchical structure of sub-strings from the recognized characters of valid radical combinations. Coarse address candidates are selected at the same time. In address maching, we incorporate postcode information to filter redundant addresses. The pre- segmented radicals are compared with candidate address and a cost function combining recognition and structrual cost is evaluated for final decision. In the system, character segmentation, recognition, string searching and matching are considered synchronously by taking advantage of lexicon knowledge. Suffix tree can greatly facilitate the substring generation process and enable the matching process to start from any character to collect potentially bitty information. Therefore, our algorithms is more robust to the intervening noises and irregular writing styles. Finallly, we test 1,000 handwritten Chinese envelopes and achieve a correct rate of 85.30% in 3.0 seconds per mail averagely.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122396803","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀