Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.最新文献

英文中文

A character recognizer for Turkish language 土耳其语字符识别器

Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.

Pub Date : 2003-08-03 DOI: 10.1109/ICDAR.2003.1227855

Sait Ulas Korkmaz, G. Kirçiçegi, Y. Akinci, V. Atalay

This paper presents particularly a contextual postprocessing subsystem for a Turkish machine printedcharacter recognition system. The contextual postprocessing subsystem is based on positional binary 3-gram statistics for Turkish language, an error correctorparser and a lexicon, which contains root words and theinflected forms of the root words. Error corrector parseris used for correcting CR alternatives using TurkishMorphology.

本文详细介绍了一个土耳其语机器印刷字符识别系统的上下文后处理子系统。上下文后处理子系统基于土耳其语的位置二进制3-gram统计，一个错误校正分析器和一个词典，其中包含词根和词根的屈折形式。错误校正分析器用于使用TurkishMorphology纠正CR选项。

引用次数: 7

Model length adaptation of an HMM based cursive word recognition system 基于HMM的草书词识别系统模型长度自适应

Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.

Pub Date : 2003-08-03 DOI: 10.1109/ICDAR.2003.1227642

M. Schambach

On the basis of a well accepted, HMM-based cursive script recognition system, an algorithm which automatically adapts the length of the models representing the letter writing variants is proposed. An average improvement in recognition performance of about 2.72 percent could be obtained. Two initialization methods for the algorithm have been tested, which show quite different behaviors; both prove to be useful in different application areas. To get a deeper insight into the functioning of the algorithm a method for the visualization of letter HMMs is developed. It shows the plausibility of most results, but also the limitations of the proposed method. However, these are mostly due to given restrictions of the training and recognition method of the underlying system.

在一个公认的基于hmm的草书识别系统的基础上，提出了一种自动适应代表字母书写变体的模型长度的算法。识别性能的平均提高约为2.72%。对该算法的两种初始化方法进行了测试，结果表明两种方法的行为完全不同;两者都被证明在不同的应用领域是有用的。为了更深入地了解该算法的功能，开发了一种字母hmm的可视化方法。它表明了大多数结果的合理性，但也表明了所提出方法的局限性。然而，这主要是由于底层系统的训练和识别方法受到一定的限制。

引用次数: 33

Generation of synthetic training data for an HMM-based handwriting recognition system 基于hmm的手写识别系统综合训练数据的生成

Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.

Pub Date : 2003-08-03 DOI: 10.1109/ICDAR.2003.1227736

Tamás Varga, H. Bunke

A perturbation model for generating synthetic text lines from existing cursively handwritten lines of text produced by human writers is presented. Our purpose is to improve the performance of an HMM-based off-line cursive handwriting recognition system by providing it with additional synthetic training data. Two kinds of perturbations are applied, geometrical transformations and thinning/thickening operations. The proposed perturbation model is evaluated under different experimental conditions.

提出了一个摄动模型，用于从现有的草书手写文本行生成合成文本行。我们的目的是通过提供额外的合成训练数据来提高基于hmm的离线草书手写识别系统的性能。应用了两种扰动，几何变换和变薄/增厚操作。在不同的实验条件下对所提出的微扰模型进行了评价。

引用次数: 99

User-assisted archive document image analysis for digital library construction 面向数字图书馆建设的用户辅助档案文献图像分析

Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.

Pub Date : 2003-08-03 DOI: 10.1109/ICDAR.2003.1227715

Jingyu He, A. Downton

A configurable archive document image analysis system for digital library construction has been designed using rapid prototyping and top-down iterative development methods. This approach has been found to be essential in order to capture the curators' expertise about existing card archive structures, content and databases. The design currently achieves about 93% correct segmentation of the required archive card fields overall, with 81.3% of all archive cards in a testset of 2000 images having all fields correctly segmented and labeled. Analysis of errors in the testset indicates that heavily-annotated cards and non-standard card formats comprise 5-10% of the overall archive, and a significant proportion of these are unlikely to be resolvable without curatorial intervention.

采用快速原型设计和自顶向下迭代开发方法，设计了面向数字图书馆建设的可配置档案文件图像分析系统。这种方法被认为是必不可少的，以便获得馆长对现有卡片档案结构、内容和数据库的专门知识。目前，该设计总体上实现了93%的所需档案卡字段的正确分割，在2000张图像的测试集中，81.3%的所有档案卡都正确分割和标记了所有字段。对测试集中错误的分析表明，大量注释卡片和非标准卡片格式占整个档案的5-10%，如果没有管理员的干预，其中很大一部分不太可能解决。

引用次数: 31

Texture feature characterization for logical pre-labeling 用于逻辑预标注的纹理特征表征

Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.

Pub Date : 2003-08-03 DOI: 10.1109/ICDAR.2003.1227728

B. Allier, J. Duong, Antoine Gagneux, Pierre Mallet, H. Emptoz

In this article we present a study based on the use of texture features for logical pre-labeling. The aim of our work is to calculate a great number of texture features over three sets of machine-printed document images and to study their joint discriminant power using SVM classifiers. The three corpuses we use are: the Archives of Savoie (AoS), composed of strongly structured documents, a subset of the UW3 database, and a third that is not structured at all, since it is composed of Web site images. The originality of our contribution is to sum up various methods that have been used for many years in our domain, and to test them on documents having very different specificities.

在这篇文章中，我们提出了一个基于纹理特征的逻辑预标记的研究。我们的工作目的是在三组机器打印的文档图像上计算大量的纹理特征，并使用SVM分类器研究它们的联合判别能力。我们使用的三个语料库是:萨瓦档案馆(Archives of Savoie, AoS)，它由强结构化文档组成，是UW3数据库的一个子集;第三个语料库完全没有结构化，因为它由Web站点图像组成。我们贡献的独创性在于总结了在我们的领域中已经使用多年的各种方法，并在具有非常不同的特殊性的文档上对它们进行了测试。

引用次数: 11

Accelerating large character set recognition using pivots 使用枢轴加速大字符集识别

Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.

Pub Date : 2003-08-03 DOI: 10.1109/ICDAR.2003.1227670

Yiping Yang, Ondrej Velek, M. Nakagawa

This paper proposes a method to accelerate character recognition of a large character set by employing pivots into the search space. We divide the feature space of character categories into smaller clusters and derive the centroid of each cluster as a pivot. Given an input pattern, it is compared with all the pivots and only a limited number of clusters whose pivots have higher similarities (or smaller distances) to the input pattern are searched for with the result that we can accelerate the recognition speed. This is based on the assumption that the search space is a distance space. The method has been applied to pre-classification of a practical off-line Japanese character recognizer with the result that the pre-classification time is reduced to 61 % while keeping its pre-classification recognition rate up to 40 candidates as the same as the original 99.6% and the total recognition time is reduced to 70% of the original time without sacrificing the recognition rate at all. If we sacrifice the pre-classification rate from 99.6% to 97.7%, then its time is reduced to 35% and the total recognition time is reduced to 51.5% with recognition rate as 96.3% from 98.3%.

本文提出了一种利用搜索空间中的轴心来加速大字符集字符识别的方法。我们将字符类别的特征空间划分为更小的簇，并导出每个簇的质心作为枢轴。给定一个输入模式，将其与所有的枢轴进行比较，只搜索与输入模式具有较高相似性(或较小距离)的有限数量的聚类，从而加快识别速度。这是基于搜索空间是距离空间的假设。将该方法应用于一个实际的离线日文字符识别器的预分类中，在保持40个候选字符的预分类识别率与原识别率99.6%相同的情况下，将预分类时间减少到61%，在不牺牲识别率的情况下，将总识别时间减少到原识别率的70%。如果我们将预分类率从99.6%降低到97.7%，则预分类时间减少到35%，总识别时间减少到51.5%，识别率从98.3%降低到96.3%。

{"title":"Accelerating large character set recognition using pivots","authors":"Yiping Yang, Ondrej Velek, M. Nakagawa","doi":"10.1109/ICDAR.2003.1227670","DOIUrl":"https://doi.org/10.1109/ICDAR.2003.1227670","url":null,"abstract":"This paper proposes a method to accelerate character recognition of a large character set by employing pivots into the search space. We divide the feature space of character categories into smaller clusters and derive the centroid of each cluster as a pivot. Given an input pattern, it is compared with all the pivots and only a limited number of clusters whose pivots have higher similarities (or smaller distances) to the input pattern are searched for with the result that we can accelerate the recognition speed. This is based on the assumption that the search space is a distance space. The method has been applied to pre-classification of a practical off-line Japanese character recognizer with the result that the pre-classification time is reduced to 61 % while keeping its pre-classification recognition rate up to 40 candidates as the same as the original 99.6% and the total recognition time is reduced to 70% of the original time without sacrificing the recognition rate at all. If we sacrifice the pre-classification rate from 99.6% to 97.7%, then its time is reduced to 35% and the total recognition time is reduced to 51.5% with recognition rate as 96.3% from 98.3%.","PeriodicalId":249193,"journal":{"name":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","volume":"538 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116336252","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 10

Visualizing multimedia content on paper documents: components of key frame selection for Video Paper 在纸质文档上可视化多媒体内容:视频文档关键帧选择的组成部分

Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.

Pub Date : 2003-08-03 DOI: 10.1109/ICDAR.2003.1227695

J. Hull, B. Erol, J. Graham, Dar-Shyang Lee

The components of a key frame selection algorithm for a paper-based multimedia browsing interface called Video Paper are described. Analysis of video image frames is combined with the results of processing the closed caption to select key frames that are printed on a paper document together with the closed caption. Bar codes positioned near the key frames allow a user to play the video from the corresponding times. This paper describes several component techniques that are being investigated for key frame selection in the Video Paper system, including face detection and text recognition. The Video Paper system implementation is also discussed.

介绍了基于纸张的多媒体浏览界面Video Paper的关键帧选择算法的组成部分。将对视频图像帧的分析与对封闭字幕的处理结果相结合，选择关键帧，与封闭字幕一起打印在纸质文档上。位于关键帧附近的条形码允许用户播放相应时间的视频。本文介绍了视频论文系统中关键帧选择的若干组成技术，包括人脸检测和文本识别。讨论了视频试卷系统的实现。

引用次数: 17

Proper names extraction from fax images combining textual and image features 结合文本和图像特征的传真图像专有名称提取

Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.

Pub Date : 2003-08-03 DOI: 10.1109/ICDAR.2003.1227724

Laurence Likforman-Sulem, Pascal Vaillant, François Yvon

In the frame of a unified messaging system, a crucial task of the system is to provide the user with key information on every message received, like keywords reflecting the object of the message, or the name of the sender. However, in the case of facsimiles, this information is not as easy to detect as in the case of e-mails, since no standard headers are defined. The aim of the presented work is to identify and extract specific information (the name of the sender) from a fax cover page. For this purpose, methods based on image document analysis (OCR recognition, physical blocks selection), and text analysis methods (optimized dictionary lookup, local grammar rules), are implemented to work in parallel. The fusion of their results brings a more accurate guess than any of the methods would achieve separately.

在统一消息传递系统的框架中，系统的一个关键任务是向用户提供接收到的每条消息的关键信息，如反映消息对象的关键字或发送者的名称。然而，在传真的情况下，该信息不像在电子邮件的情况下那样容易检测，因为没有定义标准标头。提出的工作的目的是识别和提取传真封面上的具体信息(发件人的名字)。为此，基于图像文档分析(OCR识别、物理块选择)和文本分析方法(优化字典查找、本地语法规则)的方法被实现为并行工作。他们的结果的融合带来了比任何单独的方法都更准确的猜测。

引用次数: 7

Discerning structure from freeform handwritten notes 从自由形式的手写笔记辨别结构

Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.

Pub Date : 2003-08-03 DOI: 10.1109/ICDAR.2003.1227628

Michael Shilman, Zile Wei, Sashi Raghupathy, P. Simard, David Jones

This paper presents an integrated approach to parsing textual structure in freeform handwritten notes. Text-graphics classification and text layout analysis are classical problems in printed document analysis, but the irregularity in handwriting and content in freeform notes reveals limitations in existing approaches. We advocate an integrated technique that solves the layout analysis and classification problems simultaneously: the problems are so tightly coupled that it is not possible to solve one without the other for real user notes. We tune and evaluate our approach on a large corpus of unscripted user files and reflect on the difficult recognition scenarios that we have encountered in practice.

本文提出了一种综合分析自由格式手写笔记文本结构的方法。文本-图形分类和文本布局分析是印刷文档分析中的经典问题，但自由笔记中笔迹和内容的不规则性暴露了现有方法的局限性。我们提倡一种同时解决布局分析和分类问题的集成技术:这些问题是如此紧密耦合，以至于对于真正的用户笔记来说，不可能解决其中一个问题而不解决另一个问题。我们在大量未编写脚本的用户文件语料库上调整和评估我们的方法，并反思我们在实践中遇到的困难识别场景。

引用次数: 78

Handwriting recognition using position sensitive letter n-gram matching 使用位置敏感字母n-gram匹配的手写识别

Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.

Pub Date : 2003-08-03 DOI: 10.1109/ICDAR.2003.1227730

A. El-Nasan, S. Veeramachaneni, G. Nagy

We propose further improvement of a handwriting recognition method that avoids segmentation while able to recognize words that were never seen before in handwritten form. This method is based on the fact that few pairs of English words share exactly the same set of letter bigrams and even fewer share longer n-grams. The lexical n-gram matches between every word in a lexicon and a set of reference words can be precomputed. A position-based match function then detects the matches between the handwritten signal of a query word and each reference word. We show that with a reasonable set of reference words, the recognition of lexicon words exceeds 90%.

我们提出了进一步改进的手写识别方法，避免了分割，同时能够识别以前从未见过的手写形式的单词。这种方法是基于这样一个事实:很少有对英语单词具有完全相同的字母组合，更少的单词具有更长的n-gram。词汇库中每个单词与一组参考单词之间的词汇n-gram匹配可以预先计算。然后，基于位置的匹配函数检测查询词的手写信号与每个参考词之间的匹配。研究表明，在合理的参考词集合下，词典词的识别率超过90%。

引用次数: 8

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀