Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.最新文献

英文中文

Recognition of arrows in line drawings based on the aggregation of geometric criteria using the Choquet integral 利用Choquet积分基于几何准则的集合来识别线条图中的箭头

Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.

Pub Date : 2003-08-03 DOI: 10.1109/ICDAR.2003.1227677

L. Wendling, S. Tabbone

A new way to detect arrows in line drawings is proposed in this paper. Our approach is based on the definition of the structure of such a symbol. Signatures of angular areas are computed and axiomatic properties and geometric characteristics are checked using the Choquet integral. Finally an experimental application on line-drawing documents shows the interest of our approach.

本文提出了一种新的线条图中箭头的检测方法。我们的方法是基于这样一个符号的结构的定义。计算了角区域的特征，并用Choquet积分检验了角区域的公理性质和几何特征。最后，在线条绘制文档上的实验应用表明了我们方法的有趣之处。

引用次数: 10

Towards a ptolemaic model for OCR OCR的托勒密模型

Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.

Pub Date : 2003-08-03 DOI: 10.1109/ICDAR.2003.1227819

S. Veeramachaneni, G. Nagy

In style-constrained classification often there are onlya few samples of each style and class, and the correspondencesbetween styles in the training set and the test setare unknown. To avoid gross misestimates of the classifierparameters it is therefore important to model the patterndistributions accurately. We offer empirical evidence for intuitivelyappealing assumptions, in feature spaces appropriatefor symbolic patterns, for (1) tetrahedral configurationsof class means that suggests linear style-adaptive classification,(2) improved estimates of classification boundariesby taking into account the asymmetric configuration of thepatterns with respect to the directions toward other classes,and (3) pattern-correlated style variability.

在风格约束的分类中，通常每种风格和类别只有几个样本，并且训练集和测试集中风格之间的对应关系是未知的。为了避免对分类器参数的严重错误估计，因此准确地建模模式分布是很重要的。在适合符号模式的特征空间中，我们为直观吸引人的假设提供了经验证据，用于(1)类意味的四面体配置表明线性风格自适应分类，(2)通过考虑模式相对于其他类的方向的不对称配置来改进分类边界的估计，以及(3)模式相关的风格可变性。

引用次数: 6

Writer identification using edge-based directional features 使用基于边缘的方向特征的写作者识别

Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.

Pub Date : 2003-08-03 DOI: 10.1109/ICDAR.2003.1227797

M. Bulacu, Lambert Schomaker, L. Vuurpijl

This paper evaluates the performance of edge-based directionalprobability distributions as features in writer identificationin comparison to a number of non-angular features.It is noted that the joint probability distribution of theangle combination of two "hinged" edge fragments outperformsall other individual features. Combining features mayimprove the performance. Limitations of the method pertainto the amount of handwritten material needed in orderto obtain reliable distribution estimates. The global featurestreated in this study are sensitive to major style variation(upper- vs lower case), slant, and forged styles, whichnecessitates the use of other features in realistic forensicwriter identification procedures.

本文评估了基于边缘的方向概率分布作为特征在作家识别中的性能，并与许多非角度特征进行了比较。值得注意的是，两个“铰接”边缘碎片的角度组合的联合概率分布优于所有其他单独的特征。结合功能可以提高性能。该方法的局限性在于为了获得可靠的分布估计所需的手写材料的数量。本研究处理的整体特征对主要风格变化(大写与小写)、倾斜和伪造风格很敏感，这就需要在现实的法医鉴定过程中使用其他特征。

引用次数: 198

Automated detection and segmentation of table of contents page from document images 自动检测和分割目录页从文档图像

Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.

Pub Date : 2003-08-03 DOI: 10.1109/ICDAR.2003.1227697

Sekhar Mandal, S. Chowdhury, A. Das, B. Chanda

With an aim to extract the structural information from the table of contents (TOC) to help develop a digital document library, the requirement of identifying/segmenting the TOC page is obvious. The objective to create a digital document library is to provide a non-labour intensive, cheap and flexible way of storing, representing and managing the paper document in electronic form to facilitate indexing, viewing, printing and extracting the intended portions. Information from the TOC pages is to be extracted for use in a document database for effective retrieval of the required pages. We present a fully automatic identification and segmentation of a table of contents (TOC) page from a scanned document.

为了从目录(TOC)中提取结构信息，以帮助开发数字文档库，对TOC页面进行识别/分割的需求是显而易见的。建立数码文件图书馆的目的，是提供一种不需要耗费大量人力、廉价和灵活的方式，以电子形式储存、表示和管理纸质文件，方便索引、浏览、打印和提取所需的部分。从TOC页面中提取的信息将用于文档数据库中，以便有效地检索所需的页面。我们提出了一个完全自动识别和分割目录(TOC)页从扫描文档。

引用次数: 34

Video text recognition using feature compensation as category-dependent feature extraction 基于特征补偿的视频文本识别分类相关特征提取

Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.

Pub Date : 2003-08-03 DOI: 10.1109/ICDAR.2003.1227741

M. Mori

When recognizing multiple fonts, geometric features,such as the directional information of strokes, are generallyrobust against deformation but are weak against degradation.This paper describes a category-dependent feature extractionmethod that uses a feature compensation techniqueto overcome this weakness. Our proposed method estimatesthe degree of degradation of an input pattern by comparingthe input pattern and the template of each category. Thisestimation enables us to compensate the degradation in featurevalues. We apply the proposed method to the recognitionof video text suffering from degradation and deformation.Recognition experiments using characters extractedfrom videos show that the proposed method is superior tothe conventional alternatives in resisting degradation.

在识别多种字体时，几何特征(如笔画的方向信息)通常对变形具有鲁棒性，但对退化则较弱。本文提出了一种基于分类的特征提取方法，利用特征补偿技术克服了这一缺点。我们提出的方法通过比较输入模式和每个类别的模板来估计输入模式的退化程度。这种估计使我们能够补偿特征值的退化。我们将该方法应用于有退化和变形的视频文本的识别。利用视频中提取的字符进行识别实验，结果表明该方法在抗退化方面优于传统方法。

引用次数: 9

Features for word spotting in historical manuscripts 历史手稿中的单词点错功能

Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.

Pub Date : 2003-08-03 DOI: 10.1109/ICDAR.2003.1227662

T. Rath, R. Manmatha

For the transition from traditional to digital libraries, the large number of handwritten manuscripts that exist pose a great challenge. Easy access to such collections requires an index, which is currently created manually at great cost. Because automatic handwriting recognizers fail on historical manuscripts, the word spotting technique has been developed: the words in a collection are matched as images and grouped into clusters which contain all instances of the same word. By annotating "interesting" clusters, an index that links words to the locations where they occur can be built automatically. Due to the noise in historical documents, selecting the right features for matching words is crucial. We analyzed a range of features suitable for matching words using dynamic time warping (DTW), which aligns and compares sets of features extracted from two images. Each feature's individual performance was measured on a test set. With an average precision of 72%, a combination of features outperforms competing techniques in speed and precision.

对于传统图书馆向数字图书馆的过渡来说，现存的大量手写体手稿构成了巨大的挑战。方便地访问这样的集合需要索引，而目前手工创建索引的成本很高。由于自动手写识别器无法识别历史手稿，因此人们开发了单词识别技术:将集合中的单词作为图像进行匹配，并将其分组为包含同一单词的所有实例的簇。通过标注“有趣的”集群，可以自动建立一个索引，将单词链接到它们出现的位置。由于历史文献中存在噪声，选择合适的特征进行匹配至关重要。我们使用动态时间扭曲(DTW)分析了一系列适合匹配单词的特征，DTW对从两幅图像中提取的特征集进行对齐和比较。在一个测试集上测量每个特征的单独性能。平均精度为72%，这些特征的组合在速度和精度上优于竞争对手的技术。

引用次数: 255

ICDAR 2003 robust reading competitions ICDAR 2003强大的阅读比赛

Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.

Pub Date : 2003-08-03 DOI: 10.1109/ICDAR.2003.1227749

S. Lucas, A. Panaretos, Luis Sosa, Anthony Tang, Shirley Wong, Robert Young

This paper describes the robust reading competitions forICDAR 2003. With the rapid growth in research over thelast few years on recognizing text in natural scenes, thereis an urgent need to establish some common benchmarkdatasets, and gain a clear understanding of the current stateof the art. We use the term robust reading to refer to text imagesthat are beyond the capabilities of current commercialOCR packages. We chose to break down the robust readingproblem into three sub-problems, and run competitionsfor each stage, and also a competition for the best overallsystem. The sub-problems we chose were text locating,character recognition and word recognition.By breaking down the problem in this way, we hope togain a better understanding of the state of the art in eachof the sub-problems. Furthermore, our methodology involvesstoring detailed results of applying each algorithm toeach image in the data sets, allowing researchers to study indepth the strengths and weaknesses of each algorithm. Thetext locating contest was the only one to have any entries.We report the results of this contest, and show cases wherethe leading algorithms succeed and fail.

本文描述了icdar 2003的稳健阅读竞赛。随着近年来自然场景文本识别研究的快速发展，迫切需要建立一些通用的基准数据集，并对当前的技术状况有一个清晰的认识。我们使用稳健读取一词来指超出当前商用ocr软件包能力的文本图像。我们选择将稳健阅读问题分解为三个子问题，并对每个阶段进行比赛，同时也进行最佳整体系统的比赛。我们选择的子问题是文本定位、字符识别和单词识别。通过以这种方式分解问题，我们希望更好地理解每个子问题的现状。此外，我们的方法包括存储将每种算法应用于数据集中的每个图像的详细结果，使研究人员能够深入研究每种算法的优缺点。文本定位比赛是唯一有参赛作品的比赛。我们报告了这次比赛的结果，并展示了领先算法的成功和失败的案例。

{"title":"ICDAR 2003 robust reading competitions","authors":"S. Lucas, A. Panaretos, Luis Sosa, Anthony Tang, Shirley Wong, Robert Young","doi":"10.1109/ICDAR.2003.1227749","DOIUrl":"https://doi.org/10.1109/ICDAR.2003.1227749","url":null,"abstract":"This paper describes the robust reading competitions forICDAR 2003. With the rapid growth in research over thelast few years on recognizing text in natural scenes, thereis an urgent need to establish some common benchmarkdatasets, and gain a clear understanding of the current stateof the art. We use the term robust reading to refer to text imagesthat are beyond the capabilities of current commercialOCR packages. We chose to break down the robust readingproblem into three sub-problems, and run competitionsfor each stage, and also a competition for the best overallsystem. The sub-problems we chose were text locating,character recognition and word recognition.By breaking down the problem in this way, we hope togain a better understanding of the state of the art in eachof the sub-problems. Furthermore, our methodology involvesstoring detailed results of applying each algorithm toeach image in the data sets, allowing researchers to study indepth the strengths and weaknesses of each algorithm. Thetext locating contest was the only one to have any entries.We report the results of this contest, and show cases wherethe leading algorithms succeed and fail.","PeriodicalId":249193,"journal":{"name":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132157086","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 618

Lexical post-processing optimization for handwritten word recognition 手写体单词识别的词法后处理优化

Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.

Pub Date : 2003-08-03 DOI: 10.1109/ICDAR.2003.1227711

S. Carbonnel, É. Anquetil

This paper presents a lexical post-processing optimization for handwritten word recognition. The aim of this work is to explore the combination of different lexical post-processing approaches in order to optimize the recognition rate, the recognition time and memory requirements. The present method focuses on the following tasks: a lexicon organization with word filtering, based on holistic word features to deal with large vocabulary (creation of static sublexicon compressed in a tree structure); a dedicated string matching algorithm for online handwriting (to compensate for the recognition and the segmentation errors); and a specific exploration strategy of the results provided by the analytical word recognition process. Experimental results are reported using several lexicon sizes (about 1000, 7000 and 25000 entries) to evaluate different optimization strategies according to the recognition rate, computational cost and memory requirements.

提出了一种用于手写体单词识别的词法后处理优化方法。本研究的目的是探索不同词法后处理方法的组合，以优化识别率、识别时间和记忆要求。本方法主要关注以下任务:基于整体词特征的词过滤词典组织，以处理大型词汇表(创建压缩为树结构的静态子词典);用于在线手写的专用字符串匹配算法(补偿识别和分割错误);并对结果提供了具体的探索策略，分析了单词识别过程。根据识别率、计算成本和内存需求，使用不同的词典大小(约1000、7000和25000个条目)来评估不同的优化策略。

引用次数: 19

A case restoration approach to named entity tagging in degraded documents 退化文档中命名实体标记的案例恢复方法

Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.

Pub Date : 2003-08-03 DOI: 10.1109/ICDAR.2003.1227756

R. Srihari, Cheng Niu, W. Li, Jihong Ding

This paper describes a novel approach to namedentity (NE) tagging on degraded documents. NE taggingis the process of identifying salient text strings inunstructured text, corresponding to names of people,places, organizations, times/dates, etc. Although NEtagging is typically part of a larger informationextraction process, it has other applications, such asimproving search in an information retrieval system, andpost-processing the results of an OCR system. We focuson degraded documents, i.e. case insensitive documentsthat lack orthographic information. Examples includeoutput of speech recognition systems, as well as e-mail.The traditional approach involves retraining an NEtagger on degraded text, a cumbersome operation. Thispaper describes an approach whereby text is first"restored" to its implicit case sensitive form, andsubsequently processed by the original NE tagger.Results show that this new approach leads to far lessprecision loss in NE tagging of degraded documents.

本文描述了一种在退化文档上进行命名实体(NE)标注的新方法。网元标记是在非结构化文本中识别显著文本字符串的过程，对应于人名、地点、组织、时间/日期等。虽然NEtagging通常是较大的信息提取过程的一部分，但它还有其他应用，例如改进信息检索系统中的搜索，以及对OCR系统的结果进行后处理。我们关注退化文档，即缺乏正字法信息的不区分大小写的文档。例子包括语音识别系统的输出，以及电子邮件。传统的方法涉及对退化文本重新训练NEtagger，这是一个繁琐的操作。本文描述了一种方法，即文本首先“恢复”到其隐式区分大小写的形式，然后由原始的NE标注器处理。结果表明，该方法对退化文档的NE标注精度损失较小。

引用次数: 4

Analysis and recognition of Asian scripts-the state of the art 对亚洲文字的分析和识别——艺术的现状

Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.

Pub Date : 2003-08-03 DOI: 10.1109/ICDAR.2003.1227785

C. Suen, S. Mori, Soohyung Kim, C. Leung

This paper summarizes the research activities of the pastdecade on the recognition of handwritten scripts used inChina, Japan, and Korea. It presents the recognitionmethodologies, features explored, databases used, andclassification schemes investigated. In addition, it includes adescription of the performance of numerous recognitionsystems found in both academic and industrial researchlaboratories. Recent achievements and applications are alsopresented. A list of relevant references is attached togetherwith our remarks on this subject.

本文综述了近十年来中、日、韩三国在手写体识别方面的研究活动。它介绍了识别方法、探索的特征、使用的数据库和调查的分类方案。此外，它还包括在学术和工业研究实验室中发现的许多识别系统的性能描述。介绍了近年来的研究成果和应用情况。随函附上我们对这个问题的评论，附上有关参考资料的清单。

引用次数: 40

首页上一页

下一页尾页

类型

全部化学•材料生命科学医学物理工程技术环境•农林材料科学地球科学法学管理学化学环境科学与生态学计算机科学教育学经济学农林科学人文科学生物学数学物理与天体物理心理学综合性期刊其他工业工程理学历史学农学文学信息工程

数据库

全部 ACS Publications Elsevier ieeexplore Springer The Royal Society of Chemistry Wiley

期刊

Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.

全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.

﹀