首页 > 最新文献

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)最新文献

英文 中文
Logical Structure Analysis for Form Images with Arbitrary Layout by Belief Propagation 基于信念传播的任意布局表单图像逻辑结构分析
A. Minagawa, Y. Fujii, Hiroaki Takebe, K. Fujimoto
A new method for analyzing the specific logical structure of forms with unknown layout is proposed. This method uses both the target form image and a generic logical structure as inputs, and models two types of relationships probabilistically: that between strings and logical components, and that between neighboring strings having different logical components. This modeling approach allows strings to be assigned to logical components softly but robustly, and allows the use of an intuitive Bayesian probability network similar to the generic logical structure. Based on this probability network model, strings corresponding to logical components can be determined by belief propagation. This method is demonstrated to be effective by conducting tests on three types of forms.
提出了一种分析未知布局形式具体逻辑结构的新方法。该方法使用目标形式图像和通用逻辑结构作为输入,并对两种类型的关系进行概率建模:字符串和逻辑组件之间的关系,以及具有不同逻辑组件的相邻字符串之间的关系。这种建模方法允许将字符串温和但健壮地分配给逻辑组件,并允许使用类似于通用逻辑结构的直观贝叶斯概率网络。在此概率网络模型的基础上,通过信念传播确定逻辑组件对应的字符串。通过对三种类型的表单进行测试,证明了这种方法的有效性。
{"title":"Logical Structure Analysis for Form Images with Arbitrary Layout by Belief Propagation","authors":"A. Minagawa, Y. Fujii, Hiroaki Takebe, K. Fujimoto","doi":"10.1109/ICDAR.2007.162","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.162","url":null,"abstract":"A new method for analyzing the specific logical structure of forms with unknown layout is proposed. This method uses both the target form image and a generic logical structure as inputs, and models two types of relationships probabilistically: that between strings and logical components, and that between neighboring strings having different logical components. This modeling approach allows strings to be assigned to logical components softly but robustly, and allows the use of an intuitive Bayesian probability network similar to the generic logical structure. Based on this probability network model, strings corresponding to logical components can be determined by belief propagation. This method is demonstrated to be effective by conducting tests on three types of forms.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129885483","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
An SVM-Based High-accurate Recognition Approach for Handwritten Numerals by Using Difference Features 基于svm的差分特征手写体数字高精度识别方法
Kaizhu Huang, Jun Sun, Y. Hotta, K. Fujimoto, S. Naoi
Handwritten numeral recognition is an important pattern recognition task. It can be widely used in various domains, e.g., bank money recognition, which requires a very high recognition rate. As a state-of-the-art classifier, support vector machine (SVM), has been extensively used in this area. Typically, SVM is trained in a batch model, i.e., all data points are simultaneously input for training the classification boundary. However, some slightly exceptional data, only accounting for a small proportion, are critical for the recognition rates. Training a classifier among all the data may possibly treat such legal but slightly exceptional samples as "noise ". In this paper, we propose a novel approach to attack this problem. This approach exploits a two-stage framework by using difference features. In the first stage, a regular SVM is trained on all the training data; in the second stage, only the samples misclassified in the first stage are specially considered. Therefore, the performance can be lifted. The number of misclassifications is often small because of the good performance of SVM. This will present difficulties in training an accurate SVM engine only for these misclassified samples. We then further propose a multi-way to binary approach using difference features. This approach successfully transforms multi-category classification to binary classification and expands the training samples greatly. In order to evaluate the proposed method, experiments are performed on 10,000 handwritten numeral samples extracted from real banks forms. This new algorithm achieves 99.0% accuracy. In comparison, the traditional SVM only gets 98.4%.
手写体数字识别是一项重要的模式识别任务。它可以广泛应用于各个领域,例如对识别率要求很高的银行货币识别。支持向量机作为一种最新的分类器,在这一领域得到了广泛的应用。SVM通常采用批处理模型进行训练,即同时输入所有数据点训练分类边界。然而,一些稍微异常的数据,只占很小的比例,对识别率至关重要。在所有数据中训练分类器可能会将这些合法但略有例外的样本视为“噪声”。在本文中,我们提出了一种新的方法来解决这个问题。这种方法通过使用差异特征来利用两阶段框架。第一阶段,在所有训练数据上训练一个正则支持向量机;在第二阶段,只考虑第一阶段误分类的样本。因此,性能可以提升。由于支持向量机的良好性能,错误分类的数量往往很少。这将给只针对这些错误分类的样本训练准确的SVM引擎带来困难。然后,我们进一步提出了一种利用差分特征的多向二值化方法。该方法成功地将多类别分类转化为二值分类,极大地扩展了训练样本。为了评估所提出的方法,对从真实银行表格中提取的10,000个手写数字样本进行了实验。该算法的准确率达到99.0%。相比之下,传统支持向量机的准确率只有98.4%。
{"title":"An SVM-Based High-accurate Recognition Approach for Handwritten Numerals by Using Difference Features","authors":"Kaizhu Huang, Jun Sun, Y. Hotta, K. Fujimoto, S. Naoi","doi":"10.1109/ICDAR.2007.57","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.57","url":null,"abstract":"Handwritten numeral recognition is an important pattern recognition task. It can be widely used in various domains, e.g., bank money recognition, which requires a very high recognition rate. As a state-of-the-art classifier, support vector machine (SVM), has been extensively used in this area. Typically, SVM is trained in a batch model, i.e., all data points are simultaneously input for training the classification boundary. However, some slightly exceptional data, only accounting for a small proportion, are critical for the recognition rates. Training a classifier among all the data may possibly treat such legal but slightly exceptional samples as \"noise \". In this paper, we propose a novel approach to attack this problem. This approach exploits a two-stage framework by using difference features. In the first stage, a regular SVM is trained on all the training data; in the second stage, only the samples misclassified in the first stage are specially considered. Therefore, the performance can be lifted. The number of misclassifications is often small because of the good performance of SVM. This will present difficulties in training an accurate SVM engine only for these misclassified samples. We then further propose a multi-way to binary approach using difference features. This approach successfully transforms multi-category classification to binary classification and expands the training samples greatly. In order to evaluate the proposed method, experiments are performed on 10,000 handwritten numeral samples extracted from real banks forms. This new algorithm achieves 99.0% accuracy. In comparison, the traditional SVM only gets 98.4%.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129640739","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A Review of Shape Descriptors for Document Analysis 用于文献分析的形状描述符综述
O. R. Terrades, S. Tabbone, Ernest Valveny
Shape descriptors play an important role in many document analysis application. In this paper we review some of the shape descriptors proposed in the last years from a new point of view. We propose the definitions of descriptor and primitive and introduce the notion of feature extraction method. With these definitions, we propose a new classification of shape descriptors that permits to classify according to their properties pointing out their strengths and weaknesses.
形状描述符在许多文档分析应用中起着重要的作用。本文从一个新的角度回顾了近年来提出的一些形状描述子。提出了描述符和原语的定义,并引入了特征提取方法的概念。根据这些定义,我们提出了一种新的形状描述符分类,允许根据它们的属性进行分类,指出它们的优缺点。
{"title":"A Review of Shape Descriptors for Document Analysis","authors":"O. R. Terrades, S. Tabbone, Ernest Valveny","doi":"10.1109/ICDAR.2007.33","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.33","url":null,"abstract":"Shape descriptors play an important role in many document analysis application. In this paper we review some of the shape descriptors proposed in the last years from a new point of view. We propose the definitions of descriptor and primitive and introduce the notion of feature extraction method. With these definitions, we propose a new classification of shape descriptors that permits to classify according to their properties pointing out their strengths and weaknesses.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131956609","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 30
Handwritten Numeral Recognition of Six Popular Indian Scripts 六种流行印度文字的手写数字识别
U. Pal, N. Sharma, T. Wakabayashi, F. Kimura
India is a multi-lingual multi-script country but there is not much work towards handwritten character recognition of Indian languages. In this paper we propose a modified quadratic classifier based scheme towards the recognition of off-line handwritten numerals of six popular Indian scripts. Here we consider Devnagari, Bangla, Telugu, Oriya, Kannada and Tamil scripts for our experiment. The features used in the classifier are obtained from the directional information of the numerals. For feature computation, the bounding box of a numeral is segmented into blocks and the directional features are computed in each of the blocks. These blocks are then down sampled by a Gaussian filter and the features obtained from the down sampled blocks are fed to a modified quadratic classifier for recognition. Here we have used two sets of feature. We have used 64 dimensional features for high-speed recognition and 400 dimensional features for high-accuracy recognition in our proposed system. A five-fold cross validation technique has been used for result computation and we obtained 99.56%, 98.99%, 99.37%, 98.40%, 98.71% and 98.51% accuracy from Devnagari, Bangla, Telugu, Oriya, Kannada, and Tamil scripts, respectively.
印度是一个多语言多文字的国家,但印度语言的手写字符识别工作并不多。本文提出了一种基于改进二次分类器的六种常用印度文字离线手写数字识别方案。在这里,我们考虑了德文加里语、孟加拉语、泰卢固语、奥里亚语、卡纳达语和泰米尔语。分类器中使用的特征是从数字的方向信息中获得的。对于特征计算,将数字的边界框分割成块,并在每个块中计算方向特征。然后通过高斯滤波器对这些块进行下采样,并将从下采样块中获得的特征馈送到改进的二次分类器中进行识别。这里我们使用了两组特性。在我们提出的系统中,我们使用64维特征进行高速识别,使用400维特征进行高精度识别。采用五重交叉验证技术进行结果计算,德文加里语、孟加拉语、泰卢固语、奥里亚语、卡纳达语和泰米尔语的准确率分别为99.56%、98.99%、99.37%、98.40%、98.71%和98.51%。
{"title":"Handwritten Numeral Recognition of Six Popular Indian Scripts","authors":"U. Pal, N. Sharma, T. Wakabayashi, F. Kimura","doi":"10.1109/ICDAR.2007.129","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.129","url":null,"abstract":"India is a multi-lingual multi-script country but there is not much work towards handwritten character recognition of Indian languages. In this paper we propose a modified quadratic classifier based scheme towards the recognition of off-line handwritten numerals of six popular Indian scripts. Here we consider Devnagari, Bangla, Telugu, Oriya, Kannada and Tamil scripts for our experiment. The features used in the classifier are obtained from the directional information of the numerals. For feature computation, the bounding box of a numeral is segmented into blocks and the directional features are computed in each of the blocks. These blocks are then down sampled by a Gaussian filter and the features obtained from the down sampled blocks are fed to a modified quadratic classifier for recognition. Here we have used two sets of feature. We have used 64 dimensional features for high-speed recognition and 400 dimensional features for high-accuracy recognition in our proposed system. A five-fold cross validation technique has been used for result computation and we obtained 99.56%, 98.99%, 99.37%, 98.40%, 98.71% and 98.51% accuracy from Devnagari, Bangla, Telugu, Oriya, Kannada, and Tamil scripts, respectively.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"140 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132084313","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 189
WEB Image Classification Based on the Fusion of Image and Text Classifiers 基于图像和文本分类器融合的WEB图像分类
P. R. Kalva, F. Enembreck, Alessandro Lameiras Koerich
This paper presents a novel method for the classification of images that combines information extracted from the images and contextual information. The main hypothesis is that contextual information related to an image can contribute in the image classification process. First, independent classifiers are designed to deal with images and text. From the images color, shape and texture features are extracted. These features are used with a neural network (NN) classifier to carry out image classification. On the other hand, contextual information is processed and used with a Naive Bayes (NB) classifier. At the end, the outputs of both classifiers are combined through heuristic rules. Experimental results on a database of more than 5,000 HTML documents have shown that the combination of classifiers provides a meaningful improvement (about 16%) in the correct image classification rate relative to the results provided by the NN classifier alone.
本文提出了一种将图像提取信息与上下文信息相结合的图像分类方法。主要假设是与图像相关的上下文信息可以在图像分类过程中发挥作用。首先,设计独立的分类器来处理图像和文本。从图像中提取颜色、形状和纹理特征。将这些特征与神经网络(NN)分类器一起进行图像分类。另一方面,使用朴素贝叶斯(NB)分类器处理和使用上下文信息。最后,通过启发式规则将两个分类器的输出组合起来。在超过5000个HTML文档的数据库上的实验结果表明,相对于单独使用NN分类器提供的结果,分类器的组合在正确的图像分类率方面提供了有意义的提高(约16%)。
{"title":"WEB Image Classification Based on the Fusion of Image and Text Classifiers","authors":"P. R. Kalva, F. Enembreck, Alessandro Lameiras Koerich","doi":"10.1109/ICDAR.2007.264","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.264","url":null,"abstract":"This paper presents a novel method for the classification of images that combines information extracted from the images and contextual information. The main hypothesis is that contextual information related to an image can contribute in the image classification process. First, independent classifiers are designed to deal with images and text. From the images color, shape and texture features are extracted. These features are used with a neural network (NN) classifier to carry out image classification. On the other hand, contextual information is processed and used with a Naive Bayes (NB) classifier. At the end, the outputs of both classifiers are combined through heuristic rules. Experimental results on a database of more than 5,000 HTML documents have shown that the combination of classifiers provides a meaningful improvement (about 16%) in the correct image classification rate relative to the results provided by the NN classifier alone.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"46 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127629144","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
A Unified Framework for Symbol Segmentation and Recognition of Handwritten Mathematical Expressions 手写体数学表达式符号分割与识别的统一框架
Yu Shi, HaiYang Li, F. Soong
A symbol decoding and graph generation algorithm for online handwritten mathematical expression recognition is formulated. It differs from our previous system and most other systems in two aspects: (1) it embeds stroke grouping into symbol identification to form a unified probabilistic framework for symbol recognition; and (2) a symbol graph rather than a list of symbol sequence hypotheses is generated, which makes post-processing with new information possible. Experimental results show that high quality symbol graph can be generated by the proposed algorithm. Symbol sequence corresponding to the best path in the graph demonstrates much higher symbol recognition accuracy than before, especially after rescoring with trigram. Math formula recognition performance is significantly improved.
提出了一种用于在线手写数学表达式识别的符号解码和图形生成算法。它与我们以前的系统和大多数其他系统的不同之处在于:(1)将笔画分组嵌入到符号识别中,形成统一的符号识别概率框架;(2)生成符号图而不是符号序列假设列表,使得对新信息进行后处理成为可能。实验结果表明,该算法可以生成高质量的符号图。图中最佳路径对应的符号序列的符号识别精度比之前有了很大的提高,特别是在用三元图进行评分后。数学公式识别性能显著提高。
{"title":"A Unified Framework for Symbol Segmentation and Recognition of Handwritten Mathematical Expressions","authors":"Yu Shi, HaiYang Li, F. Soong","doi":"10.1109/ICDAR.2007.38","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.38","url":null,"abstract":"A symbol decoding and graph generation algorithm for online handwritten mathematical expression recognition is formulated. It differs from our previous system and most other systems in two aspects: (1) it embeds stroke grouping into symbol identification to form a unified probabilistic framework for symbol recognition; and (2) a symbol graph rather than a list of symbol sequence hypotheses is generated, which makes post-processing with new information possible. Experimental results show that high quality symbol graph can be generated by the proposed algorithm. Symbol sequence corresponding to the best path in the graph demonstrates much higher symbol recognition accuracy than before, especially after rescoring with trigram. Math formula recognition performance is significantly improved.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"157 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129205757","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Text Line Detection in Unconstrained Handwritten Documents Using a Block-Based Hough Transform Approach 基于块的Hough变换方法的无约束手写文档文本行检测
G. Louloudis, B. Gatos, C. Halatsis
In this paper we present a new text line detection method for unconstrained handwritten documents. The proposed technique is based on a strategy that consists of three distinct steps. The first step includes image binarization and enhancement, connected component extraction and average character height estimation. In the second step, a block-based Hough transform is used for the detection of potential text lines while a third step is used to correct possible splitting, to detect text lines that the previous step did not reveal and, finally, to separate vertically connected characters and assign them to text lines. The performance evaluation of the proposed approach is based on a consistent and concrete evaluation methodology.
本文提出了一种新的无约束手写文档文本行检测方法。所提出的技术基于由三个不同步骤组成的策略。第一步包括图像二值化和增强、连通分量提取和平均特征高度估计。在第二步中,使用基于块的霍夫变换来检测潜在的文本行,而第三步用于纠正可能的分割,以检测前一步未显示的文本行,最后分离垂直连接的字符并将其分配给文本行。提出的方法的性能评价是基于一个一致的和具体的评价方法。
{"title":"Text Line Detection in Unconstrained Handwritten Documents Using a Block-Based Hough Transform Approach","authors":"G. Louloudis, B. Gatos, C. Halatsis","doi":"10.1109/ICDAR.2007.244","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.244","url":null,"abstract":"In this paper we present a new text line detection method for unconstrained handwritten documents. The proposed technique is based on a strategy that consists of three distinct steps. The first step includes image binarization and enhancement, connected component extraction and average character height estimation. In the second step, a block-based Hough transform is used for the detection of potential text lines while a third step is used to correct possible splitting, to detect text lines that the previous step did not reveal and, finally, to separate vertically connected characters and assign them to text lines. The performance evaluation of the proposed approach is based on a consistent and concrete evaluation methodology.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123172676","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 39
Character-Stroke Detection for Text-Localization and Extraction 文本定位与提取中的字符笔画检测
Krishna Subramanian, P. Natarajan, M. Decerbo, D. Castañón
In this paper, we present a new approach for analysis of images for text-localization and extraction. Our approach puts very few constraints on the font, size and color of text and is capable of handling both scene text and artificial text well. In this paper, we exploit two well-known features of text: approximately constant stroke width and local contrast, and develop a fast, simple, and effective algorithm to detect character strokes. We also show how these can be used for accurate extraction and motivate some advantages of using this approach for text localization over other color-space segmentation based approaches. We analyze the performance of our stroke detection algorithm on images collected for the robust-reading competitions at ICDAR 2003.
本文提出了一种新的图像分析方法,用于文本定位和提取。我们的方法对文本的字体、大小和颜色几乎没有限制,并且能够很好地处理场景文本和人工文本。本文利用文本的两个众所周知的特征:近似恒定笔画宽度和局部对比度,开发了一种快速、简单、有效的笔画检测算法。我们还展示了如何使用这些方法进行准确的提取,并激发了使用这种方法进行文本定位比其他基于颜色空间分割的方法的一些优点。我们分析了我们的笔划检测算法在ICDAR 2003鲁棒阅读竞赛中收集的图像上的性能。
{"title":"Character-Stroke Detection for Text-Localization and Extraction","authors":"Krishna Subramanian, P. Natarajan, M. Decerbo, D. Castañón","doi":"10.1109/ICDAR.2007.79","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.79","url":null,"abstract":"In this paper, we present a new approach for analysis of images for text-localization and extraction. Our approach puts very few constraints on the font, size and color of text and is capable of handling both scene text and artificial text well. In this paper, we exploit two well-known features of text: approximately constant stroke width and local contrast, and develop a fast, simple, and effective algorithm to detect character strokes. We also show how these can be used for accurate extraction and motivate some advantages of using this approach for text localization over other color-space segmentation based approaches. We analyze the performance of our stroke detection algorithm on images collected for the robust-reading competitions at ICDAR 2003.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123468674","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 48
Matching Local Descriptors for Image Identification on Cultural Databases 文化数据库图像识别的局部描述符匹配
Eduardo Valle, M. Cord, S. Philipp-Foliguet
In this paper we present a new method for high- dimensional descriptor matching, based on the KD-tree, which is a classic method for nearest neighbours search. This new method, which we name 3-way tree, avoids the boundary effects that disrupt the KD-tree in higher dimensionalities, by the addition of redundant, overlapping sub-trees. That way, more precision is obtained for the same querying times. We evaluate our method in the context of image identification for cultural collections, a task which can greatly benefit from the use of high-dimensional local descriptors computed around Pol (Points of Interest).
本文提出了一种基于kd树的高维描述子匹配方法,该方法是一种经典的最近邻搜索方法。这种新方法,我们称之为三向树,通过添加冗余的、重叠的子树,避免了在更高维度上破坏kd树的边界效应。这样,对于相同的查询次数,可以获得更高的精度。我们在文化藏品图像识别的背景下评估了我们的方法,这项任务可以从使用围绕Pol(兴趣点)计算的高维局部描述符中受益匪浅。
{"title":"Matching Local Descriptors for Image Identification on Cultural Databases","authors":"Eduardo Valle, M. Cord, S. Philipp-Foliguet","doi":"10.1109/ICDAR.2007.164","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.164","url":null,"abstract":"In this paper we present a new method for high- dimensional descriptor matching, based on the KD-tree, which is a classic method for nearest neighbours search. This new method, which we name 3-way tree, avoids the boundary effects that disrupt the KD-tree in higher dimensionalities, by the addition of redundant, overlapping sub-trees. That way, more precision is obtained for the same querying times. We evaluate our method in the context of image identification for cultural collections, a task which can greatly benefit from the use of high-dimensional local descriptors computed around Pol (Points of Interest).","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126548491","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Word image based latent semantic indexing for conceptual querying in document image databases 基于词图像的潜在语义索引在文档图像数据库中的概念查询
Sameek Banerjee, Gaurav Harit, S. Chaudhury
In this paper we present an application of latent semantic analysis (LSA) for indexing and retrieval of document images with text. The query is specified as a set of word images and the documents which best match with the query representation in the the latent semantic space are retrieved. We show through extensive experiments on a large database that use of LSA for document images provides improvements in retrieval precision as is the case with electronic text documents.
本文提出了一种潜在语义分析(LSA)在带文本的文档图像索引和检索中的应用。查询被指定为一组单词图像,并在潜在语义空间中检索与查询表示最匹配的文档。我们通过在一个大型数据库上进行的大量实验表明,对文档图像使用LSA可以提高检索精度,就像电子文本文档的情况一样。
{"title":"Word image based latent semantic indexing for conceptual querying in document image databases","authors":"Sameek Banerjee, Gaurav Harit, S. Chaudhury","doi":"10.1109/ICDAR.2007.269","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.269","url":null,"abstract":"In this paper we present an application of latent semantic analysis (LSA) for indexing and retrieval of document images with text. The query is specified as a set of word images and the documents which best match with the query representation in the the latent semantic space are retrieved. We show through extensive experiments on a large database that use of LSA for document images provides improvements in retrieval precision as is the case with electronic text documents.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"110 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122561751","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
期刊
Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1