首页 > 最新文献

Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition最新文献

英文 中文
Empirical error based optimization of SVM kernels: application to digit image recognition 基于经验误差的SVM核优化:在数字图像识别中的应用
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030925
N. Ayat, M. Cheriet, C. Suen
We address the problem of optimizing kernel parameters in support vector machine modeling, especially when the number of parameters is greater than one as in polynomial kernels and KMOD, our newly introduced kernel. The present work is an extended experimental study of the framework proposed by Chapelle et al. (2001) for optimizing SVM kernels using an analytic upper bound of the error. However our optimization scheme minimizes an empirical error estimate using a quasi-Newton optimization method. To assess our method, the approach is further used for adapting KMOD, RBF and polynomial kernels on synthetic data and NIST database. The method shows a much faster convergence with satisfactory results in comparison with the simple gradient descent method.
我们解决了支持向量机建模中优化核参数的问题,特别是当参数数量大于1时,如多项式核和KMOD(我们新引入的核)。目前的工作是对Chapelle等人(2001)提出的框架的扩展实验研究,该框架使用误差的解析上界来优化支持向量机核。然而,我们的优化方案使用准牛顿优化方法最小化经验误差估计。为了评估我们的方法,进一步将该方法用于在合成数据和NIST数据库上适配KMOD、RBF和多项式核。与简单的梯度下降法相比,该方法收敛速度快,结果令人满意。
{"title":"Empirical error based optimization of SVM kernels: application to digit image recognition","authors":"N. Ayat, M. Cheriet, C. Suen","doi":"10.1109/IWFHR.2002.1030925","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030925","url":null,"abstract":"We address the problem of optimizing kernel parameters in support vector machine modeling, especially when the number of parameters is greater than one as in polynomial kernels and KMOD, our newly introduced kernel. The present work is an extended experimental study of the framework proposed by Chapelle et al. (2001) for optimizing SVM kernels using an analytic upper bound of the error. However our optimization scheme minimizes an empirical error estimate using a quasi-Newton optimization method. To assess our method, the approach is further used for adapting KMOD, RBF and polynomial kernels on synthetic data and NIST database. The method shows a much faster convergence with satisfactory results in comparison with the simple gradient descent method.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125802046","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Recognition of handwritten month words on bank cheques 识别银行支票上手写的月份
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030895
Qizhi Xu, Jinho Kim, L. Lam, C. Suen
This paper describes an off-line system which recognizes unconstrained handwritten month words extracted from Canadian bank cheques. A segmentation based grapheme level HMM (hidden Markov model) classifier and two multilayer perceptron classifiers with different architectures and different features have been developed in CENPARMI for the recognition of month words. In this paper, a combination method with an effective conditional topology is presented, and the most widely used combination rules including Vote, Sum and Product, are experimented. A new modified Product rule is also proposed, which has produced the best recognition rate of 85.36% when tested on a real-life standard Canadian bank cheque database.
本文描述了一种离线系统,该系统可以识别从加拿大银行支票中提取的无约束手写月份词。在CENPARMI中开发了一种基于分词的字素级隐马尔可夫模型(HMM)分类器和两种具有不同结构和不同特征的多层感知器分类器,用于月词识别。本文提出了一种具有有效条件拓扑的组合方法,并对使用最广泛的组合规则Vote、Sum和Product进行了实验。提出了一种新的修改后的Product规则,该规则在真实的标准加拿大银行支票数据库上进行了测试,产生了85.36%的最佳识别率。
{"title":"Recognition of handwritten month words on bank cheques","authors":"Qizhi Xu, Jinho Kim, L. Lam, C. Suen","doi":"10.1109/IWFHR.2002.1030895","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030895","url":null,"abstract":"This paper describes an off-line system which recognizes unconstrained handwritten month words extracted from Canadian bank cheques. A segmentation based grapheme level HMM (hidden Markov model) classifier and two multilayer perceptron classifiers with different architectures and different features have been developed in CENPARMI for the recognition of month words. In this paper, a combination method with an effective conditional topology is presented, and the most widely used combination rules including Vote, Sum and Product, are experimented. A new modified Product rule is also proposed, which has produced the best recognition rate of 85.36% when tested on a real-life standard Canadian bank cheque database.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"167 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124674020","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Continuous approach to segmentation of handwritten text 手写文本的连续分割方法
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030950
L. M. Mestetskii, I. Reyer, T. Sederberg
A new approach to the segmentation of handwritten text is presented that is based on approximating a binary raster image with a set of polygons and building a continuous skeleton of those polygons. Polygons and skeletons are then used in extraction of lines, removing of spots and artifacts, extraction of words from lines and extraction of strokes from words.
提出了一种新的手写体文本分割方法,该方法基于一组多边形逼近二值光栅图像并构建这些多边形的连续骨架。然后使用多边形和骨架来提取线条,去除斑点和伪影,从线条中提取单词以及从单词中提取笔画。
{"title":"Continuous approach to segmentation of handwritten text","authors":"L. M. Mestetskii, I. Reyer, T. Sederberg","doi":"10.1109/IWFHR.2002.1030950","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030950","url":null,"abstract":"A new approach to the segmentation of handwritten text is presented that is based on approximating a binary raster image with a set of polygons and building a continuous skeleton of those polygons. Polygons and skeletons are then used in extraction of lines, removing of spots and artifacts, extraction of words from lines and extraction of strokes from words.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130114627","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Handwritten document retrieval 手写文档检索
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030915
Gregory F. Russell, M. Perrone, Yi-Min Chee, A. Ziq
This paper investigates the use of both typed and handwritten queries to retrieve handwritten documents. The recognition-based approach reported here is novel in that it expands documents in a fashion analogous to query expansion: Individual documents are expanded using N-best lists which embody additional statistical information from a hidden Markov model (HMM) based handwriting recognizer used to transcribe each of the handwritten documents. This additional information enables the retrieval methods to be robust to machine transcription errors, retrieving documents which otherwise would be unretrievable. Cross-writer experiments on a database of 10985 words in 108 documents from 108 writers, and within-writer experiments in a probabilistic framework, on a database of 537724 words in 3342 documents from 43 writers, indicate that significant improvements in retrieval performance can be achieved. The second database is the largest database of on-line handwritten documents known to its.
本文研究了使用打字和手写查询来检索手写文档。这里报告的基于识别的方法是新颖的,因为它以类似于查询扩展的方式扩展文档:使用N-best列表扩展单个文档,其中包含来自用于转录每个手写文档的基于隐马尔可夫模型(HMM)的手写识别器的附加统计信息。这些额外的信息使检索方法对机器转录错误具有鲁棒性,检索否则无法检索的文档。在108位作者的108篇文档中的10985个单词的数据库上进行了跨作者实验,在概率框架下对43位作者的3342篇文档中的537724个单词的数据库进行了跨作者实验,结果表明检索性能得到了显著提高。第二个数据库是目前已知的最大的在线手写文档数据库。
{"title":"Handwritten document retrieval","authors":"Gregory F. Russell, M. Perrone, Yi-Min Chee, A. Ziq","doi":"10.1109/IWFHR.2002.1030915","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030915","url":null,"abstract":"This paper investigates the use of both typed and handwritten queries to retrieve handwritten documents. The recognition-based approach reported here is novel in that it expands documents in a fashion analogous to query expansion: Individual documents are expanded using N-best lists which embody additional statistical information from a hidden Markov model (HMM) based handwriting recognizer used to transcribe each of the handwritten documents. This additional information enables the retrieval methods to be robust to machine transcription errors, retrieving documents which otherwise would be unretrievable. Cross-writer experiments on a database of 10985 words in 108 documents from 108 writers, and within-writer experiments in a probabilistic framework, on a database of 537724 words in 3342 documents from 43 writers, indicate that significant improvements in retrieval performance can be achieved. The second database is the largest database of on-line handwritten documents known to its.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"26 6","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132359741","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
DP matching using Kalman filter as pre-processing in on-line signature verification 在线签名验证中基于卡尔曼滤波的DP匹配预处理
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030960
Masahiro Tanaka, Yumi Ishino, Hironori Shimada, T. Inoue
This paper proposes the point-wise matching method which is to be used as the pre-processing in online signature verification using only the positional information. After this pre-processing, various dynamical or local features of the signature can be used in verification. The test signature and the model one are to be matched point-wise by applying time-variant linear transformation. Kalman filter and the smoother are used for estimating the time-variant transformation parameters. Numerical experiment shows quite a good performance for real online signatures.
提出了一种仅利用位置信息进行在线签名验证的逐点匹配预处理方法。经过预处理后,可以利用签名的各种动态或局部特征进行验证。采用时变线性变换对测试信号和模型信号进行逐点匹配。利用卡尔曼滤波和平滑器估计时变变换参数。数值实验表明,该算法对实际在线签名具有较好的性能。
{"title":"DP matching using Kalman filter as pre-processing in on-line signature verification","authors":"Masahiro Tanaka, Yumi Ishino, Hironori Shimada, T. Inoue","doi":"10.1109/IWFHR.2002.1030960","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030960","url":null,"abstract":"This paper proposes the point-wise matching method which is to be used as the pre-processing in online signature verification using only the positional information. After this pre-processing, various dynamical or local features of the signature can be used in verification. The test signature and the model one are to be matched point-wise by applying time-variant linear transformation. Kalman filter and the smoother are used for estimating the time-variant transformation parameters. Numerical experiment shows quite a good performance for real online signatures.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"196 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121278696","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Hidden Markov model length optimization for handwriting recognition systems 手写识别系统的隐马尔可夫模型长度优化
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030938
Matthias Zimmermann, H. Bunke
This paper investigates the use of three different schemes to optimize the number of states of linear left-to-right hidden Markov models (HMM). In the first method, we describe the fixed length modeling scheme where each character model is assigned the same number of states. The second method considered is the Bakis length modeling where the number of model states is set to a given fraction of the average number of observations of the corresponding character. In the third modeling scheme the number of model states is set to a specified quantile of the corresponding character length histogram. This method is called quantile length modeling. A comparison of different length modeling schemes was carried out with a handwriting recognition system using off-line images of cursively handwritten English words from the IAM database. For the fixed length modeling, a recognition rate of 61% was achieved. Using the Bakis or quantile length modeling the word recognition rates could be improved to over 69%.
本文研究了使用三种不同的方案来优化线性从左到右隐马尔可夫模型(HMM)的状态数。在第一种方法中,我们描述了固定长度的建模方案,其中每个字符模型被分配相同数量的状态。第二种考虑的方法是Bakis长度建模,其中将模型状态的数量设置为相应字符的平均观测数的给定分数。在第三种建模方案中,将模型状态的数量设置为相应字符长度直方图的指定分位数。这种方法称为分位数长度建模。利用IAM数据库中草书手写英语单词的离线图像,对不同长度建模方案进行了比较。对于固定长度建模,识别率达到61%。使用Bakis或分位数长度建模,单词识别率可以提高到69%以上。
{"title":"Hidden Markov model length optimization for handwriting recognition systems","authors":"Matthias Zimmermann, H. Bunke","doi":"10.1109/IWFHR.2002.1030938","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030938","url":null,"abstract":"This paper investigates the use of three different schemes to optimize the number of states of linear left-to-right hidden Markov models (HMM). In the first method, we describe the fixed length modeling scheme where each character model is assigned the same number of states. The second method considered is the Bakis length modeling where the number of model states is set to a given fraction of the average number of observations of the corresponding character. In the third modeling scheme the number of model states is set to a specified quantile of the corresponding character length histogram. This method is called quantile length modeling. A comparison of different length modeling schemes was carried out with a handwriting recognition system using off-line images of cursively handwritten English words from the IAM database. For the fixed length modeling, a recognition rate of 61% was achieved. Using the Bakis or quantile length modeling the word recognition rates could be improved to over 69%.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"388 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132351977","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 77
A hybrid large vocabulary handwritten word recognition system using neural networks with hidden Markov models 基于隐马尔可夫模型的神经网络混合大词汇手写词识别系统
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030893
Alessandro Lameiras Koerich, Yann Leydier, R. Sabourin, C. Suen
We present a hybrid recognition system that integrates hidden Markov models (HMM) with neural networks (NN) in a probabilistic framework. The input data is processed first by a lexicon-driven word recognizer based on HMMs to generate a list of the candidate N-best-scoring word hypotheses as well as the segmentation of such word hypotheses into characters. An NN classifier is used to generate a score for each segmented character and in the end, the scores from the HMM and the NN classifiers are combined to optimize performance. Experimental results show that for an 80,000-word vocabulary, the hybrid HMM/NN system improves by about 10% the word recognition rate over the HMM system alone.
提出了一种在概率框架下将隐马尔可夫模型(HMM)与神经网络(NN)相结合的混合识别系统。输入数据首先由基于hmm的词典驱动的词识别器处理,生成候选n个得分最高的词假设列表,并将这些词假设分割成字符。使用神经网络分类器为每个被分割的字符生成分数,最后,将HMM和神经网络分类器的分数相结合以优化性能。实验结果表明,对于8万单词的词汇表,混合HMM/NN系统比单独的HMM系统提高了约10%的单词识别率。
{"title":"A hybrid large vocabulary handwritten word recognition system using neural networks with hidden Markov models","authors":"Alessandro Lameiras Koerich, Yann Leydier, R. Sabourin, C. Suen","doi":"10.1109/IWFHR.2002.1030893","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030893","url":null,"abstract":"We present a hybrid recognition system that integrates hidden Markov models (HMM) with neural networks (NN) in a probabilistic framework. The input data is processed first by a lexicon-driven word recognizer based on HMMs to generate a list of the candidate N-best-scoring word hypotheses as well as the segmentation of such word hypotheses into characters. An NN classifier is used to generate a score for each segmented character and in the end, the scores from the HMM and the NN classifiers are combined to optimize performance. Experimental results show that for an 80,000-word vocabulary, the hybrid HMM/NN system improves by about 10% the word recognition rate over the HMM system alone.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"79 20","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131879726","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 45
Confident assessment of children's handwritten responses 对儿童手写回答的自信评估
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030961
J. Allan, Tony Allen, N. Sherkat
This paper introduces a novel approach for the automatic assessment of children's responses to standardised English exam questions. The constrained nature of the question and answer medium is exploited to produce an automatic assessment mechanism that is both highly accurate and produces a reasonable level of response yield. It is shown that the novel approach can achieve 100% scoring accuracy on 44% of all responses compared to a traditional lexical approach that has an error rate of 41%. When a thresholding method, similar to that used in the novel approach is applied, the traditional approach can achieve an accuracy of 100% but with a response yield of only 5%. The approach introduced in this paper is thus shown to have a significant advantage over the traditional lexical based assessment.
本文介绍了一种自动评估儿童英语标准化试题反应的新方法。利用问答媒介的约束性来产生一种自动评估机制,这种机制既高度准确,又能产生合理水平的响应量。结果表明,与错误率为41%的传统词汇方法相比,该方法在44%的回答中可以达到100%的评分准确率。当使用与新方法相似的阈值方法时,传统方法可以达到100%的准确率,但响应收率仅为5%。因此,本文介绍的方法比传统的基于词汇的评估具有显著的优势。
{"title":"Confident assessment of children's handwritten responses","authors":"J. Allan, Tony Allen, N. Sherkat","doi":"10.1109/IWFHR.2002.1030961","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030961","url":null,"abstract":"This paper introduces a novel approach for the automatic assessment of children's responses to standardised English exam questions. The constrained nature of the question and answer medium is exploited to produce an automatic assessment mechanism that is both highly accurate and produces a reasonable level of response yield. It is shown that the novel approach can achieve 100% scoring accuracy on 44% of all responses compared to a traditional lexical approach that has an error rate of 41%. When a thresholding method, similar to that used in the novel approach is applied, the traditional approach can achieve an accuracy of 100% but with a response yield of only 5%. The approach introduced in this paper is thus shown to have a significant advantage over the traditional lexical based assessment.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134161727","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A fast algorithm for finding k-nearest neighbors with non-metric dissimilarity 非度量不相似度的k近邻快速查找算法
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030877
Bin Zhang, S. Srihari
Fast nearest neighbor (NN) finding has been extensively studied. While some fast NN algorithms using metrics rely on the essential properties of metric spaces, the others using non-metric measures fail for large-size templates. However in some applications with very large size templates, the best performance is achieved by NN methods based on the dissimilarity measures resulting in a special space where computations cannot be pruned by the algorithms based-on the triangular inequality. For such NN methods, the existing fast algorithms except condensing algorithms are not applicable. In this paper, a fast hierarchical search algorithm is proposed to find k-NNs using a non-metric measure in a binary feature space. Experiments with handwritten digit recognition show that the new algorithm reduces on average dissimilarity computations by more than 90% while losing the accuracy by less than 0.1%, with a 10% increase in memory.
快速近邻(NN)的发现已经得到了广泛的研究。虽然一些使用度量的快速神经网络算法依赖于度量空间的基本属性,但其他使用非度量度量的算法在大尺寸模板中失败。然而,在一些模板尺寸非常大的应用中,基于不相似度度量的神经网络方法可以获得最佳性能,这导致了一个特殊的空间,在这个空间中,基于三角不等式的算法无法对计算进行修剪。对于这种神经网络方法,现有的除压缩算法外的快速算法都不适用。本文提出了一种快速分层搜索算法,利用非度量度量在二元特征空间中查找k- nn。手写体数字识别实验表明,新算法平均减少了90%以上的不相似度计算,而准确率损失小于0.1%,内存增加了10%。
{"title":"A fast algorithm for finding k-nearest neighbors with non-metric dissimilarity","authors":"Bin Zhang, S. Srihari","doi":"10.1109/IWFHR.2002.1030877","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030877","url":null,"abstract":"Fast nearest neighbor (NN) finding has been extensively studied. While some fast NN algorithms using metrics rely on the essential properties of metric spaces, the others using non-metric measures fail for large-size templates. However in some applications with very large size templates, the best performance is achieved by NN methods based on the dissimilarity measures resulting in a special space where computations cannot be pruned by the algorithms based-on the triangular inequality. For such NN methods, the existing fast algorithms except condensing algorithms are not applicable. In this paper, a fast hierarchical search algorithm is proposed to find k-NNs using a non-metric measure in a binary feature space. Experiments with handwritten digit recognition show that the new algorithm reduces on average dissimilarity computations by more than 90% while losing the accuracy by less than 0.1%, with a 10% increase in memory.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132315921","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 20
Script and nature differentiation for Arabic and Latin text images 阿拉伯文和拉丁文文字图像的文字和性质区分
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030928
S. Kanoun, A. Ennaji, Y. Lecourtier, A. Alimi
A method for Arabic and Latin text block differentiation for printed and handwritten scripts is proposed. This method is based on a morphological analysis for each script at the text block level and a geometrical analysis at the line and the connected component level. In this paper, we present a brief survey, of existing methods used for scripts differentiation as well as a general characteristics of Arabic and Latin scripts. Then, We describe our method for the differentiation of these last scripts. We finally show two experimental results on two different data sets. 400 text blocks constitute the first one and 335 text blocks compose the second.
提出了一种区分印刷和手写阿拉伯文和拉丁文文本块的方法。该方法基于在文本块级别对每个脚本进行形态学分析,以及在行和连接组件级别对每个脚本进行几何分析。在本文中,我们简要介绍了现有的文字区分方法以及阿拉伯文和拉丁文的一般特征。然后,我们描述了我们的方法来区分这些最后的脚本。最后给出了两个不同数据集上的实验结果。400个文本块组成第一个,335个文本块组成第二个。
{"title":"Script and nature differentiation for Arabic and Latin text images","authors":"S. Kanoun, A. Ennaji, Y. Lecourtier, A. Alimi","doi":"10.1109/IWFHR.2002.1030928","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030928","url":null,"abstract":"A method for Arabic and Latin text block differentiation for printed and handwritten scripts is proposed. This method is based on a morphological analysis for each script at the text block level and a geometrical analysis at the line and the connected component level. In this paper, we present a brief survey, of existing methods used for scripts differentiation as well as a general characteristics of Arabic and Latin scripts. Then, We describe our method for the differentiation of these last scripts. We finally show two experimental results on two different data sets. 400 text blocks constitute the first one and 335 text blocks compose the second.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"187 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"114736584","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 36
期刊
Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1