首页 > 最新文献

Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition最新文献

英文 中文
Bigram-based post-processing for online handwriting recognition using correctness evaluation 基于双元图的在线手写识别的后处理
Pub Date : 2002-11-07 DOI: 10.1109/IWFHR.2002.1030892
A. Nakamura, H. Kawajiri
An approach to bigram-based linguistic processing for online handwriting text recognition is described. A probability of correctness for each recognition result is derived from a feature set which consists of bigram probabilities and recognition scores. Using the probability of correctness, the number of candidates accepted to the post-processing step and the weight value balancing recognition scores with bigram scores are adaptively controlled. The proposed method is evaluated in experiments using the HANDS-kuchibue online handwritten character database. Results show that the method is effective in reducing candidates, improving accuracy, and saving computational costs.
描述了一种基于双字母的在线手写文本识别语言处理方法。每个识别结果的正确概率由双图概率和识别分数组成的特征集导出。利用正确概率,自适应控制接受后处理步骤的候选数和识别分数与二元图分数平衡的权重值。在HANDS-kuchibue在线手写体数据库中对该方法进行了实验验证。结果表明,该方法在减少候选点、提高精度和节省计算成本方面是有效的。
{"title":"Bigram-based post-processing for online handwriting recognition using correctness evaluation","authors":"A. Nakamura, H. Kawajiri","doi":"10.1109/IWFHR.2002.1030892","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030892","url":null,"abstract":"An approach to bigram-based linguistic processing for online handwriting text recognition is described. A probability of correctness for each recognition result is derived from a feature set which consists of bigram probabilities and recognition scores. Using the probability of correctness, the number of candidates accepted to the post-processing step and the weight value balancing recognition scores with bigram scores are adaptively controlled. The proposed method is evaluated in experiments using the HANDS-kuchibue online handwritten character database. Results show that the method is effective in reducing candidates, improving accuracy, and saving computational costs.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"26 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125380744","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
Confidence modeling for verification post-processing for handwriting recognition 用于手写识别验证后处理的置信度建模
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030880
J. Pitrelli, M. Perrone
We apply confidence-scoring techniques to verify the output of a handwriting recognizer. We evaluate a variety of scoring functions, including likelihood ratios and estimated posterior probabilities of correctness, in a postprocessing mode to generate confidence scores at the character or word level. Using the post-processor in conjunction with an HMM-based on-line handwriting recognizer for large-vocabulary word recognition, receiver-operating-characteristic (ROC) curves reveal that our post-processor is able to reject correctly 90% of recognizer errors while only falsely rejecting 33% of correctly-recognized words. For isolated-digit recognition, we achieve a correct rejection rate of 90% while keeping false rejection down to 13%.
我们应用置信度评分技术来验证手写识别器的输出。我们在后处理模式中评估各种评分函数,包括似然比和估计的正确性后验概率,以生成字符或单词级别的置信度评分。将后置处理器与基于hmm的在线手写识别器结合使用,用于大词汇量的单词识别,接受者工作特征(ROC)曲线显示,我们的后置处理器能够正确拒绝90%的识别器错误,而仅错误拒绝33%的正确识别单词。对于孤立数字识别,我们实现了90%的正确拒斥率,同时将错误拒斥率降至13%。
{"title":"Confidence modeling for verification post-processing for handwriting recognition","authors":"J. Pitrelli, M. Perrone","doi":"10.1109/IWFHR.2002.1030880","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030880","url":null,"abstract":"We apply confidence-scoring techniques to verify the output of a handwriting recognizer. We evaluate a variety of scoring functions, including likelihood ratios and estimated posterior probabilities of correctness, in a postprocessing mode to generate confidence scores at the character or word level. Using the post-processor in conjunction with an HMM-based on-line handwriting recognizer for large-vocabulary word recognition, receiver-operating-characteristic (ROC) curves reveal that our post-processor is able to reject correctly 90% of recognizer errors while only falsely rejecting 33% of correctly-recognized words. For isolated-digit recognition, we achieve a correct rejection rate of 90% while keeping false rejection down to 13%.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"117165435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
Genetic engineering of handwriting representations 手写表示的基因工程
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030900
Alexandre Lemieux, Christian Gagné, M. Parizeau
This paper presents experiments with genetically engineered feature sets for recognition of online handwritten characters. These representations stem from a nondescript decomposition of the character frame into a set of rectangular regions, possibly overlapping each represented by a vector of 7 fuzzy variables. Efficient new feature sets are automatically discovered using genetic programming techniques. Recognition experiments conducted on isolated digits of the Unipen database yield improvements of more than 3% over a previously, manually designed representation where region positions and sizes were fixed.
本文提出了一种基于基因工程特征集的在线手写字符识别实验。这些表示源于将字符帧分解为一组矩形区域,这些矩形区域可能重叠,每个区域由7个模糊变量向量表示。利用遗传编程技术自动发现高效的新特征集。在Unipen数据库的孤立数字上进行的识别实验比以前手动设计的区域位置和大小固定的表示提高了3%以上。
{"title":"Genetic engineering of handwriting representations","authors":"Alexandre Lemieux, Christian Gagné, M. Parizeau","doi":"10.1109/IWFHR.2002.1030900","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030900","url":null,"abstract":"This paper presents experiments with genetically engineered feature sets for recognition of online handwritten characters. These representations stem from a nondescript decomposition of the character frame into a set of rectangular regions, possibly overlapping each represented by a vector of 7 fuzzy variables. Efficient new feature sets are automatically discovered using genetic programming techniques. Recognition experiments conducted on isolated digits of the Unipen database yield improvements of more than 3% over a previously, manually designed representation where region positions and sizes were fixed.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"106 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123414776","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 18
On line signature verification: Fusion of a Hidden Markov Model and a neural network via a support vector machine 在线签名验证:基于支持向量机的隐马尔可夫模型与神经网络的融合
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030918
Marc Fuentes, S. Garcia-Salicetti, B. Dorizzi
We propose in this work to perform on-line signature verification by the fusion of two complementary verification modules. The first one considers a signature as a sequence of points and models the genuine signatures of a given signer by a Hidden Markov Model (HMM). Forgeries are used to compute a decision threshold. In the second module, global parameters of a signature are the inputs of a two-classes neural network trained for each signer on both the genuine and "other" signatures (genuine signatures of other signers). Fusion of the scores given by these two experts through a Support Vector Machine (SVM), allows improving the results over those of each module, on Philips' Database.
在这项工作中,我们建议通过融合两个互补的验证模块来进行在线签名验证。第一种方法将签名视为一个点序列,利用隐马尔可夫模型(HMM)对给定签名者的真实签名进行建模。赝品用于计算决策阈值。在第二个模块中,签名的全局参数是为每个签名者在真实签名和“其他”签名(其他签名者的真实签名)上训练的两类神经网络的输入。通过支持向量机(SVM)融合这两位专家给出的分数,可以在飞利浦的数据库中改进每个模块的结果。
{"title":"On line signature verification: Fusion of a Hidden Markov Model and a neural network via a support vector machine","authors":"Marc Fuentes, S. Garcia-Salicetti, B. Dorizzi","doi":"10.1109/IWFHR.2002.1030918","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030918","url":null,"abstract":"We propose in this work to perform on-line signature verification by the fusion of two complementary verification modules. The first one considers a signature as a sequence of points and models the genuine signatures of a given signer by a Hidden Markov Model (HMM). Forgeries are used to compute a decision threshold. In the second module, global parameters of a signature are the inputs of a two-classes neural network trained for each signer on both the genuine and \"other\" signatures (genuine signatures of other signers). Fusion of the scores given by these two experts through a Support Vector Machine (SVM), allows improving the results over those of each module, on Philips' Database.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"29 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115039355","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 62
Multidimensional multistage k-NN classifiers for handwritten digit recognition 手写数字识别的多维多阶段k-NN分类器
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030878
Iratxe Soraluze Arriola, Clemente Rodríguez Lafuente, F. Boto, A. Pérez
This paper analyses the application of multistage classifiers based on the k-NN rule to the automatic classification of handwritten digits. The discriminating capacity of a k-NN classifier increases as the size and dimensionality of the reference pattern set (RPS) increases. This supposes a problem for k-NN classifiers in real applications: the high computational cost required. In order to accelerate the process of calculating the distance to each pattern of the RPS, some authors propose the use of condensing techniques. These methods try to reduce the size of the RPS without losing classification power. Our alternative proposal is based on hierarchical classifiers with rejection techniques and incremental learning that reduce the computational cost of the classifier. We have used 270,000 digits (160,000 digits for training and 110, 000 for the test) of the NIST Special Data Bases 19 and 3 (SD19 and SD3) as experimental data sets. The best non -hierarchical classifier achieves a hit rate of 99.50%. The hierarchical classifier achieves the same hit ratio, but with 24.5 times lower computational cost than best non-hierarchical classifier found in our experimentation and 6 times lower than Hart's Algorithm.
本文分析了基于k-NN规则的多阶段分类器在手写体数字自动分类中的应用。k-NN分类器的识别能力随着参考模式集(RPS)的大小和维数的增加而增加。这为k-NN分类器在实际应用中提出了一个问题:所需的高计算成本。为了加速计算到RPS的每个模式的距离的过程,一些作者提出使用压缩技术。这些方法试图在不损失分类能力的情况下减小RPS的大小。我们的替代方案是基于层次分类器的拒绝技术和增量学习,减少分类器的计算成本。我们使用了NIST特殊数据库19和3 (SD19和SD3)的27万位数(16万用于训练,11万用于测试)作为实验数据集。最好的非分层分类器达到99.50%的命中率。层次分类器实现了相同的命中率,但计算成本比我们实验中发现的最佳非层次分类器低24.5倍,比Hart算法低6倍。
{"title":"Multidimensional multistage k-NN classifiers for handwritten digit recognition","authors":"Iratxe Soraluze Arriola, Clemente Rodríguez Lafuente, F. Boto, A. Pérez","doi":"10.1109/IWFHR.2002.1030878","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030878","url":null,"abstract":"This paper analyses the application of multistage classifiers based on the k-NN rule to the automatic classification of handwritten digits. The discriminating capacity of a k-NN classifier increases as the size and dimensionality of the reference pattern set (RPS) increases. This supposes a problem for k-NN classifiers in real applications: the high computational cost required. In order to accelerate the process of calculating the distance to each pattern of the RPS, some authors propose the use of condensing techniques. These methods try to reduce the size of the RPS without losing classification power. Our alternative proposal is based on hierarchical classifiers with rejection techniques and incremental learning that reduce the computational cost of the classifier. We have used 270,000 digits (160,000 digits for training and 110, 000 for the test) of the NIST Special Data Bases 19 and 3 (SD19 and SD3) as experimental data sets. The best non -hierarchical classifier achieves a hit rate of 99.50%. The hierarchical classifier achieves the same hit ratio, but with 24.5 times lower computational cost than best non-hierarchical classifier found in our experimentation and 6 times lower than Hart's Algorithm.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"120 10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122707994","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 7
Creation of classifier ensembles for handwritten word recognition using feature selection algorithms 使用特征选择算法创建用于手写单词识别的分类器集成
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030906
Simon Günter, H. Bunke
The study of multiple classifier systems has become an area of intensive research in pattern recognition. Also in handwriting, recognition, systems combining several classifiers have been investigated. In the paper new methods for the creation of classifier ensembles based on feature selection algorithms are introduced. These new methods are evaluated and compared to existing approaches in the context of handwritten word recognition, using a hidden Markov model recognizer as basic classifier.
多分类器系统的研究已成为模式识别领域的一个热点。此外,在手写,识别,系统结合几个分类器进行了研究。本文介绍了基于特征选择算法的分类器集成的新方法。使用隐马尔可夫模型识别器作为基本分类器,对这些新方法进行了评估,并与现有的手写单词识别方法进行了比较。
{"title":"Creation of classifier ensembles for handwritten word recognition using feature selection algorithms","authors":"Simon Günter, H. Bunke","doi":"10.1109/IWFHR.2002.1030906","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030906","url":null,"abstract":"The study of multiple classifier systems has become an area of intensive research in pattern recognition. Also in handwriting, recognition, systems combining several classifiers have been investigated. In the paper new methods for the creation of classifier ensembles based on feature selection algorithms are introduced. These new methods are evaluated and compared to existing approaches in the context of handwritten word recognition, using a hidden Markov model recognizer as basic classifier.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"150 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122765677","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 62
Extraction of place-name from natural scenes 自然景观中地名的提取
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030916
Takuma Yamaguchi, Y. Nakano
This paper proposes an experimental character recognition system to recognise place-names written on signboards. Recently, mobile phones with digital cameras and handy digital cameras have be come popular, so we think this system is useful. In experiments, we tested a total of 112 natural scene images with 320 characters. We obtained a correct character recognition rate of 99% and a place-name recognition rate of 98% (with a rejection rate 2%).
本文提出了一种用于标识地名识别的实验性字符识别系统。最近,带数码相机的手机和便携式数码相机已经开始流行,所以我们认为这个系统是有用的。在实验中,我们共测试了112张包含320个字符的自然场景图像。我们获得了99%的正确字符识别率和98%的地名识别率(拒绝率为2%)。
{"title":"Extraction of place-name from natural scenes","authors":"Takuma Yamaguchi, Y. Nakano","doi":"10.1109/IWFHR.2002.1030916","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030916","url":null,"abstract":"This paper proposes an experimental character recognition system to recognise place-names written on signboards. Recently, mobile phones with digital cameras and handy digital cameras have be come popular, so we think this system is useful. In experiments, we tested a total of 112 natural scene images with 320 characters. We obtained a correct character recognition rate of 99% and a place-name recognition rate of 98% (with a rejection rate 2%).","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"121 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116838601","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Handwritten text recognition through writer adaptation 手写文字识别通过作家的适应
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030937
A. Nosary, T. Paquet, L. Heutte, A. Bensefia
Handwritten text recognition is a problem rarely studied out of specific applications for which lexical knowledge can constrain the vocabulary to a limited one. In the case of handwritten text recognition, additional information can be exploited to characterize the specificity of the writing. This knowledge can help the recognition system to find coherent solutions from both the lexical and the morphological points of view. We present the principles of a handwritten text recognition system based on the online learning of the writer shapes. The proposed scheme is shown to improve the recognition rates on a sample of fifteen writings, unknown to the system.
手写体文本识别是一个很少被研究的问题,因为在特定的应用中,词汇知识将词汇限制在一个有限的范围内。在手写文本识别的情况下,可以利用附加信息来表征书写的特异性。这些知识可以帮助识别系统从词汇和形态的角度找到连贯的解决方案。提出了一种基于写作者形状在线学习的手写体文本识别系统的原理。结果表明,该方法可以提高系统对15种未知文字的识别率。
{"title":"Handwritten text recognition through writer adaptation","authors":"A. Nosary, T. Paquet, L. Heutte, A. Bensefia","doi":"10.1109/IWFHR.2002.1030937","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030937","url":null,"abstract":"Handwritten text recognition is a problem rarely studied out of specific applications for which lexical knowledge can constrain the vocabulary to a limited one. In the case of handwritten text recognition, additional information can be exploited to characterize the specificity of the writing. This knowledge can help the recognition system to find coherent solutions from both the lexical and the morphological points of view. We present the principles of a handwritten text recognition system based on the online learning of the writer shapes. The proposed scheme is shown to improve the recognition rates on a sample of fifteen writings, unknown to the system.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"181 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121727035","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
A new warping technique for normalizing likelihood of multiple classifiers and its effectiveness in combined on-line/off-line japanese character recognition 一种新的多分类器归一化似然的扭曲技术及其在线/离线联合识别日文字符的有效性
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030905
Ondrej Velek, Stefan Jäger, M. Nakagawa
We propose a technique for normalizing likelihood of multiple classifiers prior to their combination. Our technique takes classifier-specific likelihood characteristics into account and maps them to a common, ideal characteristic allowing fair combination under arbitrary combination schemes. For each classifier, a simple warping process aligns the likelihood with the accumulated recognition rate, so that recognition rate becomes a uniformly increasing function of likelihood. For combining normalized likelihood values, we investigate several elementary combination rules, such as sum-rule or max-rule. We achieved a significant performance gain of more than five percent, compared to the best single recognition rate, showing both the effectiveness of our method for classifier combination and the benefit of combining on-line Japanese character recognition with stroke order and stroke number independent off-line recognition. Moreover, our experiments provide additional empirical evidence for the good performance of the sum rule in comparison with other elementary combination rules, as has already been observed by other research groups.
我们提出了一种在多个分类器组合之前对其可能性进行归一化的技术。我们的技术考虑了分类器特定的似然特征,并将它们映射到一个共同的、理想的特征,允许在任意组合方案下进行公平组合。对于每个分类器,一个简单的扭曲过程将似然与累积识别率对齐,使识别率成为似然的均匀递增函数。为了组合归一化似然值,我们研究了几种基本组合规则,如和规则或最大规则。与最佳的单一识别率相比,我们取得了超过5%的显著性能增益,这既显示了我们的分类器组合方法的有效性,也显示了将在线日文字符识别与笔画顺序和笔画数独立的离线识别相结合的好处。此外,与其他基本组合规则相比,我们的实验为求和规则的良好性能提供了额外的经验证据,正如其他研究组已经观察到的那样。
{"title":"A new warping technique for normalizing likelihood of multiple classifiers and its effectiveness in combined on-line/off-line japanese character recognition","authors":"Ondrej Velek, Stefan Jäger, M. Nakagawa","doi":"10.1109/IWFHR.2002.1030905","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030905","url":null,"abstract":"We propose a technique for normalizing likelihood of multiple classifiers prior to their combination. Our technique takes classifier-specific likelihood characteristics into account and maps them to a common, ideal characteristic allowing fair combination under arbitrary combination schemes. For each classifier, a simple warping process aligns the likelihood with the accumulated recognition rate, so that recognition rate becomes a uniformly increasing function of likelihood. For combining normalized likelihood values, we investigate several elementary combination rules, such as sum-rule or max-rule. We achieved a significant performance gain of more than five percent, compared to the best single recognition rate, showing both the effectiveness of our method for classifier combination and the benefit of combining on-line Japanese character recognition with stroke order and stroke number independent off-line recognition. Moreover, our experiments provide additional empirical evidence for the good performance of the sum rule in comparison with other elementary combination rules, as has already been observed by other research groups.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"212 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115802134","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 34
Linguistic integration information in the AABATAS Arabic text analysis system AABATAS阿拉伯语文本分析系统中的语言整合信息
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030941
S. Kanoun, A. Ennaji, Y. Lecourtier, A. Alimi
An Arabic text analysis system called AABATAS (affixal approach-based Arabic text analysis system) is proposed. AABATAS recognizes and categorizes the words while identifying their morphological and grammatical characteristics. It is based on a new approach for Arabic word recognition called affixal approach. This affixal approach is guided by the structural properties of language. A dynamic decomposition-recognition mechanism is used in our system and leads to generate a set of reliable solutions for each word. This mechanism attempts to identify, the word basic morphemes: the prefix, the infix, the suffix and the root contrary to the existing approaches that are usually based on the recognition of the whole word or the pseudo-word or the letter. In this paper, we briefly present the general characteristics of Arabic texts as well as a succinct survey of the existing approaches used for their recognition. We then describe the structural properties of the Arabic language and the two systems based on these last properties. The first one concerns a word recognition process and the second is devoted to text analysis. We finally show two experimental results; one on a data set of 545 words and another on a text example.
提出了一种基于词缀法的阿拉伯语文本分析系统AABATAS。AABATAS在识别单词的形态和语法特征的同时对其进行识别和分类。它是基于一种新的阿拉伯语单词识别方法,称为词缀方法。这种词缀方法受语言结构特性的指导。我们的系统采用动态分解识别机制,为每个单词生成一组可靠的解。该机制试图识别词的基本语素:前缀、中缀、后缀和词根,而不是现有的基于全词、假词或字母识别的方法。在本文中,我们简要介绍了阿拉伯语文本的一般特征,并简要介绍了用于识别阿拉伯语文本的现有方法。然后,我们描述了阿拉伯语的结构特性和基于这些最后的特性的两个系统。第一个是单词识别过程,第二个是文本分析。最后给出了两个实验结果;一个在545个单词的数据集上,另一个在文本示例上。
{"title":"Linguistic integration information in the AABATAS Arabic text analysis system","authors":"S. Kanoun, A. Ennaji, Y. Lecourtier, A. Alimi","doi":"10.1109/IWFHR.2002.1030941","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030941","url":null,"abstract":"An Arabic text analysis system called AABATAS (affixal approach-based Arabic text analysis system) is proposed. AABATAS recognizes and categorizes the words while identifying their morphological and grammatical characteristics. It is based on a new approach for Arabic word recognition called affixal approach. This affixal approach is guided by the structural properties of language. A dynamic decomposition-recognition mechanism is used in our system and leads to generate a set of reliable solutions for each word. This mechanism attempts to identify, the word basic morphemes: the prefix, the infix, the suffix and the root contrary to the existing approaches that are usually based on the recognition of the whole word or the pseudo-word or the letter. In this paper, we briefly present the general characteristics of Arabic texts as well as a succinct survey of the existing approaches used for their recognition. We then describe the structural properties of the Arabic language and the two systems based on these last properties. The first one concerns a word recognition process and the second is devoted to text analysis. We finally show two experimental results; one on a data set of 545 words and another on a text example.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"77 1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116357531","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
期刊
Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1