首页 > 最新文献

Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition最新文献

英文 中文
On-line recognition of handwritten Arabic characters using a Kohonen neural network 基于Kohonen神经网络的手写阿拉伯文字在线识别
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030958
N. Mezghani, A. Mitiche, M. Cheriet
Neural networks have been applied to various pattern classification and recognition problems for their learning ability, discrimination power and generalization ability The neural network most referenced in the pattern recognition literature are the multi-layer perceptron, the Kohonen associative memory and the Capenter-Grossberg ART network. The Kohonen memory runs an unsupervised clustering algorithm. It is easily trained and has attractive properties such as topological ordering and good generalization. In this study an on-line system for the recognition of handwriting Arabic characters using a Kohonen network is investigated. The input of the neural network is a feature vector of elliptic Fourier coefficients extracted from the handwritten dynamic representation. Experimental results show that the network successfully recognizes both clearly and roughly written characters with good performance.
神经网络以其学习能力、判别能力和泛化能力被广泛应用于各种模式分类和识别问题,模式识别文献中引用最多的神经网络是多层感知器、Kohonen联想记忆和capter - grossberg ART网络。Kohonen内存运行一种无监督聚类算法。它易于训练,并且具有拓扑有序和良好泛化等吸引人的特性。本文研究了一种基于Kohonen网络的手写阿拉伯文在线识别系统。神经网络的输入是从手写动态表示中提取的椭圆傅里叶系数特征向量。实验结果表明,该网络既能识别清晰的汉字,又能识别粗略的汉字,并且具有良好的性能。
{"title":"On-line recognition of handwritten Arabic characters using a Kohonen neural network","authors":"N. Mezghani, A. Mitiche, M. Cheriet","doi":"10.1109/IWFHR.2002.1030958","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030958","url":null,"abstract":"Neural networks have been applied to various pattern classification and recognition problems for their learning ability, discrimination power and generalization ability The neural network most referenced in the pattern recognition literature are the multi-layer perceptron, the Kohonen associative memory and the Capenter-Grossberg ART network. The Kohonen memory runs an unsupervised clustering algorithm. It is easily trained and has attractive properties such as topological ordering and good generalization. In this study an on-line system for the recognition of handwriting Arabic characters using a Kohonen network is investigated. The input of the neural network is a feature vector of elliptic Fourier coefficients extracted from the handwritten dynamic representation. Experimental results show that the network successfully recognizes both clearly and roughly written characters with good performance.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129041798","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 59
Representations and metrics for off-line handwriting segmentation 离线手写分割的表示和度量
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030948
T. Breuel
Segmentation is a key step in many off-line handwriting recognition systems but, to date, there are almost no ground truth segmentation databases and no widely accepted and formally defined metrics for segmentation performance. This paper proposes a representation of segmentations and presegmentations in terms of color images. Such representations allow convenient interchange of ground truth and hypothesized segmentations in the form of standard image formats. The paper formally defines the notions of oversegmentation and undersegmentation in terms of the maximal bipartite match between corresponding pixels. It also defines a number of metrics that quantify the frequency and extent of events in handwriting like kerning, splitting, and merging of characters. It is hoped that these metrics and representations will find wider use in the community and serve as a basis for creating standard training and test databases of segmentation data.
分割是许多离线手写识别系统的关键步骤,但迄今为止,几乎没有真实分割数据库,也没有广泛接受和正式定义的分割性能指标。提出了一种基于彩色图像的分割和预分割的表示方法。这样的表示允许以标准图像格式的形式方便地交换地面真相和假设分割。本文从相应像素间最大二部匹配的角度正式定义了过分割和欠分割的概念。它还定义了一些量化手写事件的频率和范围的指标,如字距调整、分割和合并字符。希望这些度量和表示将在社区中得到更广泛的应用,并作为创建分割数据标准训练和测试数据库的基础。
{"title":"Representations and metrics for off-line handwriting segmentation","authors":"T. Breuel","doi":"10.1109/IWFHR.2002.1030948","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030948","url":null,"abstract":"Segmentation is a key step in many off-line handwriting recognition systems but, to date, there are almost no ground truth segmentation databases and no widely accepted and formally defined metrics for segmentation performance. This paper proposes a representation of segmentations and presegmentations in terms of color images. Such representations allow convenient interchange of ground truth and hypothesized segmentations in the form of standard image formats. The paper formally defines the notions of oversegmentation and undersegmentation in terms of the maximal bipartite match between corresponding pixels. It also defines a number of metrics that quantify the frequency and extent of events in handwriting like kerning, splitting, and merging of characters. It is hoped that these metrics and representations will find wider use in the community and serve as a basis for creating standard training and test databases of segmentation data.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"2012 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121351981","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
A study on the use of CDHMM for large vocabulary off-line recognition of handwritten Chinese characters 基于CDHMM的手写体汉字大词汇离线识别研究
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030932
Yong Ge, Qinah Huo
We (2002) have investigate how to use Gaussian mixture continuous-density hidden Markov models (CDHMMs) for handwritten Chinese character modeling and recognition. We have identified and developed a set of techniques that can be used to construct a practical CDHMM-based off-line recognition system for a large vocabulary of handwritten Chinese characters. We have reported elsewhere the key techniques that contribute to the high recognition accuracy. In this paper we describe how to make our recognizer compact without sacrificing too much of the recognition accuracy. We also report the results of a series of experiments that were performed to help us make a good decision when we face several design choices.
我们(2002)研究了如何使用高斯混合连续密度隐马尔可夫模型(cdhmm)进行手写体汉字建模和识别。我们已经确定并开发了一套技术,可用于构建一个实用的基于cdhmm的大型手写体汉字离线识别系统。我们已经在其他地方报道了有助于提高识别精度的关键技术。在本文中,我们描述了如何在不牺牲太多识别精度的情况下使我们的识别器更紧凑。我们还报告了一系列实验的结果,这些实验是为了帮助我们在面对几个设计选择时做出正确的决定。
{"title":"A study on the use of CDHMM for large vocabulary off-line recognition of handwritten Chinese characters","authors":"Yong Ge, Qinah Huo","doi":"10.1109/IWFHR.2002.1030932","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030932","url":null,"abstract":"We (2002) have investigate how to use Gaussian mixture continuous-density hidden Markov models (CDHMMs) for handwritten Chinese character modeling and recognition. We have identified and developed a set of techniques that can be used to construct a practical CDHMM-based off-line recognition system for a large vocabulary of handwritten Chinese characters. We have reported elsewhere the key techniques that contribute to the high recognition accuracy. In this paper we describe how to make our recognizer compact without sacrificing too much of the recognition accuracy. We also report the results of a series of experiments that were performed to help us make a good decision when we face several design choices.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131985031","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 12
Stroke level HMMs for on-line handwriting recognition 笔画水平hmm在线手写识别
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030914
T. Artières, P. Gallinari
The recent development of new terminals such as phones, mobile computers, e-books, etc, raises the needs for new interface modalities, in order to replace or complement the traditional mouse/keyboard interface. Ideally, these new interfaces should use limited computing resources and should be easy to adapt to a specific user and to a large variety of user needs. We propose here a new handwriting recognition system that is an attempt to handle these constraints. We evaluate its performances and ability to adapt to new users on a part of the Unipen database.
随着手机、移动电脑、电子书等新型终端的发展,人们对新的界面模式提出了需求,以取代或补充传统的鼠标/键盘界面。理想情况下,这些新的接口应该使用有限的计算资源,并且应该易于适应特定的用户和各种各样的用户需求。我们在这里提出了一个新的手写识别系统,试图处理这些限制。我们在Unipen数据库的一部分上评估其性能和适应新用户的能力。
{"title":"Stroke level HMMs for on-line handwriting recognition","authors":"T. Artières, P. Gallinari","doi":"10.1109/IWFHR.2002.1030914","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030914","url":null,"abstract":"The recent development of new terminals such as phones, mobile computers, e-books, etc, raises the needs for new interface modalities, in order to replace or complement the traditional mouse/keyboard interface. Ideally, these new interfaces should use limited computing resources and should be easy to adapt to a specific user and to a large variety of user needs. We propose here a new handwriting recognition system that is an attempt to handle these constraints. We evaluate its performances and ability to adapt to new users on a part of the Unipen database.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"57 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130999160","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 29
Feature sets evaluation for handwritten word recognition 手写文字识别的特征集评价
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030951
José Josemar de Oliveira, J. Carvalho, C. Freitas, R. Sabourin
This paper presents a baseline system used to evaluate feature sets for word recognition. The main goal is to determine an optimum feature set to represent the handwritten names for the months of the year in Brazilian Portuguese language. Three kinds of features are evaluated: perceptual, directional and topological. The evaluation shows that taken in isolation, the perceptual feature set produces the best results for the lexicon used. These results can be further improved combining the feature sets. The baseline system developed obtains an average recognition rate of 87%. This can be considered a good result considering that no explicit segmentation is performed.
本文提出了一种用于评价词识别特征集的基线系统。主要目标是确定一个最佳的功能集来表示巴西葡萄牙语中一年中的月份的手写名称。评估了三种特征:感性、方向性和拓扑性。评估表明,孤立地考虑,感知特征集对所使用的词典产生了最好的结果。结合特性集可以进一步改进这些结果。开发的基线系统平均识别率为87%。考虑到没有执行显式分割,这可以被认为是一个很好的结果。
{"title":"Feature sets evaluation for handwritten word recognition","authors":"José Josemar de Oliveira, J. Carvalho, C. Freitas, R. Sabourin","doi":"10.1109/IWFHR.2002.1030951","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030951","url":null,"abstract":"This paper presents a baseline system used to evaluate feature sets for word recognition. The main goal is to determine an optimum feature set to represent the handwritten names for the months of the year in Brazilian Portuguese language. Three kinds of features are evaluated: perceptual, directional and topological. The evaluation shows that taken in isolation, the perceptual feature set produces the best results for the lexicon used. These results can be further improved combining the feature sets. The baseline system developed obtains an average recognition rate of 87%. This can be considered a good result considering that no explicit segmentation is performed.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"10 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131076666","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 11
On-line handwriting recognition using character bigram match vectors 在线手写识别使用字符双字母匹配向量
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030886
A. El-Nasan, M. Perrone
Describes an adaptive, partial-word-level, writer,dependent, handwriting recognition system that utilizes the character n-gram statistics of the English language. The system exploits the linguistic property that very few pairs of English words share exactly the same set of character bigrams. This property is used to bring linguistic context to the recognition stage. The recognition is based on, estimating the probability of bigram co-occurrences between words. Preliminary experiments using naive features and limited training sets show that the system can recognize over 60% of words it has never seen before in handwritten form. The system has only few trainable parameters. In addition, incremental training is computationally inexpensive.
描述了一种自适应的、部分单词级的、依赖于书写者的手写识别系统,该系统利用了英语语言的字符n-gram统计。该系统利用了语言特性,即很少有英语单词对共享完全相同的字符集。这一特性用于将语言语境带入识别阶段。该识别是基于估计单词之间双字共现的概率。使用朴素特征和有限训练集的初步实验表明,该系统可以识别超过60%以前从未见过的手写单词。该系统只有很少的可训练参数。此外,增量训练在计算上是廉价的。
{"title":"On-line handwriting recognition using character bigram match vectors","authors":"A. El-Nasan, M. Perrone","doi":"10.1109/IWFHR.2002.1030886","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030886","url":null,"abstract":"Describes an adaptive, partial-word-level, writer,dependent, handwriting recognition system that utilizes the character n-gram statistics of the English language. The system exploits the linguistic property that very few pairs of English words share exactly the same set of character bigrams. This property is used to bring linguistic context to the recognition stage. The recognition is based on, estimating the probability of bigram co-occurrences between words. Preliminary experiments using naive features and limited training sets show that the system can recognize over 60% of words it has never seen before in handwritten form. The system has only few trainable parameters. In addition, incremental training is computationally inexpensive.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"13 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128709128","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
An algorithm for online strokes verification of Chinese characters using discrete features 基于离散特征的汉字笔画在线验证算法
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030933
C. K. Tan
Existing Chinese educational software tools teach the students at the primary or kindergarten levels the stroke movements and stroke order of each Chinese character by animating the individual strokes of the whole character, one stroke at a time. In this paper the author presents an algorithm which allows the student to hand-write a specified character online through the computer, and which checks if the individual stroke movements and the strokes' order are correct. The algorithm proposed here uses low-integer-valued coding to represent two categories of features: primitive-stroke features and character features. The proposed algorithm is capable of identifying the typical errors as incorrect movement for the primitive stroke, incorrect stroke type, incorrect relative lengths, incorrect position of a stroke from the rest, incorrect character which looks similar in appearance, incorrect order of strokes and insufficient or extra strokes.
现有的语文教育软件工具通过将整个汉字的单个笔画动画化,一笔一画地教授给小学或幼儿园的学生每个汉字的笔画动作和笔画顺序。本文提出了一种算法,该算法允许学生通过计算机在线手写指定的字符,并检查单个笔画的运动和笔画的顺序是否正确。本文提出的算法使用低整数编码来表示两类特征:原始笔画特征和字符特征。该算法能够识别基本笔画运动不正确、笔画类型不正确、笔画相对长度不正确、笔画位置与其他笔画位置不正确、笔画外观相似、笔画顺序不正确、笔画不足或多余等典型错误。
{"title":"An algorithm for online strokes verification of Chinese characters using discrete features","authors":"C. K. Tan","doi":"10.1109/IWFHR.2002.1030933","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030933","url":null,"abstract":"Existing Chinese educational software tools teach the students at the primary or kindergarten levels the stroke movements and stroke order of each Chinese character by animating the individual strokes of the whole character, one stroke at a time. In this paper the author presents an algorithm which allows the student to hand-write a specified character online through the computer, and which checks if the individual stroke movements and the strokes' order are correct. The algorithm proposed here uses low-integer-valued coding to represent two categories of features: primitive-stroke features and character features. The proposed algorithm is capable of identifying the typical errors as incorrect movement for the primitive stroke, incorrect stroke type, incorrect relative lengths, incorrect position of a stroke from the rest, incorrect character which looks similar in appearance, incorrect order of strokes and insufficient or extra strokes.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"37 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116322627","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 24
Hidden loop recovery for handwriting recognition 隐藏循环恢复手写识别
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030939
D. Doermann, N. Intrator, E. Rivlin, T. Steinherz
One significant challenge in the recognition of off-line handwriting is in the interpretation of loop structures. Although this information is readily available in online representation, close proximity of strokes often merges their centers making them difficult to identify. In this paper a novel approach to the recovery of hidden loops in off-line scanned document images is presented. The proposed algorithm seeks blobs that resemble truncated ellipses. We use a sophisticated form analysis method based on mutual distance measurements between the two sides of a symmetric shape. The experimental results are compared with the ground truth of the online representations of each off-line word image. More than 86% percent of the meaningful loops are handled correctly.
识别离线笔迹的一个重大挑战是循环结构的解释。虽然这些信息在网上很容易获得,但近距离的笔画通常会合并它们的中心,使它们难以识别。本文提出了一种从离线扫描文档图像中恢复隐藏环路的新方法。提出的算法寻找类似于截断的椭圆的斑点。我们使用了一种基于对称形状两边相互距离测量的复杂形式分析方法。实验结果与每个离线词图像的在线表示的基础真值进行了比较。超过86%的有意义的循环得到了正确处理。
{"title":"Hidden loop recovery for handwriting recognition","authors":"D. Doermann, N. Intrator, E. Rivlin, T. Steinherz","doi":"10.1109/IWFHR.2002.1030939","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030939","url":null,"abstract":"One significant challenge in the recognition of off-line handwriting is in the interpretation of loop structures. Although this information is readily available in online representation, close proximity of strokes often merges their centers making them difficult to identify. In this paper a novel approach to the recovery of hidden loops in off-line scanned document images is presented. The proposed algorithm seeks blobs that resemble truncated ellipses. We use a sophisticated form analysis method based on mutual distance measurements between the two sides of a symmetric shape. The experimental results are compared with the ground truth of the online representations of each off-line word image. More than 86% percent of the meaningful loops are handled correctly.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133827534","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 10
Writer adaptation techniques in off-line cursive word recognition 离线草书词识别中的写作者改编技术
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030924
A. Vinciarelli, Samy Bengio
This work presents the application of HMM adaptation techniques to the problem of off-line cursive script recognition. Instead of training a new model for each writer one first creates a unique model with a mixed database and then adapts it for each different writer using his own small dataset. Experiments on a publicly available benchmark database show that an adapted system has an accuracy higher than 80% even when less than 30 word samples are used during adaptation, while a system trained using the data of the single writer only needs at least 200 words (the estimate is a lower bound) in order to achieve the same performance as the adapted models.
本文提出了HMM自适应技术在离线草书识别中的应用。不是为每个作家训练一个新模型,而是首先使用混合数据库创建一个独特的模型,然后使用自己的小数据集对每个不同的作家进行调整。在一个公开可用的基准数据库上的实验表明,即使在适应过程中使用的单词样本少于30个,适应性系统的准确率也高于80%,而使用单个作者的数据训练的系统只需要至少200个单词(估计是一个下界)就可以达到与适应性模型相同的性能。
{"title":"Writer adaptation techniques in off-line cursive word recognition","authors":"A. Vinciarelli, Samy Bengio","doi":"10.1109/IWFHR.2002.1030924","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030924","url":null,"abstract":"This work presents the application of HMM adaptation techniques to the problem of off-line cursive script recognition. Instead of training a new model for each writer one first creates a unique model with a mixed database and then adapts it for each different writer using his own small dataset. Experiments on a publicly available benchmark database show that an adapted system has an accuracy higher than 80% even when less than 30 word samples are used during adaptation, while a system trained using the data of the single writer only needs at least 200 words (the estimate is a lower bound) in order to achieve the same performance as the adapted models.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130793319","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
Writer identification by writer's invariants 通过编写器的不变量来标识编写器
Pub Date : 2002-08-06 DOI: 10.1109/IWFHR.2002.1030922
A. Bensefia, A. Nosary, T. Paquet, L. Heutte
This communication deals with the problem of writer identification. If the assumption of writing individuality is true then graphical fragments that constitute it should be individual too. Therefore we propose a morphological grapheme based analysis to make writer identification. Template Matching is the core of the approach. The redundancy of the individual patterns in a writing, defined as the writer's invariants, allows to compress the handwritten texts while maintaining good identification performance. Two series of tests are reported. The first series is designed to evaluate the relevance of our approach of identification on a basis of 88 writers by evaluating the influence of the text representation (with or without invariants) on the quality of the method. The method gives about 97,7% of correct identification when using large compressed samples of handwriting. The second series of tests is designed to evaluate the influence of the sample size of the writing to be identified on the quality of the method. It is shown that writer identification can reach a correct identification rate of 92,9% using only samples of 50 graphemes of each writing.
这个通信处理的是作者身份的问题。如果写作个体性的假设是正确的,那么构成它的图形片段也应该是个体性的。因此,我们提出了一种基于词素分析的写作者识别方法。模板匹配是该方法的核心。书写中单个模式的冗余(定义为书写者的不变量)允许压缩手写文本,同时保持良好的识别性能。报告了两组试验。第一个系列旨在通过评估文本表示(带或不带不变量)对方法质量的影响,在88位作者的基础上评估我们识别方法的相关性。当使用大量压缩的手写样本时,该方法的正确率约为97.7%。第二组测试旨在评估待识别的写作样本量对方法质量的影响。结果表明,仅使用每种文字的50个字素样本,写作者识别的正确率就可以达到92.9%。
{"title":"Writer identification by writer's invariants","authors":"A. Bensefia, A. Nosary, T. Paquet, L. Heutte","doi":"10.1109/IWFHR.2002.1030922","DOIUrl":"https://doi.org/10.1109/IWFHR.2002.1030922","url":null,"abstract":"This communication deals with the problem of writer identification. If the assumption of writing individuality is true then graphical fragments that constitute it should be individual too. Therefore we propose a morphological grapheme based analysis to make writer identification. Template Matching is the core of the approach. The redundancy of the individual patterns in a writing, defined as the writer's invariants, allows to compress the handwritten texts while maintaining good identification performance. Two series of tests are reported. The first series is designed to evaluate the relevance of our approach of identification on a basis of 88 writers by evaluating the influence of the text representation (with or without invariants) on the quality of the method. The method gives about 97,7% of correct identification when using large compressed samples of handwriting. The second series of tests is designed to evaluate the influence of the sample size of the writing to be identified on the quality of the method. It is shown that writer identification can reach a correct identification rate of 92,9% using only samples of 50 graphemes of each writing.","PeriodicalId":114017,"journal":{"name":"Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2002-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115036705","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 86
期刊
Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1