首页 > 最新文献

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)最新文献

英文 中文
Using Grammar-Based Recognizers for Symbol Completion in Diagrammatic Sketches 用基于语法的识别器完成图解草图中的符号补全
G. Costagliola, V. Deufemia, M. Risi
Sketching is considered as a way to naturally express ideas during the early phases of design. For this reason, many efforts have been made to develop user interfaces and recognizers, which enable users to create sketches using pen-based devices. However, in some domains, such as in architectural and engineering fields, the drawing process turns out to be particularly tedious and time-consuming, since the symbols to be drawn may have a complex shape and recur many times in the sketches. In this paper we present a technique for symbol completion that allows users to rapidly draw diagrammatic sketches. The completion technique recovers the information on missing strokes by interacting with symbol recognizers, which are automatically generated from grammar specifications. Moreover, in order to maintain the sketch layout more familiar to the users, the added strokes are drawn according to the user drawing style.
在设计的早期阶段,草图被认为是一种自然表达想法的方式。出于这个原因,已经做出了许多努力来开发用户界面和识别器,使用户能够使用基于笔的设备创建草图。然而,在某些领域,如建筑和工程领域,绘制过程变得特别繁琐和耗时,因为要绘制的符号可能具有复杂的形状,并且在草图中反复出现多次。在本文中,我们提出了一种符号补全技术,允许用户快速绘制图解草图。补全技术通过与语法规范自动生成的符号识别器交互来恢复缺失笔画的信息。此外,为了保持草图布局更熟悉用户,增加的笔画是根据用户的绘画风格绘制的。
{"title":"Using Grammar-Based Recognizers for Symbol Completion in Diagrammatic Sketches","authors":"G. Costagliola, V. Deufemia, M. Risi","doi":"10.1109/ICDAR.2007.259","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.259","url":null,"abstract":"Sketching is considered as a way to naturally express ideas during the early phases of design. For this reason, many efforts have been made to develop user interfaces and recognizers, which enable users to create sketches using pen-based devices. However, in some domains, such as in architectural and engineering fields, the drawing process turns out to be particularly tedious and time-consuming, since the symbols to be drawn may have a complex shape and recur many times in the sketches. In this paper we present a technique for symbol completion that allows users to rapidly draw diagrammatic sketches. The completion technique recovers the information on missing strokes by interacting with symbol recognizers, which are automatically generated from grammar specifications. Moreover, in order to maintain the sketch layout more familiar to the users, the added strokes are drawn according to the user drawing style.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131179583","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Fast Selection of Small and Precise Candidate Sets from Dictionaries for Text Correction Tasks 从字典中快速选择小而精确的候选集用于文本校正任务
K. Schulz, S. Mihov, Petar Mitankin
Lexical text correction relies on a central step where approximate search in a dictionary is used to select the best correction suggestions for an ill-formed input token. In previous work we introduced the concept of a universal Levenshtein automaton and showed how to use these automata for efficiently selecting from a dictionary all entries within a fixed Levenshtein distance to the garbled input word. In this paper we look at refinements of the basic Levenshtein distance that yield more sensible notions of similarity in distinct text correction applications, e.g. OCR. We show that the concept of a universal Levenshtein automaton can be adapted to these refinements. In this way we obtain a method for selecting correction candidates which is very efficient, at the same time selecting small candidate sets with high recall.
词法文本校正依赖于一个中心步骤,其中使用字典中的近似搜索来为格式错误的输入标记选择最佳校正建议。在之前的工作中,我们介绍了通用Levenshtein自动机的概念,并展示了如何使用这些自动机有效地从字典中选择与乱码输入单词在固定Levenshtein距离内的所有条目。在本文中,我们研究了基本Levenshtein距离的改进,从而在不同的文本校正应用(例如OCR)中产生更合理的相似性概念。我们证明了通用Levenshtein自动机的概念可以适应这些改进。通过这种方法,我们获得了一种高效的选择校正候选的方法,同时选择了具有高召回率的小候选集。
{"title":"Fast Selection of Small and Precise Candidate Sets from Dictionaries for Text Correction Tasks","authors":"K. Schulz, S. Mihov, Petar Mitankin","doi":"10.1109/ICDAR.2007.119","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.119","url":null,"abstract":"Lexical text correction relies on a central step where approximate search in a dictionary is used to select the best correction suggestions for an ill-formed input token. In previous work we introduced the concept of a universal Levenshtein automaton and showed how to use these automata for efficiently selecting from a dictionary all entries within a fixed Levenshtein distance to the garbled input word. In this paper we look at refinements of the basic Levenshtein distance that yield more sensible notions of similarity in distinct text correction applications, e.g. OCR. We show that the concept of a universal Levenshtein automaton can be adapted to these refinements. In this way we obtain a method for selecting correction candidates which is very efficient, at the same time selecting small candidate sets with high recall.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"20 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131375563","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 15
iGesture: A General Gesture Recognition Framework 手势:一个通用的手势识别框架
B. Signer, U. Kurmann, M. Norrie
With the emergence of digital pen and paper interfaces, there is a need for gesture recognition tools for digital pen input. While there exists a variety of gesture recognition frameworks, none of them addresses the issues of supporting application developers as well as the designers of new recognition algorithms and, at the same time, can be integrated with new forms of input devices such as digital pens. We introduce iGesture, a Java-based gesture recognition framework focusing on extensibility and cross-application reusability by providing an integrated solution that includes tools for gesture recognition as well as the creation and management of gesture sets for the evaluation and optimisation of new or existing gesture recognition algorithms. In addition to traditional screen-based interaction, iGesture provides a digital pen and paper interface.
随着数字笔和纸界面的出现,需要针对数字笔输入的手势识别工具。虽然存在各种各样的手势识别框架,但它们都没有解决支持应用程序开发人员以及新识别算法设计人员的问题,同时也不能与数字笔等新形式的输入设备集成。我们介绍了iGesture,一个基于java的手势识别框架,通过提供一个集成的解决方案,包括手势识别工具,以及用于评估和优化新的或现有的手势识别算法的手势集的创建和管理,专注于可扩展性和跨应用程序可重用性。除了传统的基于屏幕的交互之外,iGesture还提供了数字笔和纸界面。
{"title":"iGesture: A General Gesture Recognition Framework","authors":"B. Signer, U. Kurmann, M. Norrie","doi":"10.1109/ICDAR.2007.139","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.139","url":null,"abstract":"With the emergence of digital pen and paper interfaces, there is a need for gesture recognition tools for digital pen input. While there exists a variety of gesture recognition frameworks, none of them addresses the issues of supporting application developers as well as the designers of new recognition algorithms and, at the same time, can be integrated with new forms of input devices such as digital pens. We introduce iGesture, a Java-based gesture recognition framework focusing on extensibility and cross-application reusability by providing an integrated solution that includes tools for gesture recognition as well as the creation and management of gesture sets for the evaluation and optimisation of new or existing gesture recognition algorithms. In addition to traditional screen-based interaction, iGesture provides a digital pen and paper interface.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"3 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"121867685","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 75
A Classifier of Similar Characters using Compound Mahalanobis Function based on Difference Subspace 基于差分子空间的复合Mahalanobis函数相似字符分类器
J. Hirayama, Hidehisa Nakayama, N. Kato
To distinguish similar characters, it is preferable to construct a classifier using a projective feature space which differentiates two similar categories. The classifier CMF has been proposed for a discriminant function, in similar characters recognition. In the CMF, a subspace is constructed by some eigenvectors, that corresponds to the smallest eigenvalues, is applied as projective feature space. A difference vector of two class-mean feature vectors are assumed as the difference between two similar categories, the CMF is constructed by projecting a feature vector onto this difference vector. In this paper, we propose new discriminant function expanding the CMF. In proposed method, we treat the Difference Subspace, which is difference between two subspaces as difference between two similar categories. The efficiency of the proposed new discriminant function has been demonstrated in similar characters recognition through extensive experiments on hand-written Japanese characters derived from the ETL9B database.
为了区分相似的字符,最好使用区分两个相似类别的射影特征空间构造分类器。提出了一种用于相似字符识别的判别函数CMF分类器。在CMF中,由若干对应于最小特征值的特征向量构成子空间,作为射影特征空间。假设两个类均值特征向量的差向量为两个相似类别之间的差,通过将特征向量投影到该差向量上构建CMF。本文提出了一种新的判别函数,对CMF进行了扩展。在该方法中,我们将两个子空间之间的差异视为两个相似范畴之间的差异。通过对来自ETL9B数据库的手写日文进行大量实验,证明了该判别函数在相似字符识别中的有效性。
{"title":"A Classifier of Similar Characters using Compound Mahalanobis Function based on Difference Subspace","authors":"J. Hirayama, Hidehisa Nakayama, N. Kato","doi":"10.1109/ICDAR.2007.4","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.4","url":null,"abstract":"To distinguish similar characters, it is preferable to construct a classifier using a projective feature space which differentiates two similar categories. The classifier CMF has been proposed for a discriminant function, in similar characters recognition. In the CMF, a subspace is constructed by some eigenvectors, that corresponds to the smallest eigenvalues, is applied as projective feature space. A difference vector of two class-mean feature vectors are assumed as the difference between two similar categories, the CMF is constructed by projecting a feature vector onto this difference vector. In this paper, we propose new discriminant function expanding the CMF. In proposed method, we treat the Difference Subspace, which is difference between two subspaces as difference between two similar categories. The efficiency of the proposed new discriminant function has been demonstrated in similar characters recognition through extensive experiments on hand-written Japanese characters derived from the ETL9B database.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"25 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134579434","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2
An EM Based Algorithm for Skew Detection 一种基于EM的倾斜检测算法
A. Egozi, I. Dinstein, J. Chapran, M. Fairhurst
We present a a statistical approach to skew detection, where the textual features of a document image are modeled as a mixture of straight lines in Gaussian noise. The EM algorithm is used to estimate the parameters of the mixture model and the skew angle estimate is extracted from the estimated parameters. Experiments prove that our method has some advantages over other existing methods in terms of accuracy and efficiency.
我们提出了一种歪斜检测的统计方法,其中文档图像的文本特征被建模为高斯噪声中直线的混合物。利用电磁算法对混合模型参数进行估计,并从估计参数中提取偏角估计。实验证明,该方法在精度和效率方面都优于现有的方法。
{"title":"An EM Based Algorithm for Skew Detection","authors":"A. Egozi, I. Dinstein, J. Chapran, M. Fairhurst","doi":"10.1109/ICDAR.2007.52","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.52","url":null,"abstract":"We present a a statistical approach to skew detection, where the textual features of a document image are modeled as a mixture of straight lines in Gaussian noise. The EM algorithm is used to estimate the parameters of the mixture model and the skew angle estimate is extracted from the estimated parameters. Experiments prove that our method has some advantages over other existing methods in terms of accuracy and efficiency.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"115793901","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Handwritten Chinese Character Recognition Using Modified LDA and Kernel FDA 基于改进LDA和核FDA的手写汉字识别
Duanduan Yang, Lianwen Jin
The effectiveness of kernel fisher discrimination analysis (KFDA) has been demonstrated by many pattern recognition applications. However, due to the large size of Gram matrix to be trained, how to use KFDA to solve large vocabulary pattern recognition task such as Chinese Characters recognition is still a challenging problem. In this paper, a two-stage KFDA approach is presented for handwritten Chinese character recognition. In the first stage, a new modified linear discriminant analysis method is developed to get the recognition candidates. In the second stage, KFDA is used to determine the final recognition result. Experiments on 1034 categories of Chinese character from 120 sets of handwriting samples shows that a 3.37% improvement of recognition rate is obtained, which suggests the effectiveness of the proposed method.
核费雪判别分析(KFDA)的有效性已被许多模式识别应用所证明。然而,由于待训练的Gram矩阵规模较大,如何利用KFDA解决像汉字识别这样的大词汇量模式识别任务仍然是一个具有挑战性的问题。本文提出了一种两阶段KFDA的手写体汉字识别方法。在第一阶段,提出了一种新的改进的线性判别分析方法来获得识别候选者。在第二阶段,由KFDA确定最终的识别结果。对120组手写样本中的1034类汉字进行了实验,结果表明,该方法的识别率提高了3.37%,表明了该方法的有效性。
{"title":"Handwritten Chinese Character Recognition Using Modified LDA and Kernel FDA","authors":"Duanduan Yang, Lianwen Jin","doi":"10.1109/ICDAR.2007.128","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.128","url":null,"abstract":"The effectiveness of kernel fisher discrimination analysis (KFDA) has been demonstrated by many pattern recognition applications. However, due to the large size of Gram matrix to be trained, how to use KFDA to solve large vocabulary pattern recognition task such as Chinese Characters recognition is still a challenging problem. In this paper, a two-stage KFDA approach is presented for handwritten Chinese character recognition. In the first stage, a new modified linear discriminant analysis method is developed to get the recognition candidates. In the second stage, KFDA is used to determine the final recognition result. Experiments on 1034 categories of Chinese character from 120 sets of handwriting samples shows that a 3.37% improvement of recognition rate is obtained, which suggests the effectiveness of the proposed method.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132559338","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 3
A hybrid approach for off-line Arabic handwriting recognition based on a Planar Hidden Markov modeling 一种基于平面隐马尔可夫建模的离线阿拉伯手写识别混合方法
Sameh Masmoudi Touj, N. Amara, H. Amiri
A novel approach for the Arabic handwriting recognition is presented. The use of a planar hidden Markov model (PHMM) has permitted to split the Arabic script into five homogeneous horizontal regions. Each region was described by a 1D-HMM. This modeling is based on different levels of segmentation: horizontal, natural and vertical. Both holistic and analytical approaches have been tested for the description of the median band of the Arabic writing. We show finally that a hybrid approach conducted to the improvement of the whole system performances.
提出了一种新的阿拉伯语手写识别方法。平面隐马尔可夫模型(PHMM)的使用允许将阿拉伯文字划分为五个均匀的水平区域。每个区域用1D-HMM来描述。这种建模基于不同层次的分割:水平、自然和垂直。对于阿拉伯文字的中间带的描述,整体性和分析性两种方法都进行了测试。最后,我们证明了一种混合方法可以改善整个系统的性能。
{"title":"A hybrid approach for off-line Arabic handwriting recognition based on a Planar Hidden Markov modeling","authors":"Sameh Masmoudi Touj, N. Amara, H. Amiri","doi":"10.1109/ICDAR.2007.14","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.14","url":null,"abstract":"A novel approach for the Arabic handwriting recognition is presented. The use of a planar hidden Markov model (PHMM) has permitted to split the Arabic script into five homogeneous horizontal regions. Each region was described by a 1D-HMM. This modeling is based on different levels of segmentation: horizontal, natural and vertical. Both holistic and analytical approaches have been tested for the description of the median band of the Arabic writing. We show finally that a hybrid approach conducted to the improvement of the whole system performances.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"134096675","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 31
A Two Stage Recognition Scheme for Handwritten Tamil Characters 手写泰米尔字符的两阶段识别方案
U. Bhattacharya, S. Ghosh, S. K. Parui
India is a multilingual multiscript country with more than 18 languages and 10 different major scripts. Not enough research work towards recognition of handwritten characters of these Indian scripts has been done. Tamil, an official as well as popular script of the southern part of India, Singapore, Malaysia, and Sri Lanka has a large character set which includes many compound characters. Only a few works towards handwriting recognition of this large character set has been reported in the literature. Recently, HP Labs India developed a database of handwritten Tamil characters. In the present paper, we describe an off-line recognition approach based on this database. The proposed method consists of two stages. In the first stage, we apply an unsupervised clustering method to create a smaller number of groups of handwritten Tamil character classes. In the second stage, we consider a supervised classification technique in each of these smaller groups for final recognition. The features considered in the two stages are different. The proposed two-stage recognition scheme provided acceptable classification accuracies on both the training and test sets of the present database.
印度是一个多语言多文字的国家,有超过18种语言和10种不同的主要文字。对这些印度文字的手写体的识别研究工作还不够。泰米尔语是印度南部、新加坡、马来西亚和斯里兰卡的一种官方和流行的文字,它有一个很大的字符集,其中包括许多复合字。只有少数的工作对这种大字符集的手写识别已在文献中报道。最近,惠普印度实验室开发了一个手写泰米尔文字数据库。在本文中,我们描述了一种基于该数据库的离线识别方法。该方法分为两个阶段。在第一阶段,我们应用无监督聚类方法来创建较少数量的手写泰米尔字符类组。在第二阶段,我们考虑在每个较小的组中使用监督分类技术进行最终识别。这两个阶段所考虑的特性是不同的。提出的两阶段识别方案在现有数据库的训练集和测试集上都提供了可接受的分类精度。
{"title":"A Two Stage Recognition Scheme for Handwritten Tamil Characters","authors":"U. Bhattacharya, S. Ghosh, S. K. Parui","doi":"10.1109/ICDAR.2007.37","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.37","url":null,"abstract":"India is a multilingual multiscript country with more than 18 languages and 10 different major scripts. Not enough research work towards recognition of handwritten characters of these Indian scripts has been done. Tamil, an official as well as popular script of the southern part of India, Singapore, Malaysia, and Sri Lanka has a large character set which includes many compound characters. Only a few works towards handwriting recognition of this large character set has been reported in the literature. Recently, HP Labs India developed a database of handwritten Tamil characters. In the present paper, we describe an off-line recognition approach based on this database. The proposed method consists of two stages. In the first stage, we apply an unsupervised clustering method to create a smaller number of groups of handwritten Tamil character classes. In the second stage, we consider a supervised classification technique in each of these smaller groups for final recognition. The features considered in the two stages are different. The proposed two-stage recognition scheme provided acceptable classification accuracies on both the training and test sets of the present database.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131686054","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 58
Pàtrà: A Novel Document Architecture for Integrating Handwriting with Audio-Visual Information Pàtrà:一种集成手写和视听信息的新型文档体系结构
Gaurav Harit, V. Mankar, S. Chaudhury
In this paper we present Patra - an integrated document architecture which incorporates handwritten illustrations captured and rendered in a temporal fashion synchronized with audio, video, text, and image data. The architecture of Patra permits non-linear growth in the form of multiple hierarchically organized play streams. Semantic metadata is also an integral part of Patra which serves a useful purpose of organizing such documents in a collection. We have developed an email application in which the users are provided with an authoring and rendering environment to compose, view, and reply to messages in the form of Patra.
在本文中,我们介绍了Patra——一个集成的文档架构,它包含了以与音频、视频、文本和图像数据同步的时间方式捕获和渲染的手写插图。Patra的架构允许以多种层次组织的游戏流的形式非线性增长。语义元数据也是Patra的一个组成部分,它有助于在集合中组织这样的文档。我们开发了一个电子邮件应用程序,为用户提供了一个创作和呈现环境,以便以Patra的形式编写、查看和回复消息。
{"title":"Pàtrà: A Novel Document Architecture for Integrating Handwriting with Audio-Visual Information","authors":"Gaurav Harit, V. Mankar, S. Chaudhury","doi":"10.1109/ICDAR.2007.204","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.204","url":null,"abstract":"In this paper we present Patra - an integrated document architecture which incorporates handwritten illustrations captured and rendered in a temporal fashion synchronized with audio, video, text, and image data. The architecture of Patra permits non-linear growth in the form of multiple hierarchically organized play streams. Semantic metadata is also an integral part of Patra which serves a useful purpose of organizing such documents in a collection. We have developed an email application in which the users are provided with an authoring and rendering environment to compose, view, and reply to messages in the form of Patra.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132187379","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 4
Hidden Markov Models for Online Handwritten Tamil Word Recognition 隐马尔可夫模型用于在线手写泰米尔语单词识别
A. Bharath, S. Madhvanath
Hidden Markov models (HMM) have long been a popular choice for Western cursive handwriting recognition following their success in speech recognition. Even for the recognition of Oriental scripts such as Chinese, Japanese and Korean, hidden Markov models are increasingly being used to model substrokes of characters. However, when it comes to Indie script recognition, the published work employing HMMs is limited, and generally focussed on isolated character recognition. In this effort, a data-driven HMM-based online handwritten word recognition system for Tamil, an Indie script, is proposed. The accuracies obtained ranged from 98% to 92.2% with different lexicon sizes (IK to 20 K words). These initial results are promising and warrant further research in this direction. The results are also encouraging to explore possibilities for adopting the approach to other Indie scripts as well.
隐马尔可夫模型(HMM)在语音识别领域取得成功后,一直是西方手写体识别领域的热门选择。即使是汉字、日文、韩文等东方文字的识别,也越来越多地使用隐马尔可夫模型来模拟汉字的笔划。然而,当涉及到独立脚本识别时,使用hmm的出版作品是有限的,并且通常集中在孤立的字符识别上。在此基础上,提出了一种基于数据驱动的独立语言泰米尔语的在线手写单词识别系统。在不同的词汇量(IK到20k单词)下,准确率在98%到92.2%之间。这些初步结果是有希望的,值得在这个方向上进一步研究。结果也鼓励我们探索将这种方法应用于其他独立脚本的可能性。
{"title":"Hidden Markov Models for Online Handwritten Tamil Word Recognition","authors":"A. Bharath, S. Madhvanath","doi":"10.1109/ICDAR.2007.131","DOIUrl":"https://doi.org/10.1109/ICDAR.2007.131","url":null,"abstract":"Hidden Markov models (HMM) have long been a popular choice for Western cursive handwriting recognition following their success in speech recognition. Even for the recognition of Oriental scripts such as Chinese, Japanese and Korean, hidden Markov models are increasingly being used to model substrokes of characters. However, when it comes to Indie script recognition, the published work employing HMMs is limited, and generally focussed on isolated character recognition. In this effort, a data-driven HMM-based online handwritten word recognition system for Tamil, an Indie script, is proposed. The accuracies obtained ranged from 98% to 92.2% with different lexicon sizes (IK to 20 K words). These initial results are promising and warrant further research in this direction. The results are also encouraging to explore possibilities for adopting the approach to other Indie scripts as well.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"63 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133060010","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 68
期刊
Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1