首页 > 最新文献

Proceedings of Sixth International Conference on Document Analysis and Recognition最新文献

英文 中文
Bayesian network modeling of strokes and their relationships for on-line handwriting recognition 在线手写识别中笔画及其关系的贝叶斯网络建模
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953760
Sung-Jung Cho, J. H. Kim
It is important to model strokes and their relationships for on-line handwriting recognition, because they reflect character structures. We propose to model them explicitly and statistically with Bayesian networks. A character is modeled with stroke models and their relationships. Strokes, parts of handwriting traces that are approximately linear, are modeled with a set of point models and their relationships. Points are modeled with conditional probability tables and distributions for pen status and X, Y positions in the 2-D space, given the information of related points. A Bayesian network is adopted to represent a character model, whose nodes correspond to point models and arcs their dependencies. The proposed system was tested on the recognition of on-line handwritten digits. It showed higher recognition rates than the HMM based recognizer with chaincode features and was comparable to other published systems.
对笔画及其关系进行建模对于在线手写识别非常重要,因为它们反映了字符结构。我们建议用贝叶斯网络对它们进行明确和统计的建模。一个人物是用笔画模型和它们的关系来建模的。笔画,笔迹痕迹中近似线性的部分,用一组点模型和它们之间的关系来建模。根据相关点的信息,用条件概率表和笔的状态以及在二维空间中的X、Y位置的分布对点进行建模。采用贝叶斯网络表示人物模型,其中节点对应点模型,弧线对应点模型的依赖关系。对该系统进行了在线手写体数字识别实验。它比基于HMM的链码特征识别器显示出更高的识别率,并且与其他已发表的系统相当。
{"title":"Bayesian network modeling of strokes and their relationships for on-line handwriting recognition","authors":"Sung-Jung Cho, J. H. Kim","doi":"10.1109/ICDAR.2001.953760","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953760","url":null,"abstract":"It is important to model strokes and their relationships for on-line handwriting recognition, because they reflect character structures. We propose to model them explicitly and statistically with Bayesian networks. A character is modeled with stroke models and their relationships. Strokes, parts of handwriting traces that are approximately linear, are modeled with a set of point models and their relationships. Points are modeled with conditional probability tables and distributions for pen status and X, Y positions in the 2-D space, given the information of related points. A Bayesian network is adopted to represent a character model, whose nodes correspond to point models and arcs their dependencies. The proposed system was tested on the recognition of on-line handwritten digits. It showed higher recognition rates than the HMM based recognizer with chaincode features and was comparable to other published systems.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133165944","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 63
A Bezier curve-based approach to shape description for Chinese calligraphy characters 基于贝塞尔曲线的汉字形状描述方法
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953798
Hsi-Ming Yang, Jainn-Jyh Lu, Hsi-Jian Lee
In this paper, we propose a method to vectorize Chinese characters in calligraphy documents. Our system can prevent the zigzag phenomena when the characters are enlarged. The system contains two modules: contour segment extraction and description. In the former, high curvature points on contours are detected as corner points, which divide the contour into several segments. In the latter, a contour segment can be described either by a straight line or a cubic Bezier curve. According to relations between the contour segment and the Bezier curve, control points are adjusted to fit the contour segment better. When the curve fitness cost is small enough, the shape is described well. The processing time of our curve fitting is about five seconds per A4 image, which has 4488 contour segments. Experimental results demonstrate that our system is efficient and promising.
本文提出了一种书法文档中汉字矢量化的方法。我们的系统可以防止字符放大时出现之字形现象。该系统包含轮廓段提取和轮廓描述两个模块。前者将轮廓上的高曲率点检测为角点,角点将轮廓分割为若干段;在后者中,轮廓段可以用直线或三次贝塞尔曲线来描述。根据轮廓段与Bezier曲线的关系,调整控制点以更好地拟合轮廓段。当曲线适应度代价足够小时,可以很好地描述曲线的形状。我们的曲线拟合处理时间约为每张A4图像5秒,该图像有4488个轮廓段。实验结果证明了该系统的有效性和应用前景。
{"title":"A Bezier curve-based approach to shape description for Chinese calligraphy characters","authors":"Hsi-Ming Yang, Jainn-Jyh Lu, Hsi-Jian Lee","doi":"10.1109/ICDAR.2001.953798","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953798","url":null,"abstract":"In this paper, we propose a method to vectorize Chinese characters in calligraphy documents. Our system can prevent the zigzag phenomena when the characters are enlarged. The system contains two modules: contour segment extraction and description. In the former, high curvature points on contours are detected as corner points, which divide the contour into several segments. In the latter, a contour segment can be described either by a straight line or a cubic Bezier curve. According to relations between the contour segment and the Bezier curve, control points are adjusted to fit the contour segment better. When the curve fitness cost is small enough, the shape is described well. The processing time of our curve fitting is about five seconds per A4 image, which has 4488 contour segments. Experimental results demonstrate that our system is efficient and promising.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"48 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"126870260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 53
Automatic signature verification based on the dynamic feature of pressure 基于压力动态特征的自动签名验证
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953945
K. Tanabe, M. Yoshihara, H. Kameya, S. Mori, S. Omata, Tatsuro Ito
A feasibility experiment on an automatic signature verification system based on writing pressure was conducted using a new device sensing the z-axis component of writing pressure. The data acquisitions were conducted over a half a year for checking both subjects' and device stabilities. The DP (dynamic programming) matching method provided 6% error rates for type I and type II errors. The nature of signature writing pressure was investigated.
利用传感书写压力z轴分量的装置,对基于书写压力的自动签名验证系统进行了可行性实验。数据采集进行了半年多的时间,以检查受试者和设备的稳定性。DP(动态规划)匹配方法对第一类和第二类错误提供了6%的错误率。对签名书写压力的性质进行了研究。
{"title":"Automatic signature verification based on the dynamic feature of pressure","authors":"K. Tanabe, M. Yoshihara, H. Kameya, S. Mori, S. Omata, Tatsuro Ito","doi":"10.1109/ICDAR.2001.953945","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953945","url":null,"abstract":"A feasibility experiment on an automatic signature verification system based on writing pressure was conducted using a new device sensing the z-axis component of writing pressure. The data acquisitions were conducted over a half a year for checking both subjects' and device stabilities. The DP (dynamic programming) matching method provided 6% error rates for type I and type II errors. The nature of signature writing pressure was investigated.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"369 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116118526","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
A graph-based segmentation and feature extraction framework for Arabic text recognition 基于图的阿拉伯语文本识别分割和特征提取框架
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953864
A. Elgammal, M. Ismail
This paper presents a graph-based framework for the segmentation of Arabic text. The same framework is used to extract font independent structural features from the text that are used in the recognition. The major contribution of this paper is a new graph-based structural segmentation approach based on the topological relation between the baseline and the line adjacency graph representation of the text. The text is segmented to sub-character units that we call "scripts". A structure analysis approach is used for recognition of these units. A different classifier is used to recognize dots and diacritic signs. The final character recognition is achieved by using a regular grammar that describes how characters are composed from scripts.
提出了一种基于图的阿拉伯语文本分词框架。使用相同的框架从识别中使用的文本中提取与字体无关的结构特征。本文的主要贡献是基于文本的基线和线邻接图表示之间的拓扑关系,提出了一种新的基于图的结构分割方法。文本被分割成子字符单元,我们称之为“脚本”。采用结构分析方法对这些单元进行识别。使用不同的分类器来识别点和变音符号。最后的字符识别是通过使用描述如何从脚本组成字符的规则语法来实现的。
{"title":"A graph-based segmentation and feature extraction framework for Arabic text recognition","authors":"A. Elgammal, M. Ismail","doi":"10.1109/ICDAR.2001.953864","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953864","url":null,"abstract":"This paper presents a graph-based framework for the segmentation of Arabic text. The same framework is used to extract font independent structural features from the text that are used in the recognition. The major contribution of this paper is a new graph-based structural segmentation approach based on the topological relation between the baseline and the line adjacency graph representation of the text. The text is segmented to sub-character units that we call \"scripts\". A structure analysis approach is used for recognition of these units. A different classifier is used to recognize dots and diacritic signs. The final character recognition is achieved by using a regular grammar that describes how characters are composed from scripts.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"19 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122121959","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 47
Document understanding using probabilistic relaxation: application on tables of contents of periodicals 基于概率松弛的文献理解:在期刊目录上的应用
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953841
Frank Lebourgeois, H. Emptoz, S. Souafi-Bensafi
This paper describes a statistical model for a document understanding system, which uses both text attributes and document layouts. Probabilistic relaxation is used as a recognition scheme to find the hierarchical structure of the logical layout. This approach, commonly used for pixels classification in image analysis, can be applied to classify text blocks into logical classes according to local compatibility with other neighboring blocks at different hierarchical levels. It provides a logical layout that is globally compatible with the training model. We have tested this approach on reading tables of contents of periodicals for documents indexing. Probabilistic relaxation has interesting properties like high-speed training and the 'a priori' recognition rate, which provides the consistency of the model according to the features used, and the samples chosen among the training set.
本文描述了一个同时使用文本属性和文档布局的文档理解系统的统计模型。使用概率松弛作为识别方案来寻找逻辑布局的层次结构。这种方法通常用于图像分析中的像素分类,它可以根据文本块在不同层次上与其他相邻块的局部兼容性将文本块划分为逻辑类。它提供了一个与训练模型全局兼容的逻辑布局。我们在期刊目录检索中对这种方法进行了测试。概率松弛有一些有趣的特性,比如高速训练和“先验”识别率,它根据所使用的特征和在训练集中选择的样本提供模型的一致性。
{"title":"Document understanding using probabilistic relaxation: application on tables of contents of periodicals","authors":"Frank Lebourgeois, H. Emptoz, S. Souafi-Bensafi","doi":"10.1109/ICDAR.2001.953841","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953841","url":null,"abstract":"This paper describes a statistical model for a document understanding system, which uses both text attributes and document layouts. Probabilistic relaxation is used as a recognition scheme to find the hierarchical structure of the logical layout. This approach, commonly used for pixels classification in image analysis, can be applied to classify text blocks into logical classes according to local compatibility with other neighboring blocks at different hierarchical levels. It provides a logical layout that is globally compatible with the training model. We have tested this approach on reading tables of contents of periodicals for documents indexing. Probabilistic relaxation has interesting properties like high-speed training and the 'a priori' recognition rate, which provides the consistency of the model according to the features used, and the samples chosen among the training set.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"22 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125699925","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 26
Off-line hand-written character recognition using integrated 1D HMMs based on feature extraction filters 基于特征提取滤波器的集成一维hmm离线手写字符识别
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953824
Hiromitsu Nishimura, Masayoshi Tsutsumi, M. Maruyama, H. Miyao, Y. Nakano
The purpose of our research is to improve the recognition rate of an off-line handwritten character recognition system using HMM (hidden Markov model), so that we can use the system for practical application. Due to the insufficient recognition rate of ID HMM character recognition systems and the requirement for a huge number of learning samples to construct 2D HMM character recognition systems, HMM-based character recognition systems have not yet achieved sufficient recognition performance for practical use. In this research, we propose the character recognition method that integrates 4 simply structured 1D HMMs all of which are based on feature extraction using linear filters. The results of our evaluation experiment using the Hand-Printed Character Database (ETL6) showed that the first rank recognition rate of the test samples was 98.5% and that the cumulative recognition rate of top 3 candidates was 99.3%. Although our method is relatively easy to implement, it can work even better than 2D HMM method. These results show the proposed method is very effective.
我们的研究目的是为了提高隐马尔可夫模型(HMM)离线手写字符识别系统的识别率,以便我们将该系统用于实际应用。由于ID HMM字符识别系统的识别率不足,并且需要大量的学习样本来构建二维HMM字符识别系统,基于HMM的字符识别系统还没有达到足够的实际应用的识别性能。在本研究中,我们提出了一种基于线性滤波器特征提取的4种简单结构的一维hmm的字符识别方法。我们使用手印字符库(ETL6)进行的评价实验结果表明,测试样本的第一阶识别率为98.5%,前3名候选词的累计识别率为99.3%。虽然我们的方法相对容易实现,但它可以比2D HMM方法更好地工作。结果表明,该方法是非常有效的。
{"title":"Off-line hand-written character recognition using integrated 1D HMMs based on feature extraction filters","authors":"Hiromitsu Nishimura, Masayoshi Tsutsumi, M. Maruyama, H. Miyao, Y. Nakano","doi":"10.1109/ICDAR.2001.953824","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953824","url":null,"abstract":"The purpose of our research is to improve the recognition rate of an off-line handwritten character recognition system using HMM (hidden Markov model), so that we can use the system for practical application. Due to the insufficient recognition rate of ID HMM character recognition systems and the requirement for a huge number of learning samples to construct 2D HMM character recognition systems, HMM-based character recognition systems have not yet achieved sufficient recognition performance for practical use. In this research, we propose the character recognition method that integrates 4 simply structured 1D HMMs all of which are based on feature extraction using linear filters. The results of our evaluation experiment using the Hand-Printed Character Database (ETL6) showed that the first rank recognition rate of the test samples was 98.5% and that the cumulative recognition rate of top 3 candidates was 99.3%. Although our method is relatively easy to implement, it can work even better than 2D HMM method. These results show the proposed method is very effective.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"123559654","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 8
A multi-net local learning framework for pattern recognition 模式识别的多网络局部学习框架
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953808
Jian-xiong Dong, A. Krzyżak, C. Suen
This paper proposes a general local learning framework to effectively alleviate the complexities of classifier design by means of "divide and conquer" principle and ensemble method. The learning framework consists of quantization layer and ensemble layer. After GLVQ and MLP are applied to the framework, the proposed method is tested on MNIST handwritten digit database. The obtained performance is very promising, an error rate with 0.99%, which is comparable to that of LeNet5, one of the best classifiers on this database. Further, in contrast to LeNet5, our method is especially suitable for a large-scale real-world classification problem.
本文提出了一种通用的局部学习框架,通过“分而治之”原则和集成方法有效地缓解了分类器设计的复杂性。学习框架由量化层和集成层组成。将GLVQ和MLP应用于该框架后,在MNIST手写体数字数据库上进行了测试。获得的性能非常有希望,错误率为0.99%,与该数据库中最好的分类器之一LeNet5相当。此外,与LeNet5相比,我们的方法特别适合于大规模的现实世界分类问题。
{"title":"A multi-net local learning framework for pattern recognition","authors":"Jian-xiong Dong, A. Krzyżak, C. Suen","doi":"10.1109/ICDAR.2001.953808","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953808","url":null,"abstract":"This paper proposes a general local learning framework to effectively alleviate the complexities of classifier design by means of \"divide and conquer\" principle and ensemble method. The learning framework consists of quantization layer and ensemble layer. After GLVQ and MLP are applied to the framework, the proposed method is tested on MNIST handwritten digit database. The obtained performance is very promising, an error rate with 0.99%, which is comparable to that of LeNet5, one of the best classifiers on this database. Further, in contrast to LeNet5, our method is especially suitable for a large-scale real-world classification problem.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"49 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125092260","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 13
Baseline structure analysis of handwritten mathematics notation 手写数学符号的基线结构分析
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953892
R. Zanibbi, D. Blostein, J. Cordy
The structure of mathematics notation is particularly difficult to recognize in handwritten notation because irregular symbol placements are common. We present an efficient and robust method of parsing handwritten and typeset mathematics notation without backtracking. The system is designed to be easily adaptable to various dialects of mathematics notation. The following strategies are used: (1) separate the analysis of layout, syntax, and semantics, (2) recursively apply search functions and image partitioning to recognize dominant and nested baselines, and (3) use tree transformations to express computations in a compact, efficiently executable form.
数学符号的结构在手写符号中特别难以识别,因为不规则的符号位置是常见的。我们提出了一种有效的、鲁棒的解析手写和排版数学符号的方法。该系统被设计成易于适应各种数学符号的方言。使用以下策略:(1)将布局、语法和语义的分析分开;(2)递归地应用搜索函数和图像分区来识别主导基线和嵌套基线;(3)使用树转换以紧凑、高效的可执行形式表示计算。
{"title":"Baseline structure analysis of handwritten mathematics notation","authors":"R. Zanibbi, D. Blostein, J. Cordy","doi":"10.1109/ICDAR.2001.953892","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953892","url":null,"abstract":"The structure of mathematics notation is particularly difficult to recognize in handwritten notation because irregular symbol placements are common. We present an efficient and robust method of parsing handwritten and typeset mathematics notation without backtracking. The system is designed to be easily adaptable to various dialects of mathematics notation. The following strategies are used: (1) separate the analysis of layout, syntax, and semantics, (2) recursively apply search functions and image partitioning to recognize dominant and nested baselines, and (3) use tree transformations to express computations in a compact, efficiently executable form.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"50 5 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127086201","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 25
Improvement of video text recognition by character selection 基于字符选择的视频文本识别改进
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953954
T. Mita, O. Hori
This paper proposes a new method for improving the recognition accuracy of video text by exploiting the temporal redundancy of video. The proposed method divides the video into short segments and obtains several recognition results from some video segments. The video segments have various backgrounds because background image changes temporally due to camera-work or object motion. These recognition results from diverse backgrounds are integrated into a single text string after selecting the best recognition results of individual characters. The proposed method was tested on a large set of news video sequences. Experimental results show that the proposed method increased the number of correct characters by 3.1% and the number of strings which do not include any recognition errors by 8.1%.
本文提出了一种利用视频的时间冗余来提高视频文本识别精度的新方法。该方法将视频分割成多个短视频片段,并从多个视频片段中获得多个识别结果。视频片段具有不同的背景,因为背景图像由于摄像机工作或物体运动而暂时变化。这些来自不同背景的识别结果在选择单个字符的最佳识别结果后整合到一个单一的文本字符串中。在大量新闻视频序列上对该方法进行了测试。实验结果表明,该方法将正确字符数提高了3.1%,将不包含任何识别错误的字符串数提高了8.1%。
{"title":"Improvement of video text recognition by character selection","authors":"T. Mita, O. Hori","doi":"10.1109/ICDAR.2001.953954","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953954","url":null,"abstract":"This paper proposes a new method for improving the recognition accuracy of video text by exploiting the temporal redundancy of video. The proposed method divides the video into short segments and obtains several recognition results from some video segments. The video segments have various backgrounds because background image changes temporally due to camera-work or object motion. These recognition results from diverse backgrounds are integrated into a single text string after selecting the best recognition results of individual characters. The proposed method was tested on a large set of news video sequences. Experimental results show that the proposed method increased the number of correct characters by 3.1% and the number of strings which do not include any recognition errors by 8.1%.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130449992","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 21
Line detection and segmentation in historical church registers 历史教会寄存器中的行检测与分割
Pub Date : 2001-09-10 DOI: 10.1109/ICDAR.2001.953888
Markus Feldbach, Klaus D. Tönnies
For being able to automatically acquire the information recorded in church registers and other historical scriptures, the writing on these documents has to be recognized. This paper describes algorithms for transforming the paper documents into a representation of text apt to be used as input for an automatic text recognizer. The automatic recognition of old handwritten scriptures is difficult for two main reasons. Lines of text in general are not straight and ascenders and descenders of adjacent lines interfere. The algorithms described in this paper provide ways to reconstruct the path of the lines of text using an approach of gradually constructing line segments until a unique line of text is formed. In addition, the single lines are segmented and an output in form of a raster image is provided. The method was applied to church registers. They were written between the 17th and 19th Century. Line segmentation was found to be successful in 97% of all samples.
为了能够自动获取教会登记册和其他历史经文中记录的信息,必须识别这些文件上的文字。本文描述了一种将纸质文档转换为易于作为自动文本识别器输入的文本表示的算法。旧手写经文的自动识别困难主要有两个原因。文本行通常不是直的,相邻行的上下线相互干扰。本文描述的算法提供了使用逐步构建线段的方法来重建文本行路径的方法,直到形成唯一的文本行。此外,对单线进行分割并提供光栅图像形式的输出。该方法被应用于教堂登记。它们写于17至19世纪之间。在97%的样本中,线分割被发现是成功的。
{"title":"Line detection and segmentation in historical church registers","authors":"Markus Feldbach, Klaus D. Tönnies","doi":"10.1109/ICDAR.2001.953888","DOIUrl":"https://doi.org/10.1109/ICDAR.2001.953888","url":null,"abstract":"For being able to automatically acquire the information recorded in church registers and other historical scriptures, the writing on these documents has to be recognized. This paper describes algorithms for transforming the paper documents into a representation of text apt to be used as input for an automatic text recognizer. The automatic recognition of old handwritten scriptures is difficult for two main reasons. Lines of text in general are not straight and ascenders and descenders of adjacent lines interfere. The algorithms described in this paper provide ways to reconstruct the path of the lines of text using an approach of gradually constructing line segments until a unique line of text is formed. In addition, the single lines are segmented and an output in form of a raster image is provided. The method was applied to church registers. They were written between the 17th and 19th Century. Line segmentation was found to be successful in 97% of all samples.","PeriodicalId":277816,"journal":{"name":"Proceedings of Sixth International Conference on Document Analysis and Recognition","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2001-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129862826","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 71
期刊
Proceedings of Sixth International Conference on Document Analysis and Recognition
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1