首页 > 最新文献

Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.最新文献

英文 中文
Automated segmentation of math-zones from document images 从文档图像中自动分割数学区域
S. Chowdhury, Sekhar Mandal, A. Das, B. Chanda
With an aim to high-level understanding of the mathematicalcontents in a document image the requirement ofmath-zone extraction and recognition technique is obvious.In this paper we present fully auotmatic segmentation ofdisplayed-math zones from the document image, using onlythe spatial layout information of math-formulas and equations,so as to help commercial OCR systems which cannotdiscern math-zones and also for the identification and arrangementof math symbols by others.
为了对文档图像中的数学内容进行高层次的理解,对数学区提取和识别技术的需求是显而易见的。本文提出了一种仅利用数学公式和方程的空间布局信息对文档图像中显示的数学区域进行全自动分割的方法,以帮助商用OCR系统识别数学区域,同时也为其他OCR系统对数学符号的识别和排列提供帮助。
{"title":"Automated segmentation of math-zones from document images","authors":"S. Chowdhury, Sekhar Mandal, A. Das, B. Chanda","doi":"10.1109/ICDAR.2003.1227763","DOIUrl":"https://doi.org/10.1109/ICDAR.2003.1227763","url":null,"abstract":"With an aim to high-level understanding of the mathematicalcontents in a document image the requirement ofmath-zone extraction and recognition technique is obvious.In this paper we present fully auotmatic segmentation ofdisplayed-math zones from the document image, using onlythe spatial layout information of math-formulas and equations,so as to help commercial OCR systems which cannotdiscern math-zones and also for the identification and arrangementof math symbols by others.","PeriodicalId":249193,"journal":{"name":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","volume":"27 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131304825","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 34
Best practices for convolutional neural networks applied to visual document analysis 卷积神经网络应用于可视化文档分析的最佳实践
P. Simard, David Steinkraus, John C. Platt
Neural networks are a powerful technology forclassification of visual inputs arising from documents.However, there is a confusing plethora of different neuralnetwork methods that are used in the literature and inindustry. This paper describes a set of concrete bestpractices that document analysis researchers can use toget good results with neural networks. The mostimportant practice is getting a training set as large aspossible: we expand the training set by adding a newform of distorted data. The next most important practiceis that convolutional neural networks are better suited forvisual document tasks than fully connected networks. Wepropose that a simple "do-it-yourself" implementation ofconvolution with a flexible architecture is suitable formany visual document problems. This simpleconvolutional neural network does not require complexmethods, such as momentum, weight decay, structure-dependentlearning rates, averaging layers, tangent prop,or even finely-tuning the architecture. The end result is avery simple yet general architecture which can yieldstate-of-the-art performance for document analysis. Weillustrate our claims on the MNIST set of English digitimages.
神经网络是一种强大的技术,用于分类来自文档的视觉输入。然而,在文献和工业中使用的不同的神经网络方法令人困惑。本文描述了一组具体的最佳实践,文件分析研究人员可以使用神经网络获得良好的结果。最重要的实践是获得尽可能大的训练集:我们通过添加新形式的扭曲数据来扩展训练集。下一个最重要的实践是,卷积神经网络比完全连接的网络更适合于视觉文档任务。我们提出一个简单的“自己动手”的卷积实现,具有灵活的架构,适用于许多可视化文档问题。这个简单的卷积神经网络不需要复杂的方法,比如动量、权重衰减、结构相关学习率、平均层、切线支撑,甚至微调架构。最终的结果是非常简单而通用的架构,可以为文档分析提供最先进的性能。我们用MNIST的英语数字图像集来说明我们的主张。
{"title":"Best practices for convolutional neural networks applied to visual document analysis","authors":"P. Simard, David Steinkraus, John C. Platt","doi":"10.1109/ICDAR.2003.1227801","DOIUrl":"https://doi.org/10.1109/ICDAR.2003.1227801","url":null,"abstract":"Neural networks are a powerful technology forclassification of visual inputs arising from documents.However, there is a confusing plethora of different neuralnetwork methods that are used in the literature and inindustry. This paper describes a set of concrete bestpractices that document analysis researchers can use toget good results with neural networks. The mostimportant practice is getting a training set as large aspossible: we expand the training set by adding a newform of distorted data. The next most important practiceis that convolutional neural networks are better suited forvisual document tasks than fully connected networks. Wepropose that a simple \"do-it-yourself\" implementation ofconvolution with a flexible architecture is suitable formany visual document problems. This simpleconvolutional neural network does not require complexmethods, such as momentum, weight decay, structure-dependentlearning rates, averaging layers, tangent prop,or even finely-tuning the architecture. The end result is avery simple yet general architecture which can yieldstate-of-the-art performance for document analysis. Weillustrate our claims on the MNIST set of English digitimages.","PeriodicalId":249193,"journal":{"name":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","volume":"4 6","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131505750","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 2755
Unsupervised feature selection using multi-objective genetic algorithms for handwritten word recognition 基于多目标遗传算法的无监督特征选择手写体单词识别
M. Morita, R. Sabourin, F. Bortolozzi, C. Y. Suen
In this paper a methodology for feature selection in unsupervisedlearning is proposed. It makes use of a multi-objectivegenetic algorithm where the minimization of thenumber of features and a validity index that measures thequality of clusters have been used to guide the search towardsthe more discriminant features and the best numberof clusters. The proposed strategy is evaluated usingtwo synthetic data sets and then it is applied to handwrittenmonth word recognition. Comprehensive experimentsdemonstrate the feasibility and efficiency of the proposedmethodology.
本文提出了一种无监督学习中特征选择的方法。它利用多目标遗传算法,其中特征数量的最小化和衡量聚类质量的有效性指标被用来指导搜索更有区别的特征和最佳数量的聚类。使用两个合成数据集对所提出的策略进行了评估,然后将其应用于手写月词识别。综合实验证明了该方法的可行性和有效性。
{"title":"Unsupervised feature selection using multi-objective genetic algorithms for handwritten word recognition","authors":"M. Morita, R. Sabourin, F. Bortolozzi, C. Y. Suen","doi":"10.1109/ICDAR.2003.1227746","DOIUrl":"https://doi.org/10.1109/ICDAR.2003.1227746","url":null,"abstract":"In this paper a methodology for feature selection in unsupervisedlearning is proposed. It makes use of a multi-objectivegenetic algorithm where the minimization of thenumber of features and a validity index that measures thequality of clusters have been used to guide the search towardsthe more discriminant features and the best numberof clusters. The proposed strategy is evaluated usingtwo synthetic data sets and then it is applied to handwrittenmonth word recognition. Comprehensive experimentsdemonstrate the feasibility and efficiency of the proposedmethodology.","PeriodicalId":249193,"journal":{"name":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130088080","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 5
Recognition of on-line handwritten mathematical formulas in the E-chalk system 电子粉笔系统中在线手写数学公式的识别
E. Tapia, R. Rojas
In this article, we present a system for the recognition ofon-line handwritten mathematical formulas which is usedin the electronic chalkboard (E-chalk), a multimedia systemfor distance-teaching. We discuss the classification of symbolsand the construction of the tree of spatial relationshipsamong them. The classification is based on support vectormachines and the construction of formulas is based onbaseline structure analysis.
本文介绍了一种用于远程教学的多媒体系统——电子黑板(E-chalk)的在线手写数学公式识别系统。讨论了符号的分类和符号间空间关系树的构造。分类基于支持向量机,公式构建基于基线结构分析。
{"title":"Recognition of on-line handwritten mathematical formulas in the E-chalk system","authors":"E. Tapia, R. Rojas","doi":"10.1109/ICDAR.2003.1227805","DOIUrl":"https://doi.org/10.1109/ICDAR.2003.1227805","url":null,"abstract":"In this article, we present a system for the recognition ofon-line handwritten mathematical formulas which is usedin the electronic chalkboard (E-chalk), a multimedia systemfor distance-teaching. We discuss the classification of symbolsand the construction of the tree of spatial relationshipsamong them. The classification is based on support vectormachines and the construction of formulas is based onbaseline structure analysis.","PeriodicalId":249193,"journal":{"name":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","volume":"32 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"116525650","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 98
Reference line extraction from form documents with complicated backgrounds 从复杂背景的表单文档中提取参考线
Dihua Xi, Seong-Whan Lee
Form document analysis is one of the most essential tasksin document analysis and recognition. One of the most fundamentaland crucial tasks is the extraction of the referencelines which are contained in almost all form documents.This paper presents an efficient methodology for the complicatedgrey-level form image processing. We construct anon-orthogonal wavelet with adjustable rectangle supportsand offer algorithms for the extraction of the reference linesbased on the strip growth method using the multiresolutionwavelet sub images. We have compared this system with thepopular Hough transform (HT) based and the novel orthogonalwavelet based methods. As shown in the experiments,the proposed algorithmdemonstrates high performance andfast speed for the complicated form images. This system isalso effective for the form images with slight skew.
格式文件分析是文件分析与识别中最重要的任务之一。最基本和最关键的任务之一是提取几乎所有表单文件中包含的参考资料。本文提出了一种处理复杂灰度形式图像的有效方法。我们构造了具有可调矩形支撑的非正交小波,并利用多分辨率小波子图像提出了基于条带生长法提取参考线的算法。我们将该系统与流行的基于霍夫变换(HT)和新的基于正交小波的方法进行了比较。实验结果表明,该算法在处理复杂形状图像时具有较高的性能和速度。该系统对于有轻微倾斜的表单图像也很有效。
{"title":"Reference line extraction from form documents with complicated backgrounds","authors":"Dihua Xi, Seong-Whan Lee","doi":"10.1109/ICDAR.2003.1227823","DOIUrl":"https://doi.org/10.1109/ICDAR.2003.1227823","url":null,"abstract":"Form document analysis is one of the most essential tasksin document analysis and recognition. One of the most fundamentaland crucial tasks is the extraction of the referencelines which are contained in almost all form documents.This paper presents an efficient methodology for the complicatedgrey-level form image processing. We construct anon-orthogonal wavelet with adjustable rectangle supportsand offer algorithms for the extraction of the reference linesbased on the strip growth method using the multiresolutionwavelet sub images. We have compared this system with thepopular Hough transform (HT) based and the novel orthogonalwavelet based methods. As shown in the experiments,the proposed algorithmdemonstrates high performance andfast speed for the complicated form images. This system isalso effective for the form images with slight skew.","PeriodicalId":249193,"journal":{"name":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","volume":"4 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127173645","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Combination of pruned Kohonen maps for on-line arabic characters recognition 联机阿拉伯字符识别的组合修剪Kohonen地图
N. Mezghani, M. Cheriet, A. Mitiche
The purpose of this study is to investigate a methodfor high performance on-line Arabic characters recognition.This method is based on the use of Kohonen mapsand their corresponding confusion matrices which serve toprune them of error-causing nodes, and to combine themconsequently. We use two Kohonen maps obtained usingtwo distinct character representations, namely, Fourier descriptorsand tangents extracted along the characters on-linesignals. The two Kohonen maps are then combined usinga majority vote decision rule that, for each character,favors the most reliable map. This combination, withoutadding in any significant way to the process complexity, affordsa much better recognition rate.
本研究的目的是探讨一种高效的在线阿拉伯文字符识别方法。该方法基于Kohonen映射及其相应的混淆矩阵的使用,这些混淆矩阵用于标记导致错误的节点,并将它们组合在一起。我们使用两种不同的字符表示获得的两个Kohonen映射,即傅里叶描述子和沿字符在线信号提取的切线。然后使用多数投票决策规则将两张Kohonen地图组合在一起,对于每个角色来说,这有利于最可靠的地图。这种组合在不显著增加过程复杂性的情况下,提供了更好的识别率。
{"title":"Combination of pruned Kohonen maps for on-line arabic characters recognition","authors":"N. Mezghani, M. Cheriet, A. Mitiche","doi":"10.1109/ICDAR.2003.1227790","DOIUrl":"https://doi.org/10.1109/ICDAR.2003.1227790","url":null,"abstract":"The purpose of this study is to investigate a methodfor high performance on-line Arabic characters recognition.This method is based on the use of Kohonen mapsand their corresponding confusion matrices which serve toprune them of error-causing nodes, and to combine themconsequently. We use two Kohonen maps obtained usingtwo distinct character representations, namely, Fourier descriptorsand tangents extracted along the characters on-linesignals. The two Kohonen maps are then combined usinga majority vote decision rule that, for each character,favors the most reliable map. This combination, withoutadding in any significant way to the process complexity, affordsa much better recognition rate.","PeriodicalId":249193,"journal":{"name":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"127763630","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 36
Document identity, authentication and ownership: the future of biometric verification 文件身份、认证和所有权:生物识别验证的未来
M. Fairhurst
Document security is an increasingly importantelement in the multi-faceted discipline ofdocument processing, and authentication ofindividual identity will play an increasinglyimportant future role in relation to questions ofdocument ownership, identity andconfidentiality. Biometrics-based techniques areemerging as key elements in the drive to addresssecurity and confidentiality in an effective way,yet past experience suggests that there are manypractical issues yet to be resolved if biometrictechnologies are to fulfill their potential in thedocument processing field. This paper addressessome aspects of biometric processing which arebecoming increasing priorities, and suggestshow a greater engagement of the documentprocessing community can help to bring aboutrefinements to existing approaches to biometricidentity checking.
在文件处理的多方面学科中,文件安全是一个越来越重要的因素,个人身份认证将在文件所有权、身份和机密性问题中发挥越来越重要的作用。基于生物特征的技术正在成为有效解决安全和保密性问题的关键因素,但过去的经验表明,如果生物特征技术要在文件处理领域发挥其潜力,还有许多实际问题有待解决。本文讨论了生物识别处理的一些方面,这些方面正变得越来越重要,并建议文件处理社区的更大参与可以帮助改进现有的生物识别身份检查方法。
{"title":"Document identity, authentication and ownership: the future of biometric verification","authors":"M. Fairhurst","doi":"10.1109/ICDAR.2003.1227828","DOIUrl":"https://doi.org/10.1109/ICDAR.2003.1227828","url":null,"abstract":"Document security is an increasingly importantelement in the multi-faceted discipline ofdocument processing, and authentication ofindividual identity will play an increasinglyimportant future role in relation to questions ofdocument ownership, identity andconfidentiality. Biometrics-based techniques areemerging as key elements in the drive to addresssecurity and confidentiality in an effective way,yet past experience suggests that there are manypractical issues yet to be resolved if biometrictechnologies are to fulfill their potential in thedocument processing field. This paper addressessome aspects of biometric processing which arebecoming increasing priorities, and suggestshow a greater engagement of the documentprocessing community can help to bring aboutrefinements to existing approaches to biometricidentity checking.","PeriodicalId":249193,"journal":{"name":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","volume":"15 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"133550053","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 14
Progress in camera-based document image analysis 基于摄像机的文档图像分析研究进展
D. Doermann, Jian Liang, Huiping Li
The increasing availability of high performance, low priced, portable digital imaging devices has created a tremendous opportunity for supplementing traditional scanning for document image acquisition. Digital cameras attached to cellular phones, PDAs, or as standalone still or video devices are highly mobile and easy to use; they can capture images of any kind of document including very thick books, historical pages too fragile to touch, and text in scenes; and they are much more versatile than desktop scanners. Should robust solutions to the analysis of documents captured with such devices become available, there is clearly a demand from many domains. Traditional scanner-based document analysis techniques provide us with a good reference and starting point, but they cannot be used directly on camera-captured images. Camera captured images can suffer from low resolution, blur, and perspective distortion, as well as complex layout and interaction of the content and background. In this paper we present a survey of application domains, technical challenges and solutions for recognizing documents captured by digital cameras. We begin by describing typical imaging devices and the imaging process. We discuss document analysis from a single camera-captured image as well as multiple frames and highlight some sample applications under development and feasible ideas for future development.
高性能、低价、便携式数字成像设备的日益普及,为补充传统的文档图像采集扫描创造了巨大的机会。数码相机连接到移动电话,pda,或作为独立的静态或视频设备是高度移动和易于使用;它们可以捕捉任何类型文件的图像,包括非常厚的书、太脆弱而不能触摸的历史页面和场景中的文本;而且它们比桌面扫描器更通用。如果对用这些设备捕获的文档进行分析的健壮的解决方案可用,那么显然会有来自许多领域的需求。传统的基于扫描仪的文档分析技术为我们提供了一个很好的参考和起点,但它们不能直接用于相机捕获的图像。相机拍摄的图像可能会出现低分辨率、模糊和视角失真,以及内容和背景的复杂布局和交互。在本文中,我们提出了应用领域的调查,技术挑战和解决方案,以识别由数码相机捕获的文件。我们首先描述典型的成像设备和成像过程。我们讨论了单帧和多帧的文档分析,并重点介绍了一些正在开发的示例应用和未来发展的可行思路。
{"title":"Progress in camera-based document image analysis","authors":"D. Doermann, Jian Liang, Huiping Li","doi":"10.1109/ICDAR.2003.1227735","DOIUrl":"https://doi.org/10.1109/ICDAR.2003.1227735","url":null,"abstract":"The increasing availability of high performance, low priced, portable digital imaging devices has created a tremendous opportunity for supplementing traditional scanning for document image acquisition. Digital cameras attached to cellular phones, PDAs, or as standalone still or video devices are highly mobile and easy to use; they can capture images of any kind of document including very thick books, historical pages too fragile to touch, and text in scenes; and they are much more versatile than desktop scanners. Should robust solutions to the analysis of documents captured with such devices become available, there is clearly a demand from many domains. Traditional scanner-based document analysis techniques provide us with a good reference and starting point, but they cannot be used directly on camera-captured images. Camera captured images can suffer from low resolution, blur, and perspective distortion, as well as complex layout and interaction of the content and background. In this paper we present a survey of application domains, technical challenges and solutions for recognizing documents captured by digital cameras. We begin by describing typical imaging devices and the imaging process. We discuss document analysis from a single camera-captured image as well as multiple frames and highlight some sample applications under development and feasible ideas for future development.","PeriodicalId":249193,"journal":{"name":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131056141","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 222
Confidence evaluation for combining diverse classifiers 多分类器组合置信度评价
Hongwei Hao, Cheng-Lin Liu, H. Sako
For combining classifiers at measurement level, thediverse outputs of classifiers should be transformed touniform measures that represent the confidence ofdecision, hopefully, the class probability or likelihood.This paper presents our experimental results of classifiercombination using confidence evaluation. We test threetypes of confidences: log-likelihood, exponential andsigmoid. For re-scaling the classifier outputs, we usethree scaling functions based on global normalizationand Gaussian density estimation. Experimental results inhandwritten digit recognition show that via confidenceevaluation, superior classification performance can beobtained using simple combination rules.
为了在度量水平上组合分类器,分类器的不同输出应该转换为代表决策置信度的统一度量,希望是类概率或似然。本文给出了基于置信度评价的分类器组合的实验结果。我们测试了三种类型的信心:对数似然,指数和s型。为了重新缩放分类器输出,我们使用了基于全局归一化和高斯密度估计的三个缩放函数。手写体数字识别的实验结果表明,通过置信度评价,使用简单的组合规则可以获得较好的分类性能。
{"title":"Confidence evaluation for combining diverse classifiers","authors":"Hongwei Hao, Cheng-Lin Liu, H. Sako","doi":"10.1109/ICDAR.2003.1227764","DOIUrl":"https://doi.org/10.1109/ICDAR.2003.1227764","url":null,"abstract":"For combining classifiers at measurement level, thediverse outputs of classifiers should be transformed touniform measures that represent the confidence ofdecision, hopefully, the class probability or likelihood.This paper presents our experimental results of classifiercombination using confidence evaluation. We test threetypes of confidences: log-likelihood, exponential andsigmoid. For re-scaling the classifier outputs, we usethree scaling functions based on global normalizationand Gaussian density estimation. Experimental results inhandwritten digit recognition show that via confidenceevaluation, superior classification performance can beobtained using simple combination rules.","PeriodicalId":249193,"journal":{"name":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","volume":"12 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"131160839","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
Directional wavelet approach to remove document image interference 方向小波法去除文档图像干扰
Qian Wang, Tao Xia, C. Tan, Lida Li
In this paper, we propose a directional wavelet approachto remove images of interfering strokes coming from theback of a historical handwritten document due to seepingof ink during long period of storage. Our previous workrequired mapping of both sides of the document in orderto identify the interfering strokes to be eliminated. Perfectmapping, however, is difficult due to document skews,differing resolutions, non-availability of the reverseside and warped pages during scanning. The newapproach does not require double-sided mapping butinstead uses a directional wavelet transformto distinguish the foreground and reverse side strokes.Experiments have shown that the directional waveletoperation effectively removes the interfering strokes.
在本文中,我们提出了一种定向小波方法来去除由于长时间存储期间墨水渗出而来自历史手写文件背面的干扰笔画图像。我们之前的工作需要对文档的两面进行映射,以便识别要消除的干扰笔画。然而,由于文档倾斜、不同的分辨率、不可用的反面和扫描时扭曲的页面,完美的映射是困难的。新方法不需要双面映射,而是使用方向小波变换来区分前景和反面笔画。实验表明,定向小波运算能有效地去除干扰笔画。
{"title":"Directional wavelet approach to remove document image interference","authors":"Qian Wang, Tao Xia, C. Tan, Lida Li","doi":"10.1109/ICDAR.2003.1227759","DOIUrl":"https://doi.org/10.1109/ICDAR.2003.1227759","url":null,"abstract":"In this paper, we propose a directional wavelet approachto remove images of interfering strokes coming from theback of a historical handwritten document due to seepingof ink during long period of storage. Our previous workrequired mapping of both sides of the document in orderto identify the interfering strokes to be eliminated. Perfectmapping, however, is difficult due to document skews,differing resolutions, non-availability of the reverseside and warped pages during scanning. The newapproach does not require double-sided mapping butinstead uses a directional wavelet transformto distinguish the foreground and reverse side strokes.Experiments have shown that the directional waveletoperation effectively removes the interfering strokes.","PeriodicalId":249193,"journal":{"name":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","volume":"79 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132558670","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 17
期刊
Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1