首页 > 最新文献

Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.最新文献

英文 中文
Indexing and retrieval of on-line handwritten documents 联机手写文件的索引和检索
Anil K. Jain, A. Namboodiri
Recent advances in on-line data capturing technologiesand its widespread deployment in devices like PDAsand notebook PCs is creating large amounts of handwrittendata that need to be archived and retrieved efficiently.Word-spotting, which is based on a direct comparison ofa handwritten keyword to words in the document, is commonlyused for indexing and retrieval. We propose a stringmatching-based method for word-spotting in on-line documents.The retrieval algorithm achieves a precision of92.3% at a recall rate of 90% on a database of 6,672 wordswritten by 10 different writers. Indexing experiments showan accuracy of 87.5% using a database of 3,872 on-linewords.
在线数据捕获技术的最新进展及其在pda和笔记本电脑等设备上的广泛应用正在产生大量的手写数据,这些数据需要有效地存档和检索。单词定位是基于对文档中手写关键字与单词的直接比较,通常用于索引和检索。我们提出了一种基于字符串匹配的在线文档单词识别方法。该检索算法在10位作者所写的6672个单词的数据库中,准确率达到92.3%,查全率达到90%。索引实验表明,使用3872个在线词的数据库,准确率为87.5%。
{"title":"Indexing and retrieval of on-line handwritten documents","authors":"Anil K. Jain, A. Namboodiri","doi":"10.1109/ICDAR.2003.1227743","DOIUrl":"https://doi.org/10.1109/ICDAR.2003.1227743","url":null,"abstract":"Recent advances in on-line data capturing technologiesand its widespread deployment in devices like PDAsand notebook PCs is creating large amounts of handwrittendata that need to be archived and retrieved efficiently.Word-spotting, which is based on a direct comparison ofa handwritten keyword to words in the document, is commonlyused for indexing and retrieval. We propose a stringmatching-based method for word-spotting in on-line documents.The retrieval algorithm achieves a precision of92.3% at a recall rate of 90% on a database of 6,672 wordswritten by 10 different writers. Indexing experiments showan accuracy of 87.5% using a database of 3,872 on-linewords.","PeriodicalId":249193,"journal":{"name":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","volume":"142 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"125766997","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 62
Using irregular pyramid for text segmentation and binarization of gray scale images 利用不规则金字塔对灰度图像进行文本分割和二值化
Poh Kok Loo, C. Tan
Compared to binary images that most text extraction methods work on, gray scale images provide much more information for the extraction task. On the other hand complication also arises in determining the subject textual content from its background region (i.e. thresholding) before the actual text extraction process can begin. Differing from the usual sequence of processes where document images are binarized before the actual text extraction, this paper proposes a new method by first segmenting individual subject areas with the help of an irregular pyramid to be followed by the binarization process. This permits the focus of attention only on the appropriate subject areas for the binarization process before text recognition. Our method overcomes the difficulty in global binarization to find a single value to fit all. It also avoids the common problem in most local thresholding technique of finding a suitable window size. As shown in our experimented result, our method performed well in both text segmentation and binarization by varying the sequence of processing.
与大多数文本提取方法所处理的二值图像相比,灰度图像为提取任务提供了更多的信息。另一方面,在实际文本提取过程开始之前,从其背景区域确定主题文本内容(即阈值化)也会产生复杂性。与通常在实际文本提取之前对文档图像进行二值化的处理顺序不同,本文提出了一种新的方法,首先利用不规则金字塔对单个主题区域进行分割,然后进行二值化处理。这允许将注意力集中在文本识别之前的二值化过程的适当主题领域上。我们的方法克服了全局二值化中难以找到一个值来拟合所有值的困难。它还避免了大多数局部阈值技术中常见的寻找合适窗口大小的问题。实验结果表明,通过改变处理顺序,我们的方法在文本分割和二值化方面都表现良好。
{"title":"Using irregular pyramid for text segmentation and binarization of gray scale images","authors":"Poh Kok Loo, C. Tan","doi":"10.1109/ICDAR.2003.1227733","DOIUrl":"https://doi.org/10.1109/ICDAR.2003.1227733","url":null,"abstract":"Compared to binary images that most text extraction methods work on, gray scale images provide much more information for the extraction task. On the other hand complication also arises in determining the subject textual content from its background region (i.e. thresholding) before the actual text extraction process can begin. Differing from the usual sequence of processes where document images are binarized before the actual text extraction, this paper proposes a new method by first segmenting individual subject areas with the help of an irregular pyramid to be followed by the binarization process. This permits the focus of attention only on the appropriate subject areas for the binarization process before text recognition. Our method overcomes the difficulty in global binarization to find a single value to fit all. It also avoids the common problem in most local thresholding technique of finding a suitable window size. As shown in our experimented result, our method performed well in both text segmentation and binarization by varying the sequence of processing.","PeriodicalId":249193,"journal":{"name":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","volume":"67 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"124846021","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Handwritten word recognition based on structural characteristics and lexical support 基于结构特征和词汇支持的手写体单词识别
E. Kavallieratou, K. Sgarbas, N. Fakotakis, G. Kokkinakis
In this paper a handwritten recognition algorithm based on structural characteristics, histograms and profiles, is presented. The well-known horizontal and vertical histograms are used, in combination with the newly introduced radial histogram, out-in radial and in-out radial profiles for representing 32 /spl times/ 32 matrices of characters, as 280-dimension vectors. The recognition process has been supported by a lexical component based on dynamic acyclic FSAs (Finite-State-Automata).
提出了一种基于结构特征、直方图和轮廓的手写体识别算法。众所周知的水平直方图和垂直直方图与新引入的径向直方图相结合,用于表示32 /spl乘以/ 32个字符矩阵的外向径向和外向径向轮廓,作为280维向量。识别过程由基于动态无循环有限状态自动机的词法组件支持。
{"title":"Handwritten word recognition based on structural characteristics and lexical support","authors":"E. Kavallieratou, K. Sgarbas, N. Fakotakis, G. Kokkinakis","doi":"10.1109/ICDAR.2003.1227727","DOIUrl":"https://doi.org/10.1109/ICDAR.2003.1227727","url":null,"abstract":"In this paper a handwritten recognition algorithm based on structural characteristics, histograms and profiles, is presented. The well-known horizontal and vertical histograms are used, in combination with the newly introduced radial histogram, out-in radial and in-out radial profiles for representing 32 /spl times/ 32 matrices of characters, as 280-dimension vectors. The recognition process has been supported by a lexical component based on dynamic acyclic FSAs (Finite-State-Automata).","PeriodicalId":249193,"journal":{"name":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","volume":"8 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128471949","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 31
Methods, reports and survey for the comparison of diverse isolated character recognition results on the UNIPEN database UNIPEN数据库中不同孤立字符识别结果比较的方法、报告和调查
E. Ratzlaff
A framework of data organization methods and corresponding recognition results for UNIPEN databases is presented to enable the comparison of recognition results from different isolated character recognizers. A reproducible method for splitting the Train-R01/V07 data into an array of multi-writer and omni-writer training and testing pairs is proposed. Recognition results and uncertainties are provided for each pair, as well as results for the DevTest-R01/V02 character subsets, using an online scanning n-tuple recognizer. Several other published results are surveyed within this context. In sum, this report provides the reader multiple points of reference useful for comparing a number of published recognition results and a proposed framework that similarly allows private evaluation of unpublished recognition results.
为了比较不同孤立字符识别器的识别结果,提出了UNIPEN数据库的数据组织方法框架和相应的识别结果。提出了一种将Train-R01/V07数据分割成多写器和全写器训练和测试对阵列的可重复方法。使用在线扫描n元组识别器,提供了每个对的识别结果和不确定度,以及DevTest-R01/V02字符子集的结果。在此背景下调查了其他几个已发表的结果。总而言之,本报告为读者提供了多个参考点,可用于比较许多已发布的识别结果,并提出了一个框架,该框架同样允许对未发布的识别结果进行私人评估。
{"title":"Methods, reports and survey for the comparison of diverse isolated character recognition results on the UNIPEN database","authors":"E. Ratzlaff","doi":"10.1109/ICDAR.2003.1227737","DOIUrl":"https://doi.org/10.1109/ICDAR.2003.1227737","url":null,"abstract":"A framework of data organization methods and corresponding recognition results for UNIPEN databases is presented to enable the comparison of recognition results from different isolated character recognizers. A reproducible method for splitting the Train-R01/V07 data into an array of multi-writer and omni-writer training and testing pairs is proposed. Recognition results and uncertainties are provided for each pair, as well as results for the DevTest-R01/V02 character subsets, using an online scanning n-tuple recognizer. Several other published results are surveyed within this context. In sum, this report provides the reader multiple points of reference useful for comparing a number of published recognition results and a proposed framework that similarly allows private evaluation of unpublished recognition results.","PeriodicalId":249193,"journal":{"name":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","volume":"52 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"128712365","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 61
Structured and unstructured document summarization:design of a commercial summarizer using Lexical chains 结构化和非结构化文档摘要:使用词法链的商业摘要器的设计
H. Alam, Aman Kumar, Mikako Nakamura, F. Rahman, Yuliya Tarnikova, C. Wilcox
The process of summarizing documents is becomingincreasingly important in the light of recent advances indocument creation/distribution technology, and theresulting influx of large numbers of documents in everyday life. This paper presents a document summarizer thatcombines document analysis, structural decomposition,XML representation and lexical chain analysis. Theproposed summarizer is compared to three commerciallyavailable summarizers and it is shown that it produceseither comparable or better summaries overall.
鉴于最近文件创建/分发技术的进步,以及由此导致的日常生活中大量文件的涌入,总结文件的过程变得越来越重要。本文提出了一个集文档分析、结构分解、XML表示和词法链分析于一体的文档摘要器。将提出的摘要器与三种市售的摘要器进行比较,结果表明,它总体上产生了可比较或更好的摘要。
{"title":"Structured and unstructured document summarization:design of a commercial summarizer using Lexical chains","authors":"H. Alam, Aman Kumar, Mikako Nakamura, F. Rahman, Yuliya Tarnikova, C. Wilcox","doi":"10.1109/ICDAR.2003.1227836","DOIUrl":"https://doi.org/10.1109/ICDAR.2003.1227836","url":null,"abstract":"The process of summarizing documents is becomingincreasingly important in the light of recent advances indocument creation/distribution technology, and theresulting influx of large numbers of documents in everyday life. This paper presents a document summarizer thatcombines document analysis, structural decomposition,XML representation and lexical chain analysis. Theproposed summarizer is compared to three commerciallyavailable summarizers and it is shown that it produceseither comparable or better summaries overall.","PeriodicalId":249193,"journal":{"name":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129072626","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 39
Symbols recognition by global-local structural approaches, based on the scenarios use,and with a XML representation of data 基于场景使用的全局-局部结构方法的符号识别,并使用数据的XML表示
Mathieu Delalandre, Stéphane Nicolas, É. Trupin, J. Ogier
This paper deals with the structural recognition ofsymbols on the documents. We have based our system ona combination of local and global structural approaches.The global approach groups the connected componentstogether according to some closeness and connectionconstraints. The local approach splits up each connectedcomponent into a graph of geometrical objects (vectors,arcs, curves). The extracted graphs are matched thanks toa structural classifier, which permits graph-subgraph andexact-inexact matching. The system adaptability isobtained thanks to the scenarios use. A XML datarepresentation is used, allowing the data manipulationsand the graphic representations of results.
本文研究了文献符号的结构识别问题。我们的系统是基于本地和全球结构方法的结合。全局方法根据一些紧密性和连接约束将连接的组件分组在一起。局部方法将每个连接的组件分割成几何对象(向量、弧、曲线)的图形。通过结构分类器对提取的图进行匹配,该分类器允许图-子图和精确-不精确匹配。通过场景使用,获得了系统的适应性。使用XML数据表示,允许数据操作和结果的图形表示。
{"title":"Symbols recognition by global-local structural approaches, based on the scenarios use,and with a XML representation of data","authors":"Mathieu Delalandre, Stéphane Nicolas, É. Trupin, J. Ogier","doi":"10.1109/ICDAR.2003.1227810","DOIUrl":"https://doi.org/10.1109/ICDAR.2003.1227810","url":null,"abstract":"This paper deals with the structural recognition ofsymbols on the documents. We have based our system ona combination of local and global structural approaches.The global approach groups the connected componentstogether according to some closeness and connectionconstraints. The local approach splits up each connectedcomponent into a graph of geometrical objects (vectors,arcs, curves). The extracted graphs are matched thanks toa structural classifier, which permits graph-subgraph andexact-inexact matching. The system adaptability isobtained thanks to the scenarios use. A XML datarepresentation is used, allowing the data manipulationsand the graphic representations of results.","PeriodicalId":249193,"journal":{"name":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130727456","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 6
Online handwritten Japanese text recognition free from constrains on line direction and character orientation 在线手写日语文本识别,不受线方向和字符方向的限制
M. Nakagawa, M. Onuma
This paper describes an on-line handwritten Japanese text recognition method that is liberated from constraints on writing direction (line direction) and character orientation. This method estimates the line direction and character orientation using the time sequence information of pen-tip coordinates and employs writing-box-free recognition with context processing combined. The method can cope with a mixture of vertical, horizontal and skewed lines with arbitrary character orientations. It is expected useful for tablet PCs, interactive electronic whiteboards and so on.
本文描述了一种不受书写方向(线方向)和字符方向限制的在线手写体日语文本识别方法。该方法利用笔尖坐标的时间序列信息估计直线方向和字符方向,并将无书写盒识别与上下文处理相结合。该方法可以处理具有任意字符方向的垂直、水平和歪斜线的混合。预计可用于平板电脑、交互式电子白板等。
{"title":"Online handwritten Japanese text recognition free from constrains on line direction and character orientation","authors":"M. Nakagawa, M. Onuma","doi":"10.1109/ICDAR.2003.1227719","DOIUrl":"https://doi.org/10.1109/ICDAR.2003.1227719","url":null,"abstract":"This paper describes an on-line handwritten Japanese text recognition method that is liberated from constraints on writing direction (line direction) and character orientation. This method estimates the line direction and character orientation using the time sequence information of pen-tip coordinates and employs writing-box-free recognition with context processing combined. The method can cope with a mixture of vertical, horizontal and skewed lines with arbitrary character orientations. It is expected useful for tablet PCs, interactive electronic whiteboards and so on.","PeriodicalId":249193,"journal":{"name":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","volume":"17 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132883380","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 28
Binary classification trees for multi-class classification problems 二叉分类树用于多类分类问题
Jin-Seon Lee, Il-Seok Oh
This paper proposes a binary classification tree aiming atsolving multi-class classification problems using binaryclassifiers. The tree design is achieved in a way that aclass group is partitioned into two distinct subgroups at anode. The node adopts the class-modular scheme toimprove the binary classification capability. Thepartitioning is formulated as an optimization problemand a genetic algorithm is proposed to solve theoptimization problem. The binary classification tree iscompared to the conventional methods in terms ofclassification accuracy and timing efficiency.Experiments were performed with numeral recognitionand touching-numeral pair recognition.
本文提出了一种利用二分类器解决多类分类问题的二分类树。树形设计的实现方式是将类组在阳极处划分为两个不同的子组。节点采用类模块化方案,提高了二值分类能力。将分区问题表述为一个优化问题,并提出了一种求解优化问题的遗传算法。二叉分类树在分类精度和时序效率方面与传统方法进行了比较。进行了数字识别和触摸-数字对识别实验。
{"title":"Binary classification trees for multi-class classification problems","authors":"Jin-Seon Lee, Il-Seok Oh","doi":"10.1109/ICDAR.2003.1227766","DOIUrl":"https://doi.org/10.1109/ICDAR.2003.1227766","url":null,"abstract":"This paper proposes a binary classification tree aiming atsolving multi-class classification problems using binaryclassifiers. The tree design is achieved in a way that aclass group is partitioned into two distinct subgroups at anode. The node adopts the class-modular scheme toimprove the binary classification capability. Thepartitioning is formulated as an optimization problemand a genetic algorithm is proposed to solve theoptimization problem. The binary classification tree iscompared to the conventional methods in terms ofclassification accuracy and timing efficiency.Experiments were performed with numeral recognitionand touching-numeral pair recognition.","PeriodicalId":249193,"journal":{"name":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","volume":"59 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132006346","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 32
On-line signature verification using local shape analysis 基于局部形状分析的在线签名验证
M. Zou, Jianjun Tong, Chang-ping Liu, Zhengliang Lou
This paper presents a novel approach to the on-line signature verification using local shape analysis. First, segment the input signature into several segments using HMM (hidden Markov model). Then, combine two adjacent segments to form a long segment and get its spectral and tremor information using FFT (fast Fourier transformation). At last, accept it or reject it based on the similarity between the spectral and its prototype. In addition, we proposed a novel initialization algorithm to avoid the local optimal of the HMM's re-estimation and a novel algorithm to avoid losing the important information at cusps in preprocessing. Combining the local shape analysis with the local time-based comparison, we get promising experimental results.
提出了一种基于局部形状分析的在线签名验证方法。首先,使用隐马尔可夫模型(HMM)将输入签名分割成若干段。然后,将两个相邻的片段组合成一个长片段,利用快速傅里叶变换(FFT)得到其频谱和震颤信息。最后,根据光谱与其原型的相似度,对其进行接受或拒绝。此外,我们提出了一种新的初始化算法,以避免HMM重估计的局部最优,并提出了一种新的算法,以避免在预处理中丢失重要信息的尖端。将局部形状分析与基于局部时间的比较相结合,得到了令人满意的实验结果。
{"title":"On-line signature verification using local shape analysis","authors":"M. Zou, Jianjun Tong, Chang-ping Liu, Zhengliang Lou","doi":"10.1109/ICDAR.2003.1227680","DOIUrl":"https://doi.org/10.1109/ICDAR.2003.1227680","url":null,"abstract":"This paper presents a novel approach to the on-line signature verification using local shape analysis. First, segment the input signature into several segments using HMM (hidden Markov model). Then, combine two adjacent segments to form a long segment and get its spectral and tremor information using FFT (fast Fourier transformation). At last, accept it or reject it based on the similarity between the spectral and its prototype. In addition, we proposed a novel initialization algorithm to avoid the local optimal of the HMM's re-estimation and a novel algorithm to avoid losing the important information at cusps in preprocessing. Combining the local shape analysis with the local time-based comparison, we get promising experimental results.","PeriodicalId":249193,"journal":{"name":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","volume":"6 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"132051081","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 28
A multiscale approach to restoring scanned color document images with show-through effects 带透显效果的彩色文档扫描图像的多尺度恢复方法
H. Nishida, Takeshi Suzuki
This paper describes a new approach to restoring scanned color document images where the backside image shows through the paper sheet. A new framework is presented for correcting show-through components using digital image processing techniques. First, the foreground components on the front side are separated from the background and backside components through locally adaptive binarization for each color component and edge magnitude thresholding. Background colors are estimated locally through color thresholding to generate a restored image, and then corrected adaptively through multiscale analysis along with comparison of edge distributions between the original and the restored image. The proposed method does not require specific input devices or the backside to be input; it is able to correct unneeded image components through analysis of the front side image alone. Experimental results are given to verify effectiveness of the proposed method.
本文介绍了一种恢复扫描彩色文档图像的新方法,其中背面图像通过纸张显示。提出了一种利用数字图像处理技术校正透显组件的新框架。首先,通过对每个颜色分量进行局部自适应二值化和边缘幅度阈值分割,将正面的前景分量与背景和背面分量分离。通过颜色阈值分割局部估计背景颜色,生成恢复图像,然后通过多尺度分析,比较原始图像和恢复图像的边缘分布,进行自适应校正。所建议的方法不需要特定的输入设备或背面输入;它能够通过单独分析正面图像来校正不需要的图像成分。实验结果验证了该方法的有效性。
{"title":"A multiscale approach to restoring scanned color document images with show-through effects","authors":"H. Nishida, Takeshi Suzuki","doi":"10.1109/ICDAR.2003.1227731","DOIUrl":"https://doi.org/10.1109/ICDAR.2003.1227731","url":null,"abstract":"This paper describes a new approach to restoring scanned color document images where the backside image shows through the paper sheet. A new framework is presented for correcting show-through components using digital image processing techniques. First, the foreground components on the front side are separated from the background and backside components through locally adaptive binarization for each color component and edge magnitude thresholding. Background colors are estimated locally through color thresholding to generate a restored image, and then corrected adaptively through multiscale analysis along with comparison of edge distributions between the original and the restored image. The proposed method does not require specific input devices or the backside to be input; it is able to correct unneeded image components through analysis of the front side image alone. Experimental results are given to verify effectiveness of the proposed method.","PeriodicalId":249193,"journal":{"name":"Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.","volume":"1 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2003-08-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"130249435","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 23
期刊
Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings.
全部 Acc. Chem. Res. ACS Applied Bio Materials ACS Appl. Electron. Mater. ACS Appl. Energy Mater. ACS Appl. Mater. Interfaces ACS Appl. Nano Mater. ACS Appl. Polym. Mater. ACS BIOMATER-SCI ENG ACS Catal. ACS Cent. Sci. ACS Chem. Biol. ACS Chemical Health & Safety ACS Chem. Neurosci. ACS Comb. Sci. ACS Earth Space Chem. ACS Energy Lett. ACS Infect. Dis. ACS Macro Lett. ACS Mater. Lett. ACS Med. Chem. Lett. ACS Nano ACS Omega ACS Photonics ACS Sens. ACS Sustainable Chem. Eng. ACS Synth. Biol. Anal. Chem. BIOCHEMISTRY-US Bioconjugate Chem. BIOMACROMOLECULES Chem. Res. Toxicol. Chem. Rev. Chem. Mater. CRYST GROWTH DES ENERG FUEL Environ. Sci. Technol. Environ. Sci. Technol. Lett. Eur. J. Inorg. Chem. IND ENG CHEM RES Inorg. Chem. J. Agric. Food. Chem. J. Chem. Eng. Data J. Chem. Educ. J. Chem. Inf. Model. J. Chem. Theory Comput. J. Med. Chem. J. Nat. Prod. J PROTEOME RES J. Am. Chem. Soc. LANGMUIR MACROMOLECULES Mol. Pharmaceutics Nano Lett. Org. Lett. ORG PROCESS RES DEV ORGANOMETALLICS J. Org. Chem. J. Phys. Chem. J. Phys. Chem. A J. Phys. Chem. B J. Phys. Chem. C J. Phys. Chem. Lett. Analyst Anal. Methods Biomater. Sci. Catal. Sci. Technol. Chem. Commun. Chem. Soc. Rev. CHEM EDUC RES PRACT CRYSTENGCOMM Dalton Trans. Energy Environ. Sci. ENVIRON SCI-NANO ENVIRON SCI-PROC IMP ENVIRON SCI-WAT RES Faraday Discuss. Food Funct. Green Chem. Inorg. Chem. Front. Integr. Biol. J. Anal. At. Spectrom. J. Mater. Chem. A J. Mater. Chem. B J. Mater. Chem. C Lab Chip Mater. Chem. Front. Mater. Horiz. MEDCHEMCOMM Metallomics Mol. Biosyst. Mol. Syst. Des. Eng. Nanoscale Nanoscale Horiz. Nat. Prod. Rep. New J. Chem. Org. Biomol. Chem. Org. Chem. Front. PHOTOCH PHOTOBIO SCI PCCP Polym. Chem.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1