An improved scene text and document image binarization scheme

R. Ghoshal, A. Banerjee
{"title":"An improved scene text and document image binarization scheme","authors":"R. Ghoshal, A. Banerjee","doi":"10.1109/RAIT.2018.8389021","DOIUrl":null,"url":null,"abstract":"Identification of text portions have a crucial impact on intelligent transport systems, document image processing, robotics and content based image retrieval systems. So, an accurate text identification method is necessary for text based scene image processing tasks such as OCR. Scene text image binarization plays an important role in any text identification algorithm and hence in the OCR performance. In this work a novel approach to natural scene text image binarization by tracking the text boundary based on edge and gray level variance information. Further, broken boundaries are linked to construct the complete boundary map. Here, an adaptive threshold is determined based on boundary edge information to binarize the image effectively. Compared to other well known binarization methods, our method has been proved more effective in cases where the natural scene images have low contrast, low resolution, non-uniform illumination and noise. Our experiments are conducted on the datasets of ICDAR 2003 Robust Reading Competition, ICDAR 2011 Born Digital Dataset, Street View Text (SVT) Dataset, DIBCO dataset and our laboratory made Bangla Dataset. The experimental results are satisfactory.","PeriodicalId":219972,"journal":{"name":"2018 4th International Conference on Recent Advances in Information Technology (RAIT)","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 4th International Conference on Recent Advances in Information Technology (RAIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RAIT.2018.8389021","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 10

Abstract

Identification of text portions have a crucial impact on intelligent transport systems, document image processing, robotics and content based image retrieval systems. So, an accurate text identification method is necessary for text based scene image processing tasks such as OCR. Scene text image binarization plays an important role in any text identification algorithm and hence in the OCR performance. In this work a novel approach to natural scene text image binarization by tracking the text boundary based on edge and gray level variance information. Further, broken boundaries are linked to construct the complete boundary map. Here, an adaptive threshold is determined based on boundary edge information to binarize the image effectively. Compared to other well known binarization methods, our method has been proved more effective in cases where the natural scene images have low contrast, low resolution, non-uniform illumination and noise. Our experiments are conducted on the datasets of ICDAR 2003 Robust Reading Competition, ICDAR 2011 Born Digital Dataset, Street View Text (SVT) Dataset, DIBCO dataset and our laboratory made Bangla Dataset. The experimental results are satisfactory.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
一种改进的场景文本和文档图像二值化方案
文本部分的识别对智能运输系统、文档图像处理、机器人技术和基于内容的图像检索系统具有至关重要的影响。因此,对于OCR等基于文本的场景图像处理任务,需要一种准确的文本识别方法。场景文本图像二值化在文本识别算法中起着重要的作用,影响着OCR的性能。本文提出了一种基于边缘和灰度方差信息跟踪文本边界的自然场景文本图像二值化方法。此外,将破碎的边界连接起来构建完整的边界图。该方法基于边界边缘信息确定自适应阈值,实现图像的有效二值化。与其他二值化方法相比,该方法在自然场景图像对比度低、分辨率低、光照不均匀和有噪声的情况下更为有效。实验采用ICDAR 2003鲁棒阅读大赛、ICDAR 2011 Born Digital数据集、街景文本(SVT)数据集、DIBCO数据集和我们实验室制作的孟加拉语数据集进行。实验结果令人满意。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Analysis of slope stability and detection of critical failure surface using gravitational search algorithm Prioritization of human errors in EOT crane operations and its visualisation using virtual simulation Impact of land use dynamics on land surface temperature in Jharia coalfield Application of fractional calculus to distinguish left ventricular hypertrophy with normal ECG Miniaturization of Vivaldi antenna for different wireless communication applications
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1