基于区域增长算法的广告展示板内容提取

Ramesh M. Badiger, Manjunath Y. Kammar, Ningappa T. Pujar
{"title":"基于区域增长算法的广告展示板内容提取","authors":"Ramesh M. Badiger, Manjunath Y. Kammar, Ningappa T. Pujar","doi":"10.1109/ICAECCT.2016.7942603","DOIUrl":null,"url":null,"abstract":"In recent years portable camera devices have gained increased popularity and embedded visual processing, text extraction from natural scene images like advertisement display boards, government office boards has become a key problem in everyday lives. The problem is challenging in nature due to variations in the font size and color, text alignment, illumination change and reflections. Here a novel method for extraction of text from advertisement display boards using Region growing algorithm is proposed. The proposed algorithm is generally composed of four stages, the colored image is converted into grayscale image and canny edge method is used to detect the edges of the image, the edge detected image is preprocessed by applying morphological operations and rule based method is used to remove the non text objects based on width, height and area, later finding the centroid point of the connected component of identified objects and finally proposed algorithm region growing method is used to start extracting the characters. The method is robust and insensitive to noise, blur, variation in font size and style, color, uneven thickness, and varying lightning conditions. The text extraction accuracy of 90.94% is achieved.","PeriodicalId":6629,"journal":{"name":"2016 IEEE International Conference on Advances in Electronics, Communication and Computer Technology (ICAECCT)","volume":"17 1","pages":"303-307"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Content extraction from advertisement display boards utilizing Region growing algorithm\",\"authors\":\"Ramesh M. Badiger, Manjunath Y. Kammar, Ningappa T. Pujar\",\"doi\":\"10.1109/ICAECCT.2016.7942603\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years portable camera devices have gained increased popularity and embedded visual processing, text extraction from natural scene images like advertisement display boards, government office boards has become a key problem in everyday lives. The problem is challenging in nature due to variations in the font size and color, text alignment, illumination change and reflections. Here a novel method for extraction of text from advertisement display boards using Region growing algorithm is proposed. The proposed algorithm is generally composed of four stages, the colored image is converted into grayscale image and canny edge method is used to detect the edges of the image, the edge detected image is preprocessed by applying morphological operations and rule based method is used to remove the non text objects based on width, height and area, later finding the centroid point of the connected component of identified objects and finally proposed algorithm region growing method is used to start extracting the characters. The method is robust and insensitive to noise, blur, variation in font size and style, color, uneven thickness, and varying lightning conditions. The text extraction accuracy of 90.94% is achieved.\",\"PeriodicalId\":6629,\"journal\":{\"name\":\"2016 IEEE International Conference on Advances in Electronics, Communication and Computer Technology (ICAECCT)\",\"volume\":\"17 1\",\"pages\":\"303-307\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 IEEE International Conference on Advances in Electronics, Communication and Computer Technology (ICAECCT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAECCT.2016.7942603\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE International Conference on Advances in Electronics, Communication and Computer Technology (ICAECCT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAECCT.2016.7942603","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

近年来,便携式摄像设备越来越普及,嵌入式视觉处理,从广告展板、政府办公板等自然场景图像中提取文本已成为人们日常生活中的关键问题。由于字体大小和颜色、文本对齐、照明变化和反射的变化,这个问题本质上是具有挑战性的。本文提出了一种基于区域增长算法的广告展示板文本提取方法。该算法一般由四个阶段组成:将彩色图像转换为灰度图像,使用canny边缘法对图像进行边缘检测,对检测到的图像进行形态学预处理,并使用基于规则的方法对基于宽度、高度和面积的非文本对象进行去除;然后找到识别对象的连通分量的质心点,最后采用提出的算法区域生长法开始提取特征。该方法具有鲁棒性,对噪声、模糊、字体大小和样式变化、颜色、厚度不均匀和闪电条件变化不敏感。文本提取准确率达到90.94%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Content extraction from advertisement display boards utilizing Region growing algorithm
In recent years portable camera devices have gained increased popularity and embedded visual processing, text extraction from natural scene images like advertisement display boards, government office boards has become a key problem in everyday lives. The problem is challenging in nature due to variations in the font size and color, text alignment, illumination change and reflections. Here a novel method for extraction of text from advertisement display boards using Region growing algorithm is proposed. The proposed algorithm is generally composed of four stages, the colored image is converted into grayscale image and canny edge method is used to detect the edges of the image, the edge detected image is preprocessed by applying morphological operations and rule based method is used to remove the non text objects based on width, height and area, later finding the centroid point of the connected component of identified objects and finally proposed algorithm region growing method is used to start extracting the characters. The method is robust and insensitive to noise, blur, variation in font size and style, color, uneven thickness, and varying lightning conditions. The text extraction accuracy of 90.94% is achieved.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Keynote speakers Emotweet: Sentiment Analysis tool for twitter Design of faster & power efficient sense amplifier using VLSI technology A comparative study on distance measuring approches for permutation representations An embedded system of dedicated and real-time fire detector and locator technology as an interactive response mechanism in fire occurrences
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1