利用结构特征对移动相机捕捉的招牌图像进行优化的Gurmukhi文本识别

Triptinder Pal Kaur, N. Garg
{"title":"利用结构特征对移动相机捕捉的招牌图像进行优化的Gurmukhi文本识别","authors":"Triptinder Pal Kaur, N. Garg","doi":"10.1109/ICACC.2015.65","DOIUrl":null,"url":null,"abstract":"Earlier, research was restricted to the images acquired by traditional scanners, however an innovative trend of research has emerged with the evolution of portable, high speed digital cameras and multimedia mobile phones comprising smart features. They provided us the opportunity to employ them for image acquisition as an alternate to traditional scanners for the recognition purpose. This subject has attracted numerous researchers, meanwhile it provides a means for automatic processing of substantial amount of data. Text to speech translation of recognized text from images can be ready to lend a hand for visually impaired people and for those who are unfamiliar with the language. This paper provides technical solution for the recognition of Gurmukhi text from the images of different signboards acquired by camera of different resolution. Segmentation is accomplished using vertical and horizontal projection histograms on the pre-processed image which breakdowns the text into lines, words and characters. Feature extraction and recognition on the segmented characters is accomplished by considering at least three corresponding structural features holes, endpoints and junctions. Consequently, our recognition is grounded on the location and number of these features extracted. The proposed algorithm was tested on 1300 images of Gurmukhi text acquired by camera and recognition rate of 90% demonstrates the precision of the system.","PeriodicalId":368544,"journal":{"name":"2015 Fifth International Conference on Advances in Computing and Communications (ICACC)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Optimized Gurmukhi Text Recognition from Signboard Images Captured by Mobile Camera Using Structural Features\",\"authors\":\"Triptinder Pal Kaur, N. Garg\",\"doi\":\"10.1109/ICACC.2015.65\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Earlier, research was restricted to the images acquired by traditional scanners, however an innovative trend of research has emerged with the evolution of portable, high speed digital cameras and multimedia mobile phones comprising smart features. They provided us the opportunity to employ them for image acquisition as an alternate to traditional scanners for the recognition purpose. This subject has attracted numerous researchers, meanwhile it provides a means for automatic processing of substantial amount of data. Text to speech translation of recognized text from images can be ready to lend a hand for visually impaired people and for those who are unfamiliar with the language. This paper provides technical solution for the recognition of Gurmukhi text from the images of different signboards acquired by camera of different resolution. Segmentation is accomplished using vertical and horizontal projection histograms on the pre-processed image which breakdowns the text into lines, words and characters. Feature extraction and recognition on the segmented characters is accomplished by considering at least three corresponding structural features holes, endpoints and junctions. Consequently, our recognition is grounded on the location and number of these features extracted. The proposed algorithm was tested on 1300 images of Gurmukhi text acquired by camera and recognition rate of 90% demonstrates the precision of the system.\",\"PeriodicalId\":368544,\"journal\":{\"name\":\"2015 Fifth International Conference on Advances in Computing and Communications (ICACC)\",\"volume\":\"30 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 Fifth International Conference on Advances in Computing and Communications (ICACC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICACC.2015.65\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 Fifth International Conference on Advances in Computing and Communications (ICACC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACC.2015.65","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

早期,研究仅限于传统扫描仪获取的图像,但随着便携式高速数码相机和多媒体移动电话的发展,研究出现了创新趋势,包括智能功能。它们为我们提供了使用它们进行图像采集的机会,作为识别目的的传统扫描仪的替代方案。这一课题吸引了众多研究者,同时也为大量数据的自动处理提供了一种手段。从图像中识别文本的文本到语音翻译可以随时为视障人士和不熟悉语言的人提供帮助。本文为从不同分辨率的摄像机采集的不同广告牌图像中识别古穆克文字提供了技术解决方案。分割是通过在预处理图像上使用垂直和水平投影直方图来完成的,该直方图将文本分解为行、词和字符。通过考虑至少三个相应的结构特征——孔洞、端点和连接点,来完成对分割字符的特征提取和识别。因此,我们的识别是基于提取的这些特征的位置和数量。该算法在1300张相机采集的古穆克文字图像上进行了测试,识别率达到90%,证明了系统的精度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
Optimized Gurmukhi Text Recognition from Signboard Images Captured by Mobile Camera Using Structural Features
Earlier, research was restricted to the images acquired by traditional scanners, however an innovative trend of research has emerged with the evolution of portable, high speed digital cameras and multimedia mobile phones comprising smart features. They provided us the opportunity to employ them for image acquisition as an alternate to traditional scanners for the recognition purpose. This subject has attracted numerous researchers, meanwhile it provides a means for automatic processing of substantial amount of data. Text to speech translation of recognized text from images can be ready to lend a hand for visually impaired people and for those who are unfamiliar with the language. This paper provides technical solution for the recognition of Gurmukhi text from the images of different signboards acquired by camera of different resolution. Segmentation is accomplished using vertical and horizontal projection histograms on the pre-processed image which breakdowns the text into lines, words and characters. Feature extraction and recognition on the segmented characters is accomplished by considering at least three corresponding structural features holes, endpoints and junctions. Consequently, our recognition is grounded on the location and number of these features extracted. The proposed algorithm was tested on 1300 images of Gurmukhi text acquired by camera and recognition rate of 90% demonstrates the precision of the system.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Implementation of NTCIP in Road Traffic Controllers for Traffic Signal Coordination AutoScaling of VM in Private And Public Cloud Environment with Debt Assessment Fuzzy Cautious Adaptive Random Early Detection for Heterogeneous Network Enhancing the Accuracy of Movie Recommendation System Based on Probabilistic Data Structure and Graph Database Compact Band Notched UWB Filter for Wireless Communication Applications
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1