Pseudo Zernike moments based approach for text detection and localisation from lecture videos

IF 1.4 Q4 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS International Journal of Computational Science and Engineering Pub Date : 2016-01-01 DOI:10.1504/IJCSE.2016.10011674
Belkacem Soundes, Guezouli Larbi, Zidat Samir
{"title":"Pseudo Zernike moments based approach for text detection and localisation from lecture videos","authors":"Belkacem Soundes, Guezouli Larbi, Zidat Samir","doi":"10.1504/IJCSE.2016.10011674","DOIUrl":null,"url":null,"abstract":"Scene text presents challenging characteristics mainly related to acquisition circumstances and environmental changes resulting in low quality videos. In this paper, we present a scene text detection algorithm based on pseudo Zernike moments (PZMs) and stroke features from low resolution lecture videos. Algorithm mainly consists of three steps: slide detection, text detection and segmentation and non-text filtering. In lecture videos, slide region is a key object carrying almost all important information; hence slide region has to be extracted and segmented from other scene objects considered as background for later processing. Slide region detection and segmentation is done by applying pseudo Zernike moment's based on RGB frames. Text detection and extraction is performed using PZMs segmentation over V channel of HSV colour space, and then stroke feature is used to filter out non-text region and to remove false positives. The algorithm is robust to illumination, low resolution and uneven luminance from compressed videos. Effectiveness of PZM description leads to very few false positives comparing to other approached. Moreover resulting images can be used directly by OCR engines and no more processing is needed.","PeriodicalId":47380,"journal":{"name":"International Journal of Computational Science and Engineering","volume":"40 1","pages":"274-283"},"PeriodicalIF":1.4000,"publicationDate":"2016-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computational Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/IJCSE.2016.10011674","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 1

Abstract

Scene text presents challenging characteristics mainly related to acquisition circumstances and environmental changes resulting in low quality videos. In this paper, we present a scene text detection algorithm based on pseudo Zernike moments (PZMs) and stroke features from low resolution lecture videos. Algorithm mainly consists of three steps: slide detection, text detection and segmentation and non-text filtering. In lecture videos, slide region is a key object carrying almost all important information; hence slide region has to be extracted and segmented from other scene objects considered as background for later processing. Slide region detection and segmentation is done by applying pseudo Zernike moment's based on RGB frames. Text detection and extraction is performed using PZMs segmentation over V channel of HSV colour space, and then stroke feature is used to filter out non-text region and to remove false positives. The algorithm is robust to illumination, low resolution and uneven luminance from compressed videos. Effectiveness of PZM description leads to very few false positives comparing to other approached. Moreover resulting images can be used directly by OCR engines and no more processing is needed.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于伪泽尼克矩的演讲视频文本检测和定位方法
场景文本呈现出具有挑战性的特征,主要与获取环境和环境变化有关,导致视频质量低。在本文中,我们提出了一种基于伪泽尼克矩(PZMs)和笔画特征的低分辨率演讲视频场景文本检测算法。算法主要包括三个步骤:幻灯片检测、文本检测与分割和非文本过滤。在讲课视频中,幻灯片区域是承载几乎所有重要信息的关键对象;因此,必须从作为背景的其他场景对象中提取和分割滑动区域,以供后续处理。采用基于RGB帧的伪泽尼克矩进行滑动区域检测和分割。在HSV颜色空间的V通道上使用PZMs分割进行文本检测和提取,然后使用笔画特征过滤掉非文本区域并去除误报。该算法对压缩视频的光照、低分辨率和不均匀亮度具有较强的鲁棒性。与其他方法相比,PZM描述的有效性导致很少的误报。此外,生成的图像可以直接由OCR引擎使用,而不需要更多的处理。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
International Journal of Computational Science and Engineering
International Journal of Computational Science and Engineering COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS-
CiteScore
4.00
自引率
40.00%
发文量
73
期刊介绍: Computational science and engineering is an emerging and promising discipline in shaping future research and development activities in both academia and industry, in fields ranging from engineering, science, finance, and economics, to arts and humanities. New challenges arise in the modelling of complex systems, sophisticated algorithms, advanced scientific and engineering computing and associated (multidisciplinary) problem-solving environments. Because the solution of large and complex problems must cope with tight timing schedules, powerful algorithms and computational techniques, are inevitable. IJCSE addresses the state of the art of all aspects of computational science and engineering with emphasis on computational methods and techniques for science and engineering applications.
期刊最新文献
Integer wavelet transform based data hiding scheme for electrocardiogram signals protection Integrated power information operation and maintenance system based on D3QN algorithm with experience replay Availability assessment and sensitivity analysis of an MBaaS platform SLIC-SSA: an image segmentation method based on superpixel and sparrow search algorithm Time series models for web service activity prediction
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1