An approach for Bangla and Devanagari video text recognition

MOCR '13 Pub Date : 2013-08-24 DOI:10.1145/2505377.2505389

P. Banerjee, B. Chaudhuri

引用次数: 11

Abstract

Extraction and recognition of Bangla text from video frame images is challenging due to fonts type and style variation, complex color background, low-resolution, low contrast etc. In this paper, we propose an algorithm for extraction and recognition of Bangla and Devanagari text form video frames with complex background. Here, a two-step approach has been proposed. After text localization, the text line is segmented into words using information based on line contours. First order gradient values of the text blocks are used to find the word gap. Next, an Adaptive SIS binarization technique is applied on each word. Next this binarized text block is sent to a state of the art OCR for recognition.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

一种孟加拉语和德文语视频文本识别方法

由于字体类型和样式变化、背景颜色复杂、分辨率低、对比度低等原因，从视频帧图像中提取和识别孟加拉语文本具有挑战性。本文提出了一种基于复杂背景的孟加拉语和印度语视频帧文本提取与识别算法。在这里，提出了一个两步走的方法。文本定位后，使用基于线轮廓的信息将文本线分割成单词。使用文本块的一阶梯度值来查找单词间隙。其次，对每个单词应用自适应SIS二值化技术。接下来，这个二值化的文本块被发送到最先进的OCR进行识别。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

MOCR '13

自引率

0.00%

发文量

期刊最新文献

Can we build language-independent OCR using LSTM networks? Recognition of offline handwritten numerals using an ensemble of MLPs combined by Adaboost Word level script recognition for Uighur document mixed with English script An approach for Bangla and Devanagari video text recognition HMM-based script identification for OCR