{"title":"Robust Binarization for Video Text Recognition","authors":"Z. Saidane, Christophe Garcia","doi":"10.1109/ICDAR.2007.222","DOIUrl":null,"url":null,"abstract":"This paper presents an automatic binarization method for color text areas in images or videos, which is robust to complex background, low resolution or video coding artefacts. Based on a specific architecture of convolutional neural networks, the proposed system automatically learns how to perform binarization, from a training set of synthesized text images and their corresponding desired binary images, without making any assumptions or using tunable parameters. The proposed method is compared to state-of-the-art binarization techniques, with respect to Gaussian noise and contrast variations, demonstrating the robustness and the efficiency of our method. Text recognition experiments on a database of images extracted from video frames and web pages, with two classical OCRs applied on the obtained binary images show a strong enhancement of the recognition rate by more than 40%.","PeriodicalId":279268,"journal":{"name":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","volume":"143 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2007-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"42","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDAR.2007.222","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 42
Abstract
This paper presents an automatic binarization method for color text areas in images or videos, which is robust to complex background, low resolution or video coding artefacts. Based on a specific architecture of convolutional neural networks, the proposed system automatically learns how to perform binarization, from a training set of synthesized text images and their corresponding desired binary images, without making any assumptions or using tunable parameters. The proposed method is compared to state-of-the-art binarization techniques, with respect to Gaussian noise and contrast variations, demonstrating the robustness and the efficiency of our method. Text recognition experiments on a database of images extracted from video frames and web pages, with two classical OCRs applied on the obtained binary images show a strong enhancement of the recognition rate by more than 40%.