{"title":"A robust algorithm for text extraction from images","authors":"Najwa-Maria Chidiac, Pascal Damien, C. Yaacoub","doi":"10.1109/TSP.2016.7760928","DOIUrl":null,"url":null,"abstract":"A robust algorithm that detects text from natural scene images and extracts them regardless of the orientation is proposed. All existing methods are designed to operate under a certain constraint, like detecting text only in one direction. Maximally Stable Extremal Regions (MSER) detector is chosen to extract binary regions since it has proven to be robust to lighting conditions. An enhancement technique for MSER images is designed to obtain clear letter boundaries. Images are then fed into a Stroke Width Detector and several heuristics are applied to remove non-text pixels. Afterwards, detected text regions are fed into an Optical Character Recognition module and then filtered according to their confidence measure. The recognition of characters is not part of the algorithm and the results are only about the detection of text. Our algorithm proved to be effective on blurred images and noisy images as well, based on both subjective and objective evaluations.","PeriodicalId":159773,"journal":{"name":"2016 39th International Conference on Telecommunications and Signal Processing (TSP)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 39th International Conference on Telecommunications and Signal Processing (TSP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TSP.2016.7760928","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 16
Abstract
A robust algorithm that detects text from natural scene images and extracts them regardless of the orientation is proposed. All existing methods are designed to operate under a certain constraint, like detecting text only in one direction. Maximally Stable Extremal Regions (MSER) detector is chosen to extract binary regions since it has proven to be robust to lighting conditions. An enhancement technique for MSER images is designed to obtain clear letter boundaries. Images are then fed into a Stroke Width Detector and several heuristics are applied to remove non-text pixels. Afterwards, detected text regions are fed into an Optical Character Recognition module and then filtered according to their confidence measure. The recognition of characters is not part of the algorithm and the results are only about the detection of text. Our algorithm proved to be effective on blurred images and noisy images as well, based on both subjective and objective evaluations.