{"title":"基于MSER和CNN的场景图像文本检测与识别","authors":"Savita Choudhary, N. Singh, Sanjay Chichadwani","doi":"10.1109/ICAECC.2018.8479419","DOIUrl":null,"url":null,"abstract":"Detection and recognition of text from natural images is very important for extracting information from images but is an extensively challenging task. This paper proposes an approach for detection of text area from natural scene images using Maximally Stable Extremal Regions (MSER) and recognizing the text using a self-trained Neural Network. Some preprocessing is applied to the image then MSER and canny edge is used to locate the smaller areas that may more likely contain text. The text is individually isolated as single characters by simple algorithms on the binary image and then passed through the recognition model specially designed for hazy and unaligned characters.","PeriodicalId":106991,"journal":{"name":"2018 Second International Conference on Advances in Electronics, Computers and Communications (ICAECC)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Text Detection and Recognition from Scene Images using MSER and CNN\",\"authors\":\"Savita Choudhary, N. Singh, Sanjay Chichadwani\",\"doi\":\"10.1109/ICAECC.2018.8479419\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Detection and recognition of text from natural images is very important for extracting information from images but is an extensively challenging task. This paper proposes an approach for detection of text area from natural scene images using Maximally Stable Extremal Regions (MSER) and recognizing the text using a self-trained Neural Network. Some preprocessing is applied to the image then MSER and canny edge is used to locate the smaller areas that may more likely contain text. The text is individually isolated as single characters by simple algorithms on the binary image and then passed through the recognition model specially designed for hazy and unaligned characters.\",\"PeriodicalId\":106991,\"journal\":{\"name\":\"2018 Second International Conference on Advances in Electronics, Computers and Communications (ICAECC)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-02-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 Second International Conference on Advances in Electronics, Computers and Communications (ICAECC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAECC.2018.8479419\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Second International Conference on Advances in Electronics, Computers and Communications (ICAECC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAECC.2018.8479419","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Text Detection and Recognition from Scene Images using MSER and CNN
Detection and recognition of text from natural images is very important for extracting information from images but is an extensively challenging task. This paper proposes an approach for detection of text area from natural scene images using Maximally Stable Extremal Regions (MSER) and recognizing the text using a self-trained Neural Network. Some preprocessing is applied to the image then MSER and canny edge is used to locate the smaller areas that may more likely contain text. The text is individually isolated as single characters by simple algorithms on the binary image and then passed through the recognition model specially designed for hazy and unaligned characters.