{"title":"屏蔽R-CNN文本检测器","authors":"P. Duan, Jiahao Pan, Wenbi Rao","doi":"10.1109/ICAIIS49377.2020.9194911","DOIUrl":null,"url":null,"abstract":"Scene text detection and scene text recognition are important components of scene text recognition system. Scene text detection, the initial stage of scene text recognition, aims to find out text area in the picture. Recently the target detection method Mask R-CNN has been employed scene text detection and achieved good performance. In this paper, we set forth a model, MaskS R-CNN text detector, based on Mask R-CNN, which attempts to detect scene text. In this model, a network block of Mask Scoring R-CNN is introduced to learn the high quality of the predicted instance mask scores. The mask scoring mechanism correct the inconformity between mask quality and mask score, at the same time improves instance segmentation performance by attaching great importance to more accurate mask predictions. The method put forward in this paper can achieve multi-directional and multi-language natural scene text detection. Compared with some existing traditional location methods based on edge, color and texture and some location methods based on deep learning, it is a relatively innovative method.","PeriodicalId":416002,"journal":{"name":"2020 IEEE International Conference on Artificial Intelligence and Information Systems (ICAIIS)","volume":"29 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"MaskS R-CNN Text Detector\",\"authors\":\"P. Duan, Jiahao Pan, Wenbi Rao\",\"doi\":\"10.1109/ICAIIS49377.2020.9194911\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Scene text detection and scene text recognition are important components of scene text recognition system. Scene text detection, the initial stage of scene text recognition, aims to find out text area in the picture. Recently the target detection method Mask R-CNN has been employed scene text detection and achieved good performance. In this paper, we set forth a model, MaskS R-CNN text detector, based on Mask R-CNN, which attempts to detect scene text. In this model, a network block of Mask Scoring R-CNN is introduced to learn the high quality of the predicted instance mask scores. The mask scoring mechanism correct the inconformity between mask quality and mask score, at the same time improves instance segmentation performance by attaching great importance to more accurate mask predictions. The method put forward in this paper can achieve multi-directional and multi-language natural scene text detection. Compared with some existing traditional location methods based on edge, color and texture and some location methods based on deep learning, it is a relatively innovative method.\",\"PeriodicalId\":416002,\"journal\":{\"name\":\"2020 IEEE International Conference on Artificial Intelligence and Information Systems (ICAIIS)\",\"volume\":\"29 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-03-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 IEEE International Conference on Artificial Intelligence and Information Systems (ICAIIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAIIS49377.2020.9194911\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Conference on Artificial Intelligence and Information Systems (ICAIIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICAIIS49377.2020.9194911","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Scene text detection and scene text recognition are important components of scene text recognition system. Scene text detection, the initial stage of scene text recognition, aims to find out text area in the picture. Recently the target detection method Mask R-CNN has been employed scene text detection and achieved good performance. In this paper, we set forth a model, MaskS R-CNN text detector, based on Mask R-CNN, which attempts to detect scene text. In this model, a network block of Mask Scoring R-CNN is introduced to learn the high quality of the predicted instance mask scores. The mask scoring mechanism correct the inconformity between mask quality and mask score, at the same time improves instance segmentation performance by attaching great importance to more accurate mask predictions. The method put forward in this paper can achieve multi-directional and multi-language natural scene text detection. Compared with some existing traditional location methods based on edge, color and texture and some location methods based on deep learning, it is a relatively innovative method.