基于高斯概率距离分布的任意形状文本检测

2022 IEEE 5th International Conference on Computer and Communication Engineering Technology (CCET) Pub Date : 2022-08-19 DOI:10.1109/CCET55412.2022.9906393

Li Guo, Zhongyue Chen, Xiaoping Chen

{"title":"基于高斯概率距离分布的任意形状文本检测","authors":"Li Guo, Zhongyue Chen, Xiaoping Chen","doi":"10.1109/CCET55412.2022.9906393","DOIUrl":null,"url":null,"abstract":"With the development of semantic segmentation, segmentation-based methods have yielded great success in detecting arbitrary-shaped texts. However, many existing text detection methods use binary discrete distributions to predict shrunk text instances, which cannot generate complete and accurate text bounding boxes. In this paper, we propose an arbitrary-shaped scene text detection method based on predicting Gaussian probability distance map of the complete text region, and this map can retain more text boundary information. Then, the boundary pixels are clustered into high-confidence text centers by a learnable post-processing and false positives are filtered out by pixel-level score maps. We also propose an adaptive channel enhancement module to improve the pixel-level segmentation accuracy. Experiments on three standard datasets, including CTW1500, Total-Text, and MSRA-TD500, demonstrate that the proposed method achieves great robustness and performance. The method obtains an F-measure of S2.S% on CTW1500 and S3.0% on MSRA-TD500.","PeriodicalId":329327,"journal":{"name":"2022 IEEE 5th International Conference on Computer and Communication Engineering Technology (CCET)","volume":"38 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Arbitrary-Shaped Text Detection with Gaussian Probability Distance Distribution\",\"authors\":\"Li Guo, Zhongyue Chen, Xiaoping Chen\",\"doi\":\"10.1109/CCET55412.2022.9906393\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the development of semantic segmentation, segmentation-based methods have yielded great success in detecting arbitrary-shaped texts. However, many existing text detection methods use binary discrete distributions to predict shrunk text instances, which cannot generate complete and accurate text bounding boxes. In this paper, we propose an arbitrary-shaped scene text detection method based on predicting Gaussian probability distance map of the complete text region, and this map can retain more text boundary information. Then, the boundary pixels are clustered into high-confidence text centers by a learnable post-processing and false positives are filtered out by pixel-level score maps. We also propose an adaptive channel enhancement module to improve the pixel-level segmentation accuracy. Experiments on three standard datasets, including CTW1500, Total-Text, and MSRA-TD500, demonstrate that the proposed method achieves great robustness and performance. The method obtains an F-measure of S2.S% on CTW1500 and S3.0% on MSRA-TD500.\",\"PeriodicalId\":329327,\"journal\":{\"name\":\"2022 IEEE 5th International Conference on Computer and Communication Engineering Technology (CCET)\",\"volume\":\"38 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 5th International Conference on Computer and Communication Engineering Technology (CCET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCET55412.2022.9906393\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 5th International Conference on Computer and Communication Engineering Technology (CCET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCET55412.2022.9906393","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

随着语义切分技术的发展，基于切分的方法在检测任意形状文本方面取得了巨大成功。然而，现有的许多文本检测方法使用二进制离散分布来预测收缩文本实例，无法生成完整和准确的文本边界框。本文提出了一种基于预测完整文本区域高斯概率距离图的任意形状场景文本检测方法，该图可以保留更多的文本边界信息。然后，通过可学习的后处理将边界像素聚类到高置信度的文本中心，并通过像素级分数图过滤假阳性。我们还提出了一个自适应信道增强模块来提高像素级分割的精度。在CTW1500、Total-Text和MSRA-TD500三个标准数据集上的实验表明，该方法具有良好的鲁棒性和性能。该方法得到S2的f值。CTW1500和MSRA-TD500分别为3.0%和3.0%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Arbitrary-Shaped Text Detection with Gaussian Probability Distance Distribution

With the development of semantic segmentation, segmentation-based methods have yielded great success in detecting arbitrary-shaped texts. However, many existing text detection methods use binary discrete distributions to predict shrunk text instances, which cannot generate complete and accurate text bounding boxes. In this paper, we propose an arbitrary-shaped scene text detection method based on predicting Gaussian probability distance map of the complete text region, and this map can retain more text boundary information. Then, the boundary pixels are clustered into high-confidence text centers by a learnable post-processing and false positives are filtered out by pixel-level score maps. We also propose an adaptive channel enhancement module to improve the pixel-level segmentation accuracy. Experiments on three standard datasets, including CTW1500, Total-Text, and MSRA-TD500, demonstrate that the proposed method achieves great robustness and performance. The method obtains an F-measure of S2.S% on CTW1500 and S3.0% on MSRA-TD500.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE 5th International Conference on Computer and Communication Engineering Technology (CCET)

自引率

0.00%

发文量