ResAsapp: An Effective Convolution to Distinguish Adjacent Pixels For Scene Text Detection

Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition Pub Date : 2022-11-17 DOI:10.1145/3581807.3581854

Kangming Weng, X. Du, Kunze Chen, Dahan Wang, Shunzhi Zhu

引用次数: 0

Abstract

The segmentation-based approach is an essential direction of scene text detection, and it can detect arbitrary or curved text, which has attracted the increasing attention of many researchers. However, extensive research has shown that the segmentation-based method will be disturbed by adjoining pixels and cannot effectively identify the text boundaries. To tackle this problem, we proposed a ResAsapp Conv based on the PSE algorithm. This convolution structure can provide different scale visual fields about the object and make it effectively recognize the boundary of texts. The method's effectiveness is validated on three benchmark datasets, CTW1500, Total-Text, and ICDAR2015 datasets. In particular, on the CTW1500 dataset, a dataset full of long curve text in all kinds of scenes, which is hard to distinguish, our network achieves an F-measure of 81.2%.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

ResAsapp:一种用于场景文本检测的有效卷积识别相邻像素

基于分割的场景文本检测方法是场景文本检测的一个重要方向，它可以检测任意文本或弯曲文本，越来越受到研究者的关注。然而，大量研究表明，基于分割的方法会受到相邻像素的干扰，无法有效识别文本边界。为了解决这个问题，我们提出了一种基于PSE算法的ResAsapp Conv。这种卷积结构可以提供物体不同尺度的视野，使其能够有效地识别文本的边界。在CTW1500、Total-Text和ICDAR2015三个基准数据集上验证了该方法的有效性。特别是在CTW1500数据集上，我们的网络实现了81.2%的F-measure。CTW1500数据集是一个充满各种场景的长曲线文本的数据集，很难区分。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Proceedings of the 2022 11th International Conference on Computing and Pattern Recognition

自引率

0.00%

发文量

期刊最新文献

Multi-Scale Channel Attention for Chinese Scene Text Recognition Vehicle Re-identification Based on Multi-Scale Attention Feature Fusion Comparative Study on EEG Feature Recognition based on Deep Belief Network VA-TransUNet: A U-shaped Medical Image Segmentation Network with Visual Attention Traffic Flow Forecasting Research Based on Delay Reconstruction and GRU-SVR