ACP-Net: Asymmetric Center Positioning Network for Real-Time Text Detection

IF 7.6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Knowledge-Based Systems Pub Date : 2024-12-03 Epub Date: 2024-10-06 DOI:10.1016/j.knosys.2024.112603
Boyuan Zhu , Fagui Liu , Xi Chen , Quan Tang , C.L. Philip Chen
{"title":"ACP-Net: Asymmetric Center Positioning Network for Real-Time Text Detection","authors":"Boyuan Zhu ,&nbsp;Fagui Liu ,&nbsp;Xi Chen ,&nbsp;Quan Tang ,&nbsp;C.L. Philip Chen","doi":"10.1016/j.knosys.2024.112603","DOIUrl":null,"url":null,"abstract":"<div><div>Scene text detection is crucial across numerous application fields. However, despite the emphasis on real-time performance in scene text detection, most existing detection models utilize the Feature Pyramid Network (FPN) for feature extraction, often disregarding its inherent limitations. Integrating high-resolution multi-channel features into FPN requires substantial computational resources. While FPN treats local and global features equally and is stable in various applications, its suitability for text-specific features is questionable. To this end, we propose the Asymmetric Center Positioning Network (ACP-Net) to replace FPN, achieving accuracy and real-time text detection in complex scenarios. ACP-Net features an asymmetric feature structure with independent branches for global and local information, along with an adaptive weighted fusion module to capture long-range dependencies effectively. In addition, a text center positioning module enhances text feature understanding by learning feature centers. Comprehensive evaluations across various terminals confirmed ACP-Net’s superior accuracy and speed.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"305 ","pages":"Article 112603"},"PeriodicalIF":7.6000,"publicationDate":"2024-12-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705124012371","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/10/6 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Scene text detection is crucial across numerous application fields. However, despite the emphasis on real-time performance in scene text detection, most existing detection models utilize the Feature Pyramid Network (FPN) for feature extraction, often disregarding its inherent limitations. Integrating high-resolution multi-channel features into FPN requires substantial computational resources. While FPN treats local and global features equally and is stable in various applications, its suitability for text-specific features is questionable. To this end, we propose the Asymmetric Center Positioning Network (ACP-Net) to replace FPN, achieving accuracy and real-time text detection in complex scenarios. ACP-Net features an asymmetric feature structure with independent branches for global and local information, along with an adaptive weighted fusion module to capture long-range dependencies effectively. In addition, a text center positioning module enhances text feature understanding by learning feature centers. Comprehensive evaluations across various terminals confirmed ACP-Net’s superior accuracy and speed.

Abstract Image

查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
ACP-Net:用于实时文本检测的非对称中心定位网络
场景文本检测在众多应用领域都至关重要。然而,尽管场景文本检测强调实时性,但现有的大多数检测模型都使用特征金字塔网络(FPN)进行特征提取,往往忽略了其固有的局限性。将高分辨率多通道特征整合到 FPN 中需要大量的计算资源。虽然 FPN 对局部和全局特征一视同仁,而且在各种应用中都很稳定,但它是否适用于特定文本特征却值得怀疑。为此,我们提出了非对称中心定位网络(ACP-Net)来取代 FPN,从而在复杂场景中实现准确、实时的文本检测。ACP-Net 采用非对称特征结构,具有独立的全局和局部信息分支,并配有自适应加权融合模块,可有效捕捉长距离依赖关系。此外,文本中心定位模块通过学习特征中心来增强对文本特征的理解。对各种终端的综合评估证实了 ACP-Net 的卓越准确性和速度。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Knowledge-Based Systems
Knowledge-Based Systems 工程技术-计算机:人工智能
CiteScore
14.80
自引率
12.50%
发文量
1245
审稿时长
7.8 months
期刊介绍: Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.
期刊最新文献
Revisiting the role of linguistic knowledge in large language models through prompting ACO–PAL: A prior-Aware learning framework for local path planning in complex environments LLM-enabled universal traffic signal control across different intersections and traffic flows Multi-view semi-supervised classification via innovative graph construction and smoothness-aware graph convolution Galio: Defending ownership of AI-generated images against content-preserving tampering
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1