Enhancing Object Detection With Fourier Series

IF 18.6 IEEE transactions on pattern analysis and machine intelligence Pub Date : 2025-01-08 DOI:10.1109/TPAMI.2025.3526990

Jin Liu;Zhongyuan Lu;Yaorong Cen;Hui Hu;Zhenfeng Shao;Yong Hong;Ming Jiang;Miaozhong Xu

{"title":"Enhancing Object Detection With Fourier Series","authors":"Jin Liu;Zhongyuan Lu;Yaorong Cen;Hui Hu;Zhenfeng Shao;Yong Hong;Ming Jiang;Miaozhong Xu","doi":"10.1109/TPAMI.2025.3526990","DOIUrl":null,"url":null,"abstract":"Traditional object detection models often lose the detailed outline information of the object. To address this problem, we propose the Fourier Series Object Detection (FSD). It encodes the object's outline closed curve into two one-dimensional periodic Fourier series. The Fourier Series Model (FSM) is constructed to regress the Fourier series for each object in the image. Thus, during inference, the detailed outline information of each object can be retrieved. We introduce Rolling Optimization Matching for Fourier loss to ensure that the model's learning process is not affected by the sequence of the starting points of the labeled contour points, speeding up the training process. The FSM demonstrates improved feature extraction and descriptive capabilities for non-rectangular or elongated object regions. The model achieves AP50 = 73.3% on the DOTA 1.5 dataset, which surpasses the state-of-the-art (SOTA) method by 6.44% at 66.86%. On the UCAS dataset, the model achieves AP50 = 97.25%, also surpassing the performance indicators of the SOTA methods. Furthermore, we introduce the object's Fourier power spectrum to describe outline features and the Fourier vector to indicate its direction. This enhances the scene semantic representation of the object detection model and paves a new pathway for the evolution of object detection methodologies.","PeriodicalId":94034,"journal":{"name":"IEEE transactions on pattern analysis and machine intelligence","volume":"47 4","pages":"2581-2596"},"PeriodicalIF":18.6000,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on pattern analysis and machine intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10833868/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Traditional object detection models often lose the detailed outline information of the object. To address this problem, we propose the Fourier Series Object Detection (FSD). It encodes the object's outline closed curve into two one-dimensional periodic Fourier series. The Fourier Series Model (FSM) is constructed to regress the Fourier series for each object in the image. Thus, during inference, the detailed outline information of each object can be retrieved. We introduce Rolling Optimization Matching for Fourier loss to ensure that the model's learning process is not affected by the sequence of the starting points of the labeled contour points, speeding up the training process. The FSM demonstrates improved feature extraction and descriptive capabilities for non-rectangular or elongated object regions. The model achieves AP50 = 73.3% on the DOTA 1.5 dataset, which surpasses the state-of-the-art (SOTA) method by 6.44% at 66.86%. On the UCAS dataset, the model achieves AP50 = 97.25%, also surpassing the performance indicators of the SOTA methods. Furthermore, we introduce the object's Fourier power spectrum to describe outline features and the Fourier vector to indicate its direction. This enhances the scene semantic representation of the object detection model and paves a new pathway for the evolution of object detection methodologies.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

傅里叶级数增强目标检测

传统的目标检测模型往往会丢失目标的详细轮廓信息。为了解决这个问题，我们提出了傅立叶级数目标检测（FSD）。它将物体的轮廓闭合曲线编码为两个一维周期傅立叶级数。构造傅里叶级数模型（FSM）来回归图像中每个对象的傅里叶级数。因此，在推理过程中，可以检索到每个对象的详细轮廓信息。引入傅里叶损失的滚动优化匹配，确保模型的学习过程不受标记轮廓点起始点序列的影响，加快训练过程。FSM演示了改进的非矩形或细长对象区域的特征提取和描述能力。该模型在DOTA 1.5数据集上达到了AP50 = 73.3%，比最先进的SOTA方法（66.86%）高出6.44%。在UCAS数据集上，该模型的AP50 = 97.25%，也超过了SOTA方法的性能指标。此外，我们引入了物体的傅立叶功率谱来描述轮廓特征和傅立叶矢量来指示其方向。这增强了目标检测模型的场景语义表示，为目标检测方法的发展开辟了新的途径。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE transactions on pattern analysis and machine intelligence

自引率

0.00%

发文量

期刊最新文献

Spike Camera Optical Flow Estimation Based on Continuous Spike Streams. Bi-C²R: Bidirectional Continual Compatible Representation for Re-Indexing Free Lifelong Person Re-Identification. 2025 Reviewers List* Deep Robust Reversible Watermarking. Parameter-Efficient Fine-Tuning for Continual Learning: A Neural Tangent Kernel Perspective.