{"title":"FSENet:用于微小目标检测的特征抑制和增强网络","authors":"Heng Hu, Sibao Chen, Zhihui You, Jin Tang","doi":"10.1016/j.patcog.2025.111425","DOIUrl":null,"url":null,"abstract":"<div><div>Although feature fusion has been widely used to improve detection performance, it can also lead to the mixing of feature information from different layers, which affects detection of tiny objects. To alleviate feature mixing problem, suppress the complex background interference and further improve detection performance, a new feature suppression and enhancement network is designed in this paper. In order to suppress background information and object feature information from non-local feature layers, we propose a feature suppression and enhancement module (FSEM). In FSEM, feature suppression module (FSM) aims to suppress background information and redundant features while emphasizing features of tiny objects. This helps to mitigate blending of irrelevant features and increase focusing on tiny object features. Feature enhancement module (FEM) aims to highlight deep large object feature information by combining it with shallow features. By enhancing features at different scales, FEM helps maintain feature discrimination. FSM adopts a plug-and-play design and can be embedded into detectors with feature fusion capabilities. In addition, we propose an improved Kullback–Leibler divergence (IKLD) as loss function. Distribution shifting convolution (DSConv) is adopted instead of convolution in neck to reduce computational effort. The effectiveness of our method is validated on the AI-TOD, VisDrone and DOTA datasets.</div></div>","PeriodicalId":49713,"journal":{"name":"Pattern Recognition","volume":"162 ","pages":"Article 111425"},"PeriodicalIF":7.6000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"FSENet: Feature suppression and enhancement network for tiny object detection\",\"authors\":\"Heng Hu, Sibao Chen, Zhihui You, Jin Tang\",\"doi\":\"10.1016/j.patcog.2025.111425\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Although feature fusion has been widely used to improve detection performance, it can also lead to the mixing of feature information from different layers, which affects detection of tiny objects. To alleviate feature mixing problem, suppress the complex background interference and further improve detection performance, a new feature suppression and enhancement network is designed in this paper. In order to suppress background information and object feature information from non-local feature layers, we propose a feature suppression and enhancement module (FSEM). In FSEM, feature suppression module (FSM) aims to suppress background information and redundant features while emphasizing features of tiny objects. This helps to mitigate blending of irrelevant features and increase focusing on tiny object features. Feature enhancement module (FEM) aims to highlight deep large object feature information by combining it with shallow features. By enhancing features at different scales, FEM helps maintain feature discrimination. FSM adopts a plug-and-play design and can be embedded into detectors with feature fusion capabilities. In addition, we propose an improved Kullback–Leibler divergence (IKLD) as loss function. Distribution shifting convolution (DSConv) is adopted instead of convolution in neck to reduce computational effort. The effectiveness of our method is validated on the AI-TOD, VisDrone and DOTA datasets.</div></div>\",\"PeriodicalId\":49713,\"journal\":{\"name\":\"Pattern Recognition\",\"volume\":\"162 \",\"pages\":\"Article 111425\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pattern Recognition\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0031320325000858\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/2/7 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0031320325000858","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/7 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
FSENet: Feature suppression and enhancement network for tiny object detection
Although feature fusion has been widely used to improve detection performance, it can also lead to the mixing of feature information from different layers, which affects detection of tiny objects. To alleviate feature mixing problem, suppress the complex background interference and further improve detection performance, a new feature suppression and enhancement network is designed in this paper. In order to suppress background information and object feature information from non-local feature layers, we propose a feature suppression and enhancement module (FSEM). In FSEM, feature suppression module (FSM) aims to suppress background information and redundant features while emphasizing features of tiny objects. This helps to mitigate blending of irrelevant features and increase focusing on tiny object features. Feature enhancement module (FEM) aims to highlight deep large object feature information by combining it with shallow features. By enhancing features at different scales, FEM helps maintain feature discrimination. FSM adopts a plug-and-play design and can be embedded into detectors with feature fusion capabilities. In addition, we propose an improved Kullback–Leibler divergence (IKLD) as loss function. Distribution shifting convolution (DSConv) is adopted instead of convolution in neck to reduce computational effort. The effectiveness of our method is validated on the AI-TOD, VisDrone and DOTA datasets.
期刊介绍:
The field of Pattern Recognition is both mature and rapidly evolving, playing a crucial role in various related fields such as computer vision, image processing, text analysis, and neural networks. It closely intersects with machine learning and is being applied in emerging areas like biometrics, bioinformatics, multimedia data analysis, and data science. The journal Pattern Recognition, established half a century ago during the early days of computer science, has since grown significantly in scope and influence.