Yong-Qiang Mao;Zhizhuo Jiang;Yu Liu;Yiming Zhang;Kehan Qi;Hanbo Bi;You He
{"title":"FRORS: An Effective Fine-Grained Retrieval Framework for Optical Remote Sensing Images","authors":"Yong-Qiang Mao;Zhizhuo Jiang;Yu Liu;Yiming Zhang;Kehan Qi;Hanbo Bi;You He","doi":"10.1109/JSTARS.2025.3545828","DOIUrl":null,"url":null,"abstract":"Fine-grained retrieval of remote sensing images is an image interpretation task that is still in its infancy. With the rapid development of convolutional neural networks (CNN) in the field of remote sensing, it has become possible for remote sensing image retrieval tasks to move toward more fine-grained classes. However, since current methods focus on how to construct similarity metrics between sample pairs, the model ignores the learning of fine-grained intraclass heterogeneity and interclass commonality features, which poses a huge challenge to fine-grained retrieval. To solve this problem, we propose a novel fine-grained retrieval framework of optical remote sensing (FRORS) images, which aims to improve fine-grained retrieval capabilities by constructing interaction and matching between intraclass heterogeneity features, interclass commonality features, and image features. Specifically, we first construct a fine-grained prototype memory (FPM) module, and continuously update the local prototype storage unit through a lightweight CNN to achieve a refined representation of fine-grained heterogeneity features. Furthermore, to learn interclass commonality, we propose a gram learning (GraL) strategy, which is achieved by learning the correlation between feature dimensions. On this basis, we introduce a gram-based metric match (GMM) mechanism, which fuses the prototype features representing intraclass heterogeneity and the gram vector representing interclass commonality through an embedding manner, thereby achieving the purpose of fully interactive matching between image features and fine-grained class features. With FPM, GraL, and GMM, our FRORS can better learn deep features representing fine-grained classes and promote the improvement of the network's fine-grained retrieval ability. Extensive experiments conducted on a self-constructed THUFG-OPT dataset prove that the proposed FRORS achieves state-of-the-art fine-grained retrieval performance, which is 5.75% higher than the baseline method on <inline-formula><tex-math>$\\mathrm{mAP@10}$</tex-math></inline-formula>.","PeriodicalId":13116,"journal":{"name":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","volume":"18 ","pages":"7406-7419"},"PeriodicalIF":4.7000,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10904305","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10904305/","RegionNum":2,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0
Abstract
Fine-grained retrieval of remote sensing images is an image interpretation task that is still in its infancy. With the rapid development of convolutional neural networks (CNN) in the field of remote sensing, it has become possible for remote sensing image retrieval tasks to move toward more fine-grained classes. However, since current methods focus on how to construct similarity metrics between sample pairs, the model ignores the learning of fine-grained intraclass heterogeneity and interclass commonality features, which poses a huge challenge to fine-grained retrieval. To solve this problem, we propose a novel fine-grained retrieval framework of optical remote sensing (FRORS) images, which aims to improve fine-grained retrieval capabilities by constructing interaction and matching between intraclass heterogeneity features, interclass commonality features, and image features. Specifically, we first construct a fine-grained prototype memory (FPM) module, and continuously update the local prototype storage unit through a lightweight CNN to achieve a refined representation of fine-grained heterogeneity features. Furthermore, to learn interclass commonality, we propose a gram learning (GraL) strategy, which is achieved by learning the correlation between feature dimensions. On this basis, we introduce a gram-based metric match (GMM) mechanism, which fuses the prototype features representing intraclass heterogeneity and the gram vector representing interclass commonality through an embedding manner, thereby achieving the purpose of fully interactive matching between image features and fine-grained class features. With FPM, GraL, and GMM, our FRORS can better learn deep features representing fine-grained classes and promote the improvement of the network's fine-grained retrieval ability. Extensive experiments conducted on a self-constructed THUFG-OPT dataset prove that the proposed FRORS achieves state-of-the-art fine-grained retrieval performance, which is 5.75% higher than the baseline method on $\mathrm{mAP@10}$.
期刊介绍:
The IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing addresses the growing field of applications in Earth observations and remote sensing, and also provides a venue for the rapidly expanding special issues that are being sponsored by the IEEE Geosciences and Remote Sensing Society. The journal draws upon the experience of the highly successful “IEEE Transactions on Geoscience and Remote Sensing” and provide a complementary medium for the wide range of topics in applied earth observations. The ‘Applications’ areas encompasses the societal benefit areas of the Global Earth Observations Systems of Systems (GEOSS) program. Through deliberations over two years, ministers from 50 countries agreed to identify nine areas where Earth observation could positively impact the quality of life and health of their respective countries. Some of these are areas not traditionally addressed in the IEEE context. These include biodiversity, health and climate. Yet it is the skill sets of IEEE members, in areas such as observations, communications, computers, signal processing, standards and ocean engineering, that form the technical underpinnings of GEOSS. Thus, the Journal attracts a broad range of interests that serves both present members in new ways and expands the IEEE visibility into new areas.