{"title":"Deep Clustering of Remote Sensing Scenes through Heterogeneous Transfer Learning","authors":"Isaac Ray, Alexei Skurikhin","doi":"arxiv-2409.03938","DOIUrl":null,"url":null,"abstract":"This paper proposes a method for unsupervised whole-image clustering of a\ntarget dataset of remote sensing scenes with no labels. The method consists of\nthree main steps: (1) finetuning a pretrained deep neural network (DINOv2) on a\nlabelled source remote sensing imagery dataset and using it to extract a\nfeature vector from each image in the target dataset, (2) reducing the\ndimension of these deep features via manifold projection into a low-dimensional\nEuclidean space, and (3) clustering the embedded features using a Bayesian\nnonparametric technique to infer the number and membership of clusters\nsimultaneously. The method takes advantage of heterogeneous transfer learning\nto cluster unseen data with different feature and label distributions. We\ndemonstrate the performance of this approach outperforming state-of-the-art\nzero-shot classification methods on several remote sensing scene classification\ndatasets.","PeriodicalId":501172,"journal":{"name":"arXiv - STAT - Applications","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - STAT - Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.03938","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This paper proposes a method for unsupervised whole-image clustering of a
target dataset of remote sensing scenes with no labels. The method consists of
three main steps: (1) finetuning a pretrained deep neural network (DINOv2) on a
labelled source remote sensing imagery dataset and using it to extract a
feature vector from each image in the target dataset, (2) reducing the
dimension of these deep features via manifold projection into a low-dimensional
Euclidean space, and (3) clustering the embedded features using a Bayesian
nonparametric technique to infer the number and membership of clusters
simultaneously. The method takes advantage of heterogeneous transfer learning
to cluster unseen data with different feature and label distributions. We
demonstrate the performance of this approach outperforming state-of-the-art
zero-shot classification methods on several remote sensing scene classification
datasets.