Hierarchical Self-Learning Knowledge Inference Based on Markov Random Field for Semantic Segmentation of Remote Sensing Images

IF 8.6 1区地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Geoscience and Remote Sensing Pub Date : 2024-09-18 DOI:10.1109/TGRS.2024.3463433

Yuncheng Chen;Leiguang Wang;Jingying Li;Chen Zheng

{"title":"Hierarchical Self-Learning Knowledge Inference Based on Markov Random Field for Semantic Segmentation of Remote Sensing Images","authors":"Yuncheng Chen;Leiguang Wang;Jingying Li;Chen Zheng","doi":"10.1109/TGRS.2024.3463433","DOIUrl":null,"url":null,"abstract":"Semantic segmentation is one of the most important tasks in the field of remote sensing. As the spatial resolution increases, the remote sensing images can capture more detailed information and make hierarchical semantic interpretation possible. However, hierarchical semantic segmentation encounters high heterogeneity not only within the intra-layer classes but also among inter-layer classes. It brings challenges to semantic segmentation methods such as the convolutional neural network (CNN). In this article, a hierarchical self-learning knowledge inference model (HSKIM) based on the Markov random field (MRF) model is proposed for hierarchical semantic segmentation of remote sensing images. The HSKIM model introduces a new framework that integrates the advantages of CNN-based data feature learning and MRF-based hierarchical semantic inference. It contains three modules: data learning module (\n<inline-formula> <tex-math>$\\boldsymbol {D}$ </tex-math></inline-formula>\n), inference units generation module (\n<inline-formula> <tex-math>$\\boldsymbol {I}$ </tex-math></inline-formula>\n), and self-learning knowledge inference module (\n<inline-formula> <tex-math>$\\boldsymbol {S}$ </tex-math></inline-formula>\n). The module \n<inline-formula> <tex-math>$\\boldsymbol {D}$ </tex-math></inline-formula>\n uses CNN to learn specific data features layer by layer and extract preliminary geographical objects as the initial results. The module I refines the geographical objects using a novel boundary-preservation trick to generate more accurate inference units with clear geographical meaning. The module S introduces a hierarchical object-based MRF model to implement semantic inference among intra-layer and inter-layer inference units, guided by the spatial interactions and geographical criteria. This module can self-learn and update the relationship between classes iteratively and provide the final result. Experiments on the GID dataset with hierarchical classes, alongside 12 state-of-the-art CNN-based methods, validate the effectiveness and robustness of the proposed HSKIM model. The code of this article is available at \n<uri>https://github.com/iichengzi/HSKIM</uri>\n.","PeriodicalId":13213,"journal":{"name":"IEEE Transactions on Geoscience and Remote Sensing","volume":"62 ","pages":"1-18"},"PeriodicalIF":8.6000,"publicationDate":"2024-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Geoscience and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10683796/","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

Semantic segmentation is one of the most important tasks in the field of remote sensing. As the spatial resolution increases, the remote sensing images can capture more detailed information and make hierarchical semantic interpretation possible. However, hierarchical semantic segmentation encounters high heterogeneity not only within the intra-layer classes but also among inter-layer classes. It brings challenges to semantic segmentation methods such as the convolutional neural network (CNN). In this article, a hierarchical self-learning knowledge inference model (HSKIM) based on the Markov random field (MRF) model is proposed for hierarchical semantic segmentation of remote sensing images. The HSKIM model introduces a new framework that integrates the advantages of CNN-based data feature learning and MRF-based hierarchical semantic inference. It contains three modules: data learning module (

$\boldsymbol {D}$

), inference units generation module (

$\boldsymbol {I}$

), and self-learning knowledge inference module (

$\boldsymbol {S}$

). The module

$\boldsymbol {D}$

uses CNN to learn specific data features layer by layer and extract preliminary geographical objects as the initial results. The module I refines the geographical objects using a novel boundary-preservation trick to generate more accurate inference units with clear geographical meaning. The module S introduces a hierarchical object-based MRF model to implement semantic inference among intra-layer and inter-layer inference units, guided by the spatial interactions and geographical criteria. This module can self-learn and update the relationship between classes iteratively and provide the final result. Experiments on the GID dataset with hierarchical classes, alongside 12 state-of-the-art CNN-based methods, validate the effectiveness and robustness of the proposed HSKIM model. The code of this article is available at https://github.com/iichengzi/HSKIM .

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

基于马尔可夫随机场的分层自学知识推理用于遥感图像的语义分割

语义分割是遥感领域最重要的任务之一。随着空间分辨率的提高，遥感图像可以捕捉到更详细的信息，使分层语义解释成为可能。然而，分层语义分割不仅在层内类别之间，而且在层间类别之间都会遇到高度异质性。这给卷积神经网络（CNN）等语义分割方法带来了挑战。本文提出了一种基于马尔可夫随机场（MRF）模型的分层自学习知识推理模型（HSKIM），用于遥感图像的分层语义分割。HSKIM 模型引入了一个新框架，整合了基于 CNN 的数据特征学习和基于 MRF 的分层语义推断的优势。它包含三个模块：数据学习模块（$\boldsymbol {D}$）、推理单元生成模块（$\boldsymbol {I}$）和自学知识推理模块（$\boldsymbol {S}$）。模块 $\boldsymbol {D}$ 利用 CNN 逐层学习特定的数据特征，并提取初步的地理对象作为初始结果。模块 I 使用新颖的边界保护技巧完善地理对象，以生成具有明确地理意义的更精确推理单元。模块 S 引入基于对象的分层 MRF 模型，在空间交互和地理标准的指导下，实现层内和层间推理单元之间的语义推理。该模块可以自我学习和迭代更新类之间的关系，并提供最终结果。在具有分层类别的 GID 数据集上进行的实验，以及 12 种基于 CNN 的先进方法，验证了所提出的 HSKIM 模型的有效性和鲁棒性。本文代码见 https://github.com/iichengzi/HSKIM。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Geoscience and Remote Sensing 工程技术-地球化学与地球物理

CiteScore

11.50

自引率

28.00%

发文量

1912

审稿时长

4.0 months

期刊介绍： IEEE Transactions on Geoscience and Remote Sensing (TGRS) is a monthly publication that focuses on the theory, concepts, and techniques of science and engineering as applied to sensing the land, oceans, atmosphere, and space; and the processing, interpretation, and dissemination of this information.