RescueADI: Adaptive Disaster Interpretation in Remote Sensing Images With Autonomous Agents

IF 8.6 1区 地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Geoscience and Remote Sensing Pub Date : 2025-01-22 DOI:10.1109/TGRS.2025.3532594
Zhuoran Liu;Danpei Zhao;Bo Yuan;Zhiguo Jiang
{"title":"RescueADI: Adaptive Disaster Interpretation in Remote Sensing Images With Autonomous Agents","authors":"Zhuoran Liu;Danpei Zhao;Bo Yuan;Zhiguo Jiang","doi":"10.1109/TGRS.2025.3532594","DOIUrl":null,"url":null,"abstract":"Current methods for disaster scene interpretation in remote sensing images (RSIs) mostly focus on isolated tasks such as segmentation, detection, or visual question-answering (VQA). However, these methods often fail to provide comprehensive and actionable insights, particularly in scenarios that demand the integration of multiple perception methods and specialized tools to address complex, multilayered challenges in geophysical disaster analysis. To fill this gap, this article introduces adaptive disaster interpretation (ADI), a novel task designed to solve requests by planning and executing multiple sequentially correlative interpretation tasks to provide a comprehensive analysis of disaster scenes. To facilitate research and application in this area, we present a new dataset named RescueADI, which contains high-resolution RSIs with annotations for three connected aspects: planning, perception, and recognition. The dataset includes 4044 RSIs, 16949 semantic masks, 14483 object bounding boxes, and 13424 interpretation requests across nine challenging request types. Moreover, we propose a new disaster interpretation method employing autonomous agents driven by large language models (LLMs) for task planning and execution, proving its efficacy in handling complex disaster interpretations. The proposed agent-based method solves various complex interpretation requests such as counting, area calculation, and path finding without human intervention, which traditional single-task approaches cannot handle effectively. Experimental results on RescueADI demonstrate the feasibility of the proposed task and show that our method achieves an accuracy 9% higher than existing VQA methods, highlighting its advantages over conventional disaster interpretation approaches.","PeriodicalId":13213,"journal":{"name":"IEEE Transactions on Geoscience and Remote Sensing","volume":"63 ","pages":"1-14"},"PeriodicalIF":8.6000,"publicationDate":"2025-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Geoscience and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10849780/","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

Abstract

Current methods for disaster scene interpretation in remote sensing images (RSIs) mostly focus on isolated tasks such as segmentation, detection, or visual question-answering (VQA). However, these methods often fail to provide comprehensive and actionable insights, particularly in scenarios that demand the integration of multiple perception methods and specialized tools to address complex, multilayered challenges in geophysical disaster analysis. To fill this gap, this article introduces adaptive disaster interpretation (ADI), a novel task designed to solve requests by planning and executing multiple sequentially correlative interpretation tasks to provide a comprehensive analysis of disaster scenes. To facilitate research and application in this area, we present a new dataset named RescueADI, which contains high-resolution RSIs with annotations for three connected aspects: planning, perception, and recognition. The dataset includes 4044 RSIs, 16949 semantic masks, 14483 object bounding boxes, and 13424 interpretation requests across nine challenging request types. Moreover, we propose a new disaster interpretation method employing autonomous agents driven by large language models (LLMs) for task planning and execution, proving its efficacy in handling complex disaster interpretations. The proposed agent-based method solves various complex interpretation requests such as counting, area calculation, and path finding without human intervention, which traditional single-task approaches cannot handle effectively. Experimental results on RescueADI demonstrate the feasibility of the proposed task and show that our method achieves an accuracy 9% higher than existing VQA methods, highlighting its advantages over conventional disaster interpretation approaches.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于自主代理的遥感图像自适应灾害判读
目前的遥感图像灾害现场判读方法主要集中在分割、检测或视觉问答等孤立任务上。然而,这些方法往往不能提供全面和可操作的见解,特别是在需要整合多种感知方法和专业工具来解决地球物理灾害分析中复杂、多层次挑战的情况下。为了填补这一空白,本文介绍了自适应灾难解释(ADI),这是一种新的任务,旨在通过规划和执行多个顺序相关的解释任务来解决请求,从而提供对灾难场景的全面分析。为了促进这一领域的研究和应用,我们提出了一个名为RescueADI的新数据集,该数据集包含高分辨率的rsi,并在规划、感知和识别三个相关方面进行了注释。该数据集包括4044个rsi, 16949个语义掩码,14483个对象边界框,以及跨越9种具有挑战性的请求类型的13424个解释请求。此外,我们提出了一种新的灾难解释方法,利用大型语言模型(llm)驱动的自主代理进行任务规划和执行,证明了其在处理复杂灾难解释方面的有效性。提出的基于智能体的方法在没有人工干预的情况下解决了传统单任务方法无法有效处理的计数、面积计算和寻径等各种复杂的解释请求。在RescueADI上的实验结果证明了所提出任务的可行性,并表明我们的方法比现有的VQA方法准确率高出9%,突出了其相对于传统灾害解释方法的优势。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
IEEE Transactions on Geoscience and Remote Sensing
IEEE Transactions on Geoscience and Remote Sensing 工程技术-地球化学与地球物理
CiteScore
11.50
自引率
28.00%
发文量
1912
审稿时长
4.0 months
期刊介绍: IEEE Transactions on Geoscience and Remote Sensing (TGRS) is a monthly publication that focuses on the theory, concepts, and techniques of science and engineering as applied to sensing the land, oceans, atmosphere, and space; and the processing, interpretation, and dissemination of this information.
期刊最新文献
Distribution-Aware Infrared Small Target Detection Based on Multi-Scale Convolutional Decoder and Hypergraph Attention Detecting Weak Underwater Targets in Hyperspectral Imagery via Physics-aware Residual Reasoning Faint Bottom Echo Detection for Airborne LiDAR Bathymetry Based on a Constrained Waveform Stacking Model First Cooperative Formaldehyde Monitoring with Chinese Morning and Afternoon Satellites: Revealing Global Multi-Temporal Concentration Dynamics Fast Anchor Graph Regularized Relaxation Linear Regression for Classification of Hyperspectral Images
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1