Denoising diffusion fusion network for semantic segmentation based on degradation analysis modeling with graph networks

IF 15.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE Information Fusion Pub Date : 2025-10-01 Epub Date: 2025-04-23 DOI:10.1016/j.inffus.2025.103205
Aiqing Fang , Ying Li , Jiang Long , Xiaodong Wang , Yangming Guo
{"title":"Denoising diffusion fusion network for semantic segmentation based on degradation analysis modeling with graph networks","authors":"Aiqing Fang ,&nbsp;Ying Li ,&nbsp;Jiang Long ,&nbsp;Xiaodong Wang ,&nbsp;Yangming Guo","doi":"10.1016/j.inffus.2025.103205","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate semantic segmentation is critical for autonomous driving safety, yet real-world degradations severely compromise segmentation robustness, risking safety-critical failures. Current research primarily focuses on architectural innovations and optimization strategies to improve fusion quality. However, these approaches suffer from poor interpretability and inadequate adaptability in adverse weather conditions such as extreme illumination and noise. To address these issues, we propose a denoising diffusion fusion network for semantic segmentation based on degradation modeling. We first construct a multi-scale wavelet-domain fusion network to address the distribution characteristics of different degradations. Building on this decomposition, the denoising diffusion process and wavelet-domain fusion operations are combined to enhance fusion quality. Finally, we develop a denoising diffusion-optimized fusion loss function to guide parameter optimization while suppressing degradation artifacts. Extensive experiments on public datasets show that the proposed method achieves state-of-the-art performance with measurable gains: 42.8% improvement in edge integrity, 37.6% higher spatial frequency for texture preservation, and 39.8% reduction in noise artifacts. Through graph network analysis, we also reveal the interplay mechanisms among different degradations, various fusion quality assessment metrics, and semantic segmentation performance. These advancements exhibit superior robustness to degradations and enhance safety for real-world autonomous systems.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"122 ","pages":"Article 103205"},"PeriodicalIF":15.5000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1566253525002787","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/4/23 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Accurate semantic segmentation is critical for autonomous driving safety, yet real-world degradations severely compromise segmentation robustness, risking safety-critical failures. Current research primarily focuses on architectural innovations and optimization strategies to improve fusion quality. However, these approaches suffer from poor interpretability and inadequate adaptability in adverse weather conditions such as extreme illumination and noise. To address these issues, we propose a denoising diffusion fusion network for semantic segmentation based on degradation modeling. We first construct a multi-scale wavelet-domain fusion network to address the distribution characteristics of different degradations. Building on this decomposition, the denoising diffusion process and wavelet-domain fusion operations are combined to enhance fusion quality. Finally, we develop a denoising diffusion-optimized fusion loss function to guide parameter optimization while suppressing degradation artifacts. Extensive experiments on public datasets show that the proposed method achieves state-of-the-art performance with measurable gains: 42.8% improvement in edge integrity, 37.6% higher spatial frequency for texture preservation, and 39.8% reduction in noise artifacts. Through graph network analysis, we also reveal the interplay mechanisms among different degradations, various fusion quality assessment metrics, and semantic segmentation performance. These advancements exhibit superior robustness to degradations and enhance safety for real-world autonomous systems.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
基于图网络退化分析建模的语义分割去噪扩散融合网络
准确的语义分割对自动驾驶安全至关重要,但现实世界的退化严重损害了分割的鲁棒性,可能导致安全关键故障。目前的研究主要集中在架构创新和优化策略上,以提高融合质量。然而,这些方法在恶劣的天气条件下(如极端光照和噪音)存在解释性差和适应性不足的问题。为了解决这些问题,我们提出了一种基于退化建模的语义分割去噪扩散融合网络。我们首先构建了一个多尺度小波域融合网络来处理不同退化的分布特征。在此基础上,将去噪扩散过程与小波域融合操作相结合,提高融合质量。最后,我们开发了一个去噪扩散优化的融合损失函数来指导参数优化,同时抑制退化伪影。在公共数据集上进行的大量实验表明,该方法取得了最先进的性能和可测量的增益:边缘完整性提高42.8%,纹理保存的空间频率提高37.6%,噪声伪影降低39.8%。通过图网络分析,我们还揭示了不同退化、各种融合质量评估指标和语义分割性能之间的相互作用机制。这些进步对退化表现出卓越的鲁棒性,并提高了现实世界自主系统的安全性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
Information Fusion
Information Fusion 工程技术-计算机:理论方法
CiteScore
33.20
自引率
4.30%
发文量
161
审稿时长
7.9 months
期刊介绍: Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.
期刊最新文献
GTEE: A global timestamp encoding enhanced method for robust time series imputation in complex missing scenarios Resilient distributed Kalman filtering for cyber-physical systems via mean subsequence reduction Learning across modalities: a systematic survey of multimodal models for financial analysis MuBe4D: A mutual benefit framework for generalizable motion segmentation and geometry-first 4D reconstruction Decoding multilingual imagined speech from scalp EEG via dynamic differentiable graph hierarchical fusion network
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1