HDF-Net: Capturing Homogeny Difference Features to Localize the Tampered Image

IF 18.6 IEEE transactions on pattern analysis and machine intelligence Pub Date : 2024-07-23 DOI:10.1109/TPAMI.2024.3432551

Ruidong Han;Xiaofeng Wang;Ningning Bai;Yihang Wang;Jianpeng Hou;Jianru Xue

{"title":"HDF-Net: Capturing Homogeny Difference Features to Localize the Tampered Image","authors":"Ruidong Han;Xiaofeng Wang;Ningning Bai;Yihang Wang;Jianpeng Hou;Jianru Xue","doi":"10.1109/TPAMI.2024.3432551","DOIUrl":null,"url":null,"abstract":"Modern image editing software enables anyone to alter the content of an image to deceive the public, which can pose a security hazard to personal privacy and public safety. The detection and localization of image tampering is becoming an urgent issue to be addressed. We have revealed that the tampered region exhibits homogenous differences (the changes in metadata organization form and organization structure of the image) from the real region after manipulations such as splicing, copy-move, and removal. Therefore, we propose a novel end-to-end network named HDF-Net to extract these homogeny difference features for precise localization of tampering artifacts. The HDF-Net is composed of RGB and SRM dual-stream networks, including three complementary modules, namely the suspicious tampering-artifact prominent (STP) module, the fine tampering-artifact salient (FTS) module, and the tampering-artifact edge refined (TER) module. We utilize the fully attentional block (FLA) to enhance the characterization ability of homogeny difference features extracted by each module and preserve the specifics of tampering artifacts. These modules are gradually merged according to the strategy of “coarse-fine-finer”, which significantly improves the localization accuracy and edge refinement. Extensive experiments demonstrate that HDF-Net performs better than state-of-the-art tampering localization models on five benchmarks, achieving satisfactory generalization and robustness.","PeriodicalId":94034,"journal":{"name":"IEEE transactions on pattern analysis and machine intelligence","volume":"46 12","pages":"10005-10020"},"PeriodicalIF":18.6000,"publicationDate":"2024-07-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on pattern analysis and machine intelligence","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10607909/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Modern image editing software enables anyone to alter the content of an image to deceive the public, which can pose a security hazard to personal privacy and public safety. The detection and localization of image tampering is becoming an urgent issue to be addressed. We have revealed that the tampered region exhibits homogenous differences (the changes in metadata organization form and organization structure of the image) from the real region after manipulations such as splicing, copy-move, and removal. Therefore, we propose a novel end-to-end network named HDF-Net to extract these homogeny difference features for precise localization of tampering artifacts. The HDF-Net is composed of RGB and SRM dual-stream networks, including three complementary modules, namely the suspicious tampering-artifact prominent (STP) module, the fine tampering-artifact salient (FTS) module, and the tampering-artifact edge refined (TER) module. We utilize the fully attentional block (FLA) to enhance the characterization ability of homogeny difference features extracted by each module and preserve the specifics of tampering artifacts. These modules are gradually merged according to the strategy of “coarse-fine-finer”, which significantly improves the localization accuracy and edge refinement. Extensive experiments demonstrate that HDF-Net performs better than state-of-the-art tampering localization models on five benchmarks, achieving satisfactory generalization and robustness.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

HDF-Net：捕捉同质差异特征，定位被篡改的图像。

现代图像编辑软件使任何人都能篡改图像内容以欺骗公众，这可能对个人隐私和公共安全构成安全隐患。图像篡改的检测和定位已成为亟待解决的问题。我们发现，经过拼接、复制移动和删除等操作后，被篡改区域与真实区域呈现出同质差异（图像元数据组织形式和组织结构的变化）。因此，我们提出了一种名为 HDF-Net 的新型端到端网络来提取这些同质差异特征，从而精确定位篡改伪影。HDF 网络由 RGB 和 SRM 双流网络组成，包括三个互补模块，即可疑篡改伪影突出（STP）模块、精细篡改伪影突出（FTS）模块和篡改伪影边缘细化（TER）模块。我们利用全注意力区块（FLA）来增强各模块提取的同质性差异特征的表征能力，并保留篡改伪影的特异性。这些模块按照 "粗-细-精 "的策略逐步合并，从而显著提高了定位精度和边缘细化能力。广泛的实验证明，在五个基准测试中，HDF-Net 的表现优于最先进的篡改定位模型，达到了令人满意的泛化和鲁棒性。代码见 https://github.com/ruidonghan/HDF-Net/。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE transactions on pattern analysis and machine intelligence

自引率

0.00%

发文量

期刊最新文献

Examining the Impact of Optical Aberrations to Image Classification and Object Detection Models. Neural Eigenfunctions are Structured Representation Learners. On the Adversarial Transferability of Generalized "Skip Connections". Distribution-to-Points Matching for Image Text Retrieval. Penny-Wise and Pound-Foolish in AI-Generated Image Detection.