CGR-Net: Consistency Guided ResFormer for Two-View Correspondence Learning

IF 11.1 1区工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC IEEE Transactions on Circuits and Systems for Video Technology Pub Date : 2024-08-06 DOI:10.1109/TCSVT.2024.3439348

Changcai Yang;Xiaojie Li;Jiayi Ma;Fengyuan Zhuang;Lifang Wei;Riqing Chen;Guodong Chen

{"title":"CGR-Net: Consistency Guided ResFormer for Two-View Correspondence Learning","authors":"Changcai Yang;Xiaojie Li;Jiayi Ma;Fengyuan Zhuang;Lifang Wei;Riqing Chen;Guodong Chen","doi":"10.1109/TCSVT.2024.3439348","DOIUrl":null,"url":null,"abstract":"Accurately identifying correct correspondences (inliers) in two-view images is a fundamental task in computer vision. Recent studies usually adopt Graph Neural Networks or stack local graphs into global ones to establish neighborhood relations. However, the smoothing properties of Graph Convolutional Neural network (GCN) cause the model to fall into local extreme, which leads to the issue of indistinguishability between inliers and outliers. Especially when the initial correspondences contain a large number of incorrect correspondences (outliers), these studies suffer from severe performance degradation. To address the above issues and refocus perspective information on distinct features, we design a Consistency Guided ResFormer Network (CGR-Net) that uses consistent correspondences to guide model perspective focusing, thereby avoiding the negative impact of outliers. Specifically, we design an efficient Graph Score Calculation module, which aims to compute global graph scores by enhancing the representation of important features and comprehensively capturing the contextual relationships between correspondences. Then, we propose a Consistency Guided Correspondences Selection module to dynamically fuse global graph scores and consistency graphs and construct a novel consistency matrix to accurately recognize inliers. Extensive experiments on various challenging tasks demonstrate that our CGR-Net outperforms state-of-the-art methods. Our code is released at \n<uri>https://github.com/XiaojieLi11/CGR-Net</uri>\n.","PeriodicalId":13082,"journal":{"name":"IEEE Transactions on Circuits and Systems for Video Technology","volume":"34 12","pages":"12450-12465"},"PeriodicalIF":11.1000,"publicationDate":"2024-08-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Circuits and Systems for Video Technology","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10623710/","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

Accurately identifying correct correspondences (inliers) in two-view images is a fundamental task in computer vision. Recent studies usually adopt Graph Neural Networks or stack local graphs into global ones to establish neighborhood relations. However, the smoothing properties of Graph Convolutional Neural network (GCN) cause the model to fall into local extreme, which leads to the issue of indistinguishability between inliers and outliers. Especially when the initial correspondences contain a large number of incorrect correspondences (outliers), these studies suffer from severe performance degradation. To address the above issues and refocus perspective information on distinct features, we design a Consistency Guided ResFormer Network (CGR-Net) that uses consistent correspondences to guide model perspective focusing, thereby avoiding the negative impact of outliers. Specifically, we design an efficient Graph Score Calculation module, which aims to compute global graph scores by enhancing the representation of important features and comprehensively capturing the contextual relationships between correspondences. Then, we propose a Consistency Guided Correspondences Selection module to dynamically fuse global graph scores and consistency graphs and construct a novel consistency matrix to accurately recognize inliers. Extensive experiments on various challenging tasks demonstrate that our CGR-Net outperforms state-of-the-art methods. Our code is released at https://github.com/XiaojieLi11/CGR-Net .

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

CGR-Net：用于双视图对应学习的一致性指导重构器

在二视图图像中准确识别正确的对应（内线）是计算机视觉的一项基本任务。目前的研究通常采用图神经网络或将局部图叠加成全局图来建立邻域关系。然而，图卷积神经网络（GCN）的平滑特性使模型陷入局部极值，从而导致内线和离群点无法区分的问题。特别是当初始对应包含大量不正确的对应（异常值）时，这些研究将遭受严重的性能下降。为了解决上述问题并将视角信息重新聚焦到不同的特征上，我们设计了一个一致性引导的重构网络（Consistency Guided ResFormer Network, CGR-Net），该网络使用一致性对应来指导模型视角聚焦，从而避免了异常值的负面影响。具体来说，我们设计了一个高效的图分数计算模块，该模块旨在通过增强重要特征的表示和全面捕获对应之间的上下文关系来计算全局图分数。然后，我们提出了一致性引导对应选择模块来动态融合全局图分数和一致性图，并构造了一个新的一致性矩阵来准确识别内线。在各种具有挑战性的任务上进行的大量实验表明，我们的CGR-Net优于最先进的方法。我们的代码发布在https://github.com/XiaojieLi11/CGR-Net。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

IEEE Transactions on Circuits and Systems for Video Technology 工程技术-工程：电子与电气

CiteScore

13.80

自引率

27.40%

发文量

660

审稿时长

5 months

期刊介绍： The IEEE Transactions on Circuits and Systems for Video Technology (TCSVT) is dedicated to covering all aspects of video technologies from a circuits and systems perspective. We encourage submissions of general, theoretical, and application-oriented papers related to image and video acquisition, representation, presentation, and display. Additionally, we welcome contributions in areas such as processing, filtering, and transforms; analysis and synthesis; learning and understanding; compression, transmission, communication, and networking; as well as storage, retrieval, indexing, and search. Furthermore, papers focusing on hardware and software design and implementation are highly valued. Join us in advancing the field of video technology through innovative research and insights.

期刊最新文献

IEEE Circuits and Systems Society Information IEEE Circuits and Systems Society Information IEEE Circuits and Systems Society Information 2025 Index IEEE Transactions on Circuits and Systems for Video Technology IEEE Circuits and Systems Society Information