Siamese Networks: The Tale of Two Manifolds

2019 IEEE/CVF International Conference on Computer Vision (ICCV) Pub Date : 2019-10-01 DOI:10.1109/ICCV.2019.00314

S. Roy, M. Harandi, R. Nock, R. Hartley

引用次数: 38

Abstract

Siamese networks are non-linear deep models that have found their ways into a broad set of problems in learning theory, thanks to their embedding capabilities. In this paper, we study Siamese networks from a new perspective and question the validity of their training procedure. We show that in the majority of cases, the objective of a Siamese network is endowed with an invariance property. Neglecting the invariance property leads to a hindrance in training the Siamese networks. To alleviate this issue, we propose two Riemannian structures and generalize a well-established accelerated stochastic gradient descent method to take into account the proposed Riemannian structures. Our empirical evaluations suggest that by making use of the Riemannian geometry, we achieve state-of-the-art results against several algorithms for the challenging problem of fine-grained image classification.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

暹罗网络:两个流形的故事

暹罗网络是一种非线性深度模型，由于其嵌入能力，它已经在学习理论的广泛问题中找到了自己的方法。在本文中，我们从一个新的角度来研究暹罗网络，并质疑其训练程序的有效性。我们证明，在大多数情况下，暹罗网络的目标具有不变性。忽略网络的不变性会阻碍网络的训练。为了缓解这一问题，我们提出了两种黎曼结构，并推广了一种成熟的加速随机梯度下降方法来考虑所提出的黎曼结构。我们的经验评估表明，通过使用黎曼几何，我们针对细粒度图像分类的挑战性问题的几种算法获得了最先进的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2019 IEEE/CVF International Conference on Computer Vision (ICCV)

自引率

0.00%

发文量

期刊最新文献

Very Long Natural Scenery Image Prediction by Outpainting VTNFP: An Image-Based Virtual Try-On Network With Body and Clothing Feature Preservation Towards Latent Attribute Discovery From Triplet Similarities Gaze360: Physically Unconstrained Gaze Estimation in the Wild Attention Bridging Network for Knowledge Transfer