Learning Probabilistic Symmetrization for Architecture Agnostic Equivariance

ArXiv Pub Date : 2023-06-05 DOI:10.48550/arXiv.2306.02866

Jinwoo Kim, Tien Dat Nguyen, Ayhan Suleymanzade, Hyeokjun An, Seunghoon Hong

{"title":"Learning Probabilistic Symmetrization for Architecture Agnostic Equivariance","authors":"Jinwoo Kim, Tien Dat Nguyen, Ayhan Suleymanzade, Hyeokjun An, Seunghoon Hong","doi":"10.48550/arXiv.2306.02866","DOIUrl":null,"url":null,"abstract":"We present a novel framework to overcome the limitations of equivariant architectures in learning functions with group symmetries. In contrary to equivariant architectures, we use an arbitrary base model (such as an MLP or a transformer) and symmetrize it to be equivariant to the given group by employing a small equivariant network that parameterizes the probabilistic distribution underlying the symmetrization. The distribution is end-to-end trained with the base model which can maximize performance while reducing sample complexity of symmetrization. We show that this approach ensures not only equivariance to given group but also universal approximation capability in expectation. We implement our method on a simple patch-based transformer that can be initialized from pretrained vision transformers, and test it for a wide range of symmetry groups including permutation and Euclidean groups and their combinations. Empirical tests show competitive results against tailored equivariant architectures, suggesting the potential for learning equivariant functions for diverse groups using a non-equivariant universal base architecture. We further show evidence of enhanced learning in symmetric modalities, like graphs, when pretrained from non-symmetric modalities, like vision. Our implementation will be open-sourced at https://github.com/jw9730/lps.","PeriodicalId":93888,"journal":{"name":"ArXiv","volume":"23 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ArXiv","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2306.02866","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

Abstract

We present a novel framework to overcome the limitations of equivariant architectures in learning functions with group symmetries. In contrary to equivariant architectures, we use an arbitrary base model (such as an MLP or a transformer) and symmetrize it to be equivariant to the given group by employing a small equivariant network that parameterizes the probabilistic distribution underlying the symmetrization. The distribution is end-to-end trained with the base model which can maximize performance while reducing sample complexity of symmetrization. We show that this approach ensures not only equivariance to given group but also universal approximation capability in expectation. We implement our method on a simple patch-based transformer that can be initialized from pretrained vision transformers, and test it for a wide range of symmetry groups including permutation and Euclidean groups and their combinations. Empirical tests show competitive results against tailored equivariant architectures, suggesting the potential for learning equivariant functions for diverse groups using a non-equivariant universal base architecture. We further show evidence of enhanced learning in symmetric modalities, like graphs, when pretrained from non-symmetric modalities, like vision. Our implementation will be open-sourced at https://github.com/jw9730/lps.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

建筑不可知等方差的概率对称学习

我们提出了一种新的框架来克服等变结构在群对称函数学习中的局限性。与等变体系结构相反，我们使用任意基本模型(如MLP或变压器)，并通过使用一个小的等变网络将其对称化为给定组的等变，该网络参数化了对称基础的概率分布。使用基模型对分布进行端到端训练，可以在降低对称样本复杂度的同时实现性能最大化。我们证明了这种方法不仅保证了对给定群的等方差，而且保证了期望的普遍逼近能力。我们在一个简单的基于补丁的变压器上实现了我们的方法，该变压器可以从预训练的视觉变压器中初始化，并对它进行了广泛的对称群测试，包括排列和欧几里得群及其组合。经验测试显示了与定制等变架构的竞争结果，表明使用非等变通用基础架构可以学习不同群体的等变函数。我们进一步展示了从非对称模式(如视觉)进行预训练后，对称模式(如图)的学习增强的证据。我们的实现将在https://github.com/jw9730/lps上开源。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

ArXiv

自引率

0.00%

发文量