以对象为中心的深度主动推理模型的对称性和复杂性。

IF 3.6 3区生物学 Q1 BIOLOGY Interface Focus Pub Date : 2023-04-14 eCollection Date: 2023-06-06 DOI:10.1098/rsfs.2022.0077

Stefano Ferraro, Toon Van de Maele, Tim Verbelen, Bart Dhoedt

{"title":"以对象为中心的深度主动推理模型的对称性和复杂性。","authors":"Stefano Ferraro, Toon Van de Maele, Tim Verbelen, Bart Dhoedt","doi":"10.1098/rsfs.2022.0077","DOIUrl":null,"url":null,"abstract":"Humans perceive and interact with hundreds of objects every day. In doing so, they need to employ mental models of these objects and often exploit symmetries in the object's shape and appearance in order to learn generalizable and transferable skills. Active inference is a first principles approach to understanding and modelling sentient agents. It states that agents entertain a generative model of their environment, and learn and act by minimizing an upper bound on their surprisal, i.e. their free energy. The free energy decomposes into an accuracy and complexity term, meaning that agents favour the least complex model that can accurately explain their sensory observations. In this paper, we investigate how inherent symmetries of particular objects also emerge as symmetries in the latent state space of the generative model learnt under deep active inference. In particular, we focus on object-centric representations, which are trained from pixels to predict novel object views as the agent moves its viewpoint. First, we investigate the relation between model complexity and symmetry exploitation in the state space. Second, we do a principal component analysis to demonstrate how the model encodes the principal axis of symmetry of the object in the latent space. Finally, we also demonstrate how more symmetrical representations can be exploited for better generalization in the context of manipulation.","PeriodicalId":13795,"journal":{"name":"Interface Focus","volume":null,"pages":null},"PeriodicalIF":3.6000,"publicationDate":"2023-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10102726/pdf/","citationCount":"0","resultStr":"{\"title\":\"Symmetry and complexity in object-centric deep active inference models.\",\"authors\":\"Stefano Ferraro, Toon Van de Maele, Tim Verbelen, Bart Dhoedt\",\"doi\":\"10.1098/rsfs.2022.0077\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Humans perceive and interact with hundreds of objects every day. In doing so, they need to employ mental models of these objects and often exploit symmetries in the object's shape and appearance in order to learn generalizable and transferable skills. Active inference is a first principles approach to understanding and modelling sentient agents. It states that agents entertain a generative model of their environment, and learn and act by minimizing an upper bound on their surprisal, i.e. their free energy. The free energy decomposes into an accuracy and complexity term, meaning that agents favour the least complex model that can accurately explain their sensory observations. In this paper, we investigate how inherent symmetries of particular objects also emerge as symmetries in the latent state space of the generative model learnt under deep active inference. In particular, we focus on object-centric representations, which are trained from pixels to predict novel object views as the agent moves its viewpoint. First, we investigate the relation between model complexity and symmetry exploitation in the state space. Second, we do a principal component analysis to demonstrate how the model encodes the principal axis of symmetry of the object in the latent space. Finally, we also demonstrate how more symmetrical representations can be exploited for better generalization in the context of manipulation.\",\"PeriodicalId\":13795,\"journal\":{\"name\":\"Interface Focus\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":3.6000,\"publicationDate\":\"2023-04-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10102726/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Interface Focus\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1098/rsfs.2022.0077\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2023/6/6 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q1\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Interface Focus","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1098/rsfs.2022.0077","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/6/6 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

人类每天都要感知数以百计的物体并与之互动。在此过程中，他们需要使用这些物体的心智模型，并经常利用物体形状和外观的对称性来学习可通用和可迁移的技能。主动推理是理解有知觉的代理并为其建模的第一原理方法。该方法认为，代理可以利用其环境的生成模型，并通过最小化其意外上限（即自由能）来学习和行动。自由能分解为准确性和复杂性两个项，这意味着代理倾向于能准确解释其感官观察的最不复杂的模型。在本文中，我们将研究特定物体的固有对称性如何在深度主动推理下学习的生成模型的潜在状态空间中作为对称性出现。尤其是，我们将重点放在以物体为中心的表征上，这些表征是从像素中训练出来的，用于预测代理移动视角时的新物体视图。首先，我们研究了模型复杂性与状态空间对称性利用之间的关系。其次，我们进行了主成分分析，以展示模型是如何在潜在空间中对物体的对称主轴进行编码的。最后，我们还展示了如何利用更多的对称表征来更好地概括操作。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Symmetry and complexity in object-centric deep active inference models.

Humans perceive and interact with hundreds of objects every day. In doing so, they need to employ mental models of these objects and often exploit symmetries in the object's shape and appearance in order to learn generalizable and transferable skills. Active inference is a first principles approach to understanding and modelling sentient agents. It states that agents entertain a generative model of their environment, and learn and act by minimizing an upper bound on their surprisal, i.e. their free energy. The free energy decomposes into an accuracy and complexity term, meaning that agents favour the least complex model that can accurately explain their sensory observations. In this paper, we investigate how inherent symmetries of particular objects also emerge as symmetries in the latent state space of the generative model learnt under deep active inference. In particular, we focus on object-centric representations, which are trained from pixels to predict novel object views as the agent moves its viewpoint. First, we investigate the relation between model complexity and symmetry exploitation in the state space. Second, we do a principal component analysis to demonstrate how the model encodes the principal axis of symmetry of the object in the latent space. Finally, we also demonstrate how more symmetrical representations can be exploited for better generalization in the context of manipulation.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Interface Focus BIOLOGY-

CiteScore

9.20

自引率

0.00%

发文量

审稿时长

6-12 weeks

期刊介绍： Each Interface Focus themed issue is devoted to a particular subject at the interface of the physical and life sciences. Formed of high-quality articles, they aim to facilitate cross-disciplinary research across this traditional divide by acting as a forum accessible to all. Topics may be newly emerging areas of research or dynamic aspects of more established fields. Organisers of each Interface Focus are strongly encouraged to contextualise the journal within their chosen subject.

期刊最新文献

Fundamental constraints to the logic of living systems. The legacy and evolvability of Pere Alberch's ideas. The logic of monsters: development and morphological diversity in stem-cell-based embryo models. Capacity building in porous materials research for sustainable energy applications. Chem4Energy: a consortium of the Royal Society Africa Capacity-Building Initiative.