表征学习的自由能原理

Mach. Learn. Sci. Technol. Pub Date : 2020-02-27 DOI:10.1088/2632-2153/ABF984

Yansong Gao, P. Chaudhari

{"title":"表征学习的自由能原理","authors":"Yansong Gao, P. Chaudhari","doi":"10.1088/2632-2153/ABF984","DOIUrl":null,"url":null,"abstract":"This paper employs a formal connection of machine learning with thermodynamics to characterize the quality of learnt representations for transfer learning. We discuss how information-theoretic functional such as rate, distortion and classification loss of a model lie on a convex, so-called equilibrium surface.We prescribe dynamical processes to traverse this surface under constraints, e.g., an iso-classification process that trades off rate and distortion to keep the classification loss unchanged. We demonstrate how this process can be used for transferring representations from a source dataset to a target dataset while keeping the classification loss constant. Experimental validation of the theoretical results is provided on standard image-classification datasets.","PeriodicalId":18148,"journal":{"name":"Mach. Learn. Sci. Technol.","volume":"36 1","pages":"45004"},"PeriodicalIF":0.0000,"publicationDate":"2020-02-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"A Free-Energy Principle for Representation Learning\",\"authors\":\"Yansong Gao, P. Chaudhari\",\"doi\":\"10.1088/2632-2153/ABF984\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper employs a formal connection of machine learning with thermodynamics to characterize the quality of learnt representations for transfer learning. We discuss how information-theoretic functional such as rate, distortion and classification loss of a model lie on a convex, so-called equilibrium surface.We prescribe dynamical processes to traverse this surface under constraints, e.g., an iso-classification process that trades off rate and distortion to keep the classification loss unchanged. We demonstrate how this process can be used for transferring representations from a source dataset to a target dataset while keeping the classification loss constant. Experimental validation of the theoretical results is provided on standard image-classification datasets.\",\"PeriodicalId\":18148,\"journal\":{\"name\":\"Mach. Learn. Sci. Technol.\",\"volume\":\"36 1\",\"pages\":\"45004\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-02-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Mach. Learn. Sci. Technol.\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1088/2632-2153/ABF984\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mach. Learn. Sci. Technol.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1088/2632-2153/ABF984","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

摘要

本文采用机器学习与热力学的形式化联系来表征迁移学习的学习表征的质量。我们讨论了一个模型的比率、失真和分类损失等信息论泛函如何位于一个凸的所谓的平衡面上。我们规定了在约束下遍历这个表面的动态过程，例如，一个等分类过程，它权衡了速率和失真以保持分类损失不变。我们演示了如何使用此过程将表示从源数据集传输到目标数据集，同时保持分类损失恒定。在标准图像分类数据集上对理论结果进行了实验验证。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

A Free-Energy Principle for Representation Learning

This paper employs a formal connection of machine learning with thermodynamics to characterize the quality of learnt representations for transfer learning. We discuss how information-theoretic functional such as rate, distortion and classification loss of a model lie on a convex, so-called equilibrium surface.We prescribe dynamical processes to traverse this surface under constraints, e.g., an iso-classification process that trades off rate and distortion to keep the classification loss unchanged. We demonstrate how this process can be used for transferring representations from a source dataset to a target dataset while keeping the classification loss constant. Experimental validation of the theoretical results is provided on standard image-classification datasets.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Mach. Learn. Sci. Technol.

自引率

0.00%

发文量