Bio-Inspired, Task-Free Continual Learning through Activity Regularization

IF 16.4 1区化学 Q1 CHEMISTRY, MULTIDISCIPLINARY Accounts of Chemical Research Pub Date : 2022-12-08 DOI:10.48550/arXiv.2212.04316

Francesco Lassig, Pau Vilimelis Aceituno, M. Sorbaro, B. Grewe

{"title":"Bio-Inspired, Task-Free Continual Learning through Activity Regularization","authors":"Francesco Lassig, Pau Vilimelis Aceituno, M. Sorbaro, B. Grewe","doi":"10.48550/arXiv.2212.04316","DOIUrl":null,"url":null,"abstract":"The ability to sequentially learn multiple tasks without forgetting is a key skill of biological brains, whereas it represents a major challenge to the field of deep learning. To avoid catastrophic forgetting, various continual learning (CL) approaches have been devised. However, these usually require discrete task boundaries. This requirement seems biologically implausible and often limits the application of CL methods in the real world where tasks are not always well defined. Here, we take inspiration from neuroscience, where sparse, non-overlapping neuronal representations have been suggested to prevent catastrophic forgetting. As in the brain, we argue that these sparse representations should be chosen on the basis of feed forward (stimulus-specific) as well as top-down (context-specific) information. To implement such selective sparsity, we use a bio-plausible form of hierarchical credit assignment known as Deep Feedback Control (DFC) and combine it with a winner-take-all sparsity mechanism. In addition to sparsity, we introduce lateral recurrent connections within each layer to further protect previously learned representations. We evaluate the new sparse-recurrent version of DFC on the split-MNIST computer vision benchmark and show that only the combination of sparsity and intra-layer recurrent connections improves CL performance with respect to standard backpropagation. Our method achieves similar performance to well-known CL methods, such as Elastic Weight Consolidation and Synaptic Intelligence, without requiring information about task boundaries. Overall, we showcase the idea of adopting computational principles from the brain to derive new, task-free learning algorithms for CL.","PeriodicalId":1,"journal":{"name":"Accounts of Chemical Research","volume":null,"pages":null},"PeriodicalIF":16.4000,"publicationDate":"2022-12-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accounts of Chemical Research","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.48550/arXiv.2212.04316","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 1

Abstract

The ability to sequentially learn multiple tasks without forgetting is a key skill of biological brains, whereas it represents a major challenge to the field of deep learning. To avoid catastrophic forgetting, various continual learning (CL) approaches have been devised. However, these usually require discrete task boundaries. This requirement seems biologically implausible and often limits the application of CL methods in the real world where tasks are not always well defined. Here, we take inspiration from neuroscience, where sparse, non-overlapping neuronal representations have been suggested to prevent catastrophic forgetting. As in the brain, we argue that these sparse representations should be chosen on the basis of feed forward (stimulus-specific) as well as top-down (context-specific) information. To implement such selective sparsity, we use a bio-plausible form of hierarchical credit assignment known as Deep Feedback Control (DFC) and combine it with a winner-take-all sparsity mechanism. In addition to sparsity, we introduce lateral recurrent connections within each layer to further protect previously learned representations. We evaluate the new sparse-recurrent version of DFC on the split-MNIST computer vision benchmark and show that only the combination of sparsity and intra-layer recurrent connections improves CL performance with respect to standard backpropagation. Our method achieves similar performance to well-known CL methods, such as Elastic Weight Consolidation and Synaptic Intelligence, without requiring information about task boundaries. Overall, we showcase the idea of adopting computational principles from the brain to derive new, task-free learning algorithms for CL.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

通过活动正则化实现生物启发、无任务的持续学习

连续学习多个任务而不忘记的能力是生物大脑的一项关键技能，而它代表了深度学习领域的主要挑战。为了避免灾难性遗忘，各种持续学习(CL)方法被设计出来。然而，这些通常需要离散的任务边界。这种要求在生物学上似乎是不合理的，并且常常限制了CL方法在任务并不总是定义良好的现实世界中的应用。在这里，我们从神经科学中获得灵感，在神经科学中，稀疏的、不重叠的神经元表征被认为可以防止灾难性的遗忘。正如在大脑中一样，我们认为这些稀疏表征应该在前馈(刺激特异性)和自上而下(上下文特异性)信息的基础上选择。为了实现这种选择性稀疏性，我们使用了一种生物合理的分层信用分配形式，称为深度反馈控制(DFC)，并将其与赢家通吃的稀疏性机制相结合。除了稀疏性，我们还在每层中引入横向循环连接，以进一步保护先前学习过的表示。我们在分裂- mnist计算机视觉基准上评估了新的稀疏-循环版本的DFC，并表明只有稀疏性和层内循环连接的组合才能提高相对于标准反向传播的CL性能。我们的方法在不需要任务边界信息的情况下，实现了与众所周知的CL方法(如Elastic Weight Consolidation和Synaptic Intelligence)相似的性能。总的来说，我们展示了采用大脑的计算原理来为CL派生新的、无任务的学习算法的想法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Accounts of Chemical Research 化学-化学综合

CiteScore

31.40

自引率

1.10%

发文量

312

审稿时长

2 months

期刊介绍： Accounts of Chemical Research presents short, concise and critical articles offering easy-to-read overviews of basic research and applications in all areas of chemistry and biochemistry. These short reviews focus on research from the author’s own laboratory and are designed to teach the reader about a research project. In addition, Accounts of Chemical Research publishes commentaries that give an informed opinion on a current research problem. Special Issues online are devoted to a single topic of unusual activity and significance. Accounts of Chemical Research replaces the traditional article abstract with an article "Conspectus." These entries synopsize the research affording the reader a closer look at the content and significance of an article. Through this provision of a more detailed description of the article contents, the Conspectus enhances the article's discoverability by search engines and the exposure for the research.