Reinforcement learning and meta-decision-making

IF 3.5 2区心理学 Q1 BEHAVIORAL SCIENCES Current Opinion in Behavioral Sciences Pub Date : 2024-03-14 DOI:10.1016/j.cobeha.2024.101374

Pieter Verbeke , Tom Verguts

引用次数: 0

Abstract

A key aspect of cognitive flexibility is to efficiently make use of earlier experience to attain one’s goals. This requires learning, but also a modular, and more specifically hierarchical, structure. We hold that both are required, but combining them leads to several computational challenges that brains and artificial agents (learn to) deal with. In a hierarchical structure, meta-decisions must be made, of which two types can be distinguished. First, a (meta-)decision may involve choosing which (lower-level) modules to select (module choice). Second, it may consist of choosing appropriate parameter settings within a module (parameter tuning). Furthermore, prediction error monitoring may allow determining the right meta-decision (module choice or parameter tuning). We discuss computational challenges and empirical evidence relative to how these two meta-decisions may be implemented to support learning for cognitive flexibility.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

强化学习和元决策

认知灵活性的一个重要方面是有效利用先前的经验来实现自己的目标。这不仅需要学习，还需要模块化结构，更具体地说就是分层结构。我们认为这两者都需要，但将两者结合起来会给大脑和人工代理（学习）带来一些计算上的挑战。在分层结构中，必须做出元决策，其中可分为两类。首先，（元）决策可能涉及选择哪些（低级）模块（模块选择）。其次，它可能包括在模块内选择适当的参数设置（参数调整）。此外，预测误差监测还可以帮助确定正确的元决策（模块选择或参数调整）。我们将讨论如何实施这两项元决策以支持认知灵活性学习所面临的计算挑战和经验证据。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Current Opinion in Behavioral Sciences Neuroscience-Cognitive Neuroscience

CiteScore

10.90

自引率

2.00%

发文量

135

期刊介绍： Current Opinion in Behavioral Sciences is a systematic, integrative review journal that provides a unique and educational platform for updates on the expanding volume of information published in the field of behavioral sciences.