奖励预测误差神经元实现了奖励的高效代码

IF 21.2 1区医学 Q1 NEUROSCIENCES Nature neuroscience Pub Date : 2024-06-19 DOI:10.1038/s41593-024-01671-x

Heiko H. Schütt, Dongjae Kim, Wei Ji Ma

{"title":"奖励预测误差神经元实现了奖励的高效代码","authors":"Heiko H. Schütt, Dongjae Kim, Wei Ji Ma","doi":"10.1038/s41593-024-01671-x","DOIUrl":null,"url":null,"abstract":"We use efficient coding principles borrowed from sensory neuroscience to derive the optimal neural population to encode a reward distribution. We show that the responses of dopaminergic reward prediction error neurons in mouse and macaque are similar to those of the efficient code in the following ways: the neurons have a broad distribution of midpoints covering the reward distribution; neurons with higher thresholds have higher gains, more convex tuning functions and lower slopes; and their slope is higher when the reward distribution is narrower. Furthermore, we derive learning rules that converge to the efficient code. The learning rule for the position of the neuron on the reward axis closely resembles distributional reinforcement learning. Thus, reward prediction error neuron responses may be optimized to broadcast an efficient reward signal, forming a connection between efficient coding and reinforcement learning, two of the most successful theories in computational neuroscience. This theoretical study shows that dopaminergic reward prediction error neurons encode experienced rewards efficiently, which explains four major aspects of the neural population. This efficient code can be learned with local updates for each neuron.","PeriodicalId":19076,"journal":{"name":"Nature neuroscience","volume":null,"pages":null},"PeriodicalIF":21.2000,"publicationDate":"2024-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Reward prediction error neurons implement an efficient code for reward\",\"authors\":\"Heiko H. Schütt, Dongjae Kim, Wei Ji Ma\",\"doi\":\"10.1038/s41593-024-01671-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We use efficient coding principles borrowed from sensory neuroscience to derive the optimal neural population to encode a reward distribution. We show that the responses of dopaminergic reward prediction error neurons in mouse and macaque are similar to those of the efficient code in the following ways: the neurons have a broad distribution of midpoints covering the reward distribution; neurons with higher thresholds have higher gains, more convex tuning functions and lower slopes; and their slope is higher when the reward distribution is narrower. Furthermore, we derive learning rules that converge to the efficient code. The learning rule for the position of the neuron on the reward axis closely resembles distributional reinforcement learning. Thus, reward prediction error neuron responses may be optimized to broadcast an efficient reward signal, forming a connection between efficient coding and reinforcement learning, two of the most successful theories in computational neuroscience. This theoretical study shows that dopaminergic reward prediction error neurons encode experienced rewards efficiently, which explains four major aspects of the neural population. This efficient code can be learned with local updates for each neuron.\",\"PeriodicalId\":19076,\"journal\":{\"name\":\"Nature neuroscience\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":21.2000,\"publicationDate\":\"2024-06-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Nature neuroscience\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.nature.com/articles/s41593-024-01671-x\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"NEUROSCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Nature neuroscience","FirstCategoryId":"3","ListUrlMain":"https://www.nature.com/articles/s41593-024-01671-x","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"NEUROSCIENCES","Score":null,"Total":0}

引用次数: 0

摘要

我们利用从感觉神经科学中借鉴的高效编码原理，得出了对奖赏分布进行编码的最佳神经群。我们的研究表明，小鼠和猕猴的多巴胺能奖赏预测误差神经元的反应在以下方面与高效编码相似：神经元的中点分布广泛，覆盖奖赏分布；阈值越高的神经元收益越高，调谐函数越凸，斜率越低；当奖赏分布较窄时，神经元的斜率越高。此外，我们还推导出了收敛到高效代码的学习规则。神经元在奖励轴上位置的学习规则与分布强化学习非常相似。因此，奖励预测误差神经元的反应可以通过优化来广播高效奖励信号，从而在高效编码和强化学习这两个计算神经科学领域最成功的理论之间建立联系。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

摘要图片

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

Reward prediction error neurons implement an efficient code for reward

We use efficient coding principles borrowed from sensory neuroscience to derive the optimal neural population to encode a reward distribution. We show that the responses of dopaminergic reward prediction error neurons in mouse and macaque are similar to those of the efficient code in the following ways: the neurons have a broad distribution of midpoints covering the reward distribution; neurons with higher thresholds have higher gains, more convex tuning functions and lower slopes; and their slope is higher when the reward distribution is narrower. Furthermore, we derive learning rules that converge to the efficient code. The learning rule for the position of the neuron on the reward axis closely resembles distributional reinforcement learning. Thus, reward prediction error neuron responses may be optimized to broadcast an efficient reward signal, forming a connection between efficient coding and reinforcement learning, two of the most successful theories in computational neuroscience. This theoretical study shows that dopaminergic reward prediction error neurons encode experienced rewards efficiently, which explains four major aspects of the neural population. This efficient code can be learned with local updates for each neuron.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Nature neuroscience 医学-神经科学

CiteScore

38.60

自引率

1.20%

发文量

212

审稿时长

1 months

期刊介绍： Nature Neuroscience, a multidisciplinary journal, publishes papers of the utmost quality and significance across all realms of neuroscience. The editors welcome contributions spanning molecular, cellular, systems, and cognitive neuroscience, along with psychophysics, computational modeling, and nervous system disorders. While no area is off-limits, studies offering fundamental insights into nervous system function receive priority. The journal offers high visibility to both readers and authors, fostering interdisciplinary communication and accessibility to a broad audience. It maintains high standards of copy editing and production, rigorous peer review, rapid publication, and operates independently from academic societies and other vested interests. In addition to primary research, Nature Neuroscience features news and views, reviews, editorials, commentaries, perspectives, book reviews, and correspondence, aiming to serve as the voice of the global neuroscience community.

期刊最新文献

Deep RNA sequencing of human dorsal root ganglion neurons reveals somatosensory mechanisms Mapping out multiple sclerosis with spatial transcriptomics Cell type mapping reveals tissue niches and interactions in subcortical multiple sclerosis lesions Spatially resolved gene signatures of white matter lesion progression in multiple sclerosis Smelling a concept