Huaiyu Liu , Julian Quandt , Lei Zhang , Xiongbing Kang , Jens Blechert , Tjits van Lent , Rob W. Holland , Harm Veling
{"title":"Shaping food choices with actions and inactions with and without reward and punishment","authors":"Huaiyu Liu , Julian Quandt , Lei Zhang , Xiongbing Kang , Jens Blechert , Tjits van Lent , Rob W. Holland , Harm Veling","doi":"10.1016/j.appet.2025.107950","DOIUrl":null,"url":null,"abstract":"<div><div>Enabling people to reduce their consumption of unhealthy appetitive products can improve their health. Over the last decades, progress has been made by uncovering new ways to change behavior toward appetitive products without feedback incentives (e.g., reward or punishment, as in feedback-driven reinforcement learning), but instead by cueing motor responses (e.g., go vs. no go) toward these products in cognitive training tasks. However, it is unclear how this nonreinforced learning compares to reinforcement learning. Moreover, recent work on reinforcement learning has uncovered a basic learning mechanism, the action–valence asymmetry, which points to the possibility that reward and punishment learning may not always outperform learning without any external reinforcement. Here, we report two well-powered preregistered experiments (experiment 1a: N = 72; experiment 1b: N = 81) that examined when reinforcement learning outperforms nonreinforced learning in modifying people's preferences for food. Our findings show that reinforcement learning notably surpasses nonreinforced learning, but only when active responses (go) are rewarded, and inactions (no-go) are reinforced by avoiding punishments. These results shed light on interventions that combine rewards and punishments to facilitate changes in food preferences.</div></div>","PeriodicalId":242,"journal":{"name":"Appetite","volume":"208 ","pages":"Article 107950"},"PeriodicalIF":3.8000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Appetite","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0195666325001035","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/28 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"BEHAVIORAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Enabling people to reduce their consumption of unhealthy appetitive products can improve their health. Over the last decades, progress has been made by uncovering new ways to change behavior toward appetitive products without feedback incentives (e.g., reward or punishment, as in feedback-driven reinforcement learning), but instead by cueing motor responses (e.g., go vs. no go) toward these products in cognitive training tasks. However, it is unclear how this nonreinforced learning compares to reinforcement learning. Moreover, recent work on reinforcement learning has uncovered a basic learning mechanism, the action–valence asymmetry, which points to the possibility that reward and punishment learning may not always outperform learning without any external reinforcement. Here, we report two well-powered preregistered experiments (experiment 1a: N = 72; experiment 1b: N = 81) that examined when reinforcement learning outperforms nonreinforced learning in modifying people's preferences for food. Our findings show that reinforcement learning notably surpasses nonreinforced learning, but only when active responses (go) are rewarded, and inactions (no-go) are reinforced by avoiding punishments. These results shed light on interventions that combine rewards and punishments to facilitate changes in food preferences.
使人们减少对不健康的开胃产品的消费可以改善他们的健康。在过去的几十年里,通过发现新的方法来改变人们对食欲产品的行为,而不是通过反馈激励(例如,奖励或惩罚,如反馈驱动的强化学习),而是通过在认知训练任务中对这些产品的运动反应(例如,go vs. no go),已经取得了进展。然而,目前还不清楚这种非强化学习与强化学习相比如何。此外,最近关于强化学习的研究发现了一种基本的学习机制,即行动效价不对称,这表明奖惩学习可能并不总是优于没有任何外部强化的学习。在这里,我们报告了两个良好的预注册实验(实验1a: N = 72;实验1b: N = 81),研究了强化学习在改变人们对食物的偏好方面何时优于非强化学习。我们的研究结果表明,强化学习明显优于非强化学习,但只有在积极反应(go)得到奖励,而不作为(no-go)通过避免惩罚得到强化的情况下。这些结果阐明了将奖励和惩罚结合起来的干预措施,以促进食物偏好的改变。
期刊介绍:
Appetite is an international research journal specializing in cultural, social, psychological, sensory and physiological influences on the selection and intake of foods and drinks. It covers normal and disordered eating and drinking and welcomes studies of both human and non-human animal behaviour toward food. Appetite publishes research reports, reviews and commentaries. Thematic special issues appear regularly. From time to time the journal carries abstracts from professional meetings. Submissions to Appetite are expected to be based primarily on observations directly related to the selection and intake of foods and drinks; papers that are primarily focused on topics such as nutrition or obesity will not be considered unless they specifically make a novel scientific contribution to the understanding of appetite in line with the journal's aims and scope.