{"title":"双系统自由操作回避:理论延伸","authors":"Omar D Perez, Anthony Dickinson","doi":"10.1037/xan0000377","DOIUrl":null,"url":null,"abstract":"<p><p>Our theory of positively reinforced free-operant behavior (Perez & Dickinson, 2020) assumes that responding is controlled by two systems. One system is sensitive to the correlation between response and reinforcement rates and controls goal-directed behavior, whereas a habitual system learns by reward prediction error. We present an extension of this theory to the aversive domain that explains why free-operant avoidance responding increases with both the experienced rate of negative reinforcement and the difference between this rate and that programmed by the avoidance schedule. The theory also assumes that the habitual component is reinforced by the acquisition of aversive inhibitory properties by the feedback stimuli generated by responding, which then act as safety signals that reinforce habit performance. Our analysis suggests that the distinction between habitual and goal-directed control of rewarded behavior can also be applied to the aversive domain. (PsycInfo Database Record (c) 2024 APA, all rights reserved).</p>","PeriodicalId":54259,"journal":{"name":"Journal of Experimental Psychology-Animal Learning and Cognition","volume":null,"pages":null},"PeriodicalIF":1.2000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Dual-system free-operant avoidance: Extension of a theory.\",\"authors\":\"Omar D Perez, Anthony Dickinson\",\"doi\":\"10.1037/xan0000377\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Our theory of positively reinforced free-operant behavior (Perez & Dickinson, 2020) assumes that responding is controlled by two systems. One system is sensitive to the correlation between response and reinforcement rates and controls goal-directed behavior, whereas a habitual system learns by reward prediction error. We present an extension of this theory to the aversive domain that explains why free-operant avoidance responding increases with both the experienced rate of negative reinforcement and the difference between this rate and that programmed by the avoidance schedule. The theory also assumes that the habitual component is reinforced by the acquisition of aversive inhibitory properties by the feedback stimuli generated by responding, which then act as safety signals that reinforce habit performance. Our analysis suggests that the distinction between habitual and goal-directed control of rewarded behavior can also be applied to the aversive domain. (PsycInfo Database Record (c) 2024 APA, all rights reserved).</p>\",\"PeriodicalId\":54259,\"journal\":{\"name\":\"Journal of Experimental Psychology-Animal Learning and Cognition\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2024-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Experimental Psychology-Animal Learning and Cognition\",\"FirstCategoryId\":\"102\",\"ListUrlMain\":\"https://doi.org/10.1037/xan0000377\",\"RegionNum\":4,\"RegionCategory\":\"心理学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/4/25 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q4\",\"JCRName\":\"BEHAVIORAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Experimental Psychology-Animal Learning and Cognition","FirstCategoryId":"102","ListUrlMain":"https://doi.org/10.1037/xan0000377","RegionNum":4,"RegionCategory":"心理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/4/25 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"BEHAVIORAL SCIENCES","Score":null,"Total":0}
引用次数: 0
摘要
我们的正强化自由操作行为理论(Perez 和 Dickinson,2020 年)假定,反应是由两个系统控制的。一个系统对反应和强化率之间的相关性很敏感,它控制着目标导向行为,而另一个习惯性系统则通过奖励预测错误来学习。我们将这一理论扩展到了厌恶领域,从而解释了为什么自由操作的回避反应会随着所经历的负强化率以及负强化率与回避时间表所设定的负强化率之间的差异而增加。该理论还假定,通过反应产生的反馈刺激获得的厌恶抑制特性会强化习惯性成分,而这些反馈刺激又是强化习惯表现的安全信号。我们的分析表明,奖赏行为的习惯性控制和目标导向性控制之间的区别也适用于厌恶领域。(PsycInfo Database Record (c) 2024 APA, 版权所有)。
Dual-system free-operant avoidance: Extension of a theory.
Our theory of positively reinforced free-operant behavior (Perez & Dickinson, 2020) assumes that responding is controlled by two systems. One system is sensitive to the correlation between response and reinforcement rates and controls goal-directed behavior, whereas a habitual system learns by reward prediction error. We present an extension of this theory to the aversive domain that explains why free-operant avoidance responding increases with both the experienced rate of negative reinforcement and the difference between this rate and that programmed by the avoidance schedule. The theory also assumes that the habitual component is reinforced by the acquisition of aversive inhibitory properties by the feedback stimuli generated by responding, which then act as safety signals that reinforce habit performance. Our analysis suggests that the distinction between habitual and goal-directed control of rewarded behavior can also be applied to the aversive domain. (PsycInfo Database Record (c) 2024 APA, all rights reserved).
期刊介绍:
The Journal of Experimental Psychology: Animal Learning and Cognition publishes experimental and theoretical studies concerning all aspects of animal behavior processes.