Learning the rational choice perspective: A reinforcement learning approach to simulating offender behaviours in criminological agent-based models

IF 7.1 1区地球科学 Q1 ENVIRONMENTAL STUDIES Computers Environment and Urban Systems Pub Date : 2024-06-27 DOI:10.1016/j.compenvurbsys.2024.102141

Sedar Olmez , Dan Birks , Alison Heppenstall , Jiaqi Ge

{"title":"Learning the rational choice perspective: A reinforcement learning approach to simulating offender behaviours in criminological agent-based models","authors":"Sedar Olmez , Dan Birks , Alison Heppenstall , Jiaqi Ge","doi":"10.1016/j.compenvurbsys.2024.102141","DOIUrl":null,"url":null,"abstract":"<div><p>Over the past 15 years, environmental criminologists have explored the application of agent-based models (ABMs) of crime events and various theoretical frameworks applied to understand them. Models have supported criminological theorising and, in some cases, been applied to make predictions about the impact of interventions devised to reduce crime. However, decision-making frameworks utilised in criminological ABMs have typically been implemented through traditional techniques such as condition-action rules. While these models have provided significant insights, they neglect a crucial component of theoretical accounts of offending, the notion that offenders are learning agents whose behavioural dynamics change over time and space. In response, this article presents an ABM of residential burglary in which offender agents utilise reinforcement learning (RL) to learn behaviours. This solution enables offender agents to learn from individual-level perceptions of the environment and, given these perceptions, develop behavioural responses that benefit themselves. The model includes conceptualisations of the Routine Activity Theory (RAT), Crime Pattern Theory (CPT) and a utility function, Target Attractiveness, which acts as a behavioural mould to nudge offender agents to learn behaviours in keeping with the Rational Choice Perspective (RCP). Trained behaviours are then tested by introducing crime prevention interventions into the model and examining the reactions of offender agents. In keeping with empirical studies of offending, experimental results demonstrate that offender agents utilising RL learn to offend at targets where rewards outweigh risks and effort, offend close to home, frequently victimise high-rewarding targets, and conversely learn to avoid offending in areas associated with high levels of risk and effort.</p></div>","PeriodicalId":48241,"journal":{"name":"Computers Environment and Urban Systems","volume":"112 ","pages":"Article 102141"},"PeriodicalIF":7.1000,"publicationDate":"2024-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S019897152400070X/pdfft?md5=5f857c2eef6527abf8640857265ab386&pid=1-s2.0-S019897152400070X-main.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers Environment and Urban Systems","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S019897152400070X","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL STUDIES","Score":null,"Total":0}

引用次数: 0

Abstract

Over the past 15 years, environmental criminologists have explored the application of agent-based models (ABMs) of crime events and various theoretical frameworks applied to understand them. Models have supported criminological theorising and, in some cases, been applied to make predictions about the impact of interventions devised to reduce crime. However, decision-making frameworks utilised in criminological ABMs have typically been implemented through traditional techniques such as condition-action rules. While these models have provided significant insights, they neglect a crucial component of theoretical accounts of offending, the notion that offenders are learning agents whose behavioural dynamics change over time and space. In response, this article presents an ABM of residential burglary in which offender agents utilise reinforcement learning (RL) to learn behaviours. This solution enables offender agents to learn from individual-level perceptions of the environment and, given these perceptions, develop behavioural responses that benefit themselves. The model includes conceptualisations of the Routine Activity Theory (RAT), Crime Pattern Theory (CPT) and a utility function, Target Attractiveness, which acts as a behavioural mould to nudge offender agents to learn behaviours in keeping with the Rational Choice Perspective (RCP). Trained behaviours are then tested by introducing crime prevention interventions into the model and examining the reactions of offender agents. In keeping with empirical studies of offending, experimental results demonstrate that offender agents utilising RL learn to offend at targets where rewards outweigh risks and effort, offend close to home, frequently victimise high-rewarding targets, and conversely learn to avoid offending in areas associated with high levels of risk and effort.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

学习理性选择观点：在犯罪学代理模型中模拟罪犯行为的强化学习方法

在过去 15 年中，环境犯罪学家探索了犯罪事件代理模型（ABMs）的应用，以及用于理解犯罪事件的各种理论框架。模型为犯罪学理论研究提供了支持，在某些情况下，还被用于预测为减少犯罪而设计的干预措施的影响。不过，犯罪学人工智能模型中使用的决策框架通常是通过条件-行动规则等传统技术实现的。虽然这些模型提供了重要的见解，但它们忽略了犯罪理论中的一个重要组成部分，即罪犯是学习主体，其行为动态会随着时间和空间的变化而变化。作为回应，本文介绍了一种住宅盗窃的人工智能模型，其中罪犯代理利用强化学习（RL）来学习行为。该解决方案使罪犯代理能够从个人层面对环境的感知中学习，并根据这些感知制定对自己有利的行为对策。该模型包括常规活动理论（RAT）、犯罪模式理论（CPT）和效用函数 "目标吸引力"（Target Attractiveness）的概念。然后，通过在模型中引入犯罪预防干预措施并检查罪犯代理人的反应，对训练行为进行测试。实验结果表明，利用 RL 的罪犯代理人学会了在回报大于风险和努力的目标处犯罪、在离家近的地方犯罪、经常使高回报目标受害，并学会了避免在与高风险和高努力相关的地区犯罪，这与犯罪方面的实证研究是一致的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Computers Environment and Urban Systems Multiple-

CiteScore

13.30

自引率

7.40%

发文量

111

审稿时长

32 days

期刊介绍： Computers, Environment and Urban Systemsis an interdisciplinary journal publishing cutting-edge and innovative computer-based research on environmental and urban systems, that privileges the geospatial perspective. The journal welcomes original high quality scholarship of a theoretical, applied or technological nature, and provides a stimulating presentation of perspectives, research developments, overviews of important new technologies and uses of major computational, information-based, and visualization innovations. Applied and theoretical contributions demonstrate the scope of computer-based analysis fostering a better understanding of environmental and urban systems, their spatial scope and their dynamics.