ReMAV: Reward Modeling of Autonomous Vehicles for Finding Likely Failure Events

IF 4.6 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE IEEE Open Journal of Intelligent Transportation Systems Pub Date : 2024-10-11 DOI:10.1109/OJITS.2024.3479098
Aizaz Sharif;Dusica Marijan
{"title":"ReMAV: Reward Modeling of Autonomous Vehicles for Finding Likely Failure Events","authors":"Aizaz Sharif;Dusica Marijan","doi":"10.1109/OJITS.2024.3479098","DOIUrl":null,"url":null,"abstract":"Autonomous vehicles are advanced driving systems that revolutionize transportation, but their vulnerability to adversarial attacks poses significant safety risks. Consider a scenario in which a slight perturbation in sensor data causes an autonomous vehicle to fail unexpectedly, potentially leading to accidents. Current testing methods often rely on computationally expensive active learning techniques to identify such vulnerabilities. Rather than actively training complex adversaries by interacting with the environment, there is a need to first intelligently find and reduce the search space to only those states where autonomous vehicles are found to be less confident. In this paper, we propose a black-box testing framework ReMAV that uses offline trajectories first to efficiently identify weaknesses of autonomous vehicles without the need for active interaction. To this end, we introduce a three-step methodology which i) uses offline state action pairs of any autonomous vehicle under test, ii) builds an abstract behavior representation using our designed reward modeling technique to analyze states with uncertain driving decisions, and iii) uses a disturbance model for minimal perturbation attacks where the driving decisions are less confident. Our reward modeling creates a behavior representation that highlights regions of likely uncertain autonomous vehicle behavior, even when performance seems adequate. This enables efficient testing without computationally expensive active adversarial learning. We evaluated ReMAV in a high-fidelity urban driving simulator across various single- and multi-agent scenarios. The results show substantial increases in failure events compared to the standard behavior of autonomous vehicles: 35% in vehicle collisions, 23% in road object collisions, 48% in pedestrian collisions, and 50% in off-road steering events. ReMAV outperforms two baselines and previous testing frameworks in effectiveness, efficiency, and speed of identifying failures. This demonstrates ReMAV’s capability to efficiently expose autonomous vehicle weaknesses using simple perturbation models.","PeriodicalId":100631,"journal":{"name":"IEEE Open Journal of Intelligent Transportation Systems","volume":"5 ","pages":"669-691"},"PeriodicalIF":4.6000,"publicationDate":"2024-10-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=10714436","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Open Journal of Intelligent Transportation Systems","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10714436/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Autonomous vehicles are advanced driving systems that revolutionize transportation, but their vulnerability to adversarial attacks poses significant safety risks. Consider a scenario in which a slight perturbation in sensor data causes an autonomous vehicle to fail unexpectedly, potentially leading to accidents. Current testing methods often rely on computationally expensive active learning techniques to identify such vulnerabilities. Rather than actively training complex adversaries by interacting with the environment, there is a need to first intelligently find and reduce the search space to only those states where autonomous vehicles are found to be less confident. In this paper, we propose a black-box testing framework ReMAV that uses offline trajectories first to efficiently identify weaknesses of autonomous vehicles without the need for active interaction. To this end, we introduce a three-step methodology which i) uses offline state action pairs of any autonomous vehicle under test, ii) builds an abstract behavior representation using our designed reward modeling technique to analyze states with uncertain driving decisions, and iii) uses a disturbance model for minimal perturbation attacks where the driving decisions are less confident. Our reward modeling creates a behavior representation that highlights regions of likely uncertain autonomous vehicle behavior, even when performance seems adequate. This enables efficient testing without computationally expensive active adversarial learning. We evaluated ReMAV in a high-fidelity urban driving simulator across various single- and multi-agent scenarios. The results show substantial increases in failure events compared to the standard behavior of autonomous vehicles: 35% in vehicle collisions, 23% in road object collisions, 48% in pedestrian collisions, and 50% in off-road steering events. ReMAV outperforms two baselines and previous testing frameworks in effectiveness, efficiency, and speed of identifying failures. This demonstrates ReMAV’s capability to efficiently expose autonomous vehicle weaknesses using simple perturbation models.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
ReMAV:为寻找可能的故障事件而建立的自动驾驶汽车奖励模型
自动驾驶汽车是一种先进的驾驶系统,它给交通运输带来了革命性的变化,但其易受对抗性攻击的弱点也带来了巨大的安全风险。考虑这样一种情况:传感器数据中的轻微扰动会导致自动驾驶汽车意外失灵,从而可能引发事故。目前的测试方法通常依赖于计算成本高昂的主动学习技术来识别此类漏洞。与其通过与环境互动来主动训练复杂的对手,不如首先智能地找到并缩小搜索空间,只搜索那些发现自主车辆信心不足的状态。在本文中,我们提出了一种黑盒测试框架 ReMAV,它首先使用离线轨迹来有效识别自动驾驶车辆的弱点,而无需主动交互。为此,我们介绍了一种分三步的方法:i) 使用任何被测自动驾驶车辆的离线状态动作对;ii) 使用我们设计的奖励建模技术建立抽象的行为表示法,以分析具有不确定驾驶决策的状态;iii) 在驾驶决策不太确定的情况下,使用干扰模型进行最小扰动攻击。我们的奖励建模创建了一种行为表示法,可突出显示可能存在不确定自动驾驶汽车行为的区域,即使在性能看起来足够好的情况下也是如此。这样,无需昂贵的主动对抗学习,就能进行高效测试。我们在一个高保真城市驾驶模拟器中对 ReMAV 进行了评估,涉及各种单一和多代理场景。结果显示,与自动驾驶车辆的标准行为相比,故障事件大幅增加:在车辆碰撞中增加了 35%,在道路物体碰撞中增加了 23%,在行人碰撞中增加了 48%,在越野转向事件中增加了 50%。在识别故障的效果、效率和速度方面,ReMAV 优于两个基线和以前的测试框架。这证明 ReMAV 能够利用简单的扰动模型有效地暴露自动驾驶汽车的弱点。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
CiteScore
5.40
自引率
0.00%
发文量
0
期刊最新文献
An Extensible Python Open-Source Simulation Platform for Developing and Benchmarking Bus Holding Strategies Enhancing V2X Security Through Combined Rule-Based and DL-Based Local Misbehavior Detection in Roadside Units A Survey on Sensor Selection and Placement for Connected and Automated Mobility ReMAV: Reward Modeling of Autonomous Vehicles for Finding Likely Failure Events Computationally Efficient Minimum-Time Motion Primitives for Vehicle Trajectory Planning
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1