Policy compression for aircraft collision avoidance systems

Kyle D. Julian, Jessica Lopez, J. Brush, Michael P. Owen, Mykel J. Kochenderfer
{"title":"Policy compression for aircraft collision avoidance systems","authors":"Kyle D. Julian, Jessica Lopez, J. Brush, Michael P. Owen, Mykel J. Kochenderfer","doi":"10.1109/DASC.2016.7778091","DOIUrl":null,"url":null,"abstract":"One approach to designing the decision making logic for an aircraft collision avoidance system is to frame the problem as Markov decision process and optimize the system using dynamic programming. The resulting strategy can be represented as a numeric table. This methodology has been used in the development of the ACAS X family of collision avoidance systems for manned and unmanned aircraft. However, due to the high dimensionality of the state space, discretizing the state variables can lead to very large tables. To improve storage efficiency, we propose two approaches for compressing the lookup table. The first approach exploits redundancy in the table. The table is decomposed into a set of lower-dimensional tables, some of which can be represented by single tables in areas where the lower-dimensional tables are identical or nearly identical with respect to a similarity metric. The second approach uses a deep neural network to learn a complex non-linear function approximation of the table. With the use of an asymmetric loss function and a gradient descent algorithm, the parameters for this network can be trained to provide very accurate estimates of values while preserving the relative preferences of the possible advisories for each state. As a result, the table can be approximately represented by only the parameters of the network, which reduces the required storage space by a factor of 1000. Simulation studies show that system performance is very similar using either compressed table representation in place of the original table. Even though the neural network was trained directly on the original table, the network surpasses the original table on the performance metrics and encounter sets evaluated here.","PeriodicalId":340472,"journal":{"name":"2016 IEEE/AIAA 35th Digital Avionics Systems Conference (DASC)","volume":"84 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"218","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE/AIAA 35th Digital Avionics Systems Conference (DASC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DASC.2016.7778091","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 218

Abstract

One approach to designing the decision making logic for an aircraft collision avoidance system is to frame the problem as Markov decision process and optimize the system using dynamic programming. The resulting strategy can be represented as a numeric table. This methodology has been used in the development of the ACAS X family of collision avoidance systems for manned and unmanned aircraft. However, due to the high dimensionality of the state space, discretizing the state variables can lead to very large tables. To improve storage efficiency, we propose two approaches for compressing the lookup table. The first approach exploits redundancy in the table. The table is decomposed into a set of lower-dimensional tables, some of which can be represented by single tables in areas where the lower-dimensional tables are identical or nearly identical with respect to a similarity metric. The second approach uses a deep neural network to learn a complex non-linear function approximation of the table. With the use of an asymmetric loss function and a gradient descent algorithm, the parameters for this network can be trained to provide very accurate estimates of values while preserving the relative preferences of the possible advisories for each state. As a result, the table can be approximately represented by only the parameters of the network, which reduces the required storage space by a factor of 1000. Simulation studies show that system performance is very similar using either compressed table representation in place of the original table. Even though the neural network was trained directly on the original table, the network surpasses the original table on the performance metrics and encounter sets evaluated here.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
飞机避碰系统的策略压缩
设计飞机避碰系统决策逻辑的一种方法是将避碰问题构造为马尔可夫决策过程,并用动态规划方法对避碰系统进行优化。结果策略可以表示为一个数字表。该方法已用于开发ACAS X系列的有人驾驶和无人驾驶飞机避碰系统。然而,由于状态空间的高维性,离散状态变量可能导致非常大的表。为了提高存储效率,我们提出了两种压缩查找表的方法。第一种方法利用表中的冗余。表被分解成一组低维表,其中一些表可以用单个表表示,其中低维表在相似性度量方面相同或几乎相同。第二种方法使用深度神经网络来学习复杂的非线性函数逼近表。通过使用非对称损失函数和梯度下降算法,可以训练该网络的参数,以提供非常准确的值估计,同时保留每个状态可能建议的相对偏好。因此,该表可以仅由网络的参数近似表示,这将所需的存储空间减少了1000倍。仿真研究表明,使用压缩表表示代替原始表表示的系统性能非常相似。尽管神经网络是直接在原始表上进行训练的,但该网络在性能指标和这里评估的相遇集上超过了原始表。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Trajectory optimisation for avionics-based GNSS integrity augmentation system Modeling standard for distributed control systems: IEC 61499 from industrial automation to aerospace Ontological knowledge representation for avionics decision-making support Conflict resolution for wind-optimal aircraft trajectories in North Atlantic oceanic airspace with wind uncertainties Flexible open architecture for UASs integration into the airspace: Paparazzi autopilot system
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1