Policy compression for aircraft collision avoidance systems

2016 IEEE/AIAA 35th Digital Avionics Systems Conference (DASC) Pub Date : 2016-09-01 DOI:10.1109/DASC.2016.7778091

Kyle D. Julian, Jessica Lopez, J. Brush, Michael P. Owen, Mykel J. Kochenderfer

{"title":"Policy compression for aircraft collision avoidance systems","authors":"Kyle D. Julian, Jessica Lopez, J. Brush, Michael P. Owen, Mykel J. Kochenderfer","doi":"10.1109/DASC.2016.7778091","DOIUrl":null,"url":null,"abstract":"One approach to designing the decision making logic for an aircraft collision avoidance system is to frame the problem as Markov decision process and optimize the system using dynamic programming. The resulting strategy can be represented as a numeric table. This methodology has been used in the development of the ACAS X family of collision avoidance systems for manned and unmanned aircraft. However, due to the high dimensionality of the state space, discretizing the state variables can lead to very large tables. To improve storage efficiency, we propose two approaches for compressing the lookup table. The first approach exploits redundancy in the table. The table is decomposed into a set of lower-dimensional tables, some of which can be represented by single tables in areas where the lower-dimensional tables are identical or nearly identical with respect to a similarity metric. The second approach uses a deep neural network to learn a complex non-linear function approximation of the table. With the use of an asymmetric loss function and a gradient descent algorithm, the parameters for this network can be trained to provide very accurate estimates of values while preserving the relative preferences of the possible advisories for each state. As a result, the table can be approximately represented by only the parameters of the network, which reduces the required storage space by a factor of 1000. Simulation studies show that system performance is very similar using either compressed table representation in place of the original table. Even though the neural network was trained directly on the original table, the network surpasses the original table on the performance metrics and encounter sets evaluated here.","PeriodicalId":340472,"journal":{"name":"2016 IEEE/AIAA 35th Digital Avionics Systems Conference (DASC)","volume":"84 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"218","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE/AIAA 35th Digital Avionics Systems Conference (DASC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DASC.2016.7778091","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 218

Abstract

One approach to designing the decision making logic for an aircraft collision avoidance system is to frame the problem as Markov decision process and optimize the system using dynamic programming. The resulting strategy can be represented as a numeric table. This methodology has been used in the development of the ACAS X family of collision avoidance systems for manned and unmanned aircraft. However, due to the high dimensionality of the state space, discretizing the state variables can lead to very large tables. To improve storage efficiency, we propose two approaches for compressing the lookup table. The first approach exploits redundancy in the table. The table is decomposed into a set of lower-dimensional tables, some of which can be represented by single tables in areas where the lower-dimensional tables are identical or nearly identical with respect to a similarity metric. The second approach uses a deep neural network to learn a complex non-linear function approximation of the table. With the use of an asymmetric loss function and a gradient descent algorithm, the parameters for this network can be trained to provide very accurate estimates of values while preserving the relative preferences of the possible advisories for each state. As a result, the table can be approximately represented by only the parameters of the network, which reduces the required storage space by a factor of 1000. Simulation studies show that system performance is very similar using either compressed table representation in place of the original table. Even though the neural network was trained directly on the original table, the network surpasses the original table on the performance metrics and encounter sets evaluated here.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

飞机避碰系统的策略压缩

设计飞机避碰系统决策逻辑的一种方法是将避碰问题构造为马尔可夫决策过程，并用动态规划方法对避碰系统进行优化。结果策略可以表示为一个数字表。该方法已用于开发ACAS X系列的有人驾驶和无人驾驶飞机避碰系统。然而，由于状态空间的高维性，离散状态变量可能导致非常大的表。为了提高存储效率，我们提出了两种压缩查找表的方法。第一种方法利用表中的冗余。表被分解成一组低维表，其中一些表可以用单个表表示，其中低维表在相似性度量方面相同或几乎相同。第二种方法使用深度神经网络来学习复杂的非线性函数逼近表。通过使用非对称损失函数和梯度下降算法，可以训练该网络的参数，以提供非常准确的值估计，同时保留每个状态可能建议的相对偏好。因此，该表可以仅由网络的参数近似表示，这将所需的存储空间减少了1000倍。仿真研究表明，使用压缩表表示代替原始表表示的系统性能非常相似。尽管神经网络是直接在原始表上进行训练的，但该网络在性能指标和这里评估的相遇集上超过了原始表。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

2016 IEEE/AIAA 35th Digital Avionics Systems Conference (DASC)

自引率

0.00%

发文量