In Search of Trees: Decision-Tree Policy Synthesis for Black-Box Systems via Search

Emir Demirović, Christian Schilling, Anna Lukina
{"title":"In Search of Trees: Decision-Tree Policy Synthesis for Black-Box Systems via Search","authors":"Emir Demirović, Christian Schilling, Anna Lukina","doi":"arxiv-2409.03260","DOIUrl":null,"url":null,"abstract":"Decision trees, owing to their interpretability, are attractive as control\npolicies for (dynamical) systems. Unfortunately, constructing, or synthesising,\nsuch policies is a challenging task. Previous approaches do so by imitating a\nneural-network policy, approximating a tabular policy obtained via formal\nsynthesis, employing reinforcement learning, or modelling the problem as a\nmixed-integer linear program. However, these works may require access to a\nhard-to-obtain accurate policy or a formal model of the environment (within\nreach of formal synthesis), and may not provide guarantees on the quality or\nsize of the final tree policy. In contrast, we present an approach to\nsynthesise optimal decision-tree policies given a black-box environment and\nspecification, and a discretisation of the tree predicates, where optimality is\ndefined with respect to the number of steps to achieve the goal. Our approach\nis a specialised search algorithm which systematically explores the\n(exponentially large) space of decision trees under the given discretisation.\nThe key component is a novel pruning mechanism that significantly reduces the\nsearch space. Our approach represents a conceptually novel way of synthesising\nsmall decision-tree policies with optimality guarantees even for black-box\nenvironments with black-box specifications.","PeriodicalId":501479,"journal":{"name":"arXiv - CS - Artificial Intelligence","volume":"54 2 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.03260","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Decision trees, owing to their interpretability, are attractive as control policies for (dynamical) systems. Unfortunately, constructing, or synthesising, such policies is a challenging task. Previous approaches do so by imitating a neural-network policy, approximating a tabular policy obtained via formal synthesis, employing reinforcement learning, or modelling the problem as a mixed-integer linear program. However, these works may require access to a hard-to-obtain accurate policy or a formal model of the environment (within reach of formal synthesis), and may not provide guarantees on the quality or size of the final tree policy. In contrast, we present an approach to synthesise optimal decision-tree policies given a black-box environment and specification, and a discretisation of the tree predicates, where optimality is defined with respect to the number of steps to achieve the goal. Our approach is a specialised search algorithm which systematically explores the (exponentially large) space of decision trees under the given discretisation. The key component is a novel pruning mechanism that significantly reduces the search space. Our approach represents a conceptually novel way of synthesising small decision-tree policies with optimality guarantees even for black-box environments with black-box specifications.
查看原文
分享 分享
微信好友 朋友圈 QQ好友 复制链接
本刊更多论文
寻找树:通过搜索为黑盒系统合成决策树策略
决策树由于其可解释性,作为(动态)系统的控制策略很有吸引力。遗憾的是,构建或合成此类策略是一项极具挑战性的任务。以往的方法包括模仿神经网络策略、近似通过形式合成获得的表格策略、采用强化学习或将问题建模为混合整数线性程序。不过,这些工作可能需要获取难以获得的精确策略或环境的正式模型(在正式合成的范围内),而且可能无法保证最终树状策略的质量或大小。与此相反,我们提出了一种合成最优决策树策略的方法,该方法给定了一个黑箱环境和规范,以及树谓词的离散化,其中最优性是根据实现目标的步骤数来定义的。我们的方法是一种专门的搜索算法,可以在给定的离散化条件下系统地探索决策树的(指数级大)空间。我们的方法代表了一种概念新颖的方法,可以合成即使是具有黑盒规范的黑盒环境也能保证最优性的小型决策树策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 去求助
来源期刊
自引率
0.00%
发文量
0
期刊最新文献
Abductive explanations of classifiers under constraints: Complexity and properties Explaining Non-monotonic Normative Reasoning using Argumentation Theory with Deontic Logic Towards Explainable Goal Recognition Using Weight of Evidence (WoE): A Human-Centered Approach A Metric Hybrid Planning Approach to Solving Pandemic Planning Problems with Simple SIR Models Neural Networks for Vehicle Routing Problem
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
现在去查看 取消
×
提示
确定
0
微信
客服QQ
Book学术公众号 扫码关注我们
反馈
×
意见反馈
请填写您的意见或建议
请填写您的手机或邮箱
已复制链接
已复制链接
快去分享给好友吧!
我知道了
×
扫码分享
扫码分享
Book学术官方微信
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术
文献互助 智能选刊 最新文献 互助须知 联系我们:info@booksci.cn
Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。
Copyright © 2023 Book学术 All rights reserved.
ghs 京公网安备 11010802042870号 京ICP备2023020795号-1