Avoiding Errors in Learned Heuristics in Bounded-Suboptimal Search

Symposium on Combinatorial Search Pub Date : 2022-07-17 DOI:10.1609/socs.v15i1.21804

M. Greco, Jorge A. Baier

引用次数: 0

Abstract

Despite being very effective, learned heuristics in bounded-suboptimal search can produce heuristic plateaus or move the search to zones of the state space that do not lead to a solution. In addition, it produces inadmissible cost-to-go estimates; therefore, it cannot be exploited with classical algorithms like WA* to produce w-optimal solutions. In this paper, we present two ways in which Focal Search can be modified to exploit a learned heuristic in a bounded suboptimal search: Focal Discrepancy Search, which, to evaluate each state, uses a discrepancy score based on the best-predicted heuristic value; and K-Focal Search, which expands more than one node from the FOCAL list in each expansion cycle. Both algorithms return w-optimal solutions and explore different zones of the state space than the ones that focal search, using the learned heuristic to sort the FOCAL list, would explore.

查看原文

微信好友朋友圈 QQ好友复制链接

本刊更多论文

有界次优搜索中学习启发式算法的避免错误

尽管有界次优搜索中的学习启发式算法非常有效，但它可能会产生启发式平台，或者将搜索移动到状态空间中无法找到解决方案的区域。此外，它还会产生令人难以接受的预估成本;因此，它不能被像WA*这样的经典算法利用来产生w-最优解。在本文中，我们提出了两种方法，可以修改焦点搜索来利用有界次优搜索中的学习启发式:焦点差异搜索，它使用基于最佳预测启发式值的差异评分来评估每个状态;以及K-Focal Search，它在每个扩展周期中从FOCAL列表中扩展多个节点。这两种算法都返回w-最优解，并且探索不同于焦点搜索的状态空间区域，使用学习的启发式对焦点列表进行排序，将探索。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文去求助

来源期刊

Symposium on Combinatorial Search

自引率

0.00%

发文量

期刊最新文献

A-A*pex: Efficient Anytime Approximate Multi-Objective Search Tunable Suboptimal Heuristic Search Hitting Set Heuristics for Overlapping Landmarks in Satisficing Planning Fools Rush in Where Angels Fear to Tread in Multi-Goal CBS Evaluating Distributional Predictions of Search Time: Put Up or Shut Up Games (Extended Abstract)