LLM-Evolved Pattern Generators for Optimal Classical Planning
LLM演化模式生成器用于最优经典规划
Windy Phung, Dominik Drexler, Arnaud Lequen, Jendrik Seipp
AI总结 提出首个通过LLM驱动的进化程序合成学习可容许启发式函数的方法,用于最优经典规划,结合饱和成本分区保证A*搜索的最优性。
详情
学习到的启发式函数最近已成为满足规划中传统领域无关启发式函数的竞争性替代方案。然而,现有方法侧重于改进搜索引导而非保证可容许性,这使得它们不适用于最优经典规划。我们提出了第一种学习领域相关启发式函数的方法,这些启发式函数在设计上是可容许的,从而保留了A*搜索的最优性保证。我们不是学习从状态到启发式值的直接映射,而是学习构建可诱导可容许启发式函数的抽象。我们使用LLM驱动的进化程序合成框架,为每个领域获得一个程序,该程序为该领域中的任何任务生成模式集合,并通过饱和成本分区以可容许的方式组合所得模式。实验表明,学习到的程序编码了可解释的领域特定见解,在测试时以可忽略的开销运行,并在多个领域上产生了与最先进的领域无关基线相匹配的覆盖范围,同时每个状态的评估速度显著更快。
Learned heuristics have recently become a competitive alternative to traditional domain-independent heuristics for satisficing planning. Existing approaches, however, focus on improving search guidance rather than guaranteeing admissibility, which makes them unsuitable for optimal classical planning. We present the first method for learning domain-dependent heuristics that are admissible by design and thus preserve the optimality guarantees of A* search. Instead of learning a direct mapping from states to heuristic values, we learn to construct abstractions that induce admissible heuristics. We use an LLM-driven evolutionary program-synthesis framework to obtain, for each domain, a program that produces a pattern collection for any task in that domain, and we combine the resulting patterns admissibly via saturated cost partitioning. Empirically, the learned programs encode interpretable domain-specific insights, run with negligible overhead at test time and yield heuristics that match the coverage of state-of-the-art domain-independent baselines on several domains while evaluating each state substantially faster.