ReVEL: Multi-Turn Reflective LLM-Guided Heuristic Evolution via Structured Performance Feedback
ReVEL:基于结构化性能反馈的多轮反思式LLM引导的启发式进化
Cuong Van Duc, Minh Nguyen Dinh Tuan, Tam Vu Duc, Tung Vu Duy, Son Nguyen Van, Hanh Nguyen Thi, Binh Huynh Thi Thanh
AI总结 针对NP-hard组合优化问题的启发式设计,提出ReVEL框架,通过行为感知分组和多轮迭代细化,利用LLM和累积性能反馈联合优化启发式,实验表明优于现有LLM引导的进化基线。
详情
为NP-hard组合优化问题设计有效的启发式仍然具有挑战性,通常需要大量的领域专业知识。最近的LLM引导的进化方法在自动启发式生成方面显示出前景,但大多数现有方法独立地或通过有限的成对反馈来细化启发式。我们提出ReVEL:基于结构化性能反馈的多轮反思式LLM引导的启发式进化,一个用于群体式多轮启发式细化的框架。ReVEL将启发式组织成行为感知的反思组,包括用于局部细化的相似性驱动组和用于探索性搜索的多样性驱动组。在每个组内,LLM使用累积的性能反馈执行迭代多轮细化,使得相关启发式能够在进化迭代中被联合分析和逐步改进。在标准组合优化基准上的实验表明,ReVEL在多种设置和LLM骨干下通常优于现有的LLM引导的进化基线。额外分析表明,行为感知分组有助于在迭代启发式进化过程中实现更一致的细化轨迹。
Designing effective heuristics for NP-hard combinatorial optimization problems remains challenging and often requires substantial domain expertise. Recent LLM-guided evolutionary methods have shown promise for automated heuristic generation, but most existing approaches refine heuristics independently or through limited pairwise feedback. We propose ReVEL: Multi-Turn Reflective LLM-Guided Heuristic Evolution via Structured Performance Feedback, a framework for group-wise multi-turn heuristic refinement. ReVEL organizes heuristics into behavior-aware reflective groups, including similarity-driven groups for localized refinement and diversity-driven groups for exploratory search. Within each group, the LLM performs iterative multi-turn refinement using accumulated performance feedback, enabling related heuristics to be jointly analyzed and progressively improved across evolutionary iterations. Experiments on standard combinatorial optimization benchmarks show that ReVEL generally improves optimization performance over existing LLM-guided evolutionary baselines across multiple settings and LLM backbones. Additional analyses suggest that behavior-aware grouping contributes to more consistent refinement trajectories during iterative heuristic evolution.