Verilog-Evolve: Feedback-Driven and Skill-Evolving Verilog Generation
Verilog-Evolve:反馈驱动与技能演进的Verilog生成
Zehua Pei, Hui-Ling Zhen, Yu Zhang, Sinno Jialin Pan, Mingxuan Yuan, Bei Yu
AI总结 提出Verilog-Evolve框架,通过反馈驱动(功能仿真、Yosys综合、ABC时序代理等)迭代优化Verilog代码,并利用跨会话技能演进提升生成质量,实验表明在VerilogEval和混合精度GEMM任务上提高了功能成功率和下游友好性。
详情
大型语言模型(LLMs)改进了从自然语言规范生成Verilog的过程,但大多数流水线仍将生成视为孤立的采样后功能检查。这对于实际的RTL设计是不够的,因为有用的Verilog必须正确、可综合、考虑时序,并对下游硬件目标友好。我们提出Verilog-Evolve,一个用于版本化Verilog细化和跨会话技能演进的反馈驱动框架。对于每个任务,Verilog-Evolve生成多样化的次要候选,通过功能仿真、Yosys综合、ABC时序代理以及可选的GEMM指标的可执行反馈进行评估,然后在可配置评分下将最佳候选提升为主要版本。为了跨任务改进,系统维护模块化技能指导,根据任务和反馈上下文检索技能,并通过创建/改进/跳过决策和验证器报告从记录的历史中演进候选技能。在VerilogEval和混合精度GEMM任务上的实验表明,Verilog-Evolve提高了最终功能成功和晋升稳定性,同时在开源综合、时序代理和网表级GEMM目标下生成更下游友好的RTL。验证门控的技能演进进一步提高了GEMM下游质量,并在评估的技能模式中实现了最佳下游分数和GEMM保留通过率。
Large language models (LLMs) have improved Verilog generation from natural-language specifications, but most pipelines still treat generation as isolated sampling followed by functional checking. This is insufficient for practical RTL design, where useful Verilog must be correct, synthesizable, timing-conscious, and friendly to downstream hardware objectives. We present Verilog-Evolve, a feedback-driven framework for versioned Verilog refinement and cross-session skill evolution. For each task, Verilog-Evolve generates diverse minor candidates, evaluates them with executable feedback from functional simulation, Yosys synthesis, ABC timing proxy, and optional GEMM metrics, then promotes the best candidate into a major version under configurable scoring. To improve across tasks, the system maintains modular skill guidance, retrieves skills according to task and feedback context, and evolves candidate skills from logged histories through create/improve/skip decisions and verifier reports. Experiments on VerilogEval and mixed-precision GEMM tasks show that Verilog-Evolve improves final functional success and promotion stability while producing more downstream-friendly RTL under open-source synthesis, timing-proxy, and netlist-level GEMM objectives. Validation-gated skill evolution further improves GEMM downstream quality and achieves the best downstream score and GEMM held-out pass rate among the evaluated skill modes.