AI中文摘要
前沿编码模型可能花费大量能力学习不仅是程序行为,还有人类代码库中的意外熵。这些代码库包含有价值的信号:测试、事件、迁移、边缘情况、产品判断和操作历史。这些信号与框架变动、命名漂移、生成代码歧义、依赖仪式、CI方言、弱证明路径和面向人类的审查习惯纠缠在一起。我们提出智能体优先的正则代码,一种证明携带的基底,将常规产品软件重写为正则行为配置文件、类型化变更代数、证明通道、受限编辑语法、语义补丁单元、运行时负记忆和证明携带的变更对象。核心假设是,在声明预言下通过行为等价商化软件,可以将等价编码折叠为带有显式证据和证明义务的受控代表。最终目标是在公共预言下,每个经过验证的正确变更的摊销成本,包括源代码、上下文、推理、工具、验证、安全性、来源、审查、失败循环、缺陷和铸造成本。报告的降低幅度是假设,而非测量的前沿结果。提出的极限是无意外视界:可移除的意外减少,直到剩余的新颖性、证据、治理、风险和未来可选性占主导。对于支持的常规产品分布,这给出了一个可辩护的规划目标,即接近100倍的全成本降低,并非对所有软件的保证。在Qwen2.5-Coder-14B上的初步QLoRA实验表明,64,088条正则轨迹是可学习的,并抑制了测试的禁用语言标记,但未确立行为保持、规模经济或验证变更成本。贡献是一个以最小功能描述长度和验证变更成本为中心的可证伪程序。
英文摘要
Frontier coding models may spend substantial capacity learning not only program behavior, but also accidental entropy in human repositories. Such repositories contain valuable signals: tests, incidents, migrations, edge cases, product judgment, and operational history. These signals are entangled with framework churn, naming drift, generated-source ambiguity, dependency rituals, CI dialects, weak proof routes, and human-oriented review customs. We propose agent-first canonical code, a proof-carrying substrate that rewrites routine product software into canonical behavior profiles, typed change algebra, proof lanes, constrained edit grammars, semantic patch cells, runtime negative memory, and proof-carrying change objects.
The core hypothesis is that quotienting software by behavior equivalence under a declared oracle can collapse equivalent encodings into governed representatives with explicit evidence and proof obligations. The endpoint is amortized cost per verified correct change, including source, context, reasoning, tools, verification, security, provenance, review, failed loops, defects, and foundry cost under a common oracle. Reported reduction bands are hypotheses, not measured frontier results. The proposed limit is a No-Accident Horizon: removable accident decreases until residual novelty, evidence, governance, risk, and future optionality dominate. For supported routine-product distributions, this gives a defensible planning target near 100-fold all-in cost reduction, not a guarantee for all software. Preliminary QLoRA experiments on Qwen2.5-Coder-14B show that 64,088 canonical trajectories are learnable and suppress tested forbidden-language markers, but do not establish behavior preservation, scaling economics, or verified-change cost. The contribution is a falsifiable program centered on minimum functional description length and verified-change cost.