arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 2157
2605.18872 2026-05-20 cs.LG cs.AI cs.RO

EUPHORIA: Efficient Universal Planning via Hybrid Optimization for Robust Industrial Robotic Assembly

EUPHORIA: 通过混合优化实现高效通用规划以实现稳健的工业机器人装配

Shih-Yu Lai, Chia-Ching Yen, Yang-Ting Shen, Peter Yichen Chen, Yu-Lun Liu, Bing-Yu Chen

AI总结 本文提出EUPHORIA框架,通过混合优化策略实现通用少样本适应和动态效率,解决建筑机器人装配中规划器高度专业化和操作低效的问题,结合元几何编码器、物理引导图变压器和残差稳定性校正等方法,实现高效且鲁棒的装配规划。

详情
AI中文摘要

建筑机器人装配面临持续瓶颈:现有规划器要么高度专业化,需要每次新几何设计都进行昂贵的再训练,要么操作低效,将结构序列和运动学运动视为独立过程。我们提出了EUPHORIA,一个统一框架,通过混合优化策略实现通用少样本适应和动态效率。为克服再训练瓶颈,我们提出了基于图超网络的元几何编码器:不同于标准对比学习仅在特征级识别,我们的超网络动态从最小支持集中生成策略参数,使参数级适应复杂拓扑(如穹顶、拱门)而无需基于梯度的再训练。对于结构推理,我们引入了通过软演员-评论家(SAC)训练的物理引导图变压器,其物理偏置注意力机制通过离散元模型(DEM)模拟的接触力调节注意力分数,引导规划器朝向结构关键连接。我们进一步通过运动学感知序列确保操作效率,其中SAC目标惩罚高能转换。最后,我们通过残差稳定性校正弥合仿真到现实的差距,这是一种可微优化层,通过最小化联合能量-稳定性成本优先级来微调粗略装配动作。实验表明,EUPHORIA显著减少了与解耦基线相比的能量消耗,并在未见的非标准几何上实现了最先进的成功率,通过融合元学习、物理引导注意力和残差优化,实现一个连贯的通用规划器。

英文摘要

Robotic assembly in architectural construction faces a persistent bottleneck: existing planners are either highly specialized, requiring prohibitive retraining for every new geometric design, or operationally inefficient, treating structural sequencing and kinematic motion as disjoint processes. We present EUPHORIA, a unified framework that achieves universal few-shot adaptability and dynamic efficiency through a hybrid optimization strategy. To overcome the retraining bottleneck, we propose a Meta-Geometric Encoder based on Graph Hypernetworks: unlike standard contrastive learning, which performs only feature-level recognition, our hypernetwork dynamically generates policy parameters from a minimal support set, enabling parameter-level adaptation to complex topologies (e.g., domes, arches) without gradient-based retraining. For structural reasoning, we introduce a Physics-Informed Graph Transformer trained via Soft Actor-Critic (SAC), with a Physics-Bias Attention mechanism that modulates attention scores using contact forces from Discrete Element Model (DEM) simulations, guiding the planner toward structurally critical connections. We further ensure operational efficiency through Kinematics-Aware Sequencing, where the SAC objective penalizes high-energy transitions. Finally, we bridge the Sim2Real gap via Residual Stability Correction, a differentiable optimization layer that fine-tunes coarse assembly actions by minimizing a joint energy-stability cost prior to execution. Experiments show that EUPHORIA significantly reduces energy consumption over decoupled baselines and achieves state-of-the-art success rates on unseen, non-standard geometries with minimal few-shot examples, fusing meta-learning, physics-informed attention, and residual optimization into a cohesive, generalized planner.

2605.18871 2026-05-20 cs.LG cs.AI

Distributional Energy-Based Models for Uncertainty-Aware Structured LLM Reasoning

基于不确定性感知的结构LLM推理的分布能量模型

Shireen Kudukkil Manchingal, Abhey Kalia, Fernanda Gonçalves, Shebin Rawther

AI总结 本文提出了一种分解的能量函数,结合了学习的质量评分器和确定性分析约束惩罚,用于验证结构LLM输出。该方法通过两步推理循环触发目标再生或 abstention,能够在多个基准测试中超越单次Qwen-72B,并减少约束违反。

详情
AI中文摘要

当大型语言模型生成结构化输出如旅行计划、代码解决方案或多步证明时,个别推理步骤可能正确,但整体输出可能违反预算、失败测试用例或与先前推论矛盾。我们提出了一种分解的能量函数,结合了学习的质量评分器和确定性分析约束惩罚,用于验证结构LLM输出。质量评分器是单个冻结编码器上的异构集合,包含低秩适配器(3%可训练参数);集合均值对候选者进行排名,标准差量化epistemic不确定性,驱动一个两步推理循环,触发目标再生或 abstention。在五个基准测试(GSM8K、MuSR、TravelPlanner、TACO、Knights & Knaves)中,我们的149M参数验证器协调一个7-26B开放生成器池,在每个基准测试中均优于单次Qwen-72B,与Claude Sonnet 4.6在MuSR上匹配(67.7% vs. 68.0%),并且在TravelPlanner上将约束违反减少53%(相对于Opus 4.6,oracle 0.028,随机 0.231)。两种方法是互补的:结构验证在约束可检查时获胜(验证器捕捉信号前沿模型无法自我检测),而预训练规模先验在不可检查时获胜(叙述推理、代码语义)。跨数据集的混淆分析确认在四个推理任务上确实存在质量区分,并识别出代码中的模型身份捷径,通过最后一层重新训练得以缓解。评分器在困难数据上训练后可实现零样本转移:一个MuSR训练的评分器在没有看到数学问题的情况下在GSM8K上达到93.9%。

英文摘要

When Large Language Models produce structured outputs such as travel plans, code solutions, or multi-step proofs, individual reasoning steps may appear correct while the output as a whole violates budgets, fails test cases, or contradicts earlier deductions. We propose a decomposed energy function that combines a learned quality scorer with deterministic analytical constraint penalties for verifying structured LLM outputs. The quality scorer is a heterogeneous ensemble of low-rank adapters on a single frozen encoder (3% trainable parameters); the ensemble mean ranks candidates while the standard deviation quantifies epistemic uncertainty, driving a two-pass inference loop that triggers targeted regeneration or abstention. Across five benchmarks (GSM8K, MuSR, TravelPlanner, TACO, Knights & Knaves), our 149M-parameter verifier orchestrating a pool of 7-26B open generators outperforms single-shot Qwen-72B on every benchmark, matches Claude Sonnet 4.6 on MuSR (67.7% vs. 68.0%), and reduces constraint violations by 53% relative to Opus 4.6 on TravelPlanner (oracle 0.028, random 0.231). The two routes are complementary: structural verification wins when constraints are checkable (the verifier captures signal frontier models cannot self-detect), while pretraining-scale priors win where they are not (narrative inference, code semantics). A cross-dataset confounding analysis confirms genuine quality discrimination on four reasoning tasks and identifies a model-identity shortcut on code, mitigated via last-layer retraining. Scorers trained on difficult data transfer zero-shot: a MuSR-trained scorer achieves 93.9% on GSM8K without seeing a math problem.

2605.18869 2026-05-20 cs.LG cs.AI cs.NE

MO-CAPO: Multi-Objective Cost-Aware Prompt Optimization

MO-CAPO:多目标成本感知提示优化

Jan Büssing, Moritz Schlager, Timo Heiß, Tom Zehle, Matthias Feurer

AI总结 本文提出MO-CAPO,一种多目标提示优化算法,同时优化性能和推理成本,并通过预算分配实现高效优化,通过评估四个任务和三个LLM,证明其在噪声R2指标上优于NSGA-II基线,并在较低预算下达到竞争性性能。

详情
AI中文摘要

大型语言模型(LLMs)在广泛的任务上表现出色,但对提示设计高度敏感,促使需要自动提示优化。现有方法主要关注性能,忽略竞争目标如推理成本或延迟。同时,现有多目标提示优化工作依赖于现成的NSGA-II,忽略优化效率。为此,我们引入MO-CAPO,一种新的多目标提示优化算法,同时优化性能和推理成本,利用预算分配实现成本高效的优化。我们进一步提出一个面向部署的成本目标,捕捉LLM推理的完整计算概况。我们评估了我们的方法在四个任务和三个LLM上的表现,并将其与基于NSGA-II的多目标方法和最先进的单目标提示优化器进行比较。结果表明,MO-CAPO一致地识别出强、稳健和多样的Pareto前沿近似,同时保持成本效率。它在12种情况中的8种情况下在噪声R2指标上优于NSGA-II基线,并且在显著较低的预算下常能达到竞争性性能。发现的解决方案集涵盖了被单目标优化器遗漏的多样化性能-成本权衡,但顶级性能候选者仍与单目标解决方案竞争。此外,我们进行了首次多目标机器学习实验的评估,考虑了泛化和鲁棒性通过噪声R2和近似间隙,使解决方案质量的评估更加现实。MO-CAPO使从业者能够从高效发现的多个提示中选择,这些提示提供不同的性能和成本权衡。

英文摘要

Large language models (LLMs) achieve strong performance across a wide range of tasks but are highly sensitive to prompt design, motivating the need for automatic prompt optimization. Existing methods predominantly focus on performance alone, ignoring competing objectives such as inference cost or latency. At the same time, existing work on multi-objective prompt optimization relies on off-the-shelf NSGA-II, ignoring optimization efficiency. As a remedy, we introduce MO-CAPO, a novel multi-objective prompt optimization algorithm that jointly optimizes performance and inference cost while leveraging budget allocation for cost-efficient optimization. We further propose a deployment-oriented cost objective that captures the full computational profile of LLM inference. We evaluate our approach across four tasks and three LLMs and compare it to an NSGA-II-based multi-objective method and state-of-the-art single-objective prompt optimizers. Results show that MO-CAPO consistently identifies strong, robust, and diverse Pareto front approximations while maintaining cost-efficiency. It outperforms the NSGA-II baseline on 8 out of 12 cases in terms of the noisy R2 metric and achieves competitive performances often already at a considerably lower budget. The discovered solution sets span diverse performance-cost trade-offs that are omitted by single-objective optimizers, yet the top-performance candidates remain competitive with single-objective solutions. Additionally, we conduct the first evaluation of multi-objective machine learning experiments that considers generalization and robustness through noisy R2 and approximation gap, enabling a more realistic assessment of solution quality. MO-CAPO enables practitioners to select from an efficiently discovered set of multiple prompts offering different trade-offs between performance and cost.

2605.18867 2026-05-20 cs.LG cs.AI

EVA-0: Test-Time Model Evolution with Only Two Forward Passes per Sample

EVA-0: 仅两次前向传递的测试时间模型演化

Guohao Chen, Shuaicheng Niu, Geng Li, Yunbei Zhang, Shilin Shan, Chunyan Miao, Jianfei Yang

AI总结 本文研究了在仅两次前向传递预算下测试时间模型演化的问题,提出EVA-0框架以解决零阶优化中的三个关键障碍,实现高效部署。

详情
AI中文摘要

测试时间模型演化为部署模型提供了一种改进 unlabeled 测试时间经验的有前景方法,但大多数现有方法依赖反向传播(BP),这导致了显著的内存开销,使它们难以在边缘设备、量化模型、专用加速器或黑盒模型上部署。在本文中,我们研究了在严格两次前向预算下测试时间模型演化,这一设置推动了适应向高度高效的现实部署发展。我们揭示了零阶测试时间优化中的三个关键障碍:对捷径解的易感性、不受控的权重漂移和无效的更新方向估计。为克服这些问题,我们提出了EVA-0,一个最小的零阶适应框架,其特点包括:1)保持损失尺度不变以防止捷径解;2)设计了锚点引导的优化策略以缓解权重漂移;3)使用样本级对称双侧扰动进行更新方向估计和推理。EVA-0不需要BP,并且在每个样本上仅需两次前向传递即可完成推理和适应。在ImageNet-C和ViT-Base上的结果表明,EVA-0优于基于BP的DeYO和无BP的FOA,并在FOA上实现了14倍的速度提升。代码将被发布。

英文摘要

Test-time model evolution offers a promising way for deployed models to improve from unlabeled test-time experience, yet most existing methods depend on backpropagation (BP), which incurs substantial memory overhead and makes them difficult to deploy on edge devices, quantized models, specialized accelerators, or black-box models. In this work, we study test-time model evolution under a strict two-forward budget, a setting that pushes adaptation toward highly efficient real-world deployment. We reveal three key obstacles in zeroth-order test-time optimization: susceptibility to shortcut solutions, uncontrolled weight drift, and ineffective update direction estimation. To overcome them, we propose EVA-0, a minimal zeroth-order adaptation framework that: 1) keeps the loss scale-invariant to prevent shortcut solutions; 2) devises an anchor-guided optimization strategy to alleviate weight drift; 3) uses sample-wise symmetric two-sided perturbation for update direction estimation and inference. EVA-0 requires no BP and performs both inference and adaptation within only two forward passes per sample. Results on ImageNet-C & ViT-Base show that EVA-0 outperforms both BP-based DeYO and BP-free FOA, while achieving a 14x speed-up over FOA. Code will be released.

2605.18865 2026-05-20 cs.LG cs.AI

From Sparsity to Simplicity: Enabling Simpler Sequential Replacements via Sparse Attention Distillation

从稀疏到简单:通过稀疏注意力蒸馏实现更简单的顺序替换

Yuxin Ren, Maxwell D Collins, Miao Hu, Huanrui Yang

AI总结 本文提出通过稀疏注意力蒸馏实现更简单的顺序替换,通过分析transformer层中的稀疏模式,发现可以将复杂的token依赖分解为不同复杂度的序列到序列映射,并用更简单的顺序模块替代部分层功能,从而减少参数量和延迟。

详情
AI中文摘要

自注意力机制是大规模transformer预训练的核心基础,但其二次token交互成本使得推理过程昂贵。用更简单的顺序模块替代注意力具有吸引力,但直接替换往往导致信息丢失,尤其是在大规模情况下。本文通过稀疏性的视角重新审视注意力替换。基于对transformer各层中稀疏模式的观察,我们提出预训练transformer将复杂的token依赖分解为多种复杂度的序列到序列映射,其中某些层的功能可以被近似并用更简单的顺序模块替代而不丢失信息。我们通过插拔式层间蒸馏框架验证这一前提,以近似和替代预训练视觉transformer模型中的注意力功能。在固定训练预算下,受控组的替换结果显示:替换稀疏注意力的层比替换密集注意力的层导致的准确率下降更小。我们进一步通过AViT风格的token保留对预训练的ViT施加显式的注意力稀疏性,并进行稀疏性引导的顺序替换模型蒸馏,其中我们发现增加教师模型的稀疏性会一致减少学生模型与教师模型之间的差距。所提出的方法通过注意力稀疏性的指导实现了更小的参数量和延迟的高效注意力替换。

英文摘要

Self-attention serves as the core foundation of large-scale transformer pretraining, but its quadratic token interaction cost makes inference expensive. Replacing attention with simpler sequential modules is appealing, yet naive substitution is often lossy, especially at larger scales. This paper revisits attention replacement through the lens of sparsity. Based on the observation of diverse sparsity patterns across transformer layers, we posit that pretrained transformers decompose the complex token dependency across tokens into various sequence-to-sequence mappings of diverse complexities, where some layer functionalities can be approximated and replaced with much simpler sequential modules without loss. We evaluate this premise using a plug-and-play layer-wise distillation framework to approximate and replace attention functionalities in pretrained vision transformer models. Controlled group-wise replacements under a fixed training budget reveal a clear pattern: substituting layers with sparser attention incurs substantially smaller accuracy drops than replacing denser ones. We further impose explicit attention sparsity on the pretrained ViT via AViT-style token retention and perform sparsity-guided distillation for sequential replacing models, where we see increasing teacher sparsity consistently reduces the student-teacher gap. The proposed method achieves efficient attention replacement for reduced parameter size and latency through the guidance of attention sparsity.

2605.18864 2026-05-20 cs.LG cs.AI cs.CL

SAGE: Shaping Anchors for Guided Exploration in RLVR of LLMs

SAGE: 通过塑造锚点引导LLMs的RLVR探索

Chanuk Lee, Minki Kang, Sung Ju Hwang

AI总结 本文提出SAGE框架,通过重塑反KL锚分布来实现可控的经验支持扩展,从而在数学推理基准中提升pass@1和pass@k的表现。

Comments Preprint

详情
AI中文摘要

近期研究发现,可验证奖励的强化学习(RLVR)能够可靠地提高推理任务的pass@1指标,但往往在pass@k上未能取得类似提升,引发了关于RLVR是否真正使大语言模型获得新推理能力还是仅提高基础模型中现有推理模式采样效率的问题。先前分析大多支持后者观点,认为这种限制源于标准RLVR目标的结构特性,导致探索压力不足。在本文中,我们提出一个核心结构约束源于反KL正则化,该正则化稳定了训练但本质上将策略锚定于参考分布,从而抑制了替代推理模式的出现。然而,我们显示,去除KL项或用前向KL替代并不能提供满意的解决方案,因为两者都会通过诱导奖励黑客或将概率质量分配给非目标区域而破坏效率-覆盖权衡。为了解决这一矛盾,我们提出了SAGE,一个原理性的框架,通过引导函数q(x,y)重塑反KL锚分布本身,实现可控的经验支持扩展,从而在挑战性的数学推理基准中获得一致的pass@1和pass@k提升。我们的代码可在https://github.com/tally0818/SAGE上获得。

英文摘要

Recent studies observe that reinforcement learning with verifiable rewards (RLVR) reliably improves pass@1 on reasoning tasks, yet often fails to yield comparable gains in pass@k, raising the question of whether RLVR genuinely enables large language models to acquire novel reasoning abilities or merely enhances the efficiency of sampling reasoning modes already present in the base model. Prior analyses largely support the latter view, attributing this limitation to structural properties of standard RLVR objectives that result in insufficient exploration pressure. In this work, we argue that a central structural constraint arises from reverse-KL regularization, which stabilizes training but inherently anchors the policy to the reference distribution, thereby suppressing the emergence of alternative reasoning modes. However, we show that neither removing the KL term nor replacing it with forward-KL provides a satisfactory solution, as both disrupt the efficiency-coverage trade-off by either inducing reward hacking or allocating probability mass to off-target regions. To resolve this tension, we propose SAGE, a principled framework that enables controllable empirical support expansion by reshaping the reverse-KL anchor distribution itself through a guide function q(x,y), achieving consistent improvements in both pass@1 and pass@k across challenging mathematical reasoning benchmarks. Our code is available at https://github.com/tally0818/SAGE.

2605.18862 2026-05-20 cs.LG cs.AI cs.CR

Towards Family-Grouped Hierarchical Federated Learning on Sub-5KB Models: A Feasibility Study of Privacy-Preserving ECG Monitoring for Ultra-Resource-Constrained Wearables

面向子5KB模型的家庭分组分层联邦学习:隐私保护ECG监测在超低资源约束可穿戴设备上的可行性研究

Hangyu Wu

AI总结 本文提出家庭分组分层联邦学习(Family-FL)和轻量级Tiny CNN-LSTM架构,通过模拟评估在超低资源约束微控制器上实现隐私保护的联邦学习的可行性,展示了在MIT-BIH数据库上达到91.9%的准确率和76.7%的通信量减少。

Comments Supported by Shenzhen Coddie Technology Co., Ltd. This is a preprint and has not been peer-reviewed

详情
AI中文摘要

心血管疾病仍是全球导致死亡的主要原因,通过可穿戴设备持续ECG监测早期检测心律失常可以预防危及生命事件。联邦学习(FL)通过在设备上保留原始ECG数据实现隐私保护的协同训练,但标准FL导致通信开销过大,标准深度学习模型无法在超低功耗微控制器上运行。我们提出家庭分组分层联邦学习(Family-FL),一种三级架构,利用家庭作为隐私边界在家庭内聚合后再进行全局同步。我们进一步设计了一种硬件受限的Tiny CNN-LSTM架构,仅包含669个参数,INT8量化后仅占用4.65KB Flash和2.95KB RAM,满足STC32G12K128类微控制器的约束。在MIT-BIH心律失常数据库上的实验(5次独立运行的平均值)表明,Family-FL相比FedAvg减少了76.7%的通信量,同时保持了可比的准确性。Family-FL-Tiny在91.9±1.2%的准确率和宏F1为0.483±0.031的情况下,将总通信量减少到FedAvg的0.31%。该模型实现了可靠的室性心律失常检测(每类F1=0.80),这是家庭初步筛查中最临床关键的异常情况。这些结果通过基于模拟的评估证明了通过隐私保护联邦学习在超低资源约束微控制器上的技术可行性。我们诚实地讨论了局限性:无硬件部署、单数据集验证(MIT-BIH,47名受试者)、罕见类敏感性降低以及无正式差分隐私保证。

英文摘要

Cardiovascular disease remains the leading cause of death worldwide, and early detection of arrhythmias through continuous ECG monitoring on wearable devices can prevent life-threatening events. Federated Learning (FL) enables privacy-preserving collaborative training by keeping raw ECG data on device, yet standard FL incurs prohibitive communication overhead and standard deep learning models cannot fit on ultra-low-power microcontrollers. We propose Family-Grouped Hierarchical Federated Learning (Family-FL), a three-tier architecture that uses the family as a natural privacy boundary for intra-family aggregation before global synchronization. We further design a hardware-constrained Tiny CNN-LSTM architecture with only 669 parameters, INT8-quantized to occupy merely 4.65KB Flash and 2.95KB RAM, meeting the constraints of STC32G12K128-class microcontrollers. Experiments on the MIT-BIH Arrhythmia Database (mean of 5 independent runs with different seeds) demonstrate that Family-FL reduces communication volume by 76.7% compared to FedAvg while maintaining comparable accuracy. Family-FL-Tiny achieves 91.9 +/- 1.2% accuracy with macro-F1 of 0.483 +/- 0.031, reducing total communication to 0.31% of FedAvg. The model achieves reliable ventricular arrhythmia detection (per-class F1 = 0.80), the most clinically critical abnormality for home-based preliminary screening. These results demonstrate the technical feasibility of privacy-preserving federated learning on ultra-resource-constrained microcontrollers through simulation-based evaluation. We honestly discuss limitations: no hardware deployment, single-dataset validation (MIT-BIH, 47 subjects), reduced rare-class sensitivity, and absence of formal differential privacy guarantees.

2605.18858 2026-05-20 cs.LG cs.AI cs.GT stat.ML

When Individually Calibrated Models Become Collectively Miscalibrated

当个体校准的模型成为集体不校准的

Zhaohui Wang

AI总结 研究探讨了在多智能体环境中,即使每个模型都经过个体校准,聚合预测仍可能不校准的现象,提出通过VCG聚合方法解决这一问题,实现激励相容和近最优性能。

Comments 42 pages, 1 main figure, multiple tables. Accepted at ProbML 2026

详情
AI中文摘要

概率预测系统常常将多个模型的概率估计聚合为单一决策。一个常见假设是,如果每个模型都经过个体校准,聚合预测也将是良好的校准。我们展示了在多智能体设置中,这一假设不成立:当预测者战略性地相互作用时,即使没有刻意协调,个体校准的预测者也可能集体上不校准。这种现象自然出现在智能体在重叠数据上独立训练时。我们证明,在基于Brier分数的聚合中,当信念正相关时,每个智能体的个体最优报告系统地低估了正类概率,导致价格of anarchy大于一,只要协方差(b_i, b_j) > 0。在典型设置(n=5个智能体,成对相关性=0.5,基础率=0.3)中,经实测的PoA在假阴性率上达到7.25倍。相比之下,基于VCG的聚合通过奖励边际贡献对齐激励,实现主导策略激励相容性和近最优性能。在三个现实世界数据集(NSL-KDD、UNSW-NB15、信用卡欺诈)上的实验显示,VCG在保持可比准确性的同时表现出强鲁棒性。它在数据稀疏和对抗性设置中表现尤其出色,自适应加权进一步在分布偏移下提升了性能。

英文摘要

Probabilistic prediction systems often aggregate probability estimates from multiple models into a single decision. A common assumption is that if each model is individually calibrated, the aggregate prediction will also be well calibrated. We show that this assumption fails in multi-agent settings: individually calibrated predictors can become collectively miscalibrated when their predictions interact strategically, in the game-theoretic sense of Brier-optimal local response, even without deliberate coordination. This phenomenon arises naturally when agents are independently trained on overlapping data. We prove that under Brier-score-based aggregation with positively correlated beliefs, each agent's individually optimal report systematically underestimates the positive-class probability, yielding a Price of Anarchy greater than one whenever Cov(b_i, b_j) > 0. In a canonical setting (n = 5 agents, pairwise correlation = 0.5, base rate = 0.3), the empirically measured PoA in false-negative rate reaches 7.25x. In contrast, VCG-based aggregation aligns incentives by rewarding marginal contribution, achieving dominant-strategy incentive compatibility and near-optimal performance. Experiments on three real-world datasets (NSL-KDD, UNSW-NB15, Credit Card Fraud) show that VCG provides strong robustness while maintaining comparable accuracy. It performs particularly well in data-sparse and adversarial settings, and adaptive weighting further improves performance under distribution shift.

2605.18855 2026-05-20 cs.LG cs.CV

Delta Attention Residuals

Delta Attention Residuals

Cheng Luo, Zefan Cai, Junjie Hu

AI总结 本文提出Delta Attention Residuals,通过在残差连接中引入对每个子层引入的变化(delta)进行注意力机制,解决了传统注意力残差中因累积隐藏状态冗余导致的路由崩溃问题,从而提升模型跨层选择信息的能力。

详情
AI中文摘要

Attention Residuals将标准加性残差连接替换为在前一层输出上学习的softmax注意力,实现了选择性的跨层路由。然而,标准Attention Residuals仍然在累积的隐藏状态上进行注意力计算,这些状态高度冗余。我们发现这种冗余导致在更深的层中出现路由崩溃:注意力权重变得低对比度且接近均匀(最大权重≈0.2),限制了模型在前一层中选择信息性状态的能力。这提出了一个关键但尚未深入研究的设计问题:在Attention Residuals中应路由何种层间表示?为回答这个问题,我们提出了Delta Attention Residuals,其在delta(每个子层引入的变化(v_i = h_{i+1} - h_i))上进行注意力计算,而非累积状态。Delta表示在结构上具有多样性,产生更高对比度的注意力分布(最大权重≈0.6),从而在层间实现更选择性和有效的路由。这一原则适用于单个子层和块粒度。在所有测试的规模(220M-7.6B)中,Delta Attention Residuals始终优于标准残差和Attention Residuals,验证困惑度提升1.7-8.2%。Delta Attention Residuals还允许通过标准微调将预训练检查点转换为Delta Attention Residuals。代码可在https://github.com/wdlctc/delta-attention-residuals-code获得。

英文摘要

Attention Residuals replace standard additive residual connections with learned softmax attention over previous layer outputs, enabling selective cross-layer routing. However, standard Attention Residuals still attend over cumulative hidden states in previous layers, which are highly redundant. We show that this redundancy leads to routing collapse in deeper layers: attention weights become low-contrast and closer to uniform (max weight ${\approx}$0.2), limiting the model's ability to select informative states in previous layers. This raises a key but underexplored design question: what layer-wise representations should be routed in Attention Residuals? To answer this question, we propose Delta Attention Residuals, which attend over deltas -- the change introduced by each sublayer ($\mathbf{v}_i = \mathbf{h}_{i+1} - \mathbf{h}_i$) -- instead of cumulative states. Delta representations are structurally diverse and yield higher-contrast attention distributions (max weight ${\approx}$0.6), enabling more selective and effective routing across layers. This principle applies at both per-sublayer and block granularity. Across all tested scales (220M--7.6B), Delta Attention Residuals consistently outperform both standard residuals and Attention Residuals, with 1.7--8.2\% validation perplexity gains. Delta Attention Residuals also enables converting pretrained checkpoints into Delta Attention Residuals via standard fine-tuning. Code is available at https://github.com/wdlctc/delta-attention-residuals-code.

2605.18854 2026-05-20 cs.LG

Evaluating Memory Condensation Strategies for Coding Agents in Data-Driven Scientific Discovery

评估用于数据驱动科学发现的编码代理的记忆压缩策略

Renuka Chintalapati, Sid Raskar, Anurag Acharya, Jared Willard, Patrick Emami, Sameera Horawalavithana

AI总结 本文评估了八种记忆压缩策略在数据驱动科学发现任务中的表现,发现没有压缩器显著提升假设质量,但基于LLM的压缩器会增加24-94%的token成本,而屏蔽工具调用输出可实现8.6%的净节省,且最佳压缩器因科学领域和任务长度而异。

详情
AI中文摘要

编码代理在长时间任务中积累大量上下文,但固定的上下文窗口迫使从业者在截断和任务失败之间做出选择。尽管已提出许多记忆压缩策略,从简单的滑动窗口到LLM生成的摘要,但缺乏系统性的比较来指导策略选择,尤其是在科学发现任务中。我们使用GPT-4o对六十个DiscoveryBench任务(涵盖六个科学领域,总计480次评估)评估了八种记忆压缩策略。我们发现,没有压缩器显著改变假设质量,而基于LLM的压缩器会增加24-94%的token成本,屏蔽工具调用输出可实现8.6%的净节省。我们还观察到,数据驱动科学发现的最佳压缩器因科学领域和任务长度而异。

英文摘要

Coding agents accumulate extensive context during long-running tasks, yet fixed context windows force practitioners to choose between truncation and task failure. While numerous memory condensation strategies have been proposed, from simple sliding windows to LLM-generated summaries, no systematic comparison exists to guide strategy selection, especially in scientific discovery tasks. We evaluate eight memory condensation strategies using GPT-4o on sixty DiscoveryBench tasks spanning six scientific domains (480 total evaluations). We find that no condenser significantly alters hypothesis quality, while LLM-based condensers increase token costs by 24-94 percent, and masking tool-call outputs achieves an 8.6 percent net savings. We also observe that the optimal condenser for data-driven scientific discovery varies by scientific domain and task length.

2605.18853 2026-05-20 cs.LG cs.CV cs.DC

INAR-VL: Input-Aware Routing for Edge-Cloud Vision-Language Inference

INAR-VL:面向边缘-云视觉-语言推断的输入感知路由

Ahmed Šabanović, Paul Joe Maliakel, Ivona Brandić

AI总结 本文提出INAR-VL,一种轻量级的边缘-云路由系统,用于多模态推断的两级部署。该系统通过轻量级的图像和文本复杂度信号指导路由和模型选择,在本地执行简单查询,将复杂查询卸载到云端,从而在延迟、能耗和准确性之间取得平衡。

Comments 8 pages, 3 figures

详情
AI中文摘要

边缘部署的视觉-语言模型(VLMs)面临延迟与准确性的权衡:云端执行提供高质量预测但会带来通信延迟和能耗,而仅边缘执行则速度更快但准确性较低,因为模型容量有限。这种权衡进一步受到图像质量和推理复杂度异质性的影响,使静态部署效果不佳。我们提出了INAR-VL,一种轻量级的边缘-云路由系统,用于两级部署中的多模态推断。INAR-VL在边缘和云端维护互补的VLMs,并利用轻量级的图像和文本复杂度信号指导路由和模型选择,执行简单查询本地化,当有利时将复杂查询卸载到云端。在视觉问答任务上的评估表明,INAR-VL将36%的请求执行在边缘,延迟降低24%,能耗降低26%,并保持97%的云端准确性。

英文摘要

Edge deployment of Vision-Language Models (VLMs) faces a tradeoff between latency and accuracy: cloud execution provides high-quality predictions but incurs communication delay and energy cost, while edge-only execution is faster but less accurate due to limited model capacity. This trade-off is further complicated by heterogeneity in image quality and reasoning complexity, making static placement suboptimal. We present INAR-VL, a lightweight edge-cloud routing system for multimodal inference in a two-tier deployment. INAR-VL maintains complementary VLMs across edge and cloud and uses lightweight image and text complexity signals to guide routing and model selection, executing simple queries locally while offloading complex ones when beneficial. Evaluation on visual question answering shows that INAR-VL executes 36% of requests on the edge, reduces latency by 24%, lowers energy by 26%, and preserves 97% of cloud-level accuracy.

2605.18852 2026-05-20 cs.LG cs.AI cs.CL

Robust Checkpoint Selection for Multimodal LLMs via Agentic Evaluation and Stability-Aware Ranking

通过代理评估和稳定性感知排名实现多模态大语言模型的鲁棒检查点选择

Qinwu Xu, Zhuoheng Li, Jessie Salas

AI总结 本文提出了一种多阶段框架,结合了精心挑选的现实世界数据、结构化的LLM判断和多阶段排名协议,以解决多模态大语言模型检查点选择中的鲁棒决策问题,强调数据质量(特别是OCR可读性)对评估有效性的重要性。

详情
AI中文摘要

多模态大语言模型(MLLMs)的检查点选择在性能差异微小且评估信号易受噪声影响时面临重大挑战。现有方法依赖静态基准或逐点评分,经常与实际应用场景不一致,并缺乏对不确定性的鲁棒估计,特别是在OCR密集场景中。在本文中,我们将检查点选择建模为在评估不确定性下的稳健决策问题。我们提出了一种多阶段框架,整合了精心挑选的现实世界数据、结构化的LLM判断和多阶段排名协议。评估系统通过逐点过滤、列表排名和成对比较进行逐步细化。为了提高可靠性,我们引入基于子采样的置信度估计和基于百分位数的评分公式,以捕捉分布特征并惩罚尾部失败。此外,我们证明数据质量,特别是OCR可读性,是评估有效性的重要决定因素。

英文摘要

Checkpoint selection for multimodal large language models (MLLMs) presents significant challenges when performance differentials are marginal and evaluation signals are prone to noise. Existing methodologies rely heavily on static benchmarks or pointwise scoring, which frequently misalign with in-the-wild usage and lack robust uncertainty estimation, particularly in OCR-heavy scenarios. In this work, we formulate checkpoint selection as a robust decision problem under evaluation uncertainty. We propose a multi-stage framework that integrates curated real-world data, structured LLM-based judgment, and multi-stage ranking protocols. The evaluation system orchestrates progressive refinement via pointwise filtering, listwise ranking, and pairwise comparison. To enhance reliability, we introduce subsampling-based confidence estimation and a percentile-based scoring formulation that captures distributional characteristics while penalizing tail failures. Furthermore, we demonstrate that data quality, specifically OCR readability, is a critical determinant of evaluation validity.

2605.18851 2026-05-20 cs.LG

STRIDE: Learnable Stepwise Language Feedback for LLM Reasoning

STRIDE: 用于LLM推理的可学习分步语言反馈

Junjie Zhang, Guozheng Ma, Shunyu Liu, Zetian Hu, Yongcheng Jing, Ting-En Lin, Yongbin Li, Dacheng Tao

AI总结 本文提出STRIDE框架,通过可学习的分步语言反馈提升LLM推理能力,解决了传统方法在标注成本高、信息瓶颈等问题,实验显示其在多种推理基准上表现优异。

详情
AI中文摘要

最近强化学习(RL)的进步突显了其在激励大型语言模型(LLM)推理能力的潜力。然而,现有分步级方法面临标注成本高、领域覆盖有限的问题,而标量评分进一步引入信息瓶颈,无法提供足够的语义带宽来改进中间决策。替代的语言批评方法依赖于冻结或外部批评者,虽然提供更丰富的文本反馈,但缺乏持续政策改进所需的可扩展性。在本工作中,我们提出语言驱动的分步轨迹重定向(STRIDE),一种新颖的训练框架,将过程监督从标量奖励转移到可学习的分步语言反馈。具体来说,我们仅使用基于结果的奖励共同训练生成器和生成验证器,消除外部标注,通过联合对齐的验证器训练实现持续的政策改进。验证器的分步语言批评明确本地化并解释失败,使生成器能够在中间步骤将推理轨迹转向替代决策。轨迹重定向设计保证了即使在噪声或次优验证器反馈下也能实现无害的政策改进。在多样化的推理基准实验中,STRIDE显著优于最先进的基线,同时在零次通过率问题上取得突破,其中标量方法在消融研究中无法产生学习信号,证明了可学习分步语言反馈在增强LLM推理能力方面的有效性。

英文摘要

Recent advances in Reinforcement Learning (RL) have underscored its potential for incentivizing reasoning capabilities of Large Language Models (LLMs). However, existing step-level efforts suffer from costly annotations that limit domain coverage, while scalar scores further impose an information bottleneck, offering insufficient semantic bandwidth to improve intermediate decisions. Alternative language-critique approaches, which rely on frozen or external critics, provide richer textual feedback but lack the scalability needed for sustained policy improvement. In this work, we propose language-driven stepwise trajectory redirection, termed as STRIDE, a novel training framework that shifts process supervision from scalar rewards to learnable stepwise language feedback. Specifically, we co-train a generator and a generative verifier using only outcome-based rewards, eliminating external annotations, while delivering sustained policy improvement through jointly aligned verifier training. The verifier's stepwise language critiques explicitly localize and explain failures, enabling the generator to redirect reasoning trajectories at intermediate steps toward alternative decisions. The trajectory redirection design guarantees harmless policy improvement, even under noisy or suboptimal verifier feedback. Experiments on diverse reasoning benchmarks show that STRIDE significantly outperforms state-of-the-art baselines, as well as achieving breakthroughs on zero-pass-rate problems where scalar methods yield no learning signal in our ablation studies, demonstrating the effectiveness of learnable stepwise language feedback for enhancing LLM reasoning.

2605.18849 2026-05-20 cs.LG cs.AI

INSIGHTS: Demonstration-Based Summaries of Time Series Predictors

INSIGHTS: 时间序列预测器的基于演示的摘要

Bar Eini Porat, Rom Gutman, Uri Shalit, Ofra Amir

AI总结 本文提出INSIGHTS方法,一种模型无关、以用户为中心的方法,用于提供时间序列模型的全局解释。该方法通过生成样本摘要,平衡时间序列样本的重要性与多样性,为用户提供全面的模型行为概述。

详情
AI中文摘要

可解释性方法发展迅速,但时间序列模型的全局解释仍不完善,大多数方法集中在局部实例层面的解释上。我们介绍了INSIGHTS,一种模型无关、以用户为中心的方法,用于提供时间序列模型的全局解释。我们的方法在设计上优先考虑简单性、效率和透明性,确保利益相关者能够轻松采用其输出。尽管当前方法专注于局部解释,INSIGHTS生成样本摘要,提供模型行为的全面概述。它通过利用效用函数平衡时间序列样本的重要性与多样性,捕捉领域特定的时间序列行为特征,如超过领域规范。我们通过实验、访谈和用户研究评估INSIGHTS。我们的结果表明,INSIGHTS能够构建全面、多样的时间序列子集,生成易于个体评估的摘要。它受到领域专家的青睐,因其能够提供模型行为的稳定理解以及识别的样本质量。此外,接受INSIGHTS摘要的用户研究参与者表现出对模型整体行为的更深入理解。

英文摘要

Explainability methods have progressed rapidly, but global explanations for time-series models remain underdeveloped, with most approaches focusing on local, instance-level attributions. We introduce INSIGHTS, a model-agnostic, user-centric approach for providing global explanations of time series models. Our approach prioritizes simplicity, efficiency, and transparency in its design, ensuring that stakeholders can readily adopt its outputs. While current methods focus on local explanations, INSIGHTS generates sample summaries that offer a comprehensive overview of model behavior. It balances the importance and diversity of time series samples to create informative subsets using utility functions that capture domain-specific aspects of time series behavior, such as exceeding domain norms. We evaluate INSIGHTS through experiments, interviews, and a user study. Our results indicate INSIGHTS effectively constructs comprehensive, diverse time series subsets, producing summaries manageable for individual evaluation. It is preferred by domain experts for its ability to provide a stable understanding of model behavior and the quality of the samples identified. Moreover, user study participants presented with INSIGHTS-based summaries exhibit an enhanced understanding of the model's overall behavior.

2605.18847 2026-05-20 cs.LG cs.AI

Transformers Linearly Represent Highly Structured World Models

Transformer 通过线性方式表示高度结构化的世界模型

Roman Kniazev, Nathanaël Fijalkow

AI总结 研究探讨了Transformer在训练过程中是否能构建任务的内部模型,并发现其内部表示结构与领域结构相匹配,通过Sudoku求解轨迹训练的Transformer展示了其内部计算机制和稀疏可解释的决策电路。

详情
AI中文摘要

当Transformer被训练于顺序推理轨迹时,它们是否会构建底层任务的内部模型?如果是的话,这些内部表示的结构是否与领域结构相匹配?我们训练了一个8层的Transformer模型来解决数独问题,并对其内部计算进行了机理分析。我们得出两个结论。第一,该模型构建了一个子结构世界模型:它不按人分析员所期望的那样逐个单元格表示棋盘状态,而是围绕数独约束所作用的行、列和盒子来组织信息。第二,我们识别出一个裸单电路:在最终的MLP层中,一组专用神经元,每个神经元单独检测特定单元格中恰好只剩一个可能的数字,并可靠地促进该数字。这些发现表明,涌现世界模型的几何结构由领域约束代数决定,而非其表面表现,且所得到的决策电路是稀疏的、单义的且完全可解释的。更广泛地说,这些发现展示了机理可解释性工具能够恢复Transformer如何解决组合推理任务的端到端算法账户。

英文摘要

Do transformers, when trained on sequential reasoning traces, build internal models of the underlying task? And if so, does the structure of those internal representations mirror the structure of the domain? We train an 8-layer transformer on Sudoku solving traces and perform a mechanistic analysis of its internal computation. We establish two results. First, the model builds a substructure world model: it does not represent the board state cell by cell, as a human analyst would expect, but organizes information around the rows, columns, and boxes that Sudoku's constraints act on. Second, we identify a naked-single circuit: a small set of dedicated neurons in the final MLP layer, each individually detecting when exactly one digit remains possible for a specific cell, and reliably promoting that digit. These findings show that the geometry of an emergent world model is shaped by the constraint algebra of the domain, not its surface presentation, and that the resulting decision circuit is sparse, monosemantic, and fully interpretable. More broadly, they demonstrate that mechanistic interpretability tools can recover an end-to-end algorithmic account of how a transformer solves a combinatorial reasoning task.

2605.18846 2026-05-20 cs.LG cs.AI cs.IT math.IT

Lost and Found in Translation: Variational Diagnostics for Neural Codebook Channels

译失与找回:变分诊断用于神经码本信道

Yusuke Hayashi

AI总结 该研究提出了一种变分诊断方法,用于评估神经码本信道中解码器对编码器码本的读取情况,解决了传统VAE诊断无法判断解码器是否正确读取编码器码本的问题。

Comments 9 pages, 2 figures

详情
AI中文摘要

经典通信系统不仅因随机噪声失效,还当发射端和接收端使用不兼容的操作码本时也会失效。变分自编码器(VAEs)联合训练编码器$ q_ϕ $和解码器$ p_θ $,并将其潜在空间视为离散码用于聚类、条件生成和机制可解释性。然而,标准VAE诊断——ELBO、主动单元、互信息和码本直方图——只能验证该码是否被使用,而不能验证解码器是否在编码器的码下读取每个潜在变量。我们通过神经码本信道$ K_{e o d}(j\mid i) $,一种耦合的编码器-解码器诊断方法,填补了这一差距。该信道的非对角线质量由架构无关的伯努利-KL证书$ d_{\mathrm{bin}}(1-\mathcal{A} \,\|\, arη_p) \le arΔ $控制,该证书是经典KL链式法则在离散化到编码器-解码器不一致事件下的操作专门化,补充了构造性的边缘不可能性结果:没有任何组合的边缘直方图、熵、主动码计数或互信息决定$ K_{e o d} $。我们对四个sklearn数据集(有限网格精确、5/5种子、20/20对满足边界)、二维模型(在$ 2.71 imes $观测到的不一致处非空虚)、MNIST在重要性采样控制下以及一个VQ-VAE达到预测极限$ \hat{\mathcal{A}}=1.000 $进行了证书审计。该包$ (K_{e o d}, \mathcal{A}, R_{\mathrm{eff}}, R, \mathrm{AU}) $是一个审计准备的报告单位。更广泛地说,该框架使不匹配解码——经典通信理论数十年前所命名的失败模式——在单个深度生成模型中可见。

英文摘要

Classical communication systems fail not only through random noise but also when transmitter and receiver use incompatible operational codebooks. Variational autoencoders (VAEs) train an encoder $q_ϕ$ and decoder $p_θ$ jointly, and practitioners treat the resulting latent space as a discrete code -- for clustering, conditional generation, and mechanistic interpretability. Yet standard VAE diagnostics -- ELBO, active units, mutual information, and code histograms -- certify only whether this code is used, never whether the decoder reads each latent under the encoder's code. We close this gap with the neural codebook channel $K_{e\to d}(j\mid i)$, a coupled encoder-decoder diagnostic whose off-diagonal mass is bounded by an architecture-free Bernoulli-KL certificate $d_{\mathrm{bin}}(1-\mathcal{A} \,\|\, \barη_p) \le \barΔ$ controlled by the variational gap. The certificate is the operational specialization of the classical KL chain rule under disintegration to the encoder-decoder disagreement event, complemented by a constructive marginal-impossibility result: no combination of marginal histograms, entropies, active-code counts, or mutual information determines $K_{e\to d}$. We audit the certificate on four sklearn datasets (finite-grid exact, 5/5 seeds, 20/20 pairs satisfy the bound), a 2D model where the bound is non-vacuous at $2.71\times$ the observed disagreement and the four-term identity closes within $10^{-4}$, MNIST under importance-sampling control, and a VQ-VAE attaining the predicted limit $\hat{\mathcal{A}}=1.000$. The package $(K_{e\to d}, \mathcal{A}, R_{\mathrm{eff}}, R, \mathrm{AU})$ is an audit-ready reporting unit. More broadly, the framework makes mismatched decoding -- a failure mode classical communication theory named decades ago -- visible inside a single deep generative model.

2605.18845 2026-05-20 cs.LG cs.AI

First-Passage Prediction of Grokking Delay: ACalibrated Law under AdamW with Causal Validation

Grokking延迟的首次通过预测:AdamW下的校准定律与因果验证

Truong Xuan Khanh, Truong Quynh Hoa, Luu Duc Trung, Phan Thanh Duc

AI总结 本文提出了一种在AdamW优化器下预测grokking延迟的定量方法,通过推导闭合形式定律并结合因果验证,实现了对模型记忆延迟的准确预测。

Comments 51 pages, 7 figures, 6 tables. Preprint

详情
AI中文摘要

我们首次对AdamW下的grokking延迟进行了定量预测。将延迟视为首次通过时间,推导出闭合形式定律T_grok - T_mem = (1 / 2 kappa_LL eta lambda) log(V_mem / V_star),其中V_t = ||theta_t||^2是参数范数的平方,V_star是架构相关的阈值,kappa_LL吸收了AdamW对clean-SGD收缩率2 eta lambda的修正。在单个超参数单元上校准(kappa_LL, V_star)可对26个保留运行的grokking延迟进行预测,MAPE为17.7%(在41倍延迟范围内);该定律适用于MLP(MAPE 18.0%,N=34)但在跨任务扩展时退化为23.3%(N=46,43.5倍范围),其中存在结构残差,V_star / V_mem在架构内相对稳定(CV约为14%在1L变压器上)。首次通过V_t是必要但不充分的。定量分位数定理表明,正延迟需要同时满足范数分离V_mem > V_post和阈值alpha_star = arcsin(C / V_T_mem^(1/2))的角达性,其中C可从经验NTK特征图和验证-边距分位数中计算。在模数p=89上校准C可预测alpha_star = 47.2度(p=97时观测到47.8度,误差1.3%)作为先验跨单元预测。因果干预冻结范数或移除权重衰减在记忆化时消除grokking(0/6 vs. 3/3基线),使角位移保持在12度附近。kappa_LL是按架构经验测量而非从(beta_1, beta_2, epsilon)推导;同一架构内CV最大为15%(四个架构内),但不同架构变体之间的值差异约为2倍。经验范围是AdamW下的算法任务(模运算,稀疏奇偶性);该定律是否适用于自然语言模型尚不明确。

英文摘要

We give the first quantitative prediction of grokking delay under AdamW. Treating the delay as a first-passage time, we derive a closed-form law T_grok - T_mem = (1 / 2 kappa_LL eta lambda) log(V_mem / V_star), where V_t = ||theta_t||^2 is the squared parameter norm, V_star is an architecture-dependent threshold, and kappa_LL absorbs the AdamW correction to the clean-SGD contraction rate 2 eta lambda. Calibrating (kappa_LL, V_star) on a single hyperparameter cell predicts grokking delays on 26 held-out runs with MAPE 17.7% over a 41x delay range; the law generalises to MLPs (MAPE 18.0%, N=34) and degrades to 23.3% on cross-task extension (N=46, 43.5x range), with a structured residual in which V_star / V_mem stays comparatively stable within architecture (CV about 14% on the 1L transformer). First-passage of V_t is necessary but not sufficient. A quantile-margin theorem establishes that positive delay requires both norm separation V_mem > V_post and angular reachability of a threshold alpha_star = arcsin(C / V_T_mem^(1/2)), where C is computable from the empirical NTK feature map and the validation-margin quantile. Calibrating C on modulus p=89 predicts alpha_star = 47.2 degrees at p=97 (observed 47.8 degrees, error 1.3%) as a prior cross-cell prediction. Causal interventions that freeze the norm or remove weight decay at memorisation eliminate grokking (0/6 vs. 3/3 baseline), trapping the angular displacement near 12 degrees. kappa_LL is empirically measured per architecture rather than derived from (beta_1, beta_2, epsilon); within-architecture CV stays at most 15% across four architectures, but values differ by about 2x between architectural variants beyond depth alone. Empirical scope is algorithmic tasks (modular arithmetic, sparse parity) under AdamW; whether the law transfers to natural-language scale models is open.

2605.18844 2026-05-20 cs.LG cs.AI

Graph-Driven Cross-Industry Real-Time Monitoring Framework for Anti-Money Laundering Detection in Converged Mobility-Energy Supply Chain Networks

基于图的跨行业实时监控框架用于反洗钱检测在融合的移动-能源供应链网络

Rong Liu, Xiaojun Xiao, Zhanqing Su

AI总结 本文提出了一种基于图的跨行业实时反洗钱监控框架(GCRMF),用于整合的旅行-能源供应链网络,通过构建跨行业异构图并结合双图注意力网络,动态编码资本流动路径和时间演变特征,以提高跨行业洗钱行为的识别能力,并通过自监督在线学习机制实现实时适应和持续优化。

详情
AI中文摘要

随着旅行和能源行业的深度整合,跨行业供应链金融逐渐成为隐藏洗钱事件的高风险领域。为此,本文提出了一种基于图的跨行业实时反洗钱监控框架(GCRMF)用于整合的旅行-能源供应链网络。首先,构建了一个涵盖新能源汽车租赁平台、能源供应商、金融科技机构等的跨行业异构图(CIHG),并通过临时双图注意力网络(Temporal Dual-Graph Attention Network)整合行业语义,动态编码资本流动路径和时间演变特征。随后,为识别由合谋主体共同产生的结构性欺诈行为,提出了一种基于对比学习和分层图采样的元路径子图推理模块,以增强跨行业反复洗钱行为的识别能力。同时,采用自监督在线学习机制实现实时适应和持续优化以应对新的洗钱策略。实验结果表明,与现有跨行业场景下的图神经网络方法相比,GCRMF在F1分数上提高了超过17.8%,并显著降低了误报率。

英文摘要

With the deep integration of the travel and energy industries, cross-industry supply chain finance has gradually become a high-risk field of hidden money laundering incidents. For this reason, this work proposes a graph-driven cross-industry real-time anti-money laundering monitoring framework (GCRMF) for integrated travel - energy supply chain networks. First, a cross-industry heterogeneous graph (CIHG) covering new energy vehicle rental platforms, energy suppliers, fintech institutions, etc., is constructed, and industry semantics are integrated through temporarily Dual-GAT (Temporal Dual-Graph Attention Network), dynamically encoding capital flow paths and evolution features over time. Subsequently, in order to identify the structural fraud behavior together produced by colluding subjects, a meta-path subgraph reasoning module based on contrastive learning and hierarchical graph sampling is proposed to enhance the discrimination capability of cross-industry recurring money laundering behavior. Meanwhile, a self-supervised online learning mechanism is adopted for real-time adaptation and continuous optimization to new money laundering strategies. The experimental results show that compared with existing graph neural network methods in cross-industry scenarios, GCRMF improves the performance by more than 17.8% of F1 score and greatly reduces the false positive rate.

2605.18843 2026-05-20 cs.LG

TEMPO: Temporal Enforcement via Mode-Separated Policy Optimization for Trustworthy LLM Backtesting

TEMPO: 通过模式分离策略优化实现可信大语言模型回测的时序执行

Zeyu Zhang, Bradly C. Stadie

AI总结 本文提出TEMPO方法,通过模式分离策略优化,解决大语言模型回测中因泄露后截止日期知识导致的评估不准确问题,核心贡献是引入双模式奖励和基于GRPO的训练流程,有效减少知识泄露并提升任务性能。

Comments 9 pages in main context

详情
AI中文摘要

对大型语言模型进行历史事件回测需要仅基于截止日期之前可用的信息进行推理。然而,模型经常从预训练中泄露后截止日期的知识到推理过程中,导致看似准确度提高但破坏评估的有效性。基于提示的约束在被抑制内容与预测有因果关系时失效,而知识卸载无法解决此问题,因为时间合规性是实例特定的:同一事实可能对一个截止日期是合法证据,对另一个截止日期则为违规。而不是删除知识,模型必须学习时间纪律:选择受每个实例截止日期条件的证据。我们提出TEMPO(通过模式分离策略优化实现时序执行),通过两个贡献训练这种纪律:(1)一个双模式奖励,其中泄漏模式将后截止日期的主张驱动至零作为硬性前提,然后性能模式优化任务性能;(2)基于GRPO的训练流程,使模型能够发现时间有效的推理策略。我们证明训练单调减少泄露,收敛到无泄露最优解,并在合规后提升任务性能。在三个预测任务和两个模型上,TEMPO将泄露率从2~13%降至0.6~3.7%,在强预截止信号存在时任务性能提升6~13%,在预测任务本身困难时维持稳定。

英文摘要

Backtesting large language models on historical events requires reasoning exclusively from information available before a specified cutoff date. Yet models routinely leak post-cutoff knowledge from pre-training into their reasoning, inflating apparent accuracy and undermining evaluation validity. Prompt-based constraints fail when suppressed content is causally related to the prediction, and knowledge unlearning cannot address this problem because temporal compliance is instance-specific: the same fact may be legitimate evidence for one cutoff date and a violation for another. Rather than erasing knowledge, the model must learn temporal discipline: selecting evidence conditioned on each instance's cutoff date. We propose TEMPO (Temporal Enforcement via Mode-separated Policy Optimization), which trains this discipline via two contributions: (1) a two-mode reward where a leakage mode drives post-cutoff claims to zero as a hard prerequisite before a performance mode optimizes task performance; and (2) a GRPO-based training pipeline that enables the model to discover temporally valid reasoning strategies. We prove that training monotonically decreases leakage, converges to the leak-free optimum, and improves task performance once compliance is achieved. On three prediction tasks and two models, TEMPO reduces leakage from 2~13% to 0.6~3.7% across all conditions, with task performance improving 6~13% where strong pre-cutoff signals exist and maintained where the prediction task is inherently difficult from valid information alone.

2605.18842 2026-05-20 cs.LG

Safe Continual Reinforcement Learning under Nonstationarity via Adaptive Safety Constraints

在非平稳环境下通过自适应安全约束实现安全的持续强化学习

Timofey Tomashevskiy

AI总结 本文提出了一种结合三种自适应安全机制的框架,用于在非平稳环境下实现安全的持续强化学习,通过自适应约束机制减少分布偏移下的安全违规,同时保持任务性能。

Comments Preprint version

详情
AI中文摘要

在非平稳环境中进行安全强化学习需要能够适应环境变化的安全部件。标准的安全强化学习方法通常假设固定约束或稳定的环境条件,这在分布偏移下可能不足。我们提出了LILAC+,一个用于非平稳环境下安全持续强化学习的框架,结合了三种自适应安全机制:基于上下文的安全约束、适应速度约束和预算到状态的安全执行。基于上下文的约束通过推断和预测的环境上下文调整安全要求。适应速度约束在环境变化速率超过智能体安全适应能力时收紧安全要求。预算到状态执行将累积安全要求转换为本地状态级控制约束,可在决策时执行。这些机制共同提供了一种统一的方法,用于持续强化学习中的主动和反应性安全适应。我们在模拟驾驶环境中评估了该框架,在平稳、已见非平稳和未见非平稳条件下进行测试。结果表明,自适应安全约束在分布偏移下显著减少了安全违规,同时在与无约束和固定约束基线相比时保持了具有竞争力的任务性能。这些发现表明,安全的持续强化学习需要能够响应当前状态信息、预测的环境上下文、适应需求和剩余安全预算的自适应约束机制。

英文摘要

Safe reinforcement learning in nonstationary environments requires safety mechanisms that adapt as environmental conditions change. Standard safe reinforcement learning methods often assume fixed constraints or stable environmental conditions, which can become inadequate under distribution shift. We propose LILAC+, a framework for safe continual reinforcement learning under nonstationarity that combines three adaptive safety mechanisms: context-based safety constraints, adaptation-speed constraints, and budget-to-state safety enforcement. Context-based constraints adjust safety requirements using inferred and predicted environmental context. Adaptation-speed constraints tighten safety requirements when the rate of environmental change exceeds the agent's ability to adapt safely. Budget-to-state enforcement converts cumulative safety requirements into local state-level control constraints that can be enforced at decision time. Together, these mechanisms provide a unified approach for proactive and reactive safety adaptation in continual reinforcement learning. We evaluate the framework in simulated driving environments under stationary, seen nonstationary, and unseen nonstationary conditions. The results show that adaptive safety constraints substantially reduce safety violations under distribution shift while maintaining competitive task performance compared with unconstrained and fixed-constraint baselines. These findings suggest that safe continual reinforcement learning requires adaptive constraint mechanisms that respond not only to current state information but also to predicted environmental context, adaptation demand, and remaining safety budget.

2605.18841 2026-05-20 cs.LG

From Cumulative Constraints to Adaptive Runtime Safety Control for Nonstationary Reinforcement Learning

从累积约束到适应性运行时安全控制:非平稳强化学习

Timofey Tomashevskiy

AI总结 本文提出了一种适应性运行时安全控制机制CPSS,通过将累积安全预算转化为适应性的状态级控制约束,以应对非平稳强化学习中的安全问题,通过动态调整安全阈值来保证执行动作的安全性,同时在多个高速公路合并场景中验证了其有效性。

Comments 13 pages. Preprint version

详情
AI中文摘要

在强化学习中,安全性通常通过累积成本约束来指定,但这些轨迹级保证并不能直接防止不安全的个体决策,特别是在非平稳环境下。在连续和非平稳设置中,风险与相同动作在不同上下文中的关联性变化,而固定状态级阈值可能过于保守或过于宽松。我们提出Constraint Projection Safety Shield (CPSS),一种运行时机制,将累积安全预算转化为适应性的状态级控制约束。CPSS跟踪剩余安全预算,将其投影为时间变化的可接受风险阈值,并过滤预测安全成本超过活跃阈值的策略动作。阈值通过上下文信号在线调整,使得在更严格或快速变化的环境中执行更严格,在可用安全预算充足时则更宽松。我们分析了由此产生的保护策略,并证明该机制保证了执行动作的状态级阈值满足,诱导了有限时间累积成本界,并在干预频率和每步奖励扭曲方面给出了性能退化界。我们使用highway-env在非平稳高速公路合并场景中评估了CPSS。在多个种子下,CPSS显著减少了基于接近度的安全违规,并增加了分离边缘,同时选择性干预而不是主导学习的策略。这些结果支持了将累积安全规范转化为有效本地安全控制的适应性预算到阈值投影作为实际应用的方法。

英文摘要

Safety in reinforcement learning is often specified through cumulative cost constraints, but these trajectory-level guarantees do not directly prevent unsafe individual decisions, especially under nonstationarity. In continual and nonstationary settings, the difficulty is amplified because the risk associated with the same action can vary across contexts, while a fixed state-level threshold may be either too conservative or too weak. We propose Constraint Projection Safety Shield (CPSS), a runtime mechanism that converts a cumulative safety budget into adaptive state-level control constraints during execution. CPSS tracks the remaining safety budget, projects it into a time-varying admissible risk threshold, and filters policy actions whose predicted safety cost exceeds the active threshold. The threshold is adjusted online using contextual signals so that enforcement becomes stricter in more demanding or rapidly changing regimes and less restrictive when the available safety budget is sufficient. We analyze the resulting shielded policy and show that the mechanism guarantees per-state threshold satisfaction for executed actions, induces finite-horizon cumulative cost bounds, and yields a performance degradation bound in terms of intervention frequency and per-step reward distortion. We evaluate CPSS in nonstationary highway merging scenarios using highway-env. Across multiple seeds, CPSS substantially reduces proximity-based safety violations and increases separation margins while intervening selectively rather than dominating the learned policy. These results support adaptive budget-to-threshold projection as a practical way to transform cumulative safety specifications into effective local safety control for continual reinforcement learning systems.

2605.18839 2026-05-20 cs.LG cs.AI

An Integrated Forecasting Prototype for Emergency Department Boarding Time to Support Proactive Operational Decision Making

急诊部候诊时间集成预测原型:支持主动运营决策制定

Orhun Vural, Abdulaziz Ahmed, Ferhat Zengul, James Booth, Bunyamin Ozaydin

AI总结 本文提出了一种多时间跨度的时间序列预测框架,用于预测急诊部候诊时间,以支持主动的运营决策制定,通过整合真实世界数据和外部上下文数据源,如天气、节假日和重大本地事件,提高了预测准确性。

Comments 22 pages, including supplementary materials

详情
AI中文摘要

急诊部门(ED)的拥挤状况仍然是全球范围内持续存在的运营挑战,导致护理延误和后续拥堵。急诊部候诊时间,定义为被收治患者在等待住院床放置期间在急诊部停留的时间,是这种拥堵的关键指标。提前预测急诊部候诊时间可以实现主动的运营决策制定,防止拥堵加剧。我们开发并评估了多时间跨度的时间序列预测框架,以预测6、8、10、12和24小时的急诊部候诊时间。利用美国一所大学附属城市的大学附属医院的真实世界数据,并整合外部上下文数据源,包括天气、节假日和重大本地事件。基于分解的线性(DLinear)和基于标准化的线性(NLinear)时间序列预测深度学习模型在多个时间跨度上表现优异。模型还被评估了在极端拥堵场景下的表现,这些场景由较高的候诊时间特征化。此外,还开发了一个机器学习运维(MLOps)网页原型应用,以支持将预测框架转化为实际应用,通过整合数据摄入、预测可视化、实验和重新训练等功能。

英文摘要

Overcrowding in emergency departments (ED) remains a persistent operational challenge worldwide, causing delays in care delivery and downstream congestion. ED boarding time, defined as the duration admitted patients remain in the ED while awaiting inpatient bed placement, is a key indicator of this congestion. Predicting ED boarding time in advance enables proactive operational decision making before congestion escalates. We developed and evaluated a multi-horizon time series forecasting framework to predict ED boarding time at 6, 8, 10, 12, and 24-hour horizons. Real-world data from a university-affiliated urban hospital in the United States were utilized and integrated with external contextual data sources, including weather, holidays, and major local events. Decomposition-based Linear (DLinear) and Normalization-based Linear (NLinear) time series forecasting deep learning models showed superior performance across multiple horizons. Models were also evaluated under extreme congestion scenarios characterized by elevated boarding times. In addition, a Machine Learning Operations (MLOps) web application prototype was developed to support translation of the forecasting framework into practice through integrated data ingestion, forecast visualization, experimentation, and retraining.

2605.18837 2026-05-20 cs.LG cs.AI eess.SP

VCR: Learning Valid Contextual Representation for Incomplete Wearable Signals

VCR:学习不完整可穿戴信号的有效上下文表示

Yuxuan Weng, Wenhan Luo, Qijia Shao

AI总结 本文提出VCR框架,通过学习鲁棒于模态缺失的表示,解决可穿戴信号不完整问题,提升在多种健康监测任务中的性能和鲁棒性。

详情
AI中文摘要

可穿戴设备能够从多模态信号中实现连续健康监测,但实际部署受到有限标注数据和普遍传感器不完整性的阻碍。尽管大规模自监督预训练减少了对标签的依赖,但现有方法大多假设全模态可用性。目前处理模态缺失的方法通常重建整个缺失信号,这可能导致无法从观测传感器信号推断出的模态特定细节的幻觉,从而降低鲁棒性。我们提出VCR,一种自监督框架,学习提取对模态缺失具有鲁棒性的表示。VCR采用正交分词器,通过校正潜在流形并应用几何投影,严格分离每个模态到共享语义和模态特定残差。这种设计在保持完整信息完整性的同时,为模态缺失下的稳健学习提供了结构基础。所生成的标记由一个缺失感知的混合专家背骨处理,能够适应不同模式的模态可用性。通过将目标限制为仅重建缺失模态的共享组件,VCR有效减轻了无法推断的模态特定细节的幻觉。在多个健康监测任务中,VCR在完整、单缺失和多缺失模态设置下,相比强大的监督和自监督基线,一致提升了性能和鲁棒性。

英文摘要

Wearable devices enable continuous health monitoring from multimodal signals, but real-world deployment is hindered by limited labeled data and pervasive sensor incompleteness. While large-scale self-supervised pretraining reduces label dependence, most existing methods assume full modality availability. Current approaches for handling modality missingness often reconstruct entire absent signals, which can encourage hallucinating modality-specific details that are not inferable from the observed sensor signals and degrade robustness. We propose VCR, a self-supervised framework that learns to extract valid representations robust to modality missingness. VCR employs an orthogonal tokenizer to enforce strict orthogonal disentanglement by rectifying latent manifolds and applying a geometric projection, separating each modality into shared semantics and modality-specific residuals. This design preserves complete information integrity while serving as a structural foundation for robust learning under modality missingness. The resulting tokens are processed by a missing-aware mixture-of-experts backbone that adapts to varying patterns of modality availability. By constraining the objective to reconstruct only the shared components of missing modalities, VCR effectively mitigates hallucinations of non-inferable modality-specific details. Across multiple health monitoring tasks, VCR consistently improves performance and robustness under full, single-missing, and multiple-missing modality settings compared with strong supervised and self-supervised baselines.

2605.18836 2026-05-20 cs.LG cs.CV

Spectral Gradient Surgery for Domain-Generalizable Dataset Distillation

谱梯度手术用于领域通用化数据集蒸馏

Minyoung Oh, Najeong Chae, Jae-Young Sim

AI总结 本文提出了一种新的数据集蒸馏方法,即领域通用化数据集蒸馏(DGDD),通过谱梯度手术(SGS)来提升蒸馏数据集对超出分布(OOD)的泛化能力,同时保持与现有数据集蒸馏方法的兼容性。

Comments 17pages

详情
AI中文摘要

数据集蒸馏(DD)合成一个紧凑的合成数据集,以保留完整数据集的训练效用。然而,其标准公式假设测试数据遵循与训练数据相同的分布,这一假设在实践中很少成立。一种直接的扩展——将事后域泛化(DG)技术应用于蒸馏数据——并不合适,因为现有DG方法依赖于真实数据集的自然多样性,而压缩的合成集本质上缺乏这种多样性,同时还会带来显著的增强开销,这与数据集蒸馏的效率目标相冲突。为了解决这一限制,我们引入了领域通用化数据集蒸馏(DGDD),一种新的问题设定,明确针对蒸馏数据集的超出分布泛化。我们通过广泛采用的DD基线分布匹配(DM)来研究这一问题。我们将DM的超出分布脆弱性归因于压缩合成集中类判别信息和领域特定信息的纠缠,并提出谱梯度手术(SGS)来解纠缠。SGS的关键见解是跨域在谱域中的梯度一致性和跨域梯度组件的共享揭示了哪些梯度组件在源域之间共享——因此是类判别性的——以及哪些是领域特定的。基于这一观察,SGS在标准DM更新中添加了两个互补的梯度:一个强化跨域共享组件,另一个促进蒸馏数据集内的多样性。在多样规模基准上的广泛实验表明,SGS在提升超出分布泛化的同时,仍保持与现有DM方法的即插即用兼容性。

英文摘要

Dataset Distillation (DD) synthesizes a compact synthetic dataset that preserves the training utility of a full dataset. However, its standard formulation assumes that test data follow the same distribution as training data, an assumption that rarely holds in practice. A straightforward extension-applying post-hoc Domain Generalization (DG) techniques to distilled data-is ill-suited because existing DG methods rely on the natural diversity of real datasets, which compact synthetic sets inherently lack, while also incurring substantial augmentation overhead that conflicts with the efficiency objective of dataset distillation. To address this limitation, we introduce Domain Generalizable Dataset Distillation (DGDD), a new problem setting that explicitly targets out-of-distribution (OOD) generalization of distilled datasets. We study this problem through a widely adopted DD baseline of Distribution Matching (DM). We attribute the OOD vulnerability of DM to the entanglement of class-discriminative and domain-specific information within the compressed synthetic set, and propose Spectral Gradient Surgery (SGS) to disentangle the two. The key insight of SGS is that cross-domain agreement among domain-wise gradients in the spectral domain reveals which gradient components are shared across source domains-and are therefore class-discriminative-and which are domain-specific. Based on this observation, SGS augments the standard DM update with two complementary gradients: one that reinforces cross-domain shared components and another that explicitly promotes diversity within the distilled dataset. Extensive experiments on diverse-scale benchmarks demonstrate that SGS substantially improves OOD generalization while remaining plug-and-play compatible with existing DM methods.

2605.18835 2026-05-20 cs.LG

StampFormer: A Physics-Guided Material-Geometry-Coupled Multimodal Model for Rapid Prediction of Physical Fields in Sheet Metal Stamping

StampFormer: 一种基于物理的材料-几何耦合多模态模型,用于快速预测冲压板料的物理场

Jiajie Luo, Mohamed Mohamed, Osama Hassan, Haosu Zhou, Yingxue Zhao, Haoran Li, Xinrun Li, Zhutao Shao, Yang Long, Nan Li, Jichun Li

AI总结 本文提出StampFormer模型,通过结合材料和几何信息,实现对冲压板料物理场的快速准确预测,从而提高设计效率。

详情
AI中文摘要

传统冲压板料成型依赖于耗时且昂贵的有限元分析(FEA)进行设计验证,这一过程显著延长了设计周期。虽然代理模型提供了更快的迭代速度,但现有方法存在局限:标量方法无法捕捉全面的基于场的FEA结果,而现有基于图像的方法往往忽略了材料属性的关键作用,仅关注几何。为解决这一差距,我们开发了一种基于物理的深度学习框架,即StampFormer,该框架同时利用组件几何和材料应力-应变响应来预测FEA结果。StampFormer框架使用三个核心组件处理数据。首先,材料增强的几何网络(MAGN)融合几何和材料数据。然后,通过层次化材料嵌入注入单元(HMEIU)在不同层次上整合信息,再由主网络骨干,即改进的Swin-UNet进行处理。我们在交叉件面板冲压上评估了我们的模型,使用两个模拟数据集进行钢和铝板的冲压模拟,结果表明,StampFormer在不到一秒的时间内提供了高保真的关键物理场预测,包括薄化、主应变、次应变、塑性应变和位移。与真实FEA相比,我们的模型在四个二维场上的平均相对误差小于8.5%,在三维位移场上的均方误差小于1.2 mm²。总之,我们介绍了一种实用且高效的框架,整合了多模态信息,即几何和材料属性,以提供快速且准确的预测,使设计师能够进行实时的可制造性评估。

英文摘要

Traditional sheet metal forming relies on time-consuming and expensive Finite Element Analysis (FEA) for design validation, a process that significantly prolongs design cycles. While surrogate models offer faster iteration, current approaches have limitations: scalar-based methods cannot capture comprehensive field-based FEA results, while existing image-based models often ignore the critical role of material properties by focusing solely on geometry. To address this gap, we develop a physics-guided deep learning framework, namely StampFormer, which simultaneously uses component geometry and material stress-strain responses to predict FEA outcomes. The StampFormer framework uses three core components to process data. A Material-Augmented Geometric Network (MAGN) first fuses geometric and material data. This information is then integrated at various levels by a Hierarchical Material Embedding Injection Unit (HMEIU) before being processed by the primary network backbone, an adapted Swin-UNet. We evaluated our model on the stamping of a crossmember panel with two simulation datasets for steel and aluminium panels, and results demonstrate that StampFormer provides high-fidelity predictions of critical physical fields - including thinning, major strain, minor strain, plastic strain, and displacement - in under a second. Compared with ground truth FEA, our model achieved an average relative error of less than 8.5% on the four 2D fields and a mean squared error of less than 1.2 mm2 for the 3D displacement field. In summary, we introduce a practical and efficient framework that integrates multimodal information, namely geometry and material properties, to provide fast and accurate predictions, enabling designers to perform real-time manufacturability assessments.

2605.18832 2026-05-20 cs.LG cs.AI

Precision Tracked Transformer via Kalman Filtering, Kriging and Process Noise

通过卡尔曼滤波、克里格法和过程噪声的精确跟踪变压器

Bo Long, Deepak Agarwal, Jelena Markovic-Voronov, Yi Wang, Liuqing Li

AI总结 本文提出了一种基于贝叶斯滤波的变压器(BFT),通过引入精度权重的克里格法、自适应卡尔曼更新和动态模型,解决了传统变压器在处理不确定性方面的不足,提升了序列推荐和大语言模型在噪声环境下的鲁棒性。

详情
AI中文摘要

Transformer是现代AI的基础构建块,但其缺乏对不确定性的原则性处理,这在实际应用中普遍存在:序列推荐中的冷启动标记具有稀疏的历史,语言模型中的异质信号质量,以及由无约束softmax引起的注意力 sinks。每个token都被统一的置信度处理。我们证明这种统一性是我们的贝叶斯滤波变压器(BFT)的退化情况:注意力变为精度加权克里格法,残差连接变为具有自适应增益的卡尔曼更新,FFN变为通过雅可比矩阵加过程噪声规则传播精度的动力学模型。观测精度来自一个无参数的受限最大似然(REML)估计器,具有共轭贝叶斯先验。BFT将任何Transformer层替换为几乎无开销。在序列推荐中,BFT应用于三种主要架构,在六个基准上获得显著提升,其中在冷启动用户和稀有物品上改进最大。在具有噪声数据的监督微调中,BFT在两个领域提高了鲁棒性:噪声监督(问答中的token-标签腐败)和噪声上下文(具有真实RAG干扰项的检索增强问答)。单个原则性修改——恢复精度——在经典序列建模和现代LLM领域中释放了大量空间。

英文摘要

The Transformer is the foundational building block of modern AI, yet offers no principled handling of \emph{uncertainty}, which is prevalent in real applications: cold-start tokens with sparse histories in sequential recommendation, heterogeneous signal quality in language models, and attention sinks induced by unconstrained softmax. Every token is treated with uniform confidence. We show this uniformity is a degenerate case of our \emph{Bayesian Filtering Transformer} (BFT): attention becomes precision-weighted kriging, the residual connection becomes a Kalman update with adaptive gain, and the FFN becomes a dynamics model propagating precision via a Jacobian--plus--process-noise rule. Observation precision comes from a parameter-free Restricted Maximum Likelihood (REML) estimator with a conjugate Bayesian prior. BFT replaces any Transformer layer with negligible overhead. On sequential recommendation, BFT applied to three major architectures yields significant gains on six benchmarks, with the largest improvements on cold-start users and rare items where uncertainty is highest. On supervised fine-tuning of large language models with noisy data, BFT improves robustness in two regimes: noisy supervision (token-label corruption in question answering) and noisy context (retrieval-augmented QA with real RAG distractors). A single principled modification -- restoring precision -- unlocks substantial headroom across both classical sequence-modeling and modern LLM regimes.

2605.18830 2026-05-20 cs.LG

In-Context Learning Operates as Concept Subspace Learning

基于情境学习的概念子空间学习

Wei Tang, Xinyan Jiang, Fakhri Karray, Lijie Hu

AI总结 本文研究了结构化演示是否诱导低维概念推理,通过概念子空间视角揭示了情境学习中预测分解为概念坐标回归和子空间泄漏的机制,并通过实验验证了任务信息集中在低维任务对齐激活子空间中的结论。

详情
AI中文摘要

回归和贝叶斯对情境学习(ICL)的解释说明了演示如何诱导预测器,而机械分析通常识别出紧凑的激活方向,引导受促行为。然而,仍不清楚结构化演示是否诱导低维概念推理。我们通过概念子空间视角研究这一问题,在此视角中,任务仅沿内在概念坐标变化,尽管输入观察在高维环境空间中。对于岭回归和最小二乘ICL代理,预测精确分解为概念坐标回归和子空间泄漏。在块对角或近块对角协方差假设下,主导估计和噪声敏感项随概念子空间的维度变化,而残差效应由跨子空间耦合控制。这种分离给出了机械预测:可恢复的任务信息应集中在低维、任务对齐的激活子空间中。在CounterFact衍生的多关系提示上使用Llama-3-8B,4096维残差流的68-73维子空间恢复了78.8%的干净-受污染准确率差距,而补全互补子空间则恢复了0%。概念交换将预测引导至注入的关系,而随机和跨任务匹配排名控制效果不大。此外,在Qwen2.5-7B和受控的跨语言规则任务上的额外实验显示了相同定性模式。这些结果支持概念子空间作为紧凑、任务对齐的可恢复ICL行为在结构化任务家族中的中介,而不意味着全电路恢复。

英文摘要

Regression and Bayesian accounts of in-context learning (ICL) explain how demonstrations can induce predictors, while mechanistic analyses often identify compact activation directions that steer prompted behavior. However, it remains unclear whether structured demonstrations induce low-dimensional concept inference. We study this question through a concept-subspace view of ICL, in which tasks vary only along intrinsic concept coordinates, although inputs are observed in a high-dimensional ambient space. For ridge and least-squares ICL proxies, prediction decomposes exactly into concept-coordinate regression and off-subspace leakage. Under block-diagonal or near-block-diagonal covariance assumptions, the leading estimation and nuisance-sensitivity terms scale with the dimension of the concept subspace, while residual effects are controlled by cross-subspace coupling. This separation gives a mechanistic prediction: recoverable task information should concentrate in a low-dimensional, task-aligned activation subspace. On CounterFact-derived multi-relation prompts with Llama-3-8B, a 68--73-dimensional subspace of the 4096-dimensional residual stream restores 78.8% of the clean--corrupted accuracy gap, whereas patching the complementary subspace restores 0%. Concept swaps redirect predictions toward injected relations, while random and cross-task matched-rank controls are largely ineffective. Additional experiments on Qwen2.5-7B and a controlled cross-lingual rule task show the same qualitative pattern. These results support concept subspaces as compact, task-aligned mediators of recoverable ICL behavior in structured task families, without implying full-circuit recovery.

2605.18829 2026-05-20 cs.LG cs.CR

Lossless Anti-Distillation Sampling

无损反蒸馏采样

Zibo Diao, Jingchu Gai, Xinyue Ai, Zhang Zhang, Zhenyu He, Di He

AI总结 本文提出了一种无损反蒸馏采样方法,通过在保持良性用户体验的同时,有效对抗多账号蒸馏攻击,降低蒸馏模型的泛化能力。

详情
AI中文摘要

面向商业生成模型的前沿领域,蒸馏攻击正成为日益严峻的威胁。蒸馏者通过收集生成响应并以极低的成本训练自己的竞争模型。现有防御措施要么依赖于修改模型输出,从而牺牲良性用户的响应质量,要么依赖于行为检测方法,这些方法可以通过在多个账户上分布查询来轻易绕过。在本工作中,我们提出了无损反蒸馏采样(LADS),一种专门设计用于对抗多账号蒸馏同时保持良性用户体验的新型采样方案。具体而言,LADS从由查询的语义内容和用户查询次数决定的私有种子中推导出每种生成的随机性。通过构造,每个良性用户在每次访问时都会独立地从原始模型中采样响应,因此不会产生失真。相反,对于蒸馏者,不同账户在相同语义桶中的查询会共享潜在随机性。因此,收集的数据变得相关,可能降低样本多样性并损害泛化能力。利用统一收敛理论,我们证明LADS在无条件和条件生成设置中,能够证明降低蒸馏者泛化差距的收敛率相对于标准i.i.d.采样。在图像生成、数学推理和代码生成的实验中,证实LADS显著降低蒸馏学生的表现,同时保持对单个用户的精确统计保真度。

英文摘要

Frontier commercial generative models face a growing threat from distillation, whereby a distiller harvests generated responses and trains a competing model of its own at drastically lower cost. Existing defenses either rely on modifying the models outputs, thereby sacrificing response quality for benign users, or on behavioral detection methods, which can be readily circumvented by distributing queries across multiple accounts. In this work, we propose Lossless Anti-Distillation Sampling (LADS), a novel sampling scheme specifically designed to counter multi-account distillation while maintaining a lossless experience for benign users. Concretely, LADS derives the randomness underlying each generation from a private seed determined by the semantic content of the query and the number of times the user has queried the model. By construction, every benign user receives a response independently sampled from the original model at each visit, and thus experiences no distortion. In contrast, for a distiller, different accounts share latent randomness whenever their queries fall in the same semantic bucket. As a result, the harvested data becomes correlated, potentially reducing sample diversity and degrading generalization. Using uniform convergence theory, we show that LADS provably degrades the convergence rate of the distillers generalization gap relative to standard i.i.d. sampling in both unconditional and conditional generation settings. Experiments on image generation, mathematical reasoning, and code generation confirm that LADS substantially degrades the performance of distilled students while preserving exact statistical fidelity for individual users.

2605.18826 2026-05-20 cs.LG cs.AI

The Routing and Filtering Structure of Attention

注意力的路由和过滤结构

Shafayeth Jamil, Rehan Kapadia

AI总结 本文研究了注意力机制中的路由和过滤结构,通过分解1776个预训练Transformer的头部,发现路由在低秩状态下运行,并引入S-D注意力作为诊断参数化方法,分离路由和过滤,实现稳定训练和有效降维。

Comments 13 pages, 7 figures

详情
AI中文摘要

注意力交互矩阵$QK^{ op}$包含两个交织的计算:一个斜对称成分用于在位置间重新分配信息(路由),一个对称成分用于缩放相互相关性(过滤)。我们分解了五个预训练Transformer中的1776个头部,发现路由在低秩状态下运行,远低于权重核分配的路由能力。我们引入了S-D注意力作为诊断参数化方法,通过构造分离路由和过滤,保证稳定性($\mathrm{Re}(λ) \le 0$)并稳定训练而无需层归一化。当分离和未归一化时,路由自组织成一个谱级联,第一层的有效秩为2,随着深度扩展到六个尺度,从7M到355M参数。级联预测了注意力可以简化的位置:线性化125M S-D注意力的前七层成本低于5%的困惑度,而标准注意力在相同干预下崩溃。可线性化的区域随着深度扩大。用ELU+1线性注意力替换前四层,可在完整头部维度内达到基线的1.4%以内。级联分配的架构用注意力参数换取困惑度(47%-65%更少的注意力参数,+3.9%到+8.4% PPL)。路由-过滤分解使谱预算变得清晰;级联使其具有可操作性。

英文摘要

The attention interaction matrix $QK^{\top}$ contains two entangled computations: a skew-symmetric component that redistributes information between positions (routing) and a symmetric component that scales mutual relevance (filtering). We decompose 1776 heads across five pretrained transformers and find routing operating at low rank, well below the routing capacity allocated by the weight kernel. We introduce $S$-$D$ attention as a diagnostic parameterization that disentangles routing from filtering by construction with guaranteed stability ($\mathrm{Re}(λ) \le 0$) and trains stably without layer normalization. When disentangled and unnormalized, routing self-organizes into a spectral cascade, effective rank $2$ at the first layer, expanding with depth across six scales from 7M to 355M parameters. The cascade predicts where attention can be simplified: linearizing the first seven layers of 125M $S$-$D$ attention costs ${<}5\%$ perplexity, whereas standard attention collapses under the same intervention. The linearizable region widens with depth. Replacing the first four layers with ELU+1 linear attention reaches within $1.4\%$ of baseline at full head dimension. Cascade-allocated architectures trade attention parameters for perplexity ($47\%-65\%$ fewer attention parameters at $+3.9\%$ to $+8.4\%$ PPL). The routing-filtering decomposition makes the spectral budget legible; the cascade makes it actionable.

2605.18825 2026-05-20 cs.LG cs.DC

Not All Tokens Are Worth Caching: Learning Semantic-Aware Eviction for LLM Prefix Caches

并非所有标记都值得缓存:学习语义感知的淘汰策略用于LLM前缀缓存

Shaoke Fang, Ziang Li, Wenfei Wu, Jiatong Ji, Qingsong Liu, Ruizhi Pu

AI总结 本文提出了一种语义感知的前缀缓存淘汰策略SAECache,通过多队列架构、语义感知的标记加权机制和全适应的在线学习方案,提高了LLM服务中前缀缓存的效率,从而在不同工作负载下实现了显著的TTFT提升。

详情
AI中文摘要

前缀缓存是大型语言模型(LLM)服务中的关键优化,通过重用注意力键值(KV)状态来减少昂贵的prefill计算。然而,其效益依赖于淘汰策略,因为GPU内存有限,而现有策略如LRU通常将缓存块视为统一处理。这种观点忽略了LLM提示的一个基本属性:并非所有标记都同样值得缓存。我们显示,提示中不同的标记类型,包括系统提示、用户查询、工具输出、模型响应和推理链,其重用率可能高达756倍,但现有淘汰策略并未利用这一信号。在本文中,我们提出了SAECache(语义适应的前缀缓存淘汰策略),通过三个创新来解决这一差距:(1)一个多队列架构,将KV块路由到任务特定的队列中,使用定制的优先级指标,捕捉多轮请求中的会话重用和模板单轮请求中的结构重用;(2)一种语义感知的标记加权机制,通过淘汰反馈在线学习不同标记类型的重用价值;(3)一种完全适应的在线学习方案,用于所有参数更新,包括对数正态时间参数、位置衰减幂、队列权重和元参数,这消除了手动调优并使系统能够自动适应部署特定的工作负载特性。通过在异构工作负载上的广泛评估,我们证明SAECache在生产风格的基线之上实现了1.4x-2.7x的TTFT提升,而固定参数的替代方案在工作负载不匹配时可能会下降高达2.7x,这是我们的自适应方法完全避免的失败模式。

英文摘要

Prefix caching is a key optimization in Large Language Model (LLM) serving, reusing attention Key-Value (KV) states across requests with shared prompt prefixes to reduce expensive prefill computation. However, its benefit depends critically on the eviction policy as GPU memory is scarce, and existing policies such as LRU largely treat cached blocks uniformly. This view ignores a fundamental property of LLM prompts: not all tokens are equally worth caching. We show that different token types within a prompt, including system prompts, user queries, tool outputs, model responses, and chain-of-thought reasoning, exhibit up to 756x variation in reuse rates, yet no existing eviction policy exploits this signal. In this paper, we present SAECache (Semantic-Adaptive Eviction for prefix caches), a semantic-adaptive prefix cache eviction policy that addresses this gap through three innovations: (1) a multi-queue architecture that routes KV blocks to task-specific queues with tailored priority metrics, capturing both session reuse in multi-turn requests and structural reuse in templated single-turn requests; (2) a semantic-aware token weighting mechanism that learns the reuse value of different token types online through eviction feedback; and (3) a fully adaptive online learning schema for all parameter updates, including log-normal timing parameters, position decay power, queue weights, and meta-parameters, which eliminates manual tuning and enables automatic adaptation to deployment-specific workload characteristics. Through extensive evaluation across heterogeneous workloads, we demonstrate that SAECache achieves 1.4x-2.7x TTFT improvement over production-style baselines, while fixed-parameter alternatives can degrade by up to 2.7x under workload mismatch -- a failure mode our adaptive approach avoids entirely.