arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 2154
专题追踪
2605.18841 2026-05-20 cs.LG

From Cumulative Constraints to Adaptive Runtime Safety Control for Nonstationary Reinforcement Learning

从累积约束到适应性运行时安全控制:非平稳强化学习

Timofey Tomashevskiy

发表机构 * McMaster Centre for Software Certification(麦斯特软件认证中心) Department of Computing and Software(计算与软件系) McMaster University(麦斯特大学)

AI总结 本文提出了一种适应性运行时安全控制机制CPSS,通过将累积安全预算转化为适应性的状态级控制约束,以应对非平稳强化学习中的安全问题,通过动态调整安全阈值来保证执行动作的安全性,同时在多个高速公路合并场景中验证了其有效性。

Comments 13 pages. Preprint version

详情
AI中文摘要

在强化学习中,安全性通常通过累积成本约束来指定,但这些轨迹级保证并不能直接防止不安全的个体决策,特别是在非平稳环境下。在连续和非平稳设置中,风险与相同动作在不同上下文中的关联性变化,而固定状态级阈值可能过于保守或过于宽松。我们提出Constraint Projection Safety Shield (CPSS),一种运行时机制,将累积安全预算转化为适应性的状态级控制约束。CPSS跟踪剩余安全预算,将其投影为时间变化的可接受风险阈值,并过滤预测安全成本超过活跃阈值的策略动作。阈值通过上下文信号在线调整,使得在更严格或快速变化的环境中执行更严格,在可用安全预算充足时则更宽松。我们分析了由此产生的保护策略,并证明该机制保证了执行动作的状态级阈值满足,诱导了有限时间累积成本界,并在干预频率和每步奖励扭曲方面给出了性能退化界。我们使用highway-env在非平稳高速公路合并场景中评估了CPSS。在多个种子下,CPSS显著减少了基于接近度的安全违规,并增加了分离边缘,同时选择性干预而不是主导学习的策略。这些结果支持了将累积安全规范转化为有效本地安全控制的适应性预算到阈值投影作为实际应用的方法。

英文摘要

Safety in reinforcement learning is often specified through cumulative cost constraints, but these trajectory-level guarantees do not directly prevent unsafe individual decisions, especially under nonstationarity. In continual and nonstationary settings, the difficulty is amplified because the risk associated with the same action can vary across contexts, while a fixed state-level threshold may be either too conservative or too weak. We propose Constraint Projection Safety Shield (CPSS), a runtime mechanism that converts a cumulative safety budget into adaptive state-level control constraints during execution. CPSS tracks the remaining safety budget, projects it into a time-varying admissible risk threshold, and filters policy actions whose predicted safety cost exceeds the active threshold. The threshold is adjusted online using contextual signals so that enforcement becomes stricter in more demanding or rapidly changing regimes and less restrictive when the available safety budget is sufficient. We analyze the resulting shielded policy and show that the mechanism guarantees per-state threshold satisfaction for executed actions, induces finite-horizon cumulative cost bounds, and yields a performance degradation bound in terms of intervention frequency and per-step reward distortion. We evaluate CPSS in nonstationary highway merging scenarios using highway-env. Across multiple seeds, CPSS substantially reduces proximity-based safety violations and increases separation margins while intervening selectively rather than dominating the learned policy. These results support adaptive budget-to-threshold projection as a practical way to transform cumulative safety specifications into effective local safety control for continual reinforcement learning systems.

2605.18839 2026-05-20 cs.LG cs.AI

An Integrated Forecasting Prototype for Emergency Department Boarding Time to Support Proactive Operational Decision Making

急诊部候诊时间集成预测原型:支持主动运营决策制定

Orhun Vural, Abdulaziz Ahmed, Ferhat Zengul, James Booth, Bunyamin Ozaydin

发表机构 * Department of Electrical and Computer Engineering, University of Alabama at Birmingham(阿拉巴马大学伯明翰分校电气与计算机工程系) Department of Health Services Administration, University of Alabama at Birmingham(阿拉巴马大学伯明翰分校卫生服务管理系) Department of Biomedical Informatics and Data Science, Heersink School of Medicine, University of Alabama at Birmingham(阿拉巴马大学伯明翰分校希尔斯医院医学院生物医学信息学与数据科学系) Department of Emergency Medicine, University of Alabama at Birmingham(阿拉巴马大学伯明翰分校急诊医学系)

AI总结 本文提出了一种多时间跨度的时间序列预测框架,用于预测急诊部候诊时间,以支持主动的运营决策制定,通过整合真实世界数据和外部上下文数据源,如天气、节假日和重大本地事件,提高了预测准确性。

Comments 22 pages, including supplementary materials

详情
AI中文摘要

急诊部门(ED)的拥挤状况仍然是全球范围内持续存在的运营挑战,导致护理延误和后续拥堵。急诊部候诊时间,定义为被收治患者在等待住院床放置期间在急诊部停留的时间,是这种拥堵的关键指标。提前预测急诊部候诊时间可以实现主动的运营决策制定,防止拥堵加剧。我们开发并评估了多时间跨度的时间序列预测框架,以预测6、8、10、12和24小时的急诊部候诊时间。利用美国一所大学附属城市的大学附属医院的真实世界数据,并整合外部上下文数据源,包括天气、节假日和重大本地事件。基于分解的线性(DLinear)和基于标准化的线性(NLinear)时间序列预测深度学习模型在多个时间跨度上表现优异。模型还被评估了在极端拥堵场景下的表现,这些场景由较高的候诊时间特征化。此外,还开发了一个机器学习运维(MLOps)网页原型应用,以支持将预测框架转化为实际应用,通过整合数据摄入、预测可视化、实验和重新训练等功能。

英文摘要

Overcrowding in emergency departments (ED) remains a persistent operational challenge worldwide, causing delays in care delivery and downstream congestion. ED boarding time, defined as the duration admitted patients remain in the ED while awaiting inpatient bed placement, is a key indicator of this congestion. Predicting ED boarding time in advance enables proactive operational decision making before congestion escalates. We developed and evaluated a multi-horizon time series forecasting framework to predict ED boarding time at 6, 8, 10, 12, and 24-hour horizons. Real-world data from a university-affiliated urban hospital in the United States were utilized and integrated with external contextual data sources, including weather, holidays, and major local events. Decomposition-based Linear (DLinear) and Normalization-based Linear (NLinear) time series forecasting deep learning models showed superior performance across multiple horizons. Models were also evaluated under extreme congestion scenarios characterized by elevated boarding times. In addition, a Machine Learning Operations (MLOps) web application prototype was developed to support translation of the forecasting framework into practice through integrated data ingestion, forecast visualization, experimentation, and retraining.

2605.18837 2026-05-20 cs.LG cs.AI eess.SP

VCR: Learning Valid Contextual Representation for Incomplete Wearable Signals

VCR:学习不完整可穿戴信号的有效上下文表示

Yuxuan Weng, Wenhan Luo, Qijia Shao

发表机构 * The Hong Kong University of Science and Technology(香港科学与技术大学)

AI总结 本文提出VCR框架,通过学习鲁棒于模态缺失的表示,解决可穿戴信号不完整问题,提升在多种健康监测任务中的性能和鲁棒性。

详情
AI中文摘要

可穿戴设备能够从多模态信号中实现连续健康监测,但实际部署受到有限标注数据和普遍传感器不完整性的阻碍。尽管大规模自监督预训练减少了对标签的依赖,但现有方法大多假设全模态可用性。目前处理模态缺失的方法通常重建整个缺失信号,这可能导致无法从观测传感器信号推断出的模态特定细节的幻觉,从而降低鲁棒性。我们提出VCR,一种自监督框架,学习提取对模态缺失具有鲁棒性的表示。VCR采用正交分词器,通过校正潜在流形并应用几何投影,严格分离每个模态到共享语义和模态特定残差。这种设计在保持完整信息完整性的同时,为模态缺失下的稳健学习提供了结构基础。所生成的标记由一个缺失感知的混合专家背骨处理,能够适应不同模式的模态可用性。通过将目标限制为仅重建缺失模态的共享组件,VCR有效减轻了无法推断的模态特定细节的幻觉。在多个健康监测任务中,VCR在完整、单缺失和多缺失模态设置下,相比强大的监督和自监督基线,一致提升了性能和鲁棒性。

英文摘要

Wearable devices enable continuous health monitoring from multimodal signals, but real-world deployment is hindered by limited labeled data and pervasive sensor incompleteness. While large-scale self-supervised pretraining reduces label dependence, most existing methods assume full modality availability. Current approaches for handling modality missingness often reconstruct entire absent signals, which can encourage hallucinating modality-specific details that are not inferable from the observed sensor signals and degrade robustness. We propose VCR, a self-supervised framework that learns to extract valid representations robust to modality missingness. VCR employs an orthogonal tokenizer to enforce strict orthogonal disentanglement by rectifying latent manifolds and applying a geometric projection, separating each modality into shared semantics and modality-specific residuals. This design preserves complete information integrity while serving as a structural foundation for robust learning under modality missingness. The resulting tokens are processed by a missing-aware mixture-of-experts backbone that adapts to varying patterns of modality availability. By constraining the objective to reconstruct only the shared components of missing modalities, VCR effectively mitigates hallucinations of non-inferable modality-specific details. Across multiple health monitoring tasks, VCR consistently improves performance and robustness under full, single-missing, and multiple-missing modality settings compared with strong supervised and self-supervised baselines.

2605.18836 2026-05-20 cs.LG cs.CV

Spectral Gradient Surgery for Domain-Generalizable Dataset Distillation

谱梯度手术用于领域通用化数据集蒸馏

Minyoung Oh, Najeong Chae, Jae-Young Sim

发表机构 * Graduate School of Artificial Intelligence(人工智能研究生院) Ulsan National Institute of Science and Technology (UNIST)(乌山国立科学与技术研究院(UNIST))

AI总结 本文提出了一种新的数据集蒸馏方法,即领域通用化数据集蒸馏(DGDD),通过谱梯度手术(SGS)来提升蒸馏数据集对超出分布(OOD)的泛化能力,同时保持与现有数据集蒸馏方法的兼容性。

Comments 17pages

详情
AI中文摘要

数据集蒸馏(DD)合成一个紧凑的合成数据集,以保留完整数据集的训练效用。然而,其标准公式假设测试数据遵循与训练数据相同的分布,这一假设在实践中很少成立。一种直接的扩展——将事后域泛化(DG)技术应用于蒸馏数据——并不合适,因为现有DG方法依赖于真实数据集的自然多样性,而压缩的合成集本质上缺乏这种多样性,同时还会带来显著的增强开销,这与数据集蒸馏的效率目标相冲突。为了解决这一限制,我们引入了领域通用化数据集蒸馏(DGDD),一种新的问题设定,明确针对蒸馏数据集的超出分布泛化。我们通过广泛采用的DD基线分布匹配(DM)来研究这一问题。我们将DM的超出分布脆弱性归因于压缩合成集中类判别信息和领域特定信息的纠缠,并提出谱梯度手术(SGS)来解纠缠。SGS的关键见解是跨域在谱域中的梯度一致性和跨域梯度组件的共享揭示了哪些梯度组件在源域之间共享——因此是类判别性的——以及哪些是领域特定的。基于这一观察,SGS在标准DM更新中添加了两个互补的梯度:一个强化跨域共享组件,另一个促进蒸馏数据集内的多样性。在多样规模基准上的广泛实验表明,SGS在提升超出分布泛化的同时,仍保持与现有DM方法的即插即用兼容性。

英文摘要

Dataset Distillation (DD) synthesizes a compact synthetic dataset that preserves the training utility of a full dataset. However, its standard formulation assumes that test data follow the same distribution as training data, an assumption that rarely holds in practice. A straightforward extension-applying post-hoc Domain Generalization (DG) techniques to distilled data-is ill-suited because existing DG methods rely on the natural diversity of real datasets, which compact synthetic sets inherently lack, while also incurring substantial augmentation overhead that conflicts with the efficiency objective of dataset distillation. To address this limitation, we introduce Domain Generalizable Dataset Distillation (DGDD), a new problem setting that explicitly targets out-of-distribution (OOD) generalization of distilled datasets. We study this problem through a widely adopted DD baseline of Distribution Matching (DM). We attribute the OOD vulnerability of DM to the entanglement of class-discriminative and domain-specific information within the compressed synthetic set, and propose Spectral Gradient Surgery (SGS) to disentangle the two. The key insight of SGS is that cross-domain agreement among domain-wise gradients in the spectral domain reveals which gradient components are shared across source domains-and are therefore class-discriminative-and which are domain-specific. Based on this observation, SGS augments the standard DM update with two complementary gradients: one that reinforces cross-domain shared components and another that explicitly promotes diversity within the distilled dataset. Extensive experiments on diverse-scale benchmarks demonstrate that SGS substantially improves OOD generalization while remaining plug-and-play compatible with existing DM methods.

2605.18835 2026-05-20 cs.LG

StampFormer: A Physics-Guided Material-Geometry-Coupled Multimodal Model for Rapid Prediction of Physical Fields in Sheet Metal Stamping

StampFormer: 一种基于物理的材料-几何耦合多模态模型,用于快速预测冲压板料的物理场

Jiajie Luo, Mohamed Mohamed, Osama Hassan, Haosu Zhou, Yingxue Zhao, Haoran Li, Xinrun Li, Zhutao Shao, Yang Long, Nan Li, Jichun Li

发表机构 * Dyson School of Design Engineering, Imperial College London(帝国理工学院设计工程学院) School of Computing, Newcastle University(新castle大学计算机学院) Department of Computing, Imperial College London(帝国理工学院计算系) Multi-X Solution Limited(Multi-X解决方案有限公司) Department of Computer Science, Durham University(达勒姆大学计算机科学系) Department of Mechanical Engineering, Faculty of Engineering, Helwan University(Helwan大学工程学院机械工程系)

AI总结 本文提出StampFormer模型,通过结合材料和几何信息,实现对冲压板料物理场的快速准确预测,从而提高设计效率。

详情
AI中文摘要

传统冲压板料成型依赖于耗时且昂贵的有限元分析(FEA)进行设计验证,这一过程显著延长了设计周期。虽然代理模型提供了更快的迭代速度,但现有方法存在局限:标量方法无法捕捉全面的基于场的FEA结果,而现有基于图像的方法往往忽略了材料属性的关键作用,仅关注几何。为解决这一差距,我们开发了一种基于物理的深度学习框架,即StampFormer,该框架同时利用组件几何和材料应力-应变响应来预测FEA结果。StampFormer框架使用三个核心组件处理数据。首先,材料增强的几何网络(MAGN)融合几何和材料数据。然后,通过层次化材料嵌入注入单元(HMEIU)在不同层次上整合信息,再由主网络骨干,即改进的Swin-UNet进行处理。我们在交叉件面板冲压上评估了我们的模型,使用两个模拟数据集进行钢和铝板的冲压模拟,结果表明,StampFormer在不到一秒的时间内提供了高保真的关键物理场预测,包括薄化、主应变、次应变、塑性应变和位移。与真实FEA相比,我们的模型在四个二维场上的平均相对误差小于8.5%,在三维位移场上的均方误差小于1.2 mm²。总之,我们介绍了一种实用且高效的框架,整合了多模态信息,即几何和材料属性,以提供快速且准确的预测,使设计师能够进行实时的可制造性评估。

英文摘要

Traditional sheet metal forming relies on time-consuming and expensive Finite Element Analysis (FEA) for design validation, a process that significantly prolongs design cycles. While surrogate models offer faster iteration, current approaches have limitations: scalar-based methods cannot capture comprehensive field-based FEA results, while existing image-based models often ignore the critical role of material properties by focusing solely on geometry. To address this gap, we develop a physics-guided deep learning framework, namely StampFormer, which simultaneously uses component geometry and material stress-strain responses to predict FEA outcomes. The StampFormer framework uses three core components to process data. A Material-Augmented Geometric Network (MAGN) first fuses geometric and material data. This information is then integrated at various levels by a Hierarchical Material Embedding Injection Unit (HMEIU) before being processed by the primary network backbone, an adapted Swin-UNet. We evaluated our model on the stamping of a crossmember panel with two simulation datasets for steel and aluminium panels, and results demonstrate that StampFormer provides high-fidelity predictions of critical physical fields - including thinning, major strain, minor strain, plastic strain, and displacement - in under a second. Compared with ground truth FEA, our model achieved an average relative error of less than 8.5% on the four 2D fields and a mean squared error of less than 1.2 mm2 for the 3D displacement field. In summary, we introduce a practical and efficient framework that integrates multimodal information, namely geometry and material properties, to provide fast and accurate predictions, enabling designers to perform real-time manufacturability assessments.

2605.18832 2026-05-20 cs.LG cs.AI

Precision Tracked Transformer via Kalman Filtering, Kriging and Process Noise

通过卡尔曼滤波、克里格法和过程噪声的精确跟踪变压器

Bo Long, Deepak Agarwal, Jelena Markovic-Voronov, Yi Wang, Liuqing Li

发表机构 * LinkedIn Core AI(LinkedIn核心AI)

AI总结 本文提出了一种基于贝叶斯滤波的变压器(BFT),通过引入精度权重的克里格法、自适应卡尔曼更新和动态模型,解决了传统变压器在处理不确定性方面的不足,提升了序列推荐和大语言模型在噪声环境下的鲁棒性。

详情
AI中文摘要

Transformer是现代AI的基础构建块,但其缺乏对不确定性的原则性处理,这在实际应用中普遍存在:序列推荐中的冷启动标记具有稀疏的历史,语言模型中的异质信号质量,以及由无约束softmax引起的注意力 sinks。每个token都被统一的置信度处理。我们证明这种统一性是我们的贝叶斯滤波变压器(BFT)的退化情况:注意力变为精度加权克里格法,残差连接变为具有自适应增益的卡尔曼更新,FFN变为通过雅可比矩阵加过程噪声规则传播精度的动力学模型。观测精度来自一个无参数的受限最大似然(REML)估计器,具有共轭贝叶斯先验。BFT将任何Transformer层替换为几乎无开销。在序列推荐中,BFT应用于三种主要架构,在六个基准上获得显著提升,其中在冷启动用户和稀有物品上改进最大。在具有噪声数据的监督微调中,BFT在两个领域提高了鲁棒性:噪声监督(问答中的token-标签腐败)和噪声上下文(具有真实RAG干扰项的检索增强问答)。单个原则性修改——恢复精度——在经典序列建模和现代LLM领域中释放了大量空间。

英文摘要

The Transformer is the foundational building block of modern AI, yet offers no principled handling of \emph{uncertainty}, which is prevalent in real applications: cold-start tokens with sparse histories in sequential recommendation, heterogeneous signal quality in language models, and attention sinks induced by unconstrained softmax. Every token is treated with uniform confidence. We show this uniformity is a degenerate case of our \emph{Bayesian Filtering Transformer} (BFT): attention becomes precision-weighted kriging, the residual connection becomes a Kalman update with adaptive gain, and the FFN becomes a dynamics model propagating precision via a Jacobian--plus--process-noise rule. Observation precision comes from a parameter-free Restricted Maximum Likelihood (REML) estimator with a conjugate Bayesian prior. BFT replaces any Transformer layer with negligible overhead. On sequential recommendation, BFT applied to three major architectures yields significant gains on six benchmarks, with the largest improvements on cold-start users and rare items where uncertainty is highest. On supervised fine-tuning of large language models with noisy data, BFT improves robustness in two regimes: noisy supervision (token-label corruption in question answering) and noisy context (retrieval-augmented QA with real RAG distractors). A single principled modification -- restoring precision -- unlocks substantial headroom across both classical sequence-modeling and modern LLM regimes.

2605.18830 2026-05-20 cs.LG

In-Context Learning Operates as Concept Subspace Learning

基于情境学习的概念子空间学习

Wei Tang, Xinyan Jiang, Fakhri Karray, Lijie Hu

发表机构 * Mohamed bin Zayed University of Artificial Intelligence(莫扎伊德·本·扎耶德人工智能大学) Shanghai Advanced Research Institute(上海先进研究院)

AI总结 本文研究了结构化演示是否诱导低维概念推理,通过概念子空间视角揭示了情境学习中预测分解为概念坐标回归和子空间泄漏的机制,并通过实验验证了任务信息集中在低维任务对齐激活子空间中的结论。

详情
AI中文摘要

回归和贝叶斯对情境学习(ICL)的解释说明了演示如何诱导预测器,而机械分析通常识别出紧凑的激活方向,引导受促行为。然而,仍不清楚结构化演示是否诱导低维概念推理。我们通过概念子空间视角研究这一问题,在此视角中,任务仅沿内在概念坐标变化,尽管输入观察在高维环境空间中。对于岭回归和最小二乘ICL代理,预测精确分解为概念坐标回归和子空间泄漏。在块对角或近块对角协方差假设下,主导估计和噪声敏感项随概念子空间的维度变化,而残差效应由跨子空间耦合控制。这种分离给出了机械预测:可恢复的任务信息应集中在低维、任务对齐的激活子空间中。在CounterFact衍生的多关系提示上使用Llama-3-8B,4096维残差流的68-73维子空间恢复了78.8%的干净-受污染准确率差距,而补全互补子空间则恢复了0%。概念交换将预测引导至注入的关系,而随机和跨任务匹配排名控制效果不大。此外,在Qwen2.5-7B和受控的跨语言规则任务上的额外实验显示了相同定性模式。这些结果支持概念子空间作为紧凑、任务对齐的可恢复ICL行为在结构化任务家族中的中介,而不意味着全电路恢复。

英文摘要

Regression and Bayesian accounts of in-context learning (ICL) explain how demonstrations can induce predictors, while mechanistic analyses often identify compact activation directions that steer prompted behavior. However, it remains unclear whether structured demonstrations induce low-dimensional concept inference. We study this question through a concept-subspace view of ICL, in which tasks vary only along intrinsic concept coordinates, although inputs are observed in a high-dimensional ambient space. For ridge and least-squares ICL proxies, prediction decomposes exactly into concept-coordinate regression and off-subspace leakage. Under block-diagonal or near-block-diagonal covariance assumptions, the leading estimation and nuisance-sensitivity terms scale with the dimension of the concept subspace, while residual effects are controlled by cross-subspace coupling. This separation gives a mechanistic prediction: recoverable task information should concentrate in a low-dimensional, task-aligned activation subspace. On CounterFact-derived multi-relation prompts with Llama-3-8B, a 68--73-dimensional subspace of the 4096-dimensional residual stream restores 78.8% of the clean--corrupted accuracy gap, whereas patching the complementary subspace restores 0%. Concept swaps redirect predictions toward injected relations, while random and cross-task matched-rank controls are largely ineffective. Additional experiments on Qwen2.5-7B and a controlled cross-lingual rule task show the same qualitative pattern. These results support concept subspaces as compact, task-aligned mediators of recoverable ICL behavior in structured task families, without implying full-circuit recovery.

2605.18829 2026-05-20 cs.LG cs.CR

Lossless Anti-Distillation Sampling

无损反蒸馏采样

Zibo Diao, Jingchu Gai, Xinyue Ai, Zhang Zhang, Zhenyu He, Di He

发表机构 * Peking University(北京大学) Tsinghua University(清华大学) Carnegie Mellon University(卡内基梅隆大学)

AI总结 本文提出了一种无损反蒸馏采样方法,通过在保持良性用户体验的同时,有效对抗多账号蒸馏攻击,降低蒸馏模型的泛化能力。

详情
AI中文摘要

面向商业生成模型的前沿领域,蒸馏攻击正成为日益严峻的威胁。蒸馏者通过收集生成响应并以极低的成本训练自己的竞争模型。现有防御措施要么依赖于修改模型输出,从而牺牲良性用户的响应质量,要么依赖于行为检测方法,这些方法可以通过在多个账户上分布查询来轻易绕过。在本工作中,我们提出了无损反蒸馏采样(LADS),一种专门设计用于对抗多账号蒸馏同时保持良性用户体验的新型采样方案。具体而言,LADS从由查询的语义内容和用户查询次数决定的私有种子中推导出每种生成的随机性。通过构造,每个良性用户在每次访问时都会独立地从原始模型中采样响应,因此不会产生失真。相反,对于蒸馏者,不同账户在相同语义桶中的查询会共享潜在随机性。因此,收集的数据变得相关,可能降低样本多样性并损害泛化能力。利用统一收敛理论,我们证明LADS在无条件和条件生成设置中,能够证明降低蒸馏者泛化差距的收敛率相对于标准i.i.d.采样。在图像生成、数学推理和代码生成的实验中,证实LADS显著降低蒸馏学生的表现,同时保持对单个用户的精确统计保真度。

英文摘要

Frontier commercial generative models face a growing threat from distillation, whereby a distiller harvests generated responses and trains a competing model of its own at drastically lower cost. Existing defenses either rely on modifying the models outputs, thereby sacrificing response quality for benign users, or on behavioral detection methods, which can be readily circumvented by distributing queries across multiple accounts. In this work, we propose Lossless Anti-Distillation Sampling (LADS), a novel sampling scheme specifically designed to counter multi-account distillation while maintaining a lossless experience for benign users. Concretely, LADS derives the randomness underlying each generation from a private seed determined by the semantic content of the query and the number of times the user has queried the model. By construction, every benign user receives a response independently sampled from the original model at each visit, and thus experiences no distortion. In contrast, for a distiller, different accounts share latent randomness whenever their queries fall in the same semantic bucket. As a result, the harvested data becomes correlated, potentially reducing sample diversity and degrading generalization. Using uniform convergence theory, we show that LADS provably degrades the convergence rate of the distillers generalization gap relative to standard i.i.d. sampling in both unconditional and conditional generation settings. Experiments on image generation, mathematical reasoning, and code generation confirm that LADS substantially degrades the performance of distilled students while preserving exact statistical fidelity for individual users.

2605.18826 2026-05-20 cs.LG cs.AI

The Routing and Filtering Structure of Attention

注意力的路由和过滤结构

Shafayeth Jamil, Rehan Kapadia

发表机构 * University of Southern California(南加州大学)

AI总结 本文研究了注意力机制中的路由和过滤结构,通过分解1776个预训练Transformer的头部,发现路由在低秩状态下运行,并引入S-D注意力作为诊断参数化方法,分离路由和过滤,实现稳定训练和有效降维。

Comments 13 pages, 7 figures

详情
AI中文摘要

注意力交互矩阵$QK^{ op}$包含两个交织的计算:一个斜对称成分用于在位置间重新分配信息(路由),一个对称成分用于缩放相互相关性(过滤)。我们分解了五个预训练Transformer中的1776个头部,发现路由在低秩状态下运行,远低于权重核分配的路由能力。我们引入了S-D注意力作为诊断参数化方法,通过构造分离路由和过滤,保证稳定性($\mathrm{Re}(λ) \le 0$)并稳定训练而无需层归一化。当分离和未归一化时,路由自组织成一个谱级联,第一层的有效秩为2,随着深度扩展到六个尺度,从7M到355M参数。级联预测了注意力可以简化的位置:线性化125M S-D注意力的前七层成本低于5%的困惑度,而标准注意力在相同干预下崩溃。可线性化的区域随着深度扩大。用ELU+1线性注意力替换前四层,可在完整头部维度内达到基线的1.4%以内。级联分配的架构用注意力参数换取困惑度(47%-65%更少的注意力参数,+3.9%到+8.4% PPL)。路由-过滤分解使谱预算变得清晰;级联使其具有可操作性。

英文摘要

The attention interaction matrix $QK^{\top}$ contains two entangled computations: a skew-symmetric component that redistributes information between positions (routing) and a symmetric component that scales mutual relevance (filtering). We decompose 1776 heads across five pretrained transformers and find routing operating at low rank, well below the routing capacity allocated by the weight kernel. We introduce $S$-$D$ attention as a diagnostic parameterization that disentangles routing from filtering by construction with guaranteed stability ($\mathrm{Re}(λ) \le 0$) and trains stably without layer normalization. When disentangled and unnormalized, routing self-organizes into a spectral cascade, effective rank $2$ at the first layer, expanding with depth across six scales from 7M to 355M parameters. The cascade predicts where attention can be simplified: linearizing the first seven layers of 125M $S$-$D$ attention costs ${<}5\%$ perplexity, whereas standard attention collapses under the same intervention. The linearizable region widens with depth. Replacing the first four layers with ELU+1 linear attention reaches within $1.4\%$ of baseline at full head dimension. Cascade-allocated architectures trade attention parameters for perplexity ($47\%-65\%$ fewer attention parameters at $+3.9\%$ to $+8.4\%$ PPL). The routing-filtering decomposition makes the spectral budget legible; the cascade makes it actionable.

2605.18825 2026-05-20 cs.LG cs.DC

Not All Tokens Are Worth Caching: Learning Semantic-Aware Eviction for LLM Prefix Caches

并非所有标记都值得缓存:学习语义感知的淘汰策略用于LLM前缀缓存

Shaoke Fang, Ziang Li, Wenfei Wu, Jiatong Ji, Qingsong Liu, Ruizhi Pu

发表机构 * Peking University(北京大学) FirestAI Tsinghua University(清华大学) University of Massachusetts Amherst(马萨诸塞大学阿姆赫斯特分校) Southeast University(东南大学)

AI总结 本文提出了一种语义感知的前缀缓存淘汰策略SAECache,通过多队列架构、语义感知的标记加权机制和全适应的在线学习方案,提高了LLM服务中前缀缓存的效率,从而在不同工作负载下实现了显著的TTFT提升。

详情
AI中文摘要

前缀缓存是大型语言模型(LLM)服务中的关键优化,通过重用注意力键值(KV)状态来减少昂贵的prefill计算。然而,其效益依赖于淘汰策略,因为GPU内存有限,而现有策略如LRU通常将缓存块视为统一处理。这种观点忽略了LLM提示的一个基本属性:并非所有标记都同样值得缓存。我们显示,提示中不同的标记类型,包括系统提示、用户查询、工具输出、模型响应和推理链,其重用率可能高达756倍,但现有淘汰策略并未利用这一信号。在本文中,我们提出了SAECache(语义适应的前缀缓存淘汰策略),通过三个创新来解决这一差距:(1)一个多队列架构,将KV块路由到任务特定的队列中,使用定制的优先级指标,捕捉多轮请求中的会话重用和模板单轮请求中的结构重用;(2)一种语义感知的标记加权机制,通过淘汰反馈在线学习不同标记类型的重用价值;(3)一种完全适应的在线学习方案,用于所有参数更新,包括对数正态时间参数、位置衰减幂、队列权重和元参数,这消除了手动调优并使系统能够自动适应部署特定的工作负载特性。通过在异构工作负载上的广泛评估,我们证明SAECache在生产风格的基线之上实现了1.4x-2.7x的TTFT提升,而固定参数的替代方案在工作负载不匹配时可能会下降高达2.7x,这是我们的自适应方法完全避免的失败模式。

英文摘要

Prefix caching is a key optimization in Large Language Model (LLM) serving, reusing attention Key-Value (KV) states across requests with shared prompt prefixes to reduce expensive prefill computation. However, its benefit depends critically on the eviction policy as GPU memory is scarce, and existing policies such as LRU largely treat cached blocks uniformly. This view ignores a fundamental property of LLM prompts: not all tokens are equally worth caching. We show that different token types within a prompt, including system prompts, user queries, tool outputs, model responses, and chain-of-thought reasoning, exhibit up to 756x variation in reuse rates, yet no existing eviction policy exploits this signal. In this paper, we present SAECache (Semantic-Adaptive Eviction for prefix caches), a semantic-adaptive prefix cache eviction policy that addresses this gap through three innovations: (1) a multi-queue architecture that routes KV blocks to task-specific queues with tailored priority metrics, capturing both session reuse in multi-turn requests and structural reuse in templated single-turn requests; (2) a semantic-aware token weighting mechanism that learns the reuse value of different token types online through eviction feedback; and (3) a fully adaptive online learning schema for all parameter updates, including log-normal timing parameters, position decay power, queue weights, and meta-parameters, which eliminates manual tuning and enables automatic adaptation to deployment-specific workload characteristics. Through extensive evaluation across heterogeneous workloads, we demonstrate that SAECache achieves 1.4x-2.7x TTFT improvement over production-style baselines, while fixed-parameter alternatives can degrade by up to 2.7x under workload mismatch -- a failure mode our adaptive approach avoids entirely.

2605.18824 2026-05-20 cs.LG cs.AI cs.CL

Fine-Grained Benchmark Generation for Comprehensive Evaluation of Foundation Models

细粒度基准生成用于基础模型的全面评估

Mohammed Saidul Islam, Negin Baghbanzadeh, Farnaz Kohankhaki, Afshin Cheraghi, Ali Kore, Shayaan Mehdi, Elham Dolatabadi, Arash Afkanpour

发表机构 * Vector Institute(Vector研究院) York University(约克大学)

AI总结 本文提出了一种自动化基准生成框架,用于生成覆盖广泛、元数据丰富且抗污染的评估问题,从而提升基础模型的全面评估能力。

详情
AI中文摘要

基础模型的评估通常依赖于缺乏全面覆盖和细粒度评估元数据的基准汇总分数。我们引入了一个自动化基准生成框架。该框架生成基于参考材料(如教科书)的评估问题,生成具有广泛覆盖、丰富元数据和抗污染性的基准。该流程采用多代理架构进行问题生成,并采用以解决方案图驱动的策略,显著提高了地面真实解决方案的可靠性。使用该框架,我们生成了三个基准:机器学习、公司金融和个人金融。专家审查发现,其地面真实错误率显著低于之前的基准,如MMLU和GSM8K。对12个商业和开源模型的评估显示,我们的基准实现了接近均匀的竞争力覆盖,并揭示了现有基准未能捕捉到的模型间性能差异。我们即将开源该框架和我们精心挑选的基准。

英文摘要

Evaluation of foundation models often rely on aggregate scores from benchmarks that lack comprehensive coverage and metadata for a fine-grained evaluation. We introduce a framework for automated benchmark generation. Our framework generates evaluation problems grounded in reference material, such as textbooks, producing benchmarks with broad coverage, rich metadata, and robustness to contamination. The pipeline employs a multi-agent architecture for problem generation and a solution-graph-driven strategy that significantly improves the reliability of ground truth solutions. Using the framework, we generate three benchmarks in Machine Learning, Corporate Finance, and Personal Finance. Expert review finds a significantly lower ground-truth error rate than previous benchmarks such as MMLU and GSM8K. Evaluation of 12 commercial and open-source models shows that our benchmarks achieve near-uniform competency coverage and surface performance differences across models that existing benchmarks fail to capture. We will open-source the framework and our curated benchmarks soon.

2605.18823 2026-05-20 cs.LG

Multi-Pedestrian Safety Warning at Urban Intersections Use Case of Digital Twin

城市交叉口多行人安全预警的数字孪生应用案例

Yongjie Fu, Qi Gao, Mahshid Ghasemi Dehkordi, Gil Zussman, Xuan Di

发表机构 * Department of Civil Engineering and Engineering Mechanics at Columbia University(哥伦比亚大学土木工程与工程力学系) Data Science Institute(数据科学研究院) Department of Electrical Engineering at Columbia University(哥伦比亚大学电气工程系)

AI总结 本文提出一种基于紧密耦合物理-数字孪生框架的城市交叉口多行人安全预警系统,通过COSMOS无线测试床进行实地部署和虚拟现实实验,验证了系统在提高安全预警准确性和响应效率方面的有效性。

详情
AI中文摘要

数字孪生(DTs)在城市交通系统中已获得越来越多的关注;然而,其在安全关键场景中的系统性评估仍然有限。本文提出了一种基于紧密耦合物理-数字孪生框架的城市交叉口多行人安全预警系统。该系统基于纽约市的COSMOS城市级无线测试床,整合了摄像头和超宽带(UWB)、边缘-云计算、预测轨迹建模以及基于MQTT的通信,以向易受伤害道路使用者(VRUs)提供实时安全警报。该系统通过实地部署和虚拟现实(VR)实验进行评估。结果表明,系统在不同模型配置下具有高预警生成准确率、高定位准确率、高效的端到端延迟以及在发出警告时显著减少用户响应时间。所提出的DT框架提供了一种可扩展、模块化且通用的解决方案,用于复杂城市交叉口的实时多行人安全增强。

英文摘要

Digital twins (DTs) for urban transportation systems have gained increasing attention; however, their systematic evaluation in safety-critical scenarios remains limited. This paper presents a multi-pedestrian safety warning system at urban intersections enabled by a tightly coupled physical-digital twin framework. Built upon the COSMOS city-scale wireless testbed in New York City, the proposed system integrates camera and ultra-wideband (UWB), edge-cloud computing, predictive trajectory modeling, and MQTT-based communication to deliver real-time safety alerts to vulnerable road users (VRUs). The system is evaluated through both field deployment and virtual reality (VR) experiments. Results demonstrate high warning generation accuracy, localization accuracy, efficient end-to-end latency under different model configurations, and significant reductions in user response time when warnings are issued. The proposed DT framework provides a scalable, modular, and generalizable solution for real-time multi-pedestrian safety enhancement at complex urban intersections.

2605.18822 2026-05-20 cs.LG cs.AI

Hybrid-LoRA: Bridging Full Fine-Tuning and Low-Rank Adaptation for Post-Training

Hybrid-LoRA: 桥接全微调与低秩适应以实现训练后优化

Chengqian Zhang, Wei Zhu, Kyumin Lee

发表机构 * Worcester Polytechnic Institute(沃斯特理工学院) University of Hong Kong(香港大学)

AI总结 本文提出Hybrid-LoRA框架,通过选择性地对部分模块进行全微调,其余模块使用LoRA进行适应,从而在训练后优化中实现高效性能。

详情
AI中文摘要

训练后已成为适应大型语言模型(LLMs)以实现复杂下游行为(如指令遵循、偏好对齐和多步推理)的关键方法。最近,基于可验证奖励的强化学习(RLVR)作为一种特别有效的训练后范式,通过如GRPO和GSPO等无批评算法实现了可扩展的优化。然而,使用全微调(FFT)的RLVR训练后方法需要大量GPU内存并导致高训练成本。尽管参数高效微调(PEFT)方法如低秩适应(LoRA)能有效降低计算成本,但它们在复杂推理任务的训练后性能上往往存在显著差距。在本文中,我们提出了Hybrid-LoRA,一种高效的训练后框架,该框架选择性地对一小部分不太适合低秩适应的模块进行全微调,而对其余模块使用LoRA进行适应。我们引入了一个新的Hybrid-LoRA Score,用于在固定参数预算下对候选模块按其对低秩适应的敏感性进行排序。实验表明,在10%的全微调模块预算下,Hybrid-LoRA能够接近全微调性能,其余候选模块通过LoRA进行适应, consistently outperforming four state-of-the-art PEFT post-training baselines,实现了高达5.65%和平均4.36%的改进。

英文摘要

Post-training has become essential for adapting large language models (LLMs) to complex downstream behaviors, including instruction following, preference alignment, and multi-step reasoning. Reinforcement learning with verifiable rewards (RLVR) has recently emerged as a particularly effective post-training paradigm for improving reasoning capabilities, with critic-free algorithms such as GRPO and GSPO enabling scalable optimization. However, RLVR post-training with full fine-tuning (FFT) requires substantial GPU memory and incurs high training costs. Although parameter-efficient fine-tuning (PEFT) methods, such as Low-Rank Adaptation (LoRA), effectively reduce computational costs, they often suffer from a noticeable performance gap compared to full fine-tuning in post-training for complex reasoning tasks. In this paper, we propose Hybrid-LoRA, an efficient hybrid post-training framework that selectively applies full fine-tuning to a small subset of modules less suited to low-rank adaptation, while adapting the remaining components with LoRA. We introduce a novel Hybrid-LoRA Score to rank candidate modules according to their sensitivity to low-rank adaptation under a fixed parameter budget. Experiments show that Hybrid-LoRA closely matches full fine-tuning performance under a 10% full fine-tuning module budget, with the remaining candidate modules adapted by LoRA, consistently outperforming four state-of-the-art PEFT post-training baselines, achieving improvements of up to 5.65% and on average 4.36% over the best baseline.

2605.18821 2026-05-20 cs.LG cs.CR

Quantum Adversarial Machine Learning: From Classical Adaptations to Quantum-Native Methods

量子对抗机器学习:从经典适应到量子原生方法

Roozbeh Razavi-Far, Mohammad Meymani, Erfan Mahmoudinia, Dorsa Vazirzade, Peyman Paknezhad, Fateme Ghasemi, Saeed Saravani, Somayeh Nikkhoo, Kimia Haghjooei

发表机构 * Faculty of Computer Science, University of New Brunswick(新不伦瑞克大学计算机科学学院) Department of Electrical Engineering, Amirkabir University of Technology(技术学院电子工程系) Faculty of Mathematical and Computer Science, Kharazmi University(卡扎尔米大学数学与计算机科学学院) Pázmány Péter Catholic University(帕兹曼·彼得天主教大学) Department of Computer Engineering, Amirkabir University of Technology(技术学院计算机工程系) Department of Computer Engineering, Ferdowsi University of Mashhad(马赞德兰大学计算机工程系) Department of Computer Science, Tarbiat Modares University(塔里克·莫达res大学计算机科学系)

AI总结 本文研究量子对抗机器学习中的攻击与防御策略,探讨其理论基础、发展趋势和关键挑战。

Journal ref Artif Intell Rev (2026)

详情
AI中文摘要

机器学习已革新了众多工业领域。尽管取得了近期进展,机器学习模型仍然容易受到对抗性威胁。对抗性机器学习研究这些脆弱性以构建稳健的机器学习模型。量子机器学习是连接量子计算和经典机器学习的交叉领域。虽然量子机器学习在回归、分类和生成建模等复杂任务中可能超越经典机器学习,但它仍然容易受到对抗性攻击。鉴于量子计算和机器学习的近期进展,量子对抗性机器学习领域应运而生,以研究量子机器学习的脆弱性、可能的攻击和新型量子增强的防御策略。在本文的综述中,我们提供了量子对抗性机器学习的详细概述,探讨了现有的攻击和防御措施。我们还回顾了该领域的理论基础、新兴趋势和关键挑战。

英文摘要

Machine learning has revolutionized numerous industrial domains. Despite recent advances, machine learning models remain vulnerable to adversarial threats. Adversarial machine learning is a field that studies these vulnerabilities to build robust machine learning models. Quantum machine learning is an interdisciplinary field that bridges quantum computing and classical machine learning. While quantum machine learning shows potentials to outperform classical machine learning in complex tasks such as regression, classification, and generative modeling, it remains vulnerable to adversarial attacks. Given the recent advancements in quantum computing and machine learning, the quantum adversarial machine learning field has emerged to study the vulnerabilities of quantum machine learning, possible attacks, and novel quantum-enhanced defense strategies. In this survey, we provide a detailed overview on quantum adversarial machine learning and explore the existing attacks and countermeasures. We also review the theoretical underpinnings of this area, emerging trends, and critical challenges.

2605.18820 2026-05-20 cs.LG cs.AI

Emergence of Frontier Superposition: Möbius attractor and Cascade Supervision

前沿叠加的涌现:莫比乌斯吸引子与级联监督

Hongyu Gu, Jingwen Fu

发表机构 * University of Science and Technology of China(中国科学技术大学) Zhongguancun Academy(中关村学院)

AI总结 本文研究了通过叠加实现深度推理的问题,提出莫比乌斯吸引子和级联监督方法,证明了在Erdős-Rényi图上,叠加推理的涌现是通过建筑和监督的贡献实现的。

Comments 40 pages, 3 figures

详情
AI中文摘要

叠加允许Transformer在深度推理中并行处理整个推理前沿,通过有限深度的前向传递而不是展开串行的思维链token。虽然Zhu等人(2025)在单一残差流中手工构建了一个等权重的广度优先前沿用于图可达性,但仍未确定梯度下降能否在排列对称的鞍点中找到这个目标。我们通过隔离建筑和监督的贡献,填补了在Erdős-Rényi图上通过叠加实现可达性的问题。在建筑方面,我们识别出一个莫比乌斯吸引子:在树的 regime 中,层间动态减少到一个1D莫比乌斯映射,其零集是一个共维数为一的全局最优解 manifold,包含等权重叠加状态。在监督方面,我们识别出级联监督:一个损失类别,其反向传播同时提供(A)选择性 bootstrap,(B)梯度在深度的持续性,以及(C)每一步的区分(例如L_sup和L_node)。端到端监督失败于条件(B),并被证明是不足的:在图的扇出和停滞前到达 manifold 之前,层c的内部梯度衰减为(np)^{-(D-c-2)/2}。我们的论点:莫比乌斯吸引子 + 级联监督 = 叠加推理的涌现。参数无关的衰减定律预测在深度D=3时,最终步骤余弦为0.35 vs. 0.71(端到端 vs. 级联);实验证实0.37 vs. 0.69,每一步的匹配误差在0.02以内。

英文摘要

Superposition allows Transformers to reason in depth, carrying an entire reasoning frontier in parallel through a bounded-depth forward pass instead of unrolling serial chain-of-thought tokens. While Zhu et al. (2025) hand-crafted an equal-weight breadth-first frontier in a single residual stream for graph reachability, it remained open whether gradient descent could ever find this target amidst permutation-symmetric saddles. We close this gap on Reachability-by-Superposition over Erdős-Rényi graphs by isolating architectural and supervisional contributions. Architecturally, we identify a Möbius attractor: under $S_n$-symmetry in the tree regime, layerwise dynamics reduce to a 1D Möbius map whose zero set is a codimension-one manifold of global optima containing the equal-weight superposition state. On the supervision side, we identify Cascade Supervision: a loss class whose backward pass simultaneously delivers (A) selectivity bootstrap, (B) gradient persistence across depth, and (C) per-step discrimination (e.g., \mathcal{L}_{sup} and \mathcal{L}_{node}). End-to-end supervision fails condition (B) and is provably insufficient: internal gradients at layer c decay as (np)^{-(D-c-2)/2} in the graph fan-out and stall before the manifold is reached. Our thesis: Möbius attractor + Cascade Supervision = emergence of superposition reasoning. The parameter-free decay law predicts a final-step cosine of 0.35 vs. 0.71 (end-to-end vs. cascade) at depth D=3; experiments confirm 0.37 vs. 0.69, matching within 0.02 at every step.

2605.18816 2026-05-20 cs.LG cs.AI

Symmetry in the Wild: The Role of Equivariance in Neural Fluid Surrogates

野生中的对称性:等变性在神经流体代理中的作用

Patryk Rygiel, Julian Suk, Kak Khee Yeung, Christoph Brune, Jelmer M. Wolterink

发表机构 * Department of Applied Mathematics(应用数学系) Technical Medical Centre(技术医学中心) Cardiovascular Health Technology Centre(心血管健康技术中心) University of Twente(特文特大学) Department of Computer Science(计算机科学系) Munich Center for Machine Learning(慕尼黑机器学习中心) Technical University of Munich(慕尼黑技术大学) Department of Surgery(外科系) Amsterdam UMC, Location(阿姆斯特丹大学医学中心,地点) University of Amsterdam(阿姆斯特丹大学) Amsterdam Cardiovascular Sciences(阿姆斯特丹心血管科学) Digital Society Institute(数字社会研究所)

AI总结 本文研究了等变性在神经流体代理中的作用,探讨了在不同分布对齐和真实度的任务中,等变性如何提高泛化能力,并介绍了AB-GATr模型在处理耦合表面和体积量时的效率。

详情
AI中文摘要

神经代理能够将计算流体动力学(CFD)模拟的计算速度提升几个数量级,有望改变工程和医疗流程。在现实应用中使用神经代理需要解决可扩展性问题,包括大规模、高分辨率表面和体积网格以及定制架构,并通过归纳偏置来应对有限的训练数据。群等变架构是引入此类偏置的一种系统方法,但当学习问题本身破坏对称性时,例如由于数据集中的强分布对齐,可能会产生不利影响。在本工作中,我们探讨了在具有不同分布对齐和真实度的任务中,等变性如何提高神经CFD代理的泛化能力,涵盖汽车空气动力学和血流(血动力学)。为了系统评估等变性在问题可扩展性极限处的附加价值,我们引入了Anchored-Branched Geometric Algebra Transformer(AB-GATr),一种整合了可扩展性和对称性保持的神经代理,能够以E(3)等变的方式高效建模耦合的表面和体积量。我们发现,在强对齐的空气动力学数据集上,即那些破坏对称性的数据集,强制等变性会降低分布内性能。相反,在具有不同几何形状和变化对齐的血动力学基准测试中,等变性始终有益。此外,在所有基准测试中,AB-GATr的显式等变性通过数据增强始终优于隐式对称学习。我们的发现表明,等变性并非在所有领域都有益,但在缺乏强数据规律的问题中带来了实质性的优势。

英文摘要

Neural surrogates enable orders-of-magnitude acceleration of computational fluid dynamics (CFD) simulations, with the potential to transform engineering and healthcare workflows. Neural surrogate use in real-world applications requires addressing scalability to large, high-resolution surface and volume meshes, as well as to bespoke architectures, and accounting for limited training data through the use of inductive biases. Group-equivariant architectures are a principled way to introduce such bias, yet they can be detrimental when the learning problem itself breaks symmetry, for example, due to strong distributional alignment in the dataset. In this work, we investigate under which conditions equivariance improves generalization in neural CFD surrogates across tasks with increasing levels of distributional alignment and realism, covering automotive aerodynamics and blood flow (hemodynamics). To systematically assess the added value of equivariance at the limit of problem scaling, we introduce the Anchored-Branched Geometric Algebra Transformer (AB-GATr), a neural surrogate that integrates scalability and symmetry preservation to efficiently model coupled surface and volume quantities in an $E(3)$-equivariant manner. We find that on strongly aligned aerodynamics datasets, i.e., those that break symmetry, enforcing equivariance can degrade in-distribution performance. In contrast, across hemodynamic benchmarks with diverse geometries and varying alignment, equivariance is consistently beneficial. Moreover, across all benchmarks, the explicit equivariance of AB-GATr reliably outperforms implicit symmetry learning through data augmentation. Our findings showcase that equivariance is not universally beneficial across domains, yet it brings tangible advantages in problems lacking strong data regularities.

2605.18815 2026-05-20 cs.LG cs.DC

DynaTrain: Fast Online Parallelism Switching for Elastic LLM Training

DynaTrain: 快速在线并行切换用于弹性大语言模型训练

Yuanqing Wang, Yuchen Zhang, Hao Lin, Junhao Hu, Chunyang Zhu, Quanlu Zhang, Boxun Li, Guohao Dai, Zhi Yang, Daning Cheng, Yunquan Zhang, Yu Wang

发表机构 * Institute of Computing Technology, CAS(中国科学院计算技术研究所) Peking University(北京大学) Infinigence AI Shanghai Jiao Tong University(上海交通大学) Tsinghua University(清华大学)

AI总结 本文提出DynaTrain,一种能够快速在线重新配置任意多维并行性的分布式训练系统,通过虚拟参数空间抽象统一所有分布式训练状态,实现并行配置的确定性映射,并在密集和MoE模型上展示了显著的性能提升。

Comments GitHub Repo: https://github.com/infinigence/ElasticMegatron

详情
AI中文摘要

现代大型语言模型(LLM)训练本质上是动态的:资源波动、RLHF阶段转换和集群弹性持续地改变最优并行性布局,对现有基于静态执行模型的训练框架构成重大挑战。我们提出了DynaTrain,一种支持亚秒级在线重新配置的分布式训练系统。其核心是虚拟参数空间(VPS)抽象,该抽象将所有分布式训练状态统一到一个逻辑坐标空间中,将任何并行性配置转换为确定性映射,并将复杂的转换折叠为可管理的几何交集。在VPS之上,状态路由和转换层在内存感知、无死锁的调度下执行rank-local传输,而弹性设备管理器则将新世界构建与正在进行的训练重叠,以掩盖拓扑变化成本。在密集和MoE模型上,DynaTrain能够在2秒内重新配置70B密集模型,在4.36秒内重新配置235B MoE模型,性能优于最先进的检查点基和弹性系统,提升幅度高达三个数量级,同时保持正确性。

英文摘要

Modern large language model (LLM) training is inherently dynamic: resource fluctuations, RLHF phase shifts, and cluster elasticity continually reshape the optimal parallelism layout, posing a significant challenge to existing training frameworks built around a static execution model. We present DynaTrain, a distributed training system for sub-second, online reconfiguration across arbitrary multi-dimensional parallelism. At its core, we propose a Virtual Parameter Space (VPS) abstraction that unifies all distributed training states under one logical coordinate space, turning any parallelism configuration into a deterministic mapping and collapsing complex transition into manageable geometric intersections. On top of VPS, a state routing-and-transition layer executes rank-local transfers under a memory-aware, deadlock-free schedule, and an Elastic Device Manager overlaps new-world construction with ongoing training to mask topology-change cost. On dense and MoE models up to 235B parameters, DynaTrain reconfigures a 70B dense model in under 2s and a 235B MoE model in 4.36s, outperforming state-of-the-art checkpoint-based and elastic systems by up to three orders of magnitude while preserving correctness.

2605.18814 2026-05-20 cs.LG

How Faithful Is Trajectory-Based Data Attribution? Error Sources, Remedies, and Practical Guidelines

轨迹数据归因的可信度如何?误差来源、缓解方法和实用指南

Junwei Deng, Pingbang Hu, Suliang Jin, Hao Lu, Jiachen T. Wang, Shichang Zhang, Jiaqi W. Ma

发表机构 * University of Illinois Urbana-Champaign(伊利诺伊大学厄巴纳-香槟分校) University of Michigan(密歇根大学) Princeton University(普林斯顿大学) Harvard University(哈佛大学)

AI总结 本文系统分析了轨迹数据归因方法的误差来源,并提出缓解方法和实用指南,通过将总误差分为配置级、算法级和系统级,改进了归因的准确性,并为数据选择提供了可行的实践指导。

详情
AI中文摘要

基于轨迹的数据归因方法通过展开训练轨迹来估计训练样本对模型预测的影响。它们被广泛应用于数据选择、数据估值和模型诊断等应用,但缺乏对这些方法的全面误差分析,引发了对方法可信度的担忧,并阻碍了可靠部署。在本文中,我们提供了轨迹数据归因方法误差来源的首次系统分析,以及具体的缓解方法和下游应用的实用指南。我们将总误差分为三类:配置级、算法级和系统级。我们做出了三个贡献。首先,我们识别出优化器不匹配是主导的配置级误差:现有方法在其归因下假设使用SGD,即使对于使用现代事实上的优化器AdamW训练的模型也是如此。我们提出了AdamW-influence,以充分考虑AdamW的优化动态,在四个设置中(MLP、CNN、GPT-2和Llama 3.2-1B)估计与真实影响之间的Spearman相关性提高了10%到超过300%。其次,我们隔离了剩余的算法级误差,源于一阶泰勒近似,识别了学习率和轨迹长度作为误差大小的决定因素,并推导出一个闭合形式的误差代理,可以在原始轨迹上评估而无需重新训练。第三,我们将这些见解转化为数据选择的实用指南,通过在K-step前瞻框架下统一离线和在线策略。在此框架下,在线选择具有短时间范围通常匹配或超过离线,且最佳时间范围可以与学习率联合调节。共同,这些结果将框架转化为从业者可操作的选择配方。

英文摘要

Trajectory-based data attribution methods estimate the influence of training samples on model predictions by unrolling the training trajectory. They are widely used in applications such as data selection, data valuation, and model diagnosis, but there is a lack of comprehensive error analysis of these methods, raising concerns about method faithfulness and hindering reliable deployment. In this work, we provide the first systematic analysis of error sources in trajectory-based data attribution, together with concrete remedies to mitigate them and practical guidelines for downstream use. We organize the total error into three categories, config-level, algorithm-level, and system-level. We make three contributions. First, we identify optimizer mismatch as the dominant config-level error: existing methods derive their attribution under the assumption of SGD, even for models trained with the modern de facto optimizer AdamW. We propose AdamW-influence to fully account for AdamW's optimization dynamics, yielding improvements from 10% to over 300% in Spearman correlation between estimated and ground-truth influence across four settings spanning MLP, CNN, GPT-2, and Llama 3.2-1B. Second, we isolate the remaining algorithm-level error arising from the first-order Taylor approximation, identify the learning rate and trajectory length as factors governing the error magnitude, and derive a closed-form error proxy that can be evaluated along the original trajectory without retraining. Third, we translate these insights into practical guidelines for data selection by unifying offline and online strategies under a K-step look-ahead framework. Under this framework, online selection with a short horizon often matches or exceeds offline, and the optimal horizon can be tuned jointly with the learning rate. Together, these results turn the framework into an actionable selection recipe for practitioners.

2605.18813 2026-05-20 cs.LG cs.AI

Composition of Memory Experts for Diffusion World Models

记忆专家的组合用于扩散世界模型

Sebastian Stapf, Pablo Acuaviva Huertos, Aram Davtyan, Paolo Favaro

发表机构 * Computer Vision Group(计算机视觉组) Department of Computer Science(计算机科学系) University of Bern(伯恩大学)

AI总结 本文提出了一种基于扩散的世界模型框架,通过组合专门化的记忆专家来解决记忆与效率之间的权衡问题,提升了时间一致性、过去观察的回忆和导航性能。

Journal ref Proceedings of the Fourteenth International Conference on Learning Representations (ICLR), 2026

详情
AI中文摘要

世界模型旨在预测与过去观察一致的合理未来,这是强化学习中规划和决策的关键能力。然而,现有架构面临根本性的记忆权衡:转换器保留局部细节但受二次注意限制,而递归和状态空间模型更高效但以牺牲保真度为代价。为克服这一权衡,我们建议将未来-过去一致性与任何单一架构解耦,并利用一组专门的专家。我们引入了一种基于扩散的框架,通过对比产品-专家公式整合异构记忆模型。我们的方法实现了三个互补的角色:短期记忆专家捕捉精细的局部动态,长期记忆专家通过轻量级测试时微调在外部扩散权重中存储事件历史,以及空间长期记忆专家强制几何和空间一致性。这种组合设计避免了模式崩溃,并在不产生二次成本的情况下扩展到长上下文。在模拟和现实世界基准测试中,我们的方法提高了时间一致性、过去观察的回忆和导航性能,建立了一种新的构建和操作记忆增强扩散世界模型的范式。

英文摘要

World models aim to predict plausible futures consistent with past observations, a capability central to planning and decision-making in reinforcement learning. Yet, existing architectures face a fundamental memory trade-off: transformers preserve local detail but are bottlenecked by quadratic attention, while recurrent and state-space models scale more efficiently but compress history at the cost of fidelity. To overcome this trade-off, we suggest decoupling future-past consistency from any single architecture and instead leveraging a set of specialized experts. We introduce a diffusion-based framework that integrates heterogeneous memory models through a contrastive product-of-experts formulation. Our approach instantiates three complementary roles: a short-term memory expert that captures fine local dynamics, a long-term memory expert that stores episodic history in external diffusion weights via lightweight test-time finetuning, and a spatial long-term memory expert that enforces geometric and spatial coherence. This compositional design avoids mode collapse and scales to long contexts without incurring a quadratic cost. Across simulated and real-world benchmarks, our method improves temporal consistency, recall of past observations, and navigation performance, establishing a novel paradigm for building and operating memory-augmented diffusion world models.

2605.18812 2026-05-20 cs.LG cs.CL cs.IR

PASC: Pipeline-Aware Conformal Prediction with Joint Coverage Guarantees for Multi-Stage NLP and LLM Pipelines

PASC:面向多阶段NLP和LLM流水线的管道感知置信区间

Varun Kotte

发表机构 * Independent Researcher(独立研究者)

AI总结 本文提出PASC,一种面向多阶段NLP和LLM流水线的管道感知置信区间方法,通过联合覆盖保证提升多阶段流水线的置信区间性能。

详情
AI中文摘要

现代NLP和LLM系统是流水线:命名实体识别(NER)->实体消歧(NED)->实体类型、检索增强生成(检索器->读者),以及代理链(规划器->工具->批评者)。错误在各阶段累积,但现有不确定性量化方法要么独立校准每个阶段(无联合覆盖),要么应用Bonferroni联合界(有联合覆盖但保守)。我们提出了PASC(Pipeline-Aware Split Conformal),将多阶段联合覆盖转换为单个标量置信区间问题,基于联合最大不一致性分数。PASC提供了一个有限样本分布无关的保证,所有K阶段同时覆盖的概率至少为1 - alpha,并且几乎紧致,误差不超过1/(n+1)。在CoNLL-2003上的三阶段NER->NED->实体类型流水线中,PASC实现了96.4%的端到端覆盖,优于Bonferroni的93.4%和独立CP的86.5%,在相同平均预测集大小(1.083)下。在分布偏移至WNUT-17推特和WikiNEuRal维基数据时,PASC在测试偏移设置中保持目标覆盖,而独立CP下降到59%。PASC只需一次分位数计算,运行速度比Bonferroni快1.7倍,并可扩展到K=6阶段,其中独立CP下降到0.53端到端覆盖。相同的联合最大分数减少直接应用于复合LLM系统和代理流水线。

英文摘要

Modern NLP and LLM systems are pipelines: named entity recognition (NER) -> entity disambiguation (NED) -> entity typing, retrieval-augmented generation (retriever -> reader), and agentic chains of planner -> tool -> critic. Errors compound across stages, but existing uncertainty quantification methods either calibrate each stage independently (no joint coverage) or apply a Bonferroni union bound (joint coverage, but conservative). We present PASC (Pipeline-Aware Split Conformal), which reduces multi-stage joint coverage to a single scalar conformal prediction problem on the joint maximum nonconformity score. PASC provides a finite-sample distribution-free guarantee that all K stages are simultaneously covered with probability at least 1 - alpha, and is nearly tight up to a 1/(n+1) factor. On a three-stage NER -> NED -> entity-typing pipeline over CoNLL-2003, PASC achieves 96.4% end-to-end coverage versus 93.4% for Bonferroni and 86.5% for independent CP, at identical average prediction set size (1.083). Under distribution shift to WNUT-17 Twitter and WikiNEuRal Wikipedia data, PASC empirically maintains the target coverage in the tested shift settings while independent CP collapses to 59%. PASC requires a single quantile computation, runs 1.7x faster than Bonferroni, and scales to K = 6 stages where independent CP drops to 0.53 end-to-end coverage. The same joint-maximum-score reduction applies directly to compound LLM systems and agent pipelines.

2605.18810 2026-05-20 cs.LG cs.AI

D-PACE: Dynamic Position-Aware Cross-Entropy for Parallel Speculative Drafting

D-PACE:动态位置感知交叉熵用于并行推测草案

Tianyu Wu, Yu Yao, Zhenting Qi, Han Zheng, Zhuohan Wang, Haoran Ma, Lawrence Liao, Himabindu Lakkaraju, Ju Li, Yilun Du

发表机构 * Harvard(哈佛大学) MIT(麻省理工学院)

AI总结 本文提出D-PACE,一种动态位置感知交叉熵,用于改进并行推测草案的训练,通过动态调整位置权重以提高生成速度和输出长度。

详情
AI中文摘要

推测解码通过让小型草案生成器并行生成token,由更大目标模型验证,从而加速LLM推理。最近的扩散式并行草案生成器如DFlash在一次前向传递中预测完整的B-token块,使深度草案生成器和更长的接受块成为可能。然而,现有多token草案生成器目标通常使用固定的位置依赖加权计划,如头部依赖权重或块位置衰减,这在训练过程中无法适应限制接受的位置变化。为此,我们从可微的替代品中推导出每位置的训练权重,使每个位置的权重与其log概率梯度贡献相匹配。所得到的损失,D-PACE(动态位置感知交叉熵),将训练信号转向当前限制接受的位置,随着草案生成器的改进。在六个基准、两个Qwen3-4B草案深度、两个解码温度和两个额外的目标模型上,D-PACE一致地提高了墙钟加速速度和平均生成长度,测量训练时间开销为2.3%,且不改变草案生成器的架构或推理过程。

英文摘要

Speculative decoding accelerates LLM inference by having a small drafter propose tokens that a larger target model verifies in parallel. Recent diffusion-based parallel drafters such as DFlash predict the full B-token block in one forward pass, enabling deeper drafters and longer accepted blocks. However, existing multi-token drafter objectives often use fixed position-dependent weighting schedules, such as head-dependent weights or block-position decays, which do not adapt as the positions limiting acceptance change during training. To address this, we derive per-position training weights from a differentiable surrogate of expected accepted draft length, matching the weight of each position to its log-probability gradient contribution. The resulting loss, D-PACE (Dynamic Position-Aware Cross-Entropy), shifts training signal toward positions that currently limit acceptance as the drafter improves. Across six benchmarks, two Qwen3-4B draft depths, two decoding temperatures, and two additional target models, D-PACE consistently improves both wall-clock speedup and average emitted length, with 2.3\% measured training-time overhead and no changes to the drafter architecture or inference procedure.

2605.18809 2026-05-20 cs.LG cs.AI

Metric-Gradient Projection for Stable Multi-Agent Policy Learning

基于度量梯度的稳定多智能体策略学习

Zuyuan Zhang, Sizhe Tang, Mahdi Imani, Tian Lan

发表机构 * The George Washington University(乔治华盛顿大学) Northeastern University(东北大学)

AI总结 本文提出HPML方法,通过将多智能体系统的联合更新场视为L²空间中的向量场,并计算其在最接近度量梯度势流上的Hodge型投影,从而提升多智能体强化学习的稳定性。

详情
AI中文摘要

一般和解的多智能体学习通常由堆叠更新场主导,其中每个智能体的策略更新会改变其他智能体面临的优化景观。这种耦合可以将可积分的集体改进组件与循环交互动力学纠缠在一起,导致多智能体学习缓慢或不稳定。现有方法,如正则化、信用分配和共识方法,通过局部或算法修改稳定MARL;HPML通过将联合更新场投影到度量梯度组件来补充它们。我们引入HPML(Hodge-Projected Multi-agent Learning),将多智能体系统的联合更新场视为L²空间中的向量场,并计算其在最接近度量梯度势流上的Hodge型投影。HPML遵循投影组件作为更新方向,从而在所选度量和采样度量下获得最接近的度量梯度场。投影通过变分定义,由泊松型方程表征,并通过基于图的和放缩神经网络实现,从样本中恢复投影方向。我们证明投影动力学具有Lyapunov势,并能产生具有显式加性非势项的平衡间隙界。受控实验验证了几何机制,CTDE基准测试显示当HPML用作MARL流水线中的插件投影层时,稳定性和归一化回报有所提高。

英文摘要

General-sum multi-agent learning is often governed by a stacked update field in which each agent's policy update changes the optimization landscape faced by the others. This coupling can entangle an integrable component of collective improvement with cyclic interaction dynamics, leading to slow or unstable multi-agent learning. Existing approaches, such as regularization, credit assignment, and consensus methods, stabilize MARL through local or algorithmic modifications; HPML complements them by projecting the joint update field onto a metric-gradient component. We introduce \textbf{HPML} (\textbf{H}odge-\textbf{P}rojected \textbf{M}ulti-agent \textbf{L}earning), which views the joint update field of a multi-agent system as an element of an $L^2$ space of vector fields and computes a Hodge-type projection onto the closest metric-gradient potential flow. HPML follows the projected component as the update direction, yielding the closest metric-gradient field under the chosen metric and sampling measure. The projection is defined variationally, characterized by a Poisson-type equation, and implemented through graph-based and amortized neural realizations that recover projected directions from samples. We show that the projected dynamics admit a Lyapunov potential and yield equilibrium-gap bounds with an explicit additive non-potentiality term. Controlled experiments validate the geometric mechanism, and CTDE benchmarks show improved stability and normalized return when HPML is used as a plug-in projection layer in MARL pipelines.

2605.18808 2026-05-20 cs.LG cs.AI cs.CL

Compositional Literary Primitives in Instruction-Tuned LLMs: Cross-Architectural SAE Features for Self, Style, and Affect

在指令微调的LLM中构建组合文学原语:跨架构SAE特征用于自我、风格和情感

Joao Paulo Cavalcante Presa, Savio Salvarino Teles de Oliveira

发表机构 * Federal University of Goias(戈亚斯联邦大学)

AI总结 本文通过稀疏自编码器研究了指令微调的LLM中组合文学原语的架构,发现四种特征类别,并通过跨架构SAE特征验证了自我、风格和情感的表达能力。

Comments 36 pages, 6 figures

详情
AI中文摘要

我们通过在中层残差流上使用稀疏自编码器,对两个指令微调的大型语言模型(Llama 3.1 8B-Instruct和Gemma 2 9B-IT)的文学原语组合架构进行了表征。四种特征类别出现:促进目标情感词的命名门,一个包含第一人称注册特征的十一自我簇,风格注册调节器(show-don't-tell和陌生化),以及仅由多特征引导产生的组合情感。在应用于27类情感分类法(Cowen-Keltner)的强制选择5-LLM判断小组中,Llama通过结合命名门、多特征食谱和单个自我特征引导实现了完全27/27覆盖;Gemma在adoration作为单一残差严格失败的情况下达到23/27。在随机判断中,每个单元格通过的概率约为$10^{-3}$,整个目录中两个种子假阳单元格的预期数量可忽略不计,因此观察到的覆盖度不一致于偶然。在严格与柔和判断对比中存在跨架构不对称性:在相同生成中,判断者在Llama输出上比在Gemma输出上更一致,因为Llama输出更直接地命名目标情感,而Gemma输出则通过场景和意象来唤起情感。两种架构都包含同时作为注册标记和情感发射器的自我特征,包括每个架构中一个最RLHF加载的自我特征,该特征在某一操作 regime 中增强机构Helper-AI人格,并在相同校准系数下产生可分类情感的输出。方法上,本文提出了一个三阶段验证流程(logit-lens,LLM-rate,5-LLM判断)并记录了文档化的反模式;总计算量为单GPU,大约每种情感特征发现循环15分钟。

英文摘要

We characterize a compositional architecture of literary primitives in two instruction-tuned large language models (Llama 3.1 8B-Instruct and Gemma 2 9B-IT) via sparse autoencoders on mid-depth residual streams. Four feature classes emerge: naming-gates that promote lexical tokens of a target affect, an eleven-self cluster of first-person register features, stylistic register modulators (show-don't-tell and defamiliarization), and compositional emotions that arise only from multi-feature steering. Under a forced-choice 5-LLM judge panel applied to a 27-category emotion taxonomy (Cowen-Keltner), Llama reaches full 27/27 coverage by combining naming-gates, multi-feature recipes, and single self-feature steering; Gemma reaches 23/27 with adoration as the single residual strict-fail. Under random judging, the per-cell pass probability is on the order of $10^{-3}$ and the expected number of two-seed false-positive cells across the catalog is negligible, so the observed coverage is not consistent with chance. A cross-architectural asymmetry sits in the strict-versus-soft judge contrast: on the same generations, judges agree more often on Llama outputs than on Gemma outputs because Llama outputs name the target affect more directly while Gemma outputs evoke it through scene and imagery. Both architectures contain self-features that serve simultaneously as register markers and as emotion emitters, including a single most-RLHF-loaded self-feature per architecture that intensifies the institutional Helper-AI persona at one operating regime and produces affect-categorizable output at the same calibrated coefficient. Methodologically, the paper presents a three-stage validation pipeline (logit-lens, LLM-rate, 5-LLM judge) with documented anti-patterns; the total compute is single-GPU and about 15 minutes per emotion-feature discovery cycle.

2605.18804 2026-05-20 cs.LG cs.AI

Adaptive Multi-Scale Goodness Aggregation for Forward-Forward Learning

自适应多尺度良度聚合用于前-前学习

Salar Beigzad, Vansh Verma

发表机构 * Computer Engineering University of St. Thomas Minnesota, USA(计算机工程 明尼苏达州圣汤姆斯大学)

AI总结 本文提出了一种自适应多尺度良度聚合(AMSGA)方法,通过改进局部学习神经网络的稳定性、鲁棒性和泛化能力,解决了原始前-前(FF)框架的局限性,实验表明在MNIST和Fashion-MNIST数据集上性能提升显著。

Comments 6 pages, 5 tables, IEEE format

详情
AI中文摘要

我们提出自适应多尺度良度聚合(AMSGA),一种新颖的前-前(FF)算法扩展,旨在提高局部学习神经网络的稳定性、鲁棒性和泛化能力。AMSGA通过引入多尺度良度聚合(局部、中间和全局表示)、自适应课程引导的困难负样本挖掘、层依赖的自适应阈值以及改进的优化稳定性warm-up余弦退火学习率调度,解决了原始FF框架的多个局限性。这些修改增强了FF范式,同时保持了其生物合理性和内存高效性。在MNIST和Fashion-MNIST上的实验表明,与基线FF算法相比,性能有显著提升,分别在MNIST和Fashion-MNIST上达到+1.45%和+1.50%的改进,而计算开销不大。我们的结果表明,当良度估计和训练动态精心设计时,局部学习方法可以变得更具竞争力。

英文摘要

We propose Adaptive Multi-Scale Goodness Aggregation (AMSGA), a novel extension of the Forward-Forward (FF) algorithm designed to improve stability, robustness, and generalization in local-learning neural networks. AMSGA addresses several limitations of the original FF framework by introducing multi-scale goodness aggregation across local, intermediate, and global representations; adaptive curriculum-guided hard negative mining; layer-dependent adaptive thresholds; and a warm-up cosine annealing learning-rate schedule for improved optimization stability. Together, these modifications strengthen the FF paradigm while preserving its biologically plausible and memory-efficient properties. Experiments on MNIST and Fashion-MNIST demonstrate consistent performance improvements over the baseline FF algorithm, achieving up to +1.45% improvement on MNIST and +1.50% improvement on Fashion-MNIST without significant computational overhead. Our results suggest that local learning methods can become substantially more competitive when goodness estimation and training dynamics are carefully designed.

2605.18801 2026-05-20 cs.AI cs.IR cs.LG

Position: Let's Develop Data Probes to Fundamentally Understand How Data Affects LLM Performance

位置:让我们开发数据探针,以根本理解数据如何影响大语言模型性能

Shiqiang Wang, Herbert Woisetschläger, Hans Arno Jacobsen, Mingyue Ji

发表机构 * Department of Computer Science, University of Exeter, UK(埃克塞特大学计算机科学系) Technical University of Munich, Germany(慕尼黑技术大学) Department of Electrical and Computer Engineering, University of Toronto, Canada(多伦多大学电气与计算机工程系) Department of Electrical and Computer Engineering, University of Florida, FL, USA(佛罗里达大学电气与计算机工程系)

AI总结 本文提出通过开发数据探针系统方法生成合成序列,以揭示数据特性对大语言模型性能、泛化能力和鲁棒性的影响,从而超越经验启发式方法。

Comments Accepted to ICML 2026 Position Paper Track

Journal ref Link to ICML record: https://icml.cc/virtual/2026/poster/67154

详情
AI中文摘要

数据对于大语言模型(LLMs)至关重要。然而,了解哪些数据对LLM工作流程的不同阶段(包括训练、微调、对齐、上下文学习等)有用,以及为什么有用,仍然是一个开放性问题。当前的方法依赖于对大型公共数据集进行大量实验来获得数据过滤和数据集构建的经验启发式方法。这些方法计算成本高,并且缺乏一种系统的方法来理解特定数据特性如何驱动LLM行为的本质。在本文的位置论文中,我们倡导开发系统方法来生成合成序列,这些序列由适当定义的随机过程生成,目的是当它们用于LLM工作流程的一个或多个阶段时,能够揭示有用的特点。我们将这些序列称为数据探针。通过观察LLM在数据探针上的行为,研究人员可以系统地研究数据特性如何影响模型性能、泛化能力和鲁棒性。探测序列表现出的统计特性可以通过理论概念(如典型集)来观察,这些概念被推广以描述LLM的行为。这种数据探针方法为揭示数据在LLM训练和推理中的基础作用提供了途径,超越了经验启发式方法。

英文摘要

Data is fundamental to large language models (LLMs). However, understanding of what makes certain data useful for different stages of an LLM workflow, including training, tuning, alignment, in-context learning, etc., and why, remains an open question. Current approaches rely heavily on extensive experimentation with large public datasets to obtain empirical heuristics for data filtering and dataset construction. These approaches are compute intensive and lack a principled way of understanding the essence of how specific data characteristics drive LLM behavior. In this position paper, we advocate for the need of developing systematic methodologies for generating synthetic sequences from appropriately defined random processes, with the goal that these sequences can reveal useful characteristics when they are used in one or multiple stages of the LLM workflow. We refer to such sequences as data probes. By observing LLM behavior on data probes, researchers can systematically conduct studies on how data characteristics influence model performance, generalization, and robustness. The probing sequences exhibit statistical properties that can be viewed using theoretical concepts, such as typical sets, which are generalized to describe the behaviors of LLMs. This data-probe approach provides a pathway for uncovering foundational insights into the role of data in LLM training and inference, beyond empirical heuristics.

2605.18800 2026-05-20 cs.LG cs.AI

Theory-optimal Quantization Based on Flatness

基于平坦度的理论最优量化

Xiusheng Huang, Zhe Li, Xuanwu Yin, Lu Wang, Yequan Wang, Dong Li, Emad Barsoum, Kang Liu

发表机构 * The Key Laboratory of Cognition and Decision Intelligence for Complex Systems, Institute of Automation, Chinese Academy of Sciences(认知与决策智能复杂系统重点实验室,自动化研究所,中国科学院) School of Artificial Intelligence, University of Chinese Academy of Sciences(中国科学院大学人工智能学院) Beijing Academy of Artificial Intelligence(北京人工智能研究院) AMD Ritzz-AI

AI总结 本文提出了一种基于平坦度的理论最优量化方法,通过分析量化误差与异常值之间的数学关系,引入了平坦度指标来量化异常值分布,并提出了双向对角量化框架BDQ,有效分散异常值模式,提升了大语言模型在低比特精度下的性能。

Comments 16 pages, 2 figures

详情
AI中文摘要

后训练量化已成为压缩和加速大型语言模型(LLMs)推理的广泛采用技术。LLMs量化的首要挑战源于激活异常值,这些异常值在低比特精度下显著降低模型性能。尽管近期方法试图通过跨特征维度的线性变换来缓解异常值,我们的分析表明,变换后的权重和激活仍然表现出持续的异常值模式,具有集中化的幅度分布。在本文中,我们首先建模量化误差与异常值之间的数学关系,然后引入一个新的指标平坦度来量化异常值的分布。基于此,我们推导出与平坦度相关的理论最优解。基于这些见解,我们提出了双向对角量化(BDQ),一种新的后训练量化框架,通过优化的矩阵变换有效分散异常值模式。BDQ通过学习的对角操作策略性地将异常值幅度分布到矩阵维度中。广泛的实验表明,BDQ建立了新的量化基准。在LLaMA-3-8B模型上,BDQ在W4A4量化中实现了小于1%的精度下降。在更具挑战性的W2A4KV16实验中,与最先进的方法相比,BDQ在DeepSeek-R1-Distill-LLaMA-70B模型上将性能差距减少了39.1%。

英文摘要

Post-training quantization has emerged as a widely adopted technique for compressing and accelerating the inference of Large Language Models (LLMs). The primary challenges in LLMs quantization stem from activation outliers, which significantly degrade model performance especially at lower bit precision. While recent approaches attempt to mitigate outliers through linear transformations across feature dimensions, our analysis reveals that the transformed weights and activations still exhibit persistent outlier patterns with concentrated magnitude distributions. In this paper, we first model the mathematical relationship between quantization error and outliers, and then introduce a new metric Flatness to quantify the distribution of outliers. Based on this, we derive the theoretical optimal solution with respect to Flatness. Building on these insights, we propose Bidirectional Diagonal Quantization (BDQ), a novel post-training quantization framework that effectively disperses outlier patterns through optimized matrix transformations. BDQ strategically distributes outlier magnitudes across matrix dimensions via learned diagonal operations. Extensive experiments demonstrate that BDQ establishes a new quantization benchmark. It achieves less than 1\% accuracy drop in W4A4 quantization on the LLaMA-3-8B model. In the more challenging W2A4KV16 experiment, compared to state-of-the-art approaches, BDQ reduces the performance gap by 39.1\% on the DeepSeek-R1-Distill-LLaMA-70B model.

2605.18799 2026-05-20 cs.LG cs.AI cs.CL

ReCrit: Transition-Aware Reinforcement Learning for Scientific Critic Reasoning

ReCrit: 基于过渡意识的强化学习用于科学批评推理

Wanghan Xu, Yuhao Zhou, Hengyuan Zhao, Shuo Li, Dianzhi Yu, Zhenfei Yin, Yaowen Hu, Fengli Xu, Wanli Ouyang, Wenlong Zhang, Lei Bai

发表机构 * Shanghai Jiao Tong University(上海交通大学) Shanghai Artificial Intelligence Laboratory(上海人工智能实验室) National University of Singapore(新加坡国立大学) Chinese University of Hong Kong(香港中文大学) University of Oxford(牛津大学) Tsinghua University(清华大学)

AI总结 该研究提出ReCrit框架,通过强化学习解决科学批评推理中的过渡意识问题,改进了批评准确性。

详情
AI中文摘要

大型语言模型在批评交互中不仅可能因回答错误而失败,还可能在用户批评后放弃最初正确的科学解答。在科学推理中,这种风险尤为突出,因为用户的批评可能将正确答案变为错误答案。我们将批评交互视为跨回合正确性过渡问题,而非最终答案准确性问题,并识别出三个挑战:过渡意识、解耦有用的修正与有害的阿谀奉承,以及可扩展的回放。我们提出了ReCrit,一个基于过渡意识的强化学习框架,将初始到批评行为分解为四个象限:修正、阿谀奉承、鲁棒性和边界。ReCrit奖励修正和鲁棒性,惩罚阿谀奉承,并将持续错误视为弱边界信号。为了使交互训练实用,ReCrit进一步使用动态异步回放与尾部自适应完成以减少回放等待。在三个科学推理基准测试(ChemBench、TRQA和EarthSE)上,ReCrit在Qwen3.5-4B上将平均批评准确性从38.15提升到51.49,在Qwen3.5-9B上从45.40提升到55.59。消融实验显示,最终答案奖励提供很少的交互层面增益,而基于过渡意识的奖励和象限加权产生更可区分的训练信号和更大的净批评阶段改进。代码可在https://github.com/black-yt/ReCrit获取。

英文摘要

Large language models can fail in critic interaction not only by answering incorrectly, but also by abandoning an initially correct scientific solution after user criticism. This is especially risky in scientific reasoning, where user criticism can turn a valid answer into an incorrect one. We frame critic interaction as an inter-turn correctness-transition problem rather than a final-answer accuracy problem, and identify three challenges: transition awareness, decoupling useful correction from harmful sycophancy, and scalable rollout. We propose ReCrit, a transition-aware reinforcement learning framework that decomposes Initial-to-Critic behavior into four quadrants: Correction, Sycophancy, Robustness, and Boundary. ReCrit rewards correction and robustness, penalizes sycophancy, and treats persistent errors as weak boundary signals. To make interaction training practical, ReCrit further uses dynamic asynchronous rollout with tail-adaptive completion to reduce rollout waiting. On three scientific reasoning benchmarks, ChemBench, TRQA, and EarthSE, ReCrit improves average Critic accuracy from 38.15 to 51.49 on Qwen3.5-4B and from 45.40 to 55.59 on Qwen3.5-9B. Ablations show that final-answer rewards provide little interaction-level gain, while transition-aware rewards and quadrant weighting produce more distinguishable training signals and larger net Critic-stage improvement. The code is available at https://github.com/black-yt/ReCrit .

2605.18798 2026-05-20 cs.LG cs.IT math.IT math.ST stat.ML stat.TH

Accurate Evaluation of Quickest Changepoint Detectors via Non-parametric Survival Analysis

通过非参数生存分析准确评估最快突变点检测器

Taiki Miyagawa, Akinori F. Ebihara

发表机构 * NEC Corporation(日本NEC公司)

AI总结 本文提出非参数估计方法用于快速突变点检测中的平均运行长度和平均检测延迟,通过将突变点检测与生存分析类比,解决了有限和不规则序列长度下的估计问题,提升了模型的鲁棒性和可解释性。

Comments Accepted to ICML 2026. GitHub: https://github.com/TaikiMiyagawa/Kaplan-Meier-Average-Run-Length

详情
AI中文摘要

我们提出非参数估计器用于在有限和不规则序列长度下快速突变点检测(QCD)中的平均运行长度(ARL)和平均检测延迟(ADD)。尽管ARL和ADD广泛用于理论和模拟研究中的最优性标准,但它们在实际数据集中的应用受到有限和不规则序列长度的限制。为了解决这个问题,我们通过将QCD与生存分析类比,提出非参数估计器ARL和ADD,称为KM-ARL和KM-ADD,以建模序列截断下的检测概率。我们推导了估计偏差界限,并证明除非需要外推,否则它们在渐近上是无偏的。在模拟和实际数据集上的实验展示了其实际用途,增强了对有限和不规则序列长度的鲁棒性,提高了可解释性,并促进了经验、直观的模型选择。我们的Python代码可在https://github.com/TaikiMiyagawa/Kaplan-Meier-Average-Run-Length提供,为从业者提供了即用型实现。

英文摘要

We propose non-parametric estimators for the average run length (ARL) and average detection delay (ADD) in quickest changepoint detection (QCD) under finite and irregular sequence lengths. Although ARL and ADD are widely used as optimality criteria in theoretical and simulation studies, their application to real-world datasets is hindered by limited and irregular sequence lengths. To address this issue, we propose non-parametric estimators for the ARL and ADD, termed KM-ARL and KM-ADD, by drawing an analogy between QCD and survival analysis to model detection probabilities under sequence truncation. We derive estimation bias bounds and prove that they are asymptotically unbiased unless extrapolation is required. Experiments on simulated and real-world datasets demonstrate their practical utility, enhancing robustness against limited and irregular sequence lengths, improving interpretability, and facilitating empirical, intuitive model selection. Our Python code is provided at https://github.com/TaikiMiyagawa/Kaplan-Meier-Average-Run-Length, offering ready-to-use implementations for practitioners.

2605.18796 2026-05-20 cs.LG cs.CL

UCCI: Calibrated Uncertainty for Cost-Optimal LLM Cascade Routing

UCCI:用于成本最优LLM级联路由的校准不确定性

Varun Kotte

发表机构 * Independent Researcher(独立研究者)

AI总结 本文提出UCCI,一种以校准为核心的路由方法,通过异质回归将token层面的边际不确定性映射到查询级误差概率,并通过约束成本最小化选择升级阈值。在三个显式假设下,阈值策略在校准分数上是成本最优的,异质校准在期望校准误差(ECE)上实现O(n^{-1/3})的样本复杂度。在75000个生产命名实体识别工作负载上,UCCI将推理成本降低了31%(95%CI:[27%, 35%]),同时将ECE从0.12降低到0.03。

Comments 9 pages, 2 figures, 4 tables. Code: https://github.com/varunkotte6/ucci

详情
AI中文摘要

LLM级联和模型路由通过将简单查询发送到小型模型并升级困难查询到大型模型来降低推理成本,但大多数部署的路由器使用未校准的置信度分数并需要每个工作负载的阈值调整。我们提出了UCCI,一种以校准为核心的路由器,通过异质回归将token层面的边际不确定性映射到查询级误差概率,并通过约束成本最小化选择升级阈值。在三个显式假设下,阈值策略在校准分数上是成本最优的,异质校准在期望校准误差(ECE)上实现O(n^{-1/3})的样本复杂度。在75000个生产命名实体识别工作负载上,UCCI将推理成本降低了31%(95%CI:[27%, 35%]),同时将ECE从0.12降低到0.03。在相同的操作点上,UCCI优于熵阈值法、分割置信路由以及FrugalGPT风格的学习阈值。所有级联结果均使用实际模型输出和测量的H100延迟进行端到端路由,而不是基于全局准确率或名义API价格的模拟路由。

英文摘要

LLM cascades and model routing promise lower inference cost by sending easy queries to a small model and escalating hard ones to a large model, but most deployed routers use uncalibrated confidence scores and require per-workload threshold tuning. We present UCCI, a calibration-first router that maps token-level margin uncertainty to a per-query error probability via isotonic regression and selects the escalation threshold by constrained cost minimization. Under three explicit assumptions, threshold policies on the calibrated score are cost-optimal, and isotonic calibration achieves O(n^{-1/3}) sample complexity for expected calibration error (ECE). On a production named entity recognition workload of 75,000 queries served by 4B and 12B instruction-tuned LLMs on H100 GPUs, UCCI cuts inference cost by 31% (95% CI: [27%, 35%]) at micro-F1 = 0.91 while reducing ECE from 0.12 to 0.03. At the same operating point, UCCI beats entropy thresholding, split-conformal routing, and a FrugalGPT-style learned threshold. All cascade results use end-to-end routing on actual model outputs and measured H100 latency, not simulated routing from global accuracies or nominal API prices.

2605.18795 2026-05-20 cs.LG cs.AI

HELLoRA: Hot Experts Layer-Level Low-Rank Adaptation for Mixture-of-Experts Models

HELLoRA: Hot Experts Layer-Level Low-Rank Adaptation for Mixture-of-Experts Models

Jia Wei, Zhonghao Zhang, Ping Chen, Qianyang li, Yancheng Pan, Shaoxun Wang, Ziyi Qiu, Longxiang Wang

发表机构 * Department of Computer Science and Technlogy(计算机科学与技术系) Tsinghua University(清华大学) School of Computer Science and Technlogy(计算机科学与技术系) Xi’an Jiaotong University(西安交通大学) The State Key Laboratory of Blockchain and Data Security, Zhejiang University(区块链与数据安全国家重点实验室,浙江大学) Hangzhou High-Tech Zone (Binjiang) Institute of Blockchain and Data Security(杭州高科技区(滨江)区块链与数据安全研究院)

AI总结 本文提出HELLoRA,一种针对混合专家模型的层级低秩适应方法,通过仅对最活跃的专家添加LoRA模块,减少可训练参数和计算量,同时提升下游任务性能。

详情
AI中文摘要

Low-Rank Adaptation (LoRA) dominates parameter-efficient fine-tuning of large language models, yet most variants target dense architectures. Mixture-of-Experts (MoE) models scale parameters at near-constant per-token compute, and their sparse activation patterns create untapped opportunities for more efficient adaptation. We propose Hot-Experts Layer-level Low-Rank Adaptation (HELLoRA), which attaches LoRA modules only to the most frequently activated experts at each layer. This simple mechanism reduces trainable parameters and adapter-induced FLOPs while improving downstream performance, an effect we attribute to a form of structured regularization that preserves pretrained expert specialization. To stress-test HELLoRA under extreme parameter budgets, we further compose it with LoRI to form HELLoRI, which freezes the up-projection and sparsifies the down-projection. Across three MoE backbones, namely OlMoE-1B-7B, Mixtral-8x7B, and DeepSeekMoE, and three task families covering mathematical reasoning, code generation, and safety alignment, HELLoRA consistently outperforms strong PEFT baselines. Relative to vanilla LoRA on OlMoE, HELLoRA uses 15.7% of the trainable parameters, reduces adapter FLOPs by 38.7%, achieves 1.9x the training throughput, and improves accuracy by 9.2%. On DeepSeekMoE, HELLoRA outperforms LoRA while using only 23.2% of its trainable parameters. These results demonstrate that activation-aware adapter placement is an effective and practical route to scaling PEFT for MoE language models.

英文摘要

Low-Rank Adaptation (LoRA) dominates parameter-efficient fine-tuning of large language models, yet most variants target dense architectures. Mixture-of-Experts (MoE) models scale parameters at near-constant per-token compute, and their sparse activation patterns create untapped opportunities for more efficient adaptation. We propose Hot-Experts Layer-level Low-Rank Adaptation (HELLoRA), which attaches LoRA modules only to the most frequently activated experts at each layer. This simple mechanism reduces trainable parameters and adapter-induced FLOPs while improving downstream performance, an effect we attribute to a form of structured regularization that preserves pretrained expert specialization. To stress-test HELLoRA under extreme parameter budgets, we further compose it with LoRI to form HELLoRI, which freezes the up-projection and sparsifies the down-projection. Across three MoE backbones, namely OlMoE-1B-7B, Mixtral-8x7B, and DeepSeekMoE, and three task families covering mathematical reasoning, code generation, and safety alignment, HELLoRA consistently outperforms strong PEFT baselines. Relative to vanilla LoRA on OlMoE, HELLoRA uses 15.7% of the trainable parameters, reduces adapter FLOPs by 38.7%, achieves 1.9x the training throughput, and improves accuracy by 9.2%. On DeepSeekMoE, HELLoRA outperforms LoRA while using only 23.2% of its trainable parameters. These results demonstrate that activation-aware adapter placement is an effective and practical route to scaling PEFT for MoE language models.