arXivDaily arXiv每日学术速递 周一至周五更新
重置

1. 深度学习架构与训练方法 17 篇

2606.18283 2026-06-18 cs.LG 新提交

Gaussian Mixture Attention: Linear-Time Sequence Mixing via Probabilistic Latent Routing

高斯混合注意力:通过概率潜在路由实现线性时间序列混合

Yongchao Huang, Hassan Raza

AI总结 提出高斯混合注意力(GMA),用K个高斯混合分量的潜在路由替代逐对查询-键比较,实现固定K的线性内存缩放,在长上下文分类任务中与注意力基线竞争。

Comments 55 pages

详情
AI中文摘要

标准点积注意力的密集token间交互模式仍然是扩展Transformer架构到长上下文的主要瓶颈。我们引入\textbf{高斯混合注意力(GMA)},一种概率注意力风格的序列混合器,通过$K$个学习的高斯混合分量进行路由,替代显式的逐对查询-键比较。查询和键被映射到共享潜在路由空间上的后验\textit{责任}向量;它们的重叠定义了隐式的责任空间亲和性,而值被写入和读取自一个$K$槽的潜在记忆。通过利用矩阵乘法的结合性,GMA避免了生成诱导的$N\times N$亲和矩阵,而是使用两个责任矩阵,其主导激活存储规模为$\mathcal{O}(NK)$而非固定$K$下的$\mathcal{O}(N^2)$。我们制定了GMA的双向和因果变体,提供了高斯混合分量的端到端可微参数化,并分析了其责任调制的梯度结构、约束非负低秩亲和性解释以及局部路由稳定性。实验上,GMA表现出预期的固定$K$线性内存缩放,并在长上下文分类上与注意力基线竞争,而因果GMA在WikiText-103上优于测试的线性/随机特征注意力变体,但在当前实现中仍落后于优化的因果SDPA和Mamba。对学习到的责任的分析进一步显示了广泛的组件使用和与表面形式词类别的适度对齐,支持GMA作为一种概率性、可解释、固定$K$的线性时间注意力风格替代方案,而非优化softmax注意力或状态空间模型的通用替代。

英文摘要

The dense token-to-token interaction pattern of standard dot-product attention remains a central bottleneck in scaling Transformer architectures to long contexts. We introduce \textbf{Gaussian Mixture Attention (GMA)}, a probabilistic attention-style sequence mixer that replaces explicit pairwise query--key comparison with routing through $K$ learned Gaussian mixture components. Queries and keys are mapped to posterior \textit{responsibility} vectors over a shared latent routing space; their overlap defines an implicit responsibility-space affinity, while values are written into and read from a $K$-slot latent memory. By exploiting the associativity of matrix multiplication, GMA avoids materializing the induced $N\times N$ affinity matrix and instead uses two responsibility matrices whose dominant activation storage scales as $\mathcal{O}(NK)$ rather than $\mathcal{O}(N^2)$ for fixed $K$. We formulate bidirectional and causal variants of GMA, provide an end-to-end differentiable parameterization of the Gaussian mixture components, and analyze its responsibility-modulated gradient structure, constrained non-negative low-rank affinity interpretation, and local routing stability. Empirically, GMA exhibits the intended fixed-$K$ linear memory scaling and is competitive with attention-style baselines on long-context classification, while causal GMA improves over tested linear/random-feature attention variants on WikiText-103 but remains behind optimized causal SDPA and Mamba in the current implementation. Analysis of learned responsibilities further shows broad component usage and moderate alignment with surface-form token categories, supporting GMA as a probabilistic, interpretable, fixed-$K$ linear-time attention-style alternative rather than a universal replacement for optimized softmax attention or state-space models.

2606.18315 2026-06-18 cs.LG cs.AI 新提交

Ghost Attractor Networks: Basin-Structured Dynamical Decoders for Closed-Loop Sequential Generation

鬼吸引子网络:用于闭环序列生成的盆地结构动力学解码器

Tianyu Wang, Ying Wang, Zhihao Liu, Xi Vincent Wang, Lihui Wang

发表机构 * KTH Royal Institute of Technology(瑞典皇家理工学院) Department of Production Engineering, KTH Royal Institute of Technology(瑞典皇家理工学院生产工程系) Department of Decision and Control Systems, KTH Royal Institute of Technology(瑞典皇家理工学院决策与控制系统系)

AI总结 提出鬼吸引子网络,一种理论推导的动力学解码器,通过构建盆地-吸引子结构实现高效闭环序列生成,在机器人动作解码任务中以2.3M参数匹配1.07B参数扩散变压器的离线精度,延迟降低32倍。

详情
AI中文摘要

使用大规模Transformer和扩散解码器进行序列输出生成时,内存成本随序列长度增长,且需要迭代逐步骤计算。用小型前馈解码器替代可恢复效率,但产生非结构化的潜在表示,限制了闭环控制:相位条件动作生成和跨步骤潜在传递都需要具有稳定盆地的潜在几何结构。本文提出鬼吸引子网络,一种理论推导的动力学解码器,其潜在变量在学习的势能下演化并带有漂移,通过构造产生盆地-吸引子结构。三个期望(多模态、解码器级单次切换和恒定内存)激发了势能-漂移形式,模式转变作为鞍结分岔和鬼吸引子逃逸出现。层次化的相空间分解将一阶盆地收敛与二阶本体感受细化分开。实验上,使用行为克隆和对比目标端到端训练的鬼网络在其势能中表现出预测的梯度流收缩,在1430个保留样本上,梯度范数在五个积分步骤中衰减67%。鬼网络作为机器人动作解码器进行评估。一个230万参数的鬼网络以462倍少的参数和32倍低的延迟匹配了10.7亿参数扩散变压器的离线精度,并在离线均方误差上比五个替代的200万参数解码器(MLP、神经常微分方程、条件变分自编码器、Transformer、单步扩散)低5.9%至29%。在LIBERO-10闭环基准测试中,鬼网络的盆地结构潜在上的相位条件比前馈MLP基线提高了13.5个百分点的成功率,持久潜在集成达到95.7%的最终成功率。

英文摘要

Sequential output generation with large-scale Transformer and diffusion decoders pays a memory cost that grows with sequence length, plus iterative per-step computation. Replacing them with small feed-forward decoders restores efficiency but produces unstructured latent representations that limit closed-loop control: phase-conditioned action generation and cross-step latent carry-over both require a latent geometry with stable basins. This article proposes Ghost Attractor Networks, a theoretically derived dynamical decoder whose latent evolves under a learned potential with drift and produces a basin-attractor structure by construction. Three desiderata (multi-modality, decoder-level single-pass switching, and constant memory) motivate the potential-drift form, and mode transitions arise as saddle-node bifurcations with ghost-attractor escape. A hierarchical phase-space decomposition separates first-order basin convergence from second-order proprioceptive refinement. Empirically, a Ghost trained end-to-end with a behavioral-cloning and contrastive objective exhibits the predicted gradient-flow contraction in its potential, with the gradient norm decaying by 67 percent across five integration steps on 1430 held-out samples. Ghost is evaluated as a robotic action decoder. A 2.3-million-parameter Ghost matches the offline accuracy of a 1.07-billion-parameter Diffusion Transformer at 462 times fewer parameters and 32 times lower latency, and beats five alternative 2M-parameter decoders (MLP, Neural ODE, CVAE, Transformer, 1-step Diffusion) on offline mean squared error by 5.9 to 29 percent. On the LIBERO-10 closed-loop benchmark, phase conditioning on Ghost's basin-structured latent yields a 13.5 percentage-point success-rate gain over a feed-forward MLP baseline, and persistent-latent ensembling reaches a 95.7 percent final success rate.

2606.18324 2026-06-18 cs.LG cs.AI 新提交

Why SWAVE May Not Be All You Need:A Concept-Evolution Retrospective on Complex-Valued Recurrent Language Models

为什么SWAVE可能不是你所需的一切:复数值循环语言模型的概念演化回顾

Ramprasath Ganesaraja, Swathika N, Sahil Dilip Panse

发表机构 * EdgeVerve Systems Limited(EdgeVerve系统有限公司)

AI总结 本文回顾了复数值循环语言模型SWAVE的演化过程,揭示了其设计假设的缺陷,并提出了cos-domination collapse等理论见解和工程原则。

详情
AI中文摘要

SWave是一个复数值循环语言模型(169.26M参数,D=384,L=16,T=2048),在FineWeb-Edu上使用2xH100 NVL训练。它基于三个基本前提设计:将语言表示为复数值波而非实数值能实现更丰富的信息编码;Cayley参数化的酉变换提供数学保证防止状态衰减或爆炸;旋转而非收缩的隐藏状态能在任意长上下文中保持信号完整性。SWave的核心在三个开发阶段中经历了实质性演化。发现Resonance Head在结构上允许虚通道坍缩为全局损失最小值(我们称为cos-domination collapse的失败模式),并被来自相位关联记忆(PAM)架构的具有独立实部和虚部嵌入表的解耦头取代。这解决了退化最小值,并实现了稳定的200,000步训练(最佳步PPL 22.0,第89,861步)。ComplexNorm和Wave Propagation Scan在所有三个阶段中都是承重结构,并保留在最终架构中。ProtectGatedScan被重新定义为结构先验而非学习行为。四个多尺度保留概念在受控评估下未显示可测量的改进,被发现非承重。ComplexGatedUnit被参数更少的实值平方ReLU通道混合器取代。一旦结构约束得到解决,辅助训练目标未显示益处。研究得出了cos-domination collapse的形式化描述、用于数值稳定性的对数空间反向传播并行扫描、六个可迁移的复数值循环训练工程原则,以及用于捕捉传统测试套件遗漏的结构偏差的计划到代码可追溯性方法。

英文摘要

SWave is a complex-valued recurrent language model (169.26M parameters, D=384, L=16, T=2048) trained on FineWeb-Edu using 2xH100 NVL. It was designed around three founding premises: that representing language as complex waves rather than real-valued numbers enables richer information encoding; that a Cayley-parameterised unitary transition provides a mathematical guarantee against state decay or explosion; and that a hidden state which rotates rather than shrinks preserves signal integrity over arbitrarily long contexts. The core of SWave evolved substantially across three development phases. The Resonance Head was found to structurally admit imaginary-channel collapse as a global loss minimum (a failure mode we term cos-domination collapse) and was superseded by an untied head with independent real and imaginary embedding tables from the Phase-Associative Memory (PAM) architecture. This resolved the degenerate minimum and enabled stable 200,000-step training (best-step PPL 22.0 at step 89,861). ComplexNorm and the Wave Propagation Scan proved load-bearing throughout all three phases and were retained to the final architecture. ProtectGatedScan was reframed as a structural prior rather than a learned behaviour. The four multi-scale retention concepts showed no measurable improvement under controlled evaluation and were found non-load-bearing. The ComplexGatedUnit was superseded by a real-valued squared-ReLU channel mixer with fewer parameters. The auxiliary training objectives showed no benefit once structural constraints were resolved. The investigation yields a formal characterisation of cos-domination collapse, a parallel scan with a log-space backward pass for numerical stability, six transferable engineering principles for complex-valued recurrent training, and a plan-to-code traceability methodology for catching structural divergences that conventional test suites miss.

2606.18326 2026-06-18 cs.LG 新提交

Neural Network Implementation of the Renormalization Group for Fault Diagnosis with Class Imbalance

基于重正化群神经网络的类别不平衡故障诊断

Evgeny Nikulchev, Dmitry Ilin

发表机构 * MIREA – Russian Technological University(莫斯科俄罗斯技术大学)

AI总结 提出RGNet,一种基于重正化群概念的神经网络架构,通过层次化粗粒化特征空间处理类别不平衡和多维噪声,在AI4I数据集上验证了其有效性。

Comments 8 pages

详情
AI中文摘要

机器学习模型在实际任务中的应用面临类别不平衡和多维噪声等挑战。本文提出RGNet,一种基于重正化群(RG)概念的神经网络架构,用于特征空间的层次化粗粒化。该模型依次压缩输入维度,并在分类前拼接所有尺度,从而捕获局部细节和全局模式。引入了RG流的概念——可解释的低维表示,通过t-SNE可视化揭示了离散曲线结构,证实了粗粒化的有效性。在不平衡的AI4I数据集上给出了实验结果。结果表明,RGNet是一种通用、可解释且具有竞争力的故障预测解决方案,适用于类别不平衡的应用场景。

英文摘要

The application of machine learning models in practical tasks faces challenges such as class imbalance and multidimensional noise. This paper proposes RGNet, a neural network architecture based on the concept of the renormalization group (RG), for hierarchical coarse-graining of the feature space. The model sequentially compresses the input dimensionality and concatenates all scales before classification, allowing it to capture both local details and global patterns. The notion of RG-flows is introduced - interpretable low-dimensional representations whose visualization via t-SNE reveals a discrete curvilinear structure confirming the effectiveness of coarse-graining. Experimental results are presented on the imbalanced AI4I dataset. The obtained results demonstrate that RGNet is a universal, interpretable, and competitive solution for fault prediction in applications with imbalanced classes.

2606.18388 2026-06-18 cs.LG cs.AI cs.CL cs.MA 新提交

LLMZero: Discovering Adaptive Training Strategies for RL Post-Training via LLM Agents

LLMZero: 通过LLM智能体发现RL后训练的自适应训练策略

Haoyang Fang, Wei Zhu, Boran Han, Alex Zhang, Zhenyu Pan, Shuo Yang, Shuai Zhang, Jiading Gai, Peng Tang, Cuixiong Hu, Xuan Zhu, Huzefa Rangwala, George Karypis, Bernie Wang

发表机构 * Amazon(亚马逊)

AI总结 提出LLMZero系统,利用LLM智能体通过树搜索发现多阶段RL后训练的自适应策略,揭示容量参数单调累积、正则化参数振荡的规律,在4个GRPO任务上相对基线提升9%-140%。

详情
AI中文摘要

RL后训练策略依赖于数据集,并揭示了一个反复出现的经验模式:容量参数在阶段间单调累积,而正则化参数主要根据训练动态的变化而振荡。这种区别很重要,因为固定调度将所有参数提交到固定轨迹,因此无法表达正则化必须跟踪的非平稳探索-利用权衡;该原则为多阶段训练提供了可操作的设计规则。我们通过LLMZero发现了这一点,该系统通过树搜索让LLM智能体搜索训练轨迹,诊断每个检查点的病理并提出协调的多参数转换。在4个不同的GRPO任务中,LLMZero发现的策略相对基础模型提升9%到140%,相对网格搜索提升6%到15%,始终优于随机搜索和基于技能的智能体。该结构原则跨任务迁移,解释了为什么发现的策略形式不同但参数动态相似。

英文摘要

RL post-training strategies are dataset-dependent and reveal a recurring empirical pattern: capacity parameters accumulate monotonically across stages, while regularization parameters predominantly oscillate in response to shifting training dynamics. This distinction matters because fixed schedules commit all parameters to fixed trajectories and therefore cannot express the non-stationary exploration-exploitation tradeoffs that regularization must track; the principle provides actionable design rules for multi-stage training. We discover this through LLMZero, a system where LLM agents search over training trajectories via tree search, diagnosing pathologies at each checkpoint and proposing coordinated multi-parameter transitions. Across 4 diverse GRPO tasks, LLMZero discovers strategies that improve over the base model by 9% to 140% relative and over grid search by 6% to 15% relative, consistently outperforming random search and the skill-based agent. The structural principle transfers across tasks, providing an explanation for why discovered strategies take qualitatively different forms yet share similar parameter dynamics.

2606.18457 2026-06-18 cs.LG 新提交

Task-Restricted Symmetries in Recurrent Weight Space

循环权重空间中的任务限制对称性

Simon Dräger

发表机构 * Salk Institute for Biological Studies, La Jolla, CA, USA(索尔克生物研究所,拉霍亚,加利福尼亚州,美国)

AI总结 通过有序实Schur坐标分析单层tanh RNN,发现任务分布下循环矩阵存在功能冗余,特定非正常Schur耦合可被移除而不影响性能,揭示了任务限制的近似功能不变性。

Comments 6 pages, 2 figures. Accepted at the ICML 2026 Workshop on Weight-Space Symmetries

详情
AI中文摘要

循环网络在权重空间中可能包含大量的功能冗余:改变一个循环矩阵可能使输入-输出展开在任务分布上几乎不变,而类似尺度的变化可能破坏相同的行为。我们使用有序实Schur坐标研究单层tanh RNN中的这种冗余。Schur形式将谱块与定向非正常耦合分开,为保持输入和读出映射固定的结构化消融提供了诊断基础。在固定长度的复制任务中,一些训练好的解中可以选择性地移除非正常Schur耦合而损失很小,而其他耦合对于准确的自主回放是必要的。在触发器、正弦生成和上下文相关积分任务中,损失保持的消融轮廓因任务和训练解而异。这些结果识别了候选的近似功能不变性,而非循环权重空间的普遍对称性。Schur坐标消融提供了一种实用的诊断方法,用于判断哪些结构化扰动能保持训练好的循环解,哪些会破坏其计算。

英文摘要

Recurrent networks can contain substantial functional redundancy in weight space: changing a recurrent matrix may leave the input-output rollout nearly unchanged on a task distribution, while similar-scale changes can destroy the same behavior. We study this redundancy in one-layer tanh RNNs using ordered real Schur coordinates. The Schur form separates spectral blocks from directed nonnormal couplings, giving a diagnostic basis for structured ablations that keep the input and readout maps fixed. In a fixed-length copy task, selected nonnormal Schur couplings can be removed with little loss in some trained solutions, whereas other couplings are necessary for accurate autonomous replay. Across flip-flop, sine generation, and context-dependent integration, the loss-preserving ablation profile varies across tasks and trained solutions. These results identify candidate approximate functional invariances, not universal symmetries of recurrent weight space. Schur-coordinate ablations provide a practical diagnostic for which structured perturbations preserve a trained recurrent solution and which ones disrupt its computation.

2606.18487 2026-06-18 cs.LG cs.AI cs.CL 新提交

SFT Overtraining Predicts Rank Inversion via Entropy Collapse Under RLVR

SFT 过训练通过熵崩溃预测 RLVR 下的排名反转

Siddharth Aphale, Kelly Liu

发表机构 * Stanford University(斯坦福大学)

AI总结 研究发现 SFT 过度训练导致 rollout 分布熵降低,使 GRPO 中优势信号消失,从而引发排名反转;提出基于熵的两阶段诊断方法可预警高风险检查点。

Comments 14 pages, 6 figures. Accepted at the Deep Learning for Code (DL4C) Workshop at ICML 2026

详情
AI中文摘要

当 SFT 压缩 rollout 分布时,选择 pass@1 最高的 SFT 检查点进行 GRPO 的标准启发式方法可能失败。对于二元奖励,组内期望优势方差为 $p(1{-}p)(g{-}1)/g$;当早期 GRPO 将 $p$ 驱动到 $p^*(g)$ 以下时,大多数组具有相同奖励,不提供组间相对信号。我们研究了 Qwen2.5-Coder-3B 和 DeepSeek-Coder-6.7B 的 SFT 深度阶梯。我们在五个深度和三个种子上测试 Qwen2.5-Coder-3B,在四个匹配深度和三个种子上测试 DeepSeek-Coder-6.7B。在 Qwen 上,RL 前的 pass@1 随 SFT 深度增加而上升,但 GRPO 峰值 pass@10 从 $0.806$ 下降到 $0.481$(3 种子均值,$n{=}20$);RL 前的熵与 GRPO 结果正相关($\rho{=}{+}0.69$)。在 DeepSeek 上,pass@1 仍远高于 $p^*(8){=}0.083$,GRPO 结果压缩而非反转。结合 RL 前熵分诊与早期 GRPO 熵监测的两阶段诊断方法,可标记高风险检查点并提前停止失败运行。在我们的设置中,简单的 KL 参考正则化和标签平滑变体未能挽救崩溃的 Qwen 检查点,表明该失败并非琐碎的 GRPO 超参数伪影。

英文摘要

The standard heuristic of selecting the SFT checkpoint with the highest pass@1 for GRPO can fail when SFT compresses the rollout distribution. For binary rewards, the expected within group advantage variance is $p(1{-}p)(g{-}1)/g$; when early GRPO drives $p$ below $p^*(g)$, most groups have identical rewards and provide no group relative signal. We study SFT depth ladders for Qwen2.5-Coder-3B and DeepSeek-Coder-6.7B. We test Qwen2.5-Coder-3B across five depths and three seeds, and DeepSeek-Coder-6.7B across four matched depths and three seeds. On Qwen, pre RL pass@1 rises with SFT depth, but peak GRPO pass@10 falls from $0.806$ to $0.481$ (3 seed mean, $n{=}20$); pre RL entropy is positively associated with the GRPO outcome ($ρ{=}{+}0.69$). On DeepSeek, pass@1 remains far above $p^*(8){=}0.083$, and GRPO outcomes compress rather than invert. A two stage diagnostic, combining pre RL entropy triage with an early GRPO entropy monitor, flags high risk checkpoints and can stop failing runs early. Simple KL to reference regularisation and label smoothing variants do not rescue the collapsed Qwen checkpoint in our setting, suggesting the failure is not a trivial GRPO hyperparameter artefact.

2606.18521 2026-06-18 cs.LG cs.AI 新提交

Sparsity Curse: Understanding RLVR Model Parameter Space from Model Merging

稀疏性诅咒:从模型合并理解RLVR模型参数空间

Chenrui Wu, Zexi Li, Jiajun Bu, Jiangchuan Liu, Haishuai Wang

发表机构 * Zhejiang University(浙江大学) Simon Fraser University(西蒙菲莎大学) The Chinese University of Hong Kong(香港中文大学) Zhejiang Key Lab of Accessible Perception and Intelligent Systems(浙江省可感知智能系统重点实验室)

AI总结 本文发现RLVR模型的稀疏更新在参数空间中分散更远,形成近正交捷径导致合并脆弱,并提出SAR-Merging方法解决该问题。

Comments Accepted by KDD 2026

详情
AI中文摘要

可验证奖励强化学习(RLVR)已成为一种强大的后训练范式,在激发推理智能和抵抗灾难性遗忘方面超越了监督微调(SFT)。最近的研究进一步揭示,与SFT相比,RLVR会引发高度稀疏且偏离主成分的参数更新。这自然引出一个问题:这种稀疏性是否使RLVR模型更易于模型合并?如果是,模型合并将提供一种可扩展的、无需训练的方法,来聚合来自独立训练的RLVR模型的多样化推理能力。令人惊讶的是,我们发现相反的情况,揭示了一种稀疏性诅咒:稀疏的RLVR更新在参数空间中分散得更远,形成近正交的捷径,使得聚合本质上是脆弱的。这很可能源于RL优化的随机性和涌现推理模式的多样性。与SFT模型收敛到共享的平坦盆地并自然合并不同,RLVR模型在标准合并方法下遭受严重退化。通过对更新几何的系统性实证分析,我们描述了这种失败背后的机制,并提出了敏感性感知解析合并(SAR-Merging),这是一种针对RLVR参数空间独特结构定制的合并方案。SAR-Merging通过基于Fisher信息的敏感性仲裁解决重叠更新区域中的冲突,然后通过幅度感知稀疏化和重新缩放来保留脆弱的推理路径。在数学和编程基准上的实验表明,SAR-Merging在RLVR模型上显著优于现有合并方法,实现了单任务增强和多能力融合。

英文摘要

Reinforcement Learning with Verifiable Reward (RLVR) has emerged as a powerful post-training paradigm that surpasses Supervised Fine-Tuning (SFT) in eliciting reasoning intelligence and resisting catastrophic forgetting. Recent studies further reveal that RLVR induces highly sparse and off-principal parameter updates compared to SFT. This naturally raises the question: does such sparsity make RLVR models more amenable to model merging? If so, model merging would offer a scalable, training-free path to aggregate diverse reasoning capabilities from independently trained RLVR models. Surprisingly, we find the opposite, uncovering a sparsity curse: the sparse RLVR updates are spread farther apart in parameter space, forming near-orthogonal shortcuts that make aggregation inherently fragile. This is likely rooted in the stochasticity of RL optimization and the diversity of emergent reasoning patterns. Unlike SFT models that converge to shared, flat basins and merge naturally, RLVR models suffer severe degradation under standard merging methods. Through systematic empirical analysis of the update geometry, we characterize the mechanisms behind this failure and propose Sensitivity-aware Resolving Merging (SAR-Merging), a merging recipe tailored for the unique structure of RLVR parameter spaces. SAR-Merging resolves conflicts in overlapping update regions via Fisher Information-based sensitivity arbitration, followed by magnitude-aware sparsification and rescaling to preserve fragile reasoning pathways. Experiments on mathematical and coding benchmarks demonstrate that SAR-Merging substantially outperforms existing merging methods on RLVR models, enabling both single-task enhancement and multi-capability fusion.

2606.18524 2026-06-18 cs.LG 新提交

On the Residual Scaling of Looped Transformers: Stability and Transferability

关于循环Transformer的残差缩放:稳定性和可迁移性

Shaowen Wang, Bingrui Li, Ge Zhang, Wenhao Huang, Shen Yan, Jian Li

发表机构 * Tsinghua University(清华大学)

AI总结 针对循环Transformer,提出残差缩放因子应为1/N而非1/√L,并推导出多层的分解参数化,实现超参数从少循环到多循环的迁移。

Comments 19 pages, 9 figures

详情
AI中文摘要

循环(权重共享)Transformer 将共享残差块应用 N 次(h ← h + ε f(h),每一步使用相同的 f),在不增加参数的情况下增加有效深度。先前的深度缩放分析建议深度为 L 的残差网络使用 ε = 1/√L。我们证明这对于循环架构是不够的:权重共享使得残差更新在迭代间相关,需要更强的缩放 ε = 1/N。对于多层块(L 个独特层循环 N 次),我们推导出一个分解参数化 ε = λ/(N√L),将两种增长源分开:1/N 控制层内循环相关性,1/√L 控制层间方差。一个关键结果是,最优学习率仅取决于独特层数 L,而非循环次数 N,从而实现了从小的 N 到大的 N 的直接超参数迁移,无需重新调整。在循环 Transformer 上的实验证实,1/N 缩放相比 1/√N 缩放提高了可训练性,并在不同循环次数下获得更优的损失。

英文摘要

Looped (weight-tied) Transformers apply a shared residual block $N$ times ($h \leftarrow h + \varepsilon\,f(h)$, same $f$ at each step), increasing effective depth without adding parameters. Prior depth-scaling analyses prescribe $\varepsilon = 1/\!\sqrt{L}$ for depth-$L$ residual networks. We show that this is insufficient for looped architectures: weight sharing makes residual updates correlated across iterations, requiring the stronger scaling $\varepsilon = 1/N$. For multi-layer blocks ($L$ unique layers looped $N$ times), we derive a factored parameterization $\varepsilon = λ/(N\!\sqrt{L})$ that separates the two sources of growth: $1/N$ controls the within-layer loop correlation, and $1/\!\sqrt{L}$ controls the across-layer variance. A key consequence is that the optimal learning rate depends only on the number of unique layers $L$, not on the loop count $N$, enabling direct hyperparameter transfer from small to large $N$ without retuning. Experiments on looped Transformers confirm that $1/N$ scaling improves trainability and yields better loss than $1/\!\sqrt{N}$ scaling across loop counts.

2606.18525 2026-06-18 cs.LG 新提交

Hierarchical Attention via Domain Decomposition

基于区域分解的层次注意力机制

Stephan Köhler, Oliver Rheinbach

发表机构 * Faculty of Mathematics and Computer Science(数学与计算机科学系)

AI总结 提出一种基于两水平重叠Schwarz区域分解的层次注意力机制,通过局部低秩注意力块与粗网格注意力块结合,在少参数下实现更快训练和更高精度。

Comments 20 pages, 10 figures

详情
AI中文摘要

我们提出了一种基于两水平重叠Schwarz区域分解的层次注意力机制。该方法的动机源于观察到两水平Schwarz区域分解方法将局部子域校正与一个传达全局、长程信息的粗水平相结合。我们在一个具有齐次Dirichlet边界条件的一维扩散问题背景下,测试了其在有限维算子学习中的实用性。尽管该问题简单,但它提供了一个受控的序列到序列设置,其中精确的非局部解算子已知。离散化后,学习解算子相当于逼近一个对称正定矩阵的逆。作为基线,我们使用一个全局无softmax的低秩注意力算子,形式为$QK^T$。所提出的构造将这个密集的全局分解替换为一个两水平加性结构:重叠子域上的局部低秩注意力块与一个粗注意力块相结合。得到的算子形式为$$M_{\theta}^{-1} = \Phi Q_0 K_0^T \Phi^T + \sum_{i=1}^{N} R_i^T D_i^{1/2} Q_i K_i^T D_i^{1/2} R_i.$$ 这里$R_i$限制到重叠子域,$D_i$是单位划分权重,$\Phi$是粗插值(或延拓)矩阵。针对合成Fourier右端项的数值实验表明,区域分解注意力算子能够比全局低秩注意力基线训练更快,并在使用显著更少参数的情况下提供更精确的逼近。

英文摘要

We propose a hierarchical attention mechanism based on two-level overlapping Schwarz domain decomposition. The method is motivated by the observation that two-level Schwarz domain decomposition methods combine local subdomain corrections with a coarse level that communicates global, long-range information. We test its usefulness in the context of finite-dimensional operator learning using a simple, one-dimensional diffusion problem with homogeneous Dirichlet boundary conditions. Although elementary, this problem provides a controlled sequence-to-sequence setting in which the exact nonlocal solution operator is known. After discretization, learning the solution operator amounts to approximating the inverse of a symmetric positive definite matrix. As a baseline, we use a global softmax-free low-rank attention operator of the form $QK^T$. The proposed construction replaces this dense global factorization by a two-level additive structure: local low-rank attention blocks on overlapping subdomains are combined with a coarse attention block. The resulting operator has the form $$M_θ^{-1} = ΦQ_0 K_0^T Φ^T + \sum_{i=1}^{N} R_i^T D_i^{1/2} Q_i K_i^T D_i^{1/2} R_i.$$ Here $R_i$ restricts to an overlapping subdomain, $D_i$ is a partition-of-unity weight, and $Φ$ is a coarse interpolation (or prolongation) matrix. Numerical experiments for synthetic Fourier right-hand sides indicate that the domain-decomposition attention operator is able to train faster and can give more accurate approximations than a global low-rank attention baseline while using significantly fewer parameters.

2606.18627 2026-06-18 cs.LG 新提交

PACT: Preserving Anchored Cores in Task-vectors for Model Merging

PACT: 在任务向量中保留锚定核心用于模型合并

Ningyuan Shi, Zhipeng Zhou, Hao Wang, Chunyan Miao, Peilin Zhao

发表机构 * Shanghai Jiao Tong University(上海交通大学) Nanyang Technological University(南洋理工大学) The Hong Kong University of Science and Technology (Guangzhou)(香港科技大学(广州))

AI总结 提出PACT方法,通过识别并保留预训练权重中的承重墙维度,在任务向量中锚定任务特定核心,解决任务向量范式下任务冲突和性能下降问题,提升模型合并效果。

Comments 33 pages,14 figures

详情
AI中文摘要

模型合并已成为多任务学习的一种无需训练的替代方案,旨在将多个任务特定的微调模型组合成一个单一的多任务模型。大多数现有的模型合并方法遵循任务算术范式,该范式将微调权重分解为预训练参数和任务向量,并仅在任务向量空间中进行合并。这一范式的有效性隐含地依赖于一个假设,即任务特定知识仅编码在任务向量中。我们认为,由于预训练模型固有的任务偏好,这一假设通常不成立。具体而言,我们识别出\textbf{承重墙(LBW)维度},即一些任务关键知识仍嵌入在预训练权重中,而非完全转移到任务向量中。我们从标量权重和子空间两个角度刻画LBW维度,从而覆盖现有模型合并方法的主要范式。我们的分析表明,忽略LBW维度会导致基于任务向量的方法无法完全解决任务冲突,并可能无意中破坏预训练模型中编码的任务特定知识,从而导致性能下降。为解决这一问题,我们提出PACT,该方法通过将任务向量的正交补与预训练权重的子空间对齐,从而在任务向量中保留锚定的任务特定核心(即LBW维度)。在应用现有模型合并算法之前,将这些对齐的子空间分量从任务向量中移除。此外,我们开发了一种基于随机SVD的高效变体以提高可扩展性。PACT可以无缝集成到现有方法中。在多个基准上的大量实验表明,PACT持续增强主流模型合并方法,并建立了新的最先进性能。

英文摘要

Model merging has emerged as a training-free alternative to multi-task learning, aiming to combine multiple task-specific fine-tuned models into a single multi-task model. Most existing model merging approaches follow the Task Arithmetic paradigm, which decomposes fine-tuned weights into pre-trained parameters and task vectors, and performs merging exclusively in the task-vector space. The effectiveness of this paradigm implicitly relies on the assumption that task-specific knowledge is encoded solely within task vectors. We argue that this assumption generally does not hold due to the intrinsic task preferences of pre-trained models. Specifically, we identify \textbf{Load-Bearing Wall (LBW) dimensions}, namely some task-critical knowledge that remains embedded in the pre-trained weights rather than being fully transferred into task vectors. We characterize LBW dimensions from both scalar-weight and subspace perspectives, thereby covering the major paradigms of existing model merging methods. Our analysis reveals that, by ignoring LBW dimensions, task-vector-based approaches fail to fully resolve task conflicts and may inadvertently damage task-specific knowledge encoded in the pre-trained model, leading to degradation. To address this issue, we propose PACT, which preserves the anchored task-specific cores (i.e., LBW dimensions) within task vectors by aligning their orthogonal complements with the subspace of the pre-trained weights. These aligned subspace components are then removed from the task vectors before applying existing model merging algorithms. Furthermore, we develop an efficient variant based on randomized SVD to improve scalability. PACT can be seamlessly integrated with existing methods. Extensive experiments across multiple benchmarks demonstrate that PACT consistently enhances mainstream model merging approaches and establishes new state-of-the-art performance.

2606.18676 2026-06-18 cs.LG cs.CV 新提交

InTrain: Intrinsic Trainability for Zero-Cost Neural Architecture Search

InTrain: 面向零成本神经架构搜索的内在可训练性

Qinqin Zhou, Fuhai Chen, Jipeng Wu, Zhiwei Chen, Zhikai Hu, Weiwei Cai

发表机构 * School of Computer and Data Science, Fuzhou University(福州大学计算机与数据科学学院) School of Computer and Data Science, Minjiang University(闽江学院计算机与数据科学学院) School of Artificial Intelligence, Nanchang University(南昌大学人工智能学院) Department of Computer Science, Hong Kong Baptist University(香港浸会大学计算机科学系) School of Interdisciplinary Medicine and Engineering, Harbin Medical University(哈尔滨医科大学跨学科医学与工程学院)

AI总结 提出统一理论代理InTrain,通过几何容量和优化韧性两个协同成分形式化架构的可训练性,在NAS基准上达到与集成方法相当的排序相关性。

详情
Journal ref
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2026
AI中文摘要

免训练神经架构搜索有望在不进行昂贵训练的情况下高效发现高性能网络。然而,现有的零成本代理依赖于碎片化的启发式方法,未能捕捉基本问题:是什么使一个架构具有可训练性?本文引入内在可训练性(InTrain),一个统一的理论代理,将可训练性形式化为由两个协同成分——几何容量和优化韧性——涌现出的架构不变性。我们通过分析神经信息处理来操作化内在可训练性。几何容量通过激活协方差特征谱的参与比量化,捕捉表示流形的有效维度。优化韧性通过累积梯度健康度测量,评估跨网络深度的反向传播鲁棒性。InTrain通过尺度不变的乘法耦合综合这些维度,我们假设这对于捕捉它们协同、非加性的关系至关重要。在标准NAS基准和搜索空间上的大量实验表明,InTrain达到了与最先进的基于集成的代理相当的排序相关性,并优于其他单指标方法。

英文摘要

Training-free neural architecture search promises efficient discovery of high-performance networks without costly training. However, existing zero-cost proxies rely on fragmented heuristics that fail to capture the fundamental question: what makes an architecture trainable? This paper introduces Intrinsic Trainability (InTrain), a unified theoretical proxy that formalizes trainability as an architectural invariant emerging from two synergistic components: geometric capacity and optimization resilience. We operationalize intrinsic trainability through analysis of neural information processing. Geometric capacity is quantified via the participation ratio of activation covariance eigenspectrum, capturing the effective dimensionality of representation manifolds. Optimization resilience is measured through cumulative gradient health, assessing the robustness of backpropagation across network depth. InTrain synthesizes these dimensions through a scale-invariant multiplicative coupling, which we hypothesize is essential for capturing their synergistic, non-additive relationship. Extensive experiments on standard NAS benchmarks and search spaces demonstrate that InTrain achieves ranking correlations on par with state-of-the-art ensemble-based proxies and outperforms other single-metric methods.

2606.18694 2026-06-18 cs.LG cond-mat.dis-nn cs.CL cs.NE nlin.AO 新提交

Attention as Frustrated Synchronization

注意力作为受挫同步

Joshua Nunley

发表机构 * Cognitive Science Program(认知科学项目) Luddy School of Informatics, Computing, and Engineering(信息学、计算与工程学院) Indiana University Bloomington(印第安纳大学布卢明顿分校)

AI总结 提出受挫同步网络(FSN),通过复值耦合核和延迟项实现基于同步的注意力机制,在百万参数级字符级文本和代码任务上优于调优的RoPE-SwiGLU Transformer。

Comments 25 pages, 4 figures. Preliminary report at the 1-10M parameter scale

详情
AI中文摘要

一个完美同步的振荡器网络无法进一步计算,因此基于同步构建的注意力架构必须将其计算定位在结构性的偏离一致中。我们引入了受挫同步网络(FSN),其令牌状态是环面上的相位,整个值通路是一个学习到的复值耦合核,包含谐波和一步延迟。核的每个分量在同步文献意义上都是一个受挫。复相位是静态的Kuramoto-Sakaguchi受挫角,带符号的谐波是排斥性的Daido分量,而延迟项(将每个令牌与其关注的令牌的后继耦合)在代数上与Kuramoto-Sakaguchi耦合相同,其受挫角是数据自身的转移,因此下一个令牌预测被实现为由数据受挫的同步。在匹配百万参数和训练预算的字符级文本和代码任务上,FSN的验证损失在每个测量周期都低于调优的RoPE-SwiGLU Transformer,并且该比较在基线训练至收敛后仍然成立:每30个周期的enwik8种子都低于Transformer收敛的50周期损失1.611,而FSN完成的50周期运行收敛至1.5953 ± 0.0014。一种变体将每个前馈块替换为对学习到的集体模式的平均场耦合,堆栈中不保留多层感知机,其性能与Transformer相当。在自然文本上,无受挫的基础层在每个复制深度上都落后于收敛的Transformer,在长距离复制事件上最差;而核在四个及以上深度处逆转了这种劣势。标题比较在百万参数规模下进行;规模阶梯在四百万参数下完成,优势持续存在,其余分支标记为进行中。

英文摘要

A network of oscillators that synchronizes perfectly computes nothing further, so an attention architecture built from synchronization must locate its computation in structured departures from agreement. We introduce the Frustrated Synchronization Network (FSN), whose token states are phases on a torus and whose entire value pathway is one learned complex coupling kernel over harmonics and a one-step delay. Each component of the kernel is a frustration in the sense of the synchronization literature. The complex phases are static Kuramoto-Sakaguchi frustration angles, the signed harmonics are repulsive Daido components, and the delay term, which couples each token to the successors of the tokens it attends to, is algebraically identical to Kuramoto-Sakaguchi coupling whose frustration angle is the data's own transition, so next-token prediction is implemented as synchronization frustrated by the data. At matched one-million-parameter and training budgets on character-level text and code, the FSN's validation loss is below a tuned RoPE-SwiGLU transformer's at every epoch measured, and the comparison survives training the baseline to convergence: every thirty-epoch enwik8 seed finishes below the transformer's converged fifty-epoch loss of 1.611, and the FSN's completed fifty-epoch runs converge to 1.5953 +/- 0.0014. A variant with every feed-forward block replaced by mean-field coupling to learned collective modes, leaving no multilayer perceptron in the stack, tracks the transformer. On natural text the unfrustrated base layer falls behind the converged transformer at every copy depth, worst on long-range copy events; the kernel reverses the deficit at every depth of four and beyond. Headline comparisons are at the one-million-parameter scale; a scale ladder is complete through four million parameters with the advantage persisting, and remaining arms are marked as in progress.

2606.18844 2026-06-18 cs.LG 新提交

Learning from Your Own Mistakes: Constructing Learnable Micro-Reflective Trajectories for Self-Distillation

从自身错误中学习:为自蒸馏构建可学习的微反思轨迹

Zhilin Huang, Hang Gao, Ziqiang Dong, Yuan Chen, Yifeng Luo, Chujun Qin, Jingyi Wang, Yang Yang, Guanjun Jiang

发表机构 * Qwen Business Unit of Alibaba(阿里巴巴通义千问事业部) Tsinghua University(清华大学) Peking University(北京大学)

AI总结 提出TAPO方法,通过对比正确与错误轨迹构建微反思修正,实现从隐式分布对齐到显式轨迹构建的自蒸馏改进,在多个数学推理基准上优于GRPO。

详情
AI中文摘要

自蒸馏通过使用模型自身的生成作为训练信号来改进大型语言模型的推理能力,通常通过隐式的logit级对齐来实现,最小化与特权目标分布的KL散度。然而,由于这种监督是通过无控制采样生成的,它无法提供关于模型特定错误的诊断性洞察,也无法针对其个体失败模式提供纠正性指导。因此,模型学习的是模仿特权分布,而不是接收精确指出其推理失败位置和原因的细粒度修正。在本文中,我们提出了轨迹增强策略优化(TAPO),将自蒸馏从隐式分布对齐推进到显式轨迹构建。在强化学习训练期间,模型对同一查询同时产生正确和错误的生成轨迹,TAPO利用这种对比结构来构建微反思修正——新的训练轨迹,保留模型在失败点之前的错误推理,然后插入自然语言诊断和由同一采样组中的正确参考引导的修正推理。由于每条轨迹都锚定在学习者自身的前缀和解决方案上,与基于KL的方法施加的位置级对齐相比,修正信号在更大程度上保留了模型的在策略分布。为了整合这些轨迹,TAPO在模型能力边界引入了难度感知的候选选择,并采用解耦优势估计以防止梯度污染。在AIME 2024、AIME 2025和HMMT 2025上的实验表明,在相同训练步数下,TAPO相比GRPO取得了一致的改进。进一步分析表明,TAPO增强了首次推理和错误纠正的有效性。

英文摘要

Self-distillation improves reasoning in large language models by using the model's own rollouts as training signal, typically through implicit logit-level alignment that minimizes KL divergence toward a privileged target distribution. However, because this supervision is generated via uncontrolled sampling, it provides no diagnostic insight into the model's specific errors or corrective guidance for its individual failure patterns. Consequently, the model learns to imitate a privileged distribution rather than receiving fine-grained corrections that pinpoint where and why its reasoning fails. In this paper, we propose Trajectory-Augmented Policy Optimization (TAPO), which advances self-distillation from implicit distributional alignment to explicit trajectory construction. During RL training, the model produces both correct and incorrect rollouts to the same query, and TAPO leverages this contrastive structure to construct micro-reflective corrections, new training trajectories that retain the model's erroneous reasoning up to the point of failure, then insert a natural-language diagnosis and corrected reasoning guided by a correct reference from the same sampling group. Since each trajectory is anchored in the learner's own prefix and solutions, the corrective signal preserves the model's on-policy distribution to a greater extent than the position-wise alignment imposed by KL-based methods. To integrate these trajectories, TAPO introduces difficulty-aware candidate selection at the model's capability boundary and decoupled advantage estimation to prevent gradient contamination. Experiments on AIME 2024, AIME 2025, and HMMT 2025 show that TAPO achieves consistent improvements over GRPO under the same number of training steps. Further analysis demonstrates that TAPO strengthens both first-pass reasoning and error-correction effectiveness.

2606.18923 2026-06-18 cs.LG 新提交

GrapNet: A Programmable Dynamic-Architecture Neural Graph Substrate

GrapNet: 一种可编程的动态架构神经图基板

Zirong Li

发表机构 * Zirong Li(李子荣)

AI总结 提出GrapNet,一种将图作为可执行架构的神经基板,通过可编程接口支持结构编辑、冻结子图、局部审计等操作,在Split Fashion-MNIST和Split CIFAR-10上分别提升12.08和3.81个百分点的准确率。

Comments 8 pages, 1 figure, preprint

详情
AI中文摘要

可编程性是固定张量神经网络中缺失的一流接口:编辑关系、冻结子图、审计局部函数或更改执行后端应是对神经程序的操作,而非临时参数手术。GrapNet研究这种图即网络的设置。图是架构和可执行程序,而非输入数据图。每个计算节点拥有其下一层子节点引用和与这些引用对齐的可训练分配向量;删除关系会物理移除子节点引用和相应的分配坐标。结构规则和执行策略位于节点核心之外,因此同一子节点拥有的图可以被增长、冻结、结构编辑、分组为可训练族块、通过注意力在活动关系上路由,或在拓扑稳定后降级为密集快照。GrapNet通过向量值父接口与常规模块组合:密集层、CNN编码器、ResNet特征提取器、注意力块和Transformer表示都可以为每个坐标提供一个感知GrapNode。评估组织为可编程性压力测试套件,而非新的重放基准。在匹配的十种子Split Fashion-MNIST研究中,可塑GrapNet+ER头在相同已见类损失和重放记忆下达到63.16%的已见类准确率,而参数更大的密集MLP+ER为51.08%,配对差值为12.08点,p=1.3e-5。在Split CIFAR-10上使用冻结的ImageNet ResNet-18编码器时,相同基板将在线头比MLP-256提高3.81点,p=0.0026。这些结果支持GrapNet作为可编辑的神经图基板,其核心价值在于具有忠实执行视图的结构可编程性。

英文摘要

Programmability is a missing first-class interface in fixed-tensor neural networks: editing a relation, freezing a subgraph, auditing a local function, or changing the execution backend should be an operation on the neural program rather than ad-hoc parameter surgery. GrapNet studies this graph-as-network setting. The graph is the architecture and executable program, not an input data graph. Each compute node owns its next-layer child references and a trainable allocation vector aligned with those references; deleting a relation physically removes both the child reference and the corresponding allocation coordinate. Structural rules and execution policies live outside the node core, so the same child-owned graph can be grown, frozen, structurally edited, grouped into trainable family blocks, routed by attention over active relations, or lowered to dense snapshots after topology stabilizes. GrapNet composes with conventional modules through a vector-valued parent interface: dense layers, CNN encoders, ResNet feature extractors, attention blocks, and transformer representations can all feed one sensory GrapNode per coordinate. The evaluation is organized as a programmability stress suite rather than as a new replay benchmark. In a matched ten-seed Split Fashion-MNIST study, a plastic GrapNet+ER head reaches 63.16 percent seen-class accuracy versus 51.08 percent for a parameter-larger dense MLP+ER under the same seen-class loss and replay memory, with paired delta 12.08 points and p=1.3e-5. On Split CIFAR-10 with a frozen ImageNet ResNet-18 encoder, the same substrate improves the online head over MLP-256 by 3.81 points, with p=0.0026. These results support GrapNet as an editable neural graph substrate whose core value is structural programmability with faithful execution views.

2606.19120 2026-06-18 cs.LG cs.CV 新提交

Seeing Before Reasoning: Decoupling Perception and Reasoning for Shortcut-Resilient Multimodal On-Policy Self-Distillation

先看后思:解耦感知与推理以实现抗捷径的多模态在策略自蒸馏

Sihan Wang, Xiyao Liu, Lianqing Liu, Zhi Han

发表机构 * State Key Laboratory of Robotics and Intelligent Systems, Shenyang Institute of Automation, Chinese Academy of Sciences(中国科学院沈阳自动化研究所机器人学国家重点实验室) University of Chinese Academy of Sciences(中国科学院大学)

AI总结 提出ViGOS框架,通过解耦感知和推理,在MLLM后训练中避免文本捷径,提升图像依赖行为。

Comments 29 pages, 5 figures, 8 tables

详情
AI中文摘要

在策略自蒸馏(OPSD)训练模型在其自身rollouts上,并使用冻结副本提供基于参考目标的密集token级目标。这对于LLM推理效果良好,但直接扩展到多模态大语言模型(MLLMs)可能产生捷径:特权目标可能主要基于文本参考目标而非图像来引导token。我们提出ViGOS,一种视觉引导的OPSD框架用于MLLM后训练。学生首先编写视觉描述,然后推理出最终答案。对于有效rollouts,仅图像的感知教师监督描述,而特权推理教师监督同一学生前缀上的推理和最终答案。仅对无效rollouts使用参考教师以恢复输出格式。在通用视觉-语言、专家推理、视觉数学、空间定位和视觉-语言先验基准测试中,ViGOS保持了OPSD的主要优势,并在易产生捷径的设置中改善了图像引导行为。

英文摘要

On-policy self-distillation (OPSD) trains a model on its own rollouts and uses a frozen copy to provide dense token-level targets conditioned on a reference target. This works well for LLM reasoning, but a direct extension to multimodal large language models (MLLMs) can create a shortcut: the privileged target may guide tokens mainly based on the text reference target rather than the image. We propose ViGOS, a visually grounded OPSD framework for MLLM post-training. The student first writes a visual description and then reasons toward the final answer. For valid rollouts, an image-only perception teacher supervises the description, while a privileged reasoning teacher supervises the reasoning and final answer on the same student prefix. A reference teacher is used only for invalid rollouts to recover the output format. Across general vision-language, expert reasoning, visual math, spatial grounding, and visual-language-prior benchmarks, ViGOS keeps the main benefits of OPSD and improves image-grounded behavior in shortcut-prone settings.

2606.19138 2026-06-18 cs.LG stat.ML 新提交

INDEQS: Informed Neural controlled Differential EQuationS

INDEQS: 信息引导的神经控制微分方程

Michael Detzel, Gabriel Nobis, Kristiyan Blagov, Juri Schubert, Jackie Ma, Wojciech Samek

AI总结 提出INDEQS,一种基于图的NCDE预测方法,通过在不同架构位置注入有向图先验知识,结合内外混合机制和自适应图卷积,在合成和真实任务中优于无信息NCDE。

详情
AI中文摘要

神经控制微分方程(NCDE)为时间序列预测提供了强大的连续时间框架,但标准的基于图的扩展通常纯粹从数据中学习空间结构,即使在已知有向图结构的情况下也是如此。我们引入了信息引导的神经控制微分方程(INDEQS),这是一种基于图的NCDE预测方法,在特定的架构位置融入有向图的先验知识。INDEQS将隐藏状态在图节点上的内部混合与向量场和控制之间的外部混合分开,并提供了一种轻量级的图约束变体和一种更具表现力的变体,通过自适应图卷积从数据中学习额外的图连接。为了系统研究图信息在预测中的有益时机,我们在有向图上设计了一个连续平流模拟,生成了具有已知真实流结构的合成时空数据集。然后,我们在两个实际任务上评估INDEQS:水文网络上的河流流量预测和PeMS08上的交通流预测。在这些合成和真实基准测试中,外部信息引导在参数数量相当的情况下,持续改善了无信息NCDE的平均绝对误差,尤其是在较大图上,而内部信息引导在需要严格遵循已知邻接时提供了一种更参数高效的替代方案。离散卷积和连续时间解码器的比较进一步表明,连续解码器在实际任务中提供了更好的准确性和更大的时间灵活性。INDEQS和平流模拟的实现可在以下网址获取:此 https URL。

英文摘要

Neural Controlled Differential Equations (NCDE) provide a powerful continuous-time framework for forecasting time series, but standard graph-based extensions typically learn spatial structure purely from data, even in settings where a directed graph structure is known a priori. We introduce Informed Neural controlled Differential EQuationS (INDEQS), a graph-based NCDE forecasting method that incorporates prior knowledge of a directed graph at distinct architectural positions. INDEQS separates inner mixing of hidden states across graph nodes from outer mixing between vector field and control, and offers both a lightweight graph-constrained variant and a more expressive variant, learning additional graph connections from data via adaptive graph convolutions. To systematically study when graph informedness is beneficial in forecasting, we devise a continuous advection simulation on directed graphs, yielding synthetic spatio-temporal datasets with known ground-truth flow structure. We then evaluate INDEQS on two real-world tasks: river discharge forecasting on a hydrological network and traffic flow prediction on PeMS08. Across these synthetic and real-world benchmarks, outer informedness consistently improves mean absolute error over an uninformed NCDE with comparable parameter count, particularly on larger graphs, while inner informedness offers a more parameter-efficient alternative when strict adherence to a known adjacency is desired. A comparison of discrete convolutional and continuous-time decoders further shows that continuous decoders yield better accuracy and greater temporal flexibility on real-world tasks. An implementation of INDEQS and the advection simulation is available at https://github.com/Mitchi1/indeqs.

2. 表示学习、自监督与对比学习 5 篇

2606.18383 2026-06-18 cs.LG cs.CL 新提交

From Sparse Features to Trustworthy Proxies: Certifying SAE-Based Interpretability

从稀疏特征到可信代理:认证基于SAE的可解释性

Dibyanayan Bandyopadhyay, Asif Ekbal

发表机构 * Department of Computer Science and Engineering, Indian Institute of Technology Patna(印度理工学院巴特那分校计算机科学与工程系)

AI总结 提出一种后验泛化框架,通过稀疏代理(SAE重建)认证语言模型,推导期望风险上界,并在GPT-2 Small等模型上验证非平凡界,揭示深层更易认证且特征分解区分语义对齐与统计稀疏性。

详情
AI中文摘要

稀疏自编码器(SAE)越来越多地被用于从语言模型(LM)中提取可解释特征,但一个核心问题仍然存在:基于SAE的解释何时可以被视为底层冻结LM的忠实视图?我们通过一个后验泛化框架来研究这个问题,该框架通过稀疏代理来认证LM,稀疏代理是通过将原生隐藏激活替换为其预训练的SAE重建而获得的。我们的框架使用四个可测量量推导出基础模型期望风险的上界:代理风险、SAE重建差距、概念池不匹配和稀疏复杂度。我们将此证书解释为解释忠实性的操作标准。特别地,非平凡界表明提取的稀疏特征保留了有意义的预测信息,而小的重建和匹配误差表明代理在行为上接近原始模型。实验上,我们展示了在GPT-2 Small、Gemma-2B和Llama-3-8B上,该界在实际样本量下变得非平凡。对Llama-3-8B的详细逐层分析揭示了强烈的深度依赖性,较深层变得更容易认证,这与更强的局部保真度和更弱的下游误差放大相关。最后,通过特征洗牌消融,我们展示了分解区分了真正的语义对齐与单纯的统计稀疏性,为基于SAE的解释何时变得不太可靠提供了有用的诊断。

英文摘要

Sparse autoencoders (SAEs) are increasingly used to extract interpretable features from language models (LMs), yet a central question remains: when can an SAE-based explanation be treated as a faithful view of an underlying frozen LM We study this through a post-hoc generalization framework that certifies the LM via a sparse proxy, obtained by replacing a native hidden activation with its pretrained SAE reconstruction. Our framework derives an upper bound on the base model's expected risk using four measurable quantities: proxy risk, SAE reconstruction gap, concept-pool mismatch, and sparse complexity. We interpret this certificate as an operational criterion for explanatory faithfulness. In particular, a non-vacuous bound indicates that the extracted sparse features retain meaningful predictive information, while small reconstruction and mismatch errors indicate that the proxy remains behaviorally close to the original model. Empirically, we show that the bound becomes non-vacuous on GPT-2 Small, Gemma-2B, and Llama-3-8B at practical sample sizes. A detailed layerwise analysis of Llama-3-8B reveals a strong depth dependence, with later layers becoming much easier to certify, associated with both stronger local fidelity and weaker downstream error amplification. Finally, through feature-shuffling ablations, we show that the decomposition distinguishes genuine semantic alignment from mere statistical sparsity, providing a useful diagnostic for when SAE-based explanations become less reliable.

2606.18390 2026-06-18 cs.LG q-bio.QM 新提交

MOLAR: Learning Multimodal Molecular Representations from Noisy Labels

MOLAR: 从噪声标签中学习多模态分子表示

Yingxu Wang, Kunyu Zhang, Nan Yin, Yu Li, Eran Segal

发表机构 * Mohamed bin Zayed University of Artificial Intelligence(穆罕默德·本·扎耶德人工智能大学) Zhengzhou University(郑州大学) The Education University of Hong Kong(香港教育大学) The Chinese University of Hong Kong(香港中文大学) Weizmann Institute of Science(魏茨曼科学研究所)

AI总结 提出MOLAR框架,通过分离干净属性推断与标签观测,利用图与文本模态的残差证据,从噪声标签中学习多模态分子表示,在自然噪声和标签翻转基准上优于基线方法。

详情
AI中文摘要

动机:噪声标签是分子属性预测中的常见挑战,因为分子注释通常来自实验分析、 curated数据库或弱注释流程,而非直接观测到的干净生物状态。将记录标签视为可靠监督会导致模型记忆损坏的观测并学习误导性的分子证据。在多模态分子表示学习中,图-文本融合或对齐可能放大此问题,从而跨模态传播标签引起的错误。结果:我们提出MOLAR,一个从噪声标签中学习多模态分子表示的噪声感知框架。MOLAR将潜在干净属性推断与记录标签观测分离:图和文本视图为干净属性分布贡献残差证据,一个分类标签观测通道将此分布映射到记录标签用于训练。该公式从模型中推导出后验标签可靠性和模态特定的分子证据。在自然噪声分子基准和受控标签翻转基准上的实验表明,MOLAR始终优于代表性基线。可视化分析进一步表明MOLAR提供了可解释的可靠性和模态证据诊断。

英文摘要

Motivation: Noisy labels are a common challenge in molecular property prediction because molecular annotations are often obtained from assays, curated databases, or weak annotation pipelines rather than directly observed clean biological states. Treating recorded labels as reliable supervision can cause models to memorize corrupted observations and learn misleading molecular evidence. In multimodal molecular representation learning, this issue can be amplified by graph-text fusion or alignment, which may propagate label-induced errors across modalities. Results: We propose MOLAR, a noise-aware framework for learning multimodal molecular representations from noisy labels. MOLAR separates latent clean-property inference from recorded-label observation: graph and text views contribute residual evidence to a clean-property distribution, and a categorical label-observation channel maps this distribution to recorded labels for training. This formulation derives posterior label reliability and modality-specific molecular evidence from the model. Experiments on naturally noisy molecular benchmarks and controlled label-flipping benchmarks show that MOLAR consistently outperforms representative baselines. Visualization analyses further show that MOLAR provides interpretable reliability and modality-evidence diagnostics.

2606.18688 2026-06-18 cs.LG cs.AI 新提交

Dual-Channel Grounded World Modeling (DCGWM): Structural Prevention of Objective Interference Collapse via Heterogeneous External Grounding with Inward-Only Gradient Flow

双通道接地世界建模 (DCGWM):通过异构外部接地与内向梯度流结构性防止目标干扰崩溃

Akshay Hazare

发表机构 * Independent Researcher(独立研究者)

AI总结 提出双通道接地世界建模(DCGWM),通过分区潜空间和内向梯度流,结构性防止联合嵌入预测架构中多目标接地导致的目标干扰崩溃。

Comments Position paper. Experimental validation in progress

详情
AI中文摘要

联合嵌入预测架构(JEPAs)是世界模型表示学习的主要方法。我们识别出基于JEPA的世界模型在接地于两种性质不同的外部信号时存在一种失败模式:物理动力学(稀疏、高幅度、满足约束的梯度修正)和社会行为动力学(扩散、分布匹配的修正)。我们将其称为目标干扰崩溃(OIC):我们认为在共享潜空间中的联合学习会导致主导通道系统地崩溃从属通道的表示子空间,且仅通过损失加权无法解决。我们提出双通道接地世界建模(DCGWM),通过分区潜空间(物理子空间Z_p,行为子空间Z_b)和内向梯度流,从结构上防止OIC。物理接地通道通过VICReg风格的对齐到物理测量仅更新Z_p;社会行为接地通道通过对齐到涌现多智能体模拟的轨迹仅更新Z_b。通道间接口模块在任务级别耦合子空间,而不产生跨子空间梯度。非对称接地 adherence 损失通过硬铰链惩罚物理违反和软KL惩罚行为发散来惩罚 rollout 漂移。生成渲染层在架构上与潜世界模型隔离。我们给出三个理论结果:分区消除了与OIC相关的梯度干扰路径;每个接地子空间从其对齐目标继承抗崩溃保证;在生成目标几何形状的假设下,生成隔离是必要的。本文建立了问题表述和架构;实验验证正在进行中,将在未来修订中报告。

英文摘要

Joint Embedding Predictive Architectures (JEPAs) are a leading approach to world model representation learning. We identify a failure mode in JEPA-based world models grounded against two qualitatively distinct external signals: physical dynamics (sparse, high-magnitude, constraint-satisfying gradient corrections) and social-behavioral dynamics (diffuse, distribution-matching corrections). We term this Objective Interference Collapse (OIC): we argue that joint learning in a shared latent space causes the dominant channel to systematically collapse the subordinate channel's representational subspace, in a manner not resolvable by loss weighting alone. We propose Dual-Channel Grounded World Modeling (DCGWM), designed to structurally prevent OIC through a partitioned latent space (physical subspace Z_p, behavioral subspace Z_b) with inward-only gradient flow. A Physical Grounding Channel updates only Z_p via VICReg-style alignment to physical measurements; a Social-Behavioral Grounding Channel updates only Z_b via alignment to trajectories from an emergent multi-agent simulation. An Inter-Channel Interface Module couples the subspaces at the task level without cross-subspace gradients. An Asymmetric Grounding Adherence Loss penalizes rollout drift with a hard hinge for physical violations and a soft KL for behavioral divergence. A Generative Rendering Layer is architecturally isolated from the latent world model. We present three theoretical results: the partition removes the gradient-interference pathway implicated in OIC; each grounded subspace inherits anti-collapse guarantees from its alignment objective; and generative isolation is necessary under a stated assumption on the generative objective's geometry. This manuscript establishes the problem formulation and architecture; experimental validation is ongoing and will be reported in a future revision.

2606.18703 2026-06-18 cs.LG q-bio.QM 新提交

Contextualizing Biological Language Models across Modalities via Logit-Space Contrastive Alignment

跨模态生物学语言模型的逻辑空间对比对齐

Yanjun Shao, Yundi Chen, Yashvi Patel, Aurelien Pelissier, María Rodríguez Martínez

发表机构 * Biomedical Informatics and Data Science, Yale School of Medicine(耶鲁医学院生物医学信息学与数据科学)

AI总结 提出LOGICA框架,在输出逻辑空间进行对比学习,通过门控跨模态适配器保留预训练似然接口,实现跨不同词汇表模型的上下文条件预测,在蛋白质-配体结合、TCR-肽活性和药物耐药性预测任务上超越现有方法。

详情
AI中文摘要

预训练的生物学语言模型通过掩码标记预测暴露每个标记的概率分布,提供序列设计、变异评分和机制解释所依赖的似然接口。然而,这些分布是从广泛的无标注语料中学习得到的,并未自然地以任务特定的生物学上下文(如相互作用伙伴、细胞环境或治疗干预)为条件。现有的上下文匹配方法通常通过池化嵌入、对比潜在空间或任务特定的预测头来扭曲这一接口。我们提出了LOGICA(逻辑空间对比对齐),一种用于上下文条件预测的框架,直接在输出逻辑空间中进行对比学习。通过与每个模型的原生标记头兼容的门控跨模态适配器,LOGICA保留了预训练的似然接口,并将上下文化的标记对数似然转换为匹配分数。对齐是通过上下文敏感的标记概率来定义的,而不是共享嵌入空间中的邻近性,从而能够从具有不同词汇表的模型之间的稀疏配对数据中学习,无需共享分词器或解码器。LOGICA特别适用于突变局部变异排序,其中比较简化为扰动位点上突变标记的上下文条件似然。在蛋白质-配体结合、TCR-肽活性和药物条件耐药性预测中,LOGICA优于先前的最先进方法,包括匹配的潜在对比和条件MLM基线,同时保留了用于解释和生成的标记级接口。在保留基因的单突变药物耐药性预测中,LOGICA将AUC从接近随机的潜在空间基线约0.55提高到约0.65。

英文摘要

Pretrained biological language models expose per-token probability distributions through masked-token prediction, providing the likelihood interface central to sequence design, variant scoring, and mechanistic interpretation. Yet these distributions are learned from broad unlabeled corpora and are not naturally conditioned on task-specific biological contexts such as interaction partners, cellular environments, or therapeutic interventions. Existing contextual matching methods often distort this interface through pooled embeddings, contrastive latent spaces, or task-specific prediction heads. We introduce LOGICA (Logit-space Contrastive Alignment), a framework for context-conditioned prediction that performs contrastive learning directly in output-logit space. Using gated cross-modal adapters compatible with each model's native token head, LOGICA preserves the pretrained likelihood interface and converts contextualized token log-likelihoods into matching scores. Alignment is defined through context-sensitive token probabilities rather than proximity in a shared embedding space, enabling learning from sparse paired data across models with distinct vocabularies, without a shared tokenizer or decoder. LOGICA is particularly effective for mutation-local variant ranking, where comparisons reduce to context-conditioned likelihoods of mutant tokens at perturbed sites. Across protein--ligand binding, TCR--peptide activity, and drug-conditioned resistance prediction, LOGICA improves over prior state-of-the-art methods, including matched latent-contrastive and conditional MLM baselines, while retaining a token-level interface for interpretation and generation. On held-out-gene single-mutation drug-resistance prediction, LOGICA improves AUC from near-random latent-space baselines of $\sim$0.55 to $\sim$0.65.

2606.18961 2026-06-18 cs.LG 新提交

Be Your Own Teacher: Steering Protein Language Models via Unsupervised Reward Optimization

做自己的老师:通过无监督奖励优化引导蛋白质语言模型

Lanqing Li, Shentong Mo, Yang Yu, Pheng-Ann Heng

发表机构 * The Chinese University of Hong Kong(香港中文大学) MBZUAI Hong Kong University of Science and Technology(香港科学理工大学)

AI总结 提出无监督奖励优化框架,结合模型不确定性和语义一致性作为代理奖励,通过SRO和BRO算法优化PLMs,在无标签数据下实现可控蛋白质生成,性能接近有监督方法。

Comments 24 pages, 2 figures, 13 tables

详情
AI中文摘要

蛋白质语言模型(PLMs)已成为可控生物分子设计的有力工具,但其后训练适应通常依赖于昂贵的湿实验验证或精心策划的偏好数据集。为了克服这一监督瓶颈,我们引入了PLMs的无监督奖励优化,这是一个无需真实标签即可实现可引导蛋白质生成的综合框架。我们的关键见解是,任务无关的奖励(将内在模型不确定性与由蛋白质表示模型指导的外在语义一致性相结合)在基础模型和温度设置中与可控性度量表现出强相关性。基于这一发现,我们提出了两种离线算法:软奖励优化(SRO)和二值化奖励优化(BRO),它们有效地最大化由这些代理奖励诱导的经典RLHF目标。在组合性分布外提示上的大量实验表明,两种方法均显著优于竞争基线(DPO、KTO),同时在多个采样温度、模型规模和蛋白质家族中接近理想性能。此外,使用无监督奖励微调的PLMs在pass@k评估中相比其基础模型能够实现持续更高的覆盖率。通过使PLMs能够利用自身生成的体验进行自我改进,我们的框架为在标签偏好或实验反馈稀缺或不可用的环境中实现可控生物分子设计提供了一条可扩展的途径。

英文摘要

Protein language models (PLMs) have emerged as powerful tools for controllable biomolecular design, yet their post-training adaptation typically relies on costly wet-lab validation or curated preference datasets. To overcome this supervision bottleneck, we introduce unsupervised reward optimization of PLMs, a comprehensive framework for steerable protein generation without ground-truth labels. Our key insight is that task-agnostic rewards, which combine intrinsic model uncertainty with extrinsic semantic consistency informed by protein representation models, exhibit strong correlation with controllability measures across base models and temperature regimes. Building upon this discovery, we propose two offline algorithms: Soft Reward Optimization (SRO) and Binarized Reward Optimization (BRO), which effectively maximize the classical RLHF objective induced by these proxy rewards. Extensive experiments on compositional out-of-distribution prompts demonstrate that both methods significantly outperform competitive baselines (DPO, KTO), while approaching oracle performance across multiple sampling temperatures, model scales and protein families. Moreover, PLMs fine-tuned with unsupervised rewards can achieve consistently higher coverage compared to their base model in pass@k evaluations. By enabling self-improvement of PLMs through their own generated experience, our framework provides a scalable pathway toward controllable biomolecular design in settings where labeled preferences or experimental feedback are scarce or unavailable.

3. 强化学习与序列决策 16 篇

2606.18284 2026-06-18 cs.LG cs.AI cs.CL 新提交

Breaking the Solver Bottleneck: Training Task Generators at the Learnable Frontier

打破求解器瓶颈:在可学习前沿训练任务生成器

Lorenz Wolf, Connor Watts, Roger Creus Castanyer, Geoffrey Bradway, Maxwill Lin, Augustine N. Mavor-Parker, Matthew Daborn-Sargent

发表机构 * Vmax Goodfire AI

AI总结 提出PROPEL框架,通过训练轻量级激活探针作为求解率代理,在无需重复求解器评估的情况下优化任务生成器,使生成任务集中在可学习前沿,提升数学、代码和软件工程任务的有效性。

Comments 30 pages, 9 figures, 12 tables

详情
AI中文摘要

通过强化学习训练智能体的限制资源日益成为前沿任务供给:有效、可求解且刚好足够困难以训练当前模型的任务。随着推理和智能体模型的改进,固定任务分布趋于饱和,而天真的合成生成产生琐碎、不可能或不适定的任务。用强化学习训练任务生成器以优化有效性和可学习性可以解决这一瓶颈,但直接优化需要对每个候选任务进行重复求解器评估。对于软件工程任务,单次评估可能耗时数十分钟;求解器在环的生成器训练是不可行的。我们提出PROPEL,一个求解器摊销框架,用于在目标求解率下训练任务生成器。PROPEL在一次性标注的生成任务和求解器结果语料库上训练一个轻量级激活探针。该探针从冻结的生成器参考模型预测目标求解器的通过率,并在生成器优化期间作为求解率的代理,将生成器评估简化为单次前向传播。在多种模型规模下的数学、代码和软件工程任务中,PROPEL将生成任务转向目标求解率:对于编程,在可学习前沿生成的任务从$10.1\% \ ightarrow 20.0\%$(针对Qwen2.5-3B-Instruct求解器)和从$5.3\% \ ightarrow 12.6\%$(针对Qwen2.5-7B-Instruct求解器)。对于软件工程,PROPEL将目标求解率下的生成份额从$9.8\% \ ightarrow 19.6\%$(针对Qwen3.5-27B在探针和生成器训练期间未见过的仓库)。

英文摘要

The limiting resource for training agents via reinforcement learning (RL) is increasingly frontier task supply: valid, solvable tasks just difficult enough to train the current model. As reasoning and agentic models improve, fixed task distributions saturate, while naive synthetic generation yields tasks that are trivial, impossible, or ill-posed. Training a task generator with RL to optimize validity and learnability can address this bottleneck, but direct optimization requires repeated solver rollouts per candidate. For software-engineering (SWE) tasks, a single rollout can take tens of minutes; solver-in-the-loop generator training is intractable. We introduce PROPEL, a solver-amortized framework for training task generators at the targeted solve rate. PROPEL trains a lightweight activation probe on a one-time labeled corpus of generated tasks and solver outcomes. The probe predicts target-solver pass rate from a frozen generator reference model and serves as a proxy for solve rate during generator optimization, reducing generator evaluation to a single forward pass. Across math, code, and software-engineering at multiple model scales, PROPEL shifts generation toward the targeted solve rate: for coding, tasks generated at the learnable frontier increase from $10.1\% \rightarrow 20.0\%$ for a Qwen2.5-3B-Instruct solver and from $5.3\% \rightarrow 12.6\%$ for a Qwen2.5-7B-Instruct solver. For SWE, PROPEL increases the share of generations at the targeted solve rate from $9.8\% \rightarrow 19.6\%$ for Qwen3.5-27B on repositories not seen during training of probe and generator.

2606.18308 2026-06-18 cs.LG cs.AI 新提交

TRIDENT: Breaking the Hybrid-Safety-Physics Coupling for Provably Safe Multi-Agent Reinforcement Learning

TRIDENT: 打破混合安全-物理耦合以实现可证明安全的多智能体强化学习

Zijie Meng, Ziwei Li, Yufei Liu, Zhiyu Li, Jiyuan Liu, Wenhua Nie, Bingcai Wei, Miao Zhang

发表机构 * Peking University(北京大学) Xiamen University(厦门大学) National Taiwan University(国立台湾大学) WHU(武汉大学) THU / Jimei University(清华大学 / 集美大学)

AI总结 针对混合离散-连续动作、训练时安全约束和物理动力学形成的耦合问题,提出TRIDENT框架,通过Richardson-Romberg梯度校正、Lyapunov约束序列信任域更新和物理信息残差评论家,实现可证明的安全收敛,显著降低训练违规并提升奖励。

Comments 16 pages, 4 figures

详情
AI中文摘要

网络化信息物理系统中的安全协调迫使学习算法同时处理混合离散-连续动作、严格的训练时安全约束和物理支配的动力学。我们证明这三个特征形成了一个有向偏差循环,击败了任何现成模块的朴素组合,并将其形式化为一个三向耦合引理。然后我们引入TRIDENT,这是第一个MARL框架,其三个组件被共同设计以消除每个泄漏:一个将Gumbel-Softmax偏差从O(tau)降低到O(tau^2)的Richardson-Romberg梯度校正,一个强制每次迭代可行性的Lyapunov约束顺序信任域更新,以及一个分解价值而非奖励的物理信息残差评论家。我们证明了以O~(1/sqrt(K))的收敛速率达到约束纳什均衡,以及O(sqrt(K))的累积违规界。在多无人机移动边缘计算、自主交叉口管理和混合SMAC变体上,TRIDENT相比MADDPG减少了95.5%的训练时违规,相比MACPO减少了76.3%,同时相比最强的无约束基线提高了13.5%的奖励。

英文摘要

Safe coordination in networked cyber-physical systems forces learning algorithms to simultaneously handle hybrid discrete-continuous actions, hard training-time safety constraints, and physics-governed dynamics. We show that these three features form a directed cycle of biases that defeats any naive composition of off-the-shelf modules, and formalize this as a three-way coupling lemma. We then introduce TRIDENT, the first MARL framework whose three components are co-designed to cancel each leak: a Richardson-Romberg gradient correction reducing Gumbel-Softmax bias from O(tau) to O(tau^2), a Lyapunov-constrained sequential trust-region update enforcing per-iterate feasibility, and a physics-informed residual critic that decomposes value rather than reward. We prove an O~(1/sqrt(K)) convergence rate to a constrained Nash equilibrium and an O(sqrt(K)) cumulative-violation bound. On multi-UAV mobile-edge computing, autonomous intersection management, and a hybrid SMAC variant, TRIDENT cuts training-time violations by 95.5% over MADDPG and 76.3% over MACPO, while improving reward by 13.5% over the strongest unconstrained baseline.

2606.18327 2026-06-18 cs.LG cs.AI 新提交

Self-CTRL: Self-Consistency Training with Reinforcement Learning

Self-CTRL:基于强化学习的自一致性训练

Itamar Pres, Laura Ruis, Melat Ghebreselassie, Belinda Z. Li, Jacob Andreas

发表机构 * MIT CSAIL(麻省理工学院计算机科学与人工智能实验室)

AI总结 提出Self-CTRL方法,通过强化学习优化语言模型自我解释与行为之间的一致性,在概率推理和宪法AI任务上显著提升一致性和安全性。

Comments 34 pages, 12 figures, includes appendices

详情
AI中文摘要

能够忠实描述自身行为的语言模型(LMs)更容易被用户审计、理解和信任。本文描述了基于强化学习的自一致性训练(Self-CTRL),该方法通过更新解释以更好地预测行为或更新行为以更好地匹配解释,优化LM的自我解释与相关输入行为之间的一致性。我们在两个领域应用该方法。首先,研究一个形式化概率推理任务,其中LM必须学习模仿一组有偏采样器,并评估其报告相关偏差的能力。我们发现,一致性训练将自我报告和行为测量的潜在偏差之间的相关性从$R^2=0.24$提高到$R^2=0.64$(在保留分布上),匹配直接真实标签监督的泛化能力。其次,研究一个宪法AI领域,其中LM必须描述何时拒绝或遵守用户请求。在此,Self-CTRL产生忠实描述模型在保留请求上行为的规则,将第三方审计模型的拒绝预测从$36\%$提高到$92\%$。另一方面,行为更新改善了对齐,将HarmBench失败率从$15.0\%$降低到$0.5\%$,而不会显著增加对无害提示的拒绝。通过对齐解释和行为,我们的工作为训练更安全、更透明、更可控的AI模型提供了通用方法。

英文摘要

Language models (LMs) that faithfully describe their own behavior can more easily be audited, understood, and trusted by users. This paper describes Self-Consistency Training with Reinforcement Learning (Self-CTRL), a method that optimizes for consistency between a LM's self-explanations and behavior on related inputs by updating explanations to better predict behavior or updating behavior to better match explanations. We apply our method in two domains. First, we study a formal probabilistic reasoning task in which LMs must learn to imitate a family of biased samplers and evaluated on their ability to report the associated biases. We find that consistency training improves the correlation between self-reported and behaviorally-measured latent biases from $R^2=0.24$ to $R^2=0.64$ on a set of held-out distributions, matching the generalization of direct ground-truth supervision. Second, we study a constitutional AI domain in which LMs must describe when they will refuse or comply with user requests. Here, Self-CTRL produces rules that faithfully describe the model's behavior on held-out requests, improving the refusal predictions of a third-party auditor model from $36\%$ to $92\%$. In the other direction, behavior updates improve alignment, reducing HarmBench failure rate from $15.0\%$ to $0.5\%$ without substantially increasing refusal on harmless prompts. By aligning explanations and behavior, our work provides a general recipe for training AI models to be safer, more transparent, and more controllable.

2606.18469 2026-06-18 cs.LG cs.AI 新提交

Structured Representation Learning with Locally Linear Embeddings and Adaptive Feature Fusion

基于局部线性嵌入与自适应特征融合的结构化表示学习

Somjit Nath, Jackson J Cone, Derek Nowrouzezahrai, Samira Ebrahimi Kahou

发表机构 * Mila – Quebec AI Institute(米拉-魁北克人工智能研究所)

AI总结 受神经科学启发,提出一种强化学习框架,利用局部线性嵌入捕捉状态局部结构,并通过注意力机制自适应融合动态与奖励特征,提升学习效率。

Comments Published in Transactions on Machine Learning Research (04/2026)

详情
AI中文摘要

神经科学研究揭示,大脑通过利用结构化的低维流形和自适应门控机制动态融合多源信息来编码复杂行为。受这些原理启发,我们提出了一种新颖的强化学习(RL)框架,鼓励分离动态特定和奖励特定特征,直接类比神经回路如何分离和整合信息以实现高效决策。我们的方法利用局部线性嵌入(LLE)来捕捉许多环境中固有的局部线性结构,反映神经群体活动中观察到的局部平滑性,同时通过标准RL目标推导奖励特定特征。一种类似于皮层门控的注意力机制,在逐状态基础上自适应地融合这些互补表示。在基准任务上的实验结果表明,我们的方法基于神经科学原理,相比传统RL方法提高了学习效率和整体性能,凸显了显式建模局部状态结构和自适应特征选择(如生物系统中观察到的)的优势。

英文摘要

Neuroscientific research has revealed that the brain encodes complex behaviors by leveraging structured, low-dimensional manifolds and dynamically fusing multiple sources of information through adaptive gating mechanisms. Inspired by these principles, we propose a novel reinforcement learning (RL) framework that encourages the disentanglement of dynamics-specific and reward-specific features, drawing direct parallels to how neural circuits separate and integrate information for efficient decision-making. Our approach leverages locally linear embeddings (LLEs) to capture the intrinsic, locally linear structure inherent in many environments, mirroring the local smoothness observed in neural population activity, while concurrently deriving reward-specific features through the standard RL objective. An attention mechanism, analogous to cortical gating, adaptively fuses these complementary representations on a per-state basis. Experimental results on benchmark tasks demonstrate that our method, grounded in neuroscientific principles, improves learning efficiency and overall performance compared to conventional RL approaches, highlighting the benefits of explicitly modeling local state structures and adaptive feature selection as observed in biological systems.

2606.18503 2026-06-18 cs.LG stat.ML 新提交

Quantum Annealing Enhanced Reinforcement Learning for Accurate Remaining Useful Lifetime Prediction

量子退火增强强化学习用于精确剩余使用寿命预测

Manoranjan Gandhudi, Arunkumar V., G. R. Anil, Gangadharan G. R

发表机构 * Central University of Karnataka(卡纳塔克中央大学) University College of Engineering, Anna University(安娜大学工程学院) AIONOS India Pvt Ltd(AIONOS印度私人有限公司) National Institute of Technology Tiruchirappalli(蒂鲁吉拉帕利国立理工学院)

AI总结 提出量子退火增强Q学习框架,通过将Q值更新编码为QUBO问题并利用量子退火采样实现随机动作选择,解决高维非凸空间中的收敛问题,在C-MAPSS和工业数据集上显著优于基线方法。

Comments 29 pages, 6 figures, 12 tables

详情
AI中文摘要

剩余使用寿命(RUL)估计是预测性维护的核心,意外故障的成本可能远超资产本身。统计退化模型忽略了真实系统的强非线性,而数据驱动模型在高维非凸搜索空间中常收敛到次优解。我们提出量子退火增强Q学习(QAQL)框架,将量子退火的采样行为与Q学习的序列决策相结合。每个Q值更新被编码为一个小的二次无约束二元优化(QUBO)问题,其基态对应贪婪动作;退火器不是作为确定性优化器,而是在多次读取中返回一个近最优动作的分布,这种随机动作选择提供了探索,从而抑制了在非线性退化轨迹上的过早收敛。QUBO在D-Wave Advantage系统上通过小规模嵌入求解,退火器被嵌入强化学习循环中,而非训练后附加。我们在两个公开基准上验证了QAQL:NASA C-MAPSS涡扇发动机数据集和一个设备群预测性维护数据集。在多次独立运行和六个误差指标上平均,QAQL优于本研究考虑的经典和量子基线,具有统计显著性改进。结果表明,量子退火是工业预测性维护应用中强化学习循环内一个可用的(而非仅理论上的)优化器。

英文摘要

Remaining useful life (RUL) estimation is central to predictive maintenance, where an unplanned failure can cost far more than the asset itself. Statistical degradation models miss the strong nonlinearity of real systems, and data-driven models often converge to suboptimal solutions in high-dimensional, non-convex search spaces. We propose a Quantum Annealing enhanced Q-Learning (QAQL) framework that couples the sampling behaviour of quantum annealing with the sequential decision making of Q-learning. Each Q-value update is encoded as a small quadratic unconstrained binary optimization (QUBO) whose ground state is the greedy action; rather than acting as a deterministic optimizer, the annealer returns a distribution over near-optimal actions across many reads, and this stochastic action selection supplies the exploration that curbs premature convergence on nonlinear degradation trajectories. The QUBO is solved on the D-Wave Advantage system using minor embedding, with the annealer woven into the reinforcement-learning loop rather than bolted on after training. We validate QAQL on two public benchmarks: the NASA C-MAPSS turbofan engine datasets and a device-fleet predictive maintenance dataset. Averaged over many independent runs and across six error metrics, QAQL outperforms the classical and quantum baselines considered in this study, with statistically significant improvements. The results indicate that quantum annealing is a usable, not merely theoretical, optimizer inside a reinforcement-learning loop for industrial predictive-maintenance applications.

2606.18537 2026-06-18 cs.LG 新提交

Do as the Romans Do: Learning Universal Behaviors from Heterogeneous Agents

入乡随俗:从异构智能体学习通用行为

Caleb Chang, Davin Win Kyi, Natasha Jaques, Karen Leung

发表机构 * University of Washington(华盛顿大学) NVIDIA(英伟达)

AI总结 提出GRID方法,从追求不同目标的异构示范者中提取通用奖励,训练通用智能体以学习环境通用能力,避免模式平均偏差,提升下游任务微调效率。

详情
AI中文摘要

人类通常通过观察他人来获取新技能,因为观察到的行为隐含地揭示了如何在环境中行动。然而,从异构群体中获得的观察会引入冲突的行为信号,使得难以确定哪些行为值得模仿。我们通过通用奖励推断与解耦(GRID)来解决这一挑战,这是一种从追求不同目标的异构示范者群体中提取普遍有用行为的社会学习方法。GRID将每个智能体的奖励函数分解为通用奖励(捕捉所有智能体共享的行为)和特定奖励(捕捉个体偏好和目标)。仅基于通用奖励进行训练提供了一种通用预训练的新范式。它产生了一个通用智能体,该智能体内化了通用的环境能力,如安全性和基本任务熟练度,而不会出现困扰标准从示范学习技术的模式平均偏差。这个通用智能体作为微调到下游任务(包括训练中未见过的偏好)的优越先验。在合成基函数分解、多智能体Craftax和连续自动驾驶模拟器(Highway-Env)上的实验证实,GRID以语义上有意义的方式成功解耦了奖励结构,优于标准的从示范学习基线,并实现了更高效和稳定的特化。

英文摘要

Humans often acquire new skills by observing others, since observed behaviors implicitly reveal how to act in an environment. However, observations drawn from a heterogeneous population introduce conflicting behavioral signals, making it difficult to determine which behaviors are worth imitating. We address this challenge with General Reward Inference and Disentanglement (GRID), a social learning method that extracts universally useful behaviors from a heterogeneous population of demonstrators pursuing different goals. GRID decomposes per-agent reward functions into a general reward, capturing behaviors shared across all agents, and specific rewards, capturing individual preferences and objectives. Training exclusively on the general reward provides a new paradigm of generalist pretraining. It yields a generalist agent that internalizes universal environmental competencies, such as safety and basic task proficiency, without the mode-averaging bias that afflicts standard learning from demonstration techniques. This generalist serves as a superior prior for fine-tuning to downstream tasks, including preferences unseen during training. Experiments across a synthetic basis function decomposition, multi-agent Craftax, and a continuous autonomous driving simulator (Highway-Env) confirm that GRID successfully disentangles reward structure in a semantically meaningful way, outperforms standard learning from demonstration baselines, and enables more efficient and stable specialization.

2606.18785 2026-06-18 cs.LG cs.AI 新提交

Bayesian Anytime Pareto Set Identification for Multi-Objective Multi-Armed Bandits

贝叶斯任意时间帕累托集识别用于多目标多臂老虎机

Lennert Saerens, Bram Silue, Eleni Litsa, Peter Vrancx, Pieter Libin

发表机构 * imec Data Science Institute, Interuniversity Institute of Biostatistics and Statistical Bioinformatics, UHasselt(哈瑟尔特大学生物统计学与统计生物信息学跨大学研究所数据科学研究所)

AI总结 提出首个任意时间多目标多臂老虎机算法Top-Two帕累托前沿汤普森采样(TTPFTS),用于帕累托集识别,在合成环境和超大型分子库中验证有效性,并引入不确定性量化指标。

Comments 26 pages, 13 figures

详情
AI中文摘要

识别帕累托最优解对于支持多目标决策至关重要。我们首次提出了一种用于帕累托集识别问题的任意时间多目标多臂老虎机算法,采用贝叶斯方法:Top-Two帕累托前沿汤普森采样(TTPFTS)。我们在合成环境中将TTPFTS与最先进的固定预算帕累托集识别算法进行基准测试。接下来,我们通过高效探索超大型按需合成分子库,在具有挑战性的多目标分子发现场景中展示了其实用性。此外,我们引入了一种新颖的不确定性量化指标,用于估计算法在预测帕累托集上的置信度。我们证明该指标有效代理真实性能,为监控复杂环境中的学习进度提供了一种稳健的方法。最后,我们用算法渐近正确性的理论证明补充了这些实证发现。

英文摘要

Identifying Pareto optimal solutions is critical to support multi-objective decision-making. We introduce the first anytime Multi-Objective Multi-Armed Bandit algorithm for the Pareto Set Identification problem, taking a Bayesian approach: Top-Two Pareto Front Thompson Sampling (TTPFTS). We benchmark TTPFTS against state-of-the-art fixed-budget Pareto Set Identification algorithms on synthetic environments. Next, we demonstrate its practical utility in a challenging multi-objective molecular discovery setting by efficiently exploring an ultra-large synthesis-on-demand molecular library. Furthermore, we introduce a novel uncertainty quantification metric that estimates our algorithm's confidence in the predicted Pareto set. We demonstrate that this metric effectively proxies true performance, yielding a robust methodology for monitoring learning progress in complex settings. Finally, we complement these empirical findings with a theoretical proof of the algorithm's asymptotic correctness.

2606.18810 2026-06-18 cs.LG cs.AI 新提交

Learning from Own Solutions: Self-Conditioned Credit Assignment for Reinforcement Learning with Verifiable Rewards

从自身解中学习:面向可验证奖励强化学习的自条件化信用分配

Yingyu Shan, Yuhang Guo, Zihao Cheng, Zeming Liu, Xiangrong Zhu, Xinyi Wang, Jiashu Yao, Wei Lin, Hongru Wang, Heyan Huang

发表机构 * Beijing Institute of Technology(北京理工大学) Beihang University(北京航空航天大学) Independent Researcher(独立研究者)

AI总结 提出SC-GRPO方法,利用自条件化分布间的KL散度作为GRPO梯度的乘性权重,实现细粒度信用分配,在数学、代码和智能体任务上平均提升8.1%。

详情
AI中文摘要

具有可验证奖励的强化学习(RLVR)在训练LLMs进行推理任务方面取得了显著进展,但代表性方法如GRPO对所有token分配统一信用,浪费了常规token上的梯度,同时低估了关键推理步骤。现有的token级信用分配方法需要超出模型自身rollout的资源。GRPO变体依赖于过程奖励模型或真实答案。知识蒸馏通过每个token的散度分配信用,但需要外部教师(在线策略蒸馏)或特权信息(在线策略自蒸馏)。然而,这些依赖性限制了在纯RLVR设置中的适用性。我们观察到,将模型以其自身验证过的轨迹为条件,会在原始分布和条件分布之间诱导出可测量的每token KL散度,并证明当存在多个验证过的轨迹时,从由验证过的轨迹构建的自教师进行蒸馏会导致不可行的加权平均解。我们提出SC-GRPO(自条件化GRPO),它使用前述KL散度作为GRPO梯度的乘性权重。在涵盖数学、代码和智能体任务的五个基准上,SC-GRPO一致优于GRPO 8.1%,优于DAPO 5.9%,并具有更强的分布外性能。此外,SC-GRPO实现了比OPD更高的性能。

英文摘要

Reinforcement learning with verifiable rewards (RLVR) has driven substantial progress in training LLMs for reasoning tasks, but representative methods such as GRPO assign uniform credit across all tokens, wasting gradient on routine tokens while under-crediting pivotal reasoning steps. Existing token-level credit assignment methods require resources beyond the model's own rollouts. GRPO variants rely on process reward models or ground-truth answers. Knowledge distillation assigns credit through per-token divergence but requires external teachers (On-Policy Distillation) or privileged information (On-Policy Self Distillation). However, these dependencies limit applicability in the pure RLVR setting. We observe that conditioning the model on its own verified trajectories induces a measurable per-token KL divergence between the original and conditioned distributions, and prove that distilling from a self-teacher constructed by verified trajectories leads to infeasible weighted-average solutions when multiple verified trajectories exist. We propose SC-GRPO (Self-Conditioned GRPO), which uses KL divergence mentioned before as a multiplicative weight on GRPO gradients. Across five benchmarks spanning math, code, and agentic tasks, SC-GRPO consistently outperforms 8.1% over GRPO and 5.9% over DAPO with stronger OOD performance. Moreover, SC-GRPO achieves higher performance than OPD.

2606.18812 2026-06-18 cs.LG cs.AI 新提交

Reinforcement Learning Foundation Models Should Already Be A Thing

强化学习基础模型本应已经存在

Abdelrahman Zighem, Jill-Jênn Vie

发表机构 * École normale supérieure de Paris, PSL University, Paris, France(巴黎高等师范学院,PSL大学,法国巴黎) Soda team, Inria Saclay, Palaiseau, France(Soda团队,法国国家信息与自动化研究所萨克雷中心,法国帕莱索)

AI总结 提出通过合成MDP构建强化学习基础模型,利用固定大小的充分统计量使注意力架构适用,在线和离线实验均优于传统算法。

详情
AI中文摘要

语言和视觉的基础模型由互联网规模的数据驱动,而结构化领域(表格预测、时间序列预测、图学习、强化学习)则不然。替代方案是合成数据,它将负担从收集转移到先验设计。这种先验已经存在于许多结构化任务中:TabPFN及其后续工作通过一个在合成贝叶斯先验上预训练的Transformer解决表格分类问题。我们提出两点。\textbf{首先},强化学习是明显的空白:采样一个合成MDP与采样一个合成表格数据集一样可行,然而没有上下文强化学习工作将先验设计作为主要目标。\textbf{其次},MDP允许一个固定大小的充分统计量,独立于观察到的回合且形状为表格形式,这使得它们直接适用于用于表格基础模型的基于注意力的架构,只需将策略头替换监督目标。这些共同定义了强化学习基础模型的议程。作为概念验证,我们完全在合成MDP上训练一个模型,并表明,无需任务特定的调优,它就能在上下文中解决留出的表格基准,包括在线和离线:在线时,使用比UCB-VI和表格Q-learning少得多的回合;离线时,与VI-LCB竞争。

英文摘要

Foundation models for language and vision are powered by internet-scale data, while structured domains (tabular prediction, time-series forecasting, graph learning, reinforcement learning) are not. The substitute is synthetic data, which shifts the burden from collection to prior design. Such priors already exist for many structured tasks: TabPFN and its successors solve tabular classification with a transformer pretrained on a synthetic Bayesian prior. We make two points. \textbf{First}, reinforcement learning is the conspicuous gap: sampling a synthetic MDP is as feasible as sampling a synthetic tabular dataset, yet no in-context RL work treats prior design as a primary objective. \textbf{Second}, MDPs admit a fixed-size sufficient statistic, independent of the episodes observed and tabular in shape, which makes them directly amenable to the attention-based architectures used for tabular foundation models, with a policy head replacing the supervised target. Together these define the agenda for an RL foundation model. As a proof of concept, we train one model entirely on synthetic MDPs and show that, with no task-specific tuning, it solves held-out tabular benchmarks in context, both online and offline: online, in far fewer episodes than UCB-VI and tabular Q-learning, and offline, competitively with VI-LCB.

2606.18820 2026-06-18 cs.LG cs.AI 新提交

Maturing Markov Decision Processes: Decision Making under Increasing Information and Shrinking Action Sets

成熟马尔可夫决策过程:信息增加与动作集缩小下的决策制定

Jiaxi Liu, Aiping Yang, Yuhang Yang, Shuqi Zhang, Zewei Dong, Jiangming Yang, Xuebin Chen

发表机构 * Ant International(蚂蚁国际) School of Economics, Sichuan University(四川大学经济学院) School of Economics, Fudan University(复旦大学经济学院)

AI总结 针对决策过程中信息增加与动作集缩小的不对称性,提出成熟马尔可夫决策过程(MMDP)框架,并基于过期动作优先级原则开发结构感知强化学习方法,实验证明其能提升学习效率。

Comments 25 pages, 9 figures

详情
AI中文摘要

序列决策问题通常表现出信息和决策灵活性的不对称演化:随着决策周期的展开,智能体获得更丰富的信息,而由于操作截止、承诺或资源约束,可行动作逐渐过期。标准的MDP公式通常将这种结构扁平化为阶段相关的状态描述和动作掩码,从而掩盖了嵌套的信息-动作不对称性,而这种不对称性决定了哪些决策是紧急的、哪些可以推迟。我们引入了成熟马尔可夫决策过程(MMDP),这是一种围绕这种信息-动作不对称性构建的公式。我们通过一个过期动作优先级原则来刻画其关键后果之一,该原则识别出必须在下一阶段之前解决的动作。受此结构启发,我们开发了一个结构感知的强化学习框架,包括阶段感知的策略设计、过期动作抽象以及带有蒸馏的搜索增强学习。在受控的多供应商补货问题、复杂度递增的简化现金管理环境以及生产级模拟器上的实验表明,显式建模这种不对称性可以提高学习效率,并且随着决策问题的规模扩大,其价值日益增加。

英文摘要

Sequential decision problems often exhibit an asymmetric evolution of information and decision flexibility: as a decision cycle unfolds, the agent receives richer information while feasible actions expire due to operational cutoffs, commitments, or resource constraints. Standard MDP formulations typically flatten this structure into stage-dependent state descriptions and action masks, thereby obscuring the nested information--action asymmetry that determines which decisions are urgent and which can be deferred. We introduce Maturing Markov Decision Processes (MMDPs), a formulation built around this information--action asymmetry. We characterize one of its key consequences through an expiring-action priority principle, which identifies the actions that must be resolved before the next stage. Motivated by this structure, we develop a structure-aware reinforcement learning framework with stage-aware policy design, expiring-action abstraction, and search-augmented learning with distillation. Experiments on a controlled multi-supplier replenishment problem, simplified cash-management environments of increasing complexity, and a production-scale simulator show that explicitly modeling this asymmetry improves learning efficiency and becomes increasingly valuable as decision problems scale.

2606.18910 2026-06-18 cs.LG cs.CL 新提交

REVES: REvision and VErification--Augmented Training for Test-Time Scaling

REVES:通过修订与验证增强的测试时扩展训练

Yuanxin Liu, Ruida Zhou, Xinyan Zhao, Amr Sharaf, Hongzhou Lin, Arijit Biswas, Mohammad Ghavamzadeh, Zhaoran Wang, Mingyi Hong

发表机构 * Northwestern University(西北大学) Amazon AGI(亚马逊人工智能实验室) Qualcomm AI Research(高通人工智能研究) University of Minnesota(明尼苏达大学)

AI总结 提出REVES框架,通过将中间步骤的“接近正确”答案转化为解耦的修订和验证提示,实现高效的离策略数据生成,提升大语言模型的多步推理能力,在LiveCodeBench上比强化学习基线高6.5分。

详情
AI中文摘要

通过顺序修订进行测试时扩展已成为增强大语言模型(LLM)推理能力的强大范式。然而,标准的后训练方法主要优化单次目标,与多步推理动态存在根本性不匹配。虽然最近的工作将其视为多轮强化学习(RL),但传统方法直接优化多步轨迹,未能进一步利用模型可以从纠正中学习的中间步骤中的高质量错误。我们提出了一个两阶段迭代框架,交替进行在线数据/提示增强和策略优化。通过将成功恢复轨迹中的中间步骤(“接近正确”答案)转化为解耦的修订和验证提示,我们的方法将训练集中在有效的答案转换和错误识别上。与标准的多轮RL相比,这种方法实现了高效的离策略数据生成,并减少了长程采样的计算开销。在LiveCodeBench上,使用公开可用的测试用例作为反馈,我们观察到比RL基线高6.5分,比标准多轮训练高4.0分。除了编码,我们的方法在圆填充问题上达到了先前报告的SOTA结果,同时使用了最小的基础模型(4B)和远少于更大进化搜索系统的采样次数。在真实验证下的数学结果进一步证实了改进的纠正能力。该方法还泛化到分布外的约束满足谜题,如n皇后和迷你数独,其中正确性完全由问题约束定义。代码可在该https URL获取。

英文摘要

Test-time scaling via sequential revision has emerged as a powerful paradigm for enhancing Large Language Model (LLM) reasoning. However, standard post-training methods primarily optimize single-shot objectives, creating a fundamental misalignment with multi-step inference dynamics. While recent work treats this as multi-turn reinforcement learning (RL), conventional approaches optimize over the multi-step trajectories directly, failing to further exploit the high-quality mistakes in intermediate steps that model can learn from correcting them. We propose a two-stage iterative framework that alternates between online data/prompt augmentation and policy optimization. By converting the intermediate steps (``near-miss'' answers) in the successful recovery trajectories into decoupled revision and verification prompts, our approach concentrates training on both effective answer transformation and error identification. This approach enables efficient off-policy data generation and reduces the computational overhead of long-horizon sampling compared to standard multi-turn RL. On LiveCodeBench, using publicly available test cases as feedback, we observe gains of +6.5 points over the RL baseline and +4.0 points over standard multi-turn training. Beyond coding, our approach matches the previously reported SOTA result on circle packing while using the smallest base model (4B) and far fewer rollouts than the much larger evolutionary search systems. Math results under ground-truth verification further confirm improved correction ability. It also generalizes to out-of-distribution constraint-satisfaction puzzles such as n\_queens and mini\_sudoku, where correctness is defined entirely by problem constraints. Code is available at https://github.com/yxliu02/REVES.git.

2606.18963 2026-06-18 cs.LG 新提交

Online Reward-Punishment Learning from Fixed-Channel Perceptual Event Streams without Environment Rewards

无环境奖励的固定通道感知事件流在线奖惩学习

Zirong Li

发表机构 * Zirong Li(李 Cirong)

AI总结 提出OHIRL框架,在无标量奖励下通过固定通道感知流进行在线奖惩学习,利用内部轨迹评估器推断感知维度的效价,在XOR任务和CartPole等控制任务中达到高准确率。

Comments 9 pages, 5 figures, 6 tables; 13-page technical supplement

详情
AI中文摘要

我们研究当环境不提供标量奖励或评估标签时的在线奖惩学习。在每一步,智能体仅接收一个固定通道的感知数据包,诸如疼痛、能量、接触、损伤或认知错误等量被视为感知维度,其效价必须从转移后果中推断。OHIRL分离了四个角色:M_psi学习下一数据包预测,D_omega建模残差动力学,C_eta是一个固定的内部转移后轨迹评估器,B_xi学习使用由此产生的价值证据进行后续策略更新和动作评分。C_eta采用恢复正性、持久/增长负性的残差调节取向;系数来源审计显示,等单元、原始等值和随机单调变体保留了超过92%的已发布顶级动作排名,而符号反转保留了0%。无奖励协议暴露观察转移,同时隐藏环境奖励、延迟外部评估器、成功标签和动作好坏标签。条件误差分解将B_xi的证据估计误差与残差策略优化误差分离。在2x2-XOR数据包任务中,药物和辣椒在视觉XOR上下文中获得相反的价值,并且相同的疼痛或辣度增加可能根据后果结构为正或负;B_xi达到0.952的平衡奖励符号准确率。在完整的在线交错审计中,M_psi达到留出R2=0.907,B_xi达到0.940的符号准确率,策略达到0.979的最优动作准确率,而即时数据包分数、预测误差奖励、打乱目标、零奖励和误差减少控制均崩溃。隐藏奖励的CartPole和Taxi控制、公共上下文无泄漏审计以及模块角色消融进一步测试了信息边界和组件必要性。

英文摘要

We study online reward-punishment learning when the environment provides no scalar reward or evaluative label. At each step the agent receives only a fixed-channel perceptual packet, and quantities such as pain, energy, contact, damage, or cognitive error are treated as perceptual dimensions whose valence must be inferred from transition consequences. OHIRL separates four roles: M_psi learns next-packet prediction, D_omega models residual dynamics, C_eta is a fixed internal post-transition trajectory evaluator, and B_xi learns to use the resulting value evidence for later policy updates and action scoring. C_eta uses a recovery-positive and persistence/growth-negative residual-regulation orientation; a coefficient-origin audit shows that equal-unit, raw-equal, and random monotone variants preserve more than 92% of the released top-action rankings, while sign inversion preserves 0%. The reward-free protocol exposes observation transitions while withholding environment rewards, delayed external evaluators, success labels, and action-goodness labels. A conditional error decomposition separates B_xi evidence-estimation error from residual policy-optimization error. In a 2x2-XOR packet task, medicine and chili acquire opposite value under visual XOR contexts, and the same pain or spice increase can be positive or negative depending on consequence structure; B_xi reaches 0.952 balanced reward-sign accuracy. In a full online-interleaved audit, M_psi reaches holdout R2=0.907, B_xi reaches 0.940 sign accuracy, and the policy reaches 0.979 optimal-action accuracy, while immediate packet scores, prediction-error rewards, shuffled targets, zero reward, and error-reduction controls collapse. Hidden-reward CartPole and Taxi controls, public-context no-leakage audits, and module-role ablations further test information boundaries and component necessity.

2606.19134 2026-06-18 cs.LG cs.AI 新提交

Pareto Q-Learning with Reward Machines

带奖励机的帕累托Q学习

Arnaud Lequen, Clément Legrand-Lixon, Léo Saulières

AI总结 提出PQLRM算法,结合帕累托Q学习和奖励机,在多目标强化学习中高效逼近帕累托前沿,并处理非马尔可夫奖励。

Comments Accepted at the ICAPS 2026 Workshop on Bridging the Gap Between AI Planning and (Reinforcement) Learning (PRL)

详情
AI中文摘要

我们提出了带奖励机的帕累托Q学习(PQLRM),这是一种用于任务的多目标强化学习算法,其奖励结构由一组奖励机(RMs)指定。PQLRM结合了帕累托Q学习(PQL)(该方法维护向量值Q估计的集合以逼近帕累托前沿)和带奖励机的Q学习(QRM)的增强(该方法利用奖励信号的因子化自动机结构)。这产生了一种多策略算法,在非马尔可夫、RM编码的奖励下保持样本效率。实验表明,PQLRM比应用于叉积MDP的朴素PQL基线收敛更快,并且可以合成QRM无法获得的帕累托最优策略。

英文摘要

We present Pareto Q-Learning with Reward Machines (PQLRM), a multi-objective reinforcement learning algorithm for tasks whose reward structure is specified by a set of reward machines (RMs). PQLRM combines Pareto Q-Learning (PQL), which maintains sets of vector-valued Q-estimates to approximate the Pareto front, with enhancements from Q-Learning with Reward Machines (QRM), which exploits the factored automaton structure of the reward signal. This yields a multi-policy algorithm that remains sample-efficient under non-Markovian, RM-encoded rewards. Experimental trials show that PQLRM converges faster than a naive PQL baseline applied to the cross-product MDP and can synthesize Pareto-optimal policies that QRM cannot.

2606.19199 2026-06-18 cs.LG cs.AI 新提交

Forecasting what Matters: Decision-Focused RL for Controlled EV Charging with Unknown Departure Times

预测关键因素:面向决策的强化学习用于未知离开时间的受控电动汽车充电

Giuseppe Gabriele, Fabio Pavirani, Seyed Soroush Karimi Madahi, Chris Develder

发表机构 * Ghent University -- imec(根特大学 -- imec)

AI总结 针对电动汽车充电中离开时间未知导致强化学习策略效果差的问题,提出面向决策的强化学习框架,联合训练预测器与控制器,实现端到端优化,使总奖励提升14%,未供应能量减少55%。

Comments ACM e-Energy 2026 5 pages, 1 figure, 1 table

详情
AI中文摘要

近年来电动汽车的普及给电力系统带来了挑战,包括峰值需求增加和潜在的电网不稳定。基于强化学习的智能充电控制可以通过从历史数据中学习时间和上下文模式来缓解这些问题。然而,在现实场景中,关键特征(如离开时间)通常不可用。这使得强化学习智能体更难学习和执行有效的充电策略。为了减轻这种不确定性,训练好的预测器可以从可用数据中近似未知特征。然而,由于这些预测模型通常针对准确性(而非对下游智能体决策质量的影响)进行训练,它们的误差可能会传播并阻碍使用预测的控制器的整体性能。为了避免这种情况,我们提出了一种面向决策的强化学习框架,其中预测器是端到端训练的,即通过强化学习智能体采取的充电策略动作的反馈。这种预测器和控制器的联合训练最终产生了更高质量的动作:与没有离开时间预测的强化学习方法相比,我们提出的面向决策的强化学习方法产生了更优的充电决策,总奖励提高了14%,未供应能量(即由于电动汽车已离开而未能进行的充电)减少了55%。

英文摘要

The recent growth of EV adoption poses challenges for power systems, including increased peak demand and potential grid instability. Smart control of EV charging -- e.g., based on reinforcement learning (RL) -- can alleviate these issues by learning temporal and contextual patterns from historical data. Yet, in real-world scenarios, key features, such as departure time, often are unavailable. This, in turn, makes it harder for an RL agent to learn and execute an effective charging policy. To mitigate this uncertainty, a trained forecaster can approximate the unknown features from available data. However, since these forecasting models are typically trained for accuracy (rather than their impact on a downstream agent's decision quality), their errors may propagate and hinder the overall performance of a controller that is using the forecasts. To avoid this, we propose a decision-focused RL (DF-RL) framework in which the forecaster is trained end-to-end, i.e., with feedback from the charging policy actions taken by the RL agent. Such joint training of both the forecaster and controller ultimately results in higher-quality actions: our proposed DF-RL method yields superior charging decisions compared to other baselines, achieving up to a 14% improvement in total reward and a 55% reduction of unsupplied energy (i.e., charging that failed to happen because the EV already left), relative to the RL method without departure time forecasting.

2606.19236 2026-06-18 cs.LG cs.AI cs.CL 新提交

STARE: Surprisal-Guided Token-Level Advantage Reweighting for Policy Entropy Stability

STARE: 基于惊讶度的令牌级优势重加权以实现策略熵稳定性

Haipeng Luo, Qingfeng Sun, Songli Wu, Can Xu, Wenfeng Deng, Han Hu, Yansong Tang

发表机构 * Shenzhen International Graduate School, Tsinghua University(清华大学深圳国际研究生院) Tencent Hunyuan(腾讯混元)

AI总结 针对GRPO等RL算法中策略熵崩溃问题,提出STARE方法,通过惊讶度分位数识别熵关键令牌并重加权其优势,结合目标熵闭环门控稳定熵,在1.5B-32B模型和多种任务上实现稳定训练,AIME24/25准确率提升4%-8%。

Comments LLM, Reinforcement Learning

详情
AI中文摘要

基于可验证奖励的强化学习算法(如GRPO)已成为LLMs复杂推理的主流后训练范式,但通常在训练中遭受策略熵崩溃。我们对GRPO下的令牌级熵动态进行一阶梯度分析,识别出令牌级信用分配不匹配:每个令牌的熵变化分解为轨迹级优势与下一个令牌分布上的熵敏感函数的乘积,产生优势-惊讶度四象限结构和近临界性质。受此启发,我们提出STARE(基于惊讶度的令牌级优势重加权以实现策略熵稳定性),该方法通过批次内惊讶度分位数识别熵关键令牌子集,选择性重加权其有效优势,并引入目标熵闭环门控以实现稳定的熵调节。在1.5B至32B的模型规模以及三个任务族(短思维链、长思维链和多轮工具使用)上,STARE在数千步内维持稳定的RL训练,同时将策略熵保持在目标带内。在AIME24和AIME25上,STARE在平均准确率上比DAPO和其他竞争基线高出4%-8%,反思令牌和响应长度同步增长,表明持续探索-利用平衡进一步释放了RL训练潜力。代码可在https://github.com/xxxx获取。

英文摘要

Reinforcement Learning with Verifiable Rewards algorithms like GRPO have emerged as the dominant post-training paradigm for complex reasoning in LLMs, yet commonly suffer from policy entropy collapse during training. We conduct a first-order gradient analysis of token-level entropy dynamics under GRPO and identify a token-level credit assignment mismatch: the per-token entropy variation decomposes into the product of the trajectory-level advantage and an entropy sensitivity function over the next-token distribution, yielding an advantage-surprisal four-quadrant structure and a near-criticality property. Motivated by it, we propose STARE (Surprisal-guided Token-level Advantage Reweighting for policy Entropy stability), which identifies entropy-critical token subsets via batch-internal surprisal quantiles, selectively reweights their effective advantages, and incorporates a target-entropy closed-loop gate for stable entropy regulation. Across model scales from 1.5B to 32B and three task families (Short CoT, Long CoT, and Multi-Turn Tool Use), STARE sustains stable RL training over thousands of steps while maintaining policy entropy within the target band. On AIME24 and AIME25, STARE outperforms DAPO and other competitive baselines by 4%-8% in average accuracy, with reflection tokens and response length growing in tandem, indicating sustained exploration-exploitation balance that further unlocks RL training potential.Code is available at https://github.com/hp-luo/STARE.

2606.19328 2026-06-18 cs.LG cs.AI cs.RO 新提交

UBP2: Uncertainty-Balanced Preference Planning for Efficient Preference-based Reinforcement Learning

UBP2: 不确定性平衡的偏好规划用于高效基于偏好的强化学习

Mohamed Nabail, Leo Cheng, Jingmin Wang, Nicholas Rhinehart

发表机构 * Learning, Embodied Autonomy, and Forecasting (LEAF) Lab, University of Toronto(多伦多大学学习、具身自主与预测(LEAF)实验室)

AI总结 提出UBP2方法,通过联合推理奖励、动力学和值函数的不确定性来主动引导探索,在Meta-World基准上显著提高了样本效率。

详情
AI中文摘要

基于偏好的强化学习提供了一种从行为的成对比较中学习奖励模型的方法,绕过了显式奖励设计的需求。然而,现有方法通常依赖于被动数据收集,并且在学习的早期阶段样本效率低下。我们引入了一种基于模型的方法,通过联合推理奖励、动力学和值函数的不确定性来主动引导探索。我们的方法,不确定性平衡的偏好规划(UBP2),使用奖励、动力学和值函数模型的集成,根据结合了期望奖励、终值认知不确定性的统一评分来评估候选轨迹。在此目标下的规划产生了利用和信息获取之间的显式权衡,无需临时的探索启发式。在标准正则性假设下,我们为有限时域和无限时域设置建立了次线性遗憾保证。实验上,在Meta-World基准上的实验表明,UBP2比无模型的基于偏好的方法和非乐观的基于模型的基线方法实现了更高的样本效率。

英文摘要

Preference-based RL provides an approach to learning reward models from pairwise comparisons of behaviors, bypassing the need for explicit reward design. However, existing methods typically rely on passive data collection and suffer from poor sample efficiency, especially during the early stages of learning. We introduce a model-based approach that actively directs exploration by jointly reasoning over uncertainties in the reward, dynamics, and value functions. Our method, Uncertainty-Balanced Preference Planning (UBP2), uses ensembles of reward, dynamics, and value function models to evaluate candidate trajectories according to a unified score that combines expected reward, terminal value, and epistemic uncertainty. Planning under this objective yields an explicit tradeoff between exploitation and information acquisition without requiring ad hoc exploration heuristics. Under standard regularity assumptions, we establish sublinear regret guarantees for both finite-horizon and infinite-horizon settings. Empirically, experiments on the Meta-World benchmark show UBP2 achieves substantially higher sample efficiency than model-free preference-based methods and non-optimistic model-based baselines.

4. 生成模型与概率建模 6 篇

2606.18509 2026-06-18 cs.LG stat.ML 新提交

Concept Modulation Models: A Unified Framework for Identifiability and Extrapolation

概念调制模型:可识别性与外推的统一框架

Soheun Yi, Yizhou Lu, Chandler Squires, Pradeep Ravikumar

发表机构 * Department of Statistics and Data Science, Carnegie Mellon University(卡内基梅隆大学统计与数据科学系) Machine Learning Department, Carnegie Mellon University(卡内基梅隆大学机器学习系)

AI总结 提出概念调制模型(CMMs),通过属性势统一条件潜变量模型的可识别性与外推分析,将基于转移的可识别性提升至条件设置,并导出代数外推准则。

详情
AI中文摘要

条件潜变量模型中的可靠泛化需要理解可识别性和外推:观测属性间的变化如何决定潜在结构,以及该结构如何决定未见属性上的分布。然而,现有的可识别性和外推保证大多是模型特定的,在非线性ICA、因果表示学习、扰动建模及相关条件潜变量模型中分别进行分析。我们引入概念调制模型(CMMs),这是一类属性索引的条件生成模型,其结构为$A\to \Lambda \to C\to X$,其中属性选择调制器,调制器诱导潜在概念法则,概念生成观测特征。CMMs通过展示观测属性上的特征一致性诱导受CMM类约束的潜在概念转移,将基于转移的可识别性提升至条件设置。我们通过属性势(属性条件概念法则之间的对数密度比)表达这些约束,将通用提升步骤与模型特定的刚性论证分离。相同的势控制外推:当且仅当传输的属性势恒等式扩展到这些属性时,未见属性上的一致性成立。这导出了代数外推准则,识别出几个现有可识别性和外推结果背后的共同基于势的证明对象,并且当与这些工作中的模型特定刚性论证结合时,恢复了它们所述的结论。

英文摘要

Reliable generalization in conditional latent variable models requires understanding both identifiability and extrapolation: how observed variation across attributes determines latent structure, and how that structure determines distributions at unseen attributes. However, existing identifiability and extrapolation guarantees are largely model-specific, with separate analyses in nonlinear ICA, causal representation learning, perturbation modeling, and related conditional latent variable models. We introduce concept modulation models (CMMs), an attribute-indexed class of conditional generative models with structure $A\to Λ\to C\to X$, where attributes select modulators, modulators induce latent concept laws, and concepts generate observed features. CMMs lift transition-based identifiability to conditional settings by showing that feature agreement on observed attributes induces a latent concept transition constrained by the CMM class. We express these constraints through attribute potentials, log-density ratios between attribute-conditioned concept laws, separating the generic lifting step from model-specific rigidity arguments. The same potentials control extrapolation: agreement at unseen attributes holds exactly when the transported attribute-potential identities extend to those attributes. This yields algebraic extrapolation criteria, identifies the common potential-based proof objects behind several existing identifiability and extrapolation results, and, when combined with the model-specific rigidity arguments in those works, recovers their stated conclusions.

2606.18898 2026-06-18 cs.LG 新提交

Anomaly Detection for Sparse and Irregular Multivariate Time Series with Latent SDEs

基于潜在随机微分方程的稀疏不规则多元时间序列异常检测

Martin Uray, Dominik Geng, Florian Graf, Stefan Huber, Roland Kwitt

发表机构 * Josef Ressel Centre for Intelligent and Secure Industrial Automation, University of Applied Sciences, Salzburg, Austria(约瑟夫·雷斯尔智能与安全工业自动化中心,应用科学大学,萨尔茨堡,奥地利) University of Salzburg, Austria(萨尔茨堡大学,奥地利)

AI总结 针对现实世界中稀疏、不规则采样的多元时间序列,提出基于潜在随机微分方程的生成方法,将观测投影到连续时间随机动力系统,处理缺失和不规则采样,并捕获循环行为,在六个基准数据集上取得最优结果。

Comments Preprint

详情
AI中文摘要

多元时间序列异常检测(MTSAD)在工业监控、网络安全或医疗保健等广泛应用领域至关重要。现实世界的数据通常是稀疏的、不规则采样的或部分观测的,但现有方法假设时间序列均匀采样。我们提出了一种基于潜在随机微分方程的生成方法,将观测到的时间序列投影到一个连续时间随机动力系统上,能够直接处理缺失观测和不规则采样,同时自然捕获许多现实世界用例固有的可能循环行为。在六个异常基准数据集上的实验表明,我们提出的方法在现有最先进基线中排名第一。我们进一步证明,在严重数据稀疏性下,我们的方法保持鲁棒性,而测试的基线方法性能显著下降。这些结果突显了潜在随机微分方程作为多元时间序列异常检测的自然归纳偏置,尤其是在存在现实世界不规则性的情况下。

英文摘要

Multivariate time series anomaly detection (MTSAD) is critical for a wide range of application areas, such as industrial monitoring, cybersecurity, or healthcare. Real-world data is often sparse, irregularly sampled or partially observed, yet existing methods assume uniformly sampled time series. We propose a generative approach based on Latent SDEs that projects the observed time series on a continuous-time stochastic dynamical system, directly being able to handle missing observations and irregular sampling, while also naturally capturing possible cyclic behavior that many real-world use cases inherently possess. Experiments on six anomaly benchmark datasets show that our proposed method ranks first among state-of-the-art baselines. We further demonstrate that our method remains robust under severe data sparsity, while performance significantly degrades for the tested baseline methods. These results highlight latent SDEs as a natural inductive bias for anomaly detection in multivariate time series, especially in presence of real-world irregularities.

2606.18997 2026-06-18 cs.LG 新提交

DIPHINE: Diffusion-based $Φ$-ID Neural Estimator

DIPHINE: 基于扩散的 $\Phi$ID 神经估计器

Simon Pedro Galeano Munoz, Mustapha Bounoua, Giulio Franzese, Pietro Michiardi, Maurizio Filippone

发表机构 * KAUST(卡塔尔科学与技术部) EURECOM(欧雷康)

AI总结 提出首个基于扩散模型的神经估计器 DIPHINE,用于计算连续非高斯动力系统的集成信息分解($\Phi$ID),通过单个摊销网络联合估计所有互信息项,并利用 Möbius 逆变换恢复十六个原子。

详情
AI中文摘要

揭示真实世界复杂系统的真实信息架构需要厘清其组件如何随时间独特存储、冗余共享和协同整合信息。集成信息分解($\Phi$ID)是一个框架,用于将多变量系统的信息动态分解为十六个非重叠原子,这些原子表征冗余、独特和协同的信息存储、传输和整合模式。现有的计算 $\Phi$ID 的方法仅限于高斯或离散系统,阻碍了其在连续非高斯动力系统中的应用。我们通过提出 DIPHINE(基于扩散的 $\Phi$ID 神经估计器)来解决这一限制,这是首个利用基于分数的扩散模型从单个摊销网络中联合估计 $\Phi$ID 所需的所有互信息项的神经估计器,并通过 Möbius 逆变换恢复十六个原子。我们提供了通过逆变换的误差传播的理论分析,表明从互信息到原子的映射的雅可比矩阵是整数值的,并且协同到协同原子被证明是最难估计的。我们在合成基准上展示了准确恢复真实原子,与已建立的互信息估计器相比具有优越性能,并在涉及真实数据的应用中无需任何分布假设即可提取生理上可解释的信息动态结构。

英文摘要

Uncovering the true informational architecture of real-world complex systems requires disentangling how their components uniquely store, redundantly share, and synergistically integrate information over time. Integrated Information Decomposition ($Φ$ID) is a framework for decomposing the information dynamics of multivariate systems into sixteen non-overlapping atoms that characterize redundant, unique, and synergistic modes of information storage, transfer, and integration. Existing methods to compute $Φ$ID are restricted to Gaussian or discrete systems, preventing its application to continuous non-Gaussian dynamical systems. We address this limitation by proposing DIPHINE (Diffusion-based $Φ$-ID Neural Estimator), the first neural estimator that leverages score-based diffusion models to jointly estimate all the mutual information terms required by $Φ$ID from a single amortized network, recovering the sixteen atoms through Möbius inversion. We provide a theoretical analysis of error propagation through the inversion, showing that the Jacobian of the mapping from mutual informations to atoms is integer-valued and that the synergy-to-synergy atom is provably the hardest to estimate. We demonstrate accurate recovery of ground-truth atoms on synthetic benchmarks, superior performance compared to established mutual information estimators, and the ability to extract physiologically interpretable information-dynamic structure on an application involving real data without any distributional assumptions.

2606.19162 2026-06-18 cs.LG cs.CV 新提交

The Reward Was in Your Data All Along: Correcting Flow Matching with Discriminator-Guided RL

奖励一直就在你的数据中:用判别器引导的强化学习纠正流匹配

Nicolas Beltran-Velez, Felix Friedrich, Zhang Xiaofeng, Reyhane Askari-Hemmat, Xiaochuang Han, Adriana Romero-Soriano, Michal Drozdzal

发表机构 * FAIR at Meta(Meta FAIR) Columbia University(哥伦比亚大学) McGill University(麦吉尔大学) Canada CIFAR AI Chair(加拿大CIFAR人工智能主席)

AI总结 针对流匹配模型因损失函数与样本质量不匹配导致的视觉缺陷,提出判别器引导的强化学习(DRL),利用预训练空间中判别器的logit作为奖励,显著提升无引导FID和语义FD,并改善偏好对齐。

Comments 84 pages, including appendices

详情
AI中文摘要

得分匹配和流匹配模型通常依赖基于偏好的强化学习来实现两个目的:与主观偏好对齐,以及令人惊讶地恢复视觉真实性和连贯对象结构等属性——而这些属性本应通过匹配训练从数据本身学习。我们认为这反映了结构上的不匹配。匹配损失衡量训练时边缘分布下速度或得分场的$\ell_2$回归误差,这一代理指标与决定推理时样本质量的视觉和语义属性对齐不良。给定一个与这些属性对齐的奖励,强化学习通过评估模型自身生成的样本并直接遵循奖励景观来规避不匹配。挑战在于如何在不依赖人类偏好的情况下获得这样的奖励,因为人类偏好昂贵且会将数据真实性与标注者倾向混为一谈。我们提出判别器引导的强化学习(DRL)。DRL训练一个判别器,在预训练表示空间中区分数据样本和基础模型样本,并将其logit作为KL正则化强化学习中的奖励。预训练空间将判别器限制在感知有意义的方向上,而logit估计数据与模型之间的对数似然比,这是针对数据分布的最优奖励。在SiT、JiT、REPA和RAE上,DRL降低了无引导FID(例如,SiT上从9.38降至2.62)和语义空间FD(例如,SiT上DINOv3从88.2降至19.3),在所有骨干网络上均有一致提升,并且在没有经过偏好奖励训练的情况下改善了人类偏好奖励。在后续基于偏好的后训练中,DRL还在偏好奖励与图像保真度之间产生了更好的帕累托前沿,在提高对齐度的同时减少了过饱和和过亮等低级伪影。

英文摘要

Score- and flow-matching models often rely on preference-based reinforcement learning for two purposes: aligning with subjective preferences and, surprisingly, recovering properties such as visual realism and coherent object structure that matching-based training is intended to learn from the data itself. We argue that this reflects a structural mismatch. Matching losses measure $\ell_2$ regression error on the velocity or score field under training-time marginals, a proxy poorly aligned with the visual and semantic properties that determine sample quality at inference. Given a reward aligned with these properties, RL sidesteps the mismatch by evaluating the model on its own samples and following the reward landscape directly. The challenge is to obtain such a reward without relying on human preferences, which are expensive and conflate data realism with annotator inclinations. We propose Discriminator-Guided RL (DRL). DRL trains a discriminator to separate data from base-model samples in a pretrained representation space and uses its logit as the reward in KL-regularized RL. The pretrained space restricts the discriminator to perceptually meaningful directions, and the logit estimates the log-likelihood ratio between data and model, which is the optimal reward for targeting the data distribution. Across SiT, JiT, REPA, and RAE, DRL reduces guidance-free FID (e.g., $9.38 \to 2.62$ on SiT) and semantic-space FD (e.g., $88.2 \to 19.3$ on DINOv3 for SiT), with consistent gains across all backbones, and improves human-preference rewards without training on them. It also yields a better Pareto frontier between preference reward and image fidelity under subsequent preference-based post-training, increasing alignment while reducing low-level artifacts such as oversaturation and excessive brightness.

2606.19264 2026-06-18 cs.LG cs.CL 新提交

Structured Inference with Large Language Gibbs

大语言吉布斯结构化推理

Sanghyeok Choi, Henry Gouk, Esmeralda S. Whitammer

AI总结 提出大语言吉布斯方法,利用大语言模型的条件分布作为转移算子进行结构化概率推理,通过迭代重采样变量避免顺序偏差,在合成分布、一致性推理和贝叶斯结构学习中验证有效性。

Comments Code: https://github.com/hyeok9855/large-language-gibbs

详情
AI中文摘要

大型语言模型(LLMs)中编码的知识可以作为描述复杂世界变量的结构化推理的基础,但以概率一致的方式访问这些知识构成了一个困难的推理问题。我们提出了大语言吉布斯,一种结构化概率推理方案,它使用LLM的条件分布作为转移算子。不是通过单次自回归生成来采样结构化对象,而是利用LLM的下一个标记条件分布,在给定其他变量的条件下迭代地重采样单个变量。这种方法避免了顺序依赖偏差,并产生一个反映所有局部条件分布之间折衷的平稳分布。我们将这种方法应用于从合成分布中采样、一致性推理任务和贝叶斯结构学习。结果表明,在通过噪声LLM条件分布可访问的世界先验下,MCMC中使用LLM条件分布是用于结构化概率推理的一次性生成的实际替代方案。

英文摘要

The knowledge encoded in large language models (LLMs) can serve as a substrate for structured reasoning over variables describing a complex world, but accessing this knowledge in a probabilistically coherent manner poses a difficult inference problem. We propose Large Language Gibbs, a scheme for structured probabilistic inference that uses conditional distributions of an LLM as transition operators. Rather than sampling structured objects through single-pass autoregressive generation, we iteratively resample individual variables conditioned on others using an LLM's next-token conditionals. This approach avoids order-dependent biases and produces a stationary distribution that reflects a compromise between all local conditionals. We apply this approach to sampling from synthetic distributions, consistent reasoning tasks, and Bayesian structure learning. The results suggest that the use of LLM conditionals in MCMC is a practical alternative to one-pass generation for structured probabilistic inference under a world prior accessible through noisy LLM conditionals.

2606.19315 2026-06-18 cs.LG 新提交

Diffusion-Proof: Recipe for Formal Theorem Proving Beyond Auto-Regressive Generation

Diffusion-Proof:超越自回归生成的正式定理证明配方

Ruida Wang, Rui Pan, Pengcheng Wang, Shizhe Diao, Tong Zhang

发表机构 * University of Illinois Urbana-Champaign(伊利诺伊大学厄巴纳-香槟分校) NVIDIA(英伟达)

AI总结 提出Diffusion-Proof框架,首次将扩散语言模型应用于形式定理证明,通过全证明生成和局部校正方法,在ProofNet和MiniF2F上分别提升1.61%和6.14%,并解决了一个DeepSeek-Prover-V2-7B无法解决的IMO问题。

详情
AI中文摘要

近年来,增强大型语言模型(LLMs)的形式数学推理能力已成为数学和计算机科学社区的关键焦点。虽然在使用最先进的自回归(AR)LLMs进行形式定理证明方面取得了显著进展,但这些模型存在固有局限性。它们的下一个词预测生成方法可能因长程连贯性挑战和长序列错误累积而导致次优性能。最近,扩散LLMs(dLLMs)通过多词块的迭代去噪生成文本,提供了一种有前景的替代方案。然而,dLLMs在形式数学中的应用(其中保持长程连贯性至关重要)仍然研究不足。为解决上述挑战,我们提出了**Diffusion-Proof**,据我们所知,这是第一个训练和应用dLLMs进行形式定理证明的框架。我们的框架包含两种模型的训练和推理方法。第一个是*dLLM-Prover-7B*,它执行具有长程连贯策略使用的全证明写作。第二个是*dLLM-Corrector-7B*,这是一种新颖的大块扩散校正模型。它利用dLLMs的填充能力,使用双向信息进行局部证明校正。大量实验表明,**Diffusion-Proof**相对显著优于在同一数据集上训练的AR LLM基线。与基线相比,**Diffusion-Proof**在ProofNet-Test和MiniF2F-Test基准上分别实现了**1.61%**和**6.14%**的绝对提升。值得注意的是,**Diffusion-Proof**成功解决了一个更先进的思考模型DeepSeek-Prover-V2-7B无法解决的IMO问题,展示了dLLMs在形式定理证明中的独特优势。

英文摘要

Enhancing the formal math reasoning capabilities of Large Language Models (LLMs) has become a key focus in both mathematical and computer science communities in recent years. While significant progress has been made in using state-of-the-art Auto-Regressive (AR) LLMs for formal theorem proving, these models suffer from inherent limitations. Their next-token prediction generation methods may yield suboptimal performance due to the challenges of long-range coherence and the compounding of errors over long sequences. Recent advancements in diffusion LLMs (dLLMs), which generate text through iterative denoising of a multi-token block, offer a promising alternative. However, the application of dLLMs to formal mathematics, where maintaining long-range coherence is critical, remains largely understudied. To address the challenges above, we propose **Diffusion-Proof**, to the best of our knowledge, the first framework to train and apply dLLMs for formal theorem proving. Our frameworks contain training and inference methods for two models. The first one is *dLLM-Prover-7B*, which performs whole-proof writing with long-range coherent tactic usage. The second one is *dLLM-Corrector-7B*, which is a novel large block diffusion-based correction model. It leverages the in-filling capabilities of dLLMs to perform local proof correction using bi-directional information. Extensive experiments demonstrate that **Diffusion-Proof** relatively significantly outperforms the AR LLM baseline trained under the same dataset. **Diffusion-Proof** achieves an absolute improvement of **1.61%** on ProofNet-Test and **6.14%** on MiniF2F-Test benchmarks compare to the baseline. Notably, **Diffusion-Proof** successfully resolves one IMO problem that more advanced thinking model DeepSeek-Prover-V2-7B could not solve, showcasing the unique advantage of dLLMs in formal theorem proving.

5. 优化、泛化与理论分析 12 篇

2606.18303 2026-06-18 cs.LG cs.AI 新提交

A Link between Shock-wave Theory and Symmetry-reduced Stochastic Gradient Descent for Artificial Neural Networks

冲击波理论与人工神经网络对称约化随机梯度下降之间的联系

Taiki Miyagawa

发表机构 * NEC Corporation(NEC公司)

AI总结 本文通过微分几何、李群和流体力学,建立了冲击波理论与对称商化随机梯度下降学习动力学之间的显式数学联系,并应用于多种神经网络架构。

Comments Accepted to the 35th International Conference on Artificial Neural Networks (ICANN) 2026

详情
AI中文摘要

我们利用微分几何、李群理论和流体力学,在冲击波理论与随机梯度下降的对称商化学习动力学之间建立了显式的数学联系。具体而言,在商化参数对称性并应用局部熵粗粒化后,有效动力学满足商流形上的粘性Hamilton-Jacobi方程。此外,假设原始参数动力学可由商空间上的梯度场概括,粗粒化损失函数的梯度服从Burgers型方程,且可严格建立激波形成。我们将该理论应用于多层感知机、卷积神经网络、Transformer和平均场网络,并证明它们满足Hamilton-Jacobi或Burgers型方程。我们推测该框架也能为深度学习提供实用的诊断工具。在诸如Transformer等架构中,原始参数范数常因对称冗余而失真,可能产生误导,而对称校正的商可观测量为监测、预测和控制训练阶段转变提供了原则性基础。

英文摘要

We develop a mathematically explicit link between shock-wave theory and the symmetry-quotiented learning dynamics of stochastic gradient descent, drawing on differential geometry, Lie group theory, and fluid mechanics. Specifically, after quotienting parameter symmetries and applying local-entropy coarse-graining, the effective dynamics satisfy a viscous Hamilton--Jacobi equation on the quotient manifold. Moreover, under the assumption that the raw parameter dynamics can be summarized by a gradient field on the quotiented space, the gradient of the coarse-grained loss function obeys a Burgers-type equation, and shock formation can be established rigorously. We apply our theory to multilayer perceptrons, convolutional neural networks, Transformers, and mean-field networks, and show that they obey the Hamilton--Jacobi or Burgers-type equations. We conjecture that this framework also yields practical diagnostics for deep learning. In architectures such as Transformers, raw parameter norms are often distorted by symmetry redundancy and may therefore be misleading, whereas symmetry-corrected quotient observables provide a principled basis for monitoring, forecasting, and controlling training-phase transitions.

2606.18306 2026-06-18 cs.LG stat.ML 新提交

Fisher Width: A Geometric Measure of Complexity on Statistical Manifolds

Fisher宽度:统计流形上的几何复杂度度量

Vu Khac Ky

发表机构 * Department of Mathematics, FPT University(FPT大学数学系)

AI总结 提出Fisher宽度作为统计流形上高斯宽度的类比,利用Fisher信息度量局部几何,并证明其保持高斯宽度的关键性质,应用于Fisher-Lipschitz假设类的泛化界。

Comments 48 pages, 3 figures

详情
AI中文摘要

高斯宽度是高维概率、压缩感知、凸优化和学习理论中的一个核心几何复杂度度量。它量化了集合沿随机方向的平均延伸程度,从而捕捉了约束集、假设类和下降锥的有效维度。然而,这一概念本质上是欧几里得的。统计模型则具有由Fisher信息度量诱导的自然黎曼几何,其中方向根据统计可区分性而非环境欧几里得长度进行缩放。我们引入了Fisher宽度,即统计流形上高斯宽度的Fisher几何类比。在参数点$\ heta$处,Fisher宽度将欧几里得恒等替换为局部度量张量$G(\ heta)^{1/2}$,测量Fisher重缩放集的高斯宽度。这使得所得量对局部统计曲率敏感,且在光滑重参数化下不变。我们发展了Fisher宽度的基本理论,表明它保留了高斯宽度的关键结构特征,包括集中性、度量扰动稳定性以及与欧几里得基线的谱比较界,同时捕捉了欧几里得度量无法察觉的各向异性几何效应。作为应用,我们证明了Fisher-Lipschitz假设类的泛化界,并提出了可计算的估计量,在MNIST上对三个模型类进行了实证评估。Fisher宽度之于统计流形,正如高斯宽度之于欧几里得凸体。这项工作为研究弯曲统计流形上的复杂性和学习奠定了基础。

英文摘要

Gaussian width is a central geometric complexity measure in high-dimensional probability, compressed sensing, convex optimization, and learning theory. It quantifies the average extent of a set along random directions, thereby capturing the effective dimension of constraint sets, hypothesis classes, and descent cones. However, this notion is intrinsically Euclidean. Statistical models instead carry a natural Riemannian geometry induced by the Fisher information metric, where directions are scaled according to statistical distinguishability rather than ambient Euclidean length. We introduce Fisher width, a Fisher-geometric analogue of Gaussian width for statistical manifolds. At a parameter point $θ$, Fisher width replaces the Euclidean identity by the local metric tensor $G(θ)^{1/2}$, measuring the Gaussian width of the Fisher-rescaled set. This makes the resulting quantity sensitive to local statistical curvature and invariant under smooth reparameterizations. We develop the basic theory of Fisher width, showing that it retains key structural features of Gaussian width, including concentration, metric perturbation stability, and spectral comparison bounds with the Euclidean baseline, while also capturing anisotropic geometric effects invisible to Euclidean measures. As an application, we prove a generalization bound for Fisher-Lipschitz hypothesis classes and propose computable estimators, which we evaluate empirically on MNIST across three model classes. Fisher width is to statistical manifolds what Gaussian width is to Euclidean convex bodies. This work lays the foundation for studying complexity and learning on curved statistical manifolds.

2606.18420 2026-06-18 cs.LG q-bio.QM stat.ML 新提交

Measurement noise limits the advantage of nonlinear models over linear models in biomedical prediction

测量噪声限制了非线性模型在生物医学预测中相对于线性模型的优势

Marc-Andre Schulz, Kerstin Ritter

发表机构 * Hertie Institute for AI in Brain Health, University of Tübingen(赫蒂人工智能脑健康研究所,图宾根大学) Tübingen AI Center, University of Tübingen(图宾根人工智能中心,图宾根大学) Department of Psychiatry and Neurosciences, Charité – Universitätsmedizin Berlin(精神病学与神经科学系,柏林夏里特医学院) Bernstein Center for Computational Neuroscience, Berlin(伯恩斯坦计算神经科学中心,柏林) German Center for Mental Health (DZPG), partner site Tübingen(德国心理健康中心(DZPG),图宾根合作站点)

AI总结 本文指出,在生物医学表格数据中,测量噪声会削弱非线性结构,导致非线性模型与线性模型性能相当,并提出了一个精确的超额风险恒等式,揭示了测量可靠性、样本量和特征表示三个条件必须同时满足才能体现非线性优势。

详情
AI中文摘要

在生物医学表格数据上,诸如深度网络、梯度提升树和核方法等灵活模型,在给定相同特征的情况下,反复被线性回归和逻辑回归匹配或击败。通常的反应是将其视为模型方面的不足,需要通过更多数据、更好的架构或调参来修复,假设非线性结构存在而模型未能捕捉到。我们认为,当限制因素是测量而非模型时(这在生物医学中经常发生),这些修复无法奏效。加性噪声模糊了群体最优预测器,并且由于模糊在去除函数的广泛形状之前先去除精细、快速变化的细节,它比线性结构更快地抹去非线性结构。一个k阶交互作用被特征可靠性的k次幂衰减,而线性部分只衰减一次。在生物医学测量典型的可靠性下,即使底层生物学是强非线性的,非线性优势也可能消失,并且噪声所移除的部分无法通过更大的队列或更灵活的模型恢复,只能通过更好的测量。非线性是隐藏的,而非缺失,线性模型与灵活模型之间的平局本身并不能对生物学做出定论。这些片段是经典的,来自测量误差统计、心理测量学和高斯分析,我们将它们组合成一个精确的超额风险恒等式。测量可靠性是与样本量和特征表示并列的三个条件之一,必须对齐才能使灵活模型发挥作用,而它们共同只留下一个狭窄的窗口,大多数生物医学任务落在此窗口之外。在140个英国生物银行任务中,灵活模型与线性模型之间的差距(如果存在)带有预测的噪声特征,并且这三个条件可以通过干预而非仅通过基准测试来分离。

英文摘要

On biomedical tabular data, flexible models such as deep networks, gradient-boosted trees, and kernel methods are repeatedly matched or beaten by linear and logistic regression given the same features. The usual reaction is to treat this as a model-side shortfall, to be fixed with more data, a better architecture, or tuning, on the assumption that the nonlinear structure is there and the model has failed to capture it. We argue that these fixes cannot help when the binding limit is the measurement rather than the model, as it frequently is in biomedicine. Additive noise blurs the population-optimal predictor, and because blurring removes a function's fine, rapidly varying detail before its broad shape, it erases nonlinear structure faster than linear structure. A degree-$k$ interaction is attenuated by the $k$-th power of feature reliability, while the linear part is attenuated only once. At the reliabilities typical of biomedical measurement, the nonlinear advantage can vanish even when the underlying biology is strongly nonlinear, and what the noise removes cannot be recovered by a larger cohort or a more flexible model, only by better measurement. The nonlinearity is hidden, not absent, and a tie between linear and flexible models is not by itself a verdict on the biology. These pieces are classical, drawn from measurement-error statistics, psychometrics, and Gaussian analysis, and we assemble them into an exact excess-risk identity. Measurement reliability is one of three conditions, alongside sample size and feature representation, that must align for a flexible model to help, and together they leave only a narrow window that most biomedical tasks fall outside. Across 140 UK Biobank tasks, the gap between flexible and linear models, where it exists, carries the predicted noise signature, and the three conditions can be separated by intervention but not by a benchmark alone.

2606.18465 2026-06-18 cs.LG cs.AI 新提交

What Does the Weight Norm Control in Grokking? Logit-Scale Mediation under Cross-Entropy

权重范数在Grokking中控制什么?交叉熵下的对数尺度中介作用

Truong Xuan Khanh

发表机构 * H&K Research Studio, Clevix LLC

AI总结 本文通过固定权重范数并改变输出温度,发现Grokking延迟主要由对数尺度(logit scale)决定,权重范数仅通过影响对数尺度间接起作用。

Comments 16 papges, 10 tables and 4 figures. Code and data to reproduce all numbers, tables, and figures: https://github.com/ClevixLab/grokking-logit-scale

详情
AI中文摘要

Grokking,即从记忆到泛化的延迟跳跃,通常与权重范数相关:范数越小,泛化越早。我们探究范数实际控制什么。通过钳位固定权重范数并仅改变输出温度,我们在交叉熵下将Grokking延迟滑动到其整个范数诱导范围;将有效对数尺度匹配回基线可恢复两个模数下约85%的延迟。在范数和温度的网格上,延迟仅由对数尺度决定(R2 = 0.97),范数仅额外贡献1-2%。该效应依赖于损失函数:在均方误差下,对数尺度被固定,范数通过不同路径起作用。记忆控制、float64 softmax崩溃审计和无LayerNorm的Transformer均指向同一通道。从同一状态分叉,延迟遵循钳位的范数值而非钳位操作本身,这排除了重缩放伪影。近端变量是对数尺度及其驱动的softmax饱和;权重范数仅是上游手柄。所有数字、表格和图表均可从发布的代码和数据中复现。

英文摘要

Grokking, the delayed jump from memorization to generalization, is usually tied to the weight norm: a smaller norm generalizes sooner. We ask what the norm actually controls. Holding the weight norm fixed by clamping and varying only an output temperature, we slide the grokking delay across its entire norm-induced range under cross-entropy; matching the effective logit scale back to baseline recovers about 85% of the delay at two moduli. Across a grid of norms and temperatures the delay collapses onto the logit scale alone (R2 = 0.97), with the norm adding 1-2% beyond it. The effect is loss-dependent: under mean-squared error the logit scale is pinned and the norm acts through a different route. A memorization control, a float64 softmax-collapse audit, and a no-LayerNorm transformer point to the same channel. Forking arms from one identical state, the delay follows the held norm value and not the clamp operation, which closes a rescaling-artifact concern. The proximal variable is the logit scale and the softmax saturation it drives; the weight norm is only an upstream handle. All numbers, tables, and figures reproduce from released code and data.

2606.18538 2026-06-18 cs.LG stat.ML 新提交

Effects of sparsity and superposition on loss in simple autoencoders

稀疏性与叠加对简单自编码器损失的影响

Mriganka Basu Roy Chowdhury, Eric McLaughlin Weiner

发表机构 * Department of Statistics, UC Berkeley(伯克利大学统计学系) Department of Materials Science, UC Berkeley(伯克利大学材料科学系)

AI总结 研究神经网络中多语义性源于叠加现象,通过数学分析稀疏输入下自编码器的L2重构损失上下界,验证并扩展了Elhage等人的实证结果。

Comments 16 pages, 3 figures

详情
AI中文摘要

神经网络机械可解释性的主要困难之一是出现多语义性,即每个神经元通常负责多个不同任务,阻碍了对其功能的清晰解释。Elhage等人(2022)的开创性论文认为,这是由于叠加现象,即神经网络将不同特征表示为低维空间中的非正交方向,这种策略可以在不牺牲保真度的情况下实现更大的数据压缩,因为输入向量具有特征稀疏性。Elhage等人(2022)在一个相当自然且简单的具有稀疏输入的自编码器中实证验证了这些假设。本文的贡献在于分析叠加现象发生和最优性的数学基础,同时严格证实了他们的一些发现。特别地,我们为幂激活函数提供了L2重构损失的上界和下界,在非常稀疏的情况下是紧的。文末还包含一个简短的开放问题列表。

英文摘要

One of the major difficulties in the mechanistic interpretability of neural networks is the occurrence of polysemanticity, which suggests that each neuron is typically responsible for multiple different tasks, impeding a clean interpretation of their function. The seminal paper of Elhage et al. (2022) argues that this occurs due to superposition, a phenomenon where the neural network represents distinct features as non-orthogonal directions in a lower-dimensional space, a strategy that allows much greater compression of the data without sacrificing fidelity due to the feature sparsity of input vectors. Elhage et al. (2022) empirically validates these hypotheses in a rather natural and simple autoencoder with sparse inputs. The contribution of the present work is to analyze the mathematical basis for the occurrence and optimality of superposition, while rigorously corroborating some of their findings. In particular, we provide upper and lower bounds for the L2 reconstruction loss, tight in the very sparse regime, for power activation functions. A short list of interesting open problems are also included at the end.

2606.18778 2026-06-18 cs.LG stat.ML 新提交

Online Distributional Prediction via Latent Cluster Geometry Under Drift and Corruption

漂移与腐败下基于潜在簇几何的在线分布预测

Navyansh Mahla, Prateek Chanda, Ganesh Ramakrishnan

发表机构 * Indian Institute of Technology, Bombay(印度理工学院,孟买)

AI总结 针对非平稳流中的在线分布预测问题,提出一种基于潜在簇几何的吉布斯准后验方法,通过可逆跳跃MCMC采样变维后验,并引入重启变体应对漂移,在亚线性腐败预算和运输代价下实现亚线性Wasserstein遗憾。

详情
AI中文摘要

非平稳流中的在线学习通常被表述为跟踪点估计,但许多应用需要预测完整的数据生成分布。我们研究漂移和对抗性腐败下的在线分布预测。我们的方法通过潜在簇几何表示每个候选律:一个可变大小的中心配置,组织概率质量并诱导预测分布。这些配置上的吉布斯准后验通过后验平均产生在线预测器,所得变维后验可通过可逆跳跃MCMC采样。因此,该方法避免了指定参数化流律,同时保留了用于不确定性、正则化和比较的结构化潜在空间。我们通过累积Wasserstein-1遗憾相对于时变真实律来评估性能。分析分离了两种效应:腐败扰动基于损失的后验更新,而漂移使长时域后验记忆过时。我们通过一个重启变体来解决后者,该变体在时间上局部化相同的准贝叶斯更新。所得的高概率界分解为PAC-Bayesian复杂度项、腐败敏感的后验扰动项以及由\(A_T^{\mathrm{OT}}=\sum_{t=2}^T W_2^2(p_{t-1}^*,p_t^*)\)驱动的动态最优传输项。在有界支撑、稳定潜在几何、预测映射正则性、预言可实现性、局部化重启窗口、亚线性传输作用和亚线性腐败预算下,重启预测器实现了亚线性累积Wasserstein遗憾。这些保证不需要对流、漂移机制或腐败过程进行参数化建模。

英文摘要

Online learning in non-stationary streams is often formulated as tracking a point estimate, but many applications require predicting the full data-generating distribution. We study online distributional prediction under drift and adversarial corruption. Our approach represents each candidate law through a latent cluster geometry: a variable-size configuration of centers that organizes probability mass and induces a predictive distribution. A Gibbs quasi-posterior over these configurations yields an online predictor by posterior averaging, and the resulting variable-dimensional posterior can be sampled with reversible-jump MCMC. The method therefore avoids specifying a parametric streaming law while retaining a structured latent space for uncertainty, regularization, and comparison. We evaluate performance by cumulative Wasserstein-1 regret against the time-varying true law. The analysis separates two effects: corruption perturbs the loss-based posterior update, whereas drift makes long-horizon posterior memory stale. We address the latter with a restarted variant that temporally localizes the same quasi-Bayesian update. The resulting high-probability bounds decompose into a PAC-Bayesian complexity term, a corruption-sensitive posterior perturbation term, and a dynamic optimal-transport term driven by \(A_T^{\mathrm{OT}}=\sum_{t=2}^T W_2^2(p_{t-1}^*,p_t^*)\). Under bounded support, stable latent geometry, predictive-map regularity, oracle realizability, localized restart windows, sublinear transport action, and sublinear corruption budget, the restarted predictor achieves sublinear cumulative Wasserstein regret. These guarantees require no parametric model for the stream, drift mechanism, or corruption process.

2606.18834 2026-06-18 cs.LG 新提交

Identifying Structural Biases from Causal Mechanism Shifts

从因果机制变化中识别结构性偏差

Praharsh Nanavati, Jilles Vreeken, David Kaltenpoth

发表机构 * CISPA Helmholtz Center for Information Security(CISPA赫尔姆霍茨信息安全中心)

AI总结 提出利用环境间机制变化识别隐藏混淆和选择偏差,基于互信息构建可检验准则,并设计StruBI算法,在合成和真实数据上显著优于现有方法。

详情
AI中文摘要

因果发现方法通常假设所有数据独立同分布(i.i.d.),且系统中没有未测量的变量影响。在实践中,这些假设经常被违反,导致推断不准确。在本文中,我们研究如何从因果机制变化中识别隐藏混淆和选择偏差。特别地,我们表明结构性偏差会导致依赖的机制变化。也就是说,通过考虑在不同环境下的数据中哪些变量的机制发生了变化,我们可以判断哪些变量是无偏的,哪些受到隐藏混淆的影响,哪些正在经历选择偏差。我们将此形式化为一个基于互信息的经验可检验准则,并展示在哪些条件下它能识别结构性偏差。为了判断哪些节点受到何种偏差的影响,我们引入了StruBI算法。在合成和真实数据上的实验表明,StruBI在实践中表现良好,准确恢复了受影响的变量集和偏差类型,以较大优势超越了现有技术水平。

英文摘要

Causal discovery methods commonly assume that all data is independently and identically distributed (i.i.d.) and that there are no unmeasured variables affecting the system. In practice, these assumptions are often violated, leading to inaccurate inference. In this paper, we study how to identify hidden confounding and selection biases from causal mechanism shifts. In particular, we show that structural biases lead to dependent mechanism shifts. That is, by considering for which variables the mechanisms change given data from different environments, we can tell which variables are unbiased, which are subject to hidden confounding, and which are undergoing selection bias. We formalize this into an empirically testable criterion based on mutual information, and show under which conditions it identifies structural biases. To tell which nodes are subject to what kind of bias, we introduce the StruBI algorithm. Experiments on synthetic and real-world data show that StruBI works well in practice, accurately recovering affected variable sets and types of biases, outperforming the state-of-the-art by a wide margin.

2606.18918 2026-06-18 cs.LG cs.CC 新提交

Some Complexity Results for Robustness Verification for Binarized Neural Networks

二值化神经网络鲁棒性验证的一些复杂性结果

Harshit Goyal, Sudakshina Dutta

发表机构 * Indian Institute of Technology Goa(印度理工学院Goa)

AI总结 本文通过从布尔可满足性问题归约证明二值化神经网络的可满足性是NP完全的,并利用均匀遮挡导致的网络输出分段常数结构,提出多项式时间鲁棒性检查算法。

详情
AI中文摘要

本文研究了二值化神经网络(BNNs)验证问题的计算复杂性,其中激活函数(有时权重)是二值的。我们分析了两个问题:可满足性和均匀图像遮挡下的鲁棒性。我们通过从布尔可满足性问题(SAT)归约证明BNN可满足性是NP完全的,并且均匀遮挡在网络输出中诱导出分段常数结构,从而实现了多项式时间的鲁棒性检查算法。

英文摘要

This paper studies the computational complexity of verification problems for Binarized Neural Networks (BNNs), where activations (and sometimes weights) are binary. We analyze two problems: satisfiability and robustness under uniform image occlusion. We show that BNN satisfiability is NP-complete via a reduction from Boolean satisfiability problem (SAT), and that uniform occlusion induces a piecewise-constant structure in the network output, enabling a polynomial-time robustness-checking algorithm.

2606.19036 2026-06-18 cs.LG 新提交

Geometric and Stochastic Analysis of Discontinuities in Sparse Mixture-of-Experts

稀疏混合专家模型中不连续性的几何与随机分析

Tho Tran Huu, Huu-Tuan Nguyen, Thien-Hai Nguyen, Nhat-Tri Ho, Viet-Hoang Tran, Tho Quan, Tan Minh Nguyen

发表机构 * Department of Mathematics, National University of Singapore, Singapore(新加坡国立大学数学系) Faculty of Computer Science and Engineering, Ho Chi Minh City University of Technology (HCMUT), VNU-HCM, Ho Chi Minh City, Vietnam(胡志明市技术大学计算机科学与工程学院)

AI总结 本文对稀疏混合专家模型中的不连续性进行几何与随机分析,分类不连续阶数,建立渐近体积估计,证明随机路径几乎必然击中一阶不连续,并提出低开销平滑机制以提升性能。

Comments ICML 2026 Spotlight

详情
AI中文摘要

稀疏混合专家(SMoE)架构现已广泛应用于最先进的语言和视觉模型中,其中条件路由允许扩展到非常大的网络。然而,正是这种Top-$k$专家选择使得条件路由成为可能,同时也导致SMoE映射本质上不连续。在这些不连续曲面附近,即使任意接近的输入也可能激活截然不同的专家集,从而产生显著不同的输出。本文对这些不连续性进行了严格的几何和随机分析。首先,我们根据切换事件中并列专家的数量对不连续性进行阶数分类。利用测度论切片论证,我们建立了加厚不连续曲面的渐近体积估计,表明低阶不连续集占主导地位,而高阶不连续集占据的体积相对极小。接着,通过扩散过程对输入空间中的随机扰动建模,我们证明路径最终会遇到不连续,并且首次击中几乎必然发生在阶数为1的不连续上,同时给出了显式的有限时间概率界。我们进一步推导了占据时间界,量化了随机路径在每个不连续阶数邻域内停留的时长。这些理论结果表明输入更可能位于低阶不连续附近。受此启发,我们提出一种简单的平滑机制,可直接应用于现有SMoE,在接近不连续处软性地整合专家;我们的分析保证增加的额外计算开销很小,同时在不连续附近提供局部平滑,跨语言和视觉任务的实验表明,平滑不仅增强了SMoE映射的连续性,还提升了经验性能。

英文摘要

Sparse Mixture-of-Experts (SMoE) architectures are now widely deployed in state-of-the-art language and vision models, where conditional routing allows scaling to very large networks. However, this very Top-$k$ expert selection that enables conditional routing also renders the SMoE map inherently discontinuous. In the vicinity of these discontinuity surfaces, even inputs that are arbitrarily close may activate substantially different sets of experts resulting in significantly different outputs. In this work we give a rigorous geometric and stochastic analysis of these discontinuities. We first classify them by order, determined by the number of tied experts at a switching event. Using measure-theoretic slicing arguments, we establish asymptotic volume estimates for the thickened discontinuity surfaces, showing that lower-order discontinuity sets dominate, whereas higher-order ones occupy a vanishingly small relative volume. Next, modeling random perturbations in the input space via a diffusion process, we prove that the path eventually encounter a discontinuity, and moreover that the first hit almost surely occurs on an order-1 discontinuity with explicit finite-time probability bounds. We further derive occupation-time bounds that quantify the duration the random path spend in the neighborhoods of each discontinuity order. These theoretical results imply that inputs are more likely to lie near lower order discontinuities. Motivated by this insight, we propose a simple smoothing mechanism that can be directly applied to existing SMoEs, softly incorporating experts near discontinuities; our analysis guarantees that the added computational overhead remains small while providing localized smoothing near discontinuities, and experiments across language and vision tasks show that smoothing not only enforces continuity of the SMoE map but also enhances empirical performance.

2606.19105 2026-06-18 cs.LG stat.ML 新提交

Smoothness-Based Derandomization of PAC-Bayes Bounds

基于光滑性的PAC-Bayes去随机化

Alexandre Lemire Paquin, Brahim Chaib-Draa, Philippe Giguère

发表机构 * Department of Computer Science and Software Engineering(计算机科学与软件工程系) Université Laval(拉瓦尔大学)

AI总结 利用损失和预测器的光滑性,将Gibbs预测器去随机化为后验均值处的确定性预测器,通过Jensen间隙类的Rademacher复杂度控制泛化界,并导出涉及参数雅可比和海森矩阵的正则化器。

详情
AI中文摘要

我们研究光滑损失函数的PAC-Bayes去随机化。我们的目标是通过利用损失和预测器类的光滑性,获得对确定性预测器以高概率成立的泛化界。我们表明,从Gibbs预测器到后验均值处的确定性预测器的转换有一个精确的代价,由Jensen间隙类的泛化间隙给出。我们通过其Rademacher复杂度控制该类,从而得到涉及以参数雅可比和得分图的海森矩阵表示的平坦度量的确定性预测器界。该框架适用于有界和无界光滑损失函数,并将结果专门应用于线性预测器和光滑神经网络。最后,理论中出现的雅可比和海森矩阵量激发了一个实用的正则化器。对于BatchNorm网络,我们通过将BatchNorm变换折叠到相邻的仿射权重中,相对于有效的BatchNorm权重计算该正则化器。在CIFAR-10上的实验说明了该正则化器在不同批量大小下的行为。

英文摘要

We study PAC-Bayes derandomization for smooth loss functions. Our goal is to obtain generalization bounds that hold with high probability for deterministic predictors by exploiting smoothness properties of both the loss and the predictor class. We show that passing from the Gibbs predictor to the deterministic predictor at the posterior mean has a precise cost, given by the generalization gap of the Jensen gap class. We control this class through its Rademacher complexity, leading to bounds for deterministic predictors that involve flatness quantities expressed in terms of parameter Jacobians and Hessians of the score map. The framework applies to both bounded and unbounded smooth loss functions, and we specialize the results to linear predictors and smooth neural networks. Finally, the Jacobian and Hessian quantities appearing in the theory motivate a practical regularizer. For BatchNorm networks, we compute this regularizer with respect to effective BatchNorm weights obtained by folding the BatchNorm transformation into the adjacent affine weights. Experiments on CIFAR-10 illustrate the behavior of this regularizer under different batch sizes.

2606.19145 2026-06-18 cs.LG cs.AI cs.SY eess.SY 新提交

OrthoReg: Orthogonal Regularization for Hybrid Symbolic-Neural Dynamical Systems

OrthoReg:混合符号-神经动力系统的正交正则化

Till Richter, Niki Kilbertus

发表机构 * Technical University of Munich(慕尼黑工业大学) Helmholtz Munich(亥姆霍兹慕尼黑中心)

AI总结 针对混合建模中神经部分可能重复学习符号结构导致模型冗余的问题,提出正交正则化方法OrthoReg,直接惩罚符号与神经组件间的重叠,实现互补分解,提升符号恢复和分布外行为。

详情
AI中文摘要

动力系统是建模自然世界的基础,然而建模过程中存在持续的权衡:手动指定的机械模型设计上可解释但通常过于简单且设定错误;相反,灵活的数据驱动神经方法缺乏物理洞察。混合建模旨在通过结合指定的或基于符号的物理组件与灵活的神经网络来兼顾两者优势。然而,一个关键挑战是神经组件可能重新学习机械部分,产生冗余且不可解释的模型,特别是当符号结构本身是从数据中发现时。基于标准$L^2$正则化的现有方法依赖于投影论证,但当符号组件通过稀疏发现学习时,该论证失效,允许神经增强与符号结构重叠。我们引入\textbf{OrthoReg}(正交正则化),直接惩罚符号与神经组件之间的重叠,防止符号结构被神经残差吸收。这产生互补分解:符号部分捕捉库能表达的内容,神经部分捕捉剩余内容。在存在部分库不匹配的基准动力系统上,OrthoReg改善了符号恢复和分布外行为。

英文摘要

Dynamical systems are fundamental to modeling the natural world, yet modeling them involves a persistent trade-off: manually prescribed mechanistic models are interpretable by design but often overly simplistic and misspecified; in contrast, flexible data-driven neural methods lack physical insight. Hybrid modeling aims for the best of both worlds by combining a prescribed or symbolic, physics-based component with a flexible neural network. A critical challenge, however, is that the neural component may relearn mechanistic parts, yielding redundant and uninterpretable models, especially when the symbolic structure itself is discovered from data. Existing methods based on standard $L^2$ regularization rely on a projection argument that breaks when the symbolic component is learned through sparse discovery, allowing the neural augmentation to overlap with symbolic structure. We introduce \textbf{OrthoReg} (Orthogonal Regularization), which directly penalizes overlap between the symbolic and neural components, preventing symbolic structure from being absorbed by the neural residual. This yields a complementary decomposition: the symbolic part captures what the library can express, and the neural part captures what remains. On benchmark dynamical systems with partial library mismatch, OrthoReg improves symbolic recovery and out-of-distribution behavior.

2606.19179 2026-06-18 cs.LG cs.AI math.OC stat.ML 新提交

Compute Efficiency and Serial Runtime Tradeoffs for Stochastic Momentum Methods

随机动量方法的计算效率与串行运行时间权衡

Depen Morwani, Alexandru Meterez, Pranav Nair, Sham Kakade

发表机构 * Harvard University(哈佛大学) Kempner Institute at Harvard University(哈佛大学凯普纳研究所)

AI总结 研究随机动量方法(如重球法和加速SGD)在一致线性回归中的批次大小权衡,证明重球法不改善SGD的计算效率前沿但允许更大批次减少串行运行时间,而加速SGD的计算效率与串行运行时间权衡依赖于谱衰减。

详情
AI中文摘要

随机动量方法,如重球法(HB)、Nesterov动量以及加速SGD(ASGD)的变体[Kidambi等人,2018],在现代训练中被广泛使用,但其随机优势取决于两个不同的量:串行运行时间(达到目标精度所需的迭代次数)和计算效率(CE,总梯度查询或FLOP成本的倒数)。更大的批次在不损害CE的情况下减少串行运行时间,仅当收缩间隙随批次大小线性增长时。我们研究了一致线性回归(具有高斯协变量)的随机HB和ASGD,并证明了其批次大小权衡的有限维离散时间下界。我们的第一个结果表明,HB不会改善任意谱下SGD的CE前沿;相反,它在更大的批次大小窗口内保持SGD级别的CE,允许更大的批次减少串行运行时间,直到HB达到其确定性加速尺度。这个窗口可能比SGD临界批次大小大$\sqrt{\kappa}$倍。对于ASGD,情况更依赖于谱:对于快速衰减的幂律谱,ASGD改善了小批次下的CE(相对于HB/SGD),但随着批次大小增加,它牺牲了这种CE优势以换取改进的串行运行时间。合成线性回归实验验证了这些定性区域,包括慢衰减谱下ASGD和HB的近乎重叠,以及快速衰减谱下预测的CE-串行权衡。

英文摘要

Stochastic momentum methods such as heavy ball (HB), Nesterov momentum, and variants of Accelerated SGD (ASGD) [Kidambi et al., 2018] are widely used in modern training, but their stochastic benefits depend on two distinct quantities: serial runtime, the number of iterations needed to reach a target accuracy, and compute efficiency (CE), the inverse total gradient-query or FLOP cost. Larger batches reduce serial runtime without hurting CE only when the contraction gap grows linearly with batch size. We study stochastic HB and ASGD for consistent linear regression with Gaussian covariates and prove finite-dimensional, discrete-time lower bounds on their batch-size tradeoffs. Our first result shows that HB does not improve the CE frontier over SGD for arbitrary spectra; rather, it preserves SGD-level CE over a larger batch-size window, allowing larger batches to reduce serial runtime until HB reaches its deterministic accelerated scale. This window can be a factor $\sqrtκ$ larger than the SGD critical batch size. For ASGD, the picture is more spectrum-dependent: for rapidly decaying power-law spectra, ASGD improves small-batch CE over HB/SGD, but as batch size grows it trades this CE advantage for improved serial runtime. Synthetic linear-regression experiments verify these qualitative regimes, including near-overlap of ASGD and HB for slowly decaying spectra and the predicted CE--serial tradeoff for rapidly decaying spectra.

6. 高效学习、压缩与部署 9 篇

2606.18286 2026-06-18 cs.LG 新提交

CODEBLOCK: Learning to Supervise Code at the Right Granularity

CODEBLOCK: 学习在正确的粒度上监督代码

Zhijie Deng, Ling Li, Jinlong Pang, Kaiqin Hu, Qi Xuan, Zhaowei Zhu, Jiaheng Wei

发表机构 * Hong Kong University of Science and Technology (Guangzhou)(香港科技大学(广州)) UC Santa Cruz(加州大学圣克鲁兹分校) Ant Group(蚂蚁集团) BAIA, ZJUT(浙江工业大学智能信息处理实验室) D5Data.ai

AI总结 提出CodeBlock框架,通过选择结构完整的代码块而非孤立token进行稀疏监督,在仅使用1.9%监督token的情况下,在六个代码生成基准上取得优于全token微调的效果。

详情
AI中文摘要

代码大语言模型的监督微调通常对所有响应token应用统一的交叉熵损失,隐含假设每个token提供同等有用的学习信号。最近的token级选择方法通过仅监督高价值token挑战了自然语言SFT中的这一假设。然而,直接将token级掩码迁移到代码可能会破坏语法和语义连贯的程序单元,因为代码依赖于结构完整性和定义-使用关系。因此,我们提出CodeBlock,一个结构感知的稀疏监督框架,选择结构完整的代码证据而非孤立token。CodeBlock首先选择高质量的指令-响应对,然后将代码响应划分为语法连贯的编码项,通过聚合核心逻辑token上的广义交叉熵来估计其效用,并使用数据流可达性和桥接信号重新排序,以优先传播或连接重要程序依赖的块。在训练期间,完整响应仍作为上下文可用,但损失仅应用于选定的代码项和信息性自然语言token。在六个代码生成基准上的实验表明,CodeBlock在仅使用1.9%的监督响应token的情况下,实现了比全tokenSFT和竞争性选择基线更强的平均pass@1。

英文摘要

Supervised fine-tuning of code LLMs typically applies uniform cross-entropy loss to all response tokens, implicitly assuming that every token provides equally useful learning signal. Recent token-level selection methods challenge this assumption in natural-language SFT by supervising only high-value tokens. However, directly transferring token-level masking to code can break syntactically and semantically coherent program units, because code depends on structural completeness and definition-use relations. We therefore propose CodeBlock, a structure-aware sparse supervision framework that selects structure-complete code evidence rather than isolated tokens. CodeBlock first selects high-quality instruction-response pairs, then partitions code responses into syntactically coherent coding items, estimates their utility by aggregating generalized cross-entropy over core logic tokens, and reranks them with data-flow reach and bridge signals to prioritize blocks that propagate or connect important program dependencies. During training, the full response remains available as context, while loss is applied only to selected code items and informative natural-language tokens. Experiments on six code-generation benchmarks show that CodeBlock achieves stronger average pass@1 than full-token SFT and competitive selection baselines, while using only 1.9% of supervised response tokens.

2606.18304 2026-06-18 cs.LG cs.AI 新提交

Attribution-Guided and Coverage-Maximized Pruning for Structural MoE Compression

基于归因引导和覆盖最大化的结构MoE剪枝

Yifu Ding, Jiacheng Wang, Ge Yang, Yongcheng Jing, Jinyang Guo, Xianglong Liu, Dacheng Tao

发表机构 * School of Computer Science and Engineering, Beihang University(北京航空航天大学计算机科学与工程学院) School of Artificial Intelligence, Beihang University(北京航空航天大学人工智能学院) Nanyang Technological University(南洋理工大学)

AI总结 针对MoE模型专家级剪枝粒度粗、冗余识别不足的问题,提出基于归因引导和覆盖最大化的结构剪枝框架,将剪枝分配转化为通道分数覆盖优化问题,在50%剪枝率下结合4位量化保持精度,内存减少5.27倍。

Comments 9 pages, 5 figures. Submitted to ICML 2026

详情
AI中文摘要

混合专家(MoE)模型在计算上高效扩展,但由于其巨大的内存占用和推理开销,部署成本仍然很高。先前的压缩方法主要在专家级别操作,要么移除整个专家,要么通过粗粒度的重要性分数对专家进行排序。然而,这种专家级别的决策通常过于粗糙,无法捕捉细粒度的冗余,导致剪枝预算分配不当和压缩效果有限。为了解决这个问题,我们观察到MoE专家内的信息高度集中在一小部分通道中,即使在被认为重要的专家中也存在大量冗余。基于这一观察,我们提出了一种针对MoE模型量身定制的结构剪枝框架。我们的方法将剪枝比例分配重新表述为通道分数覆盖最大化问题,并使用基于归因的近似方法高效求解。在DeepSeek和Qwen MoE模型上的实验表明,我们的方法在结合4位量化时,在50%或25%的结构化剪枝下仍能保持模型精度。在Qwen3-30B-A3B上,我们的方法将内存占用减少了5.27倍,并在各种基准测试中持续优于最先进的基线方法。

英文摘要

Mixture-of-Experts (MoE) models scale compute efficiently, yet remain expensive to deploy due to their substantial memory footprint and inference overhead. Prior compression methods mainly operate at the expert level, either removing entire experts or ranking experts by coarse-grained importance scores. However, such expert-wise decisions are often too coarse to capture fine-grained redundancy, leading to misallocated pruning budgets and limited compression. To address this problem, we observe that information within MoE experts is highly concentrated in a small subset of channels, leaving substantial redundancy even in experts deemed important. Based on this observation, we propose a structural pruning framework tailored for MoE models. Our method reformulates prune-ratio allocation as a channel-score coverage maximization problem and solves it efficiently using an attribution-based approximation. Experiments on DeepSeek and Qwen MoE models show that our method preserves model accuracy under 50% or 25% structured pruning when combined with 4-bit quantization. On Qwen3-30B-A3B, our approach reduces memory footprint by 5.27$\times$ and consistently outperforms state-of-the-art baselines across diverse benchmarks.

2606.18431 2026-06-18 cs.LG cs.DC 新提交

Beyond Prediction: Tail-Aware Scheduling for LLM Inference

超越预测:面向LLM推理的尾延迟感知调度

Yueying Li, Yuanfan Chen, Jiayang Chen, Esha Choukse, Haoran Qiu, G. Edward Suh, Rodrigo Fonseca, Ziv Scully, Udit Gupta

发表机构 * Cornell University, Computer Science Department(康奈尔大学计算机科学系) Cornell University, Electrical and Computer Engineering Department(康奈尔大学电气与计算机工程系) Cornell University, Operations Research and Information Engineering Department(康奈尔大学运筹学与信息工程系) Microsoft Azure System Research(微软Azure系统研究) NVIDIA Corporation(英伟达公司)

AI总结 针对LLM推理中长度预测调度在分布偏移和尾延迟控制上的脆弱性,提出无预测的分布感知调度框架,通过轻量统计信号实现软优先级提升,结合缓存感知抢占,在多种工作负载下将P99 TTLT降低35-50%,TTFT降低34-47%。

详情
Journal ref
Forty-Third International Conference on Machine Learning (2026)
AI中文摘要

LLM服务表现出极端的长度可变性,使得基于大小的调度在实践中变得困难。最近的LLM调度器使用预测的解码长度或排名来近似SJF/SRPT,并主要报告均值中心指标如TTFT和TBT。我们表明,这些预测驱动的策略在分布偏移、突发到达和GPU内存压力下可能脆弱,同时对主导用户体验的尾延迟(P90-P99)控制有限,即使拥有完美的解码长度知识。我们引入了一个分布感知、无预测的调度框架,用由轻量统计信号驱动的软优先级提升取代显式长度预测。我们的设计协同优化调度和缓存感知抢占,以考虑跨工作负载混合的内存耦合解码动态。在生产环境和开源轨迹上的评估表明,相对于具有完美长度知识的SRPT,我们的方法将P99 TTLT降低了高达35-50%,并在各种工作负载(包括推理密集型和聊天密集型任务)上将TTFT降低了34-47%。这些结果证明了在在线LLM服务中优化尾延迟的稳健替代方案。

英文摘要

LLM serving exhibits extreme length variability, making size-based scheduling difficult in practice. Recent LLM schedulers approximate SJF/SRPT using predicted decode lengths or ranks and primarily report mean-centric metrics such as TTFT and TBT. We show that these prediction-driven policies can be fragile under distribution shifts, bursty arrivals, and GPU memory pressure, while offering limited control over the tail latency (P90-P99) that dominates user experience, even with perfect decode-length knowledge. We introduce a distribution-aware, prediction-free scheduling framework that replaces explicit length prediction with soft priority boosting driven by lightweight statistical signals. Our design co-optimizes scheduling and cache-aware preemption to account for memory-coupled decode dynamics across workload mixes. Evaluated on production and open-source traces, our method reduces P99 TTLT by up to 35-50% relative to SRPT with perfect length knowledge and reduces TTFT by 34-47% across workloads, including reasoning-heavy and chat-heavy tasks. These results demonstrate a robust alternative for optimizing tail latency in online LLM serving.

2606.18650 2026-06-18 cs.LG 新提交

BLADE: Scalable Bi-level Adaptive Data Selection for LLM Training

BLADE: 面向LLM训练的可扩展双层自适应数据选择

Jiaxing Wang, Deping Xiang, Jin Xu, Zirui Liu, Zicheng Zhang, Guoqiang Gong, Jun Fang, Chao Liu, Pengzhang Liu, Tongxuan Liu, Ke Zhang, Qixia Jiang

发表机构 * University of Oxford(牛津大学) Renmin University of China(中国人民大学) University of Chinese Academy of Sciences(中国科学院大学)

AI总结 提出BLADE框架,通过拉格朗日乘子将双层优化转化为单层惩罚目标,避免逆Hessian计算,实现动态参考模型,理论保证一阶收敛,实验优于现有方法。

详情
AI中文摘要

随着大语言模型(LLM)数据集规模扩展到数万亿token,数据选择已成为过滤无信息噪声和构建自适应学习轨迹的关键前沿。除了静态启发式过滤,LLM训练的高级数据选择方法主要遵循两种范式,每种都有根本性局限。基于影响的方法提供了原则性的双层目标,但需要难以处理的逆Hessian计算,而超额损失方法计算高效但依赖静态参考模型,该模型在训练过程中与不断演化的代理模型失配。我们提出BLADE(双层自适应数据选择),一种无Hessian的数据选择框架。BLADE通过拉格朗日乘子将基于影响的方法背后的双层优化问题重新表述为惩罚单层目标,避免了逆Hessian计算,同时揭示了与基于超额损失的数据选择之间的原则性联系。所得目标恢复了超额损失形式,但用与训练同步的动态参考模型替代了静态参考模型。理论上,我们证明该惩罚公式保证一阶收敛。为了实现高效的在线批次选择,我们将BLADE实例化为一种无记忆随机块坐标Frank-Wolfe算法。大量实验表明,BLADE始终优于最先进的数据选择基线,为LLM训练提供了实用方案。

英文摘要

As Large Language Model (LLM) datasets scale to trillions of tokens, data selection has emerged as a critical frontier to filter out uninformative noise and construct adaptive learning trajectories. Beyond static heuristic filtering, advanced data selection methods for LLM training largely follow two paradigms, each with fundamental limitations. Influence-based methods provide principled bi-level objectives but require intractable inverse-Hessian computations, while excess-loss methods are computationally efficient but rely on a static reference model that becomes misaligned with the evolving proxy model during training. We propose BLADE (Bi-Level Adaptive Data sElection), a Hessian-free framework for data selection. BLADE reformulates the bi-level optimization problem underlying influence-based methods as a penalized single-level objective via Lagrange multipliers, avoiding inverse-Hessian computation while revealing a principled connection to excess-loss based data selection. The resulting objective recovers an excess-loss form but replaces the static reference model with a dynamic one that stays synchronized with training. Theoretically, we prove that this penalized formulation guarantees first-order convergence. For efficient online batch selection, we instantiate BLADE as a memoryless randomized block-coordinate Frank-Wolfe algorithm. Extensive experiments show that BLADE consistently outperforms state-of-the-art data selection baselines, providing a practical recipe for LLM training.

2606.18691 2026-06-18 cs.LG cond-mat.mtrl-sci 新提交

Robust and Interpretable Adaptation of Equivariant Materials Foundation Models via Sparsity-promoting Fine-tuning

通过稀疏性促进微调实现等变材料基础模型的鲁棒和可解释适应

Youngwoo Cho, Seunghoon Yi, Wooil Yang, Sungmo Kang, Young-woo Son, Jaegul Choo, Joonseok Lee, Soo Kyung Kim, Hongkee Yoon

发表机构 * KAIST(韩国科学技术院) Seoul National Univ.(首尔国立大学) KIAS(韩国宇宙科学研究所) Ewha Womans Univ.(成均馆大学) Kangwon National Univ.(江原国立大学)

AI总结 提出稀疏性促进微调方法,利用E(3)等变材料基础模型的结构特性选择性更新参数,在能量和力预测任务中以约3%参数达到或超越全微调性能,并展示在磁矩预测等任务中的泛化性和可解释性。

Comments Accepted by ICLR 2026

详情
AI中文摘要

预训练的材料基础模型,或机器学习原子间势,利用通用的物理化学知识有效逼近势能面。然而,由于物理化学多样性以及实际计算设置与构建预训练数据所用设置之间的不匹配,它们通常需要特定领域的校准。为了解决这个问题,我们提出了一种稀疏性促进的微调方法,通过利用E(3)等变材料基础模型的结构特性选择性更新模型参数。在跨分子和晶体基准的能量和力预测任务上,我们的方法匹配或超越了全微调和等变低秩适应,同时仅更新约3%的参数,在某些情况下甚至低至约0.5%。除了能量和力校准,我们进一步通过将方法应用于磁矩预测和磁感知总能量建模来展示任务泛化性。最后,稀疏模式分析揭示了物理可解释的特征,例如过渡金属系统中增强的d轨道贡献。总体而言,我们的结果确立了稀疏性促进微调作为等变材料基础模型领域专业化的灵活且可解释的方法。

英文摘要

Pre-trained materials foundation models, or machine learning interatomic potentials, leverage general physicochemical knowledge to effectively approximate potential energy surfaces. However, they often require domain-specific calibration due to physicochemical diversity as well as mismatches between practical computational settings and those used in constructing the pre-training data. To address this, we propose a sparsity-promoting fine-tuning method that selectively updates model parameters by exploiting the structural properties of E(3)-equivariant materials foundation models. On energy and force prediction tasks across molecular and crystalline benchmarks, our method matches or surpasses full fine-tuning and equivariant low-rank adaptation while updating only $\sim$3~\% of parameters, and in some cases as little as $\sim$0.5~\%. Beyond energy and force calibration, we further demonstrate task generalizability by applying our method to magnetic moment prediction and magnetism-aware total energy modeling. Finally, analysis of sparsity patterns reveals physically interpretable signatures, such as enhanced $d$-orbital contributions in transition metal systems. Overall, our results establish sparsity-promoting fine-tuning as a flexible and interpretable method for domain specialization of equivariant materials foundation models.

2606.18967 2026-06-18 cs.LG 新提交

EfficientRollout: System-Aware Self-Speculative Decoding for RL Rollouts

EfficientRollout: 面向强化学习推演的感知系统的自推测解码

Minseo Kim, Minjae Lee, Seunghyuk Oh, Kevin Galim, Donghoon Kim, Coleman Hooper, Harman Singh, Amir Gholami, Hyung Il Koo, Wonjun Kang

发表机构 * FuriosaAI University of California, Berkeley(加州大学伯克利分校)

AI总结 针对强化学习推演中自回归解码延迟瓶颈,提出感知系统的自推测解码框架,通过量化自推测解码器与感知系统的推测开关策略,在保持模型质量前提下降低推演和端到端延迟。

Comments Project Page: https://github.com/furiosa-ai/EfficientRollout

详情
AI中文摘要

强化学习(RL)已成为LLMs代表性后训练范式,赋予其强大的推理和智能体能力。然而,推演生成仍是主要的延迟瓶颈,因为自回归采样顺序解码响应,且少量长尾生成往往决定完成时间。推测解码(SD)为缓解此瓶颈提供了自然途径,它是一种用于服务固定LLMs的成熟技术,通过快速草拟令牌并通过并行验证接受它们来降低延迟,同时保持目标模型分布。但其实际加速效果无法直接迁移到RL推演:(i)不断变化的目标策略使得任何固定草拟者与策略输出分布日益不匹配;(ii)推演解码过程中活跃批次大小缩小,解码从计算受限转向内存受限,此时并行验证可利用未充分利用的计算资源。因此,加速RL推演需要草拟者在长序列、高温生成下对演化策略保持有效,以及感知系统的SD使用以避免计算受限状态。我们提出EfficientRollout,一个感知系统的自推测SD框架,旨在解决RL推演中的这一差距。EfficientRollout从目标模型诱导量化草拟者(即自推测解码),使其与演化策略保持耦合,无需单独草拟者预训练或在线适应。它进一步协调感知系统的SD切换策略与接受感知的草稿长度自适应,仅在有益状态下进行推测,同时使草拟预算与演化草拟者质量匹配。EfficientRollout在加速自回归推演基线上分别将推演和端到端延迟降低高达19.6%和12.7%,同时保持最终模型质量。

英文摘要

Reinforcement learning (RL) has become a representative post-training paradigm for LLMs, enabling strong reasoning and agentic capabilities. However, rollout generation remains a dominant latency bottleneck because autoregressive sampling decodes responses sequentially and a small number of long-tailed generations often determine completion time. Speculative decoding (SD) offers a natural way to address this bottleneck, as it is a well-established technique for serving fixed LLMs that reduces latency by rapidly drafting tokens and accepting them through parallel verification while preserving the target-model distribution. However, its practical speedups do not directly carry over to RL rollouts: (i) the evolving target policy makes any fixed drafter increasingly mismatched with the policy's output distribution; and (ii) active batch sizes shrink throughout rollout decoding, shifting decoding from compute-bound to memory-bound regimes where parallel verification can exploit underutilized compute. Therefore, accelerating RL rollouts requires both a drafter that remains effective under long, high-temperature generations from an evolving policy and system-aware use of SD that avoids compute-bound regimes. We present EfficientRollout, a system-aware self-SD framework designed to address this gap for RL rollouts. EfficientRollout induces a quantized drafter from the target model (i.e. self-speculative decoding), keeping it coupled to the evolving policy without separate drafter pretraining or online adaptation. It further coordinates a system-aware SD toggle policy with acceptance-aware draft-length adaptation, enabling speculation only in beneficial regimes while matching the drafting budget to evolving drafter quality. EfficientRollout reduces rollout and end-to-end latency by up to 19.6% and 12.7%, respectively, over an accelerated AR rollout baseline, while preserving final model quality.

2606.19025 2026-06-18 cs.LG cs.AI cs.DC cs.SY eess.SY 新提交

FoMoE: Breaking the Full-Replica Barrier with a Federation of MoEs

FoMoE: 打破全副本壁垒的专家混合联邦系统

Lorenzo Sani, Zeyu Cao, Meghdad Kurmanji, Alex Iacob, Andrej Jovanovic, Yan Gao, Wanru Zhao, Nicholas D. Lane

发表机构 * DeepSeek-AI

AI总结 提出FoMoE系统,通过跨工作节点分区专家层打破全副本范式,结合部分专家复制和跳跃令牌机制,显著降低通信开销并提升吞吐量。

详情
AI中文摘要

预训练大型语言模型(LLMs)通常需要大规模基础设施,配备紧密耦合的硬件加速器。虽然增加模型和数据集规模仍是性能的主要驱动力,但专家混合(MoE)架构最近通过将参数数量与计算成本解耦,取得了最先进的结果。这种效率使得在受限计算预算下训练大规模模型成为可能,但通常需要单个数据中心的高速互连。为了克服这些物理限制,最近的方法如DiLoCo和Photon使用低通信数据并行方法,使得能够在地理分布、弱连接的数据中心之间进行扩展。然而,这些方法存在根本性的低效问题:它们需要在每个站点拥有完整的模型副本,这带来了高昂的内存约束和通信开销。在这项工作中,我们引入了FoMoE,一个通过跨工作节点分区专家层来打破全副本范式的系统。我们证明FoMoE:(I)通过部分专家复制,在所研究的场景中,相比高效基线降低了高达1.42倍的通信成本,相比DDP降低了45.44倍;(II)通过一种新颖的跳跃令牌机制,实现了高达1.4倍的经验吞吐量加速;(III)在训练代理场景中展示了稳定的路由,并通过系统建模将通信/内存优势推广到100B规模的配置。

英文摘要

Pre-training Large Language Models (LLMs) typically demands large-scale infrastructure with tightly coupled hardware accelerators. While increasing model and dataset scale remains the dominant driver of performance, Mixture-of-Experts (MoEs) architectures have recently achieved state-of-the-art results by decoupling parameter count from computational cost. This efficiency enables training massive models on constrained compute budgets, yet it typically requires the high-speed interconnects of a single datacenter. To overcome these physical limits, recent approaches such as DiLoCo and Photon use low-communication data-parallel methods to enable scaling across geographically distributed, weakly connected data centers. However, these methods suffer from a fundamental inefficiency: they require full model replicas at every site, which imposes prohibitive memory constraints and communication overheads. In this work, we introduce FoMoE, a system that breaks the full-replica paradigm by partitioning expert layers across workers. We demonstrate that FoMoE: (I) reduces communication costs by up to 1.42x over efficient baselines and 45.44x over DDP via partial expert replication in the studied regimes; (II) achieves empirical throughput speedups of up to 1.4x through a novel skip-token mechanism; and (III) shows stable routing in the trained proxy regimes and projects the communication/memory benefits to 100B-scale configurations through system modelling.

2606.19150 2026-06-18 cs.LG 新提交

Complementary Attention Head Pruning for Efficient Transformers

互补注意力头剪枝用于高效Transformer

Yaniv Livertovsky, Shahar Somin, Gonen Singer

发表机构 * Bar-Ilan University(巴伊兰大学)

AI总结 提出CAHP框架,将注意力头选择建模为全局图论问题,通过图聚类和信息论距离保留互补头,自动确定剪枝数量,在SST-5和MNLI上优于现有方法。

Comments 9 pages, 4 figures, 3 tables. Accepted for presentation at the International Joint Conference on Neural Networks (IJCNN) 2026

详情
AI中文摘要

基于Transformer的模型在自然语言处理中的显著成功源于架构的规模化,这导致大量参数并阻碍了在资源受限环境中的部署。虽然结构化剪枝提供了一条压缩路径,但现有的最先进方法通常依赖于基于梯度的重要性排序或随机门控,这些方法存在不稳定性、结构退化以及需要大量手动超参数调整的问题。在本文中,我们引入了CAHP(互补注意力头剪枝),一种新颖的事后框架,将头选择重新定义为全局图论问题。CAHP不是孤立地评估头,而是利用基于图的聚类结合信息论距离度量来识别并保留一组拓扑多样化的互补注意力头。无需预定义稀疏度或剪枝比例,该框架通过识别递减的边际性能曲线自动确定各层中保留的注意力头数量,其中根据所选多项式次数,剪除额外头会导致性能急剧下降。在SST-5和MNLI基准上跨不同Transformer模型规模的广泛评估表明,CAHP始终优于竞争基线,特别是在高压缩率情况下。此外,我们的结构分析表明,CAHP避免了基于梯度的剪枝方法的“邻近偏差”(倾向于主要保留靠近输出层的头),而是保留了模型中间层中功能关键的注意力头集合。

英文摘要

The remarkable success of Transformer-based models in natural language processing stems from architectural scaling, which leads to a large number of parameters and hinders deployment in resource-constrained environments. While structured pruning offers a pathway to compression, existing state-of-the-art methods often rely on gradient-based importance ranking or stochastic gating, which suffer from instability, structural degeneration, and the need for extensive manual hyperparameter tuning. In this paper, we introduce CAHP (Complementary Attention Head Pruning), a novel post-hoc framework that redefines head selection as a global graph-theoretical problem. Rather than evaluating heads in isolation, CAHP utilizes graph-based clustering combined with information-theoretic distance measures to identify and preserve a topologically diverse subset of complementary attention heads. Without requiring a predefined sparsity level or pruning ratio, the framework automatically determines the number of selected attention heads across layers by identifying a diminishing marginal performance curve, where pruning additional heads leads to a sharp degradation in performance, as determined by the chosen polynomial degree. Extensive evaluations on the SST-5 and MNLI benchmarks, across different Transformer model scales, demonstrate that CAHP consistently outperforms competitive baselines, particularly in high-compression regimes. Furthermore, our structural analysis shows that CAHP avoids the "proximity bias" of gradient-based pruning methods, which tend to preserve heads mainly in layers close to the output, and instead retains a functionally critical set of attention heads in the model's intermediate layers.

2606.16290 2026-06-18 cs.LG cs.AI 新提交

An affordable hardware-aware neural architecture search for deploying convolutional neural networks on ultra-low-power computing platforms

一种经济实惠的硬件感知神经架构搜索,用于在超低功耗计算平台上部署卷积神经网络

Andrea Mattia Garavagno, Edoardo Ragusa, Antonio Frisoli, Paolo Gastaldo

发表机构 * University of Genoa(热那亚大学) Scuola Superiore Sant’Anna(圣安娜高等研究学院)

AI总结 提出一种轻量级硬件感知神经架构搜索方法,生成可在超低功耗微控制器上运行的微型CNN,在保持分类精度的同时降低搜索成本。

详情
Journal ref
IEEE Sensors Letters, vol. 8, no. 5, pp. 1-4, May 2024
AI中文摘要

硬件感知神经架构搜索(HW-NAS)通过自动设计能够满足预置硬件约束的神经架构,使得卷积神经网络(CNN)能够集成到微控制器设备中。然而,最先进的HW-NAS针对的是高性能微控制器,其功耗无法满足传感节点的要求。本文提出了一种HW-NAS方法,生成可在超低功耗微控制器上运行的微型CNN,其搜索过程轻量级,甚至可以在嵌入式设备上执行。在三个著名的微型计算机视觉基准测试上的实证结果表明,所提出的HW-NAS能够在保持最先进分类精度的同时生成微型CNN。

英文摘要

Hardware-aware neural architecture search (HW-NAS) allows the integration of Convolutional Neural Networks (CNNs) in microcontrollers devices by automatically designing neural architectures that can fit prearranged hardware constraints. However, state-of-the-art HW-NAS target high-performance microcontrollers, whose power consumption does not meet sensing nodes requirements. This work presents a HW-NAS generating tiny CNNs that can run on ultra-low-power microcontrollers, featuring a lightweight search procedure enabling its execution even on embedded devices. Empirical results on three well-known benchmarks for tiny computer vision proved that the proposed HW-NAS was able to generate tiny CNNs while preserving state-of-the-art classification accuracy.

7. 联邦学习、隐私与安全 7 篇

2606.18309 2026-06-18 cs.LG cs.AI 新提交

SAGE: Retain-Aware Post-Hoc Sanitization of Final Unlearning Vector

SAGE: 保留感知的最终遗忘向量事后净化

Jingyuan Zhang, Yucheng Bai, Peixi Wen, Zhehao Huang, Zhengbao He, Hanling Tian, Xinwen Cheng, Haiyin Ran, Xiaolin Huang

发表机构 * Institute of Image Processing and Pattern Recognition, Shanghai Jiao Tong University(上海交通大学图像处理与模式识别研究所)

AI总结 提出SAGE方法,通过事后净化最终更新向量,在不重新运行原始遗忘流程的情况下,缓解大语言模型遗忘与保留能力之间的权衡。

详情
AI中文摘要

大语言模型(LLM)遗忘旨在移除不良知识或行为,同时保留已有能力。当前的遗忘方法都涉及遗忘与保留之间的权衡。我们发现,保留激活偏差也可用于量化遗忘方法对保留造成的损害,而无需考虑遗忘过程的具体实现。这使得我们能够通过事后方法恢复任何遗忘方法的保留性能。因此,我们提出一种互补的事后设置,在不重新运行原始遗忘流程的情况下净化最终更新向量。在该设置中,我们设计了SAGE(光谱激活-几何净化),一种对最终遗忘更新的源无关修正。SAGE从一个小型保留代理收集真实模块输入,提取其主导激活几何结构,并求解一个闭式源锚定优化目标,该目标抑制与高能保留方向对齐的更新分量,同时保留源方法的遗忘载体。在多种遗忘方法、模型规模和基准测试中,SAGE持续缓解保留-遗忘权衡,将最终向量的事后净化识别为机器遗忘中一个实用且未被充分探索的维度。

英文摘要

Large Language Model (LLM) unlearning aims to remove undesirable knowledge or behaviors while preserving retained capabilities. Current unlearning methods all involve a trade-off between unlearning and retention. We have found that the retention activation bias can also be used to quantify the damage an unlearning method inflicts on retention, without considering the specific implementation of the unlearning process. This allows us to restore retention performance for any unlearning method using a post-hoc approach. Therefore, we propose a complementary post-hoc setting to sanitize the final update vector without rerunning the original unlearning pipeline. In this setting, we design SAGE, Spectral Activation-GEometry Sanitization, a source-agnostic correction for final unlearning updates. SAGE collects real module inputs from a small retain proxy, extracts their dominant activation geometry, and solves a source-anchored optimization objective in closed form, which suppresses update components aligned with high-energy retained directions while preserving the source method's forgetting carrier. Across multiple unlearning methods, model scales, and benchmarks, SAGE consistently relieves the retain-forget trade-off, identifying post-hoc sanitization of final vectors as a practical and underexplored axis for machine unlearning.

2606.18384 2026-06-18 cs.LG cs.DC 新提交

SCOPE-FL: A Strategy-proof Chain-based Optimal pareto efficient Federated Learning System

SCOPE-FL:一种策略证明的基于链的最优帕累托高效联邦学习系统

Seyed Salar Ghazi, Kaiwen Zhang, Mehdi feizi, Hans-Arno Jacobsen

发表机构 * École de Technologie Supérieure (ÉTS)(高等技术学院) Ferdowsi University of Mashhad(菲尔多西大学) University of Toronto(多伦多大学)

AI总结 针对分层联邦学习中客户端选择策略缺乏帕累托效率和策略证明性导致整体福利下降的问题,提出SCOPE-FL框架,采用顶级交易循环算法同时保证帕累托最优和策略证明性,并通过区块链智能合约实现奖励分配。

详情
AI中文摘要

分层联邦学习(HFL)能够在分布式设备间实现可扩展的协作模型训练,同时保护数据隐私。然而,现有的HFL客户端选择机制存在根本性的策略低效问题。通过优先考虑稳定性而非帕累托效率(PE),它们产生次优的资源分配,并且缺乏策略证明性(SP),参与者有动机歪曲其真实偏好,这两种失败在实践中都会在帕累托意义上降低系统整体福利。为解决这一问题,我们提出SCOPE-FL(策略证明的基于链的最优帕累托高效联邦学习),一种同步HFL框架,将客户端选择建模为双边学校选择问题,通过顶级交易循环(TTC)算法求解,同时保证PE和SP。对于奖励分配,SCOPE-FL采用基于一轮重建(OR)的可扩展沙普利值近似,确保补偿与每个客户端的贡献成比例。整个机制通过区块链智能合约执行,为SP保证在实践中成立提供了防篡改环境。在MNIST、Fashion-MNIST和CIFAR-10上的综合评估表明,SCOPE-FL在模型准确率、收敛速度和奖励效率方面优于现有最先进方法(包括DA、IAS等),同时通信延迟与DA相当,区块链开销在大规模下显著低于DA。

英文摘要

Hierarchical Federated Learning (HFL) enables scalable collaborative model training across distributed devices while preserving data privacy. However, existing HFL client selection mechanisms suffer from a fundamental strategic inefficiency. By prioritizing stability over Pareto efficiency (PE), they produce suboptimal resource allocations, and without strategy proofness (SP), participants are incentivized to misrepresent their true preferences, both failures degrading system overall welfare in the Pareto sense in practice. To address it, we propose SCOPE-FL (Strategy-proof Chain-based Optimal pareto efficient Federated Learning), a synchronous HFL framework that formulates client selection as a two-sided school choice problem solved through the Top Trading Cycle (TTC) algorithm that simultaneously guarantees PE and SP. For reward distribution, SCOPE-FL employs a scalable Shapley value approximation based on One-Round Reconstruction (OR), ensuring compensation proportional to each client's contribution. The entire mechanism executes via blockchain smart contracts, providing the tamper-proof environment required for the SP guarantees to hold in practice. A comprehensive evaluation on MNIST, Fashion-MNIST, and CIFAR-10 demonstrates that SCOPE-FL outperforms state-of-the-art approaches, including DA, IAS, and other methods across model accuracy, convergence rate, and reward efficiency, while achieving communication latency comparable to DA and blockchain overhead significantly lower than DA at scale.

2606.18518 2026-06-18 cs.LG cs.AI 新提交

PSyGenTAB: A Privacy-Preserving Framework for Synthetic Clinical Tabular Data Generation via Constrained Optimization

PSyGenTAB:通过约束优化生成合成临床表格数据的隐私保护框架

Arshia Ilaty, Hossein Shirazi, Manasi Chitale, Kedar Hegde, Dhanalakshmi Ramesh, Rashmi S. Manjunath, Amir Rahmani, Hajar Homayouni

发表机构 * San Diego State University(圣地亚哥州立大学) University of California, Irvine(加利福尼亚大学尔湾分校)

AI总结 提出PSyGenTAB框架,将合成医疗数据生成建模为约束优化问题,通过增强拉格朗日方法嵌入可配置隐私约束,在保证隐私阈值的同时最大化临床数据效用,实验表明合成数据训练的模型性能与真实数据相当。

Comments 20 pages

详情
AI中文摘要

由于机构壁垒和严格的隐私法规(如HIPAA和GDPR),医疗AI的发展受到高质量临床数据获取限制。合成数据生成提供了一种潜在解决方案,但现有方法缺乏明确管理隐私-效用权衡的原则性机制,常常退化临床有意义的模式或面临患者重识别风险。我们提出PSyGenTAB,一个隐私保护生成框架,将合成医疗数据生成建模为使用增强拉格朗日方法求解的约束优化问题。通过将可配置的隐私约束直接嵌入模型训练,PSyGenTAB在最大化临床数据效用的同时强制执行最低隐私阈值。在多个临床驱动的基准测试中,PSyGenTAB保留了可靠健康AI所需的特征间临床关系和少数类诊断模式。使用“合成训练、真实测试”和“真实训练、合成测试”协议的下游评估表明,在合成数据上训练的模型达到了与真实患者记录训练模型相当的性能。隐私审计进一步证明了精确记录复制的减少和对成员推理攻击的强大抵抗力。这些结果确立了PSyGenTAB作为平衡合成医疗数据中隐私保护和临床效用的原则性框架,支持安全的跨机构AI开发。

英文摘要

The development of medical AI is constrained by limited access to high-quality clinical data due to institutional silos and strict privacy regulations such as HIPAA and GDPR. Synthetic data generation offers a potential solution, but existing methods lack principled mechanisms to explicitly manage the privacy-utility trade-off, often degrading clinically meaningful patterns or risking patient re-identification. We present PSyGenTAB, a privacy-preserving generative framework that formulates synthetic healthcare data generation as a constrained optimization problem solved using the Augmented Lagrangian Method. By embedding configurable privacy constraints directly into model training, PSyGenTAB enforces minimum privacy thresholds while maximizing clinical data utility. Across multiple clinically motivated benchmarks, PSyGenTAB preserves inter-feature clinical relationships and minority-class diagnostic patterns essential for reliable health AI. Downstream evaluation using Train-on-Synthetic, Test-on-Real and Train-on-Real, Test-on-Synthetic protocols shows that models trained on synthetic data achieve performance comparable to those trained on real patient records. Privacy auditing further demonstrates reduced exact record reproduction and strong resilience to membership inference attacks. These results establish PSyGenTAB as a principled framework for balancing privacy protection and clinical utility in synthetic healthcare data, supporting secure cross-institutional AI development.

2606.18773 2026-06-18 cs.LG cs.AI 新提交

Private Learning with Public Feature Conditioning

基于公共特征条件化的私有学习

Shuli Jiang, Walid Krichene, Nicolas Mayoraz

发表机构 * Microsoft(微软) Google Research(谷歌研究院)

AI总结 针对标签差分隐私回归问题,提出Cond-DP方法,利用公共特征矩阵的结构信息构造条件化矩阵以加速优化,在凸、强凸和非凸设置下提供收敛保证,并在线性回归中实现比DPSGD更快的收敛速度。

Comments Proceedings of the 43rd International Conference on Machine Learning (ICML 2026). 26 pages, 9 figures

详情
AI中文摘要

我们研究了每个数据样本包含公共、非敏感特征的设置下的差分隐私(DP)回归问题——这在推荐和广告系统等应用中很常见。虽然这种标签DP或半敏感特征设置主要在分类背景下进行了探索,但有效的回归方法仍未被充分研究。我们提出了Cond-DP,一种DPSGD的条件化变体,它利用公共特征矩阵的结构来改善隐私约束下的优化。受这些公共特征通常表现出快速衰减谱的观察启发,Cond-DP引入了一个数据驱动的条件化矩阵来重塑优化景观并加速收敛。我们为凸、强凸和非凸设置提供了收敛保证,并将标准DPSGD作为条件化矩阵为单位矩阵时的特例。我们展示了如何直接从公共特征为Cond-DP构造有效的条件化矩阵,从而在私有线性回归中实现比DPSGD更快的收敛速度,且不增加额外的隐私成本。实验表明,在标签DP下,使用该条件化矩阵的Cond-DP在多种数据集和模型架构上持续优于最先进的基线方法,展示了强大且稳健的实际性能。

英文摘要

We study differentially private (DP) regression in settings where each data sample includes public, non-sensitive features -- common in applications such as recommendation and advertising systems. While such label-DP or semi-sensitive-feature settings have been primarily explored in the context of classification, effective approaches for regression remain underexplored. We introduce Cond-DP, a conditioned variant of DPSGD that leverages the structure of public feature matrices to improve optimization under privacy constraints. Motivated by the observation that these public features often exhibit rapidly decaying spectra, Cond-DP incorporates a data-driven conditioning matrix to reshape the optimization landscape and accelerate convergence. We provide convergence guarantees for convex, strongly convex, and non-convex settings, and recover standard DPSGD as a special case when the conditioning matrix is the identity. We show how to construct an effective conditioning matrix for Cond-DP directly from public features, enabling provably faster convergence than DPSGD in private linear regression without incurring additional privacy cost. Empirically, Cond-DP with this conditioning matrix consistently outperforms state-of-the-art baselines across a wide range of datasets and model architectures under label DP, demonstrating strong and robust performance in practice.

2606.19220 2026-06-18 cs.LG cs.AI 新提交

Machine Unlearning for the XGBoost Model with Network Intrusion Datasets

面向网络入侵数据集的XGBoost模型机器遗忘

Diana Magalhães, Eva Maia, João Vitorino, Isabel Praça

发表机构 * GECAD, ISEP, Polytechnic of Porto(波尔图理工学院工程学院GECAD研究所)

AI总结 针对XGBoost模型提出XGBoost-Forget遗忘方法,在表格型网络入侵数据集上实现高效遗忘,保持模型性能的同时显著提升遗忘速度。

Comments 12 pages, 7 tables, WorldCist'26 Conference

详情
AI中文摘要

机器遗忘(MU)已成为一种从训练模型中移除特定数据点而无需完全重新训练的重要技术。然而,现有大多数MU研究集中于深度学习和图像数据,在网络入侵检测领域存在空白,该领域严重依赖表格数据。本文引入XGBoost-Forget,一种针对XGBoost模型的遗忘方法,以填补这一空白。该方法在两个表格型网络入侵(NI)数据集IoT-23和GeNIS上进行了评估,使用多个指标衡量模型性能、遗忘效率和遗忘质量。结果表明,XGBoost-Forget在保持接近原始模型的预测性能的同时,提供了显著更快的遗忘速度,展示了其在表格型NI场景中用于MU的潜力。

英文摘要

Machine Unlearning (MU) has emerged as an important technique for removing specific data points from trained models without requiring full retraining. However, most existing MU research focuses on deep learning and image data, leaving a gap in the domain of network intrusion detection, which relies heavily on tabular data. This work introduces XGBoost-Forget, an unlearning approach for the XGBoost model, to address this gap. The approach is evaluated on two tabular Network Intrusion (NI) datasets, IoT-23 and GeNIS, using multiple metrics to assess model performance, unlearning efficiency, and forgetting quality. The results show that XGBoost-Forget maintains predictive performance close to the original model while providing significantly faster unlearning, demonstrating its potential for MU in tabular NI settings.

2606.19222 2026-06-18 cs.LG cs.AI 新提交

Mechanism-Guided Selective Unlearning for RLVR-Induced Reasoning

机制引导的选择性遗忘:针对RLVR诱导的推理

Chenyu Zhou, Qiliang Jiang, Shuning Wu, Xu Zhou

发表机构 * School of Engineering, Institute of Science Tokyo, Japan(东京科学大学工学院) College of Control Science and Engineering, Zhejiang University, China(浙江大学控制科学与工程学院) Department of Electrical and Computer Engineering, National University of Singapore, Singapore(新加坡国立大学电气与计算机工程系)

AI总结 提出MAST方法,通过机制引导选择性更新参数,在遗忘RLVR诱导的推理行为时,显著降低对保留性能的附带损害。

Comments 15 pages, 4 figures, 7 tables

详情
AI中文摘要

我们提出MAST(机制对齐选择性目标),一种机制引导的方法,用于遗忘RLVR诱导的推理,其附带损害远低于标准全参数更新。在Qwen2.5-Math-1.5B和Qwen3-1.7B-Base的匹配SFT/RLVR检查点上,SFT到RLVR的增量在token级delta-log-probability上与SFT更新显著不同,而全参数梯度上升仅通过破坏保留的MATH和GSM8K来实现遗忘。MAST根据离主能量、更新幅度和遗忘梯度耦合幅度对注意力投影张量进行排序,然后仅更新排名最高的子集。在主模型上,MAST诱导了统计上显著的目标遗忘(MATH遗忘从45/150降至37/150;McNemar p=0.0078),同时保留了GSM8K(+0.8个百分点)和MATH保留(-0.5个百分点)。该优势在不同种子、NPO/SimNPO目标以及Qwen3上均得到复现,在Qwen3上MAST保留了GSM8K,而全参数遗忘导致其崩溃。

英文摘要

We propose MAST (Mechanism-Aligned Selective Targeting), a mechanism-guided method for unlearning RLVR-induced reasoning with substantially lower collateral damage than standard full-parameter updates. In matched SFT/RLVR checkpoints on Qwen2.5-Math-1.5B and Qwen3-1.7B-Base, the SFT-to-RLVR increment differs sharply from the SFT update in token-level delta-log-probability, and full-parameter gradient ascent forgets only by damaging retain MATH and GSM8K. MAST ranks attention-projection tensors by off-principal energy, update magnitude, and forget-gradient coupling magnitude, then updates only the top-ranked subset. On the primary model, MAST induces statistically significant target forgetting (MATH forget 45/150 to 37/150; McNemar p=0.0078) while preserving GSM8K (+0.8 pp) and MATH retain (-0.5 pp). The advantage reproduces across seeds, NPO/SimNPO objectives, and Qwen3, where MAST preserves GSM8K while full-parameter unlearning collapses it.

2606.19262 2026-06-18 cs.LG 新提交

Detecting Hidden ML Training With Zero-Overhead Telemetry

使用零开销遥测检测隐藏的机器学习训练

Robi Rahman, Sabiha Tajdari

发表机构 * Machine Intelligence Research Institute(机器智能研究所) University of Virginia(弗吉尼亚大学)

AI总结 本文评估了仅使用零开销、隐私保护的NVML遥测(内容无关信号)对GPU工作负载分类的对抗鲁棒性,开发了一个分类器,在识别训练工作负载时达到98.2%的二元准确率,并对最具挑战性的意外工作负载达到43-87%的准确率。

Comments Technical AI Governance Research workshop at ICML 2026

详情
AI中文摘要

硬件支持的GPU工作负载监控是许多AI计算治理方案的基础,但如果开发者能够击败监控机制,这些方案将不可行。我们评估了仅使用零开销、隐私保护的NVML遥测(内容无关信号,观察计算的物理效应而不访问模型权重、训练数据或超参数)的GPU工作负载分类的对抗鲁棒性。在5轮监控-逃避迭代中,我们在跨越4代架构的9个GPU模型上评估了20种逃避策略家族。我们开发了一个分类器,在整个语料库上识别训练工作负载时达到98.2%的二元准确率,并在最具挑战性的意外工作负载上(即使它们被对抗性伪装)达到43-87%的准确率。

英文摘要

Hardware-enabled monitoring of GPU workloads underpins many proposals for AI compute governance, but if developers can defeat monitoring mechanisms, such schemes are unworkable. We evaluate the adversarial robustness of GPU workload classification using only zero-overhead, privacy-preserving NVML telemetry: content-agnostic signals that observe physical effects of computation without accessing model weights, training data, or hyperparameters. Across 5 rounds of monitor-evader iteration, we evaluate 20 evasion strategy families on 9 GPU models spanning 4 architecture generations. We develop a classifier that achieves 98.2% binary accuracy at identifying training workloads across the whole corpus, and 43-87% accuracy against the most challenging unexpected workloads even when they are adversarially disguised.

8. 鲁棒性、不确定性与可信学习 8 篇

2606.18322 2026-06-18 cs.LG cs.AI 新提交

SAE Interventions are Unreliable: Post-Intervention Recovery of Suppressed Behavior

SAE干预不可靠:干预后抑制行为的恢复

Mingyue Cui, Linghui Shen, Xingyi Yang

发表机构 * The Hong Kong Polytechnic University(香港理工大学)

AI总结 研究发现稀疏自编码器(SAE)特征干预虽能抑制行为,但存在可恢复的失败模式,通过优化残差扰动可恢复原始行为,揭示特征级控制与行为完整性之间的差距。

Comments Code: https://github.com/Mingyuee88/sae-post-intervention-recovery, Project page: https://mingyuee88.github.io/sae-post-intervention-recovery/

详情
AI中文摘要

稀疏自编码器(SAE)将残差流激活分解为可解释特征。最近的潜在空间防御越来越依赖这些分解,假设识别出的“不安全”SAE特征可作为监控和干预的可操作手柄。在这种范式下,固定特定有害特征预期能可靠地防止模型不当行为。然而,我们表明这种成功可能隐藏一种可恢复的失败模式:固定可能阻止行为的一条可见路径,但并未消除行为本身。我们将这种脆弱性形式化为干预后恢复,这是一个受约束的残差空间优化问题。从干预后的残差状态开始,我们优化残差扰动以恢复干预前的行为,同时保持目标SAE特征的干预后值。即使在干预在优化和生成过程中保持活跃的强威胁模型下,恢复仍然可能。为了排除恢复仅仅是撤销干预的可能性,我们使用编码器正交更新进行单层干预,并在跨层设置中使用相应的特征图雅可比矩阵。在TPP、遗忘、IOI和拒绝引导实验中,这种压力测试揭示了尽管特征级干预成功,行为仍可恢复。特别是在安全关键的拒绝引导设置中,我们在有效样本上实现了95.8%的恢复率,同时将防御特征的相对漂移保持在0.131,远低于基于后缀的基线。恢复路径归因分析进一步将这种恢复定位到SAE重建残差,即SAE未解释的组件。这些结果暴露了特征级控制与行为完整性之间的差距:SAE特征可以支持因果干预,但控制它们并不能保证对底层行为的控制。

英文摘要

Sparse Autoencoders (SAEs) decompose residual-stream activations into interpretable features. Recent latent-space defenses increasingly rely on these decompositions, assuming that identified "unsafe" SAE features serve as actionable handles for monitoring and intervention. In this paradigm, clamping a specific harmful feature is expected to reliably prevent model misbehavior. However, we show that this success may hide a recoverable failure mode: the clamp may block one visible route to a behavior without eliminating the behavior itself. We formulate this vulnerability as post-intervention recovery, a constrained residual-space optimization problem. Starting from the post-intervention residual state, we optimize residual perturbations to recover the pre-intervention behavior while preserving the post-intervention values of the targeted SAE features. Even under a strong threat model where the intervention remains active throughout optimization and generation, recovery remains possible. To rule out that recovery simply undoes the intervention, we use encoder-orthogonal updates for single-layer interventions and the corresponding feature-map Jacobian in the cross-layer setting. Across TPP, unlearning, IOI, and refusal steering experiments, this stress test reveals recoverable behavior despite successful feature-level intervention. Especially in the safety-critical refusal-steering setting, we achieve a 95.8% recovery rate on valid samples while keeping defended-feature relative drift to 0.131, substantially below suffix-based baselines. A recovery-path attribution analysis further localizes this recovery to the SAE reconstruction residual, the component left unexplained by the SAE. These results expose a gap between feature-level control and behavioral completeness: SAE features can support causal intervention, but controlling them does not guarantee control over the underlying behavior.

2606.18418 2026-06-18 cs.LG 新提交

P$^2$CE: Model-Agnostic Plausible Pareto-Optimal Counterfactual Explanations

P$^2$CE: 模型无关的可行帕累托最优反事实解释

Arthur Hendricks Mendes de Oliveira, Giovani Valdrighi, Marcos Medeiros Raimundo

AI总结 提出P$^2$CE算法,利用隔离森林异常检测和SHAP值,生成可行且帕累托最优的反事实解释,平衡可行性、合理性和计算效率。

Comments Under review in the Machine Learning journal

详情
AI中文摘要

机器学习算法在社会应用中的日益普及引发了对公平性和透明度的担忧,从而推动了反事实解释的发展。这些解释通过提供可操作的输入特征更改,帮助个人理解并可能改变在贷款申请、工作选择等领域的不利决策。现有方法往往难以平衡可行性、合理性和计算效率。为此,我们提出了P$^2$CE,一种生成可行帕累托最优反事实解释的算法,为用户提供不同可行性概念之间的多样化最优权衡。P$^2$CE使用辅助隔离森林异常检测器确保解释符合数据分布,并利用SHAP值在短时间内获得最优结果,与底层模型无关。我们在三个数据集上进行了实证评估,结果表明,与相关技术相比,该算法在解决方案质量和计算效率方面均表现出优越性能。

英文摘要

The increasing use of machine learning algorithms in social applications has raised concerns about fairness and transparency, leading to the development of counterfactual explanations. These explanations supports individuals to understand and potentially alter unfavorable decisions in areas such as loan applications, job selections, and more, by providing actionable changes to input features that would lead to a desired outcome. Existing methods often struggle to balance feasibility, plausibility, and computational efficiency. To address this, we introduce P$^2$CE, an algorithm for generating plausible Pareto-optimal counterfactual explanations, offering users a diverse set of optimal trade-offs between different notions of feasibility. P$^2$CE employs an auxiliary isolation forest outlier detector to ensure that explanations are in accordance with the data distribution and leverages SHAP values to obtain optimal results with short computing times, regardless of the underlying model. Our algorithm was empirically evaluated on three datasets, demonstrating superior performance in terms of both solution quality and computational efficiency compared to related techniques.

2606.18430 2026-06-18 cs.LG cs.CR 新提交

Signature filtering: a lightweight enhancement for statistical watermark detection in large language models

签名过滤:大型语言模型中统计水印检测的轻量级增强方法

Chih-Duo Hong, Yen-Pang Chen, Fang Yu

发表机构 * National Chengchi University(国立政治大学)

AI总结 提出签名过滤模块,通过移除干扰水印检测的签名令牌,在弱信号和低熵设置下将检测率从8-31%提升至78-99%,同时保持可控的假阳性率。

详情
AI中文摘要

统计水印帮助组织归因大型语言模型(LLM)的输出,但现有检测器在水印信号弱、文本重复或水印被编辑时往往表现不佳。我们提出签名过滤,一种检测时模块,在不修改水印嵌入和文本生成的情况下增强水印检测。它学习一小部分“签名”令牌,这些令牌的存在会使水印测试不可靠,并在检测前移除这些令牌。通过在小训练集上求解混合整数线性规划获得签名,约束条件最大化真阳性率。我们还推导了在几种攻击者模型(色盲、颜色自适应和分布相关)下的有限样本和渐近界。在四个知名水印家族(Kgw、Sweet、Unigram、Exp)、四个基准语料库(C4、MBPP、HumanEval、Code-Search-Net)和六个LLM(Opt-1.3b、Opt-6.7b、Llama2-13b、Llama3.1-8b、Qwen2.5-14b、Phi-3-medium-14b)上,2-gram和3-gram签名在弱信号和低熵设置下将检测率从无过滤时的8-31%提升至78-99%,同时保持假阳性率可控且通常可忽略。在压力测试中,我们打乱句子并稀释、删除和替换25-50%的令牌,针对Kgw风格水印的2-gram过滤器保留了大部分干净文本的检测增益,通常匹配或超越先进的WinMax水印检测器。因此,签名过滤提供了一种简单、可扩展且模型无关的附加组件,以加强信息处理工作流中LLM文本基于水印的来源检查。

英文摘要

Statistical watermarks help organizations attribute large language model (LLM) outputs, yet existing detectors often struggle when watermark signals are weak, texts are repetitive, or watermarks are edited. We propose signature filtering, a detection-time module that enhances watermark detection without modifying watermark embedding and text generation. It learns a small set of ``signature'' tokens whose presence makes watermark tests unreliable, and removes these tokens before detection. The signatures are obtained by solving a mixed-integer linear program on a small training set, with constraints that maximize the true positive rate. We additionally derive finite-sample and asymptotic bounds under several attacker models (color-blind, color-adaptive, and distributionally correlated). On four well-known watermark families (Kgw, Sweet, Unigram, Exp), four benchmark corpora (C4, MBPP, HumanEval, Code-Search-Net), and six LLMs (Opt-1.3b, Opt-6.7b, Llama2-13b, Llama3.1-8b, Qwen2.5-14b, Phi-3-medium-14b), 2- and 3-gram signatures raise detection rates in weak-signal and low-entropy settings from 8~31% without filtering to 78~99% with filtering, while keeping false positives controllable and often negligible. In stress tests where we scramble sentences and perturb 25~50% of tokens by dilution, deletions, and substitutions, 2-gram filters for Kgw-style watermarks preserve most of the clean-text detection gains, often matching or outperforming the advanced WinMax watermark detector. Signature filtering thus provides a simple, scalable, and model-agnostic add-on to strengthen watermark-based provenance checks for LLM text in information processing workflows.

2606.18454 2026-06-18 cs.LG cs.AI 新提交

Veriphi: Attack-Guided Neural Network Verification with Dataset-Dependent Training Methods

Veriphi: 基于攻击引导的神经网络验证与数据集依赖训练方法

Pratik Deshmukh, Kartik Arya, Vasili Savin

发表机构 * TU Wien(维也纳工业大学)

AI总结 提出Veriphi系统,结合快速对抗攻击与α,β-CROWN形式化边界验证,实验表明训练方法有效性依赖数据集特性,IBP在MNIST上有效但在CIFAR-10上失效,PGD对抗训练在小扰动下达到94%认证准确率,并实现5倍验证加速。

Comments 17 Pages, 8 Figures

详情
AI中文摘要

我们提出Veriphi,一个GPU加速的神经网络验证系统,它使用α,β-CROWN方法将快速对抗攻击与形式化边界认证相结合。通过在MNIST和CIFAR-10上使用三种训练方法(标准、对抗、认证)进行系统实验,我们证明了训练方法的有效性从根本上依赖于数据集。区间边界传播(IBP)在简单的MNIST(784维)上达到78%的认证准确率,但在更复杂的CIFAR-10数据集上提供的认证性能可忽略不计,而在小扰动下PGD对抗训练以94%的认证率占主导地位。我们通过攻击引导的伪造实现了5倍的验证加速,并将我们的方法扩展到生产规模模型(1.058亿参数),用于实际航空航天物流优化。我们的结果挑战了认证训练普遍优于对抗训练的假设,表明上下文对于验证策略选择至关重要。

英文摘要

We present Veriphi, a GPU-accelerated neural network verification system that combines fast adversarial attacks with formal bound certification using alpha,beta-CROWN methods. Through systematic experiments on MNIST and CIFAR-10 using three training methodologies (standard, adversarial, certified), we demonstrate that training method effectiveness is fundamentally dataset-dependent. Interval Bound Propagation (IBP) achieves 78% certified accuracy on simple MNIST (784 dimensions) but provides negligible certification performance on the more complex CIFAR-10 dataset, where PGD adversarial training dominates with 94% certification at small perturbations. We achieve 5x verification speedup through attack-guided falsification and scale our approach to production-size models (105.8M parameters) for real-world aerospace logistics optimization. Our results challenge the assumption that certified training universally outperforms adversarial training, showing context matters critically for verification strategy selection.

2606.18697 2026-06-18 cs.LG cs.CR cs.RO 新提交

Stealthy World Model Manipulation via Data Poisoning

通过数据投毒进行隐蔽的世界模型操纵

Yibin Hu, Xiaolin Sun, Zizhan Zheng

发表机构 * Department of Computer Science(计算机科学系)

AI总结 提出SWAAP框架,通过两阶段数据投毒(双层级优化寻找有害目标模型+梯度匹配隐蔽实现)操纵学习到的世界模型,导致规划性能显著下降,且能规避多种防御检测。

Comments 41 pages, 8 figures, 11 tables. Submitted to NeurIPS 2026

详情
AI中文摘要

基于模型的学习智能体使用学习到的世界模型来预测未来状态、规划行动并适应新环境。然而,从收集的经验中更新世界模型的过程创造了一个训练时攻击面:对抗性投毒的微调轨迹可以操纵学习到的动力学,从而破坏下游规划。在本文中,我们提出了SWAAP,这是第一个针对学习到的世界模型的两阶段数据投毒框架。在第一阶段,SWAAP利用过渡梯度定理实现的一阶双层优化,识别出一个有害的目标世界模型,该模型在规划下诱导低回报行为,同时保持接近干净动力学。在第二阶段,SWAAP通过隐蔽约束的梯度匹配实现该目标,仅修改有限比例的微调过渡目标,使得诱导的训练梯度将受害者模型引向对抗目标,同时预测误差正则化器鼓励投毒目标保持接近世界模型的自然近似误差。为了评估攻击的隐蔽性,我们在投毒管道的三个阶段评估了防御和可检测性:投毒过渡的预训练检测、微调期间的鲁棒训练以及测试时对结果世界模型的监控。在多种连续控制任务中,SWAAP导致显著的性能下降,同时保持投毒过渡接近干净数据,并规避了评估的非自适应残差/CUSUM/TRIM风格防御。这些结果揭示了世界模型适应管道中的实际漏洞,并强调了需要保护世界模型训练数据和所学动力学的鲁棒性方法。

英文摘要

Model-based learning agents use learned world models to predict future states, plan actions, and adapt to new environments. However, the process of updating world models from collected experience creates a training-time attack surface: adversarially poisoned fine-tuning trajectories can manipulate the learned dynamics and thereby corrupt downstream planning. In this paper, we propose SWAAP, the first two-stage data poisoning framework for learned world models. In the first stage, SWAAP identifies a harmful target world model that induces low-return behavior under planning while remaining close to clean dynamics, using first-order bilevel optimization enabled by a transition-gradient theorem. In the second stage, SWAAP realizes this target through stealth-constrained gradient matching, modifying only a limited fraction of fine-tuning transition targets so that the induced training gradients steer the victim model toward the adversarial target, while a prediction-error regularizer encourages the poisoned targets to remain close to the world model's natural approximation error. To assess attack stealthiness, we evaluate defenses and detectability across three stages of the poisoning pipeline: pre-training detection of poisoned transitions, robust training during fine-tuning, and test-time monitoring of the resulting world model. Across diverse continuous-control tasks, SWAAP causes substantial performance degradation while keeping poisoned transitions close to clean data and evading the evaluated non-adaptive residual/CUSUM/TRIM-style defenses. These results reveal a practical vulnerability in world-model adaptation pipelines and highlight the need for robustness methods that protect both world-model training data and learned dynamics.

2606.18832 2026-06-18 cs.LG cs.AI 新提交

Target-confidence Recourse Using tSeTlin machines: TRUST

使用Tsetlin机器的目标置信度追索:TRUST

K. Darshana Abeyrathna, Sara El Mekkaoui, Nils Enric Canut Taugbøl, Anuja Vats

发表机构 * Group Research and Development Det Norske Veritas (DNV)(挪威船级社(DNV)集团研发部)

AI总结 提出TRUST框架,通过概率Tsetlin机器和贝叶斯优化直接搜索满足用户指定置信度目标的最小输入变化,生成更稳健和可解释的反事实解释。

详情
AI中文摘要

反事实解释被广泛用于高风险决策系统中的算法追索。大多数现有方法寻求最小化改变输入以翻转模型决策。然而,决策者通常不仅依赖预测标签,还依赖置信度阈值和风险边际。刚好越过决策边界的反事实在噪声或模型变化下可能脆弱且不稳定。本文提出使用Tsetlin机器的目标置信度追索(TRUST),一种用户明确指定追索所需预测置信度的框架。TRUST不是先生成反事实再评估置信度,而是直接搜索满足用户定义置信度目标的最小变化,从而在成本、置信度和鲁棒性方面比较追索选项。我们使用概率Tsetlin机器(PTM)结合贝叶斯优化实例化TRUST。PTM基于概率子句的结构将预测置信度与决策规则的稳定性联系起来。我们表明,满足相同规则的反事实在可靠性上可能差异很大,取决于它们满足这些规则的安全程度,揭示了决策是由稳健还是脆弱的子句激活支持的。在合成和真实数据集上的实验表明,目标置信度反事实比传统的基于边界的方法产生更稳健和可解释的追索。在多个基准测试中,TRUST实现了完美的鲁棒性,同时保持较低的追索成本,包括在Haberman数据集上以0.92置信度达到0.10的L2距离。通过显式控制置信度和暴露规则级稳定性,TRUST为高风险决策支持提供了可操作的追索。

英文摘要

Counterfactual explanations are widely used to provide algorithmic recourse in high-stakes decision-making systems. Most existing methods seek the smallest change to an input that flips a model's decision. However, decision-makers often rely not only on predicted labels but also on confidence thresholds and risk margins. Counterfactuals that barely cross a decision boundary can be fragile and unstable under noise or model variation. In this paper, we propose Target-confidence Recourse Using tSeTlin machines (TRUST), a framework in which users explicitly specify the desired prediction confidence for recourse. Rather than generating counterfactuals and evaluating confidence afterward, TRUST directly searches for minimal changes that satisfy a user-defined confidence target, enabling comparison of recourse options in terms of cost, confidence, and robustness. We instantiate TRUST using a Probabilistic Tsetlin Machine (PTM) combined with Bayesian optimization. The probabilistic clause-based structure of PTM links prediction confidence to the stability of decision rules. We show that counterfactuals satisfying the same rules can still differ substantially in reliability depending on how securely they satisfy those rules, revealing whether decisions are supported by robust or fragile clause activations. Experiments on synthetic and real-world datasets demonstrate that target-confidence counterfactuals produce more robust and interpretable recourse than conventional boundary-based approaches. Across multiple benchmarks, TRUST achieves perfect robustness while maintaining low recourse cost, including an L2 distance of 0.10 on the Haberman dataset at 0.92 confidence. By explicitly controlling confidence and exposing rule-level stability, TRUST provides actionable recourse for high-stakes decision support.

2606.18839 2026-06-18 cs.LG cs.CV 新提交

Semantic Robustness Certification for Vision-Language Models

视觉语言模型的语义鲁棒性认证

Peiyu Yang, Paul Montague, Feng Liu, Andrew C. Cullen, Amardeep Kaur, Christopher Leckie, Sarah M. Erfani

发表机构 * School of Computing \& Information Systems, University of Melbourne, Australia

AI总结 提出首个无需额外数据即可认证视觉语言模型在语义层面(如形状、大小、风格)鲁棒性的框架,通过文本提示作为语义代理并量化决策边界,确保预测类别在语义变换下不变。

Comments Accepted to ICML

详情
AI中文摘要

视觉语言模型(VLM)现在被广泛用于下游任务。然而,现实世界的应用常常使VLM面临由语义变化(例如形状、大小和风格)引起的分布偏移。鲁棒性认证确定当对输入应用变换时模型的预测是否改变。虽然大多数认证框架研究输入的几何或像素级变换,但本文提出了一种新颖的框架,能够在语义级变换下认证VLM的鲁棒性。利用VLM的开放词汇能力,我们使用文本提示作为语义代理来构建由控制语义变化程度的范围参数化的变换。通过以封闭形式表征VLM决策边界,我们的框架定量地认证了在语义变换下预测类别保持不变的范围区间。我们的框架是第一个在语义级变化下认证VLM鲁棒性而无需为每种变化提供额外数据的框架,使其易于应用。在合成数据和真实数据上的实验表明,我们的框架能够在各种场景下认证针对多种语义变化的鲁棒性。

英文摘要

Vision-language models (VLMs) are now widely used in downstream tasks. However, real-world applications often expose VLMs to distribution shifts induced by semantic variation (e.g., shape, size, and style). Robustness certification determines if a model's prediction changes when transformations are applied to its input. While most certification frameworks study geometric or pixel-level transformations over inputs, this work proposes a novel framework that enables certifying VLM robustness under semantic-level transformations. Leveraging the open-vocabulary capability of VLMs, we use text prompts as semantic proxies to construct transformations parameterized by an extent that controls the degree of semantic variation. By characterizing the VLM decision boundary in closed form, our framework quantitatively certifies extent intervals for which the predicted class remains unchanged under the semantic transformation. Our framework is the first to certify VLM robustness under semantic-level variations without requiring additional data for each variation, making it practical to apply. Experiments on both synthetic and real-world data show that our framework enables certifying robustness under diverse semantic variations across scenarios.

2606.18867 2026-06-18 cs.LG cs.CY stat.ML 新提交

Strategic Feature Selection

战略特征选择

Jivat Neet Kaur, Pratik Patil, Divya Shanmugam, Emma Pierson, Michael I. Jordan, Nika Haghtalab, Meena Jagadeesan, Ahmed Alaa, Serena Wang

发表机构 * University of California, Berkeley(加州大学伯克利分校) University of Texas, Austin(德克萨斯大学奥斯汀分校) Cornell Tech(康奈尔科技) Stanford University(斯坦福大学) University of Pennsylvania(宾夕法尼亚大学) Harvard University(哈佛大学) Inria, Paris(巴黎Inria)

AI总结 研究通过特征选择和岭正则化应对战略操纵的分类问题,发现仅基于可操纵性排除特征通常次优,提出联合优化特征集与正则化水平的算法,并在医疗支付基准上验证。

详情
AI中文摘要

当算法预测器在高风险领域(如医疗)中指导资源分配时,这些预测器必须考虑输入特征的战略操纵。典型的解决方案是重新设计预测器本身以明确考虑战略互动。然而在实践中,决策者通常受限于调整现有预测管道中的粗粒度杠杆。例如,医疗组织通常根据感知的可操纵性选择排除哪些特征,同时使用标准正则化程序来收缩保留特征的系数。在这项工作中,我们通过特征选择及其与岭正则化的相互作用,启动了对战略分类的形式化研究。我们的主要发现是,仅基于可操纵性排除单个特征通常是次优的。我们提供了在最优正则化下特征子集性能的细粒度刻画,为政策设计提供了新的见解。受此刻画启发,我们开发了一种实用算法,用于联合选择特征集和岭正则化水平。通过一个关于医疗支付基准的真实世界案例研究,我们说明了我们的算法如何指导实践中粗粒度政策杠杆的设计。我们的结果为减轻算法决策系统中战略行为的影响提供了一个有原则的、实用的框架。

英文摘要

When algorithmic predictors inform resource allocation in high-stakes domains such as healthcare, these predictors must account for strategic manipulation of input features. The typical solution is to redesign the predictor itself to explicitly account for strategic interactions. In practice, however, decision makers are often constrained to adjusting coarser levers within existing prediction pipelines. For example, healthcare organizations often select which features to exclude based on perceived manipulability, while using standard regularization procedures to shrink the coefficients of retained features. In this work, we initiate a formal study of strategic classification through feature selection and its interaction with ridge regularization. Our main finding is that excluding individual features based on their manipulability alone is generally suboptimal. We provide a fine-grained characterization of the performance of a feature subset under optimal regularization, yielding new insights for policy design. Motivated by this characterization, we develop a practical algorithm for jointly choosing the feature set and the level of ridge regularization. Through a real-world case study on a healthcare payments benchmark, we illustrate how our algorithm can guide the design of coarse policy levers in practice. Our results provide a principled, practical framework for mitigating the effects of strategic behavior in algorithmic decision-making systems.

9. 图学习与结构化数据 5 篇

2606.18317 2026-06-18 cs.LG 新提交

Enhanced Graph Neural Networks using K-Hop Gaussian Diffusion

使用K跳高斯扩散增强图神经网络

Xuling Zhang, Peng Wang, Daiyan Li, Aoran Huang, Zeiwei Chen, Yongkui Yang

发表机构 * Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences(中国科学院深圳先进技术研究院) Southern University of Science and Technology(南方科技大学)

AI总结 提出K跳高斯扩散核作为预处理模块,通过多跳扩散和高斯权重平衡局部与全局信息,在噪声或结构复杂图中优于传统消息传递和现有扩散方法。

Comments 5page, 3 figures

详情
AI中文摘要

大多数图神经网络核心依赖于图卷积,通常实现为直接(单跳)邻居之间的消息传递。在许多现实世界的图中,边可能带有噪声或定义不明确,限制了信息传播到局部邻域。现有的扩散核,如个性化PageRank和热核,通过全局传播缓解了这个问题,但仍然难以处理复杂的局部结构和远距离节点噪声。为了解决这些限制,我们提出了一种K跳高斯扩散核作为图数据的预处理模块。KHG引入了多跳扩散,并对远程节点进行高斯加权,在应用标准GNN之前平衡局部和全局信息传播。在多个基准数据集上的实验表明,KHG显著优于传统的消息传递GNN,以及PPR和热核扩散,特别是在噪声或结构复杂的图中。

英文摘要

Most graph neural network (GNN) cores rely on graph convolutions, typically implemented as message passing between direct (single-hop) neighbors. In many real-world graphs, edges can be noisy or poorly defined, limiting information propagation to local neighborhoods. Existing diffusion kernels, such as Personalized PageRank (PPR) and Heat Kernel, alleviate this issue through global propagation, but still struggle with complex local structures and distant node noise. To address these limitations, we propose a K-Hop Gaussian (KHG) diffusion kernel as a preprocessing module for graph data. KHG introduces multi-hop diffusion with Gaussian weighting for remote nodes, balancing local and global information propagation before applying standard GNNs. Experiments on multiple benchmark datasets demonstrate that KHG significantly outperforms traditional message-passing GNNs, as well as PPR and Heat Kernel diffusion, particularly in noisy or structurally complex graphs.

2606.18444 2026-06-18 cs.LG cs.AI 新提交

TMR-GGNN: Credit Card Fraud Detection based on Time-Aware Multi-Relational Guided Graph Neural Network

TMR-GGNN:基于时间感知多关系引导图神经网络的信用卡欺诈检测

Rohit Tewari, Shubhankar Shilpi, Navin Chhibber, Devendra Singh Parmar, Sunil Khemka, Piyush Ranjan

发表机构 * Unysis Truist Banks Infinity Tech Group Technical Product(Unysis 信任银行 Infinity 技术集团技术产品) Fairfax, USA(美国费尔法克斯) Atlanta, USA(美国亚特兰大) Sunnyvale, USA(美国 Sunnyvale) Persistent Systems IEEE Vice Chair AeroSpace Chapter(Persistent 系统 IEEE 副主席航空航天分会) Discover Financial Services(Discover 金融服务) Edison, USA(美国埃迪森)

AI总结 提出TMR-GGNN框架,通过时间窗口内异构实体交互建模、动态多关系图构建、时间感知注意力机制和对比学习解码器,结合InfoNCE与Focal Loss复合损失函数,解决数据不平衡和欺诈模式演化问题。

Comments 2025 2nd International Conference on Software, Systems and Information Technology (SSITCON), Pages 7

详情
AI中文摘要

近年来,由于高度不平衡的数据、不断演变的欺诈模式以及交易实体间复杂的关联结构,信用卡欺诈检测面临重大挑战。为解决这些问题,本研究提出了一种名为时间感知多关系引导图神经网络(TMR-GGNN)的新框架。具体而言,所提出的TMR-GGNN通过建模客户、商户、设备和IP在时间窗口内的异构交互,扩展了编码器-解码器图神经网络(GNN)架构。随后,该TMR-GGNN方法构建了一个动态的多关系图,并在编码器中引入时间感知关系注意力机制,以基于时间邻近性和语义上下文自适应地权衡交易相关性。因此,解码器采用对比学习模块来区分真实和合成的交易模式,同时提高模型对罕见欺诈案例的泛化能力。此外,为有效管理严重的类别不平衡并强调判别性学习,引入了结合基于信息噪声对比估计(InfoNCE)的对比损失与Focal Loss的复合损失函数。这种集成有助于改进欺诈识别,同时减少假阴性。

英文摘要

In recent years, credit card fraud detection has faced significant challenges due to highly imbalanced data, evolving fraud patterns, and complex relational structures among transaction entities. To address these issues, this research proposes a novel framework called Timeaware Multi Relational Guided Graph Neural Network (TMR GGNN). Particularly, the proposed TMR GGNN extends the encoder decoder Graph Neural Network GNN architecture by modeling heterogeneous interactions across customers, merchants, devices, and IPs over temporal windows. Subsequently, the proposed TMR GGNN approach constructs a dynamic, multi relational graph and incorporates a time aware relational attention mechanism within the encoder to adaptively weigh the transaction relevance based on temporal proximity and semantic context. Consequently, the decoder employs a contrastive learning module to distinguish between real and synthesized transaction patterns, while improving the models generalization of rare fraud cases. Additionally, to effectively manage severe class imbalances and emphasize discriminative learning, a composite loss function combining Information Noise Contrastive Estimation (InfoNCE) based contrastive loss with Focal Loss is introduced. This integration assists in improving fraud identification while mitigating false negatives.

2606.18621 2026-06-18 cs.LG 新提交

Towards Anomaly Detection on Relational Data

面向关系数据的异常检测

Shiyuan Li, Yunfeng Zhao, Yue Tan, Qingfeng Chen, Yixin Liu, Shirui Pan

发表机构 * Griffith University(格里菲斯大学) Guangxi University(广西大学)

AI总结 提出RelAD框架,通过条件稀疏门控属性重建和双视图多关系边重建,有效检测关系数据中的属性异常和连接模式异常,在6个基准数据集上优于现有方法。

详情
AI中文摘要

关系数据库广泛应用于现实系统中管理结构化数据。从这类关系数据中检测异常对于识别欺诈、风险和异常行为至关重要,但尚未得到充分探索。关键挑战在于关系数据的内在复杂性:多表属性是高维且异质的,使得稀疏的异常线索容易被正常或无关信息淹没;异常还可能表现为跨不同外键关系的异常连接模式,而现有的表格和图异常检测方法难以捕捉。为解决这些问题,我们提出RelAD,一个基于重建的框架,从属性和关系边重建中捕捉异常。RelAD包含两个核心模块:条件稀疏门控属性重建,抑制冗余的多表属性并强调异常语义块;以及双视图多关系边重建,从内在和行为实体画像中检测关系特定的异常连接。得到的属性和关系信号通过轻量级融合模块整合,产生最终异常分数。我们进一步构建了6个具有系统性异常的基准数据集,大量实验表明RelAD在取得竞争性效率的同时,始终优于其他基线方法。

英文摘要

Relational databases are widely used for managing structured data in real-world systems. Detecting anomalies from such relational data is crucial for identifying fraud, risks, and abnormal behaviors, yet remains under-explored. The key challenges lie in the intrinsic complexity of relational data: multi-table attributes are high-dimensional and heterogeneous, making sparse abnormal clues easy to overwhelm by normal or irrelevant information; and anomalies may further manifest as abnormal connection patterns across different foreign-key relations, which existing tabular and graph anomaly detection methods are ill-suited to capture. To address them, we propose RelAD, a reconstruction-based framework that captures anomalies from both attribute and relational edge reconstruction. RelAD contains two core modules: conditional sparse-gated attribute reconstruction, which suppresses redundant multi-table attributes and emphasizes abnormal semantic blocks, and dual-view multi-relational edge reconstruction, which detects relation-specific abnormal connections from both intrinsic and behavioral entity profiles. The resulting attribute and relational signals are integrated through a lightweight fusion module to produce the final anomaly score. We further construct 6 benchmark datasets with systematic anomalies, on which extensive experiments show that RelAD consistently outperforms other baselines while achieving competitive efficiency.

2606.19185 2026-06-18 cs.LG 新提交

AGDN: Learning to Solve Traveling Salesman Problem with Anisotropic Graph Diffusion Network

AGDN:利用各向异性图扩散网络学习求解旅行商问题

Bolin Shen, Ziwei Huang, Zhiguang Cao, Yushun Dong

发表机构 * Florida State University(佛罗里达州立大学) Singapore Management University(新加坡管理大学)

AI总结 提出各向异性图扩散网络(AGDN),通过MixScore转移矩阵和各向异性扩散策略,有效利用图结构信息求解旅行商问题,在多种实例规模和分布上优于现有方法。

Comments Accepted at the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2026)

详情
AI中文摘要

旅行商问题(TSP)是组合优化的基石,出现在许多实际场景中。尽管基于图的学习方法已被探索用于TSP,但如何更有效地利用图结构的问题仍然悬而未决。我们提出了各向异性图扩散网络(AGDN),一种新的图神经网络框架,旨在求解TSP。我们的方法解决了两个核心难点:(1)完全连接TSP图中缺乏信息丰富的拓扑先验,以及(2)在常用的图稀疏化技术后,最优解中丢失连接节点。为了克服这些问题,我们构建了一个MixScore转移矩阵,将节点相似性与成对距离相结合,并开发了一种各向异性图扩散策略,支持跨多跳的高效信息交换。涵盖不同实例规模和节点分布的全面实验表明,AGDN在保持计算时间竞争力的同时,始终优于现有方法。此外,AGDN能够很好地泛化到训练期间未见的问题规模和分布。实现代码已公开在:this https URL。

英文摘要

The Traveling Salesman Problem (TSP) is a cornerstone of combinatorial optimization and arises in many practical scenarios. Although graph-based learning approaches have been explored for TSP, the question of how to exploit graph structure more effectively remains open. We present the Anisotropic Graph Diffusion Network (AGDN), a new Graph Neural Network framework designed to solve TSP. Our method tackles two central difficulties: (1) the lack of informative topological prior in fully connected TSP graphs, and (2) losing connected nodes in the optimal solution after the commonly used graph sparsification techniques. To overcome these issues, we construct a MixScore transition matrix that merges node similarity with pairwise distance, and we develop an anisotropic graph diffusion strategy that supports efficient information exchange across multiple hops. Comprehensive experiments spanning diverse instance sizes and node distributions show that AGDN consistently outperforms existing methods while keeping computation time competitive. Furthermore, AGDN generalizes well to problem sizes and distributions beyond those seen during training. The implementation is publicly available at: https://github.com/LabRAI/AGDN.

2606.19303 2026-06-18 cs.LG 新提交

P-K-GCN: Physics-augmented Koopman-enhanced Graph Convolutional Network for Deep Spatiotemporal Super-resolution

P-K-GCN:物理增强的Koopman图卷积网络用于深度时空超分辨率

Xizhuo, Zhang, Zekai Wang, Fei Liu, Bing Yao

发表机构 * Department of Industrial & Systems Engineering, The University of Tennessee, Knoxville(田纳西大学诺克斯维尔分校工业与系统工程系) Charles F. Dolan School of Business, Fairfield University(费尔菲尔德大学查尔斯·F·多兰商学院) Department of Electrical Engineering & Computer Science, The University of Tennessee, Knoxville(田纳西大学诺克斯维尔分校电气工程与计算机科学系)

AI总结 提出P-K-GCN,结合样条GCN和Koopman算子理论,在非规则几何上实现时空超分辨率,并通过物理损失和理论分析保证误差降低。

详情
AI中文摘要

高保真时空动力学模拟计算成本高昂,因此需要高效的超分辨率技术从粗粒度输入重建高分辨率数据。传统数据驱动方法缺乏物理约束,而简单的物理信息学习难以处理不规则空间几何和复杂时间演化。为解决这些问题,我们提出了一种物理增强的Koopman图卷积网络(P-K-GCN),用于不规则几何上的时空超分辨率。具体地,首先设计了一个基于连续样条的GCN,直接从粗粒度图中提取空间依赖关系,并引入Koopman算子理论将非线性动力学投影到紧凑的潜空间,其中时间演化被线性化。其次,我们通过基于物理的损失增强优化目标,迫使数据驱动重建遵循物理定律,以提高预测保真度和鲁棒性。最后,我们提供了严格的理论分析,证明物理增强和Koopman正则化通过减少Rademacher复杂度和收紧泛化界,数学上保证了超分辨率误差的降低。我们在从稀疏低分辨率测量重建三维心脏几何上的高分辨率心脏电动力学上评估了我们的框架。数值实验表明,我们的方法相比基线模型实现了更高的精度。

英文摘要

High-fidelity simulation of spatiotemporal dynamics is computationally prohibitive, necessitating efficient super-resolution techniques to reconstruct high-resolution data from coarse-grained inputs. Traditional data-driven methods often lack physical constraints, and simple physics-informed learning struggles with irregular spatial geometries and intricately evolving temporal dynamics. To tackle these challenges, we propose a Physics-augmented Koopman-enhanced Graph Convolutional Network (P-K-GCN) for spatiotemporal super-resolution on irregular geometries. Specifically, a continuous spline-based GCN is first designed to extract spatial dependencies directly from coarse graph, and Koopman operator theory is incorporated to project the nonlinear dynamics into a compact latent space where temporal progression is linearized. Second, we augment the optimization objective with a physics-based loss to force the data-driven reconstructions to adhere to physical laws for improving predictive fidelity and robustness. Finally, we provide a rigorous theoretical analysis, establishing that the physics augmentation and Koopman regularization mathematically guarantees a reduction in super-resolution error by diminishing Rademacher complexity and tightening generalization bounds. We evaluate our framework on reconstructing spatially high-resolution cardiac electrodynamics across a 3D heart geometry from sparse low-resolution measurements. Numerical experiments demonstrate that our method achieves superior accuracy compared to baseline models.

10. 迁移、元学习与持续学习 1 篇

2606.19164 2026-06-18 cs.LG cs.AI 新提交

Essential Subspace Merging for Multi-Task Learning

多任务学习的本质子空间合并

Longhua Li, Lei Qi, Xin Geng, Qi Tian

发表机构 * School of Computer Science and Engineering, Southeast University(东南大学计算机科学与工程学院) Key Laboratory of New Generation Artificial Intelligence Technology and Its Interdisciplinary Applications (Southeast University), Ministry of Education(教育部新一代人工智能技术及其跨学科应用重点实验室(东南大学)) Huawei Inc.(华为公司)

AI总结 提出本质子空间分解(ESD)和合并(ESM/ESM++)方法,通过正交化任务更新的主成分来减少多任务合并中的干扰,无需训练即可实现高效多任务学习。

详情
AI中文摘要

模型合并旨在通过将多个从同一预训练检查点微调得到的模型的能力集成到一个单一模型中,从而实现多任务学习。其核心挑战是任务特定参数更新之间的任务间干扰。在本文中,我们分析了任务更新引起的输出偏移,并观察到它们的能量集中在少数主方向上。我们将这些方向张成的子空间称为本质子空间。相比之下,大多数剩余方向携带的任务相关能量很少,但它们在多个任务更新中的累积会在合并过程中引起严重干扰。受此观察启发,我们提出了本质子空间分解(ESD),它根据激活偏移的主成分分解每个任务更新。基于ESD,我们引入了本质子空间合并(ESM),一种无需训练的静态合并方法,它将本质成分正交化并融合成一个紧凑的多任务模型。我们进一步将ESM扩展到ESM++,一种无需训练的动态合并方法,它将任务特定残差分解为低秩专家,并在前向推理过程中通过基于原型的路由选择最相关的专家。跨多个任务集和模型规模的大量实验表明,ESM和ESM++在减少任务间干扰的同时有效保留了任务知识。

英文摘要

Model merging aims to enable multi-task learning by integrating the capabilities of multiple models fine-tuned from the same pre-trained checkpoint into a single model. Its core challenge is inter-task interference among task-specific parameter updates. In this paper, we analyze the output shifts induced by task updates and observe that their energy is concentrated in a small number of principal directions. We call the subspace spanned by these directions the essential subspace. In contrast, most remaining directions carry little task-relevant energy, but their accumulation across multiple task updates can cause severe interference during merging. Motivated by this observation, we propose Essential Subspace Decomposition (ESD), which decomposes each task update according to the principal components of its activation shift. Based on ESD, we introduce Essential Subspace Merging (ESM), a training-free static merging method that orthogonalizes and fuses essential components into one compact multi-task model. We further extend ESM to ESM++, a training-free dynamic merging method that decomposes task-specific residuals into low-rank experts and selects the most relevant expert through prototype-based routing during forward inference. Extensive experiments across multiple task sets and model scales demonstrate that ESM and ESM++ effectively preserves task knowledge while reducing inter-task interference.

11. 数据集、基准与评测 12 篇

2606.18307 2026-06-18 cs.LG cs.AI 新提交

DRIFT: Refining Instruction Data via On-Policy Data Attribution

DRIFT: 通过在线策略数据归因优化指令数据

Zefan Wang, Lincheng Li, Tianyu Yu, Yuan Yao

发表机构 * Tsinghua University(清华大学)

AI总结 提出DRIFT方法,利用在线策略影响函数解决标准影响函数在指令微调数据归因中的近邻偏差和梯度范数偏差问题,通过模型自身生成作为验证目标,提升7B模型性能上限。

详情
AI中文摘要

优化监督微调(SFT)的训练数据分布决定了大型语言模型(LLMs)的能力。虽然现有的数据筛选方法在有限预算下加速训练方面表现出色,但它们不太适合提升能力上限。这里的挑战不再是识别一个保持性能的较小子集,而是将数据分布优化为最能提升最终模型的实例。为了解决这个问题,我们探索了使用影响函数(IF)进行实例级数据归因。我们发现标准IF公式在此设置中存在两个结构限制:由离策略验证目标引起的近邻偏差,以及对梯度范数的严重偏向。我们提出了DRIFT(通过在线策略影响函数进行数据优化用于监督微调)。DRIFT不依赖外部参考数据,而是利用模型的在线策略生成作为验证目标,这在经验上最小化了参数近邻偏差,并更好地符合IF的局部邻域假设。它进一步基于轨迹正确性应用符号加权,并针对梯度操纵问题对影响分数进行去偏,使得少量验证查询能够作为可靠锚点来归因整个数据集。在7B参数指令和推理模型上的实验表明,DRIFT持续提升了两者的性能上限,优于现有的数据筛选基线。

英文摘要

Optimizing the training data distribution for Supervised Fine-Tuning (SFT) dictates the capability of Large Language Models (LLMs). While existing data curation methods excel at accelerating training under constrained budgets, they are less suited to elevating the capability upper bound. The challenge here is no longer to identify a smaller subset that preserves performance, but to refine the data distribution toward instances most capable of improving the final model. To address this problem, we explore instance-level data attribution using Influence Functions (IF). We identify that standard IF formulations struggle in this setting due to two structural limitations: a proximity gap caused by off-policy validation targets, and a severe bias towards gradient norm. We propose DRIFT (Data Refinement via On-Policy Influence Functions for Supervised Fine-Tuning). Instead of relying on external reference data, DRIFT utilizes the model's on-policy rollouts as validation targets, which empirically minimizes the parameter proximity gap and better aligns with the local neighborhood assumption of IF. It further applies signed weighting based on trajectory correctness and debiases influence scores against the gradient hacking issue, allowing a small set of validation queries to act as reliable anchors for attributing the full dataset. Experiments on 7B-parameter instruction and reasoning models show that DRIFT consistently raises the performance ceiling on both, outperforming existing data curation baselines.

2606.18338 2026-06-18 cs.LG astro-ph.EP astro-ph.IM 新提交

ThousandWorlds: A benchmark for climate emulation of potentially habitable exoplanets

ThousandWorlds: 一个用于潜在宜居系外行星气候模拟的基准数据集

Edward T. Stevenson, Mei Ting Mak, Eric Wolf, Denis E. Sergeev, Tobi Hammond, N. J. Mayne, Miles Cranmer

发表机构 * University of Cambridge(剑桥大学) University of Oxford(牛津大学) University of Colorado Boulder(科罗拉多大学博尔德分校) University of Bristol(布里斯托大学) Purdue University(普渡大学) University of Exeter(埃克塞特大学)

AI总结 为加速系外行星气候模拟,提出ThousandWorlds基准数据集,包含五个全球气候模型的约1800次模拟,用于评估机器学习模拟器在低数据、多模拟器参数到场回归任务中的性能。

Comments 10 pages main text, 26 pages references/appendix, plus NeurIPS checklist. Data at https://doi.org/10.57967/hf/8695. Code at https://github.com/edstevenson/ThousandWorlds

详情
AI中文摘要

寻找地球以外生命将依赖于探测潜在宜居系外行星大气中的微弱特征。解释这些特征需要了解宿主行星的气候:同一分子可能在一颗行星上标志着生命,而在另一颗行星上则是非生物化学的结果。全球气候模型(GCM)提供了这种理解,但单次运行可能需要多达数百万核心小时和大量领域专家时间。机器学习模拟器可以消除这一瓶颈,但由于缺乏经过整理的多模型系外气候数据集,进展受到限制。我们介绍了ThousandWorlds,这是一个为系外气候模拟以及更广泛的低数据、多模拟器、参数到场回归任务设计的ML就绪基准数据集。该数据集包含来自五个GCM的大约1800次模拟,将八个行星参数映射到三维大气场,包括温度、湿度、风、云和辐射。三个嵌套子集定义了逐步增加的挑战:单模拟器回归、具有完整观测的多模拟器回归以及具有结构化缺失的多模拟器回归。我们提出了两个评估协议:一个用于方法排名,另一个用于衡量相对于GCM自身分歧的性能。我们评估了七种基线方法,涵盖简单方法、深度学习和高斯过程。基于GP的方法表现最佳,表明ThousandWorlds揭示了一个现成深度学习尚未成功的领域。数据:此https URL。代码:此https URL。

英文摘要

The search for life beyond Earth will depend on detecting faint signatures in the atmospheres of potentially habitable exoplanets. Interpreting those signatures requires understanding the host planet's climate: the same molecule may signal life on one planet and abiotic chemistry on another. Global climate models (GCMs) provide this understanding, but individual runs can require up to millions of core-hours and substantial domain expert time. Machine-learning emulators could remove this bottleneck, but progress has been limited by the absence of a curated, multi-model exoclimate dataset. We introduce ThousandWorlds, an ML-ready benchmark for exoclimate emulation and for the broader regime of low-data, multi-simulator, parameter-to-field regression. The dataset contains approximately 1800 simulations from five GCMs, mapping eight planet parameters to 3D atmospheric fields including temperature, humidity, winds, clouds, and radiation. Three nested subsets define progressively harder challenges: single-simulator regression, multi-simulator regression with complete observations, and multi-simulator regression with structured missingness. We propose two evaluation protocols: one for ranking methods, and one that measures performance relative to the disagreement between GCMs themselves. We evaluate seven baselines spanning simple methods, deep learning, and Gaussian processes. GP-based methods perform best, suggesting that ThousandWorlds exposes a regime where off-the-shelf deep learning does not yet succeed. Data: https://doi.org/10.57967/hf/8695. Code: https://github.com/edstevenson/ThousandWorlds.

2606.18367 2026-06-18 cs.LG 新提交

Do Time Series Foundation Model Benchmarks Hide Regime-Dependent Failures? Evidence from Traffic Speed Forecasting

时间序列基础模型基准是否隐藏了依赖于状态的失败?来自交通速度预测的证据

Yingshuo Wang, Xian Sun, Lingdong Kong, Wei Gao, Yanhang Li, Zhichao Fan, Zexin Zhuang

发表机构 * University of California, Berkeley(加州大学伯克利分校) Duke University(杜克大学) National University of Singapore(新加坡国立大学) Northeastern University(东北大学) University of Illinois Urbana-Champaign(伊利诺伊大学厄巴纳-香槟分校) Southern Methodist University(南卫理公会大学)

AI总结 本文提出状态分层评估方法,发现时间序列基础模型在交通状态转换时准确率和预测区间覆盖率显著下降,并提出了双峰混合增强方法以改善转换状态覆盖。

Comments 5 pages, 2 figures. Accepted at the Workshop on Forecasting as a New Frontier of Intelligence, ICML 2026

详情
AI中文摘要

标准基准使用聚合指标评估时间序列基础模型(TSFMs),但这可能掩盖关键运行状态下的严重失败。我们引入了状态分层评估,并将其应用于两个标准交通速度基准上的三个TSFMs。交通在自由流和拥堵状态之间表现出突然的状态切换,在转换期间产生双峰速度分布。当我们按交通状态分层时,准确率和预测区间覆盖率在转换期间急剧下降:转换状态的MAE达到11 mph(而总体为3 mph),90%预测区间的经验覆盖率低至55%。这些失败在聚合指标中不可见,因为自由流观测主导了样本。一个简单的历史条件基线(从每个传感器的训练分布中采样)实现了比任何TSFM更好的转换覆盖率,但总体准确率差得多。我们提出了双峰混合增强(BMA),一种后处理方法,将TSFM预测与历史分布知识相结合,在保持TSFM准确率的同时接近历史基线的转换覆盖率。我们的结果表明,TSFM基准应纳入状态感知评估,以揭示聚合指标隐藏的失败。

英文摘要

Standard benchmarks evaluate time series foundation models (TSFMs) using aggregate metrics, but these can mask severe failures in critical operating regimes. We introduce regime-stratified evaluation and apply it to three TSFMs on two standard traffic speed benchmarks. Traffic exhibits abrupt regime switching between free-flow and congested states, producing bimodal speed distributions during transitions. When we stratify by traffic regime, both accuracy and prediction-interval coverage degrade sharply during transitions: transition-regime MAE reaches 11 mph (versus 3 mph overall), and empirical coverage of 90% prediction intervals drops as low as 55%. These failures are invisible in aggregate metrics because free-flow observations dominate the sample. A simple historical conditional baseline (sampling from per-sensor training distributions) achieves better transition coverage than any TSFM, but has far worse overall accuracy. We propose bimodal mixture augmentation (BMA), a post-hoc method that combines TSFM forecasts with historical distributional knowledge, approaching the historical baseline's transition coverage while preserving the TSFM's accuracy. Our results suggest that TSFM benchmarks should incorporate regime-aware evaluation to surface failures that aggregate metrics hide.

2606.18451 2026-06-18 cs.LG 新提交

A Cross-Model VLM-Judge Protocol for Single-Image 3D Mesh Quality (and Why Cheap Proxies Fall Short)

跨模型VLM评判协议用于单图像3D网格质量(以及为什么廉价代理方法不足)

Ali Asaria, Tony Salomone, Deep Gandhi

发表机构 * Transformer Lab

AI总结 提出可重复的VLM评判协议评估单图3D网格质量,发现几何有效性和渲染CLIP等廉价代理方法无法替代VLM评判。

详情
AI中文摘要

单图像到3D生成器正在快速改进,但目前没有公认的、无需人工的方法来判断生成的网格是否优于另一个。从业者通常依赖廉价的自动代理方法(渲染空间的CLIP相似性和网格几何有效性统计),但这些方法在多大程度上跟踪感知质量尚未确定。我们做出两项贡献。首先,我们提出并验证了一个可重复的VLM评判评估协议:一个固定的24视角无头渲染装置、两个独立的视觉语言评判家族,以及一个强制的位置偏差校正,该校正查询两种呈现顺序并仅保留顺序一致的判决。两个评判家族彼此高度一致(Cohen's kappa = 0.66),远高于随机一致性基线。其次,以该协议为参考,我们证明廉价代理方法无法替代它。几何有效性平均而言仅是一个弱信号(因为,如我们所示,它是双峰的),且低于我们预先注册的目标,而渲染CLIP则处于随机水平。一个学习的Bradley-Terry头部坍缩到一个单一流形统计量(给渲染CLIP赋予负权重),并且与仅几何方法完全匹配,因此学习特征权重毫无收益。该代理方法也是双峰的:在具有可见几何缺陷的对比中显著高于随机水平,但在模糊对比中处于随机水平,这与几何有效性仅在缺陷视觉显著时跟踪评判者的行为一致。因此,我们推荐VLM评判协议作为在测试条件下(Google Scanned Objects上的两个前馈生成器,采用面丢失退化机制)可靠且可重复的评估器,并建议不要将几何/CLIP代理方法作为优化目标。

英文摘要

Single-image-to-3D generators are improving quickly, but there is no agreed, human-free way to tell whether one generated mesh is better than another. Practitioners commonly rely on cheap automatic proxies (render-space CLIP similarity and mesh geometry-validity statistics), yet how well these track perceived quality is unestablished. We make two contributions. First, we propose and validate a reproducible VLM-judge evaluation protocol: a fixed 24-view headless render rig, two independent vision-language judge families, and a mandatory position-bias correction that queries both presentation orders and keeps only order-consistent verdicts. The two judge families agree substantially with each other (Cohen's kappa = 0.66), well above the chance-agreement floor. Second, using this protocol as the reference, we show the cheap proxies do not substitute for it. Geometry validity is only a weak signal on average (because, as we show, it is bimodal) and stays below our pre-registered target, while render-CLIP is at chance. A learned Bradley-Terry head collapses onto a single manifoldness statistic (giving render-CLIP a negative weight) and matches geometry-only exactly, so learning the feature weights buys nothing. The proxy is also bimodal: it is significantly above chance on contrasts with visible geometric defects but at chance on ambiguous contrasts, consistent with geometry validity tracking the judge only when the defect is visually salient. We therefore recommend the VLM-judge protocol as a reliable, reproducible evaluator under the conditions tested (two feed-forward generators on Google Scanned Objects, with a face-drop degradation regime) and advise against geometry/CLIP proxies as optimization targets.

2606.18539 2026-06-18 cs.LG stat.ML 新提交

TS-Fault: Benchmarking Time Series Forecasters Against Structural Faults

TS-Fault: 针对结构性故障的时间序列预测器基准测试

Yuyang Zhao, Lian Xu, Hao Miao, Chenxi Liu, Hao Xue

发表机构 * Ray-zyy

AI总结 提出TS-Fault基准,通过参数化故障场景(沿观测/机制、单变量/多变量两轴)评估时间序列预测模型鲁棒性,发现干净数据准确性与鲁棒性负相关、机制级故障重排排名、基础模型最脆弱。

详情
AI中文摘要

时间序列预测(TSF)支撑着能源、交通、金融和医疗等领域的关键决策,然而TSF模型几乎普遍通过在干净保留数据上的单一数字(如平均误差)进行排名,隐含假设该数字能预测部署可靠性。但实际故障并非独立同分布噪声,而是具有时间形状的结构化事件、断裂的跨变量依赖、伴随缺失的机制变化以及跨传感管道的因果传播。将TSF鲁棒性视为数据质量问题,我们提出TS-Fault,一个在显式、参数化且具有可控语义难度的故障场景下评估预测模型的基准。TS-Fault将重复出现的故障沿两个正交轴(观测级 vs 机制级;单变量 vs 多变量)组织为四种模式,并通过统一重要性评分将每种故障注入最关键的预测窗口。该设计使得鲁棒性能够针对模型实际依赖的结构进行测试,而非简化为通用噪声敏感性。我们在6个数据集、4种模式和5个难度级别上,采用配对干净/损坏协议评估了21个模型。结果揭示了三个与常见排行榜直觉相悖的发现:(i)干净数据准确性与鲁棒性负相关;(ii)干净排名在观测级故障下保持不变,但在机制级故障下重新洗牌;(iii)所有灾难性故障均发生在机制级故障下,基础模型在干净数据上准确率最高但表现出最大的脆弱性。代码已公开于该URL。

英文摘要

Time series forecasting (TSF) underpins consequential decisions in energy, transportation, finance, and healthcare, yet TSF models are almost universally ranked by a single number (e.g., average error) on clean held-out data, under the implicit assumption that it predicts deployed reliability. However, real faults are not i.i.d noise but structured events with temporal shape, broken cross-variable dependencies, regime change coupled with missingness, and causal propagation across a sensing pipeline. Treating TSF robustness as a data-quality problem, we present TS-Fault, a benchmark that evaluates forecasting models under explicit, parameterized fault scenarios with controllable semantic difficulty. TS-Fault organizes recurring failures into four modes along two orthogonal axes (observation- vs mechanism-level; univariate vs multivariate) and injects each fault into the most prediction-critical window via a unified importance score. This design enables robustness to be tested against the structures models actually rely on, rather than reduced to generic noise sensitivity. We evaluate 21 models across 6 datasets, 4 modes, and 5 difficulty levels under a paired clean/corrupt protocol. The results reveal three findings that contradict common leaderboard intuition: (i) clean-data accuracy anti-correlates with robustness; (ii) clean rankings are preserved under observation-level faults but reshuffled under mechanism-level faults; and (iii) all catastrophic failures occur under mechanism-level faults, with foundation models achieving the highest clean-data accuracy yet exhibiting the greatest fragility. The code is publicly available at https://github.com/Ray-zyy/TS-Fault.

2606.18640 2026-06-18 cs.LG q-bio.QM 新提交

MetaboNet-Bench: A Multi-modal Benchmark for Glucose Forecasting in Type 1 Diabetes

MetaboNet-Bench:1型糖尿病血糖预测的多模态基准

Nathaniel Jeffries, Miriam Wolff, Sam Royston, Elizabeth Healey, Caleb Mayer, David Klonoff, Michael Snyder, Tao Wang

发表机构 * Department of Genetics, Stanford University School of Medicine(斯坦福大学医学院遗传学系) Replica Health Boston Children’s Hospital, Harvard Medical School(哈佛医学院波士顿儿童医院) Diabetes Research Institute, Mills-Peninsula Medical Center(米尔斯半岛医学中心糖尿病研究所)

AI总结 针对1型糖尿病血糖预测算法缺乏标准化评估基准的问题,提出MetaboNet-Bench多模态基准,集成血糖、胰岛素和碳水化合物数据,通过多个模型对比验证多模态数据对模型性能的影响。

Comments main content in 10 pages with 5 figures; supplementary section with 11 more pages and 5 more figures

详情
AI中文摘要

血糖预测算法是1型糖尿病血糖控制管理的重要方面。迄今为止,研究社区已经开发了大量预测算法和模型。然而,公认的是,缺乏标准化的模型性能评估基准使得公平比较变得困难,并阻碍了进一步的创新,因此基准标准化迫在眉睫。此外,许多已发表的血糖预测算法仅限于CGM数据,忽略了其他多模态信号,如胰岛素剂量和碳水化合物摄入。在此,我们介绍MetaboNet-Bench,这是一个针对1型糖尿病患者的多模态血糖预测基准,它提供了一个可扩展的开源评估框架,用于比较利用血糖、胰岛素和碳水化合物数据的血糖预测算法。然后,我们通过基准测试几个最近发布的血糖预测模型和一个自定义的多模态时间序列模型(代表不同的模型架构)来展示其实用性。结果表明,添加数据模态的好处取决于模型的复杂性,并且纳入更多临床指标有助于识别未来研究中有意义的空白。

英文摘要

Glucose forecasting algorithms are an important aspect of glycemic control management in type 1 diabetes. So far, the research community has developed numerous algorithms and models for forecasting. However, it is well-recognized that the lack of standardized model performance evaluation benchmarks makes fair comparison difficult and hinders further innovation, and thus benchmark standardization is in urgent need. Furthermore, many published glucose forecasting algorithms are limited to CGM data alone, ignoring other multimodal signals such as insulin dosing and carbohydrate intake. Here, we introduce MetaboNet-Bench, a benchmark for multimodal glucose forecasting for patients with type 1 diabetes that provides an extensible open-source evaluation framework for comparison of glucose forecasting algorithms that leverage glucose, insulin, and carbohydrate data. We then demonstrate its utility by benchmarking several recently published glucose forecasting models and a custom multimodal time-series model, representing different model architectures. The results show that the benefit of adding data modalities is conditioned on the complexity of the model and that incorporating more clinical metrics helps identify meaningful gaps to fill for future research.

2606.18677 2026-06-18 cs.LG cs.AI 新提交

Bounded Context Management for Tabular Foundation Models on Stream Learning

表格基础模型在流学习中的有界上下文管理

Jinmo Lee, Doyun Choi, Moongi Choi, Jaemin Yoo

发表机构 * Seoul National University(首尔大学) KAIST(韩国科学技术院)

AI总结 针对表格流学习中分布漂移问题,提出上下文管理策略CURE,通过不确定性门控准入和冗余感知驱逐管理上下文,在七个流上相对提升最高27.0%。

Comments Accepted as a spotlight oral (top 5%) at the 2nd ICML Workshop on Foundation Models for Structured Data (FMSD@ICML2026)

详情
AI中文摘要

表格流学习需要在分布漂移下对顺序到达的样本进行预测。虽然标准方法通过更新模型状态来适应,但表格基础模型(TFMs)以上下文方式基于标记上下文进行预测,使其成为流学习的自然替代方案。这便将挑战从如何更新模型转移到如何管理上下文。我们提出一种未来信息视角,为上下文管理导出三个实际需求:保留最近样本、保留不确定样本、移除冗余样本。我们将这些需求实例化为CURE(通过不确定性感知准入和冗余感知驱逐的上下文管理),一种具有熵门控准入和冗余感知驱逐的上下文管理策略。在七个流上,CURE相比经典流学习器相对提升高达27.0%,在多个TFM骨干上保持鲁棒,并在其他策略变体中排名第一。代码和数据集可在该https URL获取。

英文摘要

Tabular stream learning requires predictions on sequentially arriving examples under distribution shift. While standard methods adapt by updating model states, tabular foundation models (TFMs) make predictions conditioned on a labeled context in an in-context manner, making them a natural alternative for stream learning. This shifts the challenge from how to update the model to how to manage the context. We propose a future information view that yields three practical requirements for context management: preserve recent examples, retain uncertain examples, and remove redundant examples. We instantiate these requirements as CURE (Context management via Uncertainty-aware admission and Redundancy aware Eviction), a context-managing policy with entropy-gated admission and redundancy-aware eviction. Across seven streams, CURE shows up to 27.0% relative improvement over classical stream learners, remains robust across multiple TFM backbones, and ranks first among other policy variants. Code and datasets are available at https://github.com/morcellinus/CURE-ICML-FMSD.

2606.18774 2026-06-18 cs.LG 新提交

RouteJudge: An Open Platform for Reproducible and Preference-Aware LLM Routing

RouteJudge: 一个可复现且偏好感知的LLM路由开放平台

Guannan Lai, Haoran Hu, Han-Jia Ye

发表机构 * School of Artificial Intelligence, Nanjing University(南京大学人工智能学院) National Key Laboratory for Novel Software Technology, Nanjing University(南京大学计算机软件新技术国家重点实验室) SinapisAI

AI总结 提出RouteJudge平台,通过匿名成对比较评估LLM路由策略的决策质量,并发布ORBIT工具箱标准化路由工作流,支持可复现和偏好感知的路由评估。

Comments Accepted by Pluralistic Alignment Workshop at ICML 2026

详情
AI中文摘要

我们提出RouteJudge,一个用于LLM路由系统的在线成对偏好评估框架,并提供一个公开平台(https://...)。与模型级别的响应评估不同,RouteJudge关注路由器级别的决策质量。对于每个用户查询,多个路由策略在相同的模型池和预算约束下独立推荐候选模型。然后通过匿名成对比较将所选模型的响应呈现给用户,由此产生的用户偏好归因于比较响应背后的路由策略。每条评估记录存储查询、路由决策、模型响应、偏好标签、成本、延迟和任务元数据,从而支持对LLM路由器进行偏好感知、成本感知和任务条件分析。为了支持RouteJudge中路由方法的持续扩展,我们进一步发布了ORBIT(最优路由与预算推理工具箱),这是一个模块化且可扩展的工具箱,标准化了LLM路由的端到端工作流。ORBIT为基准加载、查询表示、路由器实现、预算感知评估和方法比较提供了统一接口,允许研究人员在一致的协议下开发和评估路由算法。它同时作为RouteJudge的提交和集成层:研究人员可以在ORBIT中实现路由方法,在现有路由基准上验证它们,并提交兼容的路由器进行在线偏好评估。ORBIT的代码可在https://...获取。

英文摘要

We present RouteJudge, an online pairwise preference evaluation framework for LLM routing systems, with a public platform available at https://routejudge.cn. Different from model-level response evaluation, RouteJudge focuses on router-level decision quality. For each user query, multiple routing strategies independently recommend candidate models under the same model pool and budget constraints. The selected model responses are then presented to users through anonymous pairwise comparisons, and the resulting user preferences are attributed back to the routing strategies behind the compared responses. Each evaluation record stores the query, routing decisions, model responses, preference labels, cost, latency, and task metadata, enabling preference-aware, cost-aware, and task-conditioned analysis of LLM routers. To support the continuous expansion of routing methods in RouteJudge, we further release ORBIT (Optimal Routing and Budgeted Inference Toolbox), a modular and extensible toolbox that standardizes the end-to-end workflow of LLM routing. ORBIT provides unified interfaces for benchmark loading, query representation, router implementation, budget-aware evaluation, and method comparison, allowing researchers to develop and evaluate routing algorithms under consistent protocols. It also serves as the submission and integration layer for RouteJudge: researchers can implement routing methods within ORBIT, validate them on existing routing benchmarks, and submit compatible routers for online preference-based evaluation. The code of ORBIT is available at https://github.com/AIGNLAI/LAMDA-ORBIT.

2606.18829 2026-06-18 cs.LG cs.CL 新提交

GateMem: Benchmarking Memory Governance in Multi-Principal Shared-Memory Agents

GateMem:多主体共享内存代理中的内存治理基准

Zhe Ren, Yibo Yang, Yimeng Chen, Zijun Zhao, Benshuo Fu, Zhihao Shu, Bingjie Zhang, Yangyang Xu, Dandan Guo, Shuicheng Yan

发表机构 * School of Artificial Intelligence, Jilin University(吉林大学人工智能学院) Shanghai Jiao Tong University(上海交通大学) King Abdullah University of Science and Technology (KAUST)(卡尔斯鲁厄大学) Tsinghua University(清华大学) National University of Singapore(新加坡国立大学)

AI总结 提出GateMem基准,评估多主体共享内存代理在效用、访问控制和遗忘三方面的治理能力,发现现有方法无法同时满足三者。

Comments 24 pages, 8 figures. Code and dataset are available at https://github.com/rzhub/GateMem and https://huggingface.co/datasets/Ray368/GateMem

详情
AI中文摘要

LLM代理的内存基准主要假设单用户设置,而医院、工作场所、校园和家庭中的共享助手研究不足。在这些部署中,多个主体写入公共内存池并根据不同角色、范围和关系进行查询,因此内存质量需要治理和召回。我们引入GateMem,一个多主体共享内存代理的基准。GateMem联合评估合法长期请求的效用(含状态更新)、跨上下文授权边界的访问控制,以及显式删除请求后的主动遗忘。它涵盖医疗、办公、教育和家庭领域,包含长形式多方情节、增量内存注入、隐藏检查点、结构化评判和泄漏目标注释。在多种基线和骨干模型上,没有方法能同时实现强效用、鲁棒访问控制和可靠遗忘。长上下文提示通常以高令牌成本获得最佳治理分数,而基于检索和外部内存的方法降低成本但仍泄漏未授权或已删除信息。这些结果表明,当前内存代理远未达到可靠的共享机构部署水平。

英文摘要

Memory benchmarks for LLM agents largely assume single-user settings, leaving shared assistants for hospitals, workplaces, campuses, and households understudied. In these deployments, multiple principals write to a common memory pool and query it under different roles, scopes, and relationships, so memory quality requires governance as well as recall. We introduce GateMem, a benchmark for multi-principal shared-memory agents. GateMem jointly evaluates utility for legitimate long-horizon requests with state updates, access control across contextual authorization boundaries, and agent-facing active forgetting after explicit deletion requests. It spans medical, office, education, and household domains, with long-form multi-party episodes, incremental memory injection, hidden checkpoints, structured judging, and leak-target annotations. Across diverse baselines and backbone models, no method simultaneously achieves strong utility, robust access control, and reliable forgetting. Long-context prompting often yields the best governance score at high token cost, while retrieval-based and external-memory methods reduce cost yet still leak unauthorized or deleted information. These results show current memory agents remain far from reliable shared institutional deployment.

2606.18833 2026-06-18 cs.LG 新提交

Seed-Guided Semi-Supervised Clustering by A-Contrario Anomaly Detection

基于A-Contrario异常检测的种子引导半监督聚类

Nassir Mohammad

发表机构 * Cyber Innovation Lab, Airbus, Newport, UK(空中客车公司网络创新实验室(英国纽波特))

AI总结 提出一种基于统计对偶性的半监督聚类框架,通过a-contrario推理和感知算法,利用种子标签初始化并迭代排除异常点,实现鲁棒聚类,在少量种子下达到强性能。

详情
AI中文摘要

本文介绍了一种基于分组原则与异常检测之间统计对偶性的半监督聚类框架。我们解决了噪声环境中鲁棒聚类定义的挑战——在该任务中,划分算法往往过度分配离群点,而基于密度的方法仍对启发式全局参数敏感。借鉴\textit{a-contrario}统计推理和格式塔邻近原则,我们将聚类定义为相对于均匀随机性零假设不包含任何异常点的最大数据点子集。该方法的核心是感知算法,该算法利用基于期望的原则性阈值($\mathbb{E} < 1$)来识别异常点,无需手动参数调整。通过将聚类视为异常检测的对偶问题,我们采用迭代的“通过排除进行聚类”机制。该算法由种子引导,利用最少的用户提供标签来初始化鲁棒的聚类中位数并形成初始组,随后通过接纳非异常点进行扩展。这种方法自然地隔离了边缘点、孤立噪声和新兴的未知聚类。我们在合成和真实基准数据集上评估了该方法,包括通过原始、线性降维和邻域保持嵌入表示的图像和文本数据集。结果表明,在每个聚类仅使用10-30个种子的情况下,所提出的方法在实用的低调优基准测试协议下实现了具有竞争力且通常非常强的性能,同时在固定种子聚类数和迭代次数下,对观测数和维度均保持线性可扩展性。

英文摘要

This paper introduces a semi-supervised clustering framework grounded in the statistical duality between grouping principles and anomaly detection. We address the challenge of robust cluster definition in noisy environments -- a task where partitioning algorithms often over-assign outliers and density-based methods remain sensitive to heuristic global parameters. Drawing on \textit{a-contrario} statistical reasoning and Gestalt proximity principles, we define a cluster as a maximal subset of data points containing no anomalies relative to a null hypothesis of uniform randomness. Central to this approach is the Perception algorithm, which utilises a principled expectation-based threshold ($\mathbb{E} < 1$) to identify outliers without manual parameter tuning. By treating clustering as the dual of anomaly detection, we employ an iterative ``clustering-by-exclusion'' mechanism. The algorithm is seed-guided, leveraging minimal user-provided labels to initialise robust cluster medians and form initial groups, which are subsequently expanded by admitting non-anomalous points. This approach naturally isolates fringe points, isolated noise, and emerging unknown clusters. We evaluate the method on synthetic and real-world benchmarks, including image and text datasets represented through raw, linear-reduced, and neighbourhood-preserving embeddings. Results demonstrate that with as few as 10--30 seeds per cluster, the proposed method achieves competitive and often very strong performance under a practical low-tuning benchmarking protocol, while maintaining linear scalability with respect to both observations and dimensionality for a fixed number of seeded clusters and iterations.

2606.18970 2026-06-18 cs.LG cs.AI cs.CV 新提交

A Controlled Benchmark of Quantum-Latent GAN Augmentation for Brain MRI

脑MRI的量子潜GAN增强的受控基准测试

Syed Mujtaba Haider, Silvia Figini

发表机构 * Department of Mathematics(数学系) Department of Political and Social Sciences(政治与社会科学系)

AI总结 通过受控基准测试,比较量子与经典生成器在脑MRI数据增强中的性能,发现两者均未显著优于仅用真实数据训练,且量子生成器无额外优势。

Comments This work has been submitted to the IEEE for possible publication. This work has been submitted to the IEEE for possible publication

详情
AI中文摘要

医学图像分类常受限于有限的标注数据,因此生成式增强被提出;最近,量子生成模型被用于此目的,并经常报告准确率提升。然而,这些声称通常基于单次训练运行,未匹配量子与经典生成器的参数预算,也未表征任何收益出现的数据范围。我们提出了一个受控基准测试,隔离量子生成器对脑MRI增强的贡献。图像被编码到KL正则化的潜在空间中,在该空间中,使用变分量子生成器或参数数量几乎相同的经典生成器(1648 vs. 1632)训练带有梯度惩罚的条件Wasserstein GAN。合成样本被解码并用于增强预训练分类器,覆盖从5%到100%的标注数据比例,通过八个随机种子进行配对显著性检验(多重比较校正)以及集内多样性和潜在分布分析。在所有比例下,没有增强变体显著优于仅用真实数据训练,且量子与经典生成器在统计上无法区分。任何低数据优势表现为正则化而非忠实的数据扩展:合成样本分布外移,并且在数据稀缺时严重模式崩溃,而量子生成器并不比经典生成器更多样化。我们发布该协议作为医学成像中量子生成增强严格评估的测试平台。

英文摘要

Medical image classification is often constrained by limited labeled data, motivating generative augmentation; recently, quantum generative models have been proposed for this purpose, frequently reporting accuracy gains. However, such claims are typically based on single training runs, do not match the parameter budgets of the quantum and classical generators, and do not characterize the data regime in which any benefit appears. We present a controlled benchmark that isolates the contribution of a quantum generator to brain-MRI augmentation. Images are encoded into a KL-regularized latent space in which a conditional Wasserstein GAN with gradient penalty is trained using either a variational quantum generator or a classical generator of near-identical parameter count (1648 vs. 1632). Synthetic samples are decoded and used to augment a pretrained classifier across labeled data fractions from 5% to 100%, evaluated over eight random seeds with paired significance testing (with multiple-comparison correction) and with intraset diversity and latent-distribution analyses. Across all fractions, no augmentation variant significantly outperforms real-data-only training, and the quantum and classical generators are statistically indistinguishable. Any low-data benefit behaves as regularization rather than faithful data expansion:synthetic samples are off distribution and severely mode collapsed precisely where data is scarce, and the quantum generator is no more diverse thanits classical counterpart. We release the protocol as a testbed for rigorous evaluation of quantum generative augmentation in medical imaging.

2606.19297 2026-06-18 cs.LG cs.RO 新提交

Does VLA Even Know the Basics? Measuring Commonsense and World Knowledge Retention in Vision-Language-Action Models

VLA 甚至知道基础知识吗?衡量视觉-语言-动作模型中的常识和世界知识保留

Nikita Kachaev, Andrey Moskalenko, Matvey Skripkin, Nikita Kurlaev, Daria Pugacheva, Albina Burlova, Mikhail Kolosov, Denis Shepelev, Andrey Kuznetsov, Elena Tutubalina, Aleksandr I. Panov, Alexey K. Kovalev, Vlad Shakhuro

发表机构 * CogAI Lab(CogAI实验室) FusionBrain Lab(FusionBrain实验室) IAI MSU(莫斯科大学人工智能研究所) Lomonosov MSU(莫斯科国立罗蒙诺索夫大学) NUST MISIS(国立研究型技术大学MISIS) Applied AI Institute(应用人工智能研究所) HSE University(高等经济大学) Generalizable AI Systems(通用人工智能系统实验室) ISP RAS(俄罗斯科学院系统编程研究所) MIRAI Domain-specific NLP Group(领域特定自然语言处理组)

AI总结 提出 Act2Answer 协议,通过动作回答评估 VLA 模型的知识保留,发现模型在简单概念上表现良好,但在丰富语义类别上存在差距,且 VQA 联合训练有助于知识保留。

Comments Project page: https://tttonyalpha.github.io/act2answer/

详情
AI中文摘要

具身视觉-语言-动作(VLA)模型通常通过在机器人数据上微调强大的预训练 VLM 获得,但目前尚不清楚它们在适应后保留了多少常识和事实知识。在知识敏感任务上的失败是模糊的,混淆了知识缺失与低级控制泛化能力差。我们引入 Act2Answer,一种轻量级协议,通过要求智能体通过动作来回答,将 VLM 知识基准适配到 VLA 评估。每个问题变成一个简短的桌面场景,其中智能体执行单个物体放置动作以选择候选答案,从而产生动作基础的、减少控制混淆的成功率。我们在不同的常识和世界知识类别中策划了这样的环境测试套件,并引入逐层意图探测以定位 VLM 骨干和动作头中与答案相关的信息。在对 7 个 VLA 模型和 9 个 VLM 基线的大规模研究中,我们系统地跨类别对模型进行排名,发现 VLA 在简单概念上表现稳健,但在更丰富的语义类别上相对于其源 VLM 显示出更大的差距,VQA 联合训练与更好的知识保留相关,并且答案相关信号在 VLA 中间层达到峰值,但在上层减弱。Act2Answer 可在以下网址获取:此 https URL。

英文摘要

Embodied Vision-Language-Action (VLA) models are typically obtained by fine-tuning powerful pretrained VLMs on robotics data, yet it is unclear how much commonsense and factual knowledge they retain after adaptation. Failures on knowledge-sensitive tasks are ambiguous, conflating missing knowledge with poor generalization of low-level control. We introduce Act2Answer, a lightweight protocol that adapts VLM knowledge benchmarks to VLA evaluation by requiring agents to answer through action. Each question becomes a short tabletop episode where the agent performs a single object-placement action to select among candidate answers, yielding an action-grounded success rate with reduced control confounds. We curate a test suite of such environments across diverse commonsense and world-knowledge categories and introduce layerwise intent probing to localize answer-relevant information across the VLM backbone and action head. In a large-scale study of 7 VLA models and 9 VLM baselines, we systematically rank models across categories, finding that VLAs show solid performance on simple concepts while exhibiting larger gaps on richer semantic categories relative to their source VLMs, that VQA co-training is associated with better knowledge retention, and that answer-relevant signals peak in middle VLA layers but attenuate in upper layers. Act2Answer is available at https://tttonyalpha.github.io/act2answer/.

12. 机器学习应用 21 篇

2606.18287 2026-06-18 cs.LG 新提交

Artemis: Anatomy-Resolved inTervention for Eliminating Multimodal NeuroImage confounderS

Artemis: 解剖分辨的干预方法用于消除多模态神经影像混杂因素

Siyuan Dai, Yang Du, Kun Zhao, Zhusuyi Chen, Heng Huang, Paul Thompson, Chao Shi, Haoteng Tang, Liang Zhan

发表机构 * University of Pittsburgh(匹兹堡大学) University of Maryland(马里兰大学) University of Southern California(南加州大学) Binghamton University(宾汉姆顿大学) University of Texas Rio Grande Valley(德克萨斯大学里奥格兰德河谷分校)

AI总结 提出Artemis框架,通过区域级因果干预学习特定脑区的混杂因素表示,消除fMRI和DTI多模态神经影像中人口统计学混杂因素对GNN的影响,在三个基准上提升性能。

Comments 11 pages, 8 figures

详情
AI中文摘要

多模态神经影像学整合了来自fMRI的功能连接和来自DTI的结构连接,使得使用图神经网络对脑网络进行无创分析成为可能。然而,年龄和性别等人口统计学因素系统地混淆了脑连接与临床结果之间的关系,导致GNN利用虚假捷径而非学习因果不变表示。尽管最近的因果GNN方法在图建模层面引入因果关系,但其因果机制仍然是领域无关的,没有考虑临床神经影像数据中固有的真实世界混杂因素。此外,脑网络是基于图谱分区构建的,每个区域对人口统计学因素表现出不同的敏感性,因此需要区域感知的调整。我们提出了Artemis,一个区域级因果框架,通过在每个脑区域独立进行因果干预,使用轻量级参数学习区域特定的混杂因素表示,从而弥合了这一差距。我们的调整综合利用多模态功能和结构特征进行图推理,作为一个与任意GNN骨干兼容的插件模块。在三个基准(用于疾病诊断的ADNI、用于痴呆分期的OASIS和用于性别分类的HCP)上的实验表明,与代表性的基于GNN的基线相比,该方法具有一致的改进。多项支持实验进一步证明了统计显著性和神经科学可解释性。

英文摘要

Multimodal neuroimaging, integrating functional connectivity from fMRI and structural connectivity from DTI, enables non-invasive analysis of brain networks using graph neural networks. However, demographic factors such as age and sex systematically confound the relationship between brain connectivity and clinical outcomes, causing GNNs to exploit spurious shortcuts rather than learning causally invariant representations. While recent causal GNN methods introduce causality at the graph-modeling level, their causal mechanisms remain domain-agnostic without accounting for the real-world confounders inherent in clinical neuroimaging data. Moreover, brain networks are constructed from atlas-based parcellations where each region exhibits distinct sensitivity to demographic factors, necessitating region-aware adjustment. We propose Artemis, a region-level causal framework that bridges this gap with causal intervention at each brain region independently by learning region-specific confounder representations with lightweight parameters. Our adjustment comprehensively utilized the multimodal functional and structural features for graph reasoning as a plug-in module compatible with arbitrary GNN backbones. Experiments on three benchmarks, ADNI for disease diagnosis, OASIS for dementia staging, and HCP for sex classification, demonstrate consistent improvements over representative GNN-based baselines. Multiple supporting experiments further demonstrate statistical significance and neuroscientific interpretability.

2606.18316 2026-06-18 cs.LG 新提交

A Survey on Data-Driven Models for Soil Moisture Regression and Classification

基于数据驱动的土壤湿度回归与分类模型综述

Ilektra Tsimpidi, George Georgoulas, Vidya Sumathy, George Nikolakopoulos

发表机构 * Electrical Engineering\ University of Technology\ , Sweden(电气工程\ 技术大学\ ,瑞典)

AI总结 综述了基于AI的土壤湿度建模方法,分为五类:统计时间序列、地统计、经典机器学习、深度学习和概率/贝叶斯方法,利用多源数据实现回归或分类。

Comments 14 pages, 3 figures, AIAI 2026 Conference

详情
AI中文摘要

土壤湿度(SM)建模构成一个复杂的时空学习问题,其特点是非线性环境相互作用、异构数据源和有限的地面观测。基于物理的方法,如水量平衡模型,依赖于明确的水文方程和高质量的输入,但其计算成本和可扩展性限制阻碍了大规模部署。数据驱动的人工智能(AI)方法已成为灵活的替代方案,能够以较少的建模假设提取土壤湿度与环境变量之间的经验关系。本文对基于AI的土壤湿度估计和分类模型进行了结构化综述。现有方法被组织为五类:(a)统计时间序列模型,(b)地统计方法,(c)经典机器学习(ML)模型,(d)深度学习(DL)模型和(e)概率/贝叶斯方法。这些模型利用历史土壤湿度记录、气象变量、植被指数、地形、土壤特征和地理位置数据来执行回归或分类任务。

英文摘要

Soil Moisture (SM) modelling constitutes a complex spatiotemporal learning problem characterised by nonlinear environmental interactions, heterogeneous data sources, and limited ground observations. Physics-based approaches, such as water balance models, rely on explicit hydrological equations and high-quality inputs, but their computational cost and scalability limitations restrict large-scale deployment. Data-driven artificial intelligence (AI) methods have emerged as flexible alternatives, enabling the extraction of empirical relationships between soil moisture and environmental variables with reduced modelling assumptions. This work presents a structured survey of AI-based models for soil moisture estimation and classification. Existing approaches are organized into five categories: (a) statistical time-series models, (b) geostatistical methods (c) classical machine learning (ML) models, (d) Deep Learning (DL) models and (e) Probabilistic/Bayesian methods. These models leverage historical soil moisture records, meteorological variables, vegetation indices, topography, soil characteristics, and geolocation data to perform regression or classification tasks.

2606.18319 2026-06-18 cs.LG cs.AI cs.HC cs.SE 新提交

ASTRA: A Scalable Next-Generation ATCO Training Simulator with Autonomous Simpilots

ASTRA:一种具有自主模拟飞行员的可扩展下一代空中交通管制员训练模拟器

Ethan Chew, Enjia Wu, Iruss Eng Wei Yeow, Ian Weiqin Lim, Ranen Sim, Brandon Koh Ziheng, Kaleb Nim, Caden Toh Jun Yi, Wei Dong Soin, Darius Kai Keat Koh, Galen King Yu Tay, Prannaya Gupta, Jonathan Ee Fang Koong, Yong Zhi Lim

发表机构 * Air Emerging Technologies High-Speed Experimentations and Research (AETHER), RSAF Agile Innovation Digital (RAiD), Republic of Singapore Air Force(新加坡共和国空军敏捷创新数字实验室空中新兴技术高速实验与研究)

AI总结 提出ASTRA模拟器,通过微调ASR将词错误率降至23.45%,并集成AI评估框架,实现可扩展的标准化ATCO训练。

详情
AI中文摘要

空中交通管制员(ATCO)对于确保空中交通的安全、有序和高效至关重要,但培训能力受到依赖专门的人类培训师(称为模拟飞行员)的限制,这些培训师必须在模拟空域中扮演飞行员和ATCO的双重角色。现有的自动化解决方案依赖于西方中心的语音模型,这些模型在新加坡的运营环境中表现不佳,现成的系统在新加坡口音的航空语音上词错误率(WER)高达107.80%。我们引入了ASTRA,一个端到端的训练模拟器,通过一个流水线自动化这些模拟飞行员角色,该流水线转录ATCO语音、解释指令,并使用本地适应的语音模型生成适当的飞行员和ATCO响应。我们微调的自动语音识别(ASR)流水线将WER降低到23.45%,在该领域显著优于现有方法。除了交通模拟,ASTRA还集成了一个AI辅助的性能评估框架,该框架评估受训者的无线电通信的准确性、简洁性和完整性,优化后得分分别为91.7%、88.2%和86.9%。基于DSPy和Unsloth等开源基础,这种方法实现了可扩展、标准化的ATCO评估,同时减少了教师的工作量。

英文摘要

Air Traffic Control Operators (ATCOs) are vital in ensuring the safe, orderly, and efficient flow of air traffic, yet training capacity is constrained by reliance on specialized human trainers known as simpilots, who must role-play both pilots and ATCOs in a simulated airspace. Existing automated solutions rely on Western-centric speech models that perform poorly in Singaporean operational contexts, with off-the-shelf systems exhibiting Word Error Rates (WER) of up to 107.80% on Singaporean-accented aviation speech. We introduce ASTRA, an end-to-end training simulator that automates these simpilot roles through a pipeline that transcribes ATCO speech, interprets instructions, and generates appropriate pilot and ATCO responses using locally adapted voice models. Our fine-tuned Automatic Speech Recognition (ASR) pipeline reduces WER to 23.45%, substantially outperforming existing approaches in this domain. Beyond traffic simulation, ASTRA incorporates an AI-assisted performance evaluation framework that assesses trainee radiotelephony communications across accuracy, brevity, and completeness, achieving post-optimization scores of 91.7%, 88.2%, and 86.9%, respectively. Built on open-source foundations such as DSPy and Unsloth, this approach enables scalable, standardized ATCO assessment while reducing instructor workload.

2606.18479 2026-06-18 cs.LG cs.CY 新提交

The Illusion of Improvement: Reject Inference Strategies in Credit Scoring

改进的幻觉:信用评分中的拒绝推断策略

Bruno Scarone, Ricardo Baeza-Yates

发表机构 * Northeastern University(东北大学) KTH Royal Institute of Technology(瑞典皇家理工学院)

AI总结 研究揭示拒绝推断方法在信用评分中因反馈循环导致评估指标误导,提出通过少量探索打破循环并诊断问题。

Comments Accepted to ECML PKDD 2026 (Research Track)

详情
AI中文摘要

拒绝推断方法被广泛用于减轻信用评分中的生存偏差,但其有效性仍不明确。我们系统评估了几种此类方法,并发现一个结构性失败模式:在自然的再训练循环中,模型的准确率提升而召回率崩溃,造成改进的幻觉,使从业者认为系统在变好,而实际上其拒绝质量——正确筛选出违约者的能力——在恶化。然后,我们提出一种受控探索策略,无需统计假设即可打破反馈循环:贷款方故意批准一部分被拒绝的申请人,并观察他们的真实结果。我们表明,准确率和拒绝质量在是否探索上给出相反的建议:准确率倾向于不探索,而拒绝质量随探索提高,证实标准评估指标在选择性偏差下具有误导性。即使极低的探索率(2-5%)在我们的实验中也足以以近乎零成本诊断反馈循环的严重性。我们的发现在两种机器学习方法和三个真实数据集上一致,表明标准评估协议不足以评估在生存偏差下训练的模型。

英文摘要

Reject inference methods are widely used to mitigate survival bias in credit scoring, yet their effectiveness remains poorly understood. We systematically evaluate several such methods and uncover a structural failure mode: in a natural retraining cycle, models whose accuracy improves while recall collapses create an illusion of improvement that leads practitioners to believe the system is getting better when, in fact, its rejection quality -- the ability to correctly screen out defaulters -- is deteriorating. We then propose a controlled exploration strategy that breaks the feedback loop without statistical assumptions: the lender deliberately approves a fraction of rejected applicants and observes their true outcomes. We show that accuracy and rejection quality give opposite recommendations on whether to explore: accuracy favors no exploration, while rejection quality improves with it, confirming that standard evaluation metrics are misleading under selection bias. Even minimal exploration rates (2--5\%) prove sufficient in our experiments to diagnose the severity of the feedback loop at near-zero cost. Our findings are consistent across two machine learning methods and three real-world datasets, and suggest that standard evaluation protocols are inadequate for assessing models trained under survival bias.

2606.18506 2026-06-18 cs.LG eess.SP stat.AP 新提交

Beyond AHI: An Interpretable Causal-Discovery-Guided Framework for Sleep Recovery in Connected Health

超越AHI:一种可解释的因果发现引导的睡眠恢复框架在互联健康中的应用

Saba A. Farahani, Elahe Khatibi, Manoj Vishwanath, Amir M. Rahmani, Hung Cao

发表机构 * University of California, Irvine(加州大学尔湾分校)

AI总结 提出一种可解释的因果发现引导框架,从多模态PSG中推导层次化睡眠恢复评分(SRS),在两大队列中SRS与感知恢复的关联强度是AHI的2.5倍。

Comments 6 pages, 2 figures, 2 tables. Accepted at the 2nd Workshop on Sensing and Computing for Smart and Connected Health (SCH), co-located with IEEE/ACM CHASE 2026

详情
AI中文摘要

客观睡眠评估依赖于多导睡眠图(PSG),但临床影响通常更好地反映在患者报告结局(PROs)如嗜睡和疲劳中。现有的总结指标,包括呼吸暂停低通气指数(AHI),对功能恢复背后的多域生理学提供的洞察有限。我们提出了一种可解释的、因果发现引导的框架,用于从多模态PSG中推导层次化睡眠恢复评分(SRS)。利用两个大型人群队列(MESA: n=1540; MrOS: n=825),我们应用有向无环图(DAG)学习来识别候选生理驱动因素,涵盖呼吸负担、缺氧负担、睡眠碎片化、睡眠结构和自主神经调节。尽管源自临床PSG,这些域自然映射到互联健康技术中日益可用的传感流,包括可穿戴心电图、血氧测定和睡眠阶段估计设备。为了保持机制合理性,我们引入了一个两阶段筛选过程,结合基于生理学的约束和受约束的LLM辅助审计,以识别和消除结构混杂因素以及构造重叠变量。跨队列,这五个域作为与恢复相关的重复生理域出现,所得SRS与感知恢复的关联强度高达AHI的2.5倍。通过将多模态睡眠生理学与以患者为中心的结果通过可解释、偏差感知和域结构化的框架联系起来,这项工作为临床睡眠研究和新兴智能互联健康环境中的恢复建模提供了实用基础。

英文摘要

Objective sleep assessment relies on polysomnography (PSG), yet clinical impact is often better reflected in patient-reported outcomes (PROs) such as sleepiness and fatigue. Existing summary indices, including the Apnea-Hypopnea Index (AHI), provide limited insight into the multidomain physiology underlying functional recovery. We propose an interpretable, causal-discovery--guided framework for deriving a hierarchical Sleep Recovery Score (SRS) from multimodal PSG. Using two large population cohorts (MESA: n=1540; MrOS: n=825), we apply directed acyclic graph (DAG) learning to identify candidate physiological drivers spanning respiratory burden, hypoxic burden, sleep fragmentation, sleep architecture, and autonomic regulation. Although derived from clinical PSG, these domains map naturally to sensing streams increasingly available in connected health technologies, including wearable ECG, oximetry, and sleep-stage estimation devices. To preserve mechanistic plausibility, we introduce a two-stage screening process that combines physiology-based constraints with constrained LLM-assisted auditing to identify and remove structural confounders and construct-overlapping variables. Across cohorts, these five domains emerge as recurrent physiological domains associated with recovery, and the resulting SRS shows up to 2.5$\times$ stronger alignment with perceived recovery than AHI. By linking multimodal sleep physiology to patient-centered outcomes through an interpretable, bias-aware, and domain structured framework, this work provides a practical foundation for recovery modeling across both clinical sleep studies and emerging smart and connected health settings.

2606.18561 2026-06-18 cs.LG cs.AI 新提交

Correcting Sensor-Induced Distribution Drift with Wasserstein Adversarial Learning

使用Wasserstein对抗学习校正传感器引起的分布漂移

Saraa Ali, Vladimir Bocharnikov, Fedor Ratnikov, Mikhail Hushchyn, Artem Ryzhikov, Denis Derkach

发表机构 * Laboratory of Methods for Big Data Analysis, HSE University(大数据分析方法实验室,高等经济大学)

AI总结 提出WGAN方法,通过可学习的校准变换将变化检测器响应分布映射回参考分布,在探测器模型和模拟量能器数据上验证了恢复老化系数和改善能量分布一致性的能力。

Comments This is a preprint sent to Nuclear Science and Techniques journal

详情
AI中文摘要

记录数据的质量取决于采集数据的传感器系统的稳定性。传感器运动和老化会降低下游数据驱动方法的性能和稳定性。我们提出了一种基于Wasserstein-GAN的无监督方法,用于推断物理可解释的变换参数,这些参数将变化的检测器响应分布映射回标称参考分布。与标准生成建模不同,生成器被用作可学习的校准变换,其可训练权重代表所寻求的参数,而判别器通过Wasserstein目标提供分布距离信号。我们在具有受控层偏移的跟踪探测器玩具模型上验证了该方法,并展示了其在具有单元老化效应的高粒度Geant4模拟量能器数据上的应用。该方法恢复了单个单元的老化系数,与真实值相关,并改善了校准后和参考能量和分布之间的一致性,同时随着通道间噪声水平的增加而表现出预期的退化。这些结果表明,在退化参数的直接标签不可用的情况下,对抗性分布匹配可以作为校准策略的数据驱动组件。

英文摘要

The quality of recorded data depends on the stability of the sensor system that acquires it. Sensor motion and aging can degrade the performance and stability of downstream data-driven methods. We present a Wasserstein-GAN-inspired approach for unsupervised inference of physically interpretable transformation parameters that map a changed detector response distribution back to a nominal reference distribution. In contrast to standard generative modeling, the generator is used as a learnable calibration transformation whose trainable weights represent the sought parameters, while the critic provides a distributional distance signal via the Wasserstein objective. We validate the approach on a tracking-detector toy model with controlled layer shifts and demonstrate its application on high-granularity Geant4-simulated calorimeter data with cell-wise aging effects. The method recovers aging coefficients for individual cells with correlation to ground truth and improves agreement between calibrated and reference energy-sum distributions, while exhibiting the expected degradation at increasing channel-to-channel noise levels. These results indicate that adversarial distribution matching can serve as a data-driven component of calibration strategies in settings where direct labels for degradation parameters are unavailable.

2606.18571 2026-06-18 cs.LG cs.CL cs.SD eess.AS 新提交

Fair Cognitive Impairment Detection Through Unlearning

通过去学习实现公平的认知障碍检测

William Nguyen, Jiali Cheng, Hadi Amiri

发表机构 * University of Massachusetts Lowell, USA(马萨诸塞大学洛厄尔分校)

AI总结 提出一种多模态框架,结合跨模态融合和梯度反转去学习,减少人口统计信息对轻度认知障碍检测的偏见,在跨语言数据集上缩小性能差距。

Comments Interspeech 2026

详情
AI中文摘要

轻度认知障碍(MCI)是一种以记忆、语言或思维能力显著下降为特征的医学状况。从自发语音中检测MCI对于可扩展的筛查具有前景。然而,学习模型常常利用与标签相关的人口统计线索,导致不同亚组之间存在较大的性能差距。我们提出了一种多模态框架,结合了(i)模态间(语音、文本和图像)的跨模型融合,以及(ii)使用梯度反转的去学习,该技术阻止共享嵌入编码与任务无关的人口统计属性。在多语言基准TAUKADIAL和PREPARE上的评估表明,我们的方法在MCI分类上优于最先进的多语言和多模态基线,同时显著缩小了患者亚组(性别和语言)之间的性能差距。我们进一步分析了跨数据集的迁移,表明人口统计去学习有助于学习更鲁棒的MCI检测表示。

英文摘要

Mild Cognitive Impairment (MCI) is a medical condition characterized by a noticeable decline in memory, language, or thinking abilities. MCI detection from spontaneous speech is promising for scalable screening. However, learned models often exploit demographic cues correlated with labels, resulting in a large performance gap across subgroups. We present a multimodal framework that combines (i) cross-model fusion between modalities (speech, text, and image), and (ii) unlearning using gradient reversal that discourages the shared embedding from encoding task-irrelevant demographic attributes. Evaluated on the multilingual benchmarks TAUKADIAL and PREPARE, our method outperforms the state-of-the-art multilingual and multimodal baseline in MCI classification while substantially reducing the performance gap across patient subgroups (sex and language). We further analyze transfer across datasets, showing that demographic unlearning helps learn more robust representations for MCI detection.

2606.18672 2026-06-18 cs.LG cs.AI q-bio.GN 新提交

scGTN: Deep Siamese Graph Transformer Network for Single-cell RNA Sequencing Clustering

scGTN:用于单细胞RNA测序聚类的深度孪生图变换网络

Jinke Wu, Yifan Wang, Siyu Yi, Caiyang Yu, Ziyue Qiao, Nan Yin, Jiancheng Lv, Wei Ju

发表机构 * Sichuan University(四川大学) University of International Business and Economics(对外经济贸易大学) Great Bay University(大湾区大学) The Education University of Hong Kong(香港教育大学)

AI总结 提出scGTN框架,通过孪生图变换网络整合基因表达与细胞间结构信息,利用最优传输策略进行自监督聚类,在多个数据集上优于现有方法。

Comments Accepted by Proceedings of the Thirty-Fifth International Joint Conference on Artificial Intelligence (IJCAI 2026)

详情
AI中文摘要

单细胞RNA测序(scRNA-seq)在表征细胞水平基因表达、识别细胞类型以及促进对细胞异质性的理解中起着关键作用。尽管scRNA-seq数据聚类取得了显著进展,但我们认为当前方法常常忽略scRNA-seq数据固有的稀疏性和噪声,以及复杂的细胞间结构信息。为此,本文提出了一种基于深度孪生图变换网络(称为scGTN)的新型单细胞RNA-seq聚类框架,该框架明确整合了基因表达谱和细胞间结构依赖关系以进行细胞聚类。具体而言,我们将scRNA-seq数据建模为图,并构建两个增强图视图作为双视图以捕获互补的细胞间信息。然后,采用孪生图变换网络显式整合最短路径信息和节点间距离,以捕获细胞间更丰富的结构关系。最后,我们采用最优传输策略以自监督方式指导细胞聚类。在多个基准scRNA-seq数据集上的大量实验表明,我们的scGTN始终优于现有方法。我们的代码可在以下网址获取:https://github.com/...(原文链接)。

英文摘要

Single-cell RNA sequencing (scRNA-seq) serves a pivotal role in characterizing gene expression at the cellular level, enabling the identification of cell types and advancing the understanding of cellular heterogeneity. Despite the significant progress in scRNA-seq data clustering, we argue that current methods always ignore the sparsity and noise, as well as the complex intercellular structural information inherent in scRNA-seq data. Toward this end, in this paper, we propose a novel single-cell RNA-seq clustering framework via deep Siamese Graph Transformer Network (termed scGTN), which explicitly integrates gene expression profile and intercellular structural dependencies for cell clustering. In particular, we formulate scRNA-seq data as a graph and construct two augmented graph views that serve as dual views to capture complementary intercellular information. Then, a Siamese graph transformer network is employed to explicitly incorporate shortest-path information and node-wise distances for capturing richer structural relationships between cells. Finally, we employ an optimal transport strategy to guide the cell clustering in a self-supervised manner. Extensive experiments on multiple benchmark scRNA-seq datasets demonstrate that our scGTN consistently outperforms existing methods. Our code is available at https://github.com/W-RMSL/scGTN.

2606.18713 2026-06-18 cs.LG physics.comp-ph 新提交

Trainable Photonic Measurement for Physics-Informed PDE Learning

可训练光子测量用于物理信息偏微分方程学习

Jiale Linghu, Hao Dong, Yangshuai Wang

发表机构 * Xidian University(西安电子科技大学) National University of Singapore(新加坡国立大学)

AI总结 提出一种光子量子神经场,将坐标编码为可训练光学相位,通过多光子Fock空间干涉混合并从光子数测量解码,作为物理信息残差最小化的可训练表示,在七种PDE基准上展示相位复杂度转变,在困难区域误差低一个数量级且参数少约四分之一。

详情
AI中文摘要

光子量子机器学习提供了一条从相位、干涉和测量构建可训练物理表示的途径。然而,其在科学机器学习中的作用仍 largely unexplored。物理信息神经场提供了一个自然设置,因为微分方程需要保留相位、频率和导数结构的试验空间。这里我们引入一种光子量子神经场,其中坐标成为可训练光学相位,通过多光子Fock空间干涉混合,并从光子数测量解码。光子电路本身作为神经场表示进行优化,而非固定特征图或硬件加速器。因此,光子测量是一种可训练表示,在此基础上最小化物理信息残差。在七个椭圆、波动、非线性色散和逆PDE基准测试中,我们观察到相位复杂度转变:经典坐标和傅里叶特征网络在平滑区域足够,而光子场在残差导数放大相位失配时最准确。在最困难区域,它给出最低误差,差距达一个数量级,且可训练参数约为经典基线四分之一。冻结和打乱控制以及噪声压力测试将这一增益归因于学习到的干涉和在复合扰动下稳定的Fock概率读出。这些结果将光子量子测量识别为科学机器学习的一种表示学习原理。

英文摘要

Photonic quantum machine learning offers a route to trainable physical representations built from phase, interference and measurement. However, its role in scientific machine learning remains largely unexplored. Physics-informed neural fields provide a natural setting, because differential equations require trial spaces that preserve phase, frequency and derivative structure. Here we introduce a photonic quantum neural field in which coordinates become trainable optical phases, are mixed by multi-photon Fock-space interference and are decoded from photon-number measurements. The photonic circuit is optimized as the neural-field representation itself, not as a fixed feature map or hardware accelerator. Photonic measurement is therefore a trainable representation on which the physics-informed residual is minimized. Across seven elliptic, wave, nonlinear dispersive and inverse PDE benchmarks, we observe a phase-complexity transition: classical coordinate and Fourier-feature networks suffice in smooth regimes, whereas the photonic field is most accurate when residual derivatives amplify phase mismatch. In the hardest regimes it gives the lowest errors, with margins reaching an order of magnitude and about one quarter of the trainable parameters of classical baselines. Frozen and shuffled controls, together with noise stress tests, attribute this gain to learned interference and stable Fock-probability readout under compound perturbations. These results identify photonic quantum measurement as a representation-learning principle for scientific machine learning.

2606.18726 2026-06-18 cs.LG cs.AI 新提交

Graph Grounded Cross Attention Transformer Neural Network for Structurally Constrained Full Event Sequence Generation in Predictive Process Monitoring

基于图锚定交叉注意力Transformer神经网络的预测过程监控中结构约束完整事件序列生成

Fang Wang, Ernesto Damiani

发表机构 * Department of Computer Science, University of Milan(米兰大学计算机科学系)

AI总结 提出图锚定交叉注意力Transformer(GGATN),通过全局过程图作为结构化记忆、Transformer自注意力编码序列位置、图锚定交叉注意力注入过程拓扑,结合维特比式图约束解码,一次性生成完整事件序列,在六个基准日志上优于LLM基线。

Comments 40 pages

详情
AI中文摘要

结构约束的事件序列生成仍然具有挑战性,因为生成的路径必须保持转移可行性、时间顺序、终止和属性一致性。在预测过程监控(PPM)中,这一挑战表现为完整事件序列生成,而现有工作主要处理子任务,如下一个活动、剩余时间、结果和属性预测。本文提出了图锚定交叉注意力Transformer神经网络(GGATN)用于这一统一的PPM任务。GGATN使用全局过程图作为结构化活动记忆,通过Transformer自注意力对序列位置进行上下文化,并通过图锚定交叉注意力注入过程拓扑。与自回归解码不同,GGATN一次性生成活动、时间戳、长度以及事件级和序列级属性,随后进行维特比风格的图约束解码以获得可行路径和显式终止。在六个基准事件日志上的实验表明,其生成质量优于局部指令提示的LLM基线。GGATN在序列相似性、Damerau-Levenshtein相似性、基于二元组的控制流相似性和持续时间分布方面取得了强劲性能,同时保持零幻觉活动和零序列级属性不一致。消融分析证实了全局图编码器作为稳定的结构先验。可解释性分析展示了图结构、序列上下文、反馈细化和约束解码如何塑造生成过程。

英文摘要

Structurally constrained event sequence generation remains challenging because generated paths must preserve transition feasibility, temporal order, termination, and attribute consistency. In predictive process monitoring (PPM), this challenge appears as full event sequence generation, whereas existing work mainly addresses component tasks such as next activity, remaining time, outcome, and attribute prediction. This paper proposes the Graph Grounded Cross Attention Transformer Neural Network (GGATN) for this unified PPM task. GGATN uses a global process graph as structured activity memory, contextualizes sequence positions through Transformer self attention, and injects process topology through graph grounded cross attention. Unlike autoregressive decoding, GGATN generates activities, timestamps, length, and event level and sequence level attributes in a single pass, followed by Viterbi style graph constrained decoding for feasible paths and explicit termination. Experiments on six benchmark event logs show more reliable generation quality than local instruction prompted LLM baselines. GGATN achieves strong performance on sequence similarity, Damerau Levenshtein similarity, bigram based control flow similarity, and duration distribution, while maintaining zero hallucinated activities and zero sequence level attribute inconsistency. Ablation analyses confirm the global graph encoder as a stable structural prior. Interpretability analyses show how graph structure, sequence context, feedback refinement, and constrained decoding shape generation.

2606.18732 2026-06-18 cs.LG cs.CV 新提交

Low-Cost Neuromorphic Fall Detection Using Synthetic Event Data and Hybrid SNNs

低成本神经形态跌倒检测:使用合成事件数据和混合SNN

Guillermo Rojas, Gonzalo Soto, Daniel Yunge

发表机构 * School of Electrical Engineering Pontificia Universidad Católica de Valparaíso, Chile(瓦尔帕莱索天主教大学电气工程学院)

AI总结 提出混合SNN-CNN模型,从智能手机视频合成事件相机数据,实现高效准确的跌倒检测。

Comments 4 pages, 6 figures, presented at ICONS 2025 during the Poster Session, but not published

详情
AI中文摘要

本工作提出了混合模型,将脉冲神经网络(SNN)与卷积神经网络(CNN)组件集成,以从传统智能手机视频生成的模拟事件相机数据(动态视觉传感器,DVS)中学习。主要针对人类跌倒检测,该方法通过将视频帧转换为事件数据,利用SNN的能效和时空处理能力。通过多个数据集上的模拟评估所提出的模型,并将其性能与传统机器学习模型进行比较。结果表明,在不牺牲准确性的情况下显著提高了效率,强调了将SNN和DVS技术结合用于现实环境中复杂任务的潜力。

英文摘要

This work presents the development of hybrid models that integrate spiking neural networks (SNNs) with components of convolutional neural networks (CNNs) to learn from simulated event-based camera data (Dynamic Vision Sensor, DVS) generated from conventional smartphone videos. Aimed primarily at human fall detection, the approach leverages the energy efficiency and spatio-temporal processing capabilities of SNNs by converting video frames into event-based data. The proposed models are evaluated through simulations on multiple datasets, comparing their performance to that of traditional machine learning models. Results demonstrate significant gains in efficiency without sacrificing accuracy, underscoring the potential of combining SNNs and DVS technology for complex tasks in real-world environments.

2606.18857 2026-06-18 cs.LG physics.ao-ph 新提交

Investigating Inductive Biases for Machine Learning Emulation of Sudden Stratospheric Warmings in Idealised Isca Simulations

研究理想化Isca模拟中平流层突然增温的机器学习模拟的归纳偏差

Oskar Bohn Lassen, Simon Driscoll, Stephen I. Thomson, Sebastian Schemm, Francisco C. Pereira

发表机构 * Technical University of Denmark(丹麦技术大学) University of Cambridge(剑桥大学) University of Exeter(埃克塞特大学)

AI总结 测试不同架构的归纳偏差对模拟平流层突然增温动力学的影响,发现三维垂直耦合是关键,但低预测误差不保证物理一致性。

详情
AI中文摘要

机器学习模拟器越来越多地用于天气预报,并有可能通过学习动态重要的可预测性来源,将技能扩展到次季节到季节时间尺度。一个关键挑战是模型能否利用可预测性锚点,例如平流层变率,这些锚点在超出短期超前时间时影响对流层环流。我们使用配对的理想化Isca模拟测试架构归纳偏差如何影响对平流层突然增温(SSW)动力学的模拟,这些模拟仅在施加的波-2加热扰动上有所不同。在用于一步预测的卷积、变换器和基于图的架构中,当平流层动态安静时,模型差异不大,但当类似SSW的变率活跃时,差异显著扩大。我们的结果确定显式三维垂直耦合是机器学习模拟平流层动力学的关键归纳偏差。然而,Eliassen-Palm通量诊断表明,低预测误差并不能保证物理上真实的波-平均流相互作用,平流层波驱动结构中仍存在相干误差。

英文摘要

Machine-learning emulators are increasingly used for weather prediction and have the potential to extend skill on subseasonal-to-seasonal timescales by learning dynamically important sources of predictability. A key challenge is whether the models can exploit predictability anchors, such as stratospheric variability, that influence tropospheric circulation beyond short lead times. We test how architectural inductive bias affects emulation of sudden stratospheric warming (SSW) dynamics using paired idealised Isca simulations that differ only in an imposed wave-2 heating perturbation. Across convolutional, transformer, and graph-based architectures trained for one-step prediction, model differences are modest when the stratosphere is dynamically quiet but widen substantially when SSW-like variability is active. Our results identify explicit three-dimensional vertical coupling as a key inductive bias for machine-learning emulation of stratospheric dynamics. However, Eliassen-Palm flux diagnostics show that low forecast error does not guarantee physically faithful wave-mean-flow interaction, with coherent errors remaining in stratospheric wave-driving structure.

2606.18864 2026-06-18 cs.LG cs.AI 新提交

Scaling Learning-based AEB with Massive Unlabeled Data

基于大规模无标签数据的可扩展学习型自动紧急制动

Xiangyu Wang, Yang Zhan, Mengxiang Hao, Chuanchuan Zhong, Yansong Jia, Junjie Zhang, Yu Han, Xin Jiang, Zhen Cao, Ying Wang, Yulun Song, Zhitao Xu

发表机构 * Li Auto

AI总结 提出稳定元反馈半监督学习框架,通过噪声感知解耦和运动学门控伪标签,利用大规模无标签数据提升自动紧急制动性能,实现超100:1正误触发比和35%无事故里程提升。

Comments Accepted for presentation at the 2026 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

详情
AI中文摘要

本文研究如何在生产约束下,利用大规模无标签车队数据扩展基于学习的自动紧急制动(AEB)。我们的方法基于元反馈半监督学习(MF-SSL),其中教师模型为无标签驾驶数据生成伪标签,并使用小型有标签锚定集作为安全关键反馈进行更新。在生产中,锚定歧义和有标签-无标签不匹配会放大系统性的伪标签错误,导致误触发。我们提出了一种稳定的MF-SSL框架,包括:(i) 噪声感知解耦,从教师监督更新路径中移除易产生歧义的锚定;(ii) 运动学门控伪标签,结合教师冲突惩罚,抑制无标签数据上由不匹配引起的风险幻觉,同时保持广泛覆盖。大量实验表明,随着无标签数据从1M扩展到1B窗口,模型性能持续提升,在保持舒适性的同时提高了安全性。经过1B数据训练的学生模型已部署到数十万辆车辆上,并在超过10^9公里的行驶中得到验证,实现了超过100:1的正误触发比,且相比仅基于规则的基线,无事故行驶里程提升了35%。

英文摘要

This paper studies how to scale learning-based automatic emergency braking (AEB) with massive unlabeled fleet data under production constraints. Our approach is based on meta-feedback semi-supervised learning (MF-SSL), where a teacher generates pseudo labels for unlabeled driving data and is updated using a small labeled anchor set as safety-critical feedback. In production, anchor ambiguity and labeled-unlabeled mismatch can amplify systematic pseudo-label errors, leading to spurious triggers. We propose a stabilized MF-SSL framework with (i) Noise-Aware Decoupling, which removes ambiguity-prone anchors from the teacher's supervised update path, and (ii) kinematics-gated pseudo-labeling with a teacher conflict penalty to suppress mismatch-induced risk hallucinations on unlabeled data while maintaining broad coverage. Extensive experiments show consistent gains as unlabeled data scale from 1M to 1B windows, improving safety while keeping comfort stable. The 1B-trained student model is deployed to hundreds of thousands of vehicles and validated over \$10^9$ km of driving, achieving a positive-to-false activation ratio exceeding 100:1 and a 35% improvement in accident-free driving mileage over a production rule-only baseline.

2606.18882 2026-06-18 cs.LG cs.AI eess.SP 新提交

Domain-Shift Aware Neural Networks for Unbalance Characterization in Rotating Systems

面向旋转系统不平衡表征的域偏移感知神经网络

Bernardo Feijó Junqueira, Claudio Kiyoshi Umezu, Bruno Bilhar Karaziack, Tomaz Junior, Daniel Alves Castello

发表机构 * Springer Nature

AI总结 提出域偏移感知神经网络,通过最大均值差异策略对齐源域与目标域特征,解决变工况下旋转轴不平衡质量估计的回归问题,实验证明该方法在域偏移未知时显著提升预测精度。

详情
AI中文摘要

本文研究了域偏移感知神经网络在回归任务中的应用,旨在估计不同运行条件下旋转轴的不平衡质量。实验数据来自一个测试台,其中主轴上安装有带不平衡质量的法兰,在不同转速下驱动,同时可选择性地激活副轴以引入域差异。不平衡质量固定在径向距离上,使用三轴加速度计记录系统的动态响应。质量估计的逆问题在域自适应框架中提出,网络采用最大均值差异策略进行训练,以对齐源域和目标域的特征表示。结果表明,显式处理域偏移能有效提高预测精度,尤其是在系统的物理行为和域偏移来源不完全已知且超出训练条件的情况下。这些发现凸显了域偏移感知模型在结构健康监测回归任务中的潜力。

英文摘要

This work investigates the application of a domain-shift aware neural network for regression tasks aimed at estimating unbalance masses in rotating shafts under varying operating conditions. Experimental data were collected from a test rig in which a primary shaft, equipped with a flange carrying unbalanced masses, was driven at different rotational speeds, while a secondary shaft could be optionally activated to introduce domain discrepancy. The unbalance masses were positioned at a fixed radial distance, and the dynamic response of the system was recorded using triaxial accelerometers. The inverse problem of mass estimation is formulated within a domain adaptation framework, where the network is trained with a maximum mean discrepancy strategy to align feature representations across source and target distributions. The results demonstrate the effectiveness of explicitly addressing domain shift in improving prediction accuracy, especially when the system's physical behavior and sources of domain discrepancy are not fully known and fall outside the training conditions. These findings highlight the potential of domain-shift aware models for regression tasks in Structural Health Monitoring.

2606.18933 2026-06-18 cs.LG cs.IR stat.ME 新提交

Zero-Shot Active Feature Acquisition via LLM-Elicitation

基于LLM启发式的零样本主动特征获取

Binyamin Perets, Natalie Mendelson, Shiran Vainberg, Yehuda Chowers, Shai Shen-Orr, Shie Mannor

发表机构 * Faculty of EE, Technion(技术学院电子工程系) Faculty of Medicine, Technion(技术学院医学院) CytoReason NVIDIA

AI总结 提出通过LLM启发式获取马尔可夫随机场充分统计量的零样本主动特征获取框架,解决数据标注不足问题,在IBD患者诊断中优于现有方法。

详情
AI中文摘要

主动特征获取(AFA)顺序选择要观察的特征以达成分类或排序决策。其主要局限性在于依赖大量标注数据来拟合指导获取的概率模型。大型语言模型(LLM)提供无监督的领域知识,但作为序列规划者表现不佳。要求其同时知晓和决策会混淆最好分开的能力。这里,我们通过严格的启发式方法开发了一个零样本AFA框架:仅要求LLM返回其可被信任返回的内容,即马尔可夫随机场(MRF)的充分统计量——一元偏差和成对协变。我们将该框架应用于两个场景:二分类和top-$k$识别。实践中,LLM可靠地仅返回判别性统计量,即区分类别而非孤立每个类别的统计量,这阻碍了经典AFA。我们应用最大熵闭包来解决这种规范模糊性。我们在炎症性肠病(IBD)患者队列上进行评估,这是一个活跃的临床环境,其中诊断模糊性和患者异质性阻碍了稳定的治疗策略。我们的框架在真实标签和其自身提取的信念上均优于LLM。在最关键的地方,即最困难的患者上,我们的top-$k$获取策略显著优于所有现有方法。

英文摘要

Active feature acquisition (AFA) sequentially selects which features to observe to reach a classification or ranking decision. Its central limitation is reliance on large amount of labeled data to fit probabilistic models guiding acquisition. Large language models (LLMs) supply unsupervised domain knowledge, but are poor sequential planners. Asking one to both know and decide conflates capabilities best kept separate. Here, we develop a framework for zero-shot AFA through disciplined elicitation: asking the LLM only for what it can be trusted to return, the unary deviations and pairwise co-variations that are the sufficient statistics of a Markov random field (MRF). We apply our framework to two settings: binary classification and top-$k$ identification. In practice, the LLM reliably returns only discriminative statistics, what distinguishes the classes rather than each class in isolation, which precludes classical AFA. We apply a maximum-entropy closure that resolves this gauge ambiguity. We evaluate on a cohort of Inflammatory Bowel Disease (IBD) patients, an active clinical setting where diagnostic ambiguity and patient heterogeneity obstruct stable treatment strategies. Our framework outperforms the LLM both on real labels and on its own extracted beliefs. Where it matters most, on the hardest patients, our top-$k$ acquisition policy markedly outperforms all existing methods.

2606.19026 2026-06-18 cs.LG cs.AI physics.ao-ph 新提交

A Hybrid LSTM--Vision Transformer Architecture for Predicting HRRR Forecast Errors

混合LSTM-视觉Transformer架构用于预测HRRR预报误差

David Aaron Evans, Jay C. Rothenberger, Kara J. Sulia, Nick P. Bassill, Chris D. Thorncroft

发表机构 * Atmospheric Sciences Research Center, University at Albany, SUNY(纽约州立大学奥尔巴尼分校大气科学研究中心) University of Oklahoma(俄克拉荷马大学) State Weather Risk Communication Center, University at Albany, SUNY(纽约州立大学奥尔巴尼分校州天气风险沟通中心)

AI总结 提出LSTM-ViT混合框架,结合地表观测时序与大气廓线,预测HRRR降水、风速和温度预报误差,相比基线LSTM性能提升,尤其降水误差预测技能提高约两倍。

Comments This manuscript is a preprint and has been submitted for peer review to the Artificial Intelligence for the Earth Systems journal. The content is subject to change based on the outcome of the peer review process and should not be considered final or definitive. Copyright in this Work may be transferred without further notice

详情
AI中文摘要

高分辨率数值天气预报(NWP)系统中的预报误差通常与未解析的边界层(PBL)过程、对流、地形诱导环流以及其他垂直结构的大气现象有关。先前的研究表明,长短期记忆(LSTM)网络可以利用中尺度观测成功预测高分辨率快速刷新(HRRR)模型的预报误差,但我们认为性能下降与复杂垂直大气演化时期有关。为解决这一局限,我们开发了一种混合LSTM-视觉Transformer(LSTM-ViT)框架,将来自地表观测的时间序列学习与来自纽约州中尺度剖面仪网络的垂直大气廓线相结合。LSTM-ViT框架被训练用于预测单个中尺度站点上HRRR的逐时降水、10米风速和2米温度预报误差。在所有三个预测变量中,相对于基线LSTM架构,引入剖面仪导出的大气结构提高了预报误差预测技能,最大提升出现在较短的预报提前期和PBL活动增强期间。对于降水预报误差,改进尤为显著,LSTM-ViT框架相对于基线LSTM实现了约两倍的预测技能提升,同时更好地捕捉了对流驱动的误差演变并减少了与PBL过程相关的退化。这些结果表明,将时间序列学习与垂直注意力机制相结合,为改进业务NWP系统中的预报误差预测提供了一条具有物理意义的途径。我们的研究为预报员提供了关于模型偏差和预报置信度的增强指导。

英文摘要

Forecast errors in high-resolution numerical weather prediction (NWP) systems are often linked to unresolved planetary boundary layer (PBL) processes, convection, terrain-induced circulations, and other vertically structured atmospheric phenomena. Previous work demonstrated that Long Short-Term Memory (LSTM) networks can successfully predict forecast errors in the High-Resolution Rapid Refresh (HRRR) model using mesonet observations, but we believe performance degradation is linked to periods of complex vertical atmospheric evolution. To address this limitation, we develop a hybrid LSTM-Vision Transformer (LSTM-ViT) framework that combines temporal sequence learning from surface observations with atmospheric profiles from the New York State Mesonet profiler network. The LSTM-ViT framework is trained to predict HRRR hourly precipitation, 10 m wind speed, and 2 m temperature forecast errors at individual mesonet stations. Across all three predictors, incorporation of profiler-derived atmospheric structure improves forecast error prediction skill relative to the baseline LSTM architecture, with the largest gains occurring at shorter forecast lead times and during periods of enhanced PBL activity. Improvements are particularly pronounced for precipitation forecast error, where the LSTM-ViT framework achieves approximately a twofold increase in predictive skill relative to the baseline LSTM while better capturing convectively driven error evolution and reducing degradation associated with PBL processes. These results demonstrate that combining temporal sequence learning with vertically informed attention mechanisms provides a physically meaningful pathway for improving forecast error prediction in operational NWP systems. Our research offers forecasters enhanced guidance regarding model bias and forecast confidence.

2606.19108 2026-06-18 cs.LG 新提交

JourneyFormer: Encoding Airbnb Guest Journey with Sequence Modeling

JourneyFormer: 使用序列建模编码Airbnb客人旅程

Daochen Zha, Chun How Tan, Xin Liu, Bin Xu, Han Zhao, Xiaowei Liu, Tracy Yu, Hui Gao, Huiji Gao, Liwei He, Stephanie Moyerman, Sanjeev Katariya

发表机构 * Airbnb

AI总结 针对Airbnb中客人序列长、探索性强且标签稀疏的问题,提出JourneyFormer序列建模解决方案,通过优化数据选择、ID嵌入、模型架构和标签归因,并在两个生产面上通过在线A/B测试验证了其有效性。

Comments Accepted by KDD 2026

详情
AI中文摘要

序列建模因其能够建模用户历史行为并推断用户意图,在推荐和排序算法中越来越受欢迎。尽管理论简单,但由于序列的复杂性和稀疏标签,序列模型在生产中的实际部署并非易事。例如,在Airbnb中,客人序列通常较长、具有探索性且复杂,我们关注的是稀疏的预订标签。因此,我们经常需要在数据和建模方面做出各种设计决策,以在有效性和可扩展性之间取得平衡。本文深入探讨了这些生产挑战,并部署了JourneyFormer,一种用于Airbnb搜索排序的序列建模解决方案。我们详细介绍了关键的设计考虑,涵盖客人事件选择、ID嵌入、模型架构和标签归因等方面。此外,我们描述了几种加速模型训练和推理的定制策略。JourneyFormer已成功部署在Airbnb的生产环境中,其有效性和影响不仅通过改进的离线排序指标得到证明,而且通过两个生产面上的在线A/B测试在关键业务指标上取得了显著提升。

英文摘要

Sequence modeling has become increasingly popular in recommendation and ranking algorithms, owing to its capacity to model users' historical behaviors and infer user intentions. Despite its theoretical simplicity, the practical deployment of a sequence model in production is non-trivial due to complexity of the sequence and sparse labels. For example, in Airbnb, guest sequences are often long, exploratory and complex, and we focus on booking labels, which are sparse. As such, we are often required to make various design decisions regarding data and modeling to strike a balance between effectiveness and scalability. This work delved into these production challenges and deployed JourneyFormer, a sequence modeling solution for search ranking at Airbnb. We detail crucial design considerations, covering aspects such as guest event selection, ID embeddings, model architecture, and label attribution. Additionally, we describe several tailored strategies to accelerate model training and inference. JourneyFormer has been successfully deployed within Airbnb's production, where its effectiveness and impact have been evidenced not only by improved offline ranking metrics but also by significant gains in key business metrics through online A/B testing across 2 production surfaces.

2606.19140 2026-06-18 cs.LG 新提交

ChronoSurv: A Clinical Pathway-Guided Graph Framework for Multimodal Survival Analysis

ChronoSurv:一种临床路径引导的多模态生存分析图框架

Hugo Miccinilli, Theo Di Piazza

发表机构 * Université Paris-Saclay, CentraleSupélec, MICS, France(巴黎-萨克雷大学,中央理工-高等电力学院,MICS,法国) University of Lyon, INSA Lyon, CREATIS, France(里昂大学,INSA里昂,CREATIS,法国)

AI总结 提出ChronoSurv,一种基于有向图的多模态生存分析框架,通过层次化拓扑和异质消息传递建模临床轨迹,在头颈癌数据集上取得最优判别性能与可靠校准。

Comments Accepted at MICCAI 2026. Submitted version due to embargo

详情
AI中文摘要

准确的生存预测对于头颈癌的个性化治疗计划至关重要,但由于多模态临床数据的异质性和高维性,这仍然具有挑战性。虽然深度生存模型在预测性能上优于经典统计方法,但现有方法通常依赖于静态融合策略或时间无关建模,限制了其捕捉结构化临床工作流程的能力。在这项工作中,我们提出了ChronoSurv,一种用于多模态生存分析的异质层次有向图框架。ChronoSurv使用与关键诊断步骤对齐的有向图,将患者护理表示为进展感知的临床轨迹。层次拓扑包含细粒度、粗粒度和全局表示,进一步支持对缺失模态的灵活适应,而异质消息传递则建模了跨模态和临床步骤的复杂非对称关系。在两个公共数据集上的实验结果表明,ChronoSurv在保持统计可靠校准的同时,实现了最先进的判别性能。全面的消融研究进一步证实了每个架构组件的贡献,突出了轨迹感知图建模在多模态生存预测中的潜力。

英文摘要

Accurate survival prediction is essential for personalized treatment planning in head and neck cancer, yet remains challenging due to the heterogeneous and high-dimensional nature of multimodal clinical data. While deep survival models have improved predictive performance over classical statistical approaches, existing methods typically rely on static fusion strategies or temporally agnostic modeling, limiting their ability to capture structured clinical workflows. In this work, we propose ChronoSurv, a heterogeneous hierarchical directed graph framework for multimodal survival analysis. ChronoSurv represents patient care as a progression-aware clinical trajectory using directed graphs aligned with key diagnostic steps. A hierarchical topology incorporates fine-grained, coarse, and global representations, further supporting flexible adaptation to missing modalities, while heterogeneous message passing models complex and asymmetric relationships across modalities and clinical steps. Experimental results on two public datasets demonstrate that ChronoSurv achieves state-of-the-art discriminative performance while maintaining statistically reliable calibration. Comprehensive ablation studies further confirm the contribution of each architectural component, highlighting the potential of trajectory-aware graph modeling for multimodal survival prediction.

2606.19230 2026-06-18 cs.LG cs.HC stat.ML 新提交

A Human-in-the-Loop Bayesian Optimization Framework for Constraint-Aware Bioprocess Development

一种面向约束感知的生物过程开发的人机协同贝叶斯优化框架

Samuel Stricker, Claus Wirnsperger, Alessandro Butté, Laura Helleckes, Gonzalo Guillén Gosálbez, Antonio del Rio Chanona, Mehmet Mercangöz

发表机构 * Imperial College London(伦敦帝国理工学院) DataHow AG ETH Zurich(苏黎世联邦理工学院)

AI总结 提出一种扩展的帕累托前沿引导采样框架,通过将高斯过程代理的约束满足概率和鲁棒性作为多目标优化目标,结合交互式仪表盘实现人机协同的约束感知生物过程优化。

详情
AI中文摘要

本文提出了帕累托前沿引导采样(PFGS)的一种扩展,这是一种人机协同(HitL)贝叶斯优化(BO)框架,其中高斯过程(GP)代理导出的量被重新表述为多目标优化问题的目标,得到的帕累托前沿暴露给领域专家进行交互式候选选择,而不是返回单一的自动推荐。该框架在两个方向上进行了扩展:约束优化通过将满足输出规格限的后验概率作为显式的帕累托目标来处理,该概率从GP后验分布解析计算得到;鲁棒优化通过蒙特卡洛采样策略来处理,该策略估计在用户定义的输入扰动变异性下的期望下置信性能,捕捉在可能的实现偏差下的性能退化。由此产生的多维帕累托表示通过交互式仪表盘上的成对二维投影同时显示预测性能、模型不确定性、概率约束满足和输入鲁棒性之间的权衡,使得选择标准能够随着代理模型的改进和开发目标的演变而迭代细化。该框架在一个八维的补料分批中国仓鼠卵巢(CHO)细胞培养模拟器上进行了展示,证明了系统性地识别高性能、满足可行性且对扰动具有鲁棒性的操作条件,并说明了专家定义的需求如何提供原则性的停止标准并支持实验资源的明智分配。

英文摘要

This work presents an extension to Pareto Front Guided Sampling (PFGS), a Human-in-the-Loop (HitL) Bayesian Optimization (BO) framework in which Gaussian process (GP) surrogate-derived quantities are reformulated as objectives of a multi-objective optimization problem, and the resulting Pareto front is exposed to a domain expert for interactive candidate selection rather than returning a single automated recommendation. The framework is extended in two directions: constrained optimization is addressed by incorporating the posterior probability of satisfying output specification limits as an explicit Pareto objective, computed analytically from the GP posterior distribution; robust optimization is addressed by a Monte Carlo sampling strategy that estimates expected lower-confidence performance over a user-defined variability of input perturbations, capturing performance degradation under likely implementation deviations. The resulting multi-dimensional Pareto representation renders trade-offs between predicted performance, model uncertainty, probabilistic constraint satisfaction, and input robustness simultaneously visible through pairwise two-dimensional projections on an interactive dashboard, enabling selection criteria to be iteratively refined as the surrogate model improves and development objectives evolve. The framework is showcased on an eight-dimensional fed-batch Chinese Hamster Ovary (CHO) cell culture simulator demonstrating systematic identification of high-performing, feasibility-compliant, and perturbation-resilient operating conditions, and illustrating how expert-defined requirements provide a principled stopping criterion and support informed allocation of experimental resources.

2606.19255 2026-06-18 cs.LG 新提交

SCAN: Enhance Time Series Anomaly Detection via Multi-Scale Neighborhood-Centered Clustering

SCAN: 通过多尺度邻域中心聚类增强时间序列异常检测

Xingze Zheng, Hanyin Cheng, Siyuan Wang, Yiting Hao, Peng Chen, Yuan Jun, Yang Shu

发表机构 * East China Normal University(华东师范大学) APPLab, Huawei(华为2012应用实验室) Huawei(华为)

AI总结 提出SCAN方法,通过多尺度聚类增强重建型异常检测,在表示层集成正常模式聚类中心约束重建,在异常判据层结合聚类概率与重建误差,并利用邻域中心表示改进聚类性能,在多个真实数据集上达到最优。

详情
AI中文摘要

时间序列异常检测在广泛的现实应用中扮演着关键角色。基于重建的方法已成为主流范式,但它们面临过度泛化和欠泛化问题,且难以平衡。为了解决这一问题,我们引入多尺度聚类来增强基于重建的方法。在表示层面,我们整合正常模式的聚类中心表示,以约束模型针对代表性正常模式进行重建,防止强大能力和表示能力的主导。在异常判据层面,我们基于聚类成员概率推导异常置信度分数,并将其与重建误差结合,提供双重检测标准。此外,聚类中心表示和异常置信度分数的有效性取决于聚类性能。因此,我们提取邻域中心表示用于多视图聚类,以提高聚类性能。在来自不同应用领域的多个真实数据集上的大量实验表明,SCAN达到了最先进的性能。

英文摘要

Time series anomaly detection plays a crucial role in a wide range of real-world applications. Reconstruction-based methods have become the mainstream paradigm, but they suffer from over-generalization and under-generalization problems, which are challenging to balance. To address this, we introduce multi-scale clustering to enhance reconstruction-based methods. At the representation level, we integrate the cluster center representations of normal patterns to constrain the model to target representative normal patterns for reconstruction, preventing dominance of powerful capacity and representation capability. At the anomaly criterion level, we derive anomaly confidence score based on cluster membership probability and combine it with reconstruction error, providing dual criteria for detection. Furthermore, the effectiveness of the cluster center representations and anomaly confidence score depends on the clustering performance. Accordingly, we extract neighborhood-centered representations for multi-view clustering to improve clustering performance. Extensive experiments on multiple real-world datasets from diverse application domains demonstrate the state-of-the-art performance of SCAN.

2606.19292 2026-06-18 cs.LG 新提交

Risk Stratification for ICU Delirium using Pervasive Ambient Sensing Information

使用普适环境感知信息进行ICU谵妄风险分层

Jiaqing Zhang, Sabyasachi Bandyopadhyay, Miguel Contreras, Jessica Sena, Yuanfang Ren, Andrea Davidson, Ziyuan Guan, Tezcan Ozrazgat-Baslanti, Subhash Nerella, Azra Bihorac, Parisa Rashidi

发表机构 * University of Florida(佛罗里达大学) Stanford University(斯坦福大学)

AI总结 本研究利用环境声音和光照强度数据,通过高效序列神经网络模型预测ICU患者谵妄风险,发现声音是主要预测因子,结合光照可改善短期预测,AUC达0.80。

详情
AI中文摘要

谵妄是重症监护室(ICU)中常见且严重的并发症,与发病率增加、住院时间延长和医疗成本升高相关。尽管其普遍存在,早期预测和预防仍具挑战性。环境因素如环境声音和光照可能影响谵妄的发生,但在风险评估中常被忽视。在本研究中,我们检验了光照强度和声压级是否能在多个预测时间窗口内独立预测谵妄。我们评估了四种高效的序列神经网络模型,这些模型基于来自9个ICU的309名患者的数据,用于预测10种预测窗口大小的谵妄。我们使用Shapley Additive Explanations分析报告了特征重要性和影响方向。卷积模型实现了最强的区分能力,在声音数据和组合数据上的AUC均为0.80。声音特征是整体上的主要预测因子。将声音与光照结合改善了短期(<1周)预测,组合模型在感知期后立即分配最高风险。这些发现表明,被动环境感知,尤其是声音,可以为谵妄风险评估增加临床上有意义、可解释的信号,并为丰富多模态ICU预测和预防策略提供实用途径。

英文摘要

Delirium is a common and serious complication in the Intensive Care Unit (ICU), associated with increased morbidity, prolonged hospital stays, and higher healthcare costs. Despite its prevalence, early prediction and prevention remain challenging. Environmental factors such as ambient sound and light may influence the onset of delirium, yet they are often overlooked in risk assessments. In this study, we examined whether light intensity and sound pressure levels can independently predict delirium across multiple prediction horizons. We evaluated four efficient sequential neural network models on data collected from 9 ICUs across 309 patients to predict delirium for 10 prediction-window sizes. We reported feature importance and direction of influence using Shapley Additive Explanations analysis. The convolutional model achieved the strongest discrimination, with AUC = 0.80 on sound data and on combined data. Sound features were the dominant predictors overall. Integrating sound with light improved short-term ($<1$ week) prediction, with the combined model assigning the highest risk immediately after the sensing period. These findings suggest that passive ambient sensing, especially sound, can add a clinically meaningful, interpretable signal for delirium risk estimation and offer a practical pathway to enrich multimodal ICU prediction and prevention strategies.

13. 其他/综合机器学习 1 篇

2606.19317 2026-06-18 cs.LG cs.AI 新提交

Explaining Attention with Program Synthesis

用程序合成解释注意力机制

Amiri Hayes, Belinda Li, Jacob Andreas

发表机构 * NJIT(新泽西理工学院) MIT EECS(麻省理工学院电气工程与计算机科学系) MIT CSAIL(麻省理工学院计算机科学与人工智能实验室)

AI总结 提出用可执行程序近似深度网络组件行为的方法,针对Transformer注意力头,通过生成Python程序再现注意力模式,实现可解释性。

详情
AI中文摘要

可解释深度学习研究的一个长期目标是,用人类可理解的符号描述取代不透明的神经计算。本文提出了一种用可执行程序近似深度网络组件行为的方法。我们专注于Transformer语言模型中的注意力头。对于给定的注意力头,我们首先在一组随机选择的训练样本上计算其关联的注意力矩阵。接着,我们向预训练语言模型提供这些矩阵的摘要,并指示它生成一组Python程序,这些程序仅根据输入句子中的文本即可再现相关的注意力模式。最后,我们根据最终程序集在保留输入上预测行为的效果对程序进行重新排序。我们证明,少于1000个这样的生成程序即可再现GPT-2、TinyLlama-1.1B和Llama-3B中注意力头的注意力模式,在TinyStories上平均交并比相似度超过75%。此外,最佳匹配程序可以替代神经注意力头而不会显著影响模型行为:在三个模型中用程序替代25%的注意力头仅导致平均困惑度增加16%,同时在各种下游问答基准上保持性能。这项工作为使用人类可读、可执行的代码逆向工程Transformer模型中的注意力头提供了一个可扩展的流程,推动了神经模型向符号透明性的发展。

英文摘要

A longstanding goal of research on interpretable deep learning is to replace opaque neural computations with human-meaningful symbolic descriptions. In this paper, we propose an approach for approximating the behavior of components of deep networks with executable programs. We focus on attention heads in transformer language models. For a given head, we first compute its associated attention matrices on a collection of randomly selected training examples. Next, we prompt a pre-trained language model with a summary of these matrices, and instruct it to generate a set of Python programs that can reproduce the associated attention patterns given only text from the input sentence. Finally, we re-rank programs according to how well our final set of programs predict behavior on held-out inputs. We demonstrate that a set of fewer than 1,000 such generated programs can reproduce the attention patterns of heads in GPT-2, TinyLlama-1.1B, and Llama-3B, achieving an average Intersection-over-Union similarity above 75% on TinyStories. Moreover, the best-fit programs can replace neural attention heads without substantially affecting model behavior: replacing 25% of attention heads with programmatic surrogates across the three models incurs only a 16% average perplexity increase, while maintaining performance on a variety of downstream question answering benchmarks. This work contributes a scalable pipeline for reverse-engineering attention heads in transformer models using human-readable, executable code, advancing a path toward symbolic transparency in neural models.