2605.30336 2026-05-29 cs.LG 版本更新

Fairness-Aware Federated Learning with Trajectory Shapley Value

基于轨迹Shapley值的公平感知联邦学习

Daniel Kuznetsov, Ziqi Wang

发表机构 * Faculty of Mathematics, Ecole Normale Supérieure Paris-Saclay（巴黎-萨克雷大学数学系）； Chair for Dynamics, Control, Machine Learning and Numerics – Alexander von Humboldt Professorship, Department of Mathematics, Friedrich-Alexander-Universität Erlangen-Nürnberg（埃尔兰根-纽伦堡弗里德里希-亚历山大大学数学系）

AI总结提出轨迹Shapley值作为贡献度量，并设计FedTSV自适应聚合方法，以解决联邦学习中固定权重导致的偏倚和不稳定问题，实现公平、鲁棒且高效的协作学习。

Comments Accepted for publication at the 24th European Control Conference (ECC 2026)

详情

AI中文摘要

联邦学习是一种新兴的分布式范式，解决了由异构、隐私敏感数据带来的挑战。它允许多个客户端通过聚合其在服务器上的本地更新来协作训练模型。然而，传统的聚合方案通常使用固定权重，无法反映客户端贡献的不平等和时变特性，导致学习过程偏倚且不稳定。为了提高公平性和稳定性，我们提出了轨迹Shapley值（TSV），这是一种贡献度量，通过基于验证的、时间一致的效用评估每个客户端如何影响全局模型的优化轨迹。基于TSV，我们设计了FedTSV，一种自适应聚合方法，将每轮评估转换为动态客户端权重，使服务器能够实时响应异构和对抗性参与。在基准数据集上的实验表明，FedTSV加速了收敛，提高了鲁棒性，并产生了更公平的贡献评估，从而为公平感知的联邦优化提供了原则性基础。

英文摘要

Federated learning is an emerging distributed paradigm that addresses the challenges posed by heterogeneous, privacy-sensitive data. It enables multiple clients to train a model collaboratively by aggregating their local updates at a server. However, conventional aggregation schemes typically use fixed weights that fail to reflect unequal and time-varying client contributions, leading to biased and unstable learning. To improve fairness and stability, we propose the Trajectory Shapley Value (TSV), a contribution metric that evaluates how each client influences the optimization trajectory of the global model using a validation-based, temporally consistent utility. Building on TSV, we design FedTSV, an adaptive aggregation method that converts per-round evaluations into dynamic client weights, allowing the server to respond to heterogeneous and adversarial participation in real time. Experiments on benchmark datasets show that FedTSV accelerates convergence, improves robustness, and yields more equitable contribution assessments, thereby providing a principled foundation for fairness-aware federated optimization.

URL PDF HTML ☆

赞 0 踩 0

2605.30330 2026-05-29 cs.LG 版本更新

When, why, and how do diffusion posterior samplers fail? A finite-sample lens

何时、为何以及如何扩散后验采样器失败？一个有限样本视角

Benjamin A. Burns, Sara Fridovich-Keil

发表机构 * Georgia Institute of Technology（佐治亚理工学院）

AI总结本文从有限样本视角分析扩散后验采样器中似然近似误差导致后验分布偏差的原因，发现中间时间步的后验扩散估计不准确会导致模式加权错误和幻觉，并提出一种与近似类型无关的诊断方法。

Comments All code for experiments is available at: https://github.com/voilalab/diagnosing-posterior-sampling

详情

AI中文摘要

扩散模型具有对自然数据复杂分布进行建模的出色能力，这使其成为成像逆问题中后验采样的流行且有效的选择。现有方法可以在推理时融入任何测量模型，但为了计算可行性，必须在中间时间步使用不精确的似然近似。尽管这些近似通常在经验上效果良好，但它们对采样后验的下游影响尚不清楚，并可能导致无法解释的失败。为了理解这些似然近似何时、为何以及如何传播到错误的后验分布，我们引入了一个有限样本视角的后验采样，该视角在训练集大小趋于无穷时，对于任何前向模型和先验分布，都能以任意精度逼近后验。使用这个有限样本透镜，我们观察到流行的后验采样近似倾向于在中间时间步低估或高估后验的扩散，导致下游后果，包括对早期停止时间的敏感性、后验模式的相对权重不准确以及幻觉，既包括后验中不存在的先验模式，也包括先验不支持的似然模式。此外，我们发现这些后验误差的原因既不需要非线性测量模型也不需要多模态后验，而可能仅仅由于多模态先验和中间采样时间步的后验扩散不准确而产生。我们的有限样本后验采样方法对似然近似的类型和（线性或非线性）前向模型的类型不可知，因此可以作为即插即用的诊断工具，用于评估现有和未来后验采样器的准确性和失败模式。

英文摘要

Diffusion models have excellent capacity to model complex distributions of natural data, which has made them a popular and effective choice for posterior sampling in imaging inverse problems. Existing methods can incorporate any measurement model at inference time but must use an inexact approximation for the likelihood at intermediate timesteps for computational tractability. Although these approximations can often work well empirically, their downstream effect on the sampled posterior is poorly understood and can result in unexplained failures. To understand when, why, and how these likelihood approximations propagate to erroneous posterior distributions, we introduce a finite-sample perspective on posterior sampling that approximates the posterior to arbitrary precision as training set size tends towards infinity, for any forward model and prior distribution. Using this finite-sample lens, we observe that popular posterior sampling approximations tend to under- or over-estimate the spread of the posterior at intermediate timesteps, causing downstream consequences including sensitivity to early stopping time, inaccurate relative weighting of posterior modes, and hallucination, both of prior modes that are not in the posterior and likelihood modes that are not supported by the prior. Moreover, we find that the cause of these posterior errors requires neither a nonlinear measurement model nor a multimodal posterior, but can arise solely due to a multimodal prior and inaccurate posterior spread at intermediate sampling times. Our finite-sample posterior sampling approach is agnostic to the type of likelihood approximation and the type of (linear or nonlinear) forward model, and can thus serve as a drop-in diagnostic to evaluate the accuracy and failure modes of existing and future posterior samplers.

URL PDF HTML ☆

赞 0 踩 0

2605.30329 2026-05-29 cs.LG 版本更新

SoundnessBench: Can Your AI Scientist Really Tell Good Research Ideas from Bad Ones?

SoundnessBench：你的AI科学家真的能区分好的研究想法和坏的吗？

Sy-Tuyen Ho, Minghui Liu, Huy Nghiem, Furong Huang

发表机构 * University of Maryland College Park（马里兰大学College Park分校）

AI总结提出SoundnessBench基准，通过ICLR提交的1099个机器学习研究提案评估LLM判断研究想法方法论合理性的能力，发现当前LLM存在普遍乐观偏差，无法可靠作为科学严谨性的独立初筛评估者。

Comments Project Page: https://hosytuyen.github.io/projects/SoundnessBench

详情

AI中文摘要

自主AI研究智能体旨在通过自动化研究流程（从假设生成到同行评审）加速科学发现。然而，现有基准很少测试一个基本瓶颈：大型语言模型在投入时间和计算资源之前，能否判断研究想法的方法论可行性。我们引入了SoundnessBench，一个从ICLR提交中重建的1099个机器学习研究提案的精选基准，标注了评审者的合理性子分数，并对照源论文进行了审计。SoundnessBench应被解释为可恢复的提案阶段合理性基准，而非对完整论文评审结果的精确预测。在12个前沿LLM中，我们发现了一个普遍的乐观偏差：在标准提示下，模型经常将低合理性的提案评为合理，而激进提示则主要将错误从假阳性转移到假阴性。对公共语料污染、论文识别短语、表面特征和人工审计质量的额外控制表明，这种行为不能由单一混杂因素解释。我们的结果表明，当前LLM作为科学严谨性的独立初筛评估者尚不可靠。

英文摘要

Autonomous AI research agents aim to accelerate scientific discovery by automating the research pipeline, from hypothesis generation to peer review. However, existing benchmarks rarely test a fundamental bottleneck: whether Large Language Models can judge the methodological viability of a research idea before expending time and computational resources. We introduce SoundnessBench, a curated benchmark of 1,099 machine-learning research proposals reconstructed from ICLR submissions, labeled with reviewer soundness sub-scores, and audited against source papers. SoundnessBench should be interpreted as a benchmark for recoverable proposal-stage soundness rather than exact prediction of full-paper review outcomes. Across 12 frontier LLMs, we find a pervasive optimism bias: under standard prompting, models frequently rate low-soundness proposals as sound, while aggressive prompting largely shifts errors from false positives to false negatives. Additional controls for public-corpus contamination, paper-identifying phrases, surface features, and human audit quality suggest that this behavior is not explained by a single confounder. Our results indicate that current LLMs are not yet reliable as standalone first-gate evaluators for scientific rigor.

URL PDF HTML ☆

赞 0 踩 0

2605.30327 2026-05-29 cs.LG cs.AI cs.CL math.ST stat.ML stat.TH 版本更新

Reasoning with Sampling: Cutting at Decision Points

基于采样的推理：在决策点进行裁剪

Felix Zhou, Anay Mehrotra, Quanquan C. Liu

发表机构 * Yale University（耶鲁大学）； Stanford University（斯坦福大学）

AI总结提出Entropy-Cut Metropolis-Hastings算法，利用基础模型的下一词元熵作为代理识别关键决策点并重新采样，从而高效地从幂分布中采样以增强推理能力，在多个基准上超越基线和RL训练模型。

详情

AI中文摘要

前沿推理模型是通过对基础语言模型进行强化学习后训练而产生的。最近的研究对此提出了挑战，表明从基础模型分布的锐化版本（即所谓的幂分布）中采样，无需额外训练、精心策划的数据集或验证器，就能产生可比的推理能力。然而，使这种方法实用化需要高效地从幂分布中采样。采样器需要“混合”到幂分布，这需要在目标分布的模态之间移动；直观地说，例如尝试不同的推理策略。先前工作中提出的采样器反复在当前推理轨迹中均匀随机选择一个“裁剪”位置，并从该位置开始重新采样后缀。然而，推理轨迹通常包含少数关键决策（例如，证明策略或算法的选择），我们观察到均匀选择的裁剪往往重写局部细节，而不是重新审视决策点。我们引入了一种算法（Entropy-Cut Metropolis-Hastings），该算法使用基础模型的下一词元熵作为代理来识别关键决策点，并从这些位置重新采样。我们通过实验验证了熵跳变是决策点的有用代理，并在一个风格化的推理模型中证明了我们的方法的混合时间与轨迹中的决策数量成比例，而不是与可能大得多的词元数量成比例。在MATH500、HumanEval、GPQA Diamond和AIME26上，我们的方法始终优于基线和RL训练模型。

英文摘要

Frontier reasoning models are produced by posttraining base language models with reinforcement learning. Recent work has challenged this by showing that sampling from a sharpened version of the base model's distribution, a so-called power distribution, elicits comparable reasoning without additional training, curated datasets, or verifiers. However, making this method practical requires efficiently sampling from the power distribution. A sampler needs to "mix" to the power distribution, which necessitates moving between modes of the target distribution; intuitively, e.g., trying different reasoning strategies. The samplers proposed in prior works repeatedly select a "cut" position in the current reasoning trace uniformly at random and resample the suffix from that position onward. However, reasoning traces typically contain a few consequential decisions (e.g., the choice of proof strategy or algorithm), and we observe that a uniformly chosen cut tends to rewrite local details rather than revisit decision points. We introduce an algorithm (Entropy-Cut Metropolis-Hastings) that uses the base model's next-token entropy as a proxy to identify key decision points and resample from those positions. We empirically verify that entropy jumps are a useful proxy for decision points and, in a stylized model of reasoning, prove that our method's mixing time scales with the number of decisions in a trace rather than with the number of tokens, which can be much larger. Across MATH500, HumanEval, GPQA Diamond, and AIME26, our method consistently improves over baselines and RL-trained models.

URL PDF HTML ☆

赞 0 踩 0

2605.30324 2026-05-29 cs.DS cs.AI cs.CL cs.LG stat.ML 版本更新

On Language Generation in the Limit with Bounded Memory

有界记忆下的极限语言生成

Jon Kleinberg, Anay Mehrotra, Amin Saberi, Grigoris Velegkas

发表机构 * Cornell University（康奈尔大学）； Stanford University（斯坦福大学）； Google Research（谷歌研究）

AI总结研究有界记忆下语言生成的极限问题，通过组合界和滑动窗口分析记忆约束对可生成性、密度和识别的影响。

Comments The abstract has been shortened to fit within the arXiv limit

详情

AI中文摘要

我们研究有界记忆下的极限语言生成。在该任务中，学习器每次观察来自未知目标语言的一个示例，并且必须最终只输出新的有效示例。先前的工作假设可以访问整个历史，这是一个强假设，因为实际算法只保留有限的过去信息。学习理论中的经典工作表明，记忆约束会显著改变可学习性；我们将此扩展到语言生成。首先，我们研究无记忆生成器。在温和的枚举限制下，每个可数无限语言集合仍然可以在没有记忆的情况下生成。没有这个限制，我们精确刻画了何时无记忆生成是可能的。对于有限集合，我们刻画了无记忆生成器可实现的最优极小极大密度——针对任何给定大小的集合所能保证的最佳密度。这个组合界依赖于Sperner定理和对称链分解。我们进一步表明，最后$W$个示例的滑动窗口不会改善这种最坏情况密度，而允许存储$b$个自适应选择的过去示例则会改善每个$b \geq 1$的可实现密度。最后，我们重新审视极限识别，其中学习器必须收敛到目标语言的单个正确假设。我们关注其增量变体，其中学习器只记住其之前的猜测。在这里，尽管精确识别在仅包含三种语言的集合上失败，但一个温和的松弛——要求收敛到目标的“近似”版本——对于每个有限集合都是可实现的。这些结果表明，有界记忆对这些任务的影响不同：生成对于每个可数集合仍然可实现，而密度和识别仅限于有限集合，且随着集合增长保证减弱。

英文摘要

We study language generation in the limit under bounded memory. In this task, a learner observes examples from an unknown target language one at a time and must eventually output only new valid examples. Prior work assumes access to the entire history, a strong assumption since realistic algorithms retain limited past information. Classical work in learning theory shows memory constraints dramatically alter learnability; we extend this to language generation. First, we study memoryless generators. Under a mild enumeration restriction, every countable collection of infinite languages remains generable without memory. Without this restriction, we exactly characterize when memoryless generation is possible. For finite collections, we characterize the optimal minimax density achievable by memoryless generators -- the best density guaranteed against any collection of a given size. This combinatorial bound relies on Sperner's theorem and symmetric chain decompositions. We further show that a sliding window of the last $W$ examples does not improve this worst-case density, whereas allowing it to store $b$ adaptively chosen past examples improves the achievable density for every $b \geq 1$. Finally, we revisit identification in the limit, where the learner must converge to a single correct hypothesis for the target language. We focus on its incremental variant, where the learner remembers only its previous guess. Here, although exact identification fails on a collection of just three languages, a mild relaxation requiring convergence to an ``approximate'' version of the target is achievable for every finite collection. These results show bounded memory affects these tasks differently: generation remains achievable for every countable collection, while density and identification are confined to finite collections, with guarantees weakening as the collection grows.

URL PDF HTML ☆

赞 0 踩 0

2605.30323 2026-05-29 cs.LG cs.AI 版本更新

In-Context Reward Adaptation for Robust Preference Modeling

上下文奖励自适应用于鲁棒偏好建模

Zhenyu Sun, Zheng Xu, Ermin Wei

发表机构 * Northwestern University（西北大学）； Meta Superintelligence Labs（Meta超智能实验室）

AI总结提出基于Transformer的上下文奖励自适应框架，通过少量偏好示例和人类反应时间辅助信号，在线建模多样且未见的人类偏好，实现鲁棒的偏好建模和分布偏移适应。

详情

AI中文摘要

TriSearch：通过双星翻转学习优化三角剖分

Yiran Wang, Guido Montúfar

发表机构 * UCLA（加州大学洛杉矶分校）； MPI MiS（马克斯·普朗克研究所（MiS））

AI总结提出基于强化学习的框架TriSearch，利用电路支撑的子三角剖分动作表示，通过双星翻转优化多面体三角剖分目标，实现零样本泛化到更大实例。

详情

AI中文摘要

我们引入了TriSearch，这是一个强化学习框架，用于通过双星翻转优化多面体三角剖分上的目标。关键思想是一种电路支撑的子三角剖分动作表示：可行的翻转由其支撑电路和实现的局部子三角剖分编码，使得学习策略能够利用局部几何和组合特征对它们进行排序。这产生了一个维度无关的接口，并能够在不显式枚举整个三角剖分空间的情况下高效遍历翻转图。在3D和4D中实例化后，TriSearch从小的训练实例零样本泛化到具有指数级更大搜索空间的大型多面体。它在3D中的度量目标上达到了顶级性能，并且在4D中，在固定预算下，发现了比现有采样器更多的自反多面体的不同精细、正则、星形三角剖分，对应于Calabi-Yau三维流形。

英文摘要

We introduce TriSearch, a reinforcement learning framework for optimizing objectives over triangulations of a polytope via bistellar flips. The key idea is a circuit-supported subtriangulation action representation: feasible flips are encoded by their supporting circuit and realized local subtriangulation, enabling a learned policy to rank them using local geometric and combinatorial features. This yields a dimension-agnostic interface and enables efficient traversal of the flip graph without explicit enumeration of the full triangulation space. Instantiated in 3D and 4D, TriSearch generalizes zero-shot from small training instances to larger polytopes with exponentially larger search spaces. It achieves top performance on metric objectives in 3D and, in 4D, discovers more distinct Fine, Regular, Star triangulations of reflexive polytopes, corresponding to Calabi-Yau threefolds, than existing samplers under a fixed budget.

URL PDF HTML ☆

赞 0 踩 0

2605.30219 2026-05-29 cs.AI cs.CL cs.LG 版本更新

When Should Models Change Their Minds? Contextual Belief Management in Large Language Models

模型何时应改变想法？大语言模型中的上下文信念管理

Haoming Xu, Weihong Xu, Zongrui Li, Mengru Wang, Yunzhi Yao, Chiyu Wu, Jin Shang, Yu Gong, Shumin Deng

发表机构 * Zhejiang University（浙江大学）； HomologyAI

AI总结提出上下文信念管理（CBM）框架，通过引入BeliefTrack基准和信念状态奖励的强化学习，将大语言模型在长程交互中的信念更新失败率平均降低70.9%。

Comments Work in progress

详情

AI中文摘要

长程交互要求语言模型管理累积信息：何时更新状态、何时保持状态、以及忽略什么。我们将这一挑战研究为 extbf{上下文信念管理（CBM）}：在隔离任务无关噪声的同时，维护与形式证据对齐的预测信念状态。为了使CBM可测量，我们引入了BeliefTrack，一个涵盖规则发现和电路诊断的封闭世界基准，其中有限的信念空间和符号验证器支持精确的逐轮评估。BeliefTrack诊断三种失败：保持失败、更新失败和隔离失败。在多个大语言模型中，原始模型表现出严重的CBM失败，而显式的信念跟踪提示提供的改进有限。相比之下，使用信念状态奖励的强化学习平均将失败率降低了70.9%。进一步的探测揭示了这些失败背后的潜在信念状态动态，而表示级引导在两个任务上将失败率降低了46.1% ootnote{代码即将在https://github.com/zjunlp/CBM发布。}

英文摘要

Long-horizon interactions require language models to manage accumulating information: when to update their state, when to preserve their state, and what to ignore. We study this challenge as \textbf{Contextual Belief Management (CBM)}: maintaining a predicted belief state aligned with formal evidence while isolating task-irrelevant noise. To make CBM measurable, we introduce BeliefTrack, a closed-world benchmark spanning Rule Discovery and Circuit Diagnosis, where a finite belief space and symbolic verifiers enable exact turn-level evaluation. BeliefTrack diagnoses three failures: Failed Stay, Failed Update, and Failed Isolation. Across multiple LLMs, vanilla models exhibit severe CBM failures, while explicit belief-tracking prompts provide limited gains. In contrast, reinforcement learning with belief-state rewards reduces failure rates by 70.9\% on average. Further probing reveals latent belief-state dynamics behind these failures, and representation-level steering reduces failure rates by 46.1\% across two tasks\footnote{Code is coming soon at https://github.com/zjunlp/CBM.

URL PDF HTML ☆

赞 0 踩 0

2605.30218 2026-05-29 cs.LG cs.PF 版本更新

MarginGate: Sparse Margin-Triggered Verification for Batch-Invariant LLM Inference

MarginGate: 用于批量不变LLM推理的稀疏边际触发验证

Kexin Chu, Yang Zhou, Wei Zhang

发表机构 * University of Connecticut（康涅狄格大学）； UC Davis（加州大学戴维斯分校）

AI总结提出MarginGate方法，利用logit边际稀疏触发验证，仅对低边际步骤进行验证并修复，以低成本实现批量不变LLM推理的确定性解码。

Comments 13 pages, 5 figures, 11 tables

详情

AI中文摘要

零温度BF16 LLM推理通常被认为是可重现的，但同一请求在单独解码或位于较大批次内时可能产生不同的token。现有修复方法使用批量不变算子或LLM-42的逐token验证，即使在大多数步骤稳定时也会产生成本。我们询问验证是否可以仅应用于翻转的token。在五个模型上，批次诱导的token翻转在翻转率基准上是稀疏的：在MATH500上，Llama-3.1-8B在$0.48\%$的同步解码步骤中翻转，所有测试模型在MATH500、GSM8K和HumanEval上的翻转率保持在0.3-1.3%范围内。翻转前K/V扰动保持平坦，而低top-1/top-2 logit边际暴露了大部分翻转风险。MarginGate将这些观察转化为验证器策略：它在高边际步骤上保持BF16解码，仅验证低边际步骤，并通过替换当前K/V列修复确认的不匹配。我们在四个数据集上评估，在MATH500上校准并迁移到GSM8K、SharedGPT和HumanEval。MarginGate在Llama-3.1-8B和Qwen2.5-14B上以18.56%/15.05%的验证器触发率恢复100%序列级确定性解码，相对于始终验证，将LLM-42的延迟增量降低2.23倍/1.99倍。在DSR1-Distill-Qwen-7B上，相同策略在更困难的条件下以49.50%的触发率达到确定性。

英文摘要

Temperature-zero BF16 LLM inference is often treated as reproducible, yet the same request can emit different tokens when decoded alone or inside a larger batch. Existing fixes use batch-invariant operators or LLM-42's per-token verification, incurring cost even when most steps are stable. We ask whether verification can be applied exclusively to flipped tokens. Across five models, batch-induced token flips are sparse on the flip-rate benchmarks: on MATH500, Llama-3.1-8B flips on $0.48\%$ of synchronous decode steps, and all tested models stay within the 0.3-1.3% range on MATH500, GSM8K, and HumanEval. K/V perturbations remain flat before flips, while low top-1/top-2 logit margins expose much of the flip risk. MarginGate turns these observations into a verifier policy: it keeps BF16 decoding on high-margin steps, verifies only low-margin steps, and repairs confirmed mismatches by replacing the current K/V column. We evaluate on four datasets, calibrating on MATH500 and transferring to GSM8K, SharedGPT, and HumanEval. MarginGate restores 100% sequence-level deterministic decoding on Llama-3.1-8B and Qwen2.5-14B with 18.56%/15.05% verifier trigger rates, reducing LLM-42's latency increment by 2.23x/1.99x relative to always-on verification. On DSR1-Distill-Qwen-7B, the same policy reaches determinism in a harder regime at 49.50% triggers.

URL PDF HTML ☆

赞 0 踩 0

2605.30213 2026-05-29 cs.LG 版本更新

Faithful Embeddings of Irregular and Asynchronous Data for Online Log-NCDEs

不规则和异步数据的忠实嵌入用于在线Log-NCDEs

Benjamin Walker, Alexandre Bloch, Lingyi Yang, Sam Morley, Terry Lyons

发表机构 * Mathematical Institute, University of Oxford（牛津大学数学研究所）； Department of Mathematics, Imperial College London（伦敦帝国学院数学系）

AI总结针对不规则和异步数据，提出一种连续且单射的嵌入方法，基于Log-NCDEs实现无需插值的在线计算，并证明其通用性。

Comments 34 pages, 16 figures

详情

AI中文摘要

连续时间模型是不规则和异步数据的自然选择。一个核心设计选择是如何将离散观测嵌入到连续时间中。基于插值和插补的嵌入重构了连续的观测路径，使得模型对重构的选择敏感。我们表明这种重构步骤是不必要的；在温和条件下，只要从数据到输入的嵌入是连续且单射的，模型输入空间上的紧集通用性就会转移到数据空间。受此结果指导，并基于神经控制微分方程（NCDEs）的直线控制路径，我们为Log-NCDEs（一类通用的连续时间模型）引入了一种连续且单射的嵌入。我们的方法将观测记录为增量，并在任意查询区间上组合它们，直接形成对数签名。这提供了区间级别的摘要，而无需先对观测变量进行插值，同时支持在线计算。在合成控制动力学和真实世界时间序列数据集上的实验表明，该表示准确、高效，并且对不规则、异步和稀疏观测具有鲁棒性。

英文摘要

Continuous-time models are a natural choice for irregular and asynchronous data. A central design choice is how to embed discrete observations into continuous time. Interpolation- and imputation-based embeddings reconstruct a continuous observation path, making the model sensitive to the choice of reconstruction. We show that this reconstruction step is unnecessary; under mild conditions, compact-set universality on the model input space transfers to the data space whenever the embedding from data to input is continuous and injective. Guided by this result, and building on the rectilinear control path for Neural Controlled Differential Equations (NCDEs), we introduce a continuous and injective embedding for Log-NCDEs, a universal class of continuous-time models. Our approach records observations as increments and composes them over arbitrary query intervals to directly form log-signatures. This provides interval-level summaries without first interpolating the observed variables, while supporting online computation. Experiments on synthetic controlled dynamics and real-world time-series datasets show that the representation is accurate, efficient, and robust to irregular, asynchronous, and sparse observations.

URL PDF HTML ☆

赞 0 踩 0

2605.30201 2026-05-29 cs.LG cs.AI 版本更新

HPO: Hysteretic Policy Optimization for Stable and Efficient Training under Sparse-Reward Regime

HPO: 稀疏奖励机制下稳定高效训练的滞后策略优化

Mohamed Sana, Nicola Piovesan, Antonio De Domenico, Fadhel Ayed, Haozhe Zhang

发表机构 * Paris Research Center, Huawei Technologies（华为技术有限公司巴黎研究中心）

AI总结针对GRPO在稀疏验证奖励下的失败模式，提出HPO通过降低负优势更新权重和均值长度归一化改进训练，并引入自适应版本A-HPO，在TeleLogs和Countdown实验中显著提升奖励。

详情

AI中文摘要

我们研究了GRPO风格的强化学习在稀疏可验证奖励背景下的一种狭窄但常见的失败模式：早期更新中包含更多具有负优势的响应，而非正优势的响应，而响应级长度归一化将更新幅度与输出长度挂钩。我们提出滞后策略优化（HPO），这是对GRPO的最小修改，它降低了负优势更新的权重，并用均值长度归一化替代了每个响应的长度归一化。我们进一步引入自适应HPO（A-HPO），它基于批次级优势符号统计设置滞后权重，从而消除了调整固定滞后权重的需要。在我们的TeleLogs和Countdown实验中，与GRPO相比，A-HPO提高了每次更新的奖励，在早期稀疏奖励机制中增益最大。在TeleLogs上，A-HPO实现了0.84的最终奖励，比SAPO高5%，比GSPO高11%，比GRPO高15%，同时保持了可比较的响应长度。在Countdown上，A-HPO在1.5B-7B模型的初始和最困难配置中实现了最大增益。关于滞后权重的消融研究表明，A-HPO的增益来自于比仅正更新或完全对称更新更好地平衡正负优势的贡献。

英文摘要

We investigate a narrow but common failure mode of GRPO-style reinforcement learning in the context of sparse verifiable rewards: early updates contain more responses with negative advantages than those with positive advantages, while response-level length normalization ties the magnitude of the update to the length of the output. We propose Hysteretic Policy Optimization (HPO), a minimal modification of GRPO that reduces the weight of negative-advantage updates and replaces per-response length normalization with mean-length normalization. We further introduce Adaptive HPO (A-HPO), which sets the hysteretic weight based on batch-level advantage-sign statistics, thereby removing the need for tuning a fixed hysteretic weight. In our TeleLogs and Countdown experiments, A-HPO improves the reward per update compared to GRPO, with the largest gains in early sparse reward regimes. On TeleLogs, A-HPO achieves a final reward of 0.84, outperforming SAPO by 5%, GSPO by 11%, and GRPO by 15%, while maintaining a comparable response-length. On Countdown, A-HPO achieves the largest gains in initial and most difficult configurations across 1.5B-7B models. Ablation studies on the hysteretic weight show that the gains of A-HPO come from better balancing the contributions of positive and negative advantages compared to positive-only or fully symmetric updates.

URL PDF HTML ☆

赞 0 踩 0

2605.30198 2026-05-29 cs.LG 版本更新

Active Continual Learning with Metaplastic Binary Bayesian Neural Networks

具有可塑性二值贝叶斯神经网络的主动持续学习

Kellian Cottart, Théo Ballet, Djohan Bonnet, Damien Querlioz

发表机构 * Universit \'e Paris-Saclay, CNRS, Centre de Nanosciences et de Nanotechnologies, Palaiseau, France

AI总结针对边缘系统持续学习中的后验饱和与可塑性冻结问题，提出基于有界记忆变分目标的BiMU方法，通过不确定性依赖步长和先验松弛维持非退化后验，实现无缓冲主动查询，在Permuted-MNIST和OpenLORIS-Object上显著减少标签与更新次数。

Comments Accepted at ICML 2026

详情

AI中文摘要

始终在线的边缘系统必须在严格的计算预算下随着条件变化持续学习，并检测不可靠的预测。贝叶斯二值神经网络在此场景中具有吸引力，但均值场伯努利后验可能在长非平稳流上饱和，消除认知不确定性并冻结可塑性。我们提出BiMU，它源于一个有界记忆变分目标，平衡了稳定性、可塑性和遗忘。BiMU结合了数据项与受控松弛向先验，以及不确定性依赖的步长，防止饱和并维持信息性不确定性。这种非退化后验通过蒙特卡洛分歧实现完全在线、无缓冲的主动查询，在类别不平衡下减少标签查询和反向传播更新。BiMU在1000任务Permuted-MNIST上维持学习和强OOD检测，在OpenLORIS-Object上在类别不平衡和特征压缩下，以匹配的准确率实现高达32倍的标签/更新节省。

英文摘要

Always-on edge systems must keep learning as conditions change under tight compute budgets and must detect unreliable predictions. Bayesian binary neural networks are attractive in this setting, but mean-field Bernoulli posteriors can saturate on long non-stationary streams, wiping out epistemic uncertainty and freezing plasticity. We propose BiMU, derived from a bounded-memory variational objective that balances stability, plasticity, and forgetting. BiMU combines a data term with controlled relaxation toward the prior and an uncertainty-dependent step size that prevents saturation and sustains informative uncertainty. This non-degenerate posterior enables fully online, buffer-free active querying via Monte Carlo disagreement, reducing label queries and backpropagation updates under imbalance. BiMU sustains learning and strong OOD detection on 1000-tasks Permuted-MNIST, and on OpenLORIS-Object achieves up to 32$\times$ label/update savings at matched accuracy under class imbalance and feature compression.

URL PDF HTML ☆

赞 0 踩 0

2605.30195 2026-05-29 cond-mat.mtrl-sci cs.AI cs.LG 版本更新

一种用于BATSE伽马射线暴无监督分类的全新无参数聚类算法

Soumita Modak

发表机构 * Department of Statistics, Presidency University（统计系，普雷斯顿大学）

AI总结提出一种完全无参数的聚类算法，对BATSE伽马射线暴样本进行分类，支持双群（短暴与长暴）的合并-坍缩星理论。

详情

AI中文摘要

聚类分析是一种广泛应用的机器学习技术，用于理解伽马射线暴（GRB）群体中存在的模式，以探索其物理来源。目前，尽管采用了最先进的聚类程序进行了多次尝试，但对应可区分群组的聚类数量仍存在争议。这一关键未知参数需要通过直接或间接方式（以其他调优参数的形式）评估，以便通过实施合适的聚类算法在GRB中产生聚类。虽然大多数应用的算法得出了两个物理上可解释的群组（分别以短暴和长暴为主的合并与坍缩星），但其他统计方法违反了这种二元划分。然而，任何额外聚类的物理建立尚未得到确认。因此，我们提出一种新算法，来自一种称为“完全无参数”的不同聚类流派，它以迄今未尝试过的方式对GRB进行分类。该算法从BATSE样本中指示出两个主要群组，即短持续时间和长持续时间爆发，与合并-坍缩星理论兼容。

英文摘要

Cluster analysis is a widely applied machine learning technique to understand the existing patterns in the population of gamma-ray bursts (GRBs), in order to explore their physical sources. In the present scenario, the number of clusters corresponding to differentiable groups is still under conflict, in spite of numerous attempts with the state-of-the-art clustering procedures. This crucial unknown parameter needs to be evaluated, either directly or indirectly in terms of other tuning parameters, to produce the clusters in GRBs through implementation of an appropriate clustering algorithm. While most of the applied algorithms reached two physically explained groups of merger and collapsar predominated by the short and long bursts respectively, other statistical approaches violated this binary partition. However, physical establishment of any additional cluster(s) is not yet confirmed. Therefore, we propose a new algorithm, from a different stream of clustering referred to as `completely parameter-free', which carries out the classification of GRBs in a manner that has not been tried so far. It indicates two main groups, of short and long duration bursts from the BATSE sample, compatible with the merger-collapsar theory.

URL PDF HTML ☆

赞 0 踩 0

2605.30170 2026-05-29 cs.MM cs.CV cs.LG 版本更新

Unveiling the Visual Counting Bottleneck in Vision-Language Models

揭示视觉语言模型中的视觉计数瓶颈

Xingzhou Pang, Yifan Hou, Junling Wang, Mrinmaya Sachan

发表机构 * Department of Computer Science, ETH Zürich（苏黎世联邦理工学院计算机科学系）

AI总结通过分解视觉计数为三个认知阶段，发现视觉语言模型在符号映射阶段失败，提出断裂数量假说：模型学习到分离的模态特定统计流形，无法实现跨模态对齐。

Comments ICML 2026

详情

AI中文摘要

尽管大型视觉语言模型（VLM）在插值任务上表现出色，但在系统泛化方面，尤其是视觉计数任务中，会遭遇灾难性失败。本文通过将视觉计数分解为三个认知阶段：视觉个体化、数量感知和符号映射，来研究这一外推瓶颈。利用合成围棋棋盘和线性探针，我们证明视觉骨干网络在进入外推区域后仍能保持稳健、线性可分离的数量表示，排除了感知失败的可能性。此外，模型保留了潜在的数量感知能力，能够成功对无法枚举的数量进行比较推理。我们将崩溃定位在符号映射阶段，即模型无法将有效的视觉数量投影到符号标记上。我们的发现支持断裂数量假说：VLM未能获得通用数字空间，而是学习了不相交的、模态特定的统计流形，这阻止了对未见数量的跨模态对齐。在最新基础模型上的验证结果表明，弥合这一差距需要引入强制统一表示的归纳先验，因为仅靠数据扩展是不够的。

英文摘要

While Large Vision-Language Models (VLMs) excel at interpolation, they suffer catastrophic failures in systematic generalization, most notably in visual counting. In this work, we investigate this extrapolation bottleneck by deconstructing visual counting into three cognitive stages: visual individuation, magnitude awareness, and symbolic mapping. Using synthetic Go boards and linear probes, we demonstrate that visual backbones maintain robust, linearly separable representations of quantity well into the extrapolation regime, ruling out perceptual failure. Furthermore, models retain latent magnitude awareness, successfully performing comparative reasoning on quantities they fail to enumerate. We pinpoint the collapse to the symbolic mapping stage, where the model fails to project valid visual magnitudes onto symbolic tokens. Our findings support a frac tured magnitude hypothesis: VLMs fail to acquire a universal number space, instead learning disjoint, modality-specific statistical manifolds that prevent cross-modal grounding for unseen quantities. Validated on the state-of-the-art foundation model, our results suggest that bridging this gap requires inductive priors enforcing unified representations, as data scaling alone is insufficient.

URL PDF HTML ☆

赞 0 踩 0

2605.30167 2026-05-29 stat.ML cs.CV cs.LG stat.AP 版本更新

Visual Spatial Learning: Single-Field Spatial Interpolation Using Convolutional Neural Networks

视觉空间学习：使用卷积神经网络的单场空间插值

Daniel Tinoco, Raquel Menezes, Carlos Baquero, Alexandra Silva

发表机构 * Centro de Matemática (CMAT), Universidade do Minho（数学中心（CMAT），明霍大学）； DEI-FEUP & INESC TEC, Universidade do Porto（FEUP-DEI与INESC TEC，波尔图大学）； Instituto Português do Mar e da Atmosfera, I. P. (IPMA, I. P.), Lisboa, Portugal（葡萄牙海洋与大气研究所（IPMA, I. P.），里斯本，葡萄牙）； Centro de Ciências do Mar e do Ambiente (MARE), Évora, Portugal（海洋与环境科学中心（MARE），埃维拉，葡萄牙）

AI总结提出基于卷积神经网络（CNN）的架构，直接从单次部分观测场学习空间插值，无需外部数据或先验场，作为克里金法的替代方案。

Comments 53 pages, 10 figures

详情

AI中文摘要

从稀疏观测中预测完整的空间相关场是空间统计和环境建模中的一个基本挑战。经典的插值方法如克里金法依赖于高斯过程假设和变异函数分析，这可能会限制其在非平稳环境中的有效性，并且需要大量的领域专业知识。在这项工作中，我们利用基于卷积神经网络（CNN）的架构进行空间插值，该架构在单个部分观测场上进行训练和应用，无需访问外部数据或先验场。模型直接在观测位置进行监督，并学习在用户定义的网格上预测未观测点的值。与克里金法不同，我们的方法不需要显式的协方差建模或变异函数估计，并且可以以数据驱动的方式灵活捕捉局部空间模式。这项工作展示了CNN在稀疏监督下进行单实例空间插值的潜力，为经典地统计方法提供了实用的替代方案，并将CNN的应用扩展到新的问题领域。

英文摘要

Predicting a complete spatially correlated field from sparse observations is a fundamental challenge in spatial statistics and environmental modelling. Classical interpolation methods such as Kriging rely on Gaussian process assumptions and variography, which can limit their effectiveness in non-stationary settings and require substantial domain expertise. In this work, we leverage an architecture based on convolutional neural networks (CNNs) for spatial interpolation that is trained and applied on a single partially observed field, without access to external data or prior fields. The model is supervised directly on the observed locations and learns to predict values at unobserved points on the user defined grid. Unlike Kriging, our method does not require explicit covariance modelling or variogram estimation, and it can flexibly capture local spatial patterns in a data-driven manner. This work demonstrates the potential of CNNs for single-instance spatial interpolation under sparse supervision, offering a practical alternative to classical geostatistical methods, and extending the use of CNNs to a new problem domain.

URL PDF HTML ☆

赞 0 踩 0

2605.30162 2026-05-29 cs.AI cs.CR cs.LG 版本更新

BioRefusalAudit: Auditing Biosecurity Refusal Depth Using General and Domain-Fine-Tuned Sparse Autoencoders

BioRefusalAudit: 使用通用和领域微调稀疏自编码器审计生物安全拒绝深度

Caleb DeLeeuw

发表机构 * Independent researcher（独立研究者）； Apart Research AIxBio Sprint

AI总结本文提出BioRefusalAudit方法，通过行为测试和内部稀疏自编码器特征分析，评估语言模型在生物安全场景下的拒绝一致性，发现模型存在结构脆弱性、过度拒绝和架构差异。

Comments 21 pages, 2 figures, 3 tables. Apart Research AIxBio Sprint hackathon paper, April 2026 (Track 3: AI Biosecurity Tools). Code, eval set, and SAEs: github.com/SolshineCode/Deleeuw-AI-x-Bio-hackathon. Reviewer feedback: apartresearch.com/project/biorefusalaudit-auditing-biosecurity-refusal-depth-using-general-and-domainfinetuned-sparse-autoencoders-1fyk

详情

AI中文摘要

语言模型的生物安全评估通常询问模型是否产生危险输出。本文提出一个补充性问题：当模型拒绝时，该拒绝在结构上是否稳健，还是在提示框架、格式或输出长度的适度变化下消失？在五种架构中，没有模型能清晰区分良性查询和危险查询。Gemma 2 2B-IT 在75个提示中从未真正拒绝，对每个接近危险的查询都含糊其辞。Gemma 4 E2B-IT 在聊天模板格式下拒绝了65/75个提示，无格式时拒绝了0/75。两个Gemma模型在80词限制下都降至0%拒绝率。Qwen 2.5 1.5B 和 Phi-3-mini 过度拒绝，将83-87%的良性生物学标记为危险。Llama 3.2 1B 显示出唯一有意义的层级梯度（61点跨度）。为了探究过度拒绝的驱动因素，我们测试了一组附表I但无生物毒性的化合物（特别是裸盖菇素栽培，具有FDA突破性疗法资格）。一些模型对这些化合物的拒绝率超过了真正危险的生物学，表明拒绝追踪法律和文化显著性而非CBRN危险。为了测量内部层面，我们引入了一个分歧分数D，比较模型的表面响应标签与其内部稀疏自编码器（SAE）特征激活。在Gemma 2 2B-IT（Gemma Scope 1）和Gemma 4 E2B-IT（作者训练的bio SAE）上计算了完整的D。发布了两个微调的Gemma 2领域SAE。在Gemma 4上，服从和拒绝响应之间差距为0.647点，零重叠（n=75），尽管这是初步的，目录狭窄，样本内校准，且仅覆盖Gemma家族的SAE。在一个黑客马拉松周末使用消费级硬件（GTX 1650 Ti Max-Q，以及用于SAE训练的Colab T4）构建，这一初步证据表明，激活级审计可能揭示行为评估无法发现的失败模式，且各架构间存在显著差异。

英文摘要

Biosecurity evaluations of language models typically ask whether models produce hazardous output. This paper asks a complementary question: when a model refuses, is that refusal structurally sound, or does it disappear under modest changes to prompt framing, formatting, or output length? Across five architectures, no model cleanly discriminated benign from hazard. Gemma 2 2B-IT never genuinely refused across 75 prompts, hedging on every hazard-adjacent query. Gemma 4 E2B-IT refused 65/75 prompts with chat-template formatting and 0/75 without it. Both Gemma models collapsed to 0% under an 80-token cap. Qwen 2.5 1.5B and Phi-3-mini over-refused, flagging 83-87% of benign biology as hazardous. Llama 3.2 1B showed the only meaningful tier gradient (61-point spread). To probe what drives such over-refusal, we tested a panel of Schedule I but biologically non-toxic compounds (notably psilocybin cultivation, with FDA Breakthrough Therapy status). Some models refused these at rates exceeding genuinely hazardous biology, suggesting refusal tracks legality and cultural salience over CBRN hazard. To measure the internal side, we introduce a divergence score D comparing a model's surface response label to its internal sparse autoencoder (SAE) feature activations. Full D was computed on Gemma 2 2B-IT (Gemma Scope 1) and Gemma 4 E2B-IT (author-trained bio SAE). Two fine-tuned Gemma 2 domain SAEs were released. On Gemma 4, comply and refuse responses separated by a 0.647-point gap with zero overlap (n=75), though this is preliminary, with a narrow catalog, within-sample calibration, and Gemma-family-only SAE coverage. Built over one hackathon weekend on consumer hardware (GTX 1650 Ti Max-Q, plus Colab T4 for SAE training), this preliminary evidence suggests activation-level auditing may surface failure modes invisible to behavioral evaluation, with substantial variation across architectures.

URL PDF HTML ☆

赞 0 踩 0

2605.30160 2026-05-29 cs.LG cs.AI 版本更新

On Distributional Reinforcement Learning in Chaotic Dynamical Systems

混沌动力系统中的分布强化学习

James Rudd-Jones, Mirco Musolesi, María Pérez-Ortiz

发表机构 * Centre for Artificial Intelligence（人工智能中心）； Department of Computer Science（计算机科学系）； University College London（伦敦大学学院）； University of Bologna（博洛尼亚大学）

AI总结针对混沌动力系统中强化学习面临的高方差和梯度病态问题，提出分布强化学习通过1-Wasserstein度量下的分布贝尔曼目标实现更稳定的优化。

详情

AI中文摘要

混沌动力系统对强化学习（RL）提出了根本性挑战：对初始条件的指数敏感性导致高方差的引导目标和病态的梯度更新。混沌动力学出现在科学和工程领域的各个方面，从流体流动和气候系统到多智能体系统，在这些领域中，可靠的学习是非常可取的。标准RL方法通过标量值函数优化期望回报，隐式地对发散轨迹进行平均，并将轨迹层面的不稳定性与学习目标纠缠在一起。我们证明，在温和的统计稳定性假设下，当在$1$-Wasserstein度量下测量时，回报分布比单个轨迹更规则地演化，从而产生更平滑的分布贝尔曼目标。通过将优化与该度量层面结构对齐，分布RL提供了更好的条件学习。我们为混沌系统中分布方法的优势以及混沌下RL目标的几何结构提供了原则性的解释。

英文摘要

Chaotic dynamical systems pose a fundamental challenge for Reinforcement Learning (RL): exponential sensitivity to initial conditions induces high-variance bootstrap targets and poorly conditioned gradient updates. Chaotic dynamics arise across scientific and engineering domains, from fluid flows and climate systems to multi-agent systems, where reliable learning is highly desirable. Standard RL methods optimise expected returns through scalar value functions, implicitly averaging over diverging trajectories and entangling trajectory level instability with the learning objective. We show that under mild statistical stability assumptions, the return distribution evolves more regularly than individual trajectories when measured under the $1$-Wasserstein metric, yielding a smoother distributional Bellman objective. By aligning optimisation with this measure level structure, distributional RL provides better conditioned learning. We offer a principled explanation for the advantages of distributional methods in chaotic systems and the geometries of RL objectives under chaos.

URL PDF HTML ☆

赞 0 踩 0

2605.30154 2026-05-29 cs.LG 版本更新

RL2ML: Finite-Rollout Surrogate Objectives from Reinforcement Learning to Maximum Likelihood

RL2ML: 从强化学习到最大似然的有限rollout替代目标

Yifu Zheng

发表机构 * University of Southern California（南加州大学）

AI总结本文提出RL2ML系列有限rollout替代目标，具有闭式无偏梯度估计，连接标准强化学习、类最大似然训练及超越最大似然目标，并揭示群体级更新尺度相变，将剩余自由度转化为一维优化问题。

详情

AI中文摘要

基于正确性的可验证奖励强化学习（RLVR）通过采样输出的二元反馈训练语言模型，但期望优化的目标与有限rollout组引起的随机更新几何常被混淆。本文开发了RL2ML，一系列具有闭式、精确无偏梯度估计的有限rollout替代目标。该系列在固定rollout预算下连续连接标准强化学习、类最大似然训练及超越最大似然目标，同时保持估计器-目标对齐。我们引入群体级更新尺度来表征rollout组在观察到经验成功计数后如何重新加权，揭示了仅通过总体级目标符号隐藏的亚临界-超临界更新尺度相变。基于这一区分，校准的度量增益分析和精确方差分解表明，最佳替代目标的选择既不由接近最大似然决定，也不仅由总体级权重决定，而是取决于评估度量、局部敏感性和估计器方差。因此，替代目标系列中的剩余自由度可以表述为一维优化问题，而非视为无约束超参数。

英文摘要

Correctness-based Reinforcement Learning with Verifiable Rewards (RLVR) trains language models from binary feedback on sampled outputs, but the objective optimized in expectation and the stochastic update geometry induced by finite rollout groups are often conflated. This paper develops RL2ML, a family of finite-rollout surrogate objectives with a closed-form, exactly unbiased gradient estimator. The family continuously connects standard reinforcement learning, maximum-likelihood-like training, and beyond-maximum-likelihood objectives while preserving estimator-objective alignment under a fixed rollout budget. We introduce the group-level update scale to characterize how a rollout group is reweighted after its empirical success count is observed, revealing a subcritical-supercritical update-scale transition that is hidden by population-level objective notation alone. Building on this distinction, calibrated metric-gain analysis and exact variance decomposition show that the best choice of surrogate objective is determined neither by proximity to maximum likelihood nor by the population-level weight alone. Instead, it depends jointly on the evaluation metric, local sensitivity, and estimator variance. The remaining degree of freedom in the surrogate objective family can therefore be formulated as a one-dimensional optimization problem rather than treated as an unconstrained hyperparameter.

URL PDF HTML ☆

赞 0 踩 0

2605.30153 2026-05-29 stat.ML cs.IT cs.LG math.IT math.ST stat.TH 版本更新

Diffusion Models Are Statistically Optimal for Learning Low-Dimensional Multi-Modal Distributions

扩散模型在学习低维多模态分布时具有统计最优性

Jingda Wu, Changxiao Cai

发表机构 * Department of Industrial and Operations Engineering, University of Michigan, Ann Arbor, USA（工业与运营工程系，密歇根大学，安娜堡，美国）

AI总结本文证明扩散模型在学习支撑在低维子空间并集上的分布时，样本复杂度仅依赖于内在维度，达到近最优的1-Wasserstein误差率，无需光滑性或有界密度假设。

Comments accepted to ICML 2026

详情

AI中文摘要

基于分数的扩散模型在学习高维分布，特别是那些具有低维和多模态结构的分布方面，已经展现出显著的实证成功。然而，对其统计效率的理论理解仍然有限。现有理论通常依赖于强正则性假设，例如一致有界密度或全局光滑的分数函数，这些假设无法捕捉此类内在结构。在这项工作中，我们研究了扩散模型在学习支撑在低维子空间并集上的分布时的样本复杂度。假设每个子空间内的数据分布是次高斯的，我们证明扩散模型最多需要$\widetilde{O}(\varepsilon^{-k \vee 2})$个样本即可在1-Wasserstein距离上达到$\varepsilon$误差，其中$k$是内在维度。这一近最优的收敛速率仅依赖于内在维度，并显著改进了先前遭受维度灾难的理论保证。值得注意的是，我们的分析适用于广泛的分布，无需施加光滑性、有界密度或对数凹性假设。总体而言，我们的结果表明，扩散模型能够统计适应内在低维结构，同时自然容纳多模态数据，为其在复杂高维学习任务中的成功提供了严格的理论依据。

英文摘要

Score-based diffusion models have demonstrated remarkable empirical success in learning high-dimensional distributions, particularly those exhibiting low-dimensional and multi-modal structures. However, theoretical understanding of their statistical efficiency remains limited. Existing theories typically rely on strong regularity assumptions, such as uniformly bounded densities or globally smooth score functions, which fail to capture such intrinsic structures. In this work, we study the sample complexity of diffusion models for learning distributions supported on a union of low-dimensional subspaces. Assuming that the data distribution within each subspace is subgaussian, we show that diffusion models require at most $\widetilde{O}(\varepsilon^{-k \vee 2})$ samples to achieve $\varepsilon$ error in 1-Wasserstein distance, where $k$ is the intrinsic dimension. This near-optimal convergence rate depends only on the intrinsic dimension and significantly improves upon prior theoretical guarantees that suffer from the curse of dimensionality. Notably, our analysis applies to a broad collection of distributions without imposing smoothness, bounded-density, or log-concavity assumptions. Overall, our results show that diffusion models can statistically adapt to intrinsic low-dimensional structure while naturally accommodating multi-modal data, offering a rigorous theoretical justification for their success in complex high-dimensional learning tasks.

URL PDF HTML ☆

赞 0 踩 0

2605.30148 2026-05-29 cs.LG cs.AI 版本更新

Overcoming Forgetting in LLM Fine-Tuning with Evolution Strategies

克服LLM微调中的遗忘：进化策略方法

Kajetan Schweighofer, Conor F. Hayes, Roberto Dailey, Risto Miikkulainen, Xin Qiu

发表机构 * Cognizant AI Lab（Cognizant AI实验室）； UT Austin（得克萨斯大学奥斯汀分校）

AI总结本文发现进化策略微调中的先前任务遗忘实为性能漂移且可恢复，并引入锚定权重衰减（AWD）正则化技术有效稳定先前任务性能，表明遗忘可避免，使ES成为LLM持续学习的可行方法。

详情

AI中文摘要

进化策略（ES）最近作为强化学习（RL）在大语言模型（LLM）微调中的竞争性替代方案出现，通过简单性、可扩展性和仅推理训练提供优势。然而，近期研究表明，在新任务上进行ES微调可能导致对先前任务的遗忘。首先，本文表明先前任务遗忘（1）更好地被描述为性能漂移而非不可逆遗忘，在ES训练过程中先前任务性能通常会恢复；（2）并非ES特有的失败模式，使用RL方法微调时也可能出现。其次，本文分析了这种漂移何时以及为何出现，强调了其对ES训练动态的依赖性，特别是权重空间中弱约束方向上的随机游走行为。第三，基于这些见解，本文引入了锚定权重衰减（AWD）作为一种参数空间正则化技术，将优化约束向初始模型参数。AWD在保持目标任务性能的同时有效稳定了先前任务性能，以更低的计算成本实现了与大型ES种群规模相当的优势。因此，与先前观点相反，本文表明ES下的先前任务遗忘在很大程度上是可以避免的，使ES成为LLM持续学习中一种有前景的方法。

英文摘要

Evolution Strategies (ES) has recently emerged as a competitive alternative to reinforcement learning (RL) for large language model (LLM) fine-tuning, offering advantages through simplicity, scalability, and inference-only training. However, recent work suggests that ES fine-tuning on new tasks may induce forgetting of prior tasks. First, this paper shows that prior task forgetting (1) is better characterized as performance drift rather than irreversible forgetting, with prior-task performance often recovering during ES training; and (2) is not a specific failure mode of ES, but can also arise for fine-tuning with RL methods. Second, it analyzes when and why such drift arises, highlighting its dependence on ES training dynamics, particularly random walk behavior in weakly constrained directions of the weight space. Third, based on these insights, it introduces Anchored Weight Decay (AWD) as a parameter-space regularization technique that constrains optimization toward the initial model parameters. AWD effectively stabilizes prior-task performance while preserving target-task performance, achieving benefits comparable to large ES population sizes at much lower computational cost. Thus, contrary to previous beliefs, the paper shows that prior-task forgetting under ES is largely avoidable, positioning ES as a promising approach for continual learning in LLMs.

URL PDF HTML ☆

赞 0 踩 0

2605.30135 2026-05-29 cs.LG cs.AI 版本更新

Q-ANCHOR: 基于ZNE引导校正的量子联邦学习

Hoang M. Ngo, Quan Nguyen, Wanli Xing, My T. Thai

发表机构 * Department of Computer & Information Science & Engineering（计算机与信息科学与工程系）； University of Florida（佛罗里达大学）； Frost Institute for Data Science and Computing（数据科学与计算弗罗斯特研究所）； University of Miami（迈阿密大学）

AI总结针对量子联邦学习中非独立同分布数据导致的客户端漂移和量子硬件噪声导致的硬件偏差，提出Q-ANCHOR聚合架构，通过零噪声外推锚定服务器更新并应用有状态客户端校正，理论证明可同时减轻两类漂移，实验显示训练更稳定。

详情

AI中文摘要

量子联邦学习（QFL）提供了一个有前景的框架，可以在保持数据严格本地化的同时，跨分布式客户端训练量子模型。由于其简单性和低通信开销，联邦平均（FedAvg）是QFL文献中的标准聚合选择。然而，在实际硬件上部署QFL会暴露出严重的双重漂移现象：全局模型同时受到来自非独立同分布数据的客户端漂移和来自噪声量子梯度估计的硬件偏差的干扰。在这项工作中，我们首先分析了FedAvg在这些现实条件下的收敛性，数学上证明了量子硬件偏差会产生标准平均无法纠正的持久误差下限。为了克服这一限制，我们提出了Q-ANCHOR，一种量子感知的联邦聚合架构，该架构通过零噪声外推锚定服务器更新，同时应用有状态客户端校正来抑制客户端漂移和硬件引起的偏差。我们的收敛理论证明，Q-ANCHOR成功减轻了经典客户端漂移，同时积极降低了硬件偏差下限。实验结果表明，Q-ANCHOR实现了比传统FL基线显著更稳定的训练。

英文摘要

Quantum Federated Learning (QFL) offers a promising framework to train quantum models across distributed clients while keeping data strictly local. Due to its simplicity and low communication overhead, Federated Averaging (FedAvg) is the standard aggregation choice in QFL literature. However, deploying QFL on practical hardware exposes a severe double-drift phenomenon: the global model is simultaneously derailed by client drift from non-IID data and hardware bias from noisy quantum gradient estimates. In this work, we first analyze the convergence of FedAvg under these realistic conditions, mathematically demonstrating that quantum hardware bias creates a persistent error floor that standard averaging cannot correct. To overcome this limitation, we propose Q-ANCHOR, a quantum-aware federated aggregation architecture that anchors server updates with zero-noise extrapolation while applying stateful client correction to suppress both client drift and hardware-induced bias. Our convergence theory proves that Q-ANCHOR successfully mitigates classical client drift while actively reducing the hardware-bias floor. Experimental results demonstrate that Q-ANCHOR achieves significantly more stable training than conventional FL baselines.

URL PDF HTML ☆

赞 0 踩 0

2605.30070 2026-05-29 cs.LG cs.AI 版本更新

A Predictive Law for On-Policy Self-Distillation From World Feedback

基于世界反馈的在线自蒸馏预测定律

Tommy He, Jerome Sieber, Matteo Saponati

发表机构 * Open-source models（开源模型）； LiveCodeBench

AI总结本文发现在线自蒸馏（OPSD）中初始师生性能差距与最终性能改进之间存在线性关系，并提出一种预测定律，用于在训练前预测OPSD配置的效果。

详情

AI中文摘要

超越简单的标量奖励，向更丰富的世界反馈迈进，是实现更可扩展的RL后训练的自然路径。在线自蒸馏（OPSD）是一种有前景的最新方法，它使用任意反馈作为学习信号，但其与GRPO等成熟方法相比的可靠性仍不清楚。我们发现了OPSD中初始学生-教师性能差距与最终性能改进之间存在惊人的一致线性相关性。这种关系在不同上下文类型和模型家族中均成立，为预测OPSD配置的结果提供了一种强大的预测定律，而无需运行完整的训练过程。有趣的是，我们表明这种线性可预测性随模型规模成立，这为具有更强上下文学习能力的大型模型上新的经验缩放定律提供了潜在基础。本质上，我们的发现表明，OPSD性能可以在训练前进行预测和调整，为将世界反馈作为后训练流水线的一等组件提供了一种原则性方法。

英文摘要

Moving beyond simple scalar rewards toward richer world feedback is a natural path to more scalable RL post-training. On-policy self-distillation (OPSD) is a promising recent approach that uses arbitrary feedback as learning signal, yet its reliability compared to established methods, such as GRPO, remains unclear. We identify a strikingly consistent linear correlation between the initial student-self-teacher performance gap and the final performance improvement in OPSD. This relationship holds across context types and model families, providing a powerful predictive law for anticipating the outcome of an OPSD configuration without running the full training procedure. Interestingly, we show that this linear predictability holds with model scale, suggesting a potential basis for new empirical scaling laws on larger models with stronger in-context learning capabilities. In essence, our findings show that OPSD performance can be predicted and tuned before training, offering a principled way to incorporate world feedback as a first-class component of the post-training pipeline.

URL PDF HTML ☆

赞 0 踩 0

2605.30059 2026-05-29 cs.LG cond-mat.stat-mech stat.ML 版本更新

Ridge Regression from Poisson Resetting: A Renewal Perspective on Spectral Regularization

泊松重置的岭回归：谱正则化的更新视角

Petar Jolakoski

发表机构 * manu.edu.mk

AI总结通过非平衡统计物理中的随机重置与统计学习中的岭正则化建立联系，证明线性梯度流下以速率r重置到原点产生的稳态均值即为岭估计，并推广到一般更新重置律以生成替代谱滤波器。

详情

AI中文摘要

我们将非平衡统计物理中的随机重置与统计学习中的岭正则化联系起来。对于线性梯度流，以速率$r$重置到原点产生稳态均值$(X^\top X+rI)^{-1}X^\top y$，这正是惩罚项$\lambda=r$的岭估计。这利用了岭回归与梯度流指数时间平均之间已知的拉普拉斯变换关系，其中指数时间现在被解释为与泊松重置相关的稳态年龄。然后我们将这一恒等式推广到一般更新重置律：指数重置时间分布是唯一的更新律，其稳态均值在每个特征方向上作为精确的滤波器恒等式对每个正曲率重现标量岭，而非指数更新律则生成替代的谱滤波器。在波动层面，我们研究了一个具有恒定扩散的独立加性奥恩斯坦-乌伦贝克扩展，解释为一种风格化的SGD近似。在这种设定下，等式仅在均值层面成立，因为重置过程由于累积的OU噪声和重置时序方差具有非零稳态协方差，而确定性岭是一个具有相同中心的固定估计量。风格化实验直接比较了确定性更新诱导的滤波器，并说明了非指数重置时间律诱导的滤波器何时可能在预测上与岭不同。关于稳态均值和诱导谱滤波器的结果是在二次目标上具有各向同性重置的连续时间梯度流下建立的；协方差和风险公式额外假设具有状态独立协方差的加性噪声。

英文摘要

We connect stochastic resetting from non-equilibrium statistical physics with ridge regularization in statistical learning. For linear gradient flow, resetting to the origin at rate $r$ produces stationary mean $(X^\top X+rI)^{-1}X^\top y$, exactly the ridge estimator with penalty $λ=r$. This uses the known Laplace-transform relationship between ridge regression and exponential-time averaging of gradient flow, with the exponential time now interpreted as the stationary age associated with Poisson resetting. We then extend this identity to general renewal reset laws: the exponential reset time distribution is the unique renewal law whose stationary mean reproduces scalar ridge in every eigendirection as an exact filter identity for every positive curvature, while non-exponential renewal laws generate alternative spectral filters. At the fluctuation level, we study a separate additive Ornstein-Uhlenbeck extension with constant diffusion, interpreted as a stylized SGD approximation. In this setting, the equality holds only at the level of the mean, since the reset process has a nonzero stationary covariance from accumulated OU noise and reset-timing variance, whereas deterministic ridge is a fixed estimator with the same center. Stylized experiments compare the deterministic renewal-induced filters directly and illustrate when filters induced by non-exponential reset-time laws can differ predictively from ridge. The results for the stationary mean and the induced spectral filters are established for continuous-time gradient flow with isotropic resetting on quadratic objectives; the covariance and risk formulas additionally assume additive noise with state-independent covariance.

URL PDF HTML ☆

赞 0 踩 0

2605.30056 2026-05-29 cs.RO cs.LG 版本更新

Sample-Efficient Diffusion-based Reinforcement Learning with Critic Guidance

基于评论家引导的样本高效扩散强化学习

Shutong Ding, Zejia Zhong, Zhongyi Wang, Ke Hu, Bikang Pan, Jingya Wang, Ye Shi

发表机构 * ShanghaiTech University（上海科技大学）

AI总结针对扩散策略在强化学习中探索与利用不平衡的问题，提出评论家引导的扩散策略优化（CGPO），通过无训练引导技术平衡探索与利用，在MuJoCo和Franka机器人任务上取得最优性能。

Comments accepted by ICML2026

详情

AI中文摘要

近年来，强化学习（RL）通过利用扩散策略的多模态性和探索能力取得了巨大成功。在这些方法中，一个代表性分支专注于基于采样的策略优化。这种设计使得扩散模型在训练初期具有更好的探索能力，但在Q值信息的利用上不足，导致策略收敛缓慢。另一个分支关注基于梯度的策略优化，该方法充分利用Q函数的梯度，但容易退化为低多样性的单峰策略。为了解决这个问题，我们提出了CGPO（评论家引导的扩散策略优化），通过将无训练引导技术集成到扩散策略的去噪过程中，有效平衡探索与利用。具体而言，CGPO将动作生成引导至评论家网络定义的高价值区域，并将引导后的动作作为回归目标。通过这种方式，CGPO减少了获取高质量动作所需的时间，并通过更好的探索-利用权衡提高了最终性能。我们在5个MuJoCo运动任务上验证了CGPO的有效性，与现有的基于扩散的RL方法相比，CGPO达到了最先进的性能。值得注意的是，CGPO是首次成功将扩散策略应用于真实世界RL的方法，在Franka机器人臂抓取任务上表现出优越性能。我们的官方页面发布在https://dingsht.tech/cgpo-webpage。

英文摘要

Recent advances in reinforcement learning (RL) have achieved great successes by leveraging the multimodality and exploration capability of diffusion policies. Among these approaches, one representative branch focuses on the sampling-based policy optimization. This design enables better exploration capability of the diffusion model, particularly at the beginning of training, but suffer from low exploitation in Q-value information, resulting in a slow policy convergence. Another branch pays attention to gradient-based policy optimization, which sufficiently exploits the gradient of the Q function yet tends to collapse into a unimodal policy with low diversity. To address this issue, we propose CGPO, \textbf{C}ritic-\textbf{G}uided diffusion \textbf{P}olicy \textbf{O}ptimization, which effectively balances exploration and exploitation with the training-free guidance technique integrated into the denoising process of diffusion policy. Concretely, CGPO steers action generation toward high-value regions defined by the critic network and uses the guided actions as regression objectives. In this manner, CGPO reduces the time required to obtain high-quality actions and improves final performance with better balance between the exploration-exploitation tradeoff. We validate the effectiveness of CGPO on 5 MuJoCo locomotion tasks, and CGPO achieves state-of-the-art performance compared with existing diffusion-based RL methods. Notably, CGPO is the first success to incorporate diffusion policy into real-world RL, with its superior performance on Franka robot arm grasping tasks. Our official page is released at https://dingsht.tech/cgpo-webpage.

URL PDF HTML ☆

赞 0 踩 0

2605.30046 2026-05-29 cs.LG cs.AI 版本更新

Masked Diffusion Modeling for Anomaly Detection

掩码扩散建模用于异常检测

Lixing Zhang, Yuchen Liang, Liyan Xie

发表机构 * University of Minnesota（明尼苏达大学）； Ohio State University（俄亥俄州立大学）

AI总结提出基于掩码扩散模型的MaskDiff-AD方法，通过重建随机掩码坐标的难度构建异常分数，在分类、混合类型和离散序列数据上实现高效异常检测。

详情

AI中文摘要

异常检测旨在识别偏离名义数据分布的样本，是许多安全关键应用的核心。然而，针对分类、混合类型和离散序列数据开发有效的异常检测方法仍然具有挑战性且相对未被充分探索。掩码扩散模型通过学习从剩余可见上下文中恢复掩码值，为建模此类数据提供了一种自然的方式。在本文中，我们提出了用于异常检测的掩码扩散（MaskDiff-AD），一种基于掩码扩散模型的前向方法，仅在名义数据上训练。给定测试样本，MaskDiff-AD从随机掩码坐标的重建难度构建异常分数，产生一个直接作用于离散状态空间且避免反向时间采样的内容敏感分数。我们还开发了MaskDiff-AD的非参数变体，并通过在固定检测阈值下表征I型和II型错误提供了理论保证。在来自ADBench和UADAD的十四个分类和混合类型表格数据集，以及来自NLP-ADBench的四个文本异常检测数据集上的实验表明，MaskDiff-AD相对于经典、基于扩散以及最近的表格/文本异常检测基线取得了有竞争力的性能。值得注意的是，MaskDiff-AD达到了最佳总体平均排名，优于所有十二种表格基线方法。

英文摘要

Anomaly detection aims to identify samples that deviate from the nominal data distribution and is central to many safety-critical applications. However, developing effective anomaly detection methods for categorical, mixed-type, and discrete sequence data remains challenging and relatively underexplored. Masked diffusion models provide a natural way to model such data by learning to recover masked values from the remaining visible context. In this paper, we propose Masked Diffusion for Anomaly Detection (MaskDiff-AD), a forward-only method based on masked diffusion models trained only on nominal data. Given a test sample, MaskDiff-AD constructs anomaly scores from the difficulty of reconstructing randomly masked coordinates, yielding a content-sensitive score that operates directly on discrete state spaces while avoiding reverse-time sampling. We also develop a non-parametric variant of MaskDiff-AD and provide theoretical guarantees by characterizing Type-I and Type-II errors under a fixed detection threshold. Experiments on fourteen categorical and mixed-type tabular datasets from ADBench and UADAD, as well as four text anomaly detection datasets from NLP-ADBench, show that MaskDiff-AD achieves competitive performance against classical, diffusion-based, and recent tabular/text anomaly detection baselines. Notably, MaskDiff-AD achieves the best overall average rank, outperforming all twelve tabular baseline methods.

URL PDF HTML ☆

赞 0 踩 0

2605.30038 2026-05-29 cs.LG cs.AI cs.CV 版本更新

Alignment-Guided Score Matching for Text-to-Image Alignment in Diffusion Models

对齐引导的分数匹配用于扩散模型中的文本到图像对齐

Jaa-Yeon Lee, Yeobin Hong, Taesung Kwon, Jong Chul Ye

发表机构 * Graduate School of AI, KAIST, South Korea（韩国高级人工智能研究生院）

AI总结提出一种轻量级、无奖励的后训练方法，通过将对比对齐引导直接整合到扩散模型的分数匹配目标中，以解决文本-图像对齐中的过度惩罚和计数错误问题。

Comments ICML 2026, Project page: https://jaayeon.github.io/AGSM

详情

AI中文摘要

扩散模型生成高度逼真的图像，但通常难以实现精确的文本-图像对齐。虽然最近的后训练方法使用外部奖励或人类偏好信号改善对齐，但其性能严重依赖奖励质量，且不直接解决扩散过程中的对齐问题。最近的无奖励方法如SoftREPA表明，通过对比学习优化软文本令牌可以有效改善文本-图像表示对齐，优于标准参数高效微调基线。然而，对比公式可能过度惩罚负对，表现为典型的失败案例，如过度计数和重复。为解决此问题，我们提出一种轻量级、无奖励的后训练方法，通过将对比对齐引导直接整合到扩散模型的分数匹配目标中来细化软令牌。通过在分数级别分配对齐方向，我们的方法缓解了这些限制，并产生更连贯和语义忠实的生成。实验表明，我们的方法与SoftREPA相当，同时显著改善了其失败案例，在GenEval基准上计数准确性提高了超过35%。我们的方法可无缝应用于现有扩散骨干网络（SD1.5、SDXL和SD3），并与现有的基于RL的扩散后训练方法互补。项目页面：https://jaayeon.github.io/AGSM

英文摘要

Diffusion models generate highly realistic images but often struggle with precise text-image alignment. While recent post-training methods improve alignment using external rewards or human preference signals, their performance heavily depends on reward quality and does not directly address alignment within the diffusion process itself. Recent reward-free approaches such as SoftREPA demonstrate that optimizing soft text tokens via contrastive learning can effectively improve text-image representation alignment, outperforming standard parameter-efficient fine-tuning baselines. However, the contrastive formulation can excessively penalize negative pairs, which manifests as characteristic failure cases such as over-counting and repetition. To address this issue, we propose a lightweight, reward-free post-training method that refines soft tokens by integrating contrastive alignment guidance directly into the score-matching objective of diffusion models. By assigning alignment directions at the score level, our approach mitigates these limitations and yields more coherent and semantically faithful generations. Experiments show that our method matches SoftREPA while substantially improving its failure cases, achieving over 35% improvement in counting accuracy on the GenEval benchmark. Our method is seamlessly applicable to existing diffusion backbones (SD1.5, SDXL, and SD3), and is complementary to existing RL-based diffusion post-training methods. Project page: https://jaayeon.github.io/AGSM

URL PDF HTML ☆

赞 0 踩 0

2605.30015 2026-05-29 cs.LG cs.AI 版本更新

大型语言模型的推理系统指纹识别

Anna Wimbauer, Jonas Möller, Erik Imgrund, Konrad Rieck

发表机构 * BIFOLD & TU Berlin（BIFOLD与柏林技术大学）

AI总结本文提出一种通过分析LLM的提示-响应行为来识别推理系统组件（如推理引擎、注意力后端和硬件平台）的指纹方法，并论证了防御该指纹识别的根本困难性。

详情

AI中文摘要

LLM的行为不仅仅取决于模型本身。推理系统的组件，如推理引擎、注意力后端和硬件平台，微妙地影响输入的处理方式。这些组件在实现上存在差异，因此在运行相同模型时，不同系统之间会产生微小的数值偏差。虽然先前的工作已经建立了这种偏差的理论存在性，但其安全影响尚未被探索。在本文中，我们表明这些偏差是特定组件的特征，并传播到可观察的文本输出中，从而将推理系统暴露给任何能够查询模型的方。基于这一观察，我们引入了一种指纹识别方法，通过分析LLM的提示-响应行为来识别推理系统的组件。我们的实证评估表明，即使在LLM以非零温度运行时，推理引擎、注意力后端和底层硬件平台也能被可靠地识别。我们证明，防止指纹识别从根本上来说是困难的，因为它需要消除硬件和软件堆栈之间的数值差异。因此，我们提出了部分缓解措施并讨论了它们的影响。

英文摘要

The behavior of LLMs does not depend solely on the model itself. Components of the inference system, such as the inference engine, attention backend, and hardware platform, subtly influence how inputs are processed. These components differ in their implementations and thereby induce small numerical deviations across systems when running the same model. While prior work has established the theoretical existence of such deviations, their security implications have remained unexplored. In this paper, we show that these deviations are characteristic of specific components and propagate to observable textual outputs, exposing the inference system to any party that can query the model. Building on this observation, we introduce a fingerprinting method that analyzes the prompt-response behavior of LLMs to identify components of the inference system. Our empirical evaluation demonstrates that the inference engine, attention backend, and underlying hardware platform can be identified reliably, even when the LLM is operated at non-zero temperature. We show that preventing fingerprinting is fundamentally hard, as it would require eliminating numerical differences between hardware and software stacks. We therefore propose partial mitigations and discuss their impact.

URL PDF HTML ☆

赞 0 踩 0

2605.29975 2026-05-29 cs.LG eess.SP 版本更新

A Fully Convolutional Approach to Denoising Structural Dynamics Data from X-Ray Photon Correlation Spectroscopy

一种全卷积方法用于X射线光子相关光谱中结构动力学数据的去噪

Nisar Nellikunnummel, Andi Barbour, Lutz Wiegart, Tatiana Konstantinova, Anthony DeGennaro

发表机构 * Amazon（亚马逊）； GE Aerospace Research（通用电气航空航天研究）

AI总结提出全卷积去噪自编码器（FC-DAE），用于去噪X射线光子相关光谱中的双时间强度-强度相关函数，支持任意输入尺寸，在低信噪比条件下恢复复杂动力学特征并保持结构保真度。

详情

AI中文摘要

我们提出了一种全卷积去噪自编码器（FC-DAE），用于去噪X射线光子相关光谱（XPCS）中的双时间强度-强度相关函数（$C_2$）。与通常限制为固定输入尺寸的传统去噪自编码器不同，FC-DAE接受任意维度的输入，同时保留不同动力学范围内的相关结构。该模型使用在NSLS-II光束线收集的实验$C_2$数据进行训练，并应用数据增强来扩展数据集的多样性并减少过拟合。FC-DAE在低信噪比条件下成功恢复复杂的动力学特征，同时保持结构保真度。为了评估重建可靠性，我们采用定量指标来评估结构保真度并识别潜在的模型引入偏差。我们的结果表明，FC-DAE提供了具有高计算效率的鲁棒去噪性能，使得在光子受限和低剂量测量条件下恢复XPCS动力学成为可能。

英文摘要

We present a fully convolutional denoising autoencoder (FC-DAE) for denoising two-time intensity-intensity correlation functions ($C_2$) in X-ray photon correlation spectroscopy (XPCS). Unlike conventional denoising autoencoders that are typically restricted to fixed input sizes, the FC-DAE accepts inputs of arbitrary dimensions while preserving correlation structures across diverse dynamical regimes. The model is trained using experimentally derived $C_2$ data collected at NSLS-II beamlines, with data augmentation applied to expand the diversity of the dataset and reduce overfitting. The FC-DAE successfully recovers intricate dynamical features in low signal-to-noise conditions while maintaining structural fidelity. To assess reconstruction reliability, we employ quantitative metrics to evaluate structural fidelity and identify potential model-induced bias. Our results demonstrate that the FC-DAE provides robust denoising performance with high computational efficiency, enabling recovery of XPCS dynamics under photon-limited and low-dose measurement conditions.

URL PDF HTML ☆

赞 0 踩 0

2605.29963 2026-05-29 cs.CR cs.AI cs.LG 版本更新

Honeyval: A Comprehensive Evaluation Framework for LLM-powered HTTP Honeypots

Honeyval: 基于LLM的HTTP蜜罐综合评估框架

Mark Vero, Fabian Kaczmarczyck, Ivan Petrov, Ilia Shumailov, Jamie Hayes, Niels Heinen, Tianqi Fan, Luca Invernizzi, Martin Vechev

发表机构 * ETH Zurich（苏黎世联邦理工学院）； Google（谷歌）； Google DeepMind（谷歌深Mind）； AI Sequrity Company（AI安全公司）； Independent（独立）

AI总结提出Honeyval评估框架，通过16个后端应用、AI攻击代理、控制任务和可验证利用目标，系统评估LLM驱动的HTTP蜜罐，发现其相比规则基线能显著延长攻击交互、降低被前沿模型检测率，且保持成本优势。

详情

AI中文摘要

蜜罐是模拟真实系统组件的诱饵系统，旨在防御网络攻击。最近，LLM越来越多地作为蜜罐的模拟骨干。它们使防御者能够构建高交互蜜罐，同时降低系统安全风险。然而，基于LLM的蜜罐开发缺乏统一的评估框架。大多数评估包括测量固定命令上的响应相似性、手动测试或实际部署。这些方法通常不可扩展用于开发、不可跨评估复现、不能代表实际攻击，或不能适应各种攻击者和蜜罐配置。在这项工作中，我们弥补了这一差距，提出了Honeyval，一个针对LLM驱动的HTTP蜜罐的综合评估框架。我们通过将蜜罐基于16个后端应用程序、使用AI黑客代理作为攻击者、采用两个控制任务来监控代理和蜜罐在定制化方面的能力，以及为攻击者定义清晰且可验证的利用目标，解决了先前评估的局限性。使用Honeyval，我们对近期成本高效的LLM作为HTTP蜜罐进行了广泛评估。我们的实验突出了LLM驱动的蜜罐的前景；它们与基于规则的基线蜜罐相比，导致与攻击者的交互时间显著延长，并且即使被前沿模型检测到的频率也远低得多，同时平均而言，保持了针对代理攻击者的运行成本优势。此外，我们实验了不同的反攻蜜罐配置，并观察到了独特的权衡，例如以增加检测为代价获得更长的交互。

英文摘要

Honeypots are decoy systems mimicking real system components designed to defend against cyber attacks. Recently, LLMs increasingly serve as simulation backbones for honeypots. They enable defenders to construct high-interaction honeypots with low system security risks. However, LLM-powered honeypot development lacks a unified evaluation framework. Most evaluations consist of measuring response similarity on fixed commands, manual testing, or real-world deployment. These methods are often not scalable for development, reproducible across evaluations, representative of practical attacks, or adaptable to various attacker and honeypot configurations. In this work, we bridge this gap and propose Honeyval, a comprehensive evaluation framework for LLM-powered HTTP honeypots. We address the limitations of prior evaluations by grounding the honeypots in 16 backend applications, using AI hacking agents as attackers, employing two control tasks to monitor agent and honeypot capabilities across customizations, and defining clear and verifiable exploit goals for the attacker. Using Honeyval, we conduct an extensive evaluation of recent cost-efficient LLMs as HTTP honeypots. Our experiments highlight the promise of LLM-powered honeypots; they lead to substantially longer interactions with the attacker than rule-based baseline honeypots and are far less frequently detected even by frontier models, all while, on average, preserving a running cost advantage against agentic attackers. Further, we experiment with different counter-offensive honeypots configurations, and observe unique trade-offs, such as longer interactions at the cost of increased detection.

URL PDF HTML ☆

赞 0 踩 0

2605.29952 2026-05-29 cs.LG 版本更新

From Short Histories to Long Futures: Horizon-Aware Graph Neural Networks for Long Horizon Forecasting

从短历史到长未来：面向长时域预测的视界感知图神经网络

Zesheng Liu, Maryam Rahnemoonfar

发表机构 * Department of Computer Science and Engineering, Lehigh University（计算机科学与工程系，莱维大学）； Department of Civil and Environmental Engineering, Lehigh University（土木与环境工程系，莱维大学）

AI总结提出一种多视界图神经网络模拟器，通过共享图骨干网络和增量预测策略，联合优化多步超前预测，实现长时域稳定且准确的地球物理系统模拟。

Comments Accepted for International Conference on Pattern Recognition (ICPR) 2026

详情

AI中文摘要

由于强非线性动力学、全物理模拟的高计算成本以及单步自回归代理在数十年滚动中产生的误差累积，地球物理系统的精确长期预测十分困难。深度神经网络可作为高效模拟器，但大多数仅训练用于下一步预测，且随着预测视界增长常出现漂移或不稳定。我们提出一种多视界图神经网络模拟器，在统一模型中学习从单个当前时间到多个未来超前时间的状态到状态转换。物理域表示为图，其中节点对应具有时变地球物理属性的空间位置，边编码局部空间相互作用。给定当前图状态，模型预测关键场（冰厚度和冰速度）在所有节点上的未来演化，使用共享图骨干网络和每个目标变量的独立输出分支。为提高稳定性，网络预测相对于当前状态的状态增量，然后将其加回以重建未来状态。训练联合优化所有超前时间，使用统一回归目标，推理采用从粗到细的滚动方式，以较大步长推进并有选择地以较短步长细化，以减少漂移并避免冗余计算。在数十年期松岛冰川模拟上的实验表明，我们的方法在长期精度和稳定性上均优于（i）直接从初始状态预测每个未来时间的基线模型和（ii）标准单步自回归滚动，为下游气候和海平面研究提供了更可靠的模拟器。

英文摘要

Accurate long-range prediction of geophysical systems is difficult due to strongly nonlinear dynamics, the high computational cost of full-physics simulations, and the error accumulation that arise when one-step autoregressive surrogates are rolled out over decades. Deep neural network can serve as efficient emulators, but most are trained only for next-step prediction and often drift or become unstable as the forecast horizon grows. We propose a multi-horizon graph neural network emulator that learns state-to-state transitions from a single current time to multiple future lead times within one unified model. The physical domain is represented as a graph, where nodes correspond to spatial locations with time-varying geophysical attributes and edges encode local spatial interactions. Given the current graph state, the model predicts the future evolution of key fields, ice thickness and ice velocities at all nodes, using a shared graph backbone with separate output branches for each target variable. To improve stability, the network predicts state increments relative to the current state, which are then added back to reconstruct future states. Training jointly optimizes all lead times with a unified regression objective, and inference uses a coarse-to-fine rollout that advances with larger jumps and selectively refines with shorter jumps to reduce drift and avoid redundant computation. Experiments on multi-decadal Pine Island Glacier simulations show that our approach achieves higher long-range accuracy and improved stability than both (i) an initial-state baseline that predicts each future time directly from the starting state and (ii) a standard single-step autoregressive rollout, producing a more reliable emulator for downstream climate and sea-level studies.

URL PDF HTML ☆

赞 0 踩 0

2605.29951 2026-05-29 cs.AI cs.CL cs.LG cs.MM 版本更新

MuPHI: Learning Implicit Multimodal Harm Reasoning via Semantically Grounded Reward Optimization

MuPHI: 通过语义基础奖励优化学习隐式多模态有害推理

Anisha Saha, Varsha Suresh, Teodora Kamova, Sophia Wiedmann, Timothy Hospedales, Vera Demberg

发表机构 * Max Planck Institute for Informatics（马克斯·普朗克院信息研究所）； Saarland Informatics Campus（萨尔兰州信息校园）； Saarland University（萨尔兰州大学）； The University of Edinburgh（爱丁堡大学）； Samsung AI Center, Cambridge（三星AI中心，剑桥）

AI总结针对视觉语言模型在隐式跨模态有害语义推理上的不足，提出MuPHI数据集和MuPHIRM训练框架，通过多视角奖励优化联合语义学习，提升有害检测与推理质量及分布外鲁棒性。

详情

AI中文摘要

理解看似良性的图像-文本对之间交互如何产生危害，需要超越表面特征的意图感知跨模态推理。现有的视觉语言模型（VLM）擅长对感知线索进行字面推理，但往往无法推导出依赖于隐式、上下文相关推理的有害语义。为了评估VLM在组合性有害检测和推理方面的能力，我们引入了多模态语用有害解释（MuPHI）数据集，其中包含有害编码在微妙多模态线索中的图像-文本对。MuPHI涵盖多种有害类别，并包含用于评估VLM推理链的注释有害理由。为了改进VLM的检测和推理能力，我们提出了MuPHIRM，一种推理增强的训练框架，通过优化多视角奖励来学习联合语义。MuPHIRM提高了VLM的有害检测和推理质量，同时与训练和推理时基线相比，表现出优越的分布外鲁棒性。我们的发现表明，面向推理的奖励优化为构建超越基准特定捷径进行泛化的多模态系统提供了一个有前景的方向。

英文摘要

Understanding how harm emerges from interaction between otherwise benign image-text pairs requires intent-aware cross-modal reasoning beyond surface-level features. Existing vision-language models (VLMs) excel at literal reasoning over perceptual cues but often fail to derive harmful semantics that rely on implicit, context-dependent reasoning. To evaluate VLMs on compositional harm detection and reasoning, we introduce Multimodal Pragmatic Harm Interpretation (MuPHI), a dataset containing image-text pairs where harm is encoded in subtle multimodal cues. MuPHI spans diverse harm categories and includes annotated harm rationales for assessing VLM reasoning chains. To improve both detection and reasoning in VLMs, we propose MuPHIRM, a reasoning-augmented training framework which learns joint semantics by optimizing multi-perspective rewards. MuPHIRM improves both harm detection and reasoning quality of VLMs while demonstrating superior out-of-distribution robustness compared to both trained and inference-time baselines. Our findings suggest that reasoning-oriented reward optimization offers a promising direction towards building multimodal systems that generalize beyond benchmark-specific shortcuts.

URL PDF HTML ☆

赞 0 踩 0

2605.29943 2026-05-29 cs.HC cs.ET cs.LG 版本更新

A Domain-Informed Multi-Objective Framework for EEG Channel Selection in Motor Imagery BCIs

一种领域信息驱动的多目标框架用于运动想象脑机接口中的EEG通道选择

Dekka Muni Kumar, Dhruba Jyoti Kalita, Yogesh Kumar Meena

发表机构 * Human-AI Interaction (HAIx) Lab, IIT Gandhinagar（人机交互（HAIx）实验室，印度冈达恩加尔理工学院）

AI总结提出一种基于多目标优化（NSGA-II、MOPSO、MOEA/D）的EEG通道选择框架，通过高斯核评估空间相关性、任务相关去同步评估功能区分性，在四个数据集上优于单目标方法，实现紧凑通道子集和高分类性能。

Comments This work has been submitted to the IEEE for possible publication

详情

AI中文摘要

使用脑电图（EEG）信号进行运动想象（MI）分类对于推进脑机接口（BCI）至关重要。传统的EEG通道选择方法通常面临局限性，例如依赖单目标标准和易陷入局部最优。为了解决这些挑战，本文提出了一种多目标优化框架，采用非支配排序遗传算法、多目标粒子群优化和基于分解的多目标进化算法。我们的方法有效平衡了空间相关性（使用高斯核）和功能区分性（评估试验内任务相关去同步），从而提高了性能。我们在四个EEG数据集（Physionet、OpenBMI、HighGamma和BCIIV-2A）上评估了该框架。所提出的方法成功识别出紧凑且相关的通道子集，这些子集集中在与MI活动相关的感觉运动皮层区域，解决了传统技术中普遍存在的维度和复杂性挑战。此外，该框架在Physionet、OpenBMI、HighGamma和BCIIV-2A数据集上分别达到了87%、71%、75%和65%的分类性能。通过优于现有的单目标和基于准确率的方法以及依赖固定子集的方法，这些发现表明，这种新的多目标优化框架可以增强基于MI的BCI性能，同时促进紧凑的通道配置，降低计算复杂度，使其更适合可穿戴、便携式和实时BCI应用。

英文摘要

Motor imagery (MI) classification using electroencephalography (EEG) signals is essential for advancing brain-computer interfaces (BCIs). Traditional EEG channel selection methods often face limitations, such as dependency on single-objective criteria and susceptibility to local optima. To address these challenges, this work proposes a multi-objective optimisation framework that employs non-dominated sorting genetic algorithm, multiple-objective particle swarm optimisation, and a multi-objective evolutionary algorithm based on decomposition. Our approach effectively balances spatial relevance, using a Gaussian kernel, and functional discriminability, which assesses intratrial task-related desynchronisation, thereby improving performance. We evaluated this framework on four EEG datasets: Physionet, OpenBMI, HighGamma, and BCIIV-2A. The proposed approach successfully identifies compact, relevant channel subsets concentrated around sensorimotor cortex regions linked to MI activity, addressing the prevalent challenges of dimensionality and complexity inherent to traditional techniques. Furthermore, the framework achieved classification performance of 87%, 71%, 75%, and 65% on the Physionet, OpenBMI, HighGamma, and BCIIV-2A datasets, respectively. By outperforming existing single-objective and accuracy-based methods, and those relying on fixed subsets, these findings demonstrate that this new multi-objective optimisation framework can enhance MI-based BCI performance while facilitating compact channel configurations with reduced computational complexity, making them better suited for wearable, portable, and real-time BCI applications.

URL PDF HTML ☆

赞 0 踩 0

2605.29941 2026-05-29 cs.NI cs.LG 版本更新

通过逐像素生成图像插值减少空间推进薄膜冷却分析中的实验测试

Adam T. Müller, Philipp J. Teuffel, Konstantin Manassis, Nicolaj C. Stache

发表机构 * Heilbronn University of Applied Sciences（海德堡应用科学大学）； Center for Machine Learning（机器学习中心）； Max-Planck-Str. 39（马克斯-普朗克街39号）； German Aerospace Center (DLR)（德国航空航天中心（DLR））； Institute of Space Propulsion（空间推进研究所）

AI总结提出一种基于轻量级前馈神经网络和位置编码的机器学习方法，从稀疏实验测量中进行图像回归，以减少推进系统薄膜冷却研究中的物理测试需求。

Comments Presented at the 11th European Conference for Aeronautics and Aerospace Sciences (EUCASS), 2025, DOI: 10.13009/EUCASS2025-285

详情

DOI: 10.13009/EUCASS2025-285

AI中文摘要

我们提出了一种从稀疏实验测量中进行图像回归的机器学习方法。我们展示了该方法在推进系统开发中薄膜冷却研究中的应用，旨在减少对大量物理测试的需求。我们的方法采用带有位置编码的轻量级前馈神经网络，根据输入参数生成图像。在真实和合成数据上的验证表明，该方法在减少30%测量量的同时，实现了高图像相似度（RMSE < 8%，SSIM > 93%）。我们进一步提出了一种知识驱动的扩展，用于生成图像的局部适应性。该方法显著减少了所需测试次数，同时保持了高质量数据，从而能够高效优化冷却剂喷射器配置，其应用范围超越航空航天领域。

英文摘要

We propose a machine learning approach for image regression from sparse experimental measurements. We show the application of the proposed method on film cooling studies in propulsion system development, aiming to reduce the need for extensive physical testing. Our method employs a lightweight feed-forward neural network with positional encoding to generate images conditioned by input parameters. Validated on real and synthetic data, it achieves high image similarity (RMSE < 8 %, SSIM > 93 %) while maintaining accuracy with a 30 \% reduction of measurements. We further propose a knowledge-informed extension for local adaptability of the generated images. This approach significantly reduces required tests while preserving high-quality data, enabling efficient optimization of coolant injector configurations with applications beyond aerospace.

URL PDF HTML ☆

赞 0 踩 0

2605.29908 2026-05-29 stat.ML cs.LG 版本更新

Joint Model and Data Sparsification via the Marginal Likelihood

通过边际似然进行联合模型与数据稀疏化

Alexander Timans, Thomas Möllenhoff, Christian A. Naesseth, Mohammad Emtiyaz Khan, Eric Nalisnick

发表机构 * RIKEN Center for AI Project, Tokyo, Japan（日本东京RIKEN人工智能项目中心）； Department of Computer Science, Johns Hopkins University（约翰霍普金斯大学计算机科学系）

AI总结提出通过边际似然联合学习特征和样本相关性，实现同时模型与数据稀疏化的贝叶斯方法，在保持共轭性和闭式更新的同时提升鲁棒性。

Comments 36 pages, 8 figures, 12 tables (incl. appendix); published at ICML 2026

详情

AI中文摘要

线性系统中的稀疏恢复支撑着从信号处理到高维回归的应用。基于自动相关性确定（ARD）原理的稀疏贝叶斯学习，通过边际似然优化为特征稀疏性提供了一种实用的贝叶斯机制。然而，其对同方差噪声模型的依赖使其对数据污染（如异常值或错误指定的噪声）敏感，损害了模型拟合和预测。相反，我们提出联合学习个体特征和样本相关性，通过单一贝叶斯目标实现同时模型与数据稀疏化。这种模型和数据的对称剪枝提供了一种自然扩展，保持了共轭性，允许标准优化过程的闭式更新，并与鲁棒回归和影响函数的观点一致。跨多种回归任务的实证结果证实，联合ARD方法一致地产生稀疏且鲁棒的预测模型。

英文摘要

Sparse recovery in linear systems underpins applications from signal processing to high-dimensional regression. Sparse Bayesian Learning, grounded in the principle of automatic relevance determination (ARD), offers a practical Bayesian mechanism for feature sparsity via marginal likelihood optimization. Yet, its reliance on a homoscedastic noise model renders it sensitive to data contaminations such as outliers or misspecified noise, harming model fit and predictions. Instead, we propose jointly learning individual feature and sample relevancies, enabling simultaneous model and data sparsification via a single Bayesian objective. This symmetric pruning of model and data offers a natural extension that preserves conjugacy, admits closed-form updates for standard optimization procedures, and aligns with perspectives from robust regression and influence functions. Empirical results across diverse regression tasks affirm that a joint ARD approach consistently yields both sparse and robust prediction models.

URL PDF HTML ☆

赞 0 踩 0

2605.29901 2026-05-29 cs.CR cs.LG 版本更新

Dissecting the Black Box: Circuit-Level Analysis of LLM Vulnerability Detection

剖析黑箱：LLM 漏洞检测的电路级分析

Syafiq Al Atiiq, Chun Zhou, Christian Gehrmann

发表机构 * Lund University（隆德大学）

AI总结通过机械可解释性分析 Gemma-2-2b 模型在 C/C++ 漏洞检测中的内部计算，发现模型主要依赖安全检测器（识别安全编码模式的注意力头）而非直接检测漏洞特征，并识别出关键神经组件（早期层注意力头和 MLP 神经元），通过消融实验验证其因果作用。

Comments 11 pages, 6 figures. Supported by the Wallenberg AI, Autonomous Systems and Software Program (WASP)

详情

AI中文摘要

大型语言模型（LLM）能够检测软件漏洞，但它们实际上是如何识别易受攻击的代码的呢？我们利用机械可解释性来回答这个问题；分析神经网络的内部计算以理解其推理过程。通过在 Gemma-2-2b 上使用 Circuit Tracer，我们追踪了模型将 472 个 C/C++ 代码样本分类为易受攻击或安全时所激活的计算路径。我们的分析揭示了一个令人惊讶的发现：模型主要依赖安全检测器（即识别安全编码模式的注意力头），而不是直接检测漏洞特征。当这些安全检测器未能激活时，模型将代码分类为易受攻击。我们识别出了关键的神经组件：早期层（L5、L7）中专注于安全模式的特定注意力头，以及第 7 层中编码漏洞相关特征的多层感知器（MLP）神经元。消融实验证实了它们的因果作用；移除第 11 层会使漏洞检测准确率从 100% 降至 6%，而仅消融第 7 层中的 20 个神经元就会使其降低 50%。我们的发现表明，LLM 漏洞检测使用了稀疏、可解释的电路（仅占模型容量的 16%），从而能够为安全预测提供电路级解释，并有针对性地改进检测系统。

英文摘要

Large language models (LLMs) can detect software vulnerabilities, but how do they actually identify vulnerable code? We address this question using mechanistic interpretability; analyzing the internal computations of a neural network to understand its reasoning process.Using Circuit Tracer on Gemma-2-2b, we trace the computational pathways activated when the model classifies 472 C/C++ code samples as vulnerable or safe. Our analysis reveals a surprising finding: the model primarily relies on safety detectors, attention heads that recognize safe coding patterns, rather than directly detecting vulnerability signatures. When these safety detectors fail to activate, the model classifies code as vulnerable. We identify the critical neural components: specific attention heads in early layers (L5, L7) that focus on safety patterns, and Multilayer Perceptron (MLP) neurons in Layer 7 that encode vulnerability-related features. Ablation experiments confirm their causal role; removing Layer 11 drops vulnerability detection accuracy from 100% to 6%, while ablating just 20 neurons in Layer 7 reduces it by 50%.Our findings show that LLM vulnerability detection uses sparse, interpretable circuits (only 16% of model capacity), enabling circuit-level explanations for security predictions and targeted improvements to detection systems.

URL PDF HTML ☆

赞 0 踩 0

2605.29900 2026-05-29 cs.LG cs.IT math.IT 版本更新

ESPO：早期停止的近端策略优化

Zihang Li, Rui Zhou, Yingcheng Shi, Wenhan Yu, Zhewen Tan, Zixiang Liu, Zeming Li, Binhua Li, Yongbin Li, Tong Yang, Jieping Ye

发表机构 * Tongyi Lab（通义实验室）； Alibaba Group（阿里巴巴集团）； Peking University（北京大学）

AI总结提出ESPO算法，通过在强化学习训练大语言模型时在线检测轨迹失败并提前终止，节省计算资源并提升数学推理性能。

详情

AI中文摘要

当大语言模型在强化学习过程中，在轨迹早期出现错误的推理步骤时，标准算法会强制其继续生成直到最大步长，从而在从未获得正奖励的令牌上浪费计算资源，并用失败后的噪声污染优势估计。我们提出ESPO（早期停止的近端策略优化），该算法能够在线检测轨迹失败并提前终止轨迹生成。在每个生成步骤中，ESPO仅利用采样过程中已计算出的logits计算一个替代遗憾值，并在平滑累积遗憾值显著超过其估计值时终止。截断轨迹被视为具有终止奖励的吸收失败状态，将负的时间差分误差集中在检测到的失败步骤附近，无需任何额外的奖励模型或人工标注。在基于DeepSeek-R1-Distill-Qwen-7B训练的数学推理任务上，ESPO在AIME 2024（46.28% vs. 45.25%）、AMC 2023（85.83% vs. 82.94%）和MATH-500（87.42% vs. 85.43%）上超越了PPO，同时累计节省了超过20%的轨迹生成令牌。

英文摘要

When a large language model under reinforcement learning commits a wrong reasoning step early in a trajectory, standard algorithms force it to keep generating until the maximum horizon, spending compute on tokens that never receive positive reward and polluting advantage estimates with post-failure noise. We propose ESPO (Early-Stopping Proximal Policy Optimization), which detects trajectory failure on-the-fly and terminates rollouts early. At each generation step, ESPO computes a surrogate regret using only the logits already computed during sampling, and terminates when the smoothed cumulative regret significantly exceeds its estimated values. Truncated trajectories are treated as absorbing failure states with a terminal reward, concentrating negative temporal-difference (TD) errors near the detected failure step without any additional reward model or human annotation. On DeepSeek-R1-Distill-Qwen-7B trained for mathematical reasoning, ESPO surpasses PPO on AIME~2024 (46.28% vs. 45.25%), AMC~2023 (85.83% vs. 82.94%), and MATH-500 (87.42% vs. 85.43%), while saving more than 20% rollout tokens cumulatively.

URL PDF HTML ☆

赞 0 踩 0

2605.29857 2026-05-29 cs.LG 版本更新

Feedback-to-Rubrics: Can We Learn Expert Criteria from Inline Comments?

从内联评论到评分标准：我们能从内联评论中学习专家标准吗？

Kotaro Yoshida, So Kuroki, Yuki Imajuku, Taishi Nakamura, Ryunosuke Iwai, Haruki Goda, Takuya Akiba

发表机构 * Sakana AI ； Institute of Science Tokyo（东京科学研究所）

AI总结提出从内联评论中学习可复用的自然语言评分标准的方法，通过迭代优化评分标准来预测评论并支持自动修订。

2605.29850 2026-05-29 cs.LG 版本更新

MIRAGE: Adaptive Multimodal Gating for Whole-Brain fMRI Encoding

MIRAGE：用于全脑fMRI编码的自适应多模态门控

Abdulkadir Gokce, Badr AlKhamissi, Martin Schrimpf

发表机构 * Qwen3-Omni-30B-A3B-Thinking（通义千问3- Omni-30B-A3B-Thinking）

AI总结提出MIRAGE框架，通过原生多模态骨干网络和自适应特征门控，实现全脑fMRI对自然视听刺激的高精度编码，并证明原生多模态特征优于后期融合的单模态特征。

Comments Preprint. First two author contributed equally

详情

AI中文摘要

近期任务优化神经网络的进展已将编码模型确立为预测大脑对自然刺激反应的有力工具，然而现有方法大多依赖单模态表示。全模态基础模型和丰富的多模态神经数据集的出现，使得能够联合整合跨被试的视觉、听觉和语言信息的编码模型成为可能。我们提出MIRAGE，一个用于预测全脑fMRI对自然视听刺激反应的脑编码框架。MIRAGE通过原生多模态骨干网络和跨层自适应特征门控实现了最先进的性能。这些表示随后与基于transformer的脑编码器和跨皮层分区的被试特定线性头相结合。控制比较表明，原生多模态特征在架构层次和骨干网络上始终优于独立单模态特征的事后聚合。除了预测准确性，学习的注意力权重可直接检查以解释骨干网络上的模态特定门控分布，每种模态在皮层上描绘出不同的解剖模式。综合这些结果，提出了原生多模态特征的自适应逐层聚合作为全脑编码的一种可泛化、可解释且准确的方法。

英文摘要

Recent progress in task-optimized neural networks has established encoding models as a powerful tool for predicting brain responses to naturalistic stimuli, yet most existing approaches rely on unimodal representations. The emergence of omni-modal foundation models and rich multimodal neural datasets enables encoding models that jointly integrate visual, auditory, and linguistic information across subjects. We introduce MIRAGE, a brain encoding framework for predicting whole-brain fMRI responses to naturalistic audiovisual stimuli. MIRAGE achieves state-of-the-art performance via a native multimodal backbone and adaptive feature gating across layers. These representations are then combined with a transformer-based brain encoder and a subject-specific linear head over the cortical parcels. Controlled comparisons show that natively multimodal features consistently outperform post-hoc aggregation of independent unimodal features, across architectural levels and backbones. Beyond predictive accuracy, the learned attention weights are directly inspectable to interpret the modality-specific gating profile over the backbone, and each modality traces a distinct anatomical pattern across cortex. Together, these results propose adaptive layer-wise aggregation of natively multimodal features as a generalizable, interpretable, and accurate approach for whole-brain encoding.

URL PDF HTML ☆

赞 0 踩 0

2605.29849 2026-05-29 eess.SY cs.LG cs.SY 版本更新

BuilDyn: Excitation-Driven Data Generation for Building Thermal Dynamics Modeling and Control

BuilDyn: 面向建筑热动力学建模与控制的激励驱动数据生成

Felix Koch, Thomas Krug, Fabian Raisch, Benjamin Schäfer, Benjamin Tischler

发表机构 * Technical University of Applied Sciences Rosenheim（应用技术大学罗森海姆）； Technical University of Munich（慕尼黑技术大学）； Karlsruhe Institute of Technology（卡尔斯鲁厄理工学院）

AI总结本文提出BuilDyn包，通过可定制的激励策略生成控制导向的建筑数据，提升机器学习模型对未见工况的鲁棒性。

详情

AI中文摘要

机器学习越来越多地用于建筑的数据驱动建模，以实现故障检测与诊断、节能控制等下游任务。虽然最近的工作改善了跨建筑特性、天气和占用率的泛化能力，但泛化也依赖于对控制驱动系统状态空间的充分探索。现有的真实世界数据集和仿真环境主要反映固定控制策略下的稳态运行，导致激励有限，对未见工况的鲁棒性降低。本文介绍了基于BuilDa的BuilDyn包，该包支持可定制的激励策略用于控制导向的数据生成。BuilDyn还支持从代表性建筑分布中采样，并提供Python接口以便轻松集成到机器学习流水线中。我们通过比较在非激励和激励数据上训练的数据驱动ML模型在一栋建筑上的性能，展示了BuilDyn的优势。借助BuilDyn，我们希望推进可扩展的控制导向建模，并支持迁移学习和建筑特定基础模型等未来方向。

英文摘要

Machine learning (ML) is increasingly used for data-driven modeling of buildings to enable downstream tasks such as fault detection and diagnosis, and energy-efficient control. While recent work improves generalization across building characteristics, weather, and occupancy, generalization also depends on sufficient exploration of the control-driven system state space. Existing real-world datasets and simulation environments predominantly reflect stationary operation under fixed control policies, resulting in limited excitation and reduced robustness to unseen operating conditions. This paper introduces BuilDyn, a package based on BuilDa that enables customizable excitation strategies for control-oriented data generation. BuilDyn further supports sampling from representative building distributions and provides a Python interface for easy integration into machine learning pipelines. We demonstrate the benefits of BuilDyn by comparing the performance of data-driven ML models trained on non-excited and excited data for one building. With BuilDyn, we hope to advance scalable control-oriented modeling and support future directions such as transfer learning and building-specific foundation models.

URL PDF HTML ☆

赞 0 踩 0

2605.29843 2026-05-29 cs.LG cs.AI 版本更新

HARP: Hadamard-Preconditioned Adaptive Rotation Processor for Extreme LLM Quantization

HARP: 哈达玛预条件自适应旋转处理器用于极端LLM量化

Artur Zagitov, Gleb Molodtsov, Aleksandr Beznosikov

发表机构 * BRAIn Lab（BRAIn实验室）

AI总结提出HARP，一种可学习的结构化双正交处理器，替代固定随机哈达玛变换，通过自适应旋转基来改善极端低位量化中的激活异常值和各向异性权重曲率问题，在2-4比特设置下提升困惑度和零样本准确率，并保持部署效率。

详情

AI中文摘要

后训练量化（PTQ）对于在内存和带宽约束下部署LLM至关重要。然而，极端低位量化仍然对激活异常值和各向异性权重曲率高度敏感。现有的基于非相干性的PTQ方法通过固定的随机哈达玛变换（RHT）缓解了这一问题，这提高了量化鲁棒性，但无法将旋转基适应于层、校准分布或量化器。我们引入了HARP（哈达玛预条件自适应旋转处理器），一种可学习的结构化双正交处理器，它替代了固定的哈达玛混合，同时保留了精确的全精度等价性。HARP将每个旋转表示为稀疏蝶形类块正交阶段的乘积，通过混合基数调度支持非2的幂次维度，并初始化为RHT处理器（最多一个固定排列）。仅在校准数据上拟合，HARP将量化基适应于每一层和后端。在从1B到70B参数的模型的2-4比特设置中，HARP在困惑度和零样本准确率上优于固定RHT。重要的是，HARP保持了部署效率，达到128 tok/s，而FP16为61 tok/s。

英文摘要

Post-training quantization (PTQ) is essential for deploying LLMs under memory and bandwidth constraints. However, extreme low-bit quantization remains highly sensitive to activation outliers and anisotropic weight curvature. Existing incoherence-based PTQ methods mitigate this issue with fixed randomized Hadamard transforms (RHTs), which improve quantization robustness but cannot adapt the rotated basis to the layer, calibration distribution, or quantizer. We introduce HARP (Hadamard-preconditioned Adaptive Rotation Processor), a learnable structured two-sided orthogonal processor that replaces fixed Hadamard mixing while preserving exact full-precision equivalence. HARP represents each rotation as a product of sparse butterfly-like block-orthogonal stages, supports non-power-of-two dimensions via Mixed-Radix schedules, and initializes to the RHT processor up to a fixed permutation. Fitted only on calibration data, HARP adapts the quantization basis to each layer and backend. Across 2-4 bit settings on models ranging from 1B to 70B parameters, HARP improves perplexity and zero-shot accuracy over fixed RHT. Importantly, HARP preserves deployment efficiency, reaching 128 tok/s versus 61 tok/s for FP16.

URL PDF HTML ☆

赞 0 踩 0

2605.29836 2026-05-29 cs.LG cs.AI stat.ML 版本更新

CB-SLICE: Concept-Based Interpretable Error Slice Discovery

CB-SLICE: 基于概念的可解释错误切片发现

Yael Konforti, Mateo Espinosa Zarlenga, Elaf Almahmoud, Mateja Jamnik

发表机构 * Department of Computer Science and Technology, University of Cambridge, Cambridge, UK（计算机科学与技术系，剑桥大学，剑桥，英国）； Trinity College, University of Oxford, Oxford, UK（牛津大学三一学院，牛津，英国）； Cambridge Institute for Technology and Humanity, Cambridge, UK（剑桥技术与人类研究所，剑桥，英国）

AI总结提出CB-SLICE方法，利用概念瓶颈模型的概念预测失败来发现错误切片，并通过关键词概念解释失败模式，优于现有方法。

Comments 20 pages, 7 figures, 12 tables, to be published at Proceedings of the 43rd International Conference on Machine Learning (ICML 2026)

详情

回归中插值与聚合的相互作用：最优样本复杂度

Mikael Møller Høgsgaard, Kasper Green Larsen, Liang-Yu Zou

发表机构 * Department of Computer Science, Aarhus University（奥胡斯大学计算机科学系）； Department of Statistics, University of Oxford（牛津大学统计学系）

AI总结本文从理论上研究回归中插值与聚合的相互作用，证明γ-图维度刻画了广泛自然聚合过程的可学习性，并发现通过中位数聚合三个插值假设的简单过程在所有聚合过程中最优，且严格强于恰当学习。

2605.29809 2026-05-29 cs.CR cs.CV cs.GR cs.LG cs.MM 版本更新

Cert-LAS: Toward Certified Model Ownership Verification for Text-to-Image Diffusion Models via Layer-Adaptive Smoothing

Cert-LAS：通过层自适应平滑实现文本到图像扩散模型的认证模型所有权验证

Leyi Qi, Yiming Li, Siyuan Liang, Zhengzhong Tu, Dacheng Tao

发表机构 * Generative AI Lab, College of Computing ； Data Science, Nanyang Technological University, Singapore ； Department of Computer Science ； Engineering, Texas A\&M University, USA

AI总结提出Cert-LAS方法，基于层自适应平滑和扩散分类器嵌入水印，通过假设检验验证模型所有权，并证明在恶意移除攻击下仍能可靠验证。

Comments This paper has been accepted to the International Conference on Machine Learning (ICML) 2026. 26 pages

详情

AI中文摘要

大规模文本到图像（T2I）扩散模型实现了前所未有的创意应用，但其未经授权的使用引发了严重的知识产权问题，使得模型所有权验证（MOV）日益关键。我们发现现有的基于后门的扩散水印方法通常（隐式地）假设一个“忠实”的验证过程，即验证者可以查询可疑模型并获得忠实的水印响应以完成MOV。然而，在实践中，攻击者可能有意或无意地破坏潜在的水印信号，显著降低验证可靠性。为解决此问题，我们提出Cert-LAS，首个基于层自适应平滑的T2I模型认证MOV方法。通常，Cert-LAS使用扩散分类器和LFS引导的层自适应噪声嵌入指定水印，并通过假设检验检查可疑模型是否表现出比无水印参考显著更强的水印响应来验证所有权。我们进一步证明，在特定条件下，即使存在恶意移除攻击，我们的Cert-LAS仍能实现可靠验证。大量实验验证了Cert-LAS的有效性及其对自适应攻击的抵抗力。我们的代码可在https://github.com/Leyi-Qi/Cert-LAS获取。

英文摘要

Large-scale text-to-image (T2I) diffusion models have enabled unprecedented creative applications, but their unauthorized use has raised serious intellectual property concerns, making model ownership verification (MOV) increasingly critical. We find that existing backdoor-based diffusion watermarking methods often (implicitly) assume a "faithful" verification process, namely, that the verifier can query a suspicious model and obtain the faithful watermark response to complete MOV. However, in practice, adversaries may intentionally or unintentionally damage potential watermark signals, significantly degrading verification reliability. To address this issue, we propose Cert-LAS, the first certified MOV method for T2I models based on layer-adaptive smoothing. In general, Cert-LAS embeds specified watermarks using diffusion classifiers and an LFS-guided layer-adaptive noise, and verifies ownership by examining whether the suspected model exhibits significantly stronger watermark responses compared to unwatermarked references through hypothesis testing. We further prove that, under certain conditions, our Cert-LAS can still achieve reliable verification even in the presence of malicious removal attacks. Extensive experiments validate the effectiveness of Cert-LAS and its resistance to adaptive attacks. Our code is available at https://github.com/Leyi-Qi/Cert-LAS.

URL PDF HTML ☆

赞 0 踩 0

2605.29807 2026-05-29 cs.CL cs.AI cs.LG 版本更新

Data filtering methods for training language models

训练语言模型的数据过滤方法

Egor Shevchenko, Elena Bruches

发表机构 * Novosibirsk State University（新西伯利亚国立大学）； A. P. Ershov Institute of Informatics Systems SB RAS（A. P. Ershov 信息系统研究所）

AI总结本文比较了Confident Learning和Dataset Cartography两种自动标签错误检测方法在俄语文本分类任务中的效果，发现其有效性依赖于数据集特性，在小规模高噪声数据集上Confident Learning显著提升F1-macro。

Comments AINL-2026

详情

AI中文摘要

数据质量是机器学习模型有效性的关键因素。即使广泛使用的基准数据集中也存在标签错误，这些错误会引入训练数据噪声并降低模型泛化能力。在本工作中，我们对两种自动标签错误检测方法——Confident Learning和Dataset Cartography——在三个俄语文本分类语料库上进行了比较分析，这些语料库在规模、类别数量和领域上各不相同：ru_emotion_e-culture（49,123个样本，情感分类）、RuCoLA（8,524个样本，语言可接受性）和TERRa（2,337个样本，文本蕴含识别）。我们使用在每个语料库上微调的预训练rubert-base-cased模型。为了验证过滤的意义，我们进行了控制实验，随机移除等量样本。结果表明，两种方法的有效性强烈依赖于数据集特征：在噪声水平低的大规模语料库上，过滤并未提升性能，而在噪声高的小规模数据集上，Confident Learning实现了显著的F1-macro提升。Dataset Cartography表现出更保守的行为，移除的样本更少。在所有语料库中，两种方法的目标性移除均优于随机移除，证实了这些方法的意义。

英文摘要

Data quality is a critical factor in the effectiveness of machine learning models. Label errors, present even in widely used benchmarks, introduce noise into training data and reduce model generalization. In this work, we conduct a comparative analysis of two automatic label error detection methods - Confident Learning and Dataset Cartography - on three Russian text classification corpora of varying size, number of classes, and domain: ru_emotion_e-culture (49,123 examples, emotion classification), RuCoLA (8,524 examples, linguistic acceptability), and TERRa (2,337 examples, textual entailment recognition). We use the pre-trained rubert-base-cased model fine-tuned on each corpus. To verify the meaningfulness of filtering, we conduct control experiments with random removal of an equivalent number of examples. Results show that the effectiveness of both methods depends strongly on dataset characteristics: on large corpora with low noise levels, filtering does not improve performance, while on small datasets with high noise, Confident Learning achieves a significant F1-macro improvement. Dataset Cartography demonstrates more conservative behavior, removing fewer examples. Across all corpora, targeted removal by both methods outperforms random removal, confirming the meaningfulness of the approaches.

URL PDF HTML ☆

赞 0 踩 0

2605.29803 2026-05-29 cs.LG 版本更新

Gated Graph Attention Networks with Learnable Temperature

具有可学习温度的门控图注意力网络

Zhongtian Ma, Hao Wu, Yexin Zhang, Qiaosheng Zhang, Zhen Wang

发表机构 * School of Cybersecurity, Northwestern Polytechnical University（网络安全学院，西北工业大学）； Shanghai Artificial Intelligence Laboratory（上海人工智能实验室）

AI总结提出门控图注意力和可学习温度机制，通过过滤不可靠特征维度并动态调整注意力系数分布的锐度，提升图注意力网络在均匀和异质异配基准上的性能。

详情

AI中文摘要

图注意力网络通过数据相关的系数学习邻居的重要性，但标准层缺乏对不可靠特征维度的显式控制，并且使用固定的注意力系数分布锐度。本文针对常见的图注意力机制提出了门控图注意力和可学习温度。门控图注意力过滤特征或消息响应以减少不可靠维度的影响，而可学习温度动态调整注意力系数分布的锐度。在均匀和异质异配基准上的实验表明，所提出的变体一致地改进了相应的图注意力骨干网络，受控噪声研究进一步验证了它们在特征扰动下的行为。理论分析解释了这些结果，表明当只有部分特征坐标可靠时，门控提高了鲁棒性，而当全局噪声削弱节点特征的可区分性时，温度是有益的。

英文摘要

Graph attention networks learn neighbor importance through data-dependent coefficients, but standard layers lack explicit control over unreliable feature dimensions and use fixed sharpness of attention coefficient distributions. This paper proposes gated graph attention and learnable temperature for common graph attention mechanisms. Gated graph attention filters feature or message responses to reduce the influence of unreliable dimensions, while learnable temperature dynamically adjusts the sharpness of the attention coefficient distribution. Experiments on homogeneous and heterophilic heterogeneous benchmarks show that the proposed variants consistently improve the corresponding graph attention backbones, and controlled noise studies further verify their behavior under feature perturbations. Theoretical analysis explains these results by showing that gating improves robustness when only part of the feature coordinates are reliable, while temperature is beneficial when global noise weakens the discriminability of node features.

URL PDF HTML ☆

赞 0 踩 0

2605.29801 2026-05-29 cs.AI cs.CL cs.CR cs.CV cs.LG 版本更新

AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security

AgentDoG 1.5：一种轻量级且可扩展的AI智能体安全与安保对齐框架

Dongrui Liu, Yu Li, Zhonghao Yang, Peng Wang, Guanxu Chen, Yuejin Xie, Qinghua Mao, Wanying Qu, Yanxu Zhu, Tianyi Zhou, Leitao Yuan, Zhijie Zheng, Qihao Lin, Yimin Wang, Haoyu Luo, Shuai Shao, Chen Qian, Qingyu Liu, Ling Tang, Ruiyang Qin, Qihan Ren, Junxiao Yang, Kun Wang, Zhiheng Xi, Linfeng Zhang, Ranjie Duan, Bo Zhang, Wenjie Wang, Wen Shen, Qiaosheng Zhang, Yan Teng, Chaochao Lu, Rui Mei, Man Li, Jialing Tao, Xi Lin, Tianhang Zheng, Yong Liu, Quanshi Zhang, Lei Zhu, Xingjun Ma, Junhua Liu, Hui Xue, Xiaoxiang Zuo, Xiangnan He, Chao Shen, Xianglong Liu, Minlie Huang, Jing Shao, Xia Hu

发表机构 * Shanghai Artificial Intelligence Laboratory（上海人工智能实验室）

AI总结针对开放世界智能体的新兴安全风险，提出一种轻量级可扩展的安全对齐框架，通过更新安全分类法、构建数据引擎并训练小模型（0.8B-8B参数），实现与闭源模型相当的性能，并降低部署开销两个数量级。

Comments 44 pages, 12 Figures, 9 Tables

详情

AI中文摘要

现代开放世界智能体（如OpenClaw）展现出强大的跨环境执行能力，但同时也引入了广泛的新安全风险源。同时，先进的前沿AI模型大幅降低了攻击门槛，使得当前的智能体对齐框架不足以应对实际部署。为了应对这些新兴威胁，我们提出了一种轻量级且可扩展的智能体安全对齐框架。具体而言，我们更新了智能体安全分类法，以涵盖来自Codex和OpenClaw执行场景的新兴风险。我们进一步构建了一个基于分类法指导的数据引擎，并采用影响函数净化，仅使用约1k样本训练轻量级AgentDoG 1.5变体（0.8B、2B、4B和8B参数），达到了与领先闭源模型（如GPT-5.4）相当的性能。基于AgentDoG 1.5，我们构建了一个高效的智能体安全SFT和RL训练环境，将Docker级环境的部署开销降低了两个数量级。最后，我们将AgentDoG 1.5部署为无需训练的在线护栏，用于实时安全审核。大量实验结果表明，AgentDoG 1.5在多样且复杂的交互式智能体场景中达到了最先进的性能。所有模型和数据集均已公开发布。

英文摘要

Modern open-world agents such as OpenClaw exhibit powerful cross-environment execution capabilities yet introduce broad new safety risk sources. Meanwhile, advanced frontier AI models drastically lower attack barriers, rendering current agent alignment frameworks inadequate for real-world deployment. To tackle these emerging threats, we propose a lightweight and scalable agent safety alignment framework. Specifically, we update the agent safety taxonomy to accommodate emergent risks from Codex and OpenClaw execution scenarios. We further build a taxonomy-guided data engine with influence-function purification to train lightweight AgentDoG 1.5 variants (0.8B, 2B, 4B, and 8B parameters) using only around 1k samples, achieving comparable performance with leading closed-source models (e.g., GPT-5.4). Based on AgentDoG 1.5, we construct a highly efficient agentic safety SFT and RL training environment, which reduces deployment overhead in Docker-level environments by two orders of magnitude. Finally, we deploy AgentDoG 1.5 as a training-free online guardrail for real-time safety moderation. Extensive experimental results indicate that AgentDoG 1.5 achieves state-of-the-art performance in diverse and complex interactive agentic scenarios. All models and datasets are openly released.

URL PDF HTML ☆

赞 0 踩 0

2605.29788 2026-05-29 cs.AI cs.LG 版本更新

Certified Policy Optimisation for Nested Causal Bandits via PAC-Bayes Risk

嵌套因果赌博机的认证策略优化：基于PAC-Bayes风险

Tim Woydt, Paul-David Zuercher

发表机构 * ProdAxon

AI总结本文提出嵌套因果汤普森采样（NCTS）算法，通过PAC-Bayes超额风险界对历史数据进行离线、任意时刻的部署策略认证，解决分层因果赌博机中的跨时间尺度因果耦合问题。

详情

AI中文摘要

关键序列决策很少是单时间尺度的：一个战略决策因果地塑造了每个后续战术选择所处的环境；标准赌博机和强化学习理论并未捕捉时间尺度之间的这种因果耦合。我们将问题类别形式化为嵌套上下文因果赌博机（NCCBs），这是一个分层SCM，其中每个层次的动作设置下一层次的上下文分布，并提出了嵌套因果汤普森采样（NCTS），该算法每轮抽取一个机制因子化的信念，并在其下递归地行动。我们的主要理论结果是一个因果PAC-Bayesian超额风险界，它仅从历史数据中认证任何候选部署策略，离线且任意时刻，回答了部署问题：我们能否在此处信任该智能体，风险如何？在分层SCM上的实验表明，相对于同一函数类上的匹配RFF-GP联合回归，因子化的SCM机制后验在外生分布偏移下零样本迁移显著更好，递归的元到内层提交在分布上显著优于联合提交替代方案，并且随着离线数据积累，认证显著收缩。结合这些结果，我们建立了渐进式认证交接，一种安全部署方法：每个时间尺度在收益可被认证时从传统控制器切换到NCTS，独立于其他时间尺度。

英文摘要

Critical sequential decisions are rarely single-timescale: a strategic decision causally shapes the context in which every subsequent tactical choice is made; standard bandit and reinforcement-learning theory does not capture this causal coupling between timescales. We formalise the problem class as Nested Contextual Causal Bandits (NCCBs), a hierarchical SCM where each level's action sets the next level's context distribution, and propose Nested Causal Thompson Sampling (NCTS), which draws one mechanism-factorised belief per episode and acts recursively under it. Our main theoretical result is a causal PAC-Bayesian excess-risk bound that certifies any candidate deployment policy from historic data alone, off-policy and anytime, answering the deployment question: can we trust this agent here, and at what risk? Experiments on a hierarchical SCM show that, against a matched RFF-GP joint regression on the same function class, the factorised SCM-mechanism posterior transfers significantly better zero-shot under exogenous distribution shifts, the recursive meta-to-inner commit significantly dominates the joint-commit alternative in distribution, and the certificate significantly contracts as offline data accumulates. Combining these results, we establish progressive certified handover, a safe-deployment method: each timescale flips from a legacy controller to NCTS when gains can be certified, independently of the others.

URL PDF HTML ☆

赞 0 踩 0

2605.29782 2026-05-29 cs.LG cs.AI cs.CL 版本更新

Hista and Numca: Estimate State Value Effectively for LLM Reinforcement Learning

Hista 和 Numca：为 LLM 强化学习有效估计状态值

Zizhe Chen, Jiqian Dong, Yizhou Tian, Garry Yang, Yongqiang Chen, Zhitang Chen, James Cheng

发表机构 * Department of Computer Science and Engineering, The Chinese University of Hong Kong（香港中文大学计算机科学与工程系）； Huawei Technologies Ltd（华为技术有限公司）

AI总结针对 LLM 强化学习中状态值估计不准确的问题，提出 Numca（利用数值跨度作为可分级里程碑）和 Hista（利用隐藏状态加权平均不连续轨迹及其回报）两种方法，显著提升估计精度和训练性能。

Comments Accepted at ICML 2026

详情

AI中文摘要

强化学习（RL）通过奖励信号直接优化模型行为来改进大型语言模型（LLMs）。虽然在经典RL中准确的状态值估计对于稳定训练至关重要，但在LLM后训练中这仍是一个未被充分探索的挑战。在这项工作中，我们引入了状态值估计基准（SVEB）来评估现有RL框架中的状态估计，并展示了像PPO这样的标准方法中的评论家会退化为粗糙的组平均基线。为了解决这个问题，我们提出了两种技术：Numca，它利用数值跨度作为可分级里程碑进行状态值估计；以及Hista，一个使用LLM的隐藏状态作为表示来加权平均不连续轨迹及其回报的框架。大量实验表明，这两种方法都能产生更准确的状态值估计，并在不同的RL算法和模型大小上提升训练性能，而不会产生显著的计算开销。

英文摘要

Reinforcement learning (RL) refines large language models (LLMs) by directly optimizing model behavior through reward signals. While accurate state value estimation is critical for stable training in classical RL, it remains an underexplored challenge in LLM post-training. In this work, we introduce the State Value Estimation Benchmark (SVEB) to assess state estimation within existing RL frameworks and show that critics in standard approaches like PPO collapse to a coarse group-average baseline. To address this, we propose two techniques: Numca, which leverages numerical spans as gradable milestones for state value estimation, and Hista, a framework that uses LLM's hidden states as representation to weighted average disjoint rollouts and their return. Extensive experiments demonstrate that both methods yield more accurate state value estimates and enhance training performance across different RL algorithms and model sizes without incurring significant computational overhead.

URL PDF HTML ☆

赞 0 踩 0

2605.29765 2026-05-29 cs.LG 版本更新

MMTM: Tri-Modal Topic Modeling for Long-Form Video via Similarity-Gated Fusion

MMTM: 基于相似性门控融合的长视频三模态主题建模

Ali Abusaleh, Bhuvanesh Verma, Alexander Mehler

发表机构 * Text Technology Lab (TTLab), Goethe University Frankfurt（文本技术实验室（TTLab），法兰克福歌德大学）

AI总结提出MMTM模块化流水线，通过相似性门控融合集成语音识别、音频和视觉嵌入及BERTopic聚类，在长视频主题发现中显著提升主题质量。

Comments Submitted to EMNLP 2026

2605.29748 2026-05-29 stat.ML cs.LG 版本更新

Instance-dependent Stochastic Lipschitz bandit

实例依赖的随机Lipschitz bandit

Marius Potfer, Vianney Perchet

发表机构 * Crest (Fairplay joint team)（Crest（Fairplay联合团队））； EDF R&D（EDF研发）； Criteo AI Lab（Criteo人工智能实验室）

AI总结针对Lipschitz bandit问题，提出一种基于水平集次优性间隙积分的算法，实现比传统缩放维度更优的实例依赖遗憾界。

详情

AI中文摘要

我们研究Lipschitz bandit问题，其中学习器通过带噪声的点评估在域$\mathcal{X} \subset [0,1]^d$上顺序最大化未知的Lipschitz函数$f$。现有的遗憾界要么是最坏情况的，缩放为$\tilde\Theta \left ( T^{d+1/d+2}\right )$，要么通过缩放维度$d_z$自适应，得到$\tilde\Theta \left ( T^{d_z+1/d_z+2}\right )$。然而，这种基于缩放的保证仅是部分实例依赖的，因为它们仅依赖于近最优水平集的渐近增长，未能捕捉$f$的更精细结构性质。我们提供了一种分析和算法，通过$f$在其水平集上的次优性间隙的积分来刻画遗憾。这产生了适应水平集局部增长（而不仅仅是其渐近行为）的遗憾界。作为推论，当最大化者集合的维度$d^\star>0$时，我们获得了阶为$\tilde{\mathcal{O}} \left ( T^{d_z+1 / \max(d_z,d^\star)+2}\right )$的改进自适应速率，在该情况下严格优于经典的缩放界。最后，我们将分析扩展到完全信息设置（Lipschitz专家），并展示了如何放宽一些正则性假设。

英文摘要

We study the Lipschitz bandit problem, where a learner sequentially maximizes an unknown Lipschitz function $f$ over a domain $\mathcal{X} \subset [0,1]^d$ using noisy pointwise evaluations. Existing regret bounds are either worst-case, scaling as $\tildeΘ \left ( T^{d+1/d+2}\right )$, or adaptive via the zooming dimension $d_z$, yielding $\tildeΘ \left ( T^{d_z+1/d_z+2}\right )$. However, such zooming-based guarantees are only partially instance-dependent, as they depend solely on the asymptotic growth of near-optimal level sets and fail to capture finer structural properties of $f$. We provide an analysis and an algorithm that characterizes the regret through integrals of the suboptimality gap of $f$ over its level sets. This yields regret bounds that adapt to the local growth of level sets, rather than only their asymptotic behavior. As a corollary, when the set of maximizers has dimension $d^\star>0$, we obtain improved adaptive rates of order $\tilde{\mathcal{O}} \left ( T^{d_z+1 / \max(d_z,d^\star)+2}\right )$ strictly improving over classical zooming bounds in this regime. Finally, we extend our analysis to the full-information setting (Lipschitz experts) and show how some of the regularity assumptions can be relaxed.

URL PDF HTML ☆

赞 0 踩 0

2605.29744 2026-05-29 cs.AI cs.CL cs.LG cs.MA 版本更新

Why Specialist Models Still Matter: A Heterogeneous Multi-Agent Paradigm for Medical Artificial Intelligence

为什么专家模型仍然重要：面向医学人工智能的异构多智能体范式

Yanan Wang, Shuaicong Hu, Jian Liu, Guohui Zhou, Aiguo Wang, Cuiwei Yang

发表机构 * Anthropic AI

AI总结提出HetMedAgent异构多智能体框架，通过冲突感知证据融合、不确定性驱动的临床医生干预触发和自适应阈值校准，实现通用大语言模型与领域专家模型的协同，在三个临床决策任务中验证了专家模型在模态特定分析中的不可替代价值。

Comments Accepted at ICML 2026. 12 pages main text, 16 pages appendix

详情

AI中文摘要

GPT和Claude等通用大语言模型在医疗保健领域的出色表现引发了一个关键问题：特定领域的医学专家模型是否会变得过时？我们认为，医学人工智能的未来不在于构建单一的医学基础模型，也不在于取代人类专业知识，而在于协调通用大语言模型、领域特定专家模型和临床医生之间的协作。我们提出HetMedAgent，一个异构医学多智能体框架，能够实现冲突感知证据融合、基于不确定性的临床医生干预触发和自适应阈值校准。在三个真实世界临床决策任务上的实验表明，通用大语言模型与领域特定专家模型之间的协同显著优于单独使用任一类型模型，验证了专家模型在模态特定分析中的不可替代价值。HetMedAgent代表了从构建医学大语言模型或基础模型向多智能体协作的转变，实现了通用推理能力与领域特定精度之间的平衡。

英文摘要

The impressive performance of generalist large language models (LLMs) such as GPT and Claude in healthcare raises a critical question: will domain-specific medical specialist models become obsolete? We argue that the future of medical artificial intelligence (AI) lies not in building monolithic medical foundation models, nor in replacing human expertise, but in orchestrating collaboration among generalist LLMs, domain-specific specialist models, and clinicians. We propose HetMedAgent, a heterogeneous medical multi-agent framework that enables conflict-aware evidence fusion, uncertainty-based clinician intervention triggering, and adaptive threshold calibration. Experiments on three real-world clinical decision-making tasks demonstrate that the synergy between generalist LLMs and domain-specific specialist models significantly outperforms using either type of model alone, validating the irreplaceable value of specialist models in modality-specific analysis. HetMedAgent represents a shift from building medical LLMs or foundation models to multi-agent collaboration, achieving a balance between general reasoning capabilities and domain-specific precision.

URL PDF HTML ☆

赞 0 踩 0

2605.29731 2026-05-29 cs.LG 版本更新

EMAG: Differentiable 4D Gaussian Mixture Splatting for EEG Spatial Super-Resolution

EMAG: 可微分的4D高斯混合喷溅用于EEG空间超分辨率

Alex Lazarovich, Ofir Itzhak Shahar, Gur Elkin, Ohad Ben-Shahar

AI总结提出EMAG框架，通过可微分的各向异性4D时空高斯混合模型，从稀疏低密度电极重建高密度EEG信号，实现空间超分辨率，并在三个基准上超越现有方法。

详情

AI中文摘要

高密度脑电图（HD-EEG）能够精细测量皮层活动，但需要昂贵的硬件和较长的设置时间，限制了其在临床和研究中的可及性。我们提出EMAG（EEG各向异性高斯混合），一个可微分的框架，通过将脑电源表示为各向异性4D时空高斯的混合，从稀疏的低密度（LD）电极子集重建HD-EEG信号。EMAG在球形脑网格的每个点上放置多个高斯的混合，每个高斯由完整的4x4精度矩阵参数化，从而实现各向异性的空间扩散以及空间和时间维度之间的显式耦合。前向模型通过电极位置处的可微分高斯场贡献渲染头皮EEG，从而无需显式源定位监督即可进行端到端训练。我们在三个公共EEG基准（Localize-MI、SEED和SEED-IV）上以2倍到8/16倍的超分辨率因子评估EMAG。在大多数超分辨率因子下，EMAG在三个标准基准（Localize-MI、SEED、SEED-IV）上优于当前最先进的EEG超分辨率方法。显式高斯参数化进一步实现了学习到的脑源配置的直接可视化和可解释性，可能为临床和神经科学应用（如源定位或生物标志物发现）开辟途径。

英文摘要

High-density electroencephalography (HD-EEG) enables fine-grained measurement of cortical activity but requires expensive hardware and lengthy setup times, limiting its clinical and research accessibility. We propose EMAG (EEG Mixture of Anisotropic Gaussians), a differentiable framework that reconstructs HD-EEG signals from a sparse subset of low-density (LD) electrodes by representing brain electrical sources as a mixture of anisotropic 4D space-time Gaussians. EMAG places a mixture of multiple Gaussians at each point of a spherical brain grid, each parameterized by a full 4 x 4 precision matrix, enabling anisotropic spatial spreads and explicit coupling between spatial and temporal dimensions. The forward model renders scalp EEG via differentiable Gaussian field contributions at electrode locations, enabling end-to-end training without explicit source localization supervision. We evaluate EMAG on three public EEG benchmarks (Localize-MI, SEED, and SEED-IV) at super-resolution factors of 2x through 8/16x. EMAG outperforms the current state-of-the-art EEG super-resolution method at most super-resolution factors on three standard benchmarks (Localize-MI, SEED, SEED-IV). The explicit Gaussian parameterization further enables direct visualization and interpretability of learned brain source configurations, potentially opening avenues for clinical and neuroscientific applications, such as source localization or biomarker discovery.

URL PDF HTML ☆

赞 0 踩 0

2605.29729 2026-05-29 cs.LG 版本更新

Realistic honeypot evaluations for scheming propensity

针对策划倾向的逼真蜜罐评估

Victoria Krakovna, David Lindner, Lewis Ho, Sebastian Farquhar, Rohin Shah

发表机构 * Google DeepMind（谷歌深Mind）

AI总结提出一种框架，通过在Google对齐研究代码库中设置编码任务作为蜜罐，测试模型在有机会时是否会追求工具性目标，实验表明Gemini模型在真实部署中不会主动策划，但在特定提示下会表现出策划或破坏行为。

2605.29727 2026-05-29 cs.LG 版本更新

Bastion: Budget-Aware Speculative Decoding with Tree-structured Block Diffusion Drafting

Bastion: 预算感知的树结构块扩散草稿投机解码

Soowon Oh, Nam Cao, Yujin Kim, Hojung Jung, Huzama Ahmad, Sangmin Bae, Se-Young Yun

发表机构 * KAIST AI（韩国科学技术院人工智能研究所）； Samsung Advanced Institute of Technology（三星先进技术研究所）

AI总结提出BASTION框架，通过动态构建查询相关的树结构平衡草稿质量与硬件约束，实现预算感知的投机解码，无需训练且保持目标模型分布，速度提升达6.61倍。

详情

AI中文摘要

块扩散草稿者最近作为投机解码的强大替代方案出现，通过在单个并行步骤中预测多个未来令牌分布。然而，由于这些并行预测是从位置边缘分布而非完全条件序列中采样，承诺单一贪婪路径往往无法捕捉目标模型的偏好轨迹。为解决此问题，我们提出BASTION，一种基于树的扩散草稿的预算感知投机解码框架。与依赖静态树拓扑的现有方法不同，BASTION通过平衡草稿质量与硬件约束动态构建查询相关的树。我们的框架整合了三个协同组件：(1) 接受代理，通过路径置信度估计期望接受长度；(2) 在线延迟估计器，校准硬件感知的屋顶线模型；(3) 自适应最佳优先扩展，在边际增益不再证明增量验证成本合理时停止树生长。BASTION无需训练，保持目标模型分布，且无需逐设置调优。在多种基准和GPU架构上，BASTION相比标准自回归解码实现高达6.61倍加速，优于最先进的块扩散基线39%。

英文摘要

Block-diffusion drafters have recently emerged as a powerful alternative for speculative decoding by predicting multiple future-token distributions in a single parallel step. However, since these parallel predictions are sampled from position-wise marginals rather than fully conditioned sequences, committing to a single greedy path often fails to capture the target model's preferred trajectory. To address this, we propose BASTION, a budget-aware speculative decoding framework with tree-based diffusion drafting. Unlike existing methods that rely on static tree topologies, BASTION dynamically constructs query-dependent trees by balancing draft quality against hardware constraints. Our framework integrates three synergistic components: (1) an acceptance surrogate that estimates expected accepted length via path confidence, (2) an online latency estimator that calibrates a hardware-aware roofline model, and (3) an adaptive best-first expansion that grows the tree until marginal gains no longer justify incremental verification costs. BASTION is training-free, preserves the target model's distribution, and requires no per-setting tuning. Across diverse benchmarks and GPU architectures, BASTION achieves up to a 6.61x speedup over standard autoregressive decoding, outperforming state-of-the-art block-diffusion baselines by 39%.

URL PDF HTML ☆

赞 0 踩 0

2605.29720 2026-05-29 cs.CV cs.LG 版本更新

Efficient, Validation-Free Intrinsic Quality Estimation for Large-Scale Face Recognition Datasets

面向大规模人脸识别数据集的高效、免验证的内在质量评估

Zhichao Chen, Yongle Zhao, Kaicheng Yang, Meng Yang, Yin Xie, Ziyong Feng

发表机构 * School of Cyber Science and Technology, University of Science and Technology of China（中国科学技术大学网络科学与技术学院）

AI总结提出一种无需训练的内在质量（IQ）指标，通过邻域一致性得分和全局表示子空间复杂度来估计人脸识别数据集生成高性能模型的潜力，实现快速数据集诊断与筛选。

Comments ICML 2026

2605.29713 2026-05-29 cs.LG cs.AI 版本更新

The Little Book of Generative AI Foundations: An Intuitive Mathematical Primer

生成式AI基础小书：直观数学入门

Tianhua Chen

发表机构 * School of Computing and Engineering（计算与工程学院）

AI总结本书通过推导导向的方式，从PCA到能量模型，系统介绍现代生成式人工智能的数学基础，旨在使生成建模结构更易理解。

Comments Preprint version, 178 pages. Comments and corrections are welcome

2605.29698 2026-05-29 cs.LG physics.chem-ph 版本更新

A Systematic Evaluation of Molecular Mixture Behavior Prediction

分子混合物行为预测的系统评估

Roel J. Leenhouts, Nathan K. Morgan, William Green, Jan G. Rittig, Florence H. Vermeire

发表机构 * KU Leuven（卢森堡大学）； MIT（麻省理工学院）； RWTH Aachen University（亚琛工业大学）

AI总结提出一个将混合物性质误差分解为纯组分和相互作用成分的评估框架，并基于七个匹配数据集发现绝对精度可能掩盖非理想混合行为的恢复能力。

详情

AI中文摘要

分子性质预测的机器学习主要集中在纯化合物上，尽管许多实际应用依赖于具有分子间相互作用的混合物。最近的工作扩大了混合物数据集的可用性，但评估仍然主要关注绝对精度。然而，混合物中的绝对误差将纯组分贡献与理想混合的偏差混为一谈。我们提出了一个评估框架，将混合物性质误差分解为纯化合物和相互作用（非理想）成分。该框架结合了泄漏感知分割协议、理想混合物基线和过量性质指标。为了支持可重复的基准测试，我们整理了七个匹配的纯和混合物物理化学性质数据集。在多个混合物性质任务和模型家族中，我们发现强绝对精度可能掩盖对非理想混合物行为的恢复能力，并且在严格分子分割下性能显著下降。这些结果将向未见分子的迁移识别为分子混合物机器学习中的核心挑战，并推动超越绝对精度的评估。

英文摘要

Machine learning for molecular property prediction has focused largely on pure compounds, even though many practical applications depend on mixtures with intermolecular interactions. Recent work has expanded the availability of mixture datasets, but evaluation still focuses mainly on absolute accuracy. However, absolute errors in mixtures conflate pure-component contributions with deviations from ideal mixing. We propose an evaluation framework that decomposes mixture-property error into pure-compound and interaction (non-ideal) components. The framework combines leakage-aware split protocols, ideal-mixture baselines, and excess-property metrics. To support reproducible benchmarking, we curate seven matched pure and mixture physicochemical property datasets. Across multiple mixture-property tasks and model families, we find that strong absolute accuracy can mask poor recovery of non-ideal mixture behavior, and that performance drops substantially under strict molecule splits. These results identify transfer to unseen molecules as a central challenge in molecular mixture machine learning and motivate evaluation beyond absolute accuracy alone.

URL PDF HTML ☆

赞 0 踩 0

2605.29695 2026-05-29 cs.AI cs.CE cs.LG math.PR 版本更新

FHRFormer: A Self-Supervised Masked Transformer Framework for Fetal Heart Rate Time-Series Inpainting and Forecasting

FHRFormer: 一种用于胎儿心率时间序列修复和预测的自监督掩码Transformer框架

Kjersti Engan, Neel Kanwal, Anita Yeconia, Ladislaus Blacy, Yuda Munyaw, Estomih Mduma, Hege Ersdal

发表机构 * University of Stavanger（斯塔万格大学）； Haydom Lutheran Hospital（海多姆路德医院）； Stavanger University Hospital（斯塔万格大学医院）

AI总结针对胎儿心率监测中信号丢失问题，提出基于掩码Transformer的自监督自编码器方法，通过捕获局部时间和频率成分来修复和预测缺失信号，具有鲁棒性并支持AI风险算法开发。

Comments Submitted to Frontiers in Digital Health. arXiv admin note: substantial text overlap with arXiv:2509.20852

详情

AI中文摘要

大约10%的新生儿出生时需要帮助才能开始呼吸，约5%需要通气支持。胎儿心率（FHR）监测在产前护理中评估胎儿健康状况方面起着关键作用，能够检测异常模式并支持及时产科干预以减轻分娩期间的胎儿风险。应用人工智能（AI）方法分析具有不同结局的连续FHR监测大数据集，可能为预测需要呼吸辅助或干预的风险提供新见解。可穿戴FHR监测仪的最新进展实现了在不影响母亲活动能力的情况下进行连续胎儿监测。然而，母亲运动期间的传感器移位以及胎儿或母亲位置的变化常常导致信号丢失，造成记录的FHR数据出现缺口。这种缺失数据限制了有意义信息的提取，并使基于AI的自动化分析复杂化。传统的缺失数据处理方法，如简单插值技术，往往无法保留信号的频谱特性。在本文中，我们提出了一种基于掩码Transformer的自编码器方法，通过捕获数据的局部时间和频率成分来重建缺失的FHR信号。所提出的方法在不同缺失数据时长下表现出鲁棒性，可用于信号修复和预测。该方法可回顾性地应用于研究数据集，以支持基于AI的风险算法开发。未来，该方法可集成到可穿戴FHR监测设备中，实现更早、更稳健的风险检测。

英文摘要

Approximately 10% of newborns require assistance to initiate breathing at birth, and around 5% need ventilation support. Fetal heart rate (FHR) monitoring plays a crucial role in assessing fetal well-being during prenatal care, enabling the detection of abnormal patterns and supporting timely obstetric interventions to mitigate fetal risks during labor. Applying artificial intelligence (AI) methods to analyze large datasets of continuous FHR monitoring episodes with diverse outcomes may offer novel insights into predicting the risk of needing breathing assistance or interventions. Recent advances in wearable FHR monitors have enabled continuous fetal monitoring without compromising maternal mobility. However, sensor displacement during maternal movement, as well as changes in fetal or maternal position, often lead to signal dropout, resulting in gaps in recorded FHR data. Such missing data limits the extraction of meaningful insights and complicates automated (AI-based) analysis. Traditional approaches to handling missing data, such as simple interpolation techniques, often fail to preserve the spectral characteristics of the signals. In this paper, we propose a masked transformer-based autoencoder approach to reconstruct missing FHR signals by capturing both local temporal and frequency components of the data. The proposed method demonstrates robustness across varying durations of missing data and can be used for signal inpainting and forecasting. The proposed approach can be applied retrospectively to research datasets to support the development of AI-based risk algorithms. In the future, the proposed method could be integrated into wearable FHR monitoring devices to achieve earlier and more robust risk detection.

URL PDF HTML ☆

赞 0 踩 0

2605.29693 2026-05-29 cs.LG cs.RO 版本更新

SRC的几何视角：学习用于稳定残差推理的表示

Vangelis P. Oikonomou

AI总结本文从几何角度分析稀疏表示分类（SRC）的残差排序稳定性，提出几何塑造目标以改善表示学习，并在多个数据集上验证了效果。

Comments 37 pages

详情

AI中文摘要

基于重构的推理通过比较类重构残差来分配类别；稀疏表示分类（SRC）是一个典型实例，其可靠性取决于学习表示的几何结构。我们采用严格的训练-推理分离：SRC仅作为固定的测试时规则使用，在训练过程中从不进行微分、展开或优化。在基于类条件张成子空间及其相关投影残差的张成子空间理想化中，我们通过残差间隔形式化残差排序稳定性，并刻画了可能在最坏方向破坏该间隔的几何障碍——张成子空间重叠、支配以及通过小主角产生的近重叠。这一张成子空间理论是首要的：它指定了理想化残差族何时良好分离，并为实际残差近似（如OMP）提供了条件性的求解器级解释，只要它们接近张成子空间级别的残差排序。在显式的覆盖和分离假设下，我们推导了（理想化）残差间隔的定量下界。在这些目标的指导下，我们提出了几何塑造目标，这些目标促进掩蔽的类内自表达性，抑制跨类重构路径和类间张成子空间对齐，并防止坍塌——而在训练过程中不调用SRC残差或预测。在图像（COIL-100）、文本（TREC）和EEG连接性上的实验，在相同的固定SRC/OMP推理下评估所有表示，并报告残差间隔和几何诊断；交叉熵仅作为相同评估协议下的参考几何包含在内。

英文摘要

Reconstruction-based inference assigns a class by comparing class-wise reconstruction residuals; Sparse Representation Classification (SRC) is a canonical instance whose reliability depends on the geometry of the learned representation. We adopt a strict training-inference separation: SRC is used only as a fixed test-time rule and is never differentiated, unrolled, or optimized during training. In a span-level idealization based on class-conditional spans and their associated projection residuals, we formalize residual-ordering stability through a residual margin and characterize geometric obstructions -- span overlap, dominance, and near-overlap via small principal angles -- that can collapse this margin in worst-case directions. This span-level theory is primary: it specifies when the idealized residual family is well-separated, and it provides a conditional solver-level interpretation for practical residual approximations (e.g., OMP) insofar as they remain close to the span-level residual ordering. Under explicit coverage and separation assumptions, we derive a quantitative lower bound on the (idealized) residual margin. Guided by these targets, we propose geometry-shaping objectives that promote masked within-class self-expressiveness, discourage cross-class reconstruction pathways and inter-class span alignment, and prevent collapse -- without invoking SRC residuals or predictions during training. Experiments on images (COIL-100), text (TREC), and EEG connectivity evaluate all representations under identical fixed SRC/OMP inference and report residual margins and geometric diagnostics; cross-entropy is included only as a reference geometry under the same evaluation protocol.

URL PDF HTML ☆

赞 0 踩 0

2605.29664 2026-05-29 cs.DC cs.LG 版本更新

AMDP: Asynchronous Multi-Directional Pipeline Parallelism for Large-Scale Models Training

AMDP：面向大规模模型训练的异步多方向流水线并行

Ling Chen, Houming Wu, Wenjie Yu

发表机构 * State Key Laboratory of Blockchain and Data Security, Zhejiang University, Hangzhou, China（区块链与数据安全国家重点实验室，浙江大学，杭州，中国）； College of Computer Science and Technology, Zhejiang University, Hangzhou, China（计算机科学与技术学院，浙江大学，杭州，中国）

AI总结针对异步流水线并行中参数不匹配导致收敛退化的问题，提出AMDP方法，通过限制流水线第一阶段处理小批量数量、启动多条并发流水线并自适应调整数量、以及跨小批量累积梯度后单次更新，在保持高利用率的同时加速训练并保证收敛。

Comments Accepted by ICML 2026, 9 pages, and 8 figures

详情

AI中文摘要

流水线并行对于大规模模型训练至关重要，但现有的异步方法常因前向和反向传播之间的参数不匹配而损害收敛性。我们提出异步多方向流水线并行（AMDP）来缓解此问题，同时保持高利用率。AMDP限制每个流水线的第一阶段在反向传播前最多处理两个小批量，从而限制了前向和反向传播之间的参数更新次数。为减轻由此产生的流水线气泡，AMDP启动多条并发流水线，并根据流水线深度自适应调整其数量。此外，AMDP跨小批量累积梯度并在一次更新中应用，确保只有有限数量的小批量经历参数不匹配，且限制在一个优化步骤内。在GPT和BERT风格模型上的实验表明，AMDP在保持收敛的同时显著加速了训练。

英文摘要

Pipeline parallelism is essential for large-scale model training, but existing asynchronous approaches often degrade convergence due to parameter mismatch between forward and backward passes. We propose Asynchronous Multi-Directional Pipeline parallelism (AMDP) to mitigate this issue while sustaining high utilization. AMDP limits the first stage of each pipeline to process at most two minibatches before backpropagation, bounding the number of parameter updates between forward and backward passes. To alleviate the resulting pipeline bubbles, AMDP launches multiple concurrent pipelines and adapts their number according to pipeline depth. In addition, AMDP accumulates gradients across minibatches and applies them in a single update, ensuring that only a bounded number of minibatches experience parameter mismatch, limited to within one optimization step. Experiments on GPT- and BERT-style models demonstrate that AMDP significantly accelerates training while preserving convergence.

URL PDF HTML ☆

赞 0 踩 0

2605.29659 2026-05-29 cs.LG cs.AI cs.CL 版本更新

Opir: Efficient Multi-Task Safety Classification for Toxicity, Jailbreaks, Hate Speech, and Harmful Content

Opir：针对毒性、越狱、仇恨言论和有害内容的高效多任务安全分类

Ihor Stepanov, Aleksandr Smechov

发表机构 * Knowledgator ； Wordcab

AI总结本文提出基于GLiClass架构的Opir系列编码器护栏模型，通过多任务学习实现二进制安全/不安全分类、多标签毒性分类、越狱分类和零样本不安全提示与响应分类，在12项安全分类任务和17项类别任务上与现有护栏系统竞争，同时部署开销更小。

Comments 23 pages, 4 figures, 9 tables

详情

AI中文摘要

大型语言模型（LLM）应用的实时安全过滤需要能够检测不安全提示、有毒语言、越狱尝试和不安全响应的分类器，且不能像大型护栏模型那样成本高昂，同时要能区分良性的敏感文本与真正隐蔽的有害内容。在本文中，我们介绍了Opir，一个基于GLiClass架构的编码器护栏模型系列。Opir包括用于二进制安全/不安全分类、多标签毒性分类、越狱分类以及零样本不安全提示和响应分类的多任务模型。我们还发布了专门用于二进制安全/不安全分类的边缘变体，参数少于1亿。这些模型在一个三级分类体系上训练，该体系包含16个顶层标签、126个中层标签和854个叶标签，共996个类别。Opir的训练数据结合了基于分类体系的不安全提示、对抗性挖掘的难负例、良性安全保持示例、生成的响应示例、多语言翻译以及Aegis2和WildGuard训练子集的部分内容。我们还开源了一个评估工具，支持GLiClass和GLiNER2后端以及基于解码器的模型，涵盖二进制安全分类、多标签分类、毒性、越狱检测、提示安全、响应安全、响应拒绝以及跨公共基准系列的提示子类别视图。在与八个当代护栏系统（包括基于GLiNER2和生成式护栏模型）的扩展比较中，涵盖12项安全分类任务和17项类别任务，Opir变体在大多数基准数据集上与最强的开源基线模型竞争或领先，同时部署规模显著更小。

英文摘要

Real-time safety filtering for large language model (LLM) applications requires classifiers that can detect unsafe prompts, toxic language, jailbreak attempts, and unsafe responses without the cost profile of large guardrail models, and that can distinguish benign sensitive text from genuinely covert harmful content. In this paper, we introduce Opir, a family of encoder-based guardrail models built on the GLiClass architecture. Opir includes multi-task models for binary safe/unsafe classification, multi-label toxicity classification, jailbreak classification, and zero-shot unsafe prompt and response categorization. We also release edge variants with fewer than 100M parameters dedicated to binary safe/unsafe categorization. The models are trained on a three-level taxonomy containing 996 categories across 16 top-level labels, 126 mid-level labels, and 854 leaf labels. Opir's training data combines taxonomy-grounded unsafe prompts, adversarially mined hard negatives, benign safety-preserving examples, generated response examples, multilingual translations, and portions of the Aegis2 and WildGuard training subsets. We also open-sourced an evaluation harness that supports GLiClass and GLiNER2 backends as well as decoder-based models, and covers binary safety classification, multi-label categorization, toxicity, jailbreak detection, prompt safety, response safety, response refusal, and prompt subcategory views across public benchmark families. Across an expanded comparison spanning 12 safety-classification tasks and 17 category tasks against eight contemporary guardrail systems -- including both GLiNER2-based and generative guardrail models -- Opir variants are competitive on or ahead of the strongest open-weight baselines on the majority of benchmark datasets while operating with a substantially smaller deployment footprint.

URL PDF HTML ☆

赞 0 踩 0

2605.29645 2026-05-29 cs.LG cs.AI stat.ML 版本更新

The Sample Complexity of Multiclass and Sparse Contextual Bandits

多类别和稀疏上下文赌博机的样本复杂度

Liad Erez, Fan Chen, Alon Cohen, Tomer Koren, Yishay Mansour, Shay Moran, Alexander Rakhlin

发表机构 * Tel Aviv University（特拉维夫大学）； Massachusetts Institute of Technology（麻省理工学院）； Google Research Tel Aviv（谷歌研究特拉维夫）； Technion—Israel Institute of Technology（技术学院—以色列理工学院）

AI总结针对随机i.i.d.上下文赌博机，提出基于决策估计系数和低方差探索的算法，在稀疏奖励下实现接近最优的样本复杂度，并匹配下界。

详情

AI中文摘要

我们研究随机i.i.d.设置下的上下文赌博机，其中学习器观察来自未知分布的上下文，从有限集合$A$中选择动作，并旨在基于赌博机反馈从给定类别中识别近似最优策略。受零一奖励的赌博机多类别分类启发，我们关注\emph{$s$-稀疏}设置，其中对于每个上下文，奖励向量的$L_1$范数至多为$s \ll |A|$。我们的主要结果是设计算法，以高概率输出一个相对于策略类$Π$的$ε$-最优策略，使用$ ilde{O} ((s/ε^2 + |A|/ε)\log |Π|/δ)$个样本。我们将此界推广到一般Natarajan类，并补充了匹配的下界（对数因子内），从而缩小了先前工作（Erez等人，2024, 2025）留下的巨大差距，后者额外增加了$Θ(|A|^9)$依赖。我们通过两种互补方法获得这些结果。首先，我们从具有结构化观测的上下文决策角度分析上下文赌博机，设计了一种探索-优化算法，其样本复杂度由\emph{决策估计系数}（DEC；Foster等人，2021, 2022）控制。我们证明，在$s$-稀疏奖励下，诱导的模型类具有随$s$缩放的尖锐DEC界，直接产生最优速率。由于这种方法主要是信息论性的，并涉及求解复杂的min-max优化问题，我们还开发了第二种更专门的算法方法，基于低方差探索技术。这种方法产生了具体、易处理的算法，并自然地扩展到上下文组合半赌博机，为赌博机多类别列表分类提供了改进的样本复杂度保证。

英文摘要

We study contextual bandits in the stochastic i.i.d.\ setting, where a learner observes contexts drawn from an unknown distribution, selects actions from a finite set $A$, and aims to identify an approximately optimal policy from a given class based on bandit feedback. Motivated by bandit multiclass classification with zero-one rewards, we focus on the \emph{$s$-sparse} setting in which, for every context, the reward vector has $L_1$-norm at most $s \ll |A|$. Our main result is the design of algorithms that, with high probability, output an $ε$-optimal policy compared to policy class $Π$ using $\tilde{O} ((s/ε^2 + |A|/ε)\log |Π|/δ)$ samples. We extend this bound to general Natarajan classes and complement it with a matching lower bound (up to logarithmic factors), thereby closing a substantial gap left by prior work (Erez et al., 2024, 2025), which incurred an additional $Θ(|A|^9)$ dependence. We obtain these results via two complementary approaches. First, we analyze contextual bandits through the lens of contextual decision making with structured observations, designing an exploration-by-optimization algorithm whose sample complexity is governed by the \emph{decision-estimation coefficient} (DEC; Foster et al., 2021, 2022). We show that, with $s$-sparse rewards, the induced model class admits a sharp DEC bound that scales with $s$ and directly yields the optimal rate. Since this approach is largely information-theoretic and involves solving complex min-max optimization problems, we also develop a second, more specialized algorithmic method based on a low-variance exploration technique. This approach leads to concrete, tractable algorithms and naturally extends to contextual combinatorial semi-bandits, leading to improved sample complexity guarantees for bandit multiclass list classification.

URL PDF HTML ☆

赞 0 踩 0

2605.29642 2026-05-29 stat.ML cs.IT cs.LG math.IT 版本更新

COMET：音频-文本多模态对比嵌入中模态间隙的概念空间剖析

Yonggang Zhu, Liting Gao, Aidong Men, Wenwu Wang

发表机构 * School of Artificial Intelligence, Beijing University of Posts and Telecommunications（北京邮电大学人工智能学院）； Centre for Vision, Speech, and Signal Processing (CVSSP), University of Surrey（Surrey 大学视觉、语音和信号处理中心）

AI总结提出COMET框架，通过PLS-SVD分解揭示CLAP模型中模态间隙主要由少数共享概念轴贡献，并基于谱截断方法无训练地缓解间隙，实现零样本音频字幕接近全监督性能。

详情

AI中文摘要

对比语言-音频预训练（CLAP）模型广泛用于音频理解，并在许多零样本应用中支持模态无关的条件交换。然而，其性能受到音频和文本嵌入之间模态间隙的严重影响。现有解释主要将此间隙归因于锥体效应，将其视为均值嵌入之间的偏移，但仅纠正均值只能带来有限的改进。其他假设，如信息不平衡和维度坍缩，也被提出，但仍未得到充分验证，并且在音频领域尚未被深入研究。同时，一些工作尝试将多模态对比嵌入分解为可解释的概念，但没有任何工作从概念分解的角度显式分析模态间隙。在这项工作中，我们引入了COMET（基于PLS-SVD变换的概念空间组织与模态间隙解释），这是一个新颖的用于CLAP的偏最小二乘奇异值分解（PLS-SVD）框架，揭示了模态间隙的更广泛视角。我们的框架揭示，只有一小部分可解释的轴（捕捉共享概念）对相似度计算有显著贡献，并且均值分量仅部分代表模态间隙。基于这一见解，我们提出了一种简单的谱截断方法，以无训练的方式缓解模态间隙。该方法使得零样本音频字幕通过条件交换接近全监督性能，无需大型辅助记忆库或昂贵计算。同时，它在保持检索和音频字幕任务强性能的同时，实现了显著的嵌入维度缩减。

英文摘要

Contrastive Language-Audio Pretraining (CLAP) models are widely used for audio understanding and support modality-agnostic condition swapping in many zero-shot applications. However, their performance is heavily affected by the modality gap between audio and text embeddings. Existing explanations mainly attribute this gap to the cone effect, treating it as a shift between mean embeddings, yet correcting the mean alone yields only limited improvements. Alternative hypotheses, such as information imbalance and dimensionality collapse, have also been proposed, but they remain insufficiently verified and have not been thoroughly studied in the audio domain. Meanwhile, several works attempt to decompose multimodal contrastive embeddings into interpretable concepts, but none explicitly analyze the modality gap from the perspective of concept decomposition. In this work, we introduce COMET (Concept space Organization and Modality gap Explanation with PLS-SVD Transformation), a novel partial least squares singular value decomposition (PLS-SVD) framework for CLAP that unveils a broader perspective of the modality gap. Our framework reveals that only a small, interpretable subset of axes, which captures shared concepts, contributes substantially to similarity computation, and that the mean component represents only partially the modality gap. Building on this insight, we propose a simple spectral truncation method that mitigates the modality gap in a training-free manner. The method enables zero-shot audio captioning with condition swapping to approach fully supervised performance, without requiring large auxiliary memory banks or expensive computation. At the same time, it achieves substantial embedding dimensionality reduction while preserving strong performance on retrieval and audio captioning tasks.

URL PDF HTML ☆

赞 0 踩 0

2605.29622 2026-05-29 cs.LG physics.chem-ph 版本更新

MōLe-Λ: Learning the Coupled-Cluster Response State for Energies, Gradients, and Properties

MōLe-Λ: 学习耦合簇响应态以获取能量、梯度和性质

Andreas Burger, Luca Thiede, Abdulrahman Aldossary, Jorge A. Campos-Gonzalez-Angulo, Alex Zook, Jérôme Florian Gonthier, Alán Aspuru-Guzik

发表机构 * University of Toronto（多伦多大学）； Vector Institute for Artificial Intelligence（人工智能向量研究所）； NVIDIA（英伟达）； Canadian Institute for Advanced Research (CIFAR)（加拿大高级研究研究院）

AI总结提出MōLe-Λ模型，通过联合学习左右手振幅预测耦合簇响应态，高效计算能量、梯度及多类分子性质。

Comments ICML 2026 AI4Physics

详情

AI中文摘要

耦合簇理论常被视为量子化学的金标准，但其高计算成本限制了准确能量、力和响应性质的常规获取。虽然右手$T$-振幅决定了相关波函数，但许多实际重要的可观测量还需要左手$Λ$-振幅。我们引入MōLe-$Λ$，它是分子轨道学习（MōLe）的扩展，通过从局域化的Hartree-Fock分子轨道联合学习右手振幅$(T_1,T_2)$和左手振幅$(Λ_1,Λ_2)$，预测完整的基态耦合簇单双激发（CCSD）响应态。在架构上，MōLe-$Λ$扩展了MōLe，增加了$Λ_1$和$Λ_2$读出模块，这些模块镜像了$T_1$和$T_2$头的对称性约束，同时保留了原始的等变轨道编码器、奇符号等变解码、局域性和大小广延性。所得模型能够提供准确的CC级能量和力，同时恢复偶极矩、四极矩、极化率、电子密度以及双电子可观测量如对密度。我们表明，MōLe-$Λ$进一步扩展了MōLe相对于完整CCSD的速度优势，同时大幅扩展了可访问的性质，为相关量子化学的波函数级替代模型提供了途径。

英文摘要

Coupled-cluster (CC) theory is often considered the gold standard of quantum chemistry, but its high computational cost limits routine access to accurate energies, forces and response properties. While the right-hand $T$-amplitudes determine the correlated wavefunction, many practically important observables additionally require the left-hand $Λ$-amplitudes. We introduce MōLe-$Λ$, an extension of Molecular Orbital Learning (MōLe) that predicts the full ground-state coupled-cluster singles and doubles (CCSD) response state by jointly learning right-hand amplitudes $(T_1,T_2)$ and left-hand amplitudes $(Λ_1,Λ_2)$ from localized Hartree--Fock molecular orbitals. Architecturally, MōLe-$Λ$ extends MōLe with $Λ_1$ and $Λ_2$ readouts that mirror the symmetry constraints of the $T_1$ and $T_2$ heads, while preserving the original equivariant orbital encoder, odd sign-equivariant decoding, locality and size-extensivity. The resulting model yields accurate CC-quality energies and forces, while simultaneously recovering dipoles, quadrupoles, polarizabilities, the electron density, and 2-electron observables such as the pair density. We show that MōLe-$Λ$ further extends the speed advantage of MōLe over full CCSD while substantially expanding the accessible properties, providing a route to wavefunction-level surrogate models for correlated quantum chemistry.

URL PDF HTML ☆

赞 0 踩 0

2605.29610 2026-05-29 cs.CV cs.AI cs.LG 版本更新

Learning Context-Conditioned Predicate Semantics via Prototype Feedback

通过原型反馈学习上下文条件谓词语义

NamGyu Jung, Chang Choi

发表机构 * Department of Computer Engineering, Gachon University, Seongnam, Republic of Korea（韩国成仁市加德满都大学计算机工程系）

AI总结提出AlignG方法，利用原型反馈从图像关系候选中推断上下文条件谓词语义并调整关系表示，在VG-150和GQA-200上分别提升SGDet的F@100指标1.4和2.7。

Comments Accepted at ICML 2026. Code: https://github.com/Namgyu97/AlignG-SGG.pytorch

详情

AI中文摘要

在场景图生成中，一个核心挑战是建模多义谓词，其含义随上下文变化。先前的方法通过将谓词分解为多个静态原型或检索语义相似的示例来解决此问题。然而，这些策略保持谓词表示静态，无法重新组织语义以反映图像特定的证据，导致在模糊上下文中出现系统性混淆。我们提出AlignG，通过原型反馈学习上下文条件谓词语义。AlignG从每幅图像中的关系候选中推断上下文条件谓词语义，并将调整后的语义反馈回来以重新校准关系表示。学习目标将此适应锚定到全局语义中心，防止语义漂移，同时当场景提供一致的关系线索时仍允许选择性重组。在VG-150和GQA-200上的实验表明，在SGDet下，F@100指标分别提升了+1.4和+2.7，优于最先进的基线。我们进一步可视化每幅图像的原型相似性变化，并观察到一致的上下文相关重组，其中原型根据场景证据选择性地合并或分离谓词。代码可在https://github.com/Namgyu97/AlignG-SGG.pytorch获取。

英文摘要

In scene graph generation, a central challenge is modeling polysemous predicates whose meanings shift across contexts. Prior approaches address this issue by decomposing predicates into multiple static prototypes or retrieving semantically similar exemplars. However, these strategies keep predicate representations static and cannot reorganize semantics to reflect image-specific evidence, leading to systematic confusions in ambiguous contexts. We propose AlignG, which learns context-conditioned predicate semantics via prototype feedback. AlignG infers context-conditioned predicate semantics from the relation candidates within each image and feeds the adapted semantics back to recalibrate relation representations. The learning objective anchors this adaptation to global semantic centers, preventing semantic drift while still allowing selective reorganization when the scene provides consistent relational cues. Experiments on VG-150 and GQA-200 show consistent improvements over state-of-the-art baselines, with F@100 improvements of +1.4 on VG-150 and +2.7 on GQA-200 under SGDet. We further visualize per-image prototype similarity shifts and observe coherent context-dependent reorganization where prototypes selectively merge or separate predicates according to scene evidence. The code is available at https://github.com/Namgyu97/AlignG-SGG.pytorch.

URL PDF HTML ☆

赞 0 踩 0

2605.29607 2026-05-29 cs.LG 版本更新

Cluster-Level Attention-Guided Parallel Decoding for Masked Diffusion Language Models

掩码扩散语言模型的簇级注意力引导并行解码

Heqiang Qi, Wei Huang, Mingyuan Bai, Xiangming Meng

发表机构 * Zhejiang University（浙江大学）； RIKEN Center for Advanced Intelligence Project（日本理化学研究院先进智能项目中心）； The Institute of Statistical Mathematics（统计数学研究所）； Agency for Science, Technology and Research (A ⋆ \star STAR)（科技研究局（A ⋆ STAR））

AI总结提出CLAD方法，通过将相邻高置信度token聚合成簇，并利用自注意力图估计簇间依赖，实现掩码扩散语言模型的训练无关簇级并行解码，在保持任务精度的同时获得1.77-8.47倍加速。

详情

AI中文摘要

掩码扩散语言模型（MDLMs）通过在每个去噪步骤预测所有掩码位置来实现并行解码，然而现有的无训练采样器通常以token级粒度决定哪些位置被提交。我们重新审视这一粒度，并观察到可靠预测通常表现为连续的置信度跨度，这表明并行提交的单位可以大于单个token。我们首先将相邻的高置信度候选分组为置信度诱导簇（CICs），作为跨度级更新单元。然后，我们利用同一前向传递的自注意力图来估计簇间依赖关系，从而实现对相互兼容的CICs进行冲突感知选择以进行并行提交。这产生了CLAD（簇级注意力引导解码），一种用于MDLMs的无训练簇级解码器。在LLaDA和Dream模型系列上的四个推理和代码生成基准测试中，CLAD在大多数设置下实现了1.77倍至8.47倍的速度提升，同时保持广泛可比的任务精度。

英文摘要

Masked diffusion language models (MDLMs) enable parallel decoding by predicting all masked positions at each denoising step, yet existing training-free samplers usually decide which positions to commit at token-level granularity. We revisit this granularity and observe that reliable predictions often emerge as contiguous high-confidence spans, suggesting that the unit of parallel commitment can be larger than a single token. We first group adjacent high-confidence candidates into confidence-induced clusters (CICs) as span-level update units. We then use self-attention maps from the same forward pass to estimate inter-cluster dependencies, enabling conflict-aware selection of mutually compatible CICs for parallel commitment. This yields CLAD (Cluster-Level Attention-Guided Decoding), a training-free cluster-level decoder for MDLMs. Experiments on LLaDA and Dream model families across four reasoning and code-generation benchmarks show that CLAD achieves 1.77x--8.47x speedups over Vanilla decoding while maintaining broadly comparable task accuracy in most settings.

URL PDF HTML ☆

赞 0 踩 0

2605.29601 2026-05-29 cs.CL cs.AI cs.LG 版本更新

Training Deliberative Monitors for Black-Box Scheming Detection

训练审慎监控器用于黑箱策划检测

Aditya Sinha, Akshat Naik, Victor Gillioz, Simon Storf, Kilian Merkelbach, Rich Barton-Cooper, Axel Højmark, Marius Hobbhahn

发表机构 * Independent（独立）； MATS Research（MATS研究）； Astra Fellowship ； Apollo Research（Apollo研究）

AI总结提出一种基于行动轨迹的审慎监控方法，通过蒸馏前沿模型的推理过程训练开源模型，以低成本高精度检测智能体的策划与破坏行为。

详情

AI中文摘要

随着自主智能体在执行现实任务方面变得愈发强大，区分策划行为与良性任务追求可能成为AI控制的核心问题。现有监控器通常依赖思维链访问或内部激活，或使用提示的前沿模型，这些在部署中可能不可用、不可靠或成本高昂。在本工作中，我们研究仅基于行动的审慎监控器：较小的开源模型，经过训练可从智能体轨迹中检测策划与破坏行为，而无需访问被监控智能体的推理或模型内部。我们的方法受审慎对齐启发，使用策划规范从前沿教师模型中引出结构化推理，通过独立的评判器进行过滤，并通过监督微调和强化学习将最高质量的推理蒸馏到开源监控器中。我们在五个数据集上训练，并在六个分布外智能体失调基准上评估。我们表明，将我们的方法应用于Qwen3.5-27B，其性能优于所有低成本前沿模型作为提示监控器（Gemini 3.1 Flash-Lite、GPT-5.4 Nano和Claude Haiku 4.5）以及Gemini 2.5 Pro，同时实现了更低的边际推理成本（每1000次评估的token计费美元）。更强的提示前沿监控器（Gemini 3.1 Pro、GPT-5.4、Claude Sonnet 4.6和Claude Opus 4.6）实现了更高的性能，但边际推理成本大约高出16-34倍。我们训练的多个监控器在我们评估的监控器中位于经验成本-性能帕累托前沿，为提示前沿模型提供了实用的低成本、低误报率替代方案。

英文摘要

As autonomous agents become more capable of performing real-world tasks, distinguishing scheming behavior from benign task pursuit may become a central AI control problem. Existing monitors often rely on chain-of-thought access or internal activations, or use prompted frontier models, all of which can be unavailable, unreliable or expensive in deployment. In this work, we study action-only deliberative monitors: smaller open-weight models trained to detect scheming and sabotage from agentic trajectories without accessing the monitored agent's reasoning or model internals. Our method, inspired by deliberative alignment, uses a scheming specification to elicit structured rationales from a frontier teacher, filters them with a separate judge, and distills the highest-quality rationales into open-weight monitors with supervised fine-tuning and reinforcement learning. We train on five datasets, and evaluate across six out-of-distribution agentic misalignment benchmarks. We show that applying our method to Qwen3.5-27B yields higher performance than all low-cost frontier models as prompted monitors (Gemini 3.1 Flash-Lite, GPT-5.4 Nano, and Claude Haiku 4.5) and than Gemini 2.5 Pro, while also achieving lower marginal inference cost (token-metered USD per 1,000 evaluations). Stronger prompted frontier monitors (Gemini 3.1 Pro, GPT-5.4, Claude Sonnet 4.6, and Claude Opus 4.6) achieve higher performance but at roughly $16$--$34\times$ higher marginal inference cost. Several of our trained monitors are positioned on the empirical cost--performance Pareto frontier among the monitors we evaluate, providing practical low-cost, low-FPR alternatives to prompted frontier models.

URL PDF HTML ☆

赞 0 踩 0

2605.29587 2026-05-29 q-bio.QM cs.LG 版本更新

FPLIER: Federated Pathway-Level Information Extractor

FPLIER：联邦通路级信息提取器

Daniele Malpetti, Christian Berchtold, Francesco Gualdi, Marco Scutari, Laura Azzimonti, Francesca Mangili

发表机构 * Dalle Molle Institute for Artificial Intelligence (IDSIA)（达勒莫尔人工智能研究所（IDSIA））； USI-SUPSI ； Swiss Institute of Bioinformatics (SIB)（瑞士生物信息学研究所（SIB））

AI总结提出联邦学习框架FPLIER，通过安全聚合实现分布式基因表达数据上的通路级因子分解，并证明隐私风险由训练表达矩阵的秩决定。

Comments Accepted for publication at the ACM BCB '26 conference

详情

DOI: 10.1145/3807503.3819364

AI中文摘要

在转录组学中，通路级信息提取器（PLIER）等基因集感知因子分解方法在大型异质性表达数据集上训练时效果最佳。然而，由于隐私和治理限制，许多临床相关队列无法合并为单个数据集。我们提出FPLIER，这是PLIER的联邦扩展，能够在多个数据持有者之间进行分布式训练，同时整合公开可用数据集。通过安全聚合，FPLIER产生的训练更新在代数上等价于集中式池化数据方法，同时保持表达数据的本地性。我们在两个模拟联盟（来自K-CLIER和MultiPLIER研究）的多个场景中评估FPLIER，并展示其稳定收敛。我们进一步对针对中间训练统计量和发布模型的成员推断攻击进行了系统分析。结果表明，隐私风险由训练表达矩阵的秩决定。整合公开数据或降低数据维度会增加该秩，使系统趋向满秩状态，在此状态下训练样本与非训练样本对攻击者而言难以区分，成员推断性能接近随机猜测。

英文摘要

In transcriptomics, gene-set-aware factorization methods such as the Pathway Level Information Extractor (PLIER) are most effective when trained on large, heterogeneous expression compendia. Yet, many clinically relevant cohorts cannot be pooled into a single dataset due to privacy and governance constraints. We present FPLIER, a federated extension of PLIER that enables distributed training across multiple data holders while incorporating publicly available datasets. Through secure aggregation, FPLIER produces training updates algebraically equivalent to those of a centralized pooled-data approach while keeping expression data local. We evaluate FPLIER across multiple scenarios in two simulated consortia (from the K-CLIER and MultiPLIER studies) and demonstrate stable convergence. We further conduct a systematic analysis of membership inference attacks targeting both intermediate training statistics and the released model. Our results show that privacy risk is governed by the rank of the training expression matrix. Incorporating public data or reducing data dimensionality increases this rank, moving the system toward a full-rank regime in which training and non-training samples become indistinguishable to the attacker, and membership-inference performance approaches random guessing.

URL PDF HTML ☆

赞 0 踩 0

2605.28711 2026-05-29 cs.LG 版本更新

Stage-wise Distortion-Perception Traversal in Zero-shot Inverse Problems with Diffusion Models

基于扩散模型的零样本逆问题中的逐阶段失真-感知遍历

Jiawei Zhang, Ziyuan Liu, Leon Yan, Zhenyu Xiao, Yuantao Gu

发表机构 * Shenzhen International Graduate School, Tsinghua University, Shenzhen, China（清华大学深圳国际研究生院）； Department of Electronic Engineering, Tsinghua University , Beijing, China（清华大学电子工程系）

AI总结提出一种逐阶段框架MAP-RPS，通过MAP估计和重噪声后验采样实现单扩散模型下的失真-感知权衡遍历，并扩展至潜空间LMAP-RPS以提升适用性。

Comments Accepted by ICML 2026

详情

AI中文摘要

失真-感知（D-P）权衡是贝叶斯逆问题的一个基本现象，它刻画了失真性能与感知质量之间的内在矛盾。在推理时实现D-P权衡的灵活遍历对实际应用至关重要。尽管扩散模型在零样本逆问题求解中取得了近期成功，但在基于扩散的逆算法中实现D-P遍历的高效且原则性策略仍缺乏充分刻画。本文提出一种逐阶段框架，利用单个扩散模型在零样本逆问题中实现D-P遍历。我们提出的方法称为MAP-RPS，首先进行MAP估计阶段，近似MMSE解并提供低失真初始化，随后进行重噪声后验采样阶段，逐步提升感知质量。我们对两个阶段进行了理论分析，验证了所提设计的有效性和正确性。此外，我们将MAP-RPS扩展到潜空间，得到LMAP-RPS，通过利用大规模预训练潜扩散骨干网络，具有更广泛的适用性。大量实验表明，MAP-RPS和LMAP-RPS在各种任务上实现了更有效的D-P遍历，同时作为实际逆问题的高效求解器也表现出强劲性能。

英文摘要

The distortion-perception (D-P) tradeoff is a fundamental phenomenon of Bayesian inverse problems, which characterizes the inherent tension between distortion performance and perceptual quality. Enabling flexible traversal of the D-P tradeoff at inference time is crucial for practical applications. Despite the recent success of diffusion models in zero-shot inverse problem solving, efficient and principled strategies for D-P traversal in diffusion-based inverse algorithms remain inadequately characterized. In this paper, we propose a stage-wise framework for realizing D-P traversal using a single diffusion model in zero-shot inverse problems. Our proposed method, termed MAP-RPS, starts with an MAP estimation stage that approximates the MMSE solution and provides a low-distortion initialization, followed by a re-noised posterior sampling stage that progressively improves perceptual quality. We provide theoretical analyses for both stages, establishing the validity and effectiveness of the proposed design. Furthermore, we extend MAP-RPS to the latent space, yielding LMAP-RPS, which enjoys broader applicability by leveraging large-scale pre-trained latent diffusion backbones. Extensive experiments demonstrate that MAP-RPS and LMAP-RPS enable more effective D-P traversal on various tasks, while also exhibiting strong performance as efficient solvers for real-world inverse problems.

URL PDF HTML ☆

赞 0 踩 0

2605.28418 2026-05-29 cs.LG 版本更新

Revisiting Metafeatures to Explain Model Differences on Tabular Data

重新审视元特征以解释表格数据上的模型差异

Markus Herre, Andrej Tschalzev, Sascha Marton, Christian Bartelt

发表机构 * Clausthal University of Technology, Clausthal-Zellerfeld, Germany（Clausthal技术大学，Clausthal-Zellerfeld，德国）； University of Mannheim, Mannheim, Germany（曼海姆大学，曼海姆，德国）

AI总结研究通过严格统计检验和留一法分析，发现数据集元特征无法稳健解释表格数据上不同模型族（如神经网络与树模型、非基础模型与基础模型）之间的性能差异。

详情

AI中文摘要

随着表格基础模型的兴起以及传统模型在许多任务上仍表现良好，为表格数据集选择合适模型仍然困难。我们研究数据集元特征是否能解释表格预测任务中模型族之间的性能差距。利用TabArena基准结果，我们分析数据集级别的性能差距，并将其与模型无关的数据集描述符相关联。经过严格统计检验并控制错误发现率后，我们发现：(1) 对于神经网络与树模型的差距，没有元特征能通过错误发现率控制；(2) 对于非基础模型与基础模型的差距，一个关联是稳健的，但在留一数据集预测测试中不能泛化；(3) 对于TabICLv2与TabPFN-2.6，一个稳健关联也改善了留出预测。此外，我们进行了留一数据集分析，发现元特征预测器未能比简单基线有实质性改进。总体而言，我们的结果显示了表格数据集的异质性，并且全局元特征方法不够稳健，无法对51个TabArena数据集提供解释。

英文摘要

With the rise of tabular foundation models alongside traditional models still performing well on many tasks, choosing the right model for a tabular dataset remains difficult. We investigate whether dataset meta-features can explain performance gaps between model families on tabular prediction tasks. Using the TabArena benchmark results, we analyze dataset-level performance gaps and relate them to model-agnostic dataset descriptors. After strict statistical tests with false discovery control, we find that (1) for neural network vs. tree gaps, no meta-feature survives false discovery control, (2) for non-foundation vs. foundation model gaps, one association is robust but does not generalize when tested in leave-one-dataset-out prediction, and (3) for TabICLv2 vs. TabPFN-2.6, one robust association also improves held-out prediction. Furthermore, we conduct a leave-one-dataset-out analysis and find that meta-feature predictors fail to improve meaningfully over a simple baseline. Overall, our results show the heterogeneity of tabular datasets and that global meta-feature approaches are not robust enough to offer explanations on the 51 TabArena datasets.

URL PDF HTML ☆

赞 0 踩 0

2605.28368 2026-05-29 cs.LG cond-mat.mtrl-sci physics.app-ph 版本更新

LEIA: Learned Environment for Interactive Architected Materials

LEIA: 用于交互式架构材料的学习环境

Haiqian Yang, Yuan Cao, Markus J. Buehler

发表机构 * Unreasonable Labs

AI总结提出LEIA世界模型，通过逐步施加边界条件并实时观察变形和应力场，支持工程师交互式探索架构材料，并实现快速代理引导的候选生成与排序。

Comments 22 pages, 10 figures

详情

AI中文摘要

世界模型已经实现了游戏环境和机器人操作的交互式探索，但物理工程仍然超出其能力范围：真实材料表现出非线性本构定律、携带历史依赖的内部状态、经历惯性动力学，并且可能具有跨越多个长度尺度的层次结构。我们提出了LEIA（用于交互式架构材料的学习环境），这是一个世界模型，允许工程师逐步施加边界条件并实时观察由此产生的变形和应力场。LEIA处理大型三维非结构化网格，并对用户指定的加载生成自回归响应。我们引入了MicroPlate，这是一个架构板的基准测试，涵盖微观结构建模的两种模式：通过三维几何显式解析微观结构的架构晶格，以及通过内部自由度隐式建模微观结构变化的均质板。MicroPlate用于评估LEIA以及两种模式下的四种基线方法。最后，我们证明LEIA能够实现高效的候选生成和排序，用于快速代理引导的架构材料新设计搜索，并通过有限元地面实况验证了应力准确的候选排序。

英文摘要

World models have enabled interactive exploration of game environments and robotic manipulation, but physical engineering remains beyond their reach: real materials exhibit nonlinear constitutive laws, carry history-dependent internal state, undergo inertial dynamics, and may possess hierarchical structures spanning multiple length scales. We present LEIA (Learned Environment for Interactive Architected materials), a world model that lets engineers apply boundary conditions step by step and observe the resulting deformation and stress fields in real time. LEIA handles large three-dimensional unstructured meshes and generates autoregressive responses to user-specified loading. We introduce MicroPlate, a benchmark of architected plates spanning two regimes of microstructure modeling: architected lattices that resolve microstructure explicitly through three-dimensional geometry, and a homogeneous plate where microstructural change is modeled implicitly through internal degrees of freedom. MicroPlate is used to assess LEIA alongside four baseline methods across both regimes. Finally, we demonstrate that LEIA enables efficient candidate generation and ranking for fast surrogate-guided search for de novo designs of architected materials, with stress-accurate candidate ranking validated by finite element ground truth.

URL PDF HTML ☆

赞 0 踩 0

2605.28327 2026-05-29 stat.ML cs.LG q-fin.RM stat.AP 版本更新

Insurance Pricing Optimization via Off-Policy Evaluation

通过离线策略评估进行保险定价优化

Sascha Günther, Dimitri Semenovich, Mario V. Wüthrich

发表机构 * Department of Mathematics, ETH Zurich（苏黎世联邦理工学院数学系）

AI总结本文提出基于离线策略评估和随机控制的保险定价方法，利用核化逆倾向得分估计器降低方差，并通过数据共享Lasso和神经网络两种策略优化方法实现最优定价。

详情

AI中文摘要

传统保险定价依赖于基于风险的原则，确保精算公平和偿付能力，但未明确考虑投保人的价格敏感性。我们将保险定价表述为一个决策问题，并使用离线策略评估和随机控制的工具进行研究。我们提出了一种核化逆倾向得分估计器，该估计器利用动作空间中的局部结构，与经典逆倾向得分估计器相比实现了方差减少。基于这些价值估计，我们研究了策略优化，并提出了两种计算最优定价规则的实用方法：一种可解释的数据共享Lasso公式和一种基于神经网络的灵活策略参数化。通过使用受控的合成旅行保险环境，我们实证验证了理论结果，并表明神经网络在策略优化方面优于现有技术。

英文摘要

Traditional insurance pricing relies on risk-based principles that ensure actuarial fairness and solvency but do not explicitly account for policyholders' price sensitivity. We formulate insurance pricing as a decision-making problem and study it using tools from off-policy evaluation and stochastic control. We propose a kernelized inverse propensity score estimator that exploits local structure in the action space and yields variance reduction compared to the classical inverse propensity score estimator. Building on these value estimates, we investigate policy optimization and present two practical approaches for computing optimal pricing rules: an interpretable data-shared Lasso formulation and a flexible policy parameterization based on neural networks. Using a controlled synthetic travel insurance environment, we empirically confirm the theoretical results and show that neural networks outperform existing techniques for policy optimization.

URL PDF HTML ☆

赞 0 踩 0

2605.27809 2026-05-29 cs.LG cs.CR 版本更新

Density-aware Sample-specific Attack

密度感知的样本特定攻击

Qiyuan Wang, Yao Li, Raymond K. W. Wong

发表机构 * Texas A&M University（德克萨斯A&M大学）； University of North Carolina at Chapel Hill（北卡罗来纳大学教堂山分校）

AI总结提出一种通过将触发样本引导至干净数据分布的低密度区域来优化后门攻击的双层优化方法，在微调和剪枝防御下均保持高攻击成功率。

详情

AI中文摘要

尽管后门攻击近期取得进展，现有方法仍易受到训练后防御（如微调或剪枝）的影响，这些防御会擦除后门。我们重新审视后门攻击的核心目标，并在受害者训练的贝叶斯最优模型下推导出刻画最优样本特定触发器构建的原则性准则。我们的分析表明，当触发样本被引导至干净数据分布的低密度区域时，攻击成功率和干净准确率保持同时达到最优，这种分布条件一次性控制中毒分布的所有矩，而非少量输入空间汇总统计量。我们引入一个双层优化框架，通过条件时间分数匹配估计密度比，并优化混合模型目标以将触发样本放置在这些稀疏区域。在MNIST、CIFAR-10、GTSRB和TinyImageNet上的广泛评估表明，我们的方法在防御前达到99%以上的攻击成功率，并且在微调防御下，防御后的ASR比最强基线高出50-85个百分点。针对神经元剪枝防御，该方法表现出完全免疫性，在所有剪枝阈值下均未识别出任何需要移除的神经元。这些结果暴露了当前防御范式的根本缺陷，并强调了需要超越干净分布支持域进行防御的必要性。

英文摘要

Despite recent progress in backdoor attacks, existing methods remain susceptible to post-training defenses that erase the backdoor through fine-tuning or pruning. We revisit the core objectives of backdoor attacks and derive principled criteria characterizing optimal sample-specific trigger construction under a Bayes-optimal model of the victim's training. Our analysis reveals that both attack success and clean-accuracy preservation are simultaneously optimized when triggered samples are steered into low-density regions of the clean data distribution, a distributional condition that controls all moments of the poisoned distribution at once rather than a handful of input-space summary statistics. We introduce a bilevel optimization framework that estimates density ratios via conditional time-score matching and optimizes a mixture-model objective to place triggered samples in these sparse regions. Extensive evaluations on MNIST, CIFAR-10, GTSRB, and TinyImageNet demonstrate that our method achieves above 99\% attack success rate before defense and retains 50--85 percentage points higher post-defense ASR than the strongest baselines under fine-tuning defenses. Against neuron-pruning defenses, the method exhibits complete immunity, with zero neurons identified for removal across all pruning thresholds. These results expose a fundamental gap in current defense paradigms and underscore the need for defenses that operate beyond the support of the clean distribution.

URL PDF HTML ☆

赞 0 踩 0

2605.27696 2026-05-29 cs.CV cs.LG 版本更新

Structure over Pixels: Learning Variable-Length Visual Programs

结构优于像素：学习可变长度视觉程序

Piotr Wyrwiński, Kacper Dobek, Krzysztof Krawiec

发表机构 * Institute of Computing Science（计算科学研究所）； Poznan University of Technology（波兹南技术大学）

AI总结提出STROP离散视觉分词器架构，通过基于DINOv3特征的局部率失真监督学习可变长度视觉程序，以结构表示替代像素重建。

详情

AI中文摘要

离散视觉分词器将图像转换为有序的代码序列，为场景的结构描述提供了自然表示。然而，现有的自适应分词器要么需要事后搜索，要么在预训练速率的离散集合中进行选择，而不是学习与模型和场景耦合的连续每图像序列长度，并且它们通常针对像素重建进行训练，强调纹理而非结构。我们提出STROP，一种离散视觉分词器架构，形成结构场景表示并同时学习图像的视觉程序应该有多长。使用由冻结的DINOv3特征的局部率失真探针监督的四阶段课程，STROP优化了一个专门的长度头，在单次前向传递中估计活动前缀长度。通过绕过像素级重建梯度，码本完全由高层潜在表示的质量塑造。程序长度随场景复杂性增长，组合结构的迹象出现在下游密集预测迁移和对学习代码词汇的直接检查中。

英文摘要

Discrete visual tokenizers translate images into ordered sequences of codes, providing a natural representation for structural description of scenes. Yet existing adaptive tokenizers either require post-hoc search or select among a discrete set of pre-trained rates, rather than learning a continuous per-image sequence length coupled to the model and scene, and they typically train against pixel reconstruction, emphasizing texture rather than structure. We propose STROP, a discrete visual tokenizer architecture that forms structural scene representations and simultaneously learns how long an image's visual program should be. Using a four-phase curriculum supervised by local rate--distortion probes against frozen DINOv3 features, STROP optimizes a dedicated length head that estimates the active prefix length in a single forward pass. By bypassing pixel-level reconstruction gradients, the codebook is shaped entirely by the quality of higher-level latent representations. Program length grows with scene complexity, and signs of compositional structure emerge both in downstream dense-prediction transfer and in direct inspection of the learned code vocabulary.

URL PDF HTML ☆

赞 0 踩 0

2605.27078 2026-05-29 cs.LG cs.AI 版本更新

Paris 2.0: 一种去中心化的视频生成扩散模型

Ali Rouzbayani, Bidhan Roy, Marcos Villagra, Zhiying Jiang

AI总结本文提出Paris 2.0，首个通过去中心化计算预训练的视频生成模型，基于Paris 1.0的扩散模型框架，在低分辨率文本到视频任务中相比集中式模型将FVD从561.04降至279.01，提升约2倍，并提高了CLIP文本-视频相似度和美学评分。

Comments 6 pages, 5 figures

2605.25303 2026-05-29 cs.DS cs.LG math.ST stat.ML stat.TH 版本更新

Algorithms with Polynomially-Improved Approximation Factors for the $2 \rightarrow q$ Norm, and Applications

具有多项式改进近似因子的 $2 \rightarrow q$ 范数算法及其应用

Samuel B. Hopkins, Stefan Tiegel

发表机构 * MIT（麻省理工学院）

AI总结本文针对 $q>2$ 时的 $2 \rightarrow q$ 范数，提出了首个多项式时间近似算法，其近似因子在多项式级别上优于基线 $d^{1/4}$，例如 $q=4$ 时达到 $d^{1/8}$，并构造了平方和证书，从而改进了鲁棒均值估计、协方差估计、回归和聚类等问题的算法。

Comments v2 corrected minor typos

详情

AI中文摘要

矩阵 $X \in \mathbb{R}^{n \times d}$ 的 $2 \rightarrow q$ 范数定义为 $\lVert X \rVert_{2 \rightarrow q} = \sup_{\lVert v \rVert_2 = 1} \lVert Xv \rVert_q$。我们针对 $q > 2$（即超收缩设置）给出了该范数的多项式时间乘法近似算法。该问题要么直接对应，要么与组合优化和近似难度（例如小集扩张）、量子信息（例如最佳可分态）以及算法统计学中长期存在的开放问题密切相关。关于在多项式时间内能为此问题达到何种近似因子，我们所知甚少，尽管此类近似具有重要的下游影响。Barak、Brandão、Harrow、Kelner、Steurer 和 Zhou 表明，假设指数时间假设（FOCS'12），没有多项式时间算法能实现优于 $2^{\sqrt{\log n}}$ 的近似因子。另一方面，一个简单的谱算法给出了 $d^{1/4}$ 的基线近似。据我们所知，我们给出了首个在多项式因子内超越该基线的多项式时间近似算法。对于重要的特例 $q = 4$，它实现了 $d^{1/8}$ 的近似。所有先前的算法要么需要对 $X$ 附加假设，要么仅在 $n$ 较小时才能超越基线。此外，我们为 $2 \rightarrow q$ 范数构造了平方和证书。这直接改进了当数据仅满足 $q$ 阶矩有界时的鲁棒均值和协方差估计、鲁棒回归以及聚类算法。

英文摘要

The $2 \rightarrow q$ norm of a matrix $X \in \mathbb{R}^{n \times d}$ is defined as $\lVert X \rVert_{2 \rightarrow q} = \sup_{\lVert v \rVert_2 = 1} \lVert Xv \rVert_q$. We give polynomial-time multiplicative approximation algorithms for this norm when $q > 2$ (i.e. in the hypercontractive setting). This problem either directly captures or is closely related to long-standing open problems in combinatorial optimization and hardness of approximation (e.g. Small Set Expansion), quantum information (e.g. Best Separable State), and algorithmic statistics. Very little is known about what approximation factors we can achieve for this problem in polynomial time, even though such approximations have significant downstream consequences. Barak, Brandão, Harrow, Kelner, Steurer, and Zhou showed that no polynomial-time algorithm can achieve an approximation factor better than $2^{\sqrt{\log n}}$, assuming the Exponential Time Hypothesis (FOCS'12). On the other hand, a simple spectral algorithm gives a $d^{1/4}$-approximation as a baseline. We give, to the best of our knowledge, the first polynomial-time approximation algorithm beating this baseline by polynomial factors. For the important special case of $q = 4$ it achieves a $d^{1/8}$-approximation. All previous algorithms required additional assumptions on $X$, or only surpassed the baseline for small values of $n$. Moreover, we construct sum-of-squares certificates for the $2 \rightarrow q$ norm. This directly implies improved algorithms for robust mean and covariance estimation, robust regression, and clustering, when the data only satisfies a bound on its $q$-th moment.

URL PDF HTML ☆

赞 0 踩 0

2605.24934 2026-05-29 cs.RO cs.AI cs.CV cs.LG 版本更新

HumanEgo: Zero-Shot Robot Learning from Minutes of Human Egocentric Videos

HumanEgo：从几分钟的人类自我中心视频中零样本学习机器人

Zhi Wang, Botao He, Kelin Yu, Seungjae Lee, Ruohan Gao, Furong Huang, Yiannis Aloimonos

发表机构 * University of Maryland（马里兰大学）

AI总结提出HumanEgo框架，通过将人类演示提升为手-物体交互的实体级表示，并训练具有密集辅助目标的流匹配策略，实现从人类自我中心视频到机器人的零样本、无机器人数据、硬件无关的技能迁移。

Comments Project page: https://humanego-ai.github.io

详情

AI中文摘要

人类自我中心视频捕捉了丰富的操作演示，无需任何机器人硬件，但由于人类和机器人在视觉外观和运动学上的具身差距，将这些技能迁移到机器人仍然具有挑战性。我们提出了HumanEgo，一个通过将每个人类演示提升为手-物体交互的实体级表示，并训练具有密集辅助目标的流匹配策略来弥合具身差距的框架，该策略放大了每个轨迹的监督信号。HumanEgo无需机器人数据、硬件无关、数据高效且可零样本地从人类迁移到机器人。每个任务仅需30分钟的人类视频，HumanEgo在四个真实世界任务中实现了92.5%的平均成功率（仅15分钟即可达到75%），比匹配时间的机器人遥操作高出41%，并且能够稳健地零样本迁移到新的机器人、相机和环境。我们发布了HumanEgo作为一个易于使用的开源框架，用于直接从人类数据学习机器人策略：https://github.com/TX-Leo/HumanEgo

英文摘要

Human egocentric video captures rich manipulation demonstrations without any robot hardware, yet transferring these skills to robots remains challenging due to the embodiment gap between human and robot in both visual appearance and kinematics. We present HumanEgo, a framework that bridges the embodiment gap by lifting each human demonstration to an entity-level representation of hand-object interaction, and training a flow matching policy with dense auxiliary objectives that amplify supervision from every trajectory. HumanEgo is robot-data-free, hardware-agnostic, data-efficient, and zero-shot human-to-robot transferable. With only 30 minutes of human videos per task, HumanEgo achieves 92.5% average success across four real-world tasks (75% with just 15 minutes), outperforms matched-time robot teleoperation by 41%, and robustly transfers zero-shot across novel robots, cameras, and environments. We release HumanEgo as an easy-to-use, open-source framework for learning robot policies directly from human data: https://github.com/TX-Leo/HumanEgo

URL PDF HTML ☆

赞 0 踩 0

2605.23239 2026-05-29 cs.LG 版本更新

Self-supervised Adversarial Purification for Graph Neural Networks

自监督对抗净化用于图神经网络

Woohyun Lee, Hogun Park

发表机构 * Department of Computer Science and Engineering（计算机科学与工程系）； Sungkyunkwan University（全州大学）； Suwon, South Korea（韩国水原）

AI总结提出自监督对抗净化框架，通过专用净化器GPR-GAE（基于广义PageRank滤波器的图自编码器）在分类前净化输入数据，实现鲁棒性与分类器分离，达到最先进的防御性能。

Comments Accepted at ICML 2025. 21 pages. Code is available at: https://github.com/woodavid31/GPR-GAE

详情

Journal ref: Proceedings of the 42nd International Conference on Machine Learning, PMLR 267:33715-33735, 2025

AI中文摘要

防御图神经网络（GNN）免受对抗攻击需要在准确性和鲁棒性之间取得平衡，而传统方法（如对抗训练）将这两个冲突目标交织在单个分类器中，往往处理不当。为克服这一局限，我们提出一种自监督对抗净化框架。通过引入专用净化器，在分类前净化输入数据，将鲁棒性与分类器分离。与先前的对抗净化方法不同，我们提出GPR-GAE，一种新颖的图自编码器（GAE），作为专用净化器，采用自监督策略训练，以数据驱动方式适应多样化的图结构。利用多个广义PageRank（GPR）滤波器，GPR-GAE捕获多样化的结构表示，实现鲁棒且有效的净化。我们的多步净化过程进一步促进GPR-GAE实现精确的图恢复和对结构扰动的鲁棒防御。跨不同数据集和攻击场景的实验表明，GPR-GAE具有最先进的鲁棒性，可作为GNN分类器的独立即插即用净化器。

英文摘要

Defending Graph Neural Networks (GNNs) against adversarial attacks requires balancing accuracy and robustness, a trade-off often mishandled by traditional methods like adversarial training that intertwine these conflicting objectives within a single classifier. To overcome this limitation, we propose a self-supervised adversarial purification framework. We separate robustness from the classifier by introducing a dedicated purifier, which cleanses the input data before classification. In contrast to prior adversarial purification methods, we propose GPR-GAE, a novel graph auto-encoder (GAE), as a specialized purifier trained with a self-supervised strategy, adapting to diverse graph structures in a data-driven manner. Utilizing multiple Generalized PageRank (GPR) filters, GPR-GAE captures diverse structural representations for robust and effective purification. Our multi-step purification process further facilitates GPR-GAE to achieve precise graph recovery and robust defense against structural perturbations. Experiments across diverse datasets and attack scenarios demonstrate the state-of-the-art robustness of GPR-GAE, showcasing it as an independent plug-and-play purifier for GNN classifiers.

URL PDF HTML ☆

赞 0 踩 0

2605.20612 2026-05-29 cs.LG 版本更新

Matryoshka Concept Bottleneck Models

Matryoshka 概念瓶颈模型

Ziye Chen, Hongbin Lin, Jie Li, Lijie Hu

发表机构 * Mohamed bin Zayed University of Artificial Intelligence（莫扎德·本·扎耶德人工智能大学）； The Hong Kong University of Science and Technology (Guangzhou)（香港科学与技术大学（广州））

AI总结提出 Matryoshka 概念瓶颈模型 (MCBM)，通过嵌套层次结构实现自适应概念利用，将预期干预成本从线性降低到对数阶 O(log K)，同时保证单调性能提升。

详情

AI中文摘要

概念瓶颈模型 (CBMs) 已成为可解释深度学习的一种重要范式，通过将预测基于人类可理解的概念来学习。然而，它们的实际部署受到测试时干预成本高昂的阻碍，因为纠正模型错误通常需要人类专家手动检查和验证大量预测概念。现有方法存在根本性的结构限制：它们要么采用单一静态概念集，迫使专家详尽地标注概念，导致高昂的干预成本；要么训练多个针对不同概念预算的模型，导致大量的计算和维护开销。为了解决这一挑战，我们提出了 Matryoshka 概念瓶颈模型 (MCBM)，这是一种统一的架构，能够在单个模型中实现自适应概念利用。受 Matryoshka 表示学习的启发，MCBM 基于最大相关性和最小冗余性将概念组织成嵌套层次结构，允许在不重新训练的情况下在多个概念粒度级别进行推理。理论上，我们证明 MCBM 将预期干预成本从线性降低到对数阶 $O(\log K)$，同时保证单调性能提升。实验上，大量实验表明，MCBM 在实现动态且高效的专家交互的同时，与独立训练的模型性能相当。

英文摘要

Concept Bottleneck Models (CBMs) have emerged as a prominent paradigm for interpretable deep learning, learning by grounding predictions in human-understandable concepts. However, their practical deployment is hindered by the high cost of test-time intervention, as correcting model errors typically requires human experts to manually inspect and verify a large set of predicted concepts. Existing approaches suffer from a fundamental structural limitation: they either adopt a single static concept set, forcing experts to exhaustively annotate concepts and incurring prohibitive intervention costs, or train multiple models tailored to different concept budgets, resulting in substantial computational and maintenance overhead. To address this challenge, we propose the Matryoshka Concept Bottleneck Model (MCBM), a unified architecture that enables adaptive concept utilization within a single model. Inspired by Matryoshka Representation Learning, MCBM organizes concepts into a nested hierarchy based on maximum relevance and minimum redundancy, allowing inference at multiple levels of conceptual granularity without retraining. Theoretically, we show that MCBM reduces the expected intervention costs from linear to logarithmic order, $O(\log K)$, while guaranteeing monotonic performance improvement. Empirically, extensive experiments demonstrate that MCBM matches the performance of independently trained models while enabling dynamic and efficient expert interaction.

URL PDF HTML ☆

赞 0 踩 0

2605.16608 2026-05-29 cs.LG cs.CL 版本更新

TabPFN-3: 技术报告

Léo Grinsztajn, Klemens Flöge, Oscar Key, Felix Birkel, Philipp Jund, Brendan Roof, Mihir Manium, Shi Bin Hoo, Magnus Bühler, Anurag Garg, Dominik Safaric, Jake Robertson, Benjamin Jäger, Simone Alessi, Adrian Hayler, Vladyslav Moroshan, Lennart Purucker, Philipp Singer, Alan Arazi, Julien Siems, Jan Hendrik Metzen, Georg Grab, Nick Erickson, Siyuan Guo, Eliott Kalfon, Simon Bing, David Salinas, Clara Cornu, Lilly Charlotte Wehrhahn, Diana Kriuchkova, Kursat Kaya, Lydia Sidhoum, Marie Salmon, Jerry Chen, Madelon Hulsebos, Yann LeCun, Samuel Müller, Bernhard Schölkopf, Sauraj Gambhir, Noah Hollmann, Frank Hutter

发表机构 * Prior Labs

AI总结本文提出TabPFN-3，通过扩展训练数据和优化推理，在表格数据上实现最先进性能，并支持时间序列、关系数据和表格文本数据。

详情

AI中文摘要

表格数据支撑着科学和工业中大多数高价值预测问题，而TabPFN推动了该模态的基础模型革命。根据用户反馈设计，TabPFN-3在此基础上将最先进性能扩展到具有100万训练行的数据集，并大幅减少训练和推理时间。TabPFN-3完全基于我们先验的合成数据进行预训练，极大地推动了表格预测的前沿，并在时间序列、关系数据和表格文本数据上带来了实质性收益。在标准表格基准TabArena上，TabPFN-3的前向传播以显著优势优于所有其他模型（包括调优和集成基线），并在速度/性能前沿上占据帕累托优势。在更多样化的数据集上，TabPFN-3在多类数据集上排名第一，并在多达100万训练行和200个特征的数据集上击败了经过8小时调优的梯度提升树基线。TabPFN-3将测试时计算缩放引入表格基础模型。我们的API产品TabPFN-3-Plus（思考版）利用这一点，在TabArena上以超过200 Elo的优势击败所有非TabPFN模型，在最大数据子集上达到420 Elo，并且比AutoGluon 1.5 extreme快10倍，同时不使用LLM、真实数据、互联网搜索或除TabPFN之外的任何其他模型。TabPFN-3扩展了我们模型的能力，实现了对关系数据（在RelBenchV1上新的最先进基础模型）和表格文本数据（通过TabPFN-3-Plus在TabSTAR上达到最先进）的最先进预测；并改进了现有集成：专用检查点TabPFN-TS-3在时间序列基准fev-bench上排名第二，SHAP值计算速度提升高达120倍。TabPFN-3在实现这一性能的同时，比TabPFN-2.5快20倍。此外，减少的KV缓存和行分块技术使得在单个H100上以快速推理速度扩展到100万行。

英文摘要

Tabular data underpins most high-value prediction problems in science and industry, and TabPFN has driven the foundation model revolution for this modality. Designed with feedback from our users, TabPFN-3 builds on this foundation to scale state-of-the-art performance to datasets with 1M training rows and substantially reduce training and inference time. Pretrained exclusively on synthetic data from our prior, TabPFN-3 dramatically pushes the frontier of tabular prediction and brings substantial gains on time series, relational, and tabular-text data. On the standard tabular benchmark TabArena, a forward pass of TabPFN-3 outperforms all other models, including tuned and ensembled baselines, by a significant margin, and pareto-dominates the speed/performance frontier. On more diverse datasets, TabPFN-3 ranks first on datasets with many classes, and beats 8-hour-tuned gradient-boosted-tree baselines on datasets up to 1M training rows and 200 features. TabPFN-3 introduces test-time compute scaling to tabular foundation models. Our API offering TabPFN-3-Plus (Thinking) exploits this to beat all non-TabPFN models by over 200 Elo on TabArena, rising to 420 Elo on the largest data subset, and outperforms AutoGluon 1.5 extreme while being 10x faster, without using LLMs, real data, internet search or any other model besides TabPFN. TabPFN-3 extends the capabilities of our models, enabling SOTA prediction on relational data (new SOTA foundation model on RelBenchV1) and tabular-text data (SOTA on TabSTAR via TabPFN-3-Plus); and improves existing integrations: a specialized checkpoint, TabPFN-TS-3, ranks 2nd on the time-series benchmark fev-bench, and SHAP-value computation is up to 120x faster. TabPFN-3 achieves this performance while being up to 20x faster than TabPFN-2.5. In addition, a reduced KV cache and row-chunking scale to 1M rows on one H100 with fast inference speed.

URL PDF HTML ☆

赞 0 踩 0

2605.13230 2026-05-29 cs.LG cs.AI 版本更新

Teacher-Guided Policy Optimization for On-Policy Reasoning Distillation under Large Policy Divergence

教师引导的策略优化：大策略差异下的在线推理蒸馏

Xinyu Liu, Kechen Jiao, Chunyang Xiao, Runsong Zhao, Junhao Ruan, Bei Li, Jiahao Liu, Qifan Wang, Xin Chen, Jingang Wang, Chenglong Wang, Tong Xiao, JingBo Zhu

发表机构 * School of Computer Science and Engineering, Northeastern University, China（东北大学计算机科学与工程学院）； Tsinghua University（清华大学）； Meituan（美团）； Meta AI ； NiuTrans Research, Shenyang, China（新译研究院，沈阳，中国）

AI总结针对在线蒸馏中教师与学生策略差异大时反向KL监督失效的问题，提出教师引导策略优化（TGPO），通过教师直接指导学生上下文的token级生成并结合RLVR奖励，在推理基准上优于现有方法。

详情

AI中文摘要

在线蒸馏（OPD）已成为面向推理的大型语言模型（LLM）后训练的一种有前景的范式，特别是与可验证奖励的强化学习（RLVR）结合时。现有的OPD方法依赖于基于反向KL（RKL）的教师监督，对学生策略采样的轨迹进行监督。然而，我们识别出一个关键限制：在教师-学生策略差异大的情况下，RL驱动的探索常常产生教师分布之外的轨迹，导致无信息的负面反馈。为了解决这个问题，我们提出教师引导策略优化（TGPO），一种在策略差异大设置下仍然有效的在线推理蒸馏方法。TGPO不依赖于单纯的评估监督，而是利用教师直接指导基于学生生成上下文的token级生成；结合RLVR风格的轨迹级奖励，TGPO引导探索朝向改进的延续。在推理基准上的实验表明，TGPO始终优于现有的基于RKL的OPD方法，并且在不同教师模型下保持鲁棒性。

极端多类监督对比表示学习的精细泛化分析

Nong Minh Hieu, Antoine Ledent

发表机构 * School of Computing and Information Systems, Singapore Management University（新加坡国立管理学院计算机与信息系统学院）

AI总结针对对比表示学习在有限标注数据中构造元组导致依赖性的问题，提出改进的U-统计量分析，得到与类别数R同阶的样本复杂度，并设计新估计器在长尾分布下实现O(k)的样本复杂度。

Comments Accepted at ICML 2026

详情

AI中文摘要

对比表示学习（CRL）在多个机器学习领域取得了强大的实证成功，但其理论样本复杂度仍然知之甚少。现有分析通常假设输入元组是独立同分布的，这一假设在大多数实际设置中被违反，因为对比元组是从有限标注数据池中构造的，导致元组之间存在依赖性。虽然最近有一项工作使用U-统计量分析这种学习设置以估计总体风险，但其中使用的技术要求每个类别的风险均匀集中，使得超额风险界限的规模为$ρ_{\min}^{-{1}/{2}}$，其中$ρ_{\min}$表示最稀有类别的概率。这种依赖在极端多类设置中可能过于悲观，因为存在许多尾部类别，它们对总体风险的贡献极小。我们的贡献有两方面。首先，我们改进了先前的工作，证明了一个样本复杂度与类别数$R$同阶的界限，无论类别分布如何。此外，我们制定了一个不同的估计器，捕捉风险 extit{跨类别}的集中性，从而在极端多类学习场景中实现更尖锐的界限，特别是在类别分布为长尾的情况下。在类别分布的温和假设下，得到的样本复杂度为$\mathcal{O}(k)$，其中$k$是每个元组的样本数。

英文摘要

Contrastive Representation Learning (CRL) has achieved strong empirical success in multiple machine learning disciplines, yet its theoretical sample complexity remains poorly understood. Existing analyses usually assume that input tuples are identically and independently distributed, an assumption violated in most practical settings where contrastive tuples are constructed from a finite pool of labeled data, inducing dependencies among tuples. While one recent work analyzed this learning setting using U-Statistics to estimate the population risk, the techniques used therein require the risk of each class to concentrate uniformly, making excess risk bounds scale in the order of $ρ_{\min}^{-{1}/{2}}$ where $ρ_{\min}$ denotes the probability of the rarest class. Such a dependency can be overly pessimistic in the extreme multiclass settings where there are many tail classes which contribute minimally to the overall population risk. Our contributions are two-fold. Firstly, we improve upon the previous work and prove a bound with a sample complexity of the same order as the number of classes $R$, regardless of the distribution over classes. Furthermore, we formulate a different estimator that captures the concentration of the risk \textit{across classes}, enabling sharper bounds in extreme multi-class learning scenarios, especially where class distributions are long-tailed. Under mild assumptions on the class distributions, the resulting sample complexity is $\mathcal{O}(k)$ where $k$ is the number of samples per tuple.

URL PDF HTML ☆

赞 0 踩 0

2605.06355 2026-05-29 cs.LG stat.ML 版本更新

CompleteRXN：迈向完整开放化学反应数据库

Gabriel Vogel, Minouk Noordsij, Evgeny Pidko, Jana M. Weber

发表机构 * Department of Intelligent Systems（智能系统系）； Delft University of Technology（代尔夫特理工大学）； Department of Chemical Engineering（化学工程系）

AI总结针对化学反应数据库（如USPTO）普遍存在的不完整问题，提出CompleteRXN基准和约束反应平衡器（CRB）模型，通过监督学习和约束解码实现高精度的反应补全。

详情

AI中文摘要

诸如USPTO等化学反应数据集存在严重的不完整性，经常缺失副产物、共反应物和化学计量系数。这限制了它们在下游应用中的适用性和可靠性。在此，我们介绍CompleteRXN，一个在现实缺失数据条件下用于反应补全的大规模监督基准。通过将USPTO记录映射到精心整理的机理反应，我们构建了一个对齐的不完整和原子平衡反应数据集。我们评估了代表性基线方法，包括一种新颖的具有约束解码的编码器-解码器反应补全模型——约束反应平衡器（CRB），以及最近的算法方法SynRBL。在我们的CompleteRXN基准上，CRB在难度递增的划分上实现了高性能，在随机划分上达到99.20%的等价准确率，在极端分布外划分上达到91.12%。SynRBL生成了许多平衡且化学上合理的补全结果，但在基准测试划分上的准确率较低。在所有方法中，性能随着不完整程度的增加而下降。当在基准之外（完整的未整理USPTO）评估反应时，我们观察到性能大幅下降，这突显了基准性能与实际鲁棒性之间的差距，并激励了未来的工作。

英文摘要

Chemical reaction datasets such as USPTO suffer from substantial incompleteness, frequently missing byproducts, co-reactants, and stoichiometric coefficients. This limits their applicability and reliability in downstream applications. Here, we introduce CompleteRXN, a large-scale supervised benchmark for reaction completion under realistic missing-data conditions. We construct a dataset of aligned incomplete and atom-balanced reactions by mapping USPTO records to curated mechanistic reactions. We evaluate representative baselines, including a novel encoder-decoder reaction completion model with constrained decoding, the Constrained Reaction Balancer (CRB), and a recent algorithmic method, SynRBL. On our CompleteRXN benchmark, the CRB achieves high performance across splits of increasing difficulty, reaching 99.20% equivalence accuracy on the random split and 91.12% on the extreme out-of-distribution split. SynRBL produces many balanced and chemically plausible completions, but with lower accuracy on the benchmark test splits. Across all methods, performance degrades with increasing incompleteness. We observe a substantial drop when evaluating on reactions outside the benchmark (full uncurated USPTO), highlighting the gap between benchmark performance and practical robustness and motivating future work.

URL PDF HTML ☆

赞 0 踩 0

2604.27272 2026-05-29 cs.CL cs.AI cs.LG 版本更新

When 2D Tasks Meet 1D Serialization: On Serialization Friction in Structured Tasks

当2D任务遇到1D序列化：结构化任务中的序列化摩擦

Chung-Hsiang Lo, Lu Li, Diji Yang, Tianyu Zhang, Yunkai Zhang, Yoshua Bengio, Yi Zhang

发表机构 * Northeastern University（东北大学）； University of Pennsylvania（宾夕法尼亚大学）； UC Santa Cruz（加州大学圣克鲁兹分校）； Mila - Quebec AI Institute（魁北克人工智能研究所）； University of Montreal（蒙特利尔大学）； BAIR, UC Berkeley（伯克利大学BAIR实验室）

AI总结研究通过矩阵转置、康威生命游戏和LU分解三个任务，发现将二维布局任务序列化为一维文本会因表示不匹配导致性能下降，且错误呈现空间结构模式。

详情

AI中文摘要

在LLM时代，许多符号化和结构化问题通过一维文本序列化呈现给模型。然而，其中一些问题本质上是二维的：它们的相关关系，如行列对应或空间邻接，由二维布局中的位置定义，而非顺序。这引发了一个表示问题：在一维序列中保留相同的符号条目是否也保留了计算所需的关系结构？我们通过序列化摩擦的视角研究这一问题：即相同底层任务实例和条目仍然存在，但依赖于布局的关系在一维序列化下变得隐式的表示不匹配。本研究使用三个受控合成测试任务：矩阵转置、康威生命游戏和LU分解。在每个任务中，相同的实例要么作为一维文本序列化呈现，要么作为其原生二维布局渲染为图像呈现。在整个测试集中，随着任务规模增长，一维序列化的性能下降更显著，且序列化下的错误呈现空间结构模式，表明这种呈现选择在我们的测试集中具有重要影响。为了进一步解释这些结果，我们添加了补充分析，包括视觉内探针以及混合训练转置设置下两种输入呈现的额外比较。这些发现表明，对于布局定义的任务，将输入简化为1D序列化并非中性的表示选择。

英文摘要

In the LLM era, many symbolic and structured problems are presented to models through 1D text serialization. Yet some such problems are natively two-dimensional: their relevant relations, such as row--column correspondence or spatial adjacency, are defined by position in a 2D layout rather than by sequential order. This raises a representational question: does preserving the same symbolic entries in a 1D sequence also preserve the relational structure needed for computation? We study this issue through the lens of serialization friction: the representational mismatch in which the same underlying task instances and entries are still present, but relations that depend on layout become implicit under 1D serialization. The study uses a controlled synthetic testbed of three tasks: matrix transpose, Conway's Game of Life, and LU decomposition. In each task, the same instances are presented either as 1D text serialization or as their native 2D layout rendered as an image. Across this testbed, 1D serialization degrades more sharply as task size grows, and errors under serialization exhibit spatially structured patterns, suggesting that this presentation choice is consequential within our testbed. To further interpret these results, we add supplementary analyses that include a within-visual probe and an additional comparison of the two input presentations under the mixed-training transpose setting. These findings suggest that, for layout-defined tasks, reducing inputs to 1D serialization is not a neutral choice of representation.

URL PDF HTML ☆

赞 0 踩 0

2604.26645 2026-05-29 cs.AI cs.LG 版本更新

SciHorizon-DataEVA: An Agentic System for AI-Readiness Evaluation of Heterogeneous Scientific Data

SciHorizon-DataEVA：面向异构科学数据AI就绪性评估的智能体系统

Dianyu Liu, Chuan Qin, Xi Chen, Xiaohan Li, Wenxi Xu, Yuyang Wang, Xin Chen, Yuanchun Zhou, Hengshu Zhu

发表机构 * SciHorizon Team, Computer Network Information Center, Chinese Academy of Sciences（科学前沿团队，计算机网络信息中心，中国科学院）

AI总结提出SciHorizon-DataEVA智能体系统，基于Sci-TQA2原则和层次化多智能体评估方法，实现对异构科学数据的可扩展AI就绪性评估。

详情

AI中文摘要

AI-for-Science (AI4Science) 正通过将机器学习模型嵌入跨领域的预测、模拟和假设生成工作流程，日益变革科学发现。然而，这些模型的有效性从根本上受到科学数据AI就绪性的限制，目前尚不存在可扩展且系统的评估机制。在这项工作中，我们提出了SciHorizon-DataEVA，一种新颖的智能体系统，用于对异构科学数据进行可扩展的AI就绪性评估。在评估标准层面，我们引入了Sci-TQA2原则，将AI就绪性组织为四个互补维度：治理可信度、数据质量、AI兼容性和科学适应性。每个维度被分解为可测量的原子元素，以实现细粒度且可执行的评估。为了大规模实施这些原则，我们开发了Sci-TQA2-Eval，一种通过有向循环工作流编排的层次化多智能体评估方法。我们的Sci-TQA2-Eval通过结合轻量级数据集分析、适用性感知的度量激活以及基于领域约束和数据集-论文信号的知识增强规划，动态构建数据集感知的评估规范。这些规范通过自适应的、以工具为中心的评估机制执行，该机制具有内置的验证和自我修正能力，从而实现对异构科学数据的可扩展且可靠的评估。在跨多个领域的科学数据集上的广泛实验证明了SciHorizon-DataEVA在原则性AI就绪性评估方面的有效性和通用性。

英文摘要

AI-for-Science (AI4Science) is increasingly transforming scientific discovery by embedding machine learning models into prediction, simulation, and hypothesis generation workflows across domains. However, the effectiveness of these models is fundamentally constrained by the AI-readiness of scientific data, for which no scalable and systematic evaluation mechanism currently exists. In this work, we propose SciHorizon-DataEVA, a novel agentic system to scalable AI-readiness evaluation of heterogeneous scientific data. At the evaluation-criteria level, we introduce the Sci-TQA2 principles, which organize AI-readiness into four complementary dimensions: Governance Trustworthiness, Data Quality, AI Compatibility, and Scientific Adaptability. Each dimension is decomposed into measurable atomic elements that enable fine-grained and executable assessment. To operationalize these principles at scale, we develop Sci-TQA2-Eval, a hierarchical multi-agent evaluation approach orchestrated through a directed, cyclic workflow. Our Sci-TQA2-Eval dynamically constructs dataset-aware evaluation specifications by combining lightweight dataset profiling, applicability-aware metric activation, and knowledge-augmented planning grounded in domain constraints and dataset-paper signals. These specifications are executed through an adaptive, tool-centric evaluation mechanism with built-in verification and self-correction, enabling scalable and reliable assessment across heterogeneous scientific data. Extensive experiments on scientific datasets spanning multiple domains demonstrate the effectiveness and generality of SciHorizon-DataEVA for principled AI-readiness evaluation.

URL PDF HTML ☆

赞 0 踩 0

2604.23862 2026-05-29 cs.LG cs.AI cs.CL 版本更新

Graph Memory Transformer (GMT)

图记忆Transformer (GMT)

Nicola Zanarini, Niccolò Ferrari, Evelina Lamma

发表机构 * Bonfiglioli Engineering s.r.l.（博尼菲利工程公司）； Department of Engineering, University of Ferrara（费拉拉大学工程学院）； NAIS s.r.l.（NAIS公司）

AI总结提出用显式学习的记忆图替换解码器-only Transformer中的前馈网络子层，保留自回归架构，实现可解释的记忆导航。

Comments 65 pages, 10 figures, 5 tables. Author list updated in arXiv metadata; no technical changes. Code available at https://github.com/Nemesis533/GMT-GraphMemoryTransformer

详情

AI中文摘要

我们研究是否可以在解码器-only Transformer中，用显式学习的记忆图替换前馈网络（FFN）子层，同时保留周围的自回归架构。所提出的图记忆Transformer（GMT）保持因果自注意力不变，但将通常的逐token FFN变换替换为一个记忆单元，该单元通过一个由学习的有向转移矩阵连接的质心库来路由token表示。在此处研究的基础GMT v7实例中，16个Transformer块中的每个块包含128个质心、一个128*128的边矩阵、引力源路由、token条件目标选择以及门控位移读出。因此，该单元返回从估计的源记忆状态到目标记忆状态的移动，而不是检索到的值。由此产生的模型是一个完全解码器-only的语言模型，具有82.2M可训练参数且没有密集的FFN子层，而评估中使用的密集GPT风格基线有103.0M参数。基础v7模型训练稳定，并将质心使用、转移结构和源到目标移动作为前向计算中可直接检查的量。在验证损失和困惑度方面，它落后于较大的密集基线（3.5995/36.58 vs. 3.2903/26.85），但在评估设置下显示出接近的零样本基准表现。这些结果并非旨在声称最先进性能；它们支持用图介导的记忆导航替换密集的token内变换的可行性和结构可解释性。更广泛的扩展、优化的内核以及更广泛的基准评估留待后续工作。

英文摘要

We investigate whether the Feed-Forward Network (FFN) sublayer in a decoder-only transformer can be replaced by an explicit learned memory graph while preserving the surrounding autoregressive architecture. The proposed Graph Memory Transformer (GMT) keeps causal self-attention intact, but replaces the usual per-token FFN transformation with a memory cell that routes token representations over a learned bank of centroids connected by a learned directed transition matrix. In the base GMT v7 instantiation studied here, each of 16 transformer blocks contains 128 centroids, a 128 * 128 edge matrix, gravitational source routing, token-conditioned target selection, and a gated displacement readout. The cell therefore returns movement from an estimated source memory state toward a target memory state, rather than a retrieved value. The resulting model is a fully decoder-only language model with 82.2M trainable parameters and no dense FFN sublayers, compared with a 103.0M-parameter dense GPT-style baseline used in the evaluation. The base v7 model trains stably and exposes centroid usage, transition structure, and source-to-target movement as directly inspectable quantities of the forward computation. It remains behind the larger dense baseline in validation loss and perplexity (3.5995/36.58 vs. 3.2903/26.85), while showing close zero-shot benchmark behavior under the evaluated setting. These results are not intended as a state-of-the-art claim; they support the viability and structural interpretability of replacing dense within-token transformation with graph-mediated memory navigation. Broader scaling, optimized kernels, and more extensive benchmark evaluation are left for subsequent work.

URL PDF HTML ☆

赞 0 踩 0

2604.19011 2026-05-29 cs.LG cs.RO 版本更新

Accelerating trajectory optimization with Sobolev-trained diffusion policies

基于Sobolev训练的扩散策略加速轨迹优化

Théotime Le Hellard, Franki Nguimatsia Tiofack, Quentin Le Lidec, Justin Carpentier

发表机构 * Inria - Département d’Informatique de l’École normale supérieure, PSL Research University（法国国家科学研究中心-巴黎高等师范学院计算机系，PSL研究大学）； Courant Institute, New York University（纽约大学Courant研究所）

AI总结针对梯度型轨迹优化求解器，提出利用Sobolev学习训练扩散策略以提供初始猜测，通过利用轨迹和反馈增益的一阶损失避免复合误差，实现求解时间减少2至20倍。

详情

AI中文摘要

轨迹优化求解器利用已知系统动力学通过迭代改进计算局部最优轨迹。其缺点是每个新问题实例独立求解，因此收敛速度和求解质量依赖于初始轨迹。为提高效率，一种自然的方法是用学习策略生成的初始猜测对轨迹优化进行热启动，该策略在求解器先前生成的轨迹上训练。基于扩散的策略最近成为表达性模仿学习模型，使其成为这一角色的有前途候选者。然而，一个反直觉的挑战来自轨迹优化示范的局部最优性：当策略展开时，小的非最优偏差可能将其推入训练数据中未表示的情况，从而在长时域上引发复合误差。在这项工作中，我们专注于基于学习的热启动，用于同时提供反馈增益的梯度型轨迹优化求解器。利用这一特性，我们推导出一阶损失，用于使用轨迹和反馈增益对基于扩散的策略进行Sobolev学习。通过全面实验，我们证明所得策略避免了复合误差，因此可以从非常少的轨迹中学习，提供初始猜测，将求解时间减少2倍到20倍。结合一阶信息使得用更少的扩散步骤进行预测成为可能，从而降低推理延迟。

英文摘要

Trajectory Optimization (TO) solvers exploit known system dynamics to compute locally optimal trajectories through iterative improvements. A downside is that each new problem instance is solved independently; therefore, convergence speed and quality of the solution found depend on the initial trajectory proposed. To improve efficiency, a natural approach is to warm-start TO with initial guesses produced by a learned policy trained on trajectories previously generated by the solver. Diffusion-based policies have recently emerged as expressive imitation learning models, making them promising candidates for this role. Yet, a counterintuitive challenge comes from the local optimality of TO demonstrations: when a policy is rolled out, small non-optimal deviations may push it into situations not represented in the training data, triggering compounding errors over long horizons. In this work, we focus on learning-based warm-starting for gradient-based TO solvers that also provide feedback gains. Exploiting this specificity, we derive a first-order loss for Sobolev learning of diffusion-based policies using both trajectories and feedback gains. Through comprehensive experiments, we demonstrate that the resulting policy avoids compounding errors, and so can learn from very few trajectories to provide initial guesses reducing solving time by $2\times$ to $20 \times$. Incorporating first-order information enables predictions with fewer diffusion steps, reducing inference latency.

URL PDF HTML ☆

赞 0 踩 0

2603.23234 2026-05-29 cs.AI cs.LG 版本更新

MemCollab: Cross-Model Memory Collaboration via Contrastive Trajectory Distillation

MemCollab：通过对比轨迹蒸馏实现跨模型记忆协作

Yurui Chang, Yiran Wu, Qingyun Wu, Lu Lin

发表机构 * Pennsylvania State University（宾夕法尼亚州立大学）； AG2AI

AI总结针对不同骨干模型代理间共享记忆性能下降的问题，提出MemCollab框架，通过对比同一任务上不同模型生成的推理轨迹来蒸馏共享的抽象推理约束，并引入任务感知检索机制，提升异构代理的准确性和推理效率。

详情

AI中文摘要

LLM代理越来越依赖记忆机制来重用过去问题解决经验中的知识。然而，现有方法通常为单个代理构建记忆，并与同一底层模型重用，将存储的知识紧密耦合到特定模型的推理风格。在异构部署中，代理可能使用不同大小、架构或专业化的骨干模型实例化，这引发了一个关键问题：一个单一的记忆系统能否在不同骨干模型的代理之间共享？我们发现，简单的跨模型记忆传输可能会降低性能，因为存储的记忆常常将任务相关知识纠缠到模型特定的偏见中。为了解决这一挑战，我们提出了MemCollab，一个协作记忆框架，通过对比不同基于模型的代理在同一任务上生成的推理轨迹来构建共享的跨模型记忆。通过这一对比过程，MemCollab蒸馏出捕获共享任务级不变量的抽象推理约束，同时抑制模型特定的伪影。我们进一步引入了一种任务感知检索机制，根据任务类别调节记忆访问，确保在推理时只检索相关的约束。在数学推理和代码生成基准上的实验表明，MemCollab在不同代理（包括不同模型族设置）上一致地提高了准确性和推理效率。这些结果表明，协作构建的跨模型记忆可以作为异构基于LLM的代理的共享推理资源。

英文摘要

LLM agents increasingly rely on memory mechanisms to reuse knowledge from past problem-solving experiences. However, existing methods typically construct memory for a single agent and reuse it with the same underlying model, tightly coupling stored knowledge to model-specific reasoning styles. In heterogeneous deployments, where agents may be instantiated with backbone models of different sizes, architectures, or specializations, this raises a key question: can a single memory system be shared across agents with different backbone models? We find that naive cross-model memory transfer can degrade performance, because stored memories often entangle task-relevant knowledge with model-specific biases. To address this challenge, we propose MemCollab, a collaborative memory framework that builds shared cross-model memory by contrasting reasoning trajectories generated by different model-based agents on the same task. Through this contrastive process, MemCollab distills abstract reasoning constraints that capture shared task-level invariants while suppressing model-specific artifacts. We further introduce a task-aware retrieval mechanism that conditions memory access on task category, ensuring that only relevant constraints are retrieved at inference time. Experiments on mathematical reasoning and code generation benchmarks show that MemCollab consistently improves both accuracy and inference-time efficiency across diverse agents, including settings with different model families. These results demonstrate that collaboratively constructed cross-model memory can serve as a shared reasoning resource for heterogeneous LLM-based agents.

URL PDF HTML ☆

赞 0 踩 0

2603.21621 2026-05-29 cs.LG 版本更新

Path-Space Mirror Descent for On-Policy Reinforcement Learning under the Generalized Schrödinger Bridge

广义薛定谔桥下在线强化学习的路径空间镜像下降

Yuehu Gong, Zeyuan Wang, Yulin Chen, Shutong Ding, Qingyuan Zhou, Yanwei Fu

发表机构 * School of Data Science, Fudan University（复旦大学数据科学学院）； Laboratory for Big Data and Decision, National University of Defense Technology（国防科技大学大数据与决策实验室）； ShanghaiTech University（上海科技大学）； College of Computer Science and Artificial Intelligence, Fudan University（复旦大学计算机科学与人工智能学院）； Shanghai Innovation Institute（上海创新研究院）

AI总结针对生成式策略在在线策略优化中终端动作密度难处理的问题，提出GSB-MDPO方法，通过将策略优化建模为广义薛定谔桥问题并利用路径空间KL散度作为近端项，实现了无需显式终端似然评估的稳定更新。

详情

AI中文摘要

经典的在线策略算法如PPO和镜像下降策略优化通过可处理的动作似然提供稳定的近端策略更新，但通常使用简单的Gaussian策略，其在复杂连续控制任务中的表达能力有限。基于扩散和流模型的生成式策略提供了更具表达力的动作分布，但它们自然地定义了多步去噪路径上的分布，其终端动作密度通常是难以处理的，这造成了与基于似然的在线策略近端更新的不匹配。为了解决这种不匹配，我们引入了GSB-MDPO（广义薛定谔桥镜像下降策略优化），它将在线策略生成式策略优化表述为状态条件生成路径上的广义薛定谔桥问题，并通过镜像下降策略优化实例化得到的路径测度更新。关键洞察是GSB路径空间KL散度在MDPO中扮演了近端项的角色，同时上界了终端动作KL散度，从而无需显式终端动作似然评估即可直接控制执行的动作分布。在Playground和Gym-MuJoCo上的14个连续控制任务上的实验证明了GSB-MDPO的经验有效性，并支持路径空间正则化作为多步生成式策略的原则性近端更新。

英文摘要

Classical on-policy algorithms such as PPO and mirror descent policy optimization provide stable proximal policy updates through tractable action likelihoods, but are typically instantiated with simple Gaussian policies whose expressiveness can be limited in complex continuous-control tasks. Generative policies based on diffusion and flow models provide more expressive action distributions, but they naturally define distributions over multi-step denoising paths whose terminal action density is often intractable, creating a mismatch with likelihood-based on-policy proximal updates. To address this mismatch, we introduce \textbf{GSB-MDPO} (\emph{Generalized Schrödinger Bridge Mirror Descent Policy Optimization}), which formulates on-policy generative policy optimization as a Generalized Schrödinger Bridge problem over state-conditioned generation paths and instantiates the resulting path-measure update through mirror descent policy optimization. The key insight is that the GSB path-space KL plays the role of the proximal term in MDPO while upper-bounding the terminal action KL, enabling direct control of the executed action distribution without explicit terminal action likelihood evaluation. Experiments on 14 continuous-control tasks across Playground and Gym-MuJoCo demonstrate the empirical effectiveness of GSB-MDPO and support path-space regularization as a principled proximal update for multi-step generative policies.

URL PDF HTML ☆

赞 0 踩 0

2603.20329 2026-05-29 stat.ML cs.LG math.PR 版本更新

Measure flow path recovery in Bayes Hilbert spaces

贝叶斯希尔伯特空间中的测度流路径恢复

S. David Mis, Maarten V. de Hoop

发表机构 * Rice University（里士大学）

AI总结针对有限移动局部传感器恢复概率测度流的不适定问题，提出基于贝叶斯希尔伯特框架的变分理论，通过构造最小能量传输实现和线性化观测算子，分析可恢复性条件，并发展有限维约化方法实现稳定重建。

详情

AI中文摘要

我们研究使用贝叶斯希尔伯特框架从有限个移动局部传感器恢复概率测度流的不适定问题。相对于固定的参考概率测度，概率律由其中心化对数比坐标表示，因此演化律成为希尔伯特函数空间中的一条路径。对于足够正则的贝叶斯希尔伯特路径，我们通过在每个时间点求解加权纽曼问题，构造路径的规范最小能量传输实现，得到切方向上的内在传输形式。然后，我们直接在贝叶斯希尔伯特路径空间上制定逆问题。观测算子的线性化产生可观测性形式，可恢复性由其与传输几何通过联合传输-可观测性形式的相互作用决定。在无穷维环境中，我们发展了正则化变分理论，并识别了局部传感器的局限性：移动传感器可以使联合形式单射，但通常不能在整个状态空间上产生强制稳定性估计。这一障碍自然导致有限维贝叶斯希尔伯特约化。在那里，传输形式成为动能张量，线性化观测成为约化感知矩阵，因此可恢复性可以通过显式的格拉姆条件表达。我们证明局部凸起传感器检测每个固定的约化方向，有限个适当放置的静态传感器产生均匀的约化可观测性，并且存在依赖于路径的传感器轨迹，使得即使单个移动传感器也能恢复约化路径。最后，我们证明这些约化恢复结果可以提升到对由所选有限维子空间良好近似的路径的近似环境恢复，从而实现稳定重建至投影误差。

英文摘要

We study the ill-posed problem of recovering a probability measure flow from finitely many moving localized sensors using a Bayes Hilbert framework. Relative to a fixed reference probability measure, a probability law is represented by its centered log-ratio coordinates, so that an evolving law becomes a path in a Hilbert space of functions. For sufficiently regular Bayes Hilbert paths, we construct a canonical minimum-energy transport realization of the path by solving a weighted Neumann problem at each time, yielding an intrinsic transport form on tangent directions. We then formulate an inverse problem directly on Bayes Hilbert path space. Linearization of an observation operator yields an observability form, and recoverability is governed by its interaction with the transport geometry through a joint transport--observability form. In the ambient infinite-dimensional setting, we develop a regularized variational theory and identify limitations of localized sensing: mobile sensors can make the joint form injective, but they do not in general yield a coercive stability estimate on the full state space. This obstruction leads naturally to finite-dimensional Bayes Hilbert reductions. There the transport form becomes a kinetic tensor and the linearized observations become reduced sensing matrices, so recoverability can be expressed through explicit Gramian conditions. We show that localized bump sensors detect every fixed reduced direction, that finitely many suitably placed static sensors yield uniform reduced observability, and there exist path-dependent sensor trajectories such that even a single moving sensor can recover the reduced path. Finally, we show that these reduced recovery results lift to approximate ambient recovery for paths that are well approximated by the chosen finite-dimensional subspaces, yielding stable reconstruction up to projection error.

URL PDF HTML ☆

赞 0 踩 0

2603.14315 2026-05-29 cs.LG math.OC 版本更新

Enhancing LLM Training via Spectral Clipping

通过谱裁剪增强大语言模型训练

Xiaowen Jiang, Andrei Semenov, Sebastian U. Stich

发表机构 * CISPA Helmholtz Center for Information Security（信息安全研究中心）； EPFL（瑞士联邦理工学院）

AI总结提出SPECTRA框架，通过对优化器更新进行谱裁剪以约束谱范数、对梯度进行预谱裁剪以抑制噪声尖峰，从而提升多种优化器在大语言模型预训练中的性能。

Comments v2: ICML 2026

详情

AI中文摘要

虽然基于谱的优化器（如Muon）直接对更新的谱进行操作，但标准自适应方法（如AdamW）没有考虑权重和梯度的谱结构，使它们容易受到大语言模型（LLM）训练中两个经验问题的影响：（i）优化器更新可能具有较大的谱范数，可能破坏训练稳定性并降低泛化能力；（ii）随机梯度噪声可能表现出稀疏的谱尖峰，少数主导奇异值远大于其余值。我们提出SPECTRA，一个通用框架，通过（i）对更新进行后谱裁剪以施加谱范数约束，（ii）可选地对梯度进行预谱裁剪以抑制谱噪声尖峰，来解决这些问题。我们证明后谱裁剪构成了一种具有谱范数约束和权重正则化的复合Frank-Wolfe方法。我们进一步分析了预谱裁剪如何缓解稀疏谱尖峰。我们通过Newton-Schulz迭代提出了高效的软谱裁剪，避免了昂贵的SVD。在LLM预训练上的实验表明，SPECTRA对各种优化器（包括AdamW、Signum、Mars和AdEMAMix）一致地改善了验证损失，其中表现最佳的变体达到了最先进的结果。使用SPECTRA训练的模型表现出更小的权重范数，证实了谱裁剪与正则化之间的联系。

英文摘要

While spectral-based optimizers like Muon operate directly on the spectrum of updates, standard adaptive methods such as AdamW do not account for the spectral structure of weights and gradients, leaving them vulnerable to two empirical issues in large language model (LLM) training: (i) the optimizer updates can have large spectral norms, potentially destabilizing training and degrading generalization; (ii) stochastic gradient noise can exhibit sparse spectral spikes, with a few dominant singular values much larger than the rest. We propose SPECTRA, a general framework addressing these by (i) post-spectral clipping of updates to enforce spectral-norm constraints (ii) optional pre-spectral clipping of gradients to suppress spectral noise spikes. We prove that post-clipping constitutes a Composite Frank-Wolfe method with spectral-norm constraints and weight regularization. We further analyze how pre-clipping mitigates sparse spectral spikes. We propose efficient soft spectral clipping via Newton-Schulz iterations, avoiding expensive SVD. Experiments on LLM pretraining show SPECTRA uniformly improves validation loss for various optimizers, including AdamW, Signum, Mars, and AdEMAMix, with the best-performing variants achieving state-of-the-art results. Models trained with SPECTRA exhibit smaller weight norms, confirming the link between spectral clipping and regularization.

URL PDF HTML ☆

赞 0 踩 0

2603.11331 2026-05-29 cs.LG cs.AI 版本更新

Jailbreak Scaling Laws for Large Language Models: Polynomial-Exponential Crossover

大型语言模型的越狱缩放定律：多项式-指数交叉

Indranil Halder, Annesya Banerjee, Cengiz Pehlevan

发表机构 * John A. Paulson School of Engineering And Applied Sciences, Harvard University（哈佛大学约翰·A·保罗森工程与应用科学学院）； Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology（麻省理工学院脑科学与认知科学系）； Speech and Hearing Bioscience and Technology, Harvard Medical School（哈佛医学院语音与听力生物科学与技术系）； Kempner Institute for the Study of Natural and Artificial Intelligence, Harvard University（哈佛大学自然与人工智能研究学院）； Center for Brain Science, Harvard University（哈佛大学脑科学中心）

AI总结研究发现对抗性提示注入攻击可使攻击成功率从无注入时的缓慢多项式增长变为随推理样本数指数增长，并通过自旋玻璃模型从理论上解释了这一现象。

详情

AI中文摘要

对抗性攻击可以可靠地将安全对齐的大型语言模型引导至不安全行为。经验上，我们发现对抗性提示注入攻击可以将攻击成功率从无注入时观察到的缓慢多项式增长放大为随推理样本数指数增长。我们首先通过一组关于上下文安全生成分布的最小假设，确定了这两种机制的统计基础，并推导出两种缩放定律。为了进一步解释这一现象，我们提出了一个基于自旋玻璃系统的代理语言理论生成模型，该系统处于复制对称破缺状态，生成样本来自相关的吉布斯测度，并将低能、有偏大小的子集标记为不安全。我们分析展示了该模型如何自然实现最小假设。短注入提示对应于指向不安全簇中心的弱磁场，导致攻击成功率随推理样本数呈幂律缩放；而长注入提示（即强磁场）则导致指数缩放。我们在参数规模从3B到70B的广泛大型语言模型中观察到了定性一致的行为。特别是，主要趋势在多种攻击方法（如GCG和AutoDAN）以及基准数据集（如AdvBench和HarmBench）中保持稳定。

英文摘要

Adversarial attacks can reliably steer safety-aligned large language models toward unsafe behavior. Empirically, we find that adversarial prompt-injection attacks can amplify attack success rate from the slow polynomial growth observed without injection to exponential growth with the number of inference-time samples. We first identify a minimal statistical mechanism for these two regimes by giving a small set of assumptions on the distribution of safe generation across contexts under which both scaling laws follow. To explain this phenomenon further, we propose a theoretical generative model of proxy language in terms of a spin-glass system operating in a replica-symmetry-breaking regime, where generations are drawn from the associated Gibbs measure and a subset of low-energy, size-biased clusters is designated unsafe. We analytically show how this model naturally realizes the minimal assumptions. Short injected prompts correspond to a weak magnetic field aligned towards unsafe cluster centers and yield a power-law scaling of attack success rate with the number of inference-time samples, while long injected prompts, i.e., strong magnetic field, yield exponential scaling. We observe qualitatively consistent behavior across a broad range of large language models, spanning parameter scales from 3B to 70B. In particular, the main trends remain stable across multiple attack methods, such as GCG and AutoDAN, as well as across benchmark datasets such as AdvBench and HarmBench.

URL PDF HTML ☆

赞 0 踩 0

2603.10474 2026-05-29 cs.LG cs.NE cs.RO 版本更新

Muscle Synergy Priors Enhance Biomechanical Fidelity in Predictive Musculoskeletal Locomotion Simulation

肌肉协同先验增强预测性肌肉骨骼运动模拟的生物力学保真度

Ilseung Park, Eunsik Choi, Jangwhan Ahn, Jooeun Ahn

发表机构 * Department of Mechanical Engineering（机械工程系）； Carnegie Mellon University（卡内基梅隆大学）； Department of Physical Education（体育系）； Seoul National University（首尔国立大学）； Lampe Joint Department of Biomedical Engineering（生物医学工程联合部门）； UNC-Chapel Hill and NC State University（北卡罗来纳大学教堂山分校和北卡罗来纳州立大学）

AI总结提出一种生理学启发的强化学习框架，通过肌肉协同约束控制，在有限实验数据下提高了预测性人体运动模拟的生物力学保真度和泛化能力。

Comments Added a manuscript footnote stating "Project page with supplementary videos: https://ces40320.github.io/WebHomepage__Walk-RL ."

详情

AI中文摘要

人类运动源于高维神经肌肉控制，这使得预测性肌肉骨骼模拟具有挑战性。我们提出了一种生理学启发的强化学习框架，利用肌肉协同约束控制。我们从少量地面行走试验的逆肌肉骨骼分析中提取了低维协同基，并将其作为动作空间，用于训练一个肌肉驱动的三维模型，该模型在可变速度、坡度和不平坦地形上进行训练。由此产生的控制器在0.7-1.8 m/s的速度和±6°的坡度上生成了稳定的步态，并再现了关节角度、关节力矩和地面反作用力的条件依赖性调节。与无约束控制器相比，协同约束控制减少了非生理性膝关节运动学，并将膝关节力矩曲线保持在实验包络内。在各种条件下，模拟的垂直地面反作用力与人体测量值强相关，肌肉激活时间大多落在受试者间变异范围内。这些结果表明，将神经生理结构嵌入强化学习可以在有限实验数据下提高预测性人体运动模拟的生物力学保真度和泛化能力。

英文摘要

Human locomotion emerges from high-dimensional neuromuscular control, making predictive musculoskeletal simulation challenging. We present a physiology-informed reinforcement-learning framework that constrains control using muscle synergies. We extracted a low-dimensional synergy basis from inverse musculoskeletal analyses of a small set of overground walking trials and used it as the action space for a muscle-driven three-dimensional model trained across variable speeds, slopes and uneven terrain. The resulting controller generated stable gait from 0.7-1.8 m/s and on $\pm$ 6$^{\circ}$ grades and reproduced condition-dependent modulation of joint angles, joint moments and ground reaction forces. Compared with an unconstrained controller, synergy-constrained control reduced non-physiological knee kinematics and kept knee moment profiles within the experimental envelope. Across conditions, simulated vertical ground reaction forces correlated strongly with human measurements, and muscle-activation timing largely fell within inter-subject variability. These results show that embedding neurophysiological structure into reinforcement learning can improve biomechanical fidelity and generalization in predictive human locomotion simulation with limited experimental data.

URL PDF HTML ☆

赞 0 踩 0

2603.07916 2026-05-29 cs.AI cs.DB cs.LG 版本更新

Rel-MOSS: Towards Imbalanced Relational Deep Learning on Relational Databases

Rel-MOSS：面向关系数据库中不平衡关系深度学习的解决方案

Jun Yin, Peng Huo, Bangguo Zhu, Hao Yan, Senzhang Wang, Shirui Pan, Chengqi Zhang

发表机构 * Department of Data Science and Artificial Intelligence（数据科学与人工智能系）； Hong Kong Polytechnic University（香港理工大学）； School of Computer Science and Engineering（计算机科学与工程学院）； Central South University（中南大学）； School of Information and Communication Technology（信息与通信技术学院）； Griffith University（格里菲斯大学）； National Super Computing Center（国家超级计算中心）

AI总结针对关系数据库中实体分类的类别不平衡问题，提出关系中心少数类合成过采样GNN（Rel-MOSS），通过关系门控控制器和关系引导的少数类合成器提升少数类表示，在12个数据集上平均平衡准确率提升2.46%，G-Mean提升4.00%。

详情

AI中文摘要

在最近的进展中，为了实现关系数据库（RDB）上完全数据驱动的学习范式，提出了关系深度学习（RDL），将RDB结构化为异构实体图，并采用图神经网络（GNN）作为预测模型。然而，现有的RDL方法忽略了RDB中关系数据的不平衡问题，可能导致少数实体表示不足，从而在实践中产生不可用的模型。在这项工作中，我们首次研究了RDB实体分类中的类别不平衡问题，并设计了以关系为中心的少数类合成过采样GNN（Rel-MOSS），以填补当前文献中的关键空白。具体来说，为了缓解少数类相关信息被多数类信息淹没的问题，我们设计了关系门控控制器来调节来自每个单独关系类型的邻域消息。基于关系门控表示，我们进一步提出了用于过采样的关系引导的少数类合成器，该合成器整合了实体关系签名以保持关系一致性。在12个实体分类数据集上的大量实验为Rel-MOSS的优越性提供了令人信服的证据，与最先进的RDL方法和处理类别不平衡的经典方法相比，在平衡准确率和G-Mean上分别平均提高了2.46%和4.00%。

英文摘要

In recent advances, to enable a fully data-driven learning paradigm on relational databases (RDB), relational deep learning (RDL) is proposed to structure the RDB as a heterogeneous entity graph and adopt the graph neural network (GNN) as the predictive model. However, existing RDL methods neglect the imbalance problem of relational data in RDBs and risk under-representing the minority entities, leading to an unusable model in practice. In this work, we investigate, for the first time, class imbalance problem in RDB entity classification and design the relation-centric minority synthetic over-sampling GNN (Rel-MOSS), in order to fill a critical void in the current literature. Specifically, to mitigate the issue of minority-related information being submerged by majority counterparts, we design the relation-wise gating controller to modulate neighborhood messages from each individual relation type. Based on the relational-gated representations, we further propose the relation-guided minority synthesizer for over-sampling, which integrates the entity relational signatures to maintain relational consistency. Extensive experiments on 12 entity classification datasets provide compelling evidence for the superiority of Rel-MOSS, yielding an average improvement of up to 2.46% and 4.00% in terms of Balanced Accuracy and G-Mean, compared with SOTA RDL methods and classic methods for handling class imbalance.

URL PDF HTML ☆

赞 0 踩 0

2603.07860 2026-05-29 cs.LG 版本更新

Sparse Scheduled Diffusion Guidance for Inverse Problems

稀疏调度扩散引导用于逆问题

Abduragim Shtanchaev, Albina Ilina, Yazid Janati, Arip Asadulaev, Martin Takac, Eric Moulines

发表机构 * MBZUAI（穆扎伊人工智能研究院）； Institute of Foundation Models（基础模型研究所）； EPITA

AI总结提出Spin方法，通过从中间时间步开始后验采样并仅在调度步骤应用轻量级校正，实现高效逆问题求解，在FFHQ和ImageNet上速度提升2-50倍且内存更低。

详情

AI中文摘要

预训练扩散模型是贝叶斯逆问题的有效先验，但使用这些先验进行后验采样通常成本高昂，因为数据一致性引导应用于整个反向轨迹。现有方法表明，有时可以避免通过去噪器的向量-雅可比乘积，但它们通常仍然依赖于整个轨迹的密集引导或昂贵的内部求解。我们提出了稀疏调度扩散引导用于逆问题（Spin），这是一种避免从纯噪声开始后验采样的求解器。Spin首先在中间时间步$t_*$从后验时间边际采样，然后将该状态作为引导反向扩散过程的热启动。在引导时间，Spin不是在每个去噪步骤强制执行测量约束，而是仅在调度的时间步应用轻量级校正，此时去噪器仍能清理伪影。由此产生的过程将先验细化与数据一致性解耦：先验提供去噪，而轻量级像素空间优化强制执行测量约束，无需通过去噪器或解码器进行反向传播。在FFHQ和ImageNet上的线性和非线性逆问题中，Spin以显著更好的运行时-内存曲线实现了有竞争力的重建质量，在像素空间模型上运行速度提高2倍，在潜在扩散模型上运行速度提高50倍，且内存成本更低。

英文摘要

Pretrained diffusion models are effective priors for Bayesian inverse problems, but posterior sampling with these priors is often costly because data-consistency guidance is applied throughout the full reverse trajectory. Existing methods have shown that vector-Jacobian products through the denoiser can sometimes be avoided, yet they typically still rely on dense guidance through the full trajectory or expensive inner solves. We introduce Sparse Scheduled Diffusion Guidance for Inverse Problems (Spin), a solver that avoids starting posterior sampling from pure noise. Spin first samples from a posterior time-marginal at an intermediate timestep $t_*$, and then uses that state as a warm start for a guided reverse diffusion process. At guidance time, instead of enforcing the measurement constraint at every denoising step, Spin applies lightweight corrections only at scheduled timesteps where the denoiser can still clean up artifacts. The resulting procedure decouples prior refinement from data consistency: the prior supplies denoising, while lightweight pixel-space optimization enforces the measurement constraint without backpropagation through the denoiser or decoder. Across linear and nonlinear inverse problems on FFHQ and ImageNet, Spin achieves competitive reconstruction quality with a substantially better runtime--memory profile, running 2x faster on pixel-space models and up to 50x faster on latent diffusion models, with lower memory costs.

URL PDF HTML ☆

赞 0 踩 0

2603.05488 2026-05-29 cs.CL cs.AI cs.LG 版本更新

Reasoning Theater: Disentangling Model Beliefs from Chain-of-Thought

推理剧场：从思维链中分离模型信念

Siddharth Boppana, Annabel Ma, Max Loeffler, Raphael Sarfati, Eric Bigelow, Atticus Geiger, Owen Lewis, Jack Merullo

发表机构 * Harvard University, Cambridge, MA（哈佛大学，马萨诸塞州剑桥）

AI总结通过激活探针、早期强制回答和思维链监控器分析，发现推理模型存在表演性思维链现象，并利用探针引导的早期退出实现高效计算。

详情

AI中文摘要

我们提供了推理模型中表演性思维链（CoT）的证据，即模型对其最终答案变得非常自信，但继续生成令牌而不揭示其内部信念。我们的分析比较了两个大型模型（DeepSeek-R1 671B 和 GPT-OSS 120B）中的激活探针、早期强制回答和思维链监控器，并发现了任务难度特定的差异：模型的最终答案可以从思维链中远早于监控器能够判断的激活中解码，特别是对于基于回忆的简单MMLU问题。我们将此与困难的多跳GPQA-Diamond问题中的真正推理进行对比。尽管如此，转折点（例如回溯、“啊哈”时刻）几乎只出现在探针显示大信念转变的响应中，表明这些行为追踪的是真正的不确定性，而不是学到的“推理剧场”。最后，探针引导的早期退出在MMLU上减少了高达80%的令牌，在GPQA-Diamond上减少了30%，且准确率相似，将注意力探针定位为检测表演性推理和实现自适应计算的高效工具。

英文摘要

We provide evidence of performative chain-of-thought (CoT) in reasoning models, where a model becomes strongly confident in its final answer, but continues generating tokens without revealing its internal belief. Our analysis compares activation probing, early forced answering, and a CoT monitor across two large models (DeepSeek-R1 671B & GPT-OSS 120B) and find task difficulty-specific differences: The model's final answer is decodable from activations far earlier in CoT than a monitor is able to say, especially for easy recall-based MMLU questions. We contrast this with genuine reasoning in difficult multihop GPQA-Diamond questions. Despite this, inflection points (e.g., backtracking, 'aha' moments) occur almost exclusively in responses where probes show large belief shifts, suggesting these behaviors track genuine uncertainty rather than learned "reasoning theater." Finally, probe-guided early exit reduces tokens by up to 80% on MMLU and 30% on GPQA-Diamond with similar accuracy, positioning attention probing as an efficient tool for detecting performative reasoning and enabling adaptive computation.

URL PDF HTML ☆

赞 0 踩 0

2603.05002 2026-05-29 cs.LG math.OC stat.ML 版本更新

Non-Euclidean Gradient Descent Operates at the Edge of Stability

非欧几里得梯度下降在稳定性边缘运行

Rustem Islamov, Michael Crawshaw, Jeremy Cohen, Robert Gower

发表机构 * University of Basel（巴塞尔大学）； George Mason University（乔治·马歇尔大学）； Flatiron Institute（Flatiron研究所）

AI总结本文通过方向光滑性解释梯度下降中的稳定性边缘现象，并将其推广到非欧几里得范数，定义广义尖锐度，实验表明非欧几里得梯度下降也表现出渐进尖锐化和阈值振荡。

详情

AI中文摘要

稳定性边缘（EoS）是一种现象，其中Hessian矩阵的尖锐度（最大特征值）在梯度下降（GD）中接近并徘徊在稳定性阈值$2/η$附近（步长为$η$）。尽管（表面上）违反了经典光滑性假设，但EoS在深度学习中已被广泛观察到，其理论基础仍不完整。我们通过方向光滑性[Mishkin et al., 2024]的视角提供了对EoS的解释。这种解释自然地扩展到非欧几里得范数，我们用它来定义任意范数下的广义尖锐度。我们的广义尖锐度度量包括先前研究的普通GD和预处理GD作为特例，以及尚未研究EoS的方法，例如$\ell_{\infty}$下降、块坐标下降、谱GD及其归一化版本。通过在神经网络上的实验，我们表明具有广义尖锐度的非欧几里得GD也表现出渐进尖锐化，随后在阈值$2/η$附近或之上振荡。在实践中，我们的框架提供了一种几何感知的谱诊断方法，可应用于广泛的非欧几里得梯度方法类别。

英文摘要

The Edge of Stability (EoS) is a phenomenon where the sharpness (largest eigenvalue) of the Hessian approaches and then hovers near the stability threshold $2/η$ during gradient descent (GD) with step size $η$. Despite (apparently) violating classical smoothness assumptions, EoS has been widely observed in deep learning, but its theoretical foundations remain incomplete. We provide an interpretation of EoS through the lens of Directional Smoothness [Mishkin et al., 2024]. This interpretation naturally extends to non-Euclidean norms, which we use to define generalized sharpness under an arbitrary norm. Our generalized sharpness measure includes previously studied vanilla GD and preconditioned GD as special cases, as well as methods for which EoS has not been studied, such as $\ell_{\infty}$-descent, Block CD, Spectral GD, and their normalized versions. Through experiments on neural networks, we show that non-Euclidean GD with our generalized sharpness also exhibits progressive sharpening followed by oscillations around or above the threshold $2/η$. Practically, our framework provides a geometry-aware spectral diagnostic that can be applied across a broad class of non-Euclidean gradient methods.

URL PDF HTML ☆

赞 0 踩 0

2603.03805 2026-05-29 cs.LG cs.AI cs.DB 版本更新

Relational In-Context Learning via Synthetic Pre-training with Structural Prior

通过结构先验的合成预训练实现关系上下文学习

Yanbo Wang, Jiaxuan You, Chuan Shi, Muhan Zhang

发表机构 * Institute for Artificial Intelligence, Peking University（北京大学人工智能研究院）； University of Illinois at Urbana-Champaign（伊利诺伊大学香槟分校）； Institute of Computing Technology, Beijing University of Post（北京邮电大学计算机学院）； State Key Laboratory of General Artificial Intelligence（通用人工智能国家重点实验室）

AI总结提出RDB-PFN，首个仅通过合成数据训练的关系基础模型，利用结构因果模型生成多样关系数据库，实现对新数据库的即时上下文学习，在19个真实关系预测任务上优于现有表格基础模型。

详情

AI中文摘要

关系数据库是现代业务的支柱，但它们缺乏与文本或视觉领域相当的基础模型。一个关键障碍是高质量的关系数据库是私有的、稀缺的且结构异构，使得互联网规模的预训练不可行。为了克服这种数据稀缺性，我们引入了RDB-PFN，这是第一个完全通过合成数据训练的关系基础模型。受先验数据拟合网络的启发，其中从结构因果模型生成的合成数据能够实现单表推理，我们设计了一个关系先验生成器，从零开始创建无限多样的关系数据库流。在超过200万个合成单表和关系任务上进行预训练后，RDB-PFN通过真正的上下文学习学会即时适应任何新数据库。实验表明，RDB-PFN在19个真实世界的关系预测任务上实现了强大的少样本性能，优于在相同DFS线性化输入上评估的最先进的表格基础模型，同时使用轻量级架构和快速推理。代码可在https://github.com/MuLabPKU/RDBPFN获取。

英文摘要

Relational Databases (RDBs) are the backbone of modern business, yet they lack foundation models comparable to those in text or vision. A key obstacle is that high-quality RDBs are private, scarce, and structurally heterogeneous, making internet-scale pre-training infeasible. To overcome this data scarcity, we introduce RDB-PFN, the first relational foundation model trained purely via synthetic data. Inspired by Prior-Data Fitted Networks (PFNs), where synthetic data generated from Structural Causal Models (SCMs) enables reasoning on single tables, we design a Relational Prior Generator to create an infinite stream of diverse RDBs from scratch. Pre-training on over 2 million synthetic single-table and relational tasks, RDB-PFN learns to adapt to any new database instantly via genuine in-context learning. Experiments show that RDB-PFN achieves strong few-shot performance on 19 real-world relational prediction tasks, outperforming state-of-the-art tabular foundation models evaluated on the same DFS-linearized inputs, while using a lightweight architecture and fast inference. The code is available at https://github.com/MuLabPKU/RDBPFN.

URL PDF HTML ☆

赞 0 踩 0

2603.03503 2026-05-29 cs.CV cs.LG 版本更新

Geographically-Weighted Weakly Supervised Bayesian High-Resolution Transformer for 200m Resolution Pan-Arctic Sea Ice Concentration Mapping and Uncertainty Estimation using Sentinel-1, RCM, and AMSR2 Data

地理加权弱监督贝叶斯高分辨率Transformer：利用Sentinel-1、RCM和AMSR2数据实现200米分辨率泛北极海冰密集度制图与不确定性估计

Mabel Heffring, Lincoln Linlin Xu

发表机构 * Department of Geomatics Engineering, Schulich School of Engineering, University of Calgary（地质工程系，Schulich 工程学院，卡尔加里大学）

AI总结提出一种贝叶斯高分辨率Transformer模型，结合地理加权弱监督损失函数和决策级数据融合，利用Sentinel-1、RCM和AMSR2数据实现200米分辨率泛北极海冰密集度制图与不确定性量化。

Comments 23 pages, 20 figures

详情

DOI: 10.1016/j.isprsjprs.2026.05.032

AI中文摘要

尽管具有可靠对应不确定性的泛北极海冰高分辨率制图对于业务化海冰密集度（SIC）制图至关重要，但由于冰特征信号的细微性、SIC标签的不精确性、模型不确定性和数据异质性等关键挑战，这是一项艰巨的任务。本研究提出了一种新颖的贝叶斯高分辨率Transformer方法，利用Sentinel-1、RADARSAT星座任务（RCM）和先进微波扫描辐射计2（AMSR2）数据，实现200米分辨率泛北极SIC制图和不确定性量化。首先，为了改进微小和细微海冰特征（例如裂缝/水道、融池和浮冰）的提取，我们设计了一种新颖的高分辨率Transformer模型，该模型具有全局和局部模块，能够更好地区分海冰模式的细微差异。其次，为了解决低分辨率和非精确SIC标签的问题，我们设计了一种地理加权弱监督损失函数，在区域级别而非像素级别监督模型，并优先考虑纯开阔水和冰盖特征，同时减轻边缘冰区（MIZ）中模糊性的影响。第三，为了改进不确定性量化，我们设计了所提Transformer模型的贝叶斯扩展，将其参数视为随机变量，以更有效地捕获不确定性。第四，为了解决数据异质性，我们在决策级融合三种不同类型的数据（Sentinel-1、RCM和AMSR2），以改进SIC制图和不确定性量化。所提方法在2021年和2025年泛北极最小范围条件下进行了评估。结果表明，所提模型在使用Sentinel-1数据时实现了0.70的总体特征检测精度，同时保留了泛北极SIC模式（相对于ARTIST海冰产品，Sentinel-1 R² = 0.90）。

英文摘要

Although high-resolution mapping of pan-Arctic sea ice with reliable corresponding uncertainty is essential for operational sea ice concentration (SIC) charting, it is a difficult task due to key challenges, such as the subtle nature of ice signature features, inexact SIC labels, model uncertainty, and data heterogeneity. This study presents a novel Bayesian High-Resolution Transformer approach for 200 meter resolution pan-Arctic SIC mapping and uncertainty quantification using Sentinel-1, RADARSAT Constellation Mission (RCM), and Advanced Microwave Scanning Radiometer 2 (AMSR2) data. First, to improve small and subtle sea ice feature (e.g., cracks/leads, ponds, and ice floes) extraction, we design a novel high-resolution Transformer model with both global and local modules that can better discern the subtle differences in sea ice patterns. Second, to address low-resolution and inexact SIC labels, we design a geographically-weighted weakly supervised loss function to supervise the model at region level instead of pixel level, and to prioritize pure open water and ice pack signatures while mitigating the impact of ambiguity in the marginal ice zone (MIZ). Third, to improve uncertainty quantification, we design a Bayesian extension of the proposed Transformer model, treating its parameters as random variables to more effectively capture uncertainties. Fourth, to address data heterogeneity, we fuse three different data types (Sentinel-1, RCM, and AMSR2) at decision-level to improve both SIC mapping and uncertainty quantification. The proposed approach is evaluated under pan-Arctic minimum-extent conditions in 2021 and 2025. Results demonstrate that the proposed model achieves 0.70 overall feature detection accuracy using Sentinel-1 data, while also preserving pan-Arctic SIC patterns (Sentinel-1 R\textsuperscript{2} = 0.90 relative to the ARTIST Sea Ice product).

URL PDF HTML ☆

赞 0 踩 0

2602.21565 2026-05-29 cs.LG 版本更新

Routing by Reaching: Composition of Pre-trained GFlowNets for Multi-Objective Generation

通过到达进行路由：预训练GFlowNets的组合用于多目标生成

Seokwon Yoon, Youngbin Choi, Seunghyuk Cho, Seungbeom Lee, MoonJeong Park, Dongwoo Kim

发表机构 * Department of Computer Science \& Engineering, POSTECH, South Korea ； Graduate School of Artificial Intelligence, POSTECH, South Korea

AI总结提出一个在推理时组合预训练GFlowNets的框架，无需微调或重新训练即可快速适应多目标生成任务，并证明在线性标量化下精确恢复目标分布，对非线性算子通过畸变因子量化近似质量。

Comments Appears in the 43rd International Conference on Machine Learning (ICML 2026)

详情

AI中文摘要

生成流网络（GFlowNets）学习按照奖励函数比例采样多样化的候选，使其非常适合科学发现，其中探索多个有希望的解决方案至关重要。进一步将GFlowNets扩展到多目标设置已引起越来越多的兴趣，因为现实世界的应用通常涉及多个相互冲突的目标。然而，现有方法需要对每个目标组合进行联合训练，这意味着目标集的任何变化都需要从头开始重新训练。我们提出了一个在推理时组合预训练GFlowNets的框架，无需微调或重新训练即可实现快速适应。重要的是，我们的框架是灵活的，能够处理从线性标量化到复杂非线性算子的多种奖励组合，这些在以前的文献中通常分开处理。我们证明，我们的方法在线性标量化下精确恢复目标分布，并通过畸变因子量化非线性算子的近似质量。在合成二维网格和真实分子生成任务上的实验表明，我们的方法达到了与基线相当的性能。

英文摘要

Generative Flow Networks (GFlowNets) learn to sample diverse candidates in proportion to a reward function, making them well-suited for scientific discovery, where exploring multiple promising solutions is crucial. Further extending GFlowNets to multi-objective settings has attracted growing interest as real-world applications often involve multiple, conflicting objectives. However, existing approaches require joint training for each combination of objectives, meaning that any change in the objective set necessitates retraining from scratch. We propose a framework that composes pre-trained GFlowNets at inference time, enabling rapid adaptation without fine-tuning or retraining. Importantly, our framework is flexible, capable of handling diverse reward combinations ranging from linear scalarization to complex nonlinear operators, which are often handled separately in previous literature. We prove that our method exactly recovers the target distribution for linear scalarization, and quantify the approximation quality for nonlinear operators through a distortion factor. Experiments on a synthetic 2D grid and real-world molecule generation tasks demonstrate that our approach achieves performance comparable to baselines.

URL PDF HTML ☆

赞 0 踩 0

2602.18196 2026-05-29 cs.LG 版本更新

RAT+: Train Dense, Infer Sparse -- Recurrence Augmented Attention for Dilated Inference

RAT+：密集训练，稀疏推理——用于扩张推理的循环增强注意力

Xiuying Wei, Caglar Gulcehre

发表机构 * CLAIRE lab at EPFL, Lausanne, Switzerland（EPFL 拉沃斯实验室）

AI总结提出RAT+架构，通过密集预训练和循环增强注意力，使模型在推理时可灵活切换为稀疏扩张注意力，大幅降低计算和缓存开销，同时保持高精度。

Comments Accepted by ICML2026

详情

AI中文摘要

结构化扩张注意力具有吸引人的推理效率调节旋钮：它将注意力的FLOPs和KV缓存大小减少扩张大小D的倍数，同时保持长程连接。虽然先前的工作通过从头训练每个配置来研究它，但直接将预训练注意力模型稀疏化为扩张模式会导致严重的精度下降，阻碍跨推理场景的灵活重用。我们引入RAT+，一种密集预训练架构，通过全序列循环和主动循环学习增强注意力。单个RAT+模型密集预训练一次，然后可以在推理时灵活切换到扩张注意力（可选局部窗口）或混合层/头组合，仅需短期的10亿token分辨率适应，而无需重新训练单独的稀疏模型。在100B token上训练的1.5B参数模型中，RAT+在D=16时紧密匹配密集精度，在D=64时在常识推理和LongBench任务上下降约2-3个点。我们进一步扩展到2.6B和7.6B参数，观察到更有希望的性能（例如，在注意力FLOPs和KV缓存大小减少64倍的情况下，平均精度损失1个点）。代码可在https://github.com/wimh966/rat-plus获取。

英文摘要

Structured dilated attention has an appealing inference-time efficiency knob: it reduces the FLOPs of attention and the KV cache size by a factor of the dilation size D, while preserving long-range connectivity. While prior work studies it by training each configuration from scratch, directly sparsifying a pretrained attention model into a dilated pattern leads to severe accuracy degradation, preventing flexible reuse across inference scenarios. We introduce RAT+, a dense-pretraining architecture that augments attention with full-sequence recurrence and active recurrence learning. A single RAT+ model is pretrained densely once and can then be flexibly switched at inference time to dilated attention (optionally with local windows) or hybrid layer/head compositions, requiring only a short 1B-token resolution adaptation rather than retraining separate sparse models. At 1.5B parameters trained on 100B tokens, RAT+ closely matches dense accuracy at D = 16, and drops by about 2-3 points at D = 64 on commonsense reasoning and LongBench tasks. We further scale to 2.6B and 7.6B parameters and observe even more promising performance (e.g., a 1-point average accuracy loss with a 64x reduction in attention FLOPs and KV cache size). Code is available at https://github.com/wimh966/rat-plus.

URL PDF HTML ☆

赞 0 踩 0

2602.16610 2026-05-29 cs.CL cs.AI cs.LG 版本更新

Who can we trust? LLM-as-a-jury for Comparative Assessment

我们该信任谁？LLM作为陪审团进行比较评估

Mengjie Qian, Guangzhi Sun, Mark J. F. Gales, Kate M. Knill

发表机构 * Department of Engineering, University of Cambridge, UK（剑桥大学工程系）

AI总结针对LLM作为评估者时判断不一致和可靠性差异的问题，提出BT-sigma模型，通过引入判别参数联合推断项目排名和法官可靠性，优于平均聚合方法。

Comments Accepted to ICML 2026

详情

AI中文摘要

大型语言模型（LLMs）越来越多地被用作自动评估器，用于自然语言生成评估，通常采用成对比较判断。现有方法通常依赖单一法官或聚合多个法官并假设其可靠性相同。在实践中，LLM法官在不同任务和评估方面的表现差异很大，其判断概率可能存在偏差和不一致。此外，用于法官校准的人工标注监督可能不可用。我们首先通过实验证明LLM比较概率的不一致性存在，并表明这限制了直接基于概率排名的有效性。为解决此问题，我们研究了LLM作为陪审团的设置，并提出了BT-sigma，这是Bradley-Terry模型的一种法官感知扩展，为每个法官引入一个判别参数，仅从成对比较中联合推断项目排名和法官可靠性。在基准NLG评估数据集上的实验表明，BT-sigma始终优于基于平均的聚合方法，并且学习到的判别参数与LLM判断的循环一致性的独立度量高度相关。进一步分析揭示，BT-sigma可以解释为一种无监督校准机制，通过建模法官可靠性来改进聚合。

英文摘要

Large language models (LLMs) are increasingly applied as automatic evaluators for natural language generation assessment often using pairwise comparative judgements. Existing approaches typically rely on single judges or aggregate multiple judges assuming equal reliability. In practice, LLM judges vary substantially in performance across tasks and evaluation aspects, and their judgment probabilities may be biased and inconsistent. Furthermore, human-labelled supervision for judge calibration may be unavailable. We first empirically demonstrate that inconsistencies in LLM comparison probabilities exist and show that it limits the effectiveness of direct probability-based ranking. To address this, we study the LLM-asa-jury setting and propose BT-sigma, a judge-aware extension of the Bradley-Terry model that introduces a discriminator parameter for each judge to jointly infer item rankings and judge reliability from pairwise comparisons alone. Experiments on benchmark NLG evaluation datasets show that BT-sigma consistently outperforms averaging-based aggregation methods, and that the learned discriminators strongly correlate with independent measures of the cycle consistency of LLM judgments. Further analysis reveals that BT-sigma can be interpreted as an unsupervised calibration mechanism that improves aggregation by modelling judge reliability.

URL PDF HTML ☆

赞 0 踩 0

2602.16449 2026-05-29 cs.LG cs.AI stat.ML 版本更新

GICDM: Mitigating Hubness for Reliable Distance-Based Generative Model Evaluation

GICDM: 缓解枢纽性以实现可靠的基于距离的生成模型评估

Nicolas Salvy, Hugues Talbot, Bertrand Thirion

发表机构 * Inria, Palaiseau, France（法国帕莱索研究所）

AI总结针对生成模型评估中高维嵌入空间的枢纽性现象，提出GICDM方法（基于迭代上下文不相似度度量），通过多尺度扩展校正邻域估计，恢复可靠度量并与人类评估对齐。

Comments Forty-third International Conference on Machine Learning, 2026

2602.15382 2026-05-29 cs.CL cs.CV cs.LG 版本更新

The Vision Wormhole: Latent-Space Communication in Heterogeneous Multi-Agent Systems

视觉虫洞：异构多智能体系统中的潜在空间通信

Xiaoze Liu, Ruowang Zhang, Weichen Yu, Siheng Xiong, Liu He, Feijie Wu, Hoin Jung, Matt Fredrikson, Xiaoqian Wang, Jing Gao

发表机构 * Purdue University（普渡大学）； Contextual AI（情境人工智能）； Carnegie Mellon University（卡内基梅隆大学）； Georgia Institute of Technology（佐治亚理工学院）

AI总结提出Vision Wormhole框架，通过通用视觉编解码器将推理轨迹映射到共享连续空间，实现异构VLM间的潜在状态传输，无需配对翻译器，降低对齐复杂度并提升效率。

Comments Preprint. Work in progress

详情

AI中文摘要

由大型语言模型驱动的多智能体系统（MAS）实现了先进的协作推理，但仍受限于离散文本通信，这带来了运行时开销和信息量化损失。虽然潜在状态传输提供了一种替代方案，但现有方法要么假设同构的发送器-接收器架构，要么依赖于特定配对的学得翻译器，限制了跨具有不连续流形的不同模型族的可扩展性。我们将为自然图像训练的视觉-语言模型（VLM）的视觉界面重新概念化为异构智能体之间的连续通信通道，并将这一思想实例化为 extbf{视觉虫洞}：一种通用视觉编解码器，将推理轨迹映射到共享的连续参考空间，并将其注入接收器的视觉通路，实现无需配对翻译器的跨架构潜在状态传输。该框架采用中心辐射拓扑，将对齐复杂度从$O(N^2)$降低到$O(N)$，并通过无标签的教师-学生蒸馏针对文本通道进行训练，无需并行隐藏状态监督。在异构VLM族（Qwen-VL、Gemma、SmolVLM2、LFM2.5-VL）和九个推理基准上的大量实验表明，视觉虫洞在大多数评估设置中减少了端到端挂钟时间，并产生了正的平均宏$Δ$-准确率。

英文摘要

Multi-Agent Systems (MAS) powered by Large Language Models have unlocked advanced collaborative reasoning, yet they remain bottlenecked by discrete text communication, which imposes runtime overhead and information quantization loss. While latent state transfer offers an alternative, existing approaches either assume homogeneous sender--receiver architectures or rely on pair-specific learned translators, limiting scalability across diverse model families with disjoint manifolds. We reconceptualize the visual interface of Vision-Language Models (VLMs), trained for natural images, as a continuous communication channel between heterogeneous agents, and instantiate this idea as the \textbf{Vision Wormhole}: a Universal Visual Codec maps reasoning traces into a shared continuous reference space and injects them into the receiver's visual pathway, yielding cross-architecture latent state transfer without per-pair translators. The framework adopts a hub-and-spoke topology that reduces alignment complexity from $O(N^2)$ to $O(N)$, and is trained by label-free teacher--student distillation against the text channel, requiring no parallel hidden-state supervision. Extensive experiments across heterogeneous VLM families (Qwen-VL, Gemma, SmolVLM2, LFM2.5-VL) and nine reasoning benchmarks show that the Vision Wormhole reduces end-to-end wall-clock time across most evaluated settings and yields positive macro-average $Δ$-accuracy.

URL PDF HTML ☆

赞 0 踩 0

2602.15239 2026-05-29 cs.LG 版本更新

大型语言模型的罕见事件分析

Jake McAllister Dorman, Edward Gillman, Dominic C. Rose, Jamie F. Mair, Juan P. Garrahan

发表机构 * School of Physics and Astronomy, University of Nottingham（物理与天文学学院，诺丁汉大学）

AI总结本文提出一个端到端框架，用于系统分析大型语言模型中的罕见事件，涵盖理论、高效生成策略、概率估计和误差分析，并通过实例展示其应用。

Comments ICML 2026 Oral Spotlight

2602.06361 2026-05-29 cs.GT cs.IT cs.LG math.IT stat.ML 版本更新

Envy-Free Allocation of Indivisible Goods via Noisy Queries

通过噪声查询实现不可分割物品的无嫉妒分配

Zihan Li, Yan Hao Ling, Jonathan Scarlett, Warut Suksompong

发表机构 * Meta（Meta公司）； National University of Singapore（国立新加坡大学）； Nanyang Technological University（南洋理工大学）

AI总结针对不可直接观测估值、仅能通过噪声查询获取信息的不可分割物品分配问题，在双智能体高斯噪声和有界估值设定下，推导了实现无嫉妒分配所需查询次数的上下界，并证明了当最优分配负嫉妒值Δ不太小时最优查询次数与m^{2.5}/Δ^2成比例。

Comments ICML 2026

2602.05961 2026-05-29 cs.LG stat.ML 版本更新

Discrete diffusion samplers and bridges: Off-policy algorithms and applications in latent spaces

离散扩散采样器与桥：离策略算法及其在潜在空间中的应用

Arran Carter, Sanghyeok Choi, Kirill Tamogashev, Víctor Elvira, Esmeralda S. Whitammer

发表机构 * University of Edinburgh（爱丁堡大学）； CIFAR Fellow（卡尔·弗里德里希·列文森研究员）

AI总结提出离策略训练技术改进离散扩散采样器性能，并首次引入离散域的数据到能量薛定谔桥训练，应用于图像生成模型的离散潜在空间中的无数据后验采样。

Comments ICML 2026. Code: https://github.com/mmacosha/offpolicy-discrete-diffusion-samplers-and-bridges

详情

AI中文摘要

从已知归一化常数的分布 $p(x) \propto e^{-\mathcal{E}(x)}$ 中采样是统计学中一个重要且具有挑战性的问题。近年来，出现了一类新的摊销采样算法，通常称为扩散采样器，能够从未归一化的密度中快速高效地采样。这类算法在连续空间采样任务中已被广泛研究；然而，它们在离散空间问题中的应用仍 largely 未被探索。尽管该领域已取得一些进展，但离散扩散采样器并未充分利用连续空间采样中常用的思想。在本文中，我们提出通过引入离散扩散采样器的离策略训练技术来弥合这一差距。我们证明这些技术在已有和新颖的合成基准上提高了离散采样器的性能。接下来，我们将离散扩散采样器推广到两个任意分布之间的桥接任务，首次为离散域引入了数据到能量薛定谔桥训练。最后，我们展示了所提出的扩散采样器在图像生成模型的离散潜在空间中进行无数据后验采样的应用。

英文摘要

Sampling from a distribution $p(x) \propto e^{-\mathcal{E}(x)}$ known up to a normalising constant is an important and challenging problem in statistics. Recent years have seen the rise of a new family of amortised sampling algorithms, commonly referred to as diffusion samplers, that enable fast and efficient sampling from an unnormalised density. Such algorithms have been widely studied for continuous-space sampling tasks; however, their application to problems in discrete space remains largely unexplored. Although some progress has been made in this area, discrete diffusion samplers do not take full advantage of ideas commonly used for continuous-space sampling. In this paper, we propose to bridge this gap by introducing off-policy training techniques for discrete diffusion samplers. We show that these techniques improve the performance of discrete samplers on both established and new synthetic benchmarks. Next, we generalise discrete diffusion samplers to the task of bridging between two arbitrary distributions, introducing data-to-energy Schrödinger bridge training for the discrete domain for the first time. Lastly, we showcase the application of the proposed diffusion samplers to data-free posterior sampling in the discrete latent spaces of image generative models.

URL PDF HTML ☆

赞 0 踩 0

2602.03582 2026-05-29 cs.LG 版本更新

Optimization and Generation in Aerodynamics Inverse Design

气动逆设计中的优化与生成

Huaguan Chen, Ning Lin, Luxi Chen, Jiacheng Cen, Rui Zhang, Wenbing Huang, Chongxuan Li, Hao Sun

AI总结本文提出一个概率框架，将视觉特征保持与气动性能优化统一为目标，通过重加权学习分布实现优化和引导生成，实验表明在车辆和飞机设计中显著降低阻力同时保持视觉一致性。

详情

AI中文摘要

通过可用信息桥接功能相似性与表征相似性

Antonio Almudévar, Alfonso Ortega

发表机构 * ViVoLab, Aragón Institute for Engineering Research (I3A), University of Zaragoza, Zaragoza, Spain（ViVoLab，阿拉贡工程研究院（I3A），萨拉戈萨大学，西班牙萨拉戈萨）

AI总结提出一个基于可用信息的统一框架，从功能相似性、表征相似性及其关系三个维度进行理论和实证综合，揭示表征相似性是功能相似性的充分非必要条件。

详情

AI中文摘要

我们提出了一个通过可用信息量化表征之间相似性的统一框架，在三个关键维度上提供了严格的理论和实证综合。首先，针对功能相似性，我们建立了拼接性能与条件互信息之间的形式化联系。我们进一步揭示拼接本质上是非对称的，证明稳健的功能比较需要双向分析而非单向映射。其次，关于表征相似性，我们发现基于重构的指标和标准工具（如CKA、RSA）在特定约束下充当可用信息的估计量。关键的是，我们表明相似性是相对于预测族的能力而言的：对刚性观察者而言不同的表征，对更具表达力的观察者可能是相同的。第三，我们证明表征相似性是功能相似性的充分非必要条件。我们通过任务粒度层次统一这些概念：复杂任务上的相似性保证了任何更粗粒度衍生任务上的相似性，将表征相似性确立为最大粒度的极限：输入重构。

英文摘要

We present a unified framework for quantifying the similarity between representations through the lens of \textit{usable} information, offering a rigorous theoretical and empirical synthesis across three key dimensions. First, addressing functional similarity, we establish a formal link between stitching performance and conditional mutual information. We further reveal that stitching is inherently asymmetric, demonstrating that robust functional comparison necessitates a bidirectional analysis rather than a unidirectional mapping. Second, concerning representational similarity, we find that reconstruction-based metrics and standard tools (e.g., CKA, RSA) act as estimators of usable information under specific constraints. Crucially, we show that similarity is relative to the capacity of the predictive family: representations that appear distinct to a rigid observer may be identical to a more expressive one. Third, we demonstrate that representational similarity is sufficient but not necessary for functional similarity. We unify these concepts through a task-granularity hierarchy: similarity on a complex task guarantees similarity on any coarser derivative, establishing representational similarity as the limit of maximum granularity: input reconstruction.

URL PDF HTML ☆

赞 0 踩 0

2601.21564 2026-05-29 cs.LG 版本更新

从自回归到掩码扩散语言模型的后训练中的机制转变

Injin Kong, Hyoungjoon Lee, Yohan Jo

发表机构 * Graduate School of Data Science, Seoul National University（首尔国立大学数据科学研究生院）； Department of Biosystems & Biomaterials Science and Engineering, Seoul National University（首尔国立大学生物系统与生物材料科学与工程系）

AI总结通过比较电路分析，发现后训练得到的掩码扩散模型在结构上根据任务保留或重组自回归电路，在语义上从局部专业化转向分布式整合，表明扩散后训练是内部计算的深度重组。

详情

AI中文摘要

将预训练的自回归模型（ARMs）后训练为掩码扩散模型（MDMs）已成为一种克服顺序生成局限性的经济有效方法。然而，后训练的MDMs是否获得了真正的新计算机制，还是仅仅以非自回归形式重新表达了自回归计算，仍不清楚。通过对ARMs及其从相同骨干网络后训练得到的MDM对应物进行电路比较分析，我们揭示了两个互补的重组轴。在结构上，转变是任务依赖的：MDMs在局部因果任务上保留自回归电路，但在全局任务上放弃继承的路径并将计算前置到早期层。在语义上，转变在不同机制间是一致的：ARMs中尖锐的局部专业化让位于MDMs中的分布式整合。这些发现共同表明，扩散后训练并非生成过程的表面变化，而是内部计算的重组，其深度取决于任务。

英文摘要

Post-training pretrained autoregressive models (ARMs) into masked diffusion models (MDMs) has emerged as a cost-effective way to overcome the limitations of sequential generation. Yet it remains unclear whether post-trained MDMs acquire genuinely new computational mechanisms or merely re-express autoregressive computation in a non-autoregressive form. Through a comparative circuit analysis of ARMs and their MDM counterparts post-trained from the same backbones, we uncover two complementary axes of reorganization. Structurally, the shift is task-dependent: MDMs preserve autoregressive circuitry on locally causal tasks but abandon inherited pathways and front-load computation into early layers on global tasks. Semantically, the shift is consistent across regimes: sharp, localized specialization in ARMs gives way to distributed integration in MDMs. Together, these findings show that diffusion post-training is not a surface-level change in the generation procedure but a reorganization of internal computation whose depth depends on the task.

URL PDF HTML ☆

赞 0 踩 0

2601.04765 2026-05-29 cs.CL cs.AI cs.LG physics.comp-ph 版本更新

Differential syntactic and semantic encoding in LLMs

大型语言模型中句法与语义的差异编码

Santiago Acevedo, Alessandro Laio, Marco Baroni

发表机构 * Catalan Institute of Research and Advanced Studies (ICREA) and Universitat Pompeu Fabra (UPF)（加泰罗尼亚研究与高级科学研究所（ICREA）和庞培法华大学（UPF））

AI总结本研究通过平均共享句法结构或语义的句子隐藏表示向量，发现大型语言模型（以DeepSeek-V3为例）的内部层表示中句法和语义信息至少部分线性编码，且两者编码轮廓不同，可一定程度解耦。

Comments Published as conference paper at ICML 2026

2601.00065 2026-05-29 cs.LG cs.CL cs.CR 版本更新

When the Same Coefficients Reach Different Places: Asymmetric Realizability in Transplanting Tokenizers across Large Language Models

当相同系数到达不同位置：跨大型语言模型移植分词器中的非对称可实现性

Xiaoze Liu, Weichen Yu, Matt Fredrikson, Xiaoqian Wang, Jing Gao

发表机构 * Purdue University（普渡大学）； Carnegie Mellon University（卡内基梅隆大学）

AI总结本文发现跨词汇模型组合中分词器移植的几何结构非对称性，并构造了“破坏令牌”以利用该漏洞，通过实验验证其在多个模型对中的存在性及对微调、谱滤波等防御措施的鲁棒性。

详情

AI中文摘要

跨词汇模型组合中的分词器移植将仅存在于捐赠者的嵌入行重构为基于共享词汇锚点的加权组合，并在基础模型上重用这些系数。我们识别出这种重构的一个结构几何特性：相同的系数向量在捐赠者和基础锚点跨度中到达不同的集合，即一个\emph{非对称可实现性}差距。在OMP下的65个捐赠者-基础对中，通过CLP、WECHSEL和FOCUS的跨算子验证，我们构造了\emph{破坏令牌}：在捐赠者锚点跨度中保持统计惰性，同时在基础中产生高显著性重构的单一系数向量。相同的Gemma-2-2B捐赠者检查点允许针对来自五个模型家族的13个不同下游基础进行此构造。植入的方向与未改变的干净参考权重合并。在部署者案例研究中，标准LoRA微调主要抑制了其提示分布与训练语料匹配的破坏者，并且在我们设置中不足以缓解此类攻击家族。测试的谱滤波器未能捕捉到非对称性。我们讨论了在开放权重组合供应链中的潜在滥用。

英文摘要

Tokenizer transplant in cross-vocabulary model composition reconstructs donor-only embedding rows as weighted combinations over shared lexical anchors and reuses those coefficients on the base. We identify a structural geometric property of this reconstruction: the same coefficient vector reaches different sets in the donor and base anchor spans, an \emph{asymmetric realizability} gap. Across 65 donor-base pairs under OMP, with cross-operator validation on CLP, WECHSEL, and FOCUS, we construct \textit{breaker tokens}: single coefficient vectors that remain statistically inert in the donor anchor span while producing a high-salience reconstruction in the base. The same Gemma-2-2B donor checkpoint admits this construction against 13 different downstream bases drawn from five model families. The planted direction passes weight-merging with a clean reference unchanged. In a deployer case study, standard LoRA fine-tuning suppresses the breaker primarily on prompts whose distribution matches the training corpus and is not a sufficient mitigation against this attack family in our setting. The tested spectral filters miss the asymmetry. We discuss potential misuse in the open-weight composition supply chain.

URL PDF HTML ☆

赞 0 踩 0

2512.21311 2026-05-29 cs.LG 版本更新

Learning to Solve PDEs on Neural Shape Representations

在神经形状表示上学习求解偏微分方程

Lilian Welschinger, Yilin Liu, Zican Wang, Niloy Mitra

发表机构 * University College London（伦敦大学学院）； Adobe Research（Adobe研究）

AI总结提出一种无网格公式，学习基于神经局部形状属性的局部更新算子，直接在神经表示上求解表面偏微分方程，无需显式网格或逐实例优化，且保持可微性。

Comments Accepted at CVPR 2026. Project page: https://welschinger.github.io/Learning-to-Solve-PDEs-on-Neural-Shape-Representations/

详情

AI中文摘要

在形状上求解偏微分方程支撑着许多形状分析和工程任务；然而，主流的偏微分方程求解器在多边形/三角形网格上运行，而现代3D资产越来越多地以神经表示的形式存在。这种不匹配导致没有合适的方法直接在神经域内求解表面偏微分方程，迫使进行显式网格提取或逐实例残差训练，阻碍了端到端的工作流程。我们提出了一种新颖的无网格公式，学习一个基于神经（局部）形状属性条件化的局部更新算子，使得表面偏微分方程能够直接在神经数据所在处求解。该算子自然地与流行的神经表面表示集成，仅在单个代表性形状上训练一次，并能在形状和拓扑变化中泛化，实现准确、快速的推理，无需显式网格划分或逐实例优化，同时保持可微性。在解析基准测试（球面上的热扩散方程和泊松方程）以及各种形状和神经表面表示上，我们的方法达到了与经典求解器相当的精度，同时实现了跨神经和传统表面表示的统一端到端流水线。我们的源代码和项目页面：https://welschinger.github.io/Learning-to-Solve-PDEs-on-Neural-Shape-Representations/。

英文摘要

Solving partial differential equations (PDEs) on shapes underpins many shape analysis and engineering tasks; yet, prevailing PDE solvers operate on polygonal/triangle meshes while modern 3D assets increasingly live as neural representations. This mismatch leaves no suitable method to solve surface PDEs directly within the neural domain, forcing explicit mesh extraction or per-instance residual training, preventing end-to-end workflows. We present a novel, meshfree formulation that learns a local update operator conditioned on neural (local) shape attributes, enabling surface PDEs to be solved directly where the (neural) data lives. The operator integrates naturally with prevalent neural surface representations, is trained once on a single representative shape, and generalizes across shape and topology variations, enabling accurate, fast inference without explicit meshing or per-instance optimization while preserving differentiability. Across analytic benchmarks (heat diffusion and Poisson equations on the sphere) and on diverse shapes and neural surface representations, our method achieves accuracy comparable to classical solvers while enabling a unified, end-to-end pipeline across neural and traditional surface representations. Our source code and project page: https://welschinger.github.io/Learning-to-Solve-PDEs-on-Neural-Shape-Representations/.

URL PDF HTML ☆

赞 0 踩 0

2512.19199 2026-05-29 cs.LG cs.AI 版本更新

On the Koopman-Based Generalization Bounds for Multi-Task Deep Learning

基于Koopman的多任务深度学习泛化界

Mahdi Mohammadigohari, Giuseppe Di Fatta, Giuseppe Nicosia, Panos M. Pardalos

发表机构 * Free University of Bozen-Bolzano（博兹纳-博尔扎诺自由大学）； University of Catania（卡塔尼亚大学）； University of Florida（佛罗里达大学）

AI总结本文利用算子理论技术建立多任务深度神经网络的泛化界，通过利用权重矩阵的小条件数并引入定制的Sobolev空间作为扩展假设空间，提出比传统范数方法更紧的界，该界在单输出设置下仍有效且优于现有Koopman界。

Comments Accepted at the 11th International Conference on Machine Learning, Optimization, and Data Science (LOD), Castiglione della Pescaia, Italy, September 21-24, 2025. To appear in Lecture Notes in Computer Science (LNCS), volume 16467

2512.19184 2026-05-29 cs.LG cs.AI 版本更新

Operator-Based Generalization Bound for Deep Learning: Insights on Multi-Task Learning

基于算子的深度学习泛化界：多任务学习的洞见

Mahdi Mohammadigohari, Giuseppe Di Fatta, Giuseppe Nicosia, Panos M. Pardalos

发表机构 * Free University of Bozen-Bolzano（博兹纳-博尔扎诺自由大学）； University of Catania（卡塔尼亚大学）； University of Florida（佛罗里达大学）

AI总结本文通过算子理论框架，结合Koopman方法与现有技术，为向量值神经网络和深度核方法提出了更紧的泛化界，并引入草图技术降低计算成本，同时提出深度向量值再生核希尔伯特空间框架，利用Perron-Frobenius算子增强深度核方法，推导了新的Rademacher泛化界，解决了欠拟合和过拟合问题。

Comments Accepted at the 11th International Conference on Machine Learning, Optimization, and Data Science (LOD), Castiglione della Pescaia, Italy, September 21-24, 2025. To appear in Lecture Notes in Computer Science (LNCS), volume 16467

详情

DOI: 10.1007/978-3-032-21480-5_9
Journal ref: Machine Learning, Optimization, and Data Science (LOD 2025), Lecture Notes in Computer Science (LNCS), vol. 16468, Springer, 2026, pp. 120--137

AI中文摘要

本文提出了向量值神经网络和深度核方法的新型泛化界，通过算子理论框架聚焦多任务学习。我们的关键发展在于策略性地将基于Koopman的方法与现有技术相结合，实现了比传统基于范数的界更紧的泛化保证。为缓解基于Koopman方法的计算挑战，我们引入了适用于向量值神经网络的草图技术。这些技术在一般Lipschitz损失下给出了超额风险界，为包括鲁棒回归和多重分位数回归在内的应用提供了性能保证。此外，我们提出了一个新的深度学习框架——深度向量值再生核希尔伯特空间（vvRKHS），利用Perron-Frobenius（PF）算子增强深度核方法。我们为该框架推导了新的Rademacher泛化界，通过核精炼策略明确处理欠拟合和过拟合。这项工作为深度学习架构下的多任务学习泛化性质提供了新颖洞见，该领域直到最近才有所发展。

英文摘要

This paper presents novel generalization bounds for vector-valued neural networks and deep kernel methods, focusing on multi-task learning through an operator-theoretic framework. Our key development lies in strategically combining a Koopman based approach with existing techniques, achieving tighter generalization guarantees compared to traditional norm-based bounds. To mitigate computational challenges associated with Koopman-based methods, we introduce sketching techniques applicable to vector valued neural networks. These techniques yield excess risk bounds under generic Lipschitz losses, providing performance guarantees for applications including robust and multiple quantile regression. Furthermore, we propose a novel deep learning framework, deep vector-valued reproducing kernel Hilbert spaces (vvRKHS), leveraging Perron Frobenius (PF) operators to enhance deep kernel methods. We derive a new Rademacher generalization bound for this framework, explicitly addressing underfitting and overfitting through kernel refinement strategies. This work offers novel insights into the generalization properties of multitask learning with deep learning architectures, an area that has been relatively unexplored until recent developments.

URL PDF HTML ☆

赞 0 踩 0

2512.13517 2026-05-29 q-bio.NC cs.LG 版本更新

A Deep Learning Model of Mental Rotation Informed by Interactive VR Experiments

基于交互式VR实验的心理旋转深度学习模型

Raymond Khazoum, Daniela Fernandes, Aleksandr Krylov, Qin Li, Stephane Deny

发表机构 * Department of Computer Science, Aalto University, Espoo, Finland（奥卢大学计算机科学系，芬兰埃斯波）； Department of Neuroscience and Biomedical Engineering, Aalto University, Espoo, Finland（奥卢大学神经科学与生物医学工程系，芬兰埃斯波）

AI总结提出一个由等变编码器、神经符号对象编码器和神经决策代理组成的深度学习模型，通过VR实验验证，准确模拟人类心理旋转的性能、响应时间和行为。

Comments Version accepted at ICML 2026

详情

AI中文摘要

心理旋转——比较从不同视角观察到的物体的能力——是人类心理模拟和空间世界建模的基本示例。在这里，我们利用深度、等变和神经符号学习的最新进展，提出了一个人类心理旋转的机制模型。我们的模型由三个堆叠的组件组成：(1) 等变神经编码器，从图像中生成物体的3D空间表示；(2) 神经符号对象编码器，从这些空间表示中推导出符号对象描述；(3) 神经决策代理，通过循环路径比较这些符号描述，以在3D潜在空间中规定旋转模拟。我们的模型设计受到现有心理旋转实验文献的指导，并辅以VR实验，其中参与者有时可以操作物体进行比较。我们的模型很好地捕捉了参与者在我们和其他人的实验中的表现、反应时间和行为，并通过消融研究证明了每个组件的必要性。我们的工作为最近一系列人类空间推理的深度神经模型增添了新的内容，进一步证明了整合深度、等变和符号表示来模拟人类思维的效力。

英文摘要

Mental rotation -- the ability to compare objects seen from different viewpoints -- is a fundamental example of mental simulation and spatial world modeling in humans. Here we propose a mechanistic model of human mental rotation, leveraging recent advances in deep, equivariant, and neuro-symbolic learning. Our model consists of three stacked components: (1) an equivariant neural encoder, producing 3D spatial representations of objects from images, (2) a neuro-symbolic object encoder, deriving symbolic objects descriptions from these spatial representations, and (3) a neural decision agent, comparing these symbolic descriptions to prescribe rotation simulations in 3D latent space via a recurrent pathway. Our model design is guided by the existing experimental literature on mental rotation, which we complemented with experiments in VR where participants could at times manipulate the objects to compare. Our model captures well the performance, response times and behavior of participants in our and others' experiments, and through ablation studies we demonstrate the necessity of each component. Our work adds to a recent collection of deep neural models of human spatial reasoning, further demonstrating the potency of integrating deep, equivariant, and symbolic representations to model the human mind.

URL PDF HTML ☆

赞 0 踩 0

2512.10659 2026-05-29 cs.LG 版本更新

DCFO: Density-Based Counterfactuals for Outliers -- Additional Material

DCFO: 基于密度的离群点反事实解释——补充材料

Tommaso Amico, Pernille Matthews, Lena Krieger, Arthur Zimek, Ira Assent

发表机构 * Department of Computer Science（计算机科学系）； Department of Computer Science and Mathematics（计算机科学与数学系）

AI总结针对局部离群因子（LOF）缺乏可解释性的问题，提出基于密度的离群点反事实解释方法（DCFO），通过将数据空间划分为LOF平滑区域实现高效梯度优化，在50个OpenML数据集上优于现有方法。

详情

AI中文摘要

离群点检测识别显著偏离大多数数据分布的数据点。解释离群点对于理解导致其检测的潜在因素、验证其重要性以及识别潜在偏差或错误至关重要。有效的解释提供可操作的见解，有助于采取预防措施以避免未来出现类似的离群点。反事实解释通过识别改变预测所需的最小变化，阐明特定数据点为何被分类为离群点。尽管有价值，但大多数现有的反事实解释方法忽略了离群点检测带来的独特挑战，并且未能针对经典、广泛采用的离群点检测算法。局部离群因子（LOF）是最流行的无监督离群点检测方法之一，通过相对局部密度量化离群程度。尽管LOF在多种应用中广泛使用，但它缺乏可解释性。为解决这一局限性，我们提出了基于密度的离群点反事实解释（DCFO），这是一种专门为LOF生成反事实解释的新方法。DCFO将数据空间划分为LOF行为平滑的区域，从而实现高效的基于梯度的优化。在50个OpenML数据集上的广泛实验验证表明，DCFO始终优于基准竞争对手，在生成的反事实的邻近性和有效性方面表现更优。

英文摘要

Outlier detection identifies data points that significantly deviate from the majority of the data distribution. Explaining outliers is crucial for understanding the underlying factors that contribute to their detection, validating their significance, and identifying potential biases or errors. Effective explanations provide actionable insights, facilitating preventive measures to avoid similar outliers in the future. Counterfactual explanations clarify why specific data points are classified as outliers by identifying minimal changes required to alter their prediction. Although valuable, most existing counterfactual explanation methods overlook the unique challenges posed by outlier detection, and fail to target classical, widely adopted outlier detection algorithms. Local Outlier Factor (LOF) is one the most popular unsupervised outlier detection methods, quantifying outlierness through relative local density. Despite LOF's widespread use across diverse applications, it lacks interpretability. To address this limitation, we introduce Density-based Counterfactuals for Outliers (DCFO), a novel method specifically designed to generate counterfactual explanations for LOF. DCFO partitions the data space into regions where LOF behaves smoothly, enabling efficient gradient-based optimisation. Extensive experimental validation on 50 OpenML datasets demonstrates that DCFO consistently outperforms benchmarked competitors, offering superior proximity and validity of generated counterfactuals.

URL PDF HTML ☆

赞 0 踩 0

2512.10401 2026-05-29 stat.ML cs.LG math.ST stat.TH 版本更新

Diffusion differentiable resampling

扩散可微重采样

Jennifer Rosina Andersson, Zheng Zhao

发表机构 * Department of Information Technology, Uppsala University, Sweden（乌普萨拉大学信息科技系，瑞典）

AI总结针对序贯蒙特卡洛中的可微重采样问题，提出一种基于无训练扩散模型代理的信息性且即时可微的重采样方法，理论证明其一致性，并在多个滤波和参数估计基准上优于现有方法。

Comments In ICML 2026

2512.03109 2026-05-29 cs.LG cs.AI stat.AP stat.ML 版本更新

E-valuator: Reliable Agent Verifiers with Sequential Hypothesis Testing

E-valuator: 基于序贯假设检验的可靠智能体验证器

Shuvom Sadhuka, Drew Prinster, Clara Fannjiang, Gabriele Scalia, Bonnie Berger, Aviv Regev, Hanchen Wang

发表机构 * Genentech（基因泰克）； MIT（麻省理工学院）； Johns Hopkins（约翰霍普金斯大学）； Stanford（斯坦福大学）

AI总结提出E-valuator方法，将任意黑盒验证器分数转化为具有可控虚警率的决策规则，通过序贯假设检验实现对智能体轨迹的在线监控，提升统计功效并节省令牌。

详情

AI中文摘要

智能体AI系统根据用户提示执行一系列动作，如推理步骤或工具调用。为了评估其轨迹的成功性，研究人员开发了验证器（如LLM评判器和过程奖励模型）来对智能体轨迹中每个动作的质量进行评分。尽管这些启发式评分可能提供信息，但在用于决定智能体是否会产生成功输出时，无法保证正确性。在此，我们引入e-valuator，一种将任意黑盒验证器分数转化为具有可证明虚警率控制的决策规则的方法。我们将区分成功轨迹（即会导致对用户提示正确响应的动作序列）与不成功轨迹的问题构建为序贯假设检验问题。E-valuator基于e-过程工具开发了一个序贯假设检验，该检验在智能体轨迹的每一步都保持统计有效性，从而能够对任意长动作序列的智能体进行在线监控。实验表明，在六个数据集和三个智能体上，e-valuator相比其他策略提供了更高的统计功效和更好的虚警率控制。我们还展示了e-valuator可用于快速终止有问题的轨迹并节省令牌。总之，e-valuator提供了一个轻量级、模型无关的框架，将验证器启发式转化为具有统计保证的决策规则，从而支持部署更可靠的智能体系统。

英文摘要

Agentic AI systems execute a sequence of actions, such as reasoning steps or tool calls, in response to a user prompt. To evaluate the success of their trajectories, researchers have developed verifiers, such as LLM judges and process-reward models, to score the quality of each action in an agent's trajectory. Although these heuristic scores can be informative, there are no guarantees of correctness when used to decide whether an agent will yield a successful output. Here, we introduce e-valuator, a method to convert any black-box verifier score into a decision rule with provable control of false alarm rates. We frame the problem of distinguishing successful trajectories (that is, a sequence of actions that will lead to a correct response to the user's prompt) and unsuccessful trajectories as a sequential hypothesis testing problem. E-valuator builds on tools from e-processes to develop a sequential hypothesis test that remains statistically valid at every step of an agent's trajectory, enabling online monitoring of agents over arbitrarily long sequences of actions. Empirically, we demonstrate that e-valuator provides greater statistical power and better false alarm rate control than other strategies across six datasets and three agents. We additionally show that e-valuator can be used for to quickly terminate problematic trajectories and save tokens. Together, e-valuator provides a lightweight, model-agnostic framework that converts verifier heuristics into decisions rules with statistical guarantees, enabling the deployment of more reliable agentic systems.

URL PDF HTML ☆

赞 0 踩 0

2511.11118 2026-05-29 cs.LG 版本更新

Improving Continual Learning of Knowledge Graph Embeddings via Informed Initialization

通过信息初始化改进知识图谱嵌入的持续学习

Gerard Pons, Besim Bilalli, Anna Queralt

AI总结提出一种基于知识图谱模式与已有嵌入的信息初始化策略，提升持续学习中新知识的获取并减少灾难性遗忘，同时加速训练。

详情

DOI: 10.1016/j.neucom.2026.134045

AI中文摘要

许多知识图谱（KG）会频繁更新，迫使知识图谱嵌入（KGE）适应这些变化。为了解决这个问题，KGE的持续学习技术在更新旧嵌入的同时纳入新实体的嵌入。这些方法中的一个必要步骤是嵌入的初始化，作为KGE学习过程的输入，它对最终嵌入的准确性以及训练所需的时间有重要影响。这对于相对较小且频繁的更新尤其重要。我们提出了一种新颖的信息嵌入初始化策略，可以无缝集成到现有的KGE持续学习方法中，该策略在减少灾难性遗忘的同时增强新知识的获取。具体地，利用KG模式以及先前学习的嵌入，基于新实体所属的类别来获得其初始表示。我们广泛的实验分析表明，所提出的初始化策略提高了所得KGE的预测性能，同时增强了知识保留。此外，我们的方法加速了知识获取，减少了增量学习新嵌入所需的周期数，从而减少了时间。最后，其在不同类型的KGE学习模型中的优势也得到了证明。

英文摘要

Many Knowledege Graphs (KGs) are frequently updated, forcing their Knowledge Graph Embeddings (KGEs) to adapt to these changes. To address this problem, continual learning techniques for KGEs incorporate embeddings for new entities while updating the old ones. One necessary step in these methods is the initialization of the embeddings, as an input to the KGE learning process, which can have an important impact in the accuracy of the final embeddings, as well as in the time required to train them. This is especially relevant for relatively small and frequent updates. We propose a novel informed embedding initialization strategy, which can be seamlessly integrated into existing continual learning methods for KGE, that enhances the acquisition of new knowledge while reducing catastrophic forgetting. Specifically, the KG schema and the previously learned embeddings are utilized to obtain initial representations for the new entities, based on the classes the entities belong to. Our extensive experimental analysis shows that the proposed initialization strategy improves the predictive performance of the resulting KGEs, while also enhancing knowledge retention. Furthermore, our approach accelerates knowledge acquisition, reducing the number of epochs, and therefore time, required to incrementally learn new embeddings. Finally, its benefits across various types of KGE learning models are demonstrated.

URL PDF HTML ☆

赞 0 踩 0

2510.27663 2026-05-29 eess.IV cs.LG stat.ME stat.ML 版本更新

Bayesian model selection and misspecification testing in imaging inverse problems only from noisy and partial measurements

仅从噪声和部分测量中进行成像逆问题的贝叶斯模型选择与误设定检验

Tom Sprunck, Marcelo Pereyra, Tobias Liaudat

发表机构 * Heriot-Watt University, MACS \& Maxwell Institute for Mathematical Sciences, EH14 4AS, Edinburgh, United Kingdom

AI总结提出一种结合贝叶斯交叉验证与数据分裂的通用方法，用于在无真实数据情况下对成像逆模型进行选择与误设定检测，兼容扩散采样器等贝叶斯成像采样器，计算成本低且准确率高。

详情

AI中文摘要

现代成像技术严重依赖贝叶斯统计模型来解决困难的图像重建和恢复任务。本文针对无真实数据的情况，研究此类模型的客观评估，重点关注模型选择和误设定诊断。现有的无监督模型评估方法通常因计算成本高且与通过机器学习模型隐式定义的现代图像先验不兼容，而不适用于计算成像。本文提出一种基于贝叶斯交叉验证与数据分裂（一种随机测量分裂技术）的新型组合方法，用于贝叶斯成像科学中的无监督模型选择和误设定检测。该方法与任何贝叶斯成像采样器兼容，包括扩散采样器和即插即用采样器。我们通过涉及多种评分规则和模型误设定类型的实验证明了该方法的有效性，在低计算成本下实现了出色的选择和检测精度。

英文摘要

Modern imaging techniques heavily rely on Bayesian statistical models to address difficult image reconstruction and restoration tasks. This paper addresses the objective evaluation of such models in settings where ground truth is unavailable, with a focus on model selection and misspecification diagnosis. Existing unsupervised model evaluation methods are often unsuitable for computational imaging due to their high computational cost and incompatibility with modern image priors defined implicitly via machine learning models. We herein propose a general methodology for unsupervised model selection and misspecification detection in Bayesian imaging sciences, based on a novel combination of Bayesian cross-validation and data fission, a randomized measurement splitting technique. The approach is compatible with any Bayesian imaging sampler, including diffusion and plug-and-play samplers. We demonstrate the methodology through experiments involving various scoring rules and types of model misspecification, where we achieve excellent selection and detection accuracy with a low computational cost.

URL PDF HTML ☆

赞 0 踩 0

2510.27391 2026-05-29 cs.CV cs.LG 版本更新

Modality Alignment across Trees on Heterogeneous Hyperbolic Manifolds

异质双曲流形上的树间模态对齐

Wei Wu, Xiaomeng Fan, Yuwei Wu, Zhi Gao, Pengxiang Li, Yunde Jia, Mehrtash Harandi

发表机构 * Beijing Key Laboratory of Intelligent Information Technology, School of Computer Science & Technology, Beijing Institute of Technology（北京智能信息科技重点实验室，计算机科学与技术学院，北京理工大学）； Guangdong Laboratory of Machine Perception and Intelligent Computing, Shenzhen MSU-BIT University（广东机器感知与智能计算实验室，深圳MSU-BIT大学）； Department of Electrical and Computer System Engineering, Monash University（电子与计算机系统工程系，墨尔本大学）

AI总结提出一种在异质双曲流形上对齐图像和文本树状层次特征的方法，通过交叉注意力提取视觉层次特征、异质流形嵌入及KL距离度量学习中间流形，在开放集分类任务中优于基线。

Comments Published as a conference paper at ICLR 2026

详情

Journal ref: The Fourteenth International Conference on Learning Representations (ICLR 2026), Rio de Janeiro, Brazil, 2026

AI中文摘要

模态对齐对于视觉-语言模型（VLM）有效整合跨模态信息至关重要。然而，现有方法在提取文本层次特征的同时，对每个图像仅用单一特征表示，导致不对称和次优的对齐。为解决此问题，我们提出树间对齐（Alignment across Trees）方法，该方法为图像和文本模态构建并对齐树状层次特征。具体而言，我们引入一个语义感知的视觉特征提取框架，该框架对来自中间Transformer层的视觉类别标记应用交叉注意力机制，由文本线索引导以提取具有从粗到细语义的视觉特征。然后，我们将两种模态的特征树嵌入到具有不同曲率的双曲流形中，以有效建模其层次结构。为了在不同曲率的异质双曲流形之间进行对齐，我们推导了异质流形上分布之间的KL距离度量，并通过最小化该距离学习一个用于流形对齐的中间流形。我们证明了最优中间流形的存在性和唯一性。在多个图像数据集上的分类学开放集分类任务实验表明，我们的方法在少样本和跨域设置下持续优于强基线。

英文摘要

Modality alignment is critical for vision-language models (VLMs) to effectively integrate information across modalities. However, existing methods extract hierarchical features from text while representing each image with a single feature, leading to asymmetric and suboptimal alignment. To address this, we propose Alignment across Trees, a method that constructs and aligns tree-like hierarchical features for both image and text modalities. Specifically, we introduce a semantic-aware visual feature extraction framework that applies a cross-attention mechanism to visual class tokens from intermediate Transformer layers, guided by textual cues to extract visual features with coarse-to-fine semantics. We then embed the feature trees of the two modalities into hyperbolic manifolds with distinct curvatures to effectively model their hierarchical structures. To align across the heterogeneous hyperbolic manifolds with different curvatures, we formulate a KL distance measure between distributions on heterogeneous manifolds, and learn an intermediate manifold for manifold alignment by minimizing the distance. We prove the existence and uniqueness of the optimal intermediate manifold. Experiments on taxonomic open-set classification tasks across multiple image datasets demonstrate that our method consistently outperforms strong baselines under few-shot and cross-domain settings.

URL PDF HTML ☆

赞 0 踩 0

2510.14150 2026-05-29 cs.AI cs.LG cs.NE 版本更新

CodeEvolve: an open source evolutionary coding agent for algorithmic discovery and optimization

CodeEvolve：用于算法发现和优化的开源进化编码智能体

Henrique Assumpção, Diego Ferreira, Leandro Campos, Fabricio Murai

发表机构 * Inter Science - Inter&Co ； Federal University of Minas Gerais（联邦大学伯南迪斯）； Worcester Polytechnic Institute（沃思彻斯特理工大学）

AI总结提出CodeEvolve开源框架，结合大语言模型与岛屿进化搜索，通过灵感交叉、元提示和深度细化，在AlphaEvolve基准上匹配或超越5/9问题，并在匹配条件下优于OpenEvolve和ShinkaEvolve，以更低成本超越前沿闭源集成。

Comments 21 pages, 16 figures, 8 tables

详情

AI中文摘要

我们介绍了CodeEvolve，一个开源框架，它将大语言模型与基于岛屿的进化搜索相结合，用于端到端的算法发现。CodeEvolve在CVT-MAP-Elites存档和加权LLM集成之上集成了基于灵感的交叉、元提示和深度细化，为复杂问题生成优化解决方案。在AlphaEvolve基准套件上，CodeEvolve在9个问题中的5个上匹配或超过了报告的AlphaEvolve结果，并且在匹配条件下，在9个问题中的6个上优于开源框架OpenEvolve和ShinkaEvolve。使用开放权重的Qwen3-Coder-30B骨干网络，它在CirclePackingSquare的两个实例上均超过了报告的AlphaEvolve分数，成本大约比前沿闭源集成低一个数量级，并且在无需重新调整的情况下，在启发式设计任务上与EoH保持竞争力。消融实验表明，CodeEvolve组件之间的相互作用（而非任何单一算子）驱动了这些结果。我们在https://github.com/inter-co/science-codeevolve 发布了该框架、实验数据和实用的超参数指南。

英文摘要

We introduce CodeEvolve, an open-source framework that couples large language models with island-based evolutionary search for end-to-end algorithmic discovery. CodeEvolve integrates inspiration-based crossover, meta-prompting, and depth-based refinement on top of a CVT-MAP-Elites archive and a weighted LLM ensemble to generate optimized solutions for complex problems. On the AlphaEvolve benchmark suite, CodeEvolve matches or surpasses the reported AlphaEvolve results on 5 of 9 problems and, under matched conditions, outperforms the open-source frameworks OpenEvolve and ShinkaEvolve on 6 of 9. With the open-weight Qwen3-Coder-30B backbone, it surpasses the reported AlphaEvolve score on both CirclePackingSquare instances at roughly an order of magnitude lower cost than a frontier closed-source ensemble, and remains competitive with EoH on heuristic-design tasks without retuning. Ablations show that the interaction between CodeEvolve's components, rather than any single operator, drives these results. We release the framework, experimental data, and practical hyperparameter guidelines at https://github.com/inter-co/science-codeevolve.

URL PDF HTML ☆

赞 0 踩 0

2510.11499 2026-05-29 cs.LG cs.AI 版本更新

Offline Reinforcement Learning with Generative Trajectory Policies

基于生成轨迹策略的离线强化学习

Xinsong Feng, Leshu Tang, Chenan Wang, Haipeng Chen

发表机构 * School of Computing, Data Sciences ； Computer Engineering Department, UCLA, Los Angeles, USA

AI总结本文提出生成轨迹策略（GTP），通过统一扩散、流匹配和一致性模型为常微分方程驱动的连续时间生成轨迹，并引入两种理论自适应方法，在D4RL基准上达到最先进性能。

Comments ICML 2026

详情

AI中文摘要

生成模型因其捕获复杂多模态行为的能力，已成为离线强化学习中一类强大的策略。然而，现有方法面临明显的权衡：扩散策略等慢速迭代模型计算成本高，而一致性策略等快速单步模型性能往往下降。在本文中，我们证明弥合这一差距是可能的。我们认为，超越个体方法局限的关键在于一个统一视角，该视角将现代生成模型（包括扩散、流匹配和一致性模型）视为学习由常微分方程驱动的连续时间生成轨迹的具体实例。这一原则性基础为强化学习中的生成策略提供了更清晰的设计空间，并使我们能够提出生成轨迹策略（GTP），一种新的、更通用的策略范式，学习底层ODE的完整解映射。为使该范式适用于离线强化学习，我们进一步引入了两种理论上原则性的自适应方法。实验结果表明，GTP在D4RL基准上达到了最先进的性能——它显著优于先前的生成策略，在多个以困难著称的AntMaze任务上取得了完美分数。

英文摘要

Generative models have emerged as a powerful class of policies for offline reinforcement learning (RL) due to their ability to capture complex, multi-modal behaviors. However, existing methods face a stark trade-off: slow, iterative models like diffusion policies are computationally expensive, while fast, single-step models like consistency policies often suffer from degraded performance. In this paper, we demonstrate that it is possible to bridge this gap. The key to moving beyond the limitations of individual methods, we argue, lies in a unifying perspective that views modern generative models, including diffusion, flow matching, and consistency models, as specific instances of learning a continuous-time generative trajectory governed by an Ordinary Differential Equation (ODE). This principled foundation provides a clearer design space for generative policies in RL and allows us to propose Generative Trajectory Policies (GTPs), a new and more general policy paradigm that learns the entire solution map of the underlying ODE. To make this paradigm practical for offline RL, we further introduce two key theoretically principled adaptations. Empirical results demonstrate that GTP achieves state-of-the-art performance on D4RL benchmarks - it significantly outperforms prior generative policies, achieving perfect scores on several notoriously hard AntMaze tasks.

URL PDF HTML ☆

赞 0 踩 0

2510.08722 2026-05-29 cs.LG cs.AI 版本更新

The Impact of Semantic Pairs on Self-Supervised Representation Learning

语义对自监督表示学习的影响

Mohammad Alkhalefi, Georgios Leontidis, Mingjun Zhong

AI总结通过控制实验研究语义正对（不同同类实例）相比增强正对在自监督学习中的效果，发现语义对能提升泛化性能，尤其对比学习受益最大。

Comments 19 pages, 7 figures, 5 tables

详情

AI中文摘要

实例判别通过将同一图像的不同增强视图视为正对来学习视觉表示。虽然这鼓励对手工变换的不变性，但同图像正对可能保留背景、纹理、光照和对象特定细节等干扰相关性。语义正对，即不同的同类实例，通过在不同上下文中呈现对象可能减少这些相关性。然而，先前的研究通常将语义对与增强正对或错误邻居（即错误映射的语义对）结合，使得难以隔离语义配对的效果。我们提出了一个关于语义正对用于自监督表示学习的受控实证研究。从ImageNet-1K中，我们构建了两个匹配的子集：一个增强对基线和一个手动策划的语义对数据集，具有相同的类别组成和训练对数量。我们使用这些数据集在匹配的训练条件下比较代表性的对比和非对比SSL方法。在迁移学习和目标检测评估中，语义对预训练始终优于增强对预训练。额外的消融实验表明，语义对诱导了超出标准变换管道的不变性。在评估的方法中，对比学习从语义对中受益最大，其中SimCLR显示出最大的相对改进。这些结果阐明了语义正对在SSL中的作用，并为选择和设计能够有效利用语义对信息的框架提供了指导。

英文摘要

Instance discrimination learns visual representations by treating different augmented views of the same image as positive pairs. While this encourages invariance to handcrafted transformations, same-image positives can preserve nuisance correlations such as background, texture, illumination, and object-specific details. Semantic positive pairs, i.e., different same-class instances, may reduce these correlations by presenting objects across diverse contexts. However, previous studies often combine semantic pairs with augmented positives or false neighbors (i.e., incorrectly mapped semantic pairs), making it difficult to isolate the effect of semantic pairing. We present a controlled empirical study of semantic positive pairs for self-supervised representation learning. From ImageNet-1K, we construct two matched subsets: an augmented-pair baseline and a manually curated semantic-pair dataset with the same class composition and training-pair count. We use these datasets to compare representative contrastive and non-contrastive SSL methods under matched training conditions. Across transfer learning and object detection evaluations, semantic-pair pretraining consistently improves generalisation over augmented-pair pretraining. Additional ablations show that semantic pairs induce invariances beyond the standard transformation pipeline. Among the evaluated methods, contrastive learning benefits most strongly from semantic pairs, with SimCLR showing the largest relative improvement. These results clarify the role of semantic positive pairs in SSL and provide guidance for selecting and designing frameworks that can exploit semantic pair information effectively

URL PDF HTML ☆

赞 0 踩 0

2509.24895 2026-05-29 cs.LG 版本更新

Towards Understanding the Shape of Representations in Protein Language Models

理解蛋白质语言模型中表示的形状

Kosio Beshkov, Anders Malthe-Sørenssen

发表机构 * Department of Physics（物理系）； University of Oslo（奥斯陆大学）

AI总结本研究通过平方根速度表示和图过滤分析蛋白质语言模型（PLM）的表示空间，发现ESM2模型中Karcher均值和有效维度随层数非线性变化，且PLM优先编码残基的局部关系，最忠实于结构的表示出现在模型倒数第二层附近。

Comments Accepted as a poster at ICLR 2026. OpenReview: https://openreview.net/forum?id=Dnn8SSBJaY

详情

Journal ref: International Conference on Learning Representations (ICLR), 2026

AI中文摘要

虽然蛋白质语言模型（PLM）是未来从头蛋白质设计最有前途的研究途径之一，但它们将序列转换为隐藏表示的方式以及这些表示中编码的信息尚未完全理解。一些工作试图提出PLM的可解释性工具，但侧重于理解单个序列如何被这些模型转换。因此，PLM如何转换整个序列空间及其关系仍然未知。在这项工作中，我们尝试通过将蛋白质结构和表示与平方根速度（SRV）表示和图过滤联系起来，来理解这个转换后的序列空间。这两种方法自然地导出一个度量空间，在该空间中，可以比较成对的蛋白质或蛋白质表示。我们分析了来自SCOP数据集的不同类型蛋白质，并表明Karcher均值和SRV形状空间的有效维度作为不同大小ESM2模型中层数的函数遵循非线性模式。此外，我们使用图过滤作为工具来研究模型编码蛋白质结构特征的上下文长度。我们发现PLM优先编码残基之间的直接和局部关系，但对于较大的上下文长度开始退化。最忠实于结构的编码往往出现在模型最后一层附近但之前，表明在这些层之上训练折叠模型可能会提高折叠性能。

英文摘要

While protein language models (PLMs) are one of the most promising avenues of research for future de novo protein design, the way in which they transform sequences to hidden representations, as well as the information encoded in such representations is yet to be fully understood. Several works have attempted to propose interpretability tools for PLMs, but they have focused on understanding how individual sequences are transformed by such models. Therefore, the way in which PLMs transform the whole space of sequences along with their relations is still unknown. In this work we attempt to understand this transformed space of sequences by identifying protein structure and representation with square-root velocity (SRV) representations and graph filtrations. Both approaches naturally lead to a metric space in which pairs of proteins or protein representations can be compared with each other. We analyze different types of proteins from the SCOP dataset and show that the Karcher mean and effective dimension of the SRV shape space follow a non-linear pattern as a function of the layers in ESM2 models of different sizes. Furthermore, we use graph filtrations as a tool to study the context lengths at which models encode the structural features of proteins. We find that PLMs preferentially encode immediate as well as local relations between residues, but start to degrade for larger context lengths. The most structurally faithful encoding tends to occur close to, but before the last layer of the models, indicating that training a folding model ontop of these layers might lead to improved folding performance.

URL PDF HTML ☆

赞 0 踩 0

2509.24100 2026-05-29 stat.ME cs.LG 版本更新

驯服基于ML的安全任务中的数据挑战：使用生成式AI

Shravya Kanchi, Neal Mangaokar, Aravind Cheruvu, Sifat Muhammad Abdullah, Shirin Nilizadeh, Atul Prakash, Bimal Viswanath

发表机构 * University of Michigan, Ann Arbor（密歇根大学安娜堡分校）； University of Texas at Arlington（德克萨斯理工大学）

AI总结提出使用生成式AI（GenAI）生成的合成数据增强训练集，以改善机器学习安全分类器的泛化性能，在7个任务上实现最高32.6%的提升。

Comments Accepted at the 2026 ACM Asia Conference on Computer and Communications Security (AsiaCCS 2026)

详情

DOI: 10.1145/3779208.3785264
Journal ref: In Proc. ACM AsiaCCS 2026, Bangalore, India, June 1-5, 2026. ACM, 2026

AI中文摘要

基于机器学习的监督分类器广泛用于安全任务，其改进主要集中在算法进步上。我们认为，对分类器性能产生负面影响的数据挑战受到的关注有限。我们解决以下研究问题：生成式AI（GenAI）的发展能否应对这些数据挑战并提高分类器性能？我们提出使用GenAI技术生成的合成数据增强训练数据集，以改善分类器的泛化能力。我们使用6种最先进的GenAI方法在7个不同的安全任务上评估了这种方法，并引入了一种名为Nimai的新型GenAI方案，该方案能够实现高度可控的数据合成。我们发现，GenAI技术可以显著提高安全分类器的性能，即使在数据严重受限的情况下（仅约180个训练样本），也能实现高达32.6%的提升。此外，我们证明GenAI可以促进部署后对概念漂移的快速适应，在调整过程中只需最少的标注。尽管取得了成功，但我们的研究发现，一些GenAI方案在某些安全任务上难以初始化（训练和生成数据）。我们还识别了特定任务的特征，如噪声标签、重叠的类别分布和稀疏特征向量，这些特征阻碍了使用GenAI提升性能。我们相信，我们的研究将推动未来针对安全任务的GenAI工具的开发。

英文摘要

Machine learning-based supervised classifiers are widely used for security tasks, and their improvement has been largely focused on algorithmic advancements. We argue that data challenges that negatively impact the performance of these classifiers have received limited attention. We address the following research question: Can developments in Generative AI (GenAI) address these data challenges and improve classifier performance? We propose augmenting training datasets with synthetic data generated using GenAI techniques to improve classifier generalization. We evaluate this approach across 7 diverse security tasks using 6 state-of-the-art GenAI methods and introduce a novel GenAI scheme called Nimai that enables highly controlled data synthesis. We find that GenAI techniques can significantly improve the performance of security classifiers, achieving improvements of up to 32.6% even in severely data-constrained settings (only ~180 training samples). Furthermore, we demonstrate that GenAI can facilitate rapid adaptation to concept drift post-deployment, requiring minimal labeling in the adjustment process. Despite successes, our study finds that some GenAI schemes struggle to initialize (train and produce data) on certain security tasks. We also identify characteristics of specific tasks, such as noisy labels, overlapping class distributions, and sparse feature vectors, which hinder performance boost using GenAI. We believe that our study will drive the development of future GenAI tools designed for security tasks.

URL PDF HTML ☆

赞 0 踩 0

2507.00037 2026-05-29 cs.LG cs.AI 版本更新

Model Fusion via Retrofitting

通过回溯改造的模型融合

Phoomraphee Luenam, Andreas Spanopoulos, Amit Sant, Thomas Hofmann, Sotiris Anagnostidis, Sidak Pal Singh

发表机构 * ETH Z\"urich

AI总结提出一种以神经元为中心的融合算法，通过将父模型中间神经元分组为目标表示并训练融合模型子网络逼近，结合神经元归因分数进行显著特征对齐，适用于任意可模块化为有向无环图结构的架构，在零样本和非独立同分布场景下表现最佳。

Comments 5 figures, 15 tables, 23 pages

详情

AI中文摘要

模型融合旨在将独立训练的神经网络组合成一个单一模型而无需重新训练，但由于排列不变性、随机初始化和异构训练数据导致的表示差异，这一过程变得复杂。现有方法在非独立同分布数据分布下的零样本设置中尤其困难，并且通常局限于特定架构或成对融合。我们引入了一类以神经元为中心的融合算法，将融合视为一个原则性的表示匹配问题：父模型中的中间神经元被分组为目标表示，然后训练融合模型的相应子网络来逼近这些表示。与先前工作不同，我们的方法结合了神经元归因分数以偏向于显著特征的对齐，并且可以应用于任何可模块化为有向无环图层次的架构——在VGG、ResNet和ViT上进行了实证验证。在标准基准上的实验显示，与现有融合方法相比，我们的方法取得了一致的改进，在零样本和非独立同分布场景中增益最大。代码可在https://github.com/AndrewSpano/model-fusion-via-retrofitting获取。

英文摘要

Model fusion seeks to combine independently trained neural networks into a single model without retraining, but is complicated by representational divergence arising from permutation invariance, random initialization, and heterogeneous training data. Existing methods struggle particularly in zero-shot settings under non-IID data distributions, and are often limited to specific architectures or pairwise fusion. We introduce a neuron-centric family of fusion algorithms that frames fusion as a principled representation-matching problem: intermediate neurons across parent models are grouped into target representations, which the fused model's corresponding sub-networks are then trained to approximate. Unlike prior work, our approach incorporates neuron attribution scores to bias alignment toward salient features, and can be applied to any architecture modularizable as a DAG of levels -- empirically validated on VGGs, ResNets, and ViTs. Experiments across standard benchmarks show consistent improvements over existing fusion methods, with the largest gains in zero-shot and non-IID scenarios. Code is available at https://github.com/AndrewSpano/model-fusion-via-retrofitting.

URL PDF HTML ☆

赞 0 踩 0

2506.20344 2026-05-29 math.OC cs.LG 版本更新

A Complete Loss Landscape Analysis of Regularized Deep Matrix Factorization

正则化深度矩阵分解的完整损失景观分析

Po Chen, Rujun Jiang, Peng Wang

发表机构 * School of Data Science, Fudan University, Shanghai, China（复旦大学数据科学学院，上海，中国）； Department of Computer and Information Science, University of Macau, Macau SAR, China（澳门大学计算机与信息科学系，澳门特别行政区，中国）

AI总结本文通过闭式表征所有临界点并分类其类型，揭示了正则化深度矩阵分解的损失景观，解释了梯度方法几乎总是收敛到局部极小值的原因。

Comments 30 pages, 2 figures

2506.12815 2026-05-29 cs.LG 版本更新

TrojanTO: Action-Level Backdoor Attacks against Trajectory Optimization Models

TrojanTO：针对轨迹优化模型的行动级后门攻击

Yang Dai, Oubo Ma, Longfei Zhang, Xingxing Liang, Xiaochun Cao, Shouling Ji, Jiaheng Zhang, Jincai Huang, Li Shen

发表机构 * Laboratory for Big Data and Decision, National University of Defense Technology（大数据与决策实验室，国防科技大学）； Zhejiang University（浙江大学）； Shenzhen Campus of Sun Yat-sen University（中山大学深圳校区）； National University of Singapore（新加坡国立大学）

AI总结提出TrojanTO，首个针对轨迹优化模型的行动级后门攻击方法，通过交替训练增强触发与目标动作关联，并利用轨迹过滤和批量投毒实现高隐蔽性，在低攻击预算下有效植入后门。

Comments 23 pages, 6 figures

详情

Journal ref: International Conference on Learning Representations (ICLR), 2026

AI中文摘要

轨迹优化（TO）模型的最新进展在离线强化学习中取得了显著成功。然而，它们对后门攻击的脆弱性尚不清楚。我们发现，现有的强化学习后门攻击基于奖励操纵，由于TO模型固有的序列建模特性，这些攻击对其基本无效。此外，高维动作空间带来的复杂性进一步加剧了动作操纵的挑战。为解决这些问题，我们提出了TrojanTO，这是首个针对TO模型的行动级后门攻击。TrojanTO采用交替训练来增强触发器与目标动作之间的关联，以提高攻击有效性。为提高攻击隐蔽性，它通过轨迹过滤进行精确投毒以保持正常性能，并通过批量投毒确保触发器一致性。大量评估表明，TrojanTO能够在低攻击预算（0.3%的轨迹）下，跨不同任务和攻击目标有效植入后门攻击。此外，TrojanTO对DT、GDT和DC具有广泛的适用性，突显了其跨多种TO模型架构的可扩展性。

英文摘要

Recent advances in Trajectory Optimization (TO) models have achieved remarkable success in offline reinforcement learning. However, their vulnerabilities against backdoor attacks are poorly understood. We find that existing backdoor attacks in reinforcement learning are based on reward manipulation, which are largely ineffective against the TO model due to its inherent sequence modeling nature. Moreover, the complexities introduced by high-dimensional action spaces further compound the challenge of action manipulation. To address these gaps, we propose TrojanTO, the first action-level backdoor attack against TO models. TrojanTO employs alternating training to enhance the connection between triggers and target actions for attack effectiveness. To improve attack stealth, it utilizes precise poisoning via trajectory filtering for normal performance and batch poisoning for trigger consistency. Extensive evaluations demonstrate that TrojanTO effectively implants backdoor attacks across diverse tasks and attack objectives with a low attack budget (0.3\% of trajectories). Furthermore, TrojanTO exhibits broad applicability to DT, GDT, and DC, underscoring its scalability across diverse TO model architectures.

URL PDF HTML ☆

赞 0 踩 0

2505.21627 2026-05-29 cs.GT cs.AI cs.CY cs.LG 版本更新

Is Your LLM Overcharging You? Tokenization, Transparency, and Incentives

你的大语言模型是否在过度收费？分词、透明度与激励

Ander Artola Velasco, Stratis Tsirtsis, Nastaran Okati, Manuel Gomez-Rodriguez

发表机构 * Ander Artola Velasco（1. 阿德纳·阿尔托拉·韦拉斯科）； Stratis Tsirtsis（2. 斯特拉蒂斯·蒂尔蒂斯）； Nastaran Okati（3. 纳斯塔兰·奥卡蒂）

AI总结研究当前按token计费机制下，服务提供商可能通过策略性报告token数量来过度收费，并提出按字符线性定价的激励相容机制以消除该财务激励。

Comments Selected as an oral presentation at ICML 2026

详情

AI中文摘要

最先进的大语言模型需要专门的硬件和大量能源来运行。因此，提供大语言模型访问的基于云的服务变得非常流行。在这些服务中，用户为模型生成的输出支付的价格取决于模型用于生成该输出的token数量：他们为每个token支付固定价格。在这项工作中，我们表明这种定价机制为提供商创造了财务激励，使其策略性地虚报模型用于生成输出的token数量，而用户无法证明甚至不知道提供商是否在过度收费。然而，我们也表明，如果不诚实的提供商被强制要求透明地说明模型使用的生成过程，那么在不引起怀疑的情况下最优地虚报是困难的。尽管如此，作为概念验证，我们开发了一种高效的启发式算法，使提供商能够在不引起怀疑的情况下显著过度收费用户。关键的是，我们证明运行该算法的成本低于从过度收费用户中获得的额外收入，突显了当前按token计费机制下用户的脆弱性。此外，我们表明，为了消除策略性行为的财务激励，定价机制必须根据token的字符数线性定价。虽然这会使提供商的利润率因token而异，但我们引入了一个简单的方案，采用这种激励相容定价机制的提供商可以维持他们在按token计费机制下的平均利润率。在此过程中，为了说明和补充我们的理论结果，我们使用来自$ exttt{Llama}$、$ exttt{Gemma}$和$ exttt{Ministral}$系列的几个大语言模型以及来自LMSYS Chatbot Arena平台的输入提示进行了实验。

英文摘要

State-of-the-art large language models require specialized hardware and substantial energy to operate. As a consequence, cloud-based services that provide access to large language models have become very popular. In these services, the price users pay for an output provided by a model depends on the number of tokens the model uses to generate it: they pay a fixed price per token. In this work, we show that this pricing mechanism creates a financial incentive for providers to strategize and misreport the (number of) tokens a model used to generate an output, and users cannot prove, or even know, whether a provider is overcharging them. However, we also show that, if an unfaithful provider is obliged to be transparent about the generative process used by the model, misreporting optimally without raising suspicion is hard. Nevertheless, as a proof-of-concept, we develop an efficient heuristic algorithm that allows providers to significantly overcharge users without raising suspicion. Crucially, we demonstrate that the cost of running the algorithm is lower than the additional revenue from overcharging users, highlighting the vulnerability of users under the current pay-per-token pricing mechanism. Further, we show that, to eliminate the financial incentive to strategize, a pricing mechanism must price tokens linearly on their character count. While this makes a provider's profit margin vary across tokens, we introduce a simple prescription under which the provider who adopts such an incentive-compatible pricing mechanism can maintain the average profit margin they had under the pay-per-token pricing mechanism. Along the way, to illustrate and complement our theoretical results, we conduct experiments with several large language models from the $\texttt{Llama}$, $\texttt{Gemma}$ and $\texttt{Ministral}$ families, and input prompts from the LMSYS Chatbot Arena platform.

URL PDF HTML ☆

赞 0 踩 0

2505.20955 2026-05-29 cs.CR cs.LG 版本更新

Enhancing Membership Inference Attacks on Diffusion Models from a Frequency-Domain Perspective

从频域角度增强扩散模型的成员推理攻击

Puwei Lian, Yujun Cai, Songze Li, Bingkun Bao

发表机构 * Southeast University（东南大学）； The University of Queensland（昆士兰大学）； Hefei University of Technology（合肥工业大学）； Engineering Research Center of Blockchain Application, Supervision and Management (Southeast University), Ministry of Education（区块链应用、监督与管理工程研究中心（东南大学），教育部）

AI总结本文从频域角度揭示扩散模型处理高频信息的缺陷导致成员推理攻击误分类，并提出即插即用的高频滤波模块以提升攻击性能。

Comments Accepted to Forty-Third International Conference on Machine Learning (ICML 2026)

详情

AI中文摘要

扩散模型在图像生成方面取得了巨大成功，但也引发了关于隐私和版权的重要担忧。成员推理攻击（MIAs）旨在确定特定数据是否在模型训练阶段被使用。由于当前针对扩散模型的MIAs通常利用模型的图像预测能力，我们将其形式化为一个统一的一般范式，通过计算成员分数进行成员识别。在该范式下，我们通过实验发现现有攻击忽略了扩散模型处理高频信息时的固有缺陷。因此，该缺陷导致包含更多高频内容的成员数据被误分类为留出数据，而高频内容较少的留出数据则倾向于被误分类为成员数据。此外，我们从理论上证明该缺陷降低了攻击的成员优势，从而干扰了对成员数据和留出数据的有效区分。基于这一发现，我们提出了一种即插即用的高频滤波模块，以减轻该缺陷的不利影响，该模块可以无缝集成到一般范式中的任何攻击中，且无需额外时间成本。大量实验证实，该模块在不同数据集和模型上显著提升了基线攻击的性能。代码可在 https://github.com/poetic2/FreMIA 获取。

英文摘要

Diffusion models have achieved tremendous success in image generation, but they also raise significant concerns regarding privacy and copyright issues. Membership Inference Attacks (MIAs) are designed to ascertain whether specific data was utilized during a model's training phase. As current MIAs for diffusion models typically exploit the model's image prediction ability, we formalize them into a unified general paradigm that computes the membership score for membership identification. Under this paradigm, we empirically find that existing attacks overlook the inherent deficiency in how diffusion models process high-frequency information. Consequently, this deficiency leads to member data with more high-frequency content being misclassified as hold-out data, and hold-out data with less high-frequency content tends to be misclassified as member data. Moreover, we theoretically demonstrate that this deficiency reduces the membership advantage of attacks, thereby interfering with the effective discrimination of member data and hold-out data. Based on this insight, we propose a plug-and-play high-frequency filter module to mitigate the adverse effects of the deficiency, which can be seamlessly integrated into any attacks within the general paradigm without additional time costs. Extensive experiments corroborate that this module significantly improves the performance of baseline attacks across different datasets and models. Code is available at https://github.com/poetic2/FreMIA.

URL PDF HTML ☆

赞 0 踩 0

2505.13745 2026-05-29 cs.LG stat.ML 版本更新

Synthetic Non-stationary Data Streams for Recognition of the Unknown

用于未知识别的合成非平稳数据流

Joanna Komorniczak

发表机构 * Wrocław University of Science and Technology（沃拉夫大学科学与技术学院）

AI总结提出一种同时包含概念漂移和新类出现的合成数据流生成策略，并评估无监督漂移检测器在开放集识别任务中的表现。

详情

DOI: 10.1007/978-3-032-19102-1_9

AI中文摘要

数据非平稳性问题在数据流处理中常被讨论。在动态环境中，方法应持续准备分析时变数据——因此，它们应支持增量训练并应对概念漂移。非平稳数据流环境中另一个同样重要的变化是新的、先前未知类别的出现。通常，方法专注于这两种现象之一——检测概念漂移或检测新类别——而数据流中可能同时出现这两种困难。此外，关于先前未知的观测，开放类别集的话题近年来变得尤为重要，方法的目标是在已知类别内高效分类，并识别模型能力范围外的对象。本文提出一种合成数据流生成策略，其中同时出现概念漂移和代表未知对象的新类别。所呈现的研究展示了无监督漂移检测器如何处理检测新类别和概念漂移的任务，并演示了生成的数据流如何用于开放集识别任务。

英文摘要

The problem of data non-stationarity is commonly addressed in data stream processing. In a dynamic environment, methods should continuously be ready to analyze time-varying data -- hence, they should enable incremental training and respond to concept drifts. An equally important variability typical for non-stationary data stream environments is the emergence of new, previously unknown classes. Often, methods focus on one of these two phenomena -- detection of concept drifts or detection of novel classes -- while both difficulties can be observed in data streams. Additionally, concerning previously unknown observations, the topic of open set of classes has become particularly important in recent years, where the goal of methods is to efficiently classify within known classes and recognize objects outside the model competence. This article presents a strategy for synthetic data stream generation in which both concept drifts and the emergence of new classes representing unknown objects occur. The presented research shows how unsupervised drift detectors address the task of detecting novelty and concept drifts and demonstrates how the generated data streams can be utilized in the open set recognition task.

URL PDF HTML ☆

赞 0 踩 0

2505.02604 2026-05-29 cs.LG 版本更新

Connecting Independently Trained Modes via Layer-Wise Connectivity

通过逐层连接性连接独立训练的模态

Yongding Tian, Zaid Al-Ars, Maksim Kitsak, Peter Hofstee

发表机构 * Computer Engineering Lab, Delft University of Technology, Delft, NL（代尔夫特理工大学计算机工程实验室）； Network and Architecture Service, Delft University of Technology, Delft, NL（代尔夫特理工大学网络与架构服务）； IBM Infrastructure, TX, USA（IBM基础设施）

AI总结提出一种新的经验算法，通过逐层连接性构建独立训练神经网络模型之间的连续低损失路径，在多种现代架构上实现更一致的模式连接。

Comments 28 pages, 22 figures, accepted in ICML 2026: https://openreview.net/forum?id=4VOTzpH9MO

详情

AI中文摘要

实证研究表明，可以在独立训练的神经网络模型之间构建连续的低损失路径。这种现象称为模式连接性，指的是在参数空间中不同模式（即训练良好的解）之间存在这样的路径。然而，现有的经验方法不能可靠地连接独立训练的模态，并且主要在一组狭窄的架构（例如，基本的CNN、VGG和ResNet）上进行了评估，使得它们在新模型上的有效性尚不清楚。在这项工作中，我们提出了一种新的经验算法，用于连接独立训练的模态，该算法超越了传统架构，支持更广泛的网络，包括MobileNet、ShuffleNet、EfficientNet、RegNet、深度层聚合（DLA）和紧凑卷积变换器（CCT）。除了更广泛的适用性外，所提出的方法在独立训练的模态对之间产生更一致的连接路径，并支持连接使用不同训练超参数获得的模态。

英文摘要

Empirical studies have shown that continuous low-loss paths can be constructed between independently trained neural network models. This phenomenon, known as mode connectivity, refers to the existence of such paths between distinct modes-i.e., well-trained solutions in parameter space. However, existing empirical methods do not reliably connect independently trained modes and have been evaluated mainly on a narrow set of architectures (e.g., basic CNNs, VGG, and ResNet), leaving their effectiveness on newer models unclear. In this work, we propose a new empirical algorithm for connecting independently trained modes that generalizes beyond traditional architectures and supports a broader range of networks, including MobileNet, ShuffleNet, EfficientNet, RegNet, Deep Layer Aggregation (DLA), and Compact Convolutional Transformers (CCT). In addition to broader applicability, the proposed method yields more consistent connectivity paths across independently trained mode pairs and supports connecting modes obtained with different training hyperparameters.

URL PDF HTML ☆

赞 0 踩 0

2505.02069 2026-05-29 cs.LG stat.ML 版本更新

Neural Logistic Bandits

神经逻辑老虎机

Seoungbin Bae, Dabeen Lee

发表机构 * Department of Industrial \& Systems Engineering, KAIST ； Department of Mathematical Sciences \& Research Institute of Mathematics, Seoul National University ； Interdisciplinary Program in Artificial Intelligence, Seoul National University

AI总结针对神经逻辑老虎机问题，利用一种新型的自归一化向量值鞅的Bernstein型不等式，提出两种算法NeuralLog-UCB-1和NeuralLog-UCB-2，分别实现与有效维度相关的遗憾上界，改进了现有结果。

详情

AI中文摘要

我们研究了神经逻辑老虎机问题，其主要任务是通过神经网络学习逻辑链接函数内的未知奖励函数。现有方法要么对$κ$（其中$1/κ$表示奖励分布的最小方差）有不利的依赖，要么直接依赖于特征维度$d$，而在基于神经网络的设置中$d$可能非常大。在这项工作中，我们引入了一种新型的自归一化向量值鞅的Bernstein型不等式，旨在绕过对环境维度的直接依赖。这使我们能够推导出一个遗憾上界，该上界随有效维度$\widetilde{d}$增长，而不是特征维度，同时保持对$κ$的最小依赖。基于该集中不等式，我们提出了两种算法NeuralLog-UCB-1和NeuralLog-UCB-2，它们分别保证了$\widetilde{O}(\widetilde{d}\sqrt{κT})$和$\widetilde{O}(\widetilde{d}\sqrt{T/κ})$阶的遗憾上界，改进了现有结果。最后，我们在合成数据集和真实数据集上报告了数值结果，以验证我们的理论发现。

英文摘要

We study the problem of neural logistic bandits, where the main task is to learn an unknown reward function within a logistic link function using a neural network. Existing approaches either exhibit unfavorable dependencies on $κ$, where $1/κ$ represents the minimum variance of reward distributions, or suffer from direct dependence on the feature dimension $d$, which can be huge in neural network-based settings. In this work, we introduce a novel Bernstein-type inequality for self-normalized vector-valued martingales that is designed to bypass a direct dependence on the ambient dimension. This lets us deduce a regret upper bound that grows with the effective dimension $\widetilde{d}$, not the feature dimension, while keeping a minimal dependence on $κ$. Based on the concentration inequality, we propose two algorithms, NeuralLog-UCB-1 and NeuralLog-UCB-2, that guarantee regret upper bounds of order $\widetilde{O}(\widetilde{d}\sqrt{κT})$ and $\widetilde{O}(\widetilde{d}\sqrt{T/κ})$, respectively, improving on the existing results. Lastly, we report numerical results on both synthetic and real datasets to validate our theoretical findings.

URL PDF HTML ☆

赞 0 踩 0

2502.20954 2026-05-29 cs.LG 版本更新

Robust and Efficient Writer-Independent IMU-Based Handwriting Recognition

鲁棒且高效的独立于书写者的基于IMU的手写识别

Jindong Li, Tim Hamann, Jens Barth, Peter Kämpf, Dario Zanca, Björn Eskofier

发表机构 * Machine Learning and Data Analytics Lab, Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen, Germany（机器学习与数据分析实验室，埃朗根-纽伦堡大学，埃朗根，德国）； STABILO International GmbH, Heroldsberg, Germany（STABILO国际有限公司，赫尔兹堡，德国）； Translational Digital Health Group, Institute of AI for Health, Helmholtz Zentrum München - German Research Center for Environmental Health, Neuherberg, Germany（转化数字健康组，健康人工智能研究所，慕尼黑-德国环境健康研究中心，纽赫堡，德国）

AI总结提出一种结合CNN编码器和BiLSTM解码器的模型，在IMU数据上实现独立于书写者的手写识别，在OnHW数据集和自建数据集上分别达到7.37%和9.44%的字符错误率，并展现出对未见书写风格的鲁棒性。

Comments Accepted at iWOAR 2025. Published in Springer LNCS, 2026. Code available at https://github.com/jindongli24/REWI

详情

DOI: 10.1007/978-3-032-13312-0_16
Journal ref: Sensor-Based Activity Recognition and Artificial Intelligence (iWOAR 2025), Lecture Notes in Computer Science, pp. 261-286, Springer, Cham, 2026

AI中文摘要

使用惯性测量单元（IMU）数据进行手写识别（HWR）由于书写风格的多样性和数据集的有限性仍然具有挑战性。以往的方法往往难以处理未见过的书写者的手写，使得独立于书写者（WI）的识别成为一个关键但困难的问题。本文提出了一种模型，旨在提高基于IMU数据的WI HWR性能，该模型使用CNN编码器和基于BiLSTM的解码器。我们的方法对未见过的书写风格表现出强大的鲁棒性，在公共OnHW数据集和我们基于单词的数据集的WI划分上均优于现有方法，分别实现了7.37%和9.44%的字符错误率（CER），以及15.12%和32.17%的词错误率（WER）。鲁棒性评估表明，我们的模型在不同年龄组中保持优越性能，并且从一个组学到的知识相比其他方法能更好地泛化到另一个组。在我们基于句子的数据集上的评估进一步展示了识别完整句子的潜力。通过全面的消融研究，我们表明我们的设计选择在性能和效率之间实现了良好的平衡。这些发现支持开发更适应和可扩展的HWR系统用于实际应用。

英文摘要

Handwriting recognition (HWR) using inertial measurement unit (IMU) data remains challenging due to variations in writing styles and the limited availability of datasets. Previous approaches often struggle with handwriting from unseen writers, making writer-independent (WI) recognition a crucial yet difficult problem. This paper presents a model designed to improve WI HWR on IMU data, using a CNN encoder and BiLSTM-based decoder. Our approach demonstrates strong robustness to unseen handwriting styles, outperforming existing methods on the WI splits of both the public OnHW dataset and our word-based dataset, achieving character error rates (CERs) of 7.37% and 9.44%, and word error rates (WERs) of 15.12% and 32.17%, respectively. Robustness evaluation shows that our model maintains superior performance across different age groups, with knowledge learned from one group generalizing better to another compared to other approaches. Evaluation on our sentence-based dataset further demonstrates the potential for recognizing full sentences. Through comprehensive ablation studies, we show that our design choices achieve a strong balance between performance and efficiency. These findings support the development of more adaptable and scalable HWR systems for real-world applications.

URL PDF HTML ☆

赞 0 踩 0

2502.20838 2026-05-29 cs.SD cs.AI cs.LG eess.AS 版本更新

Weakly Supervised Detection and Temporal Localization of Whale Calls in Long-Duration Bioacoustic Data

弱监督检测与长时间生物声学数据中鲸叫声的时间定位

Ragib Amin Nihal, Benjamin Yen, Runwu Shi, Takeshi Ashizawa, Kazuhiro Nakadai

发表机构 * Systems and Control Engineering, School of Engineering, Institute of Science Tokyo, Japan（东京科学研究院工程学院系统与控制工程系）

AI总结提出DSMIL-LocNet框架，利用弱监督多实例学习仅使用录音级标签实现鲸叫声的分类和时间定位，在长录音上优于全监督基线。

Comments Accepted in European Signal Processing Conference (EUSIPCO) 2026

详情

AI中文摘要

被动声学监测（PAM）系统生成持续数月连续录音，但自动化生物声学分析鲸叫声需要两种独立的标注工作：用于分类的二元存在标签和用于定位的精确时间边界。一个多分钟录音的二元标签可以在几秒钟内分配，但对其中的每个叫声打时间戳需要数小时的专家努力。在操作规模上同时提供两者是不可行的。我们提出DSMIL-LocNet，一个弱监督多实例学习（MIL）框架，仅使用录音级存在/缺失标签执行分类和时间定位。我们的双流架构整合频谱和时间特征，处理2-30分钟的录音，而无需现有CNN方法在长输入上退化的时间压缩。在AcousticTrends BlueFinLibrary上，DSMIL-LocNet在300-1800秒录音上达到F1分数0.88-0.91，而全监督CNN基线退化为0.19-0.64。它还提供这些基线在没有帧级标注的情况下无法产生的时间定位。代码：https://github.com/Ragib-Amin-Nihal/DSMIL-LocNet

英文摘要

Passive acoustic monitoring (PAM) systems generate continuous recordings spanning months, yet automated bioacoustic analysis of whale calls requires two separate annotation efforts: binary presence labels for classification and precise temporal boundaries for localization. A binary label for a multi-minute recording can be assigned in seconds, but timestamping every call within it requires hours of expert effort. Providing both is infeasible at operational scale. We present DSMIL-LocNet, a weakly supervised multiple instance learning (MIL) framework that performs both classification and temporal localization using only recording-level presence/absence labels. Our dual-stream architecture integrates spectral and temporal features to process recordings of 2--30 minutes without the temporal compression that degrades existing CNN methods on long inputs. On the AcousticTrends BlueFinLibrary, DSMIL-LocNet achieves F1 scores of 0.88--0.91 on recordings of 300--1800s, where fully supervised CNN baselines degrade to 0.19--0.64. It also provides temporal localization that these baselines cannot produce without frame-level annotation. Code: https://github.com/Ragib-Amin-Nihal/DSMIL-Loc

URL PDF HTML ☆

赞 0 踩 0

2502.10330 2026-05-29 cs.LG 版本更新

Diffusion-based learning framework for Constrained Nonconvex Optimization with Weighted Bootstrapped Refinement

基于扩散的约束非凸优化学习框架与加权自举细化

Shutong Ding, Yimiao Zhou, Ke Hu, Xi Yao, Junchi Yan, Xiaoying Tang, Ye Shi

发表机构 * ShanghaiTech University（上海科技大学）； MoE Key Laboratory of Intelligent Perception（智能感知MoE重点实验室）； Human Machine Collaboration（人机协同）； Shanghai Jiao Tong University（上海交通大学）； China Mobile Communications Company Limited Research Institute（中国移动通信公司有限研究院）； The Chinese University of Hong Kong, Shenzhen（香港中文大学（深圳））

AI总结提出DiOpt框架，通过监督预热和自举训练两阶段学习噪声到约束区域的映射，解决扩散模型在约束非凸优化中的分布错位问题，实现高约束满足和最优性。

Comments accepted by ICML2026

详情

AI中文摘要

扩散模型的最新进展显示出通过利用其多模态性来加速非凸问题求解的潜力。然而，现有的大多数基于扩散的优化方法依赖于监督学习，并且缺乏强制执行约束满足的机制，而这在现实应用中是需要满足的。在这种情况下，我们研究并理论分析了监督扩散求解器的固有问题，并识别出分布错位问题，即生成的解分布在可行区域上的概率质量通常较低。为了解决这个问题，我们提出了DiOpt，一种新的基于扩散的约束非凸优化学习框架，它有效地学习了从噪声到约束区域的映射。具体来说，该框架在两个不同的阶段运行：初始的预热阶段，通过监督学习实现，随后是自举训练阶段。这种双阶段架构旨在迭代地细化解，从而在高度满足约束的情况下改进目标函数。最后，我们还在推理中采用解选择技术以获得更好的最优性。值得注意的是，DiOpt是首次成功将扩散求解器集成到约束非凸优化中。在多样化的非凸任务上的评估显示了DiOpt在最优性和约束满足方面的优越性。我们的官方页面发布在https://dingsht.tech/diopt-webpage。

英文摘要

Recent advances in diffusion models show promising potential to accelerate nonconvex problem solving by leveraging their multimodality. However, most existing diffusion-based optimization approaches rely on supervised learning and lack a mechanism to enforce constraint satisfaction, which is required in real-world applications. In that case, we investigate and theoretically analyze the inherent problem of supervised diffusion solvers and identify the distributional misalignment problem, i.e., the generated solution distribution often exhibits low probability mass on the feasible region. To resolve this issue, we propose DiOpt, a new diffusion-based learning framework for constrained nonconvex optimization, which effectively learns the mapping from noise to the constraint region. Specifically, this framework operates in two distinct phases: an initial warm-start phase, implemented via supervised learning, followed by a bootstrapping training phase. This dual-phase architecture is designed to iteratively refine solutions, thereby improving the objective function with high constraint satisfaction. Finally, we also employ a solution selection technique in inference for better optimality. Notably, DiOpt is the first successful integration of the diffusion solver in constrained nonconvex optimization. Evaluations on diverse nonconvex tasks demonstrate the superiority of DiOpt in both optimality and constraint satisfaction. Our official page is released at https://dingsht.tech/diopt-webpage.

URL PDF HTML ☆

赞 0 踩 0

2502.10205 2026-05-29 cs.LG 版本更新

Looking around you: external information enhances representations for event sequences

环顾四周：外部信息增强事件序列的表示

Petr Sokerin, Maria Kovaleva, Ekaterina Boyarina, Pavel Tikhomirov, Denis Vorobiyov, Alexey Zaytsev

发表机构 * LARSS Laboratory, AI Center, Skoltech（LARSS实验室、人工智能中心、Skoltech）

AI总结针对事件序列表示学习中忽略同时发生序列上下文的问题，提出通过聚合多个用户表示来增强特定用户表示的方法，其中可学习注意力机制在多个数据集上显著提升指标。

详情

AI中文摘要

表示学习在不同领域产生模型，例如商店购买、客户交易和一般人的行为。然而，这类用于事件序列的模型通常孤立地处理每个序列，忽略了那些在时间上同时发生的序列的上下文。这种限制在金融和电子商务等条件快速变化的领域，或当某些序列缺乏近期事件时尤其成问题。我们开发了一种方法，从多个用户表示中聚合信息，在多个同时发生的事件序列的设置中增强特定用户的表示，实现了比独立处理每个序列更好的质量。我们的研究考虑了多种聚合方法，从简单的池化技术到可学习注意力聚合，后者可以突出其他用户之间更复杂的信息流。所提出的方法在现有编码器之上运行，并支持其高效微调。在九个多样化的事件序列数据集（金融、电子商务、娱乐等）和下游任务中，可学习注意力在有无微调的情况下均改善了指标分数，而均值池化虽然增益较小但仍然显著。

英文摘要

Representation learning produces models in different domains, such as store purchases, client transactions, and general people's behavior. However, such models for event sequences usually process each sequence in isolation, ignoring context from those that co-occur in time. This limitation is particularly problematic in domains with fast-evolving conditions, like finance and e-commerce, or when certain sequences lack recent events. We develop a method that aggregates information from multiple user representations, augmenting a specific user's representation in a setting with multiple co-occurring event sequences, achieving better quality than processing each sequence independently. Our study considers diverse aggregation approaches, ranging from simple pooling techniques to Learnable attention aggregation, that can highlight more complex information flow among other users. The proposed methods operate on top of an existing encoder and support its efficient fine-tuning. Across nine diverse event sequence datasets (finance, e-commerce, entertainment, etc.) and downstream tasks, Learnable attention improves metric scores, both with and without fine-tuning, while mean pooling yields a smaller but still significant gain.

URL PDF HTML ☆

赞 0 踩 0

2502.01360 2026-05-29 cs.LG math.AT q-bio.NC 版本更新

A Quotient Homology Theory of Representation in Neural Networks

神经网络表示的商同调理论

Kosio Beshkov

发表机构 * Department of Physics, University of Oslo（奥斯陆大学物理系）

AI总结利用ReLU神经网络的分片线性性质，定义输入数据集上的等价关系并构造商空间，证明在凸性条件下神经表示的同调群与商同调群同构，从而无需外部度量即可计算Betti数。

详情

Journal ref: Transactions on Machine Learning Research, 05/2026, https://openreview.net/forum?id=RluspxztzS

AI中文摘要

先前的研究已经证明，使用ReLU激活函数的神经网络所实现的映射集合与分片线性连续映射的集合相同。此外，这类网络诱导一个超平面排列，将网络的输入域分割成凸多面体$G_J$，网络$Φ$在这些多面体上以仿射方式运行。在本文中，我们利用这些性质在输入数据集上定义一个等价关系$\sim_Φ$，该关系定义了一个商空间，该商空间可被分割成两个集合，分别与$Φ_J$的局部秩以及交集$\cap ext{Im}Φ_{J_i}$相关。我们将后者称为 extit{重叠分解}$\mathcal{O}_Φ$，并证明如果每个多面体与输入流形之间的交集是凸的，则神经表示的同调群与商同调群$H_k(Φ(\mathcal{M})) \simeq H_k(\mathcal{M}/\mathcal{O}_Φ)$同构。这使我们能够在不选择外部度量的情况下内在地计算神经表示的Betti数。我们开发了通过线性规划和并查集算法数值计算重叠分解的方法。利用这一框架，我们在玩具数据集上进行了若干实验，表明与标准持续同调相比，基于重叠同调的Betti数计算追踪的是纯拓扑特征而非几何特征。最后，我们研究了几个分类问题中训练过程中重叠分解的演化，并讨论了该方法的一些缺点。

英文摘要

Previous research has proven that the set of maps implemented by neural networks with a ReLU activation function is identical to the set of piecewise linear continuous maps. Furthermore, such networks induce a hyperplane arrangement splitting the input domain of the network into convex polyhedra $G_J$ over which a network $Φ$ operates in an affine manner. In this work, we leverage these properties to define an equivalence relation $\sim_Φ$ on top of an input dataset, which defines a quotient space that can be split into two sets related to the local rank of $Φ_J$ and the intersections $\cap \text{Im}Φ_{J_i}$. We refer to the latter as the \textit{overlap decomposition} $\mathcal{O}_Φ$ and prove that if the intersections between each polyhedron and an input manifold are convex, the homology groups of neural representations are isomorphic to quotient homology groups $H_k(Φ(\mathcal{M})) \simeq H_k(\mathcal{M}/\mathcal{O}_Φ)$. This lets us intrinsically calculate the Betti numbers of neural representations without the choice of an external metric. We develop methods to numerically compute the overlap decomposition through linear programming and a union-find algorithm. Using this framework, we perform several experiments on toy datasets showing that, compared to standard persistent homology, our overlap homology-based computation of Betti numbers tracks purely topological rather than geometric features. Finally, we study the evolution of the overlap decomposition during training on several classification problems and discuss some shortcomings of our method.

URL PDF HTML ☆

赞 0 踩 0

2412.00452 2026-05-29 cs.LG cs.CV 版本更新

Learning Locally, Revising Globally: Global Reviser for Federated Learning with Noisy Labels

局部学习，全局修正：面向含噪标签联邦学习的全局修正器

Yuxin Tian, Mouxing Yang, Yuhao Zhou, Jian Wang, Qing Ye, Tongliang Liu, Gang Niu, Jiancheng Lv

发表机构 * College of Computer Science, Sichuan University, Chengdu, China（四川大学计算机学院，中国成都）； Engineering Research Center of Machine Learning（机器学习工程研究中心）； University of Sydney, Sydney, Australia（悉尼大学，澳大利亚悉尼）； Southeast University, Nanjing, China（东南大学，中国南京）

AI总结针对联邦学习中标签噪声与数据异质性共存的问题，提出一种利用全局模型慢记忆特性的联邦全局修正器（FedGR），通过三个模块协同修正噪声标签并正则化局部训练，在三个基准上优于八种基线方法。

Comments ICML 2026 Camera Ready

详情

AI中文摘要

传统的联邦学习（FL）严重依赖高质量标签，这在实际应用中往往不现实，导致联邦标签噪声（F-LN）问题。更糟糕的是，FL的异质性加剧了F-LN问题，因为客户端经历不同的标签噪声类型、比率和数据分布。在本研究中，我们首先观察到FL的全局模型表现出对噪声标签的缓慢记忆现象，这表明其在FL中能够维持可靠的预测和鲁棒的表示。受此启发，我们提出了一种名为联邦全局修正器（FedGR）的新方法，这是一种直接而有效的方法，包含三个模块，协同修正噪声标签并正则化局部训练。通过利用这一固有属性，FedGR以自包含的方式提高了FL对标签噪声的鲁棒性。在三个广泛使用的F-LN基准上的大量实验表明，即使在严重的标签噪声和数据异质性下，FedGR也表现出优越的性能，始终优于八个最先进的基线。代码：https://github.com/cs-yuxintian/FedGR-ICML26

英文摘要

Conventional federated learning (FL) heavily depends on high-quality labels, which are often impractical in the real world, leading to the federated label-noise (F-LN) problem. Worse still, the F-LN problem is exacerbated by the heterogeneity of FL, whereas clients experience different label-noise types, ratios, and data distribution. In this study, we first observe an intriguing phenomenon that the global model of FL exhibits a slow memorization of noisy labels, suggesting its ability to maintain reliable predictions and robust representations in FL. Motivated by this, we propose a novel method termed Federated Global Reviser (\method), a straightforward yet effective method comprising three modules that collaboratively rectify noisy labels and regularize local training. By exploiting this inherent property, \method\ improves the label-noise robustness of FL in a self-contained manner. Extensive experiments on three widely used F-LN benchmarks demonstrate the superior performance of FedGR, consistently outperforming eight state-of-the-art baselines even in severe label-noise and data heterogeneity. Code: https://github.com/cs-yuxintian/FedGR-ICML26

URL PDF HTML ☆

赞 0 踩 0

2411.03006 2026-05-29 math.CO cs.CC cs.DM cs.LG math.OC 版本更新

Neural Networks and (Virtual) Extended Formulations

神经网络与（虚拟）扩展公式

Christoph Hertrich, Georg Loho

发表机构 * Georg Loho\ Universität Berlin \& University of Twente

AI总结通过将神经网络表示能力与多面体的扩展复杂度关联，证明单调或输入凸神经网络规模的下界，并引入虚拟扩展复杂度以推广到一般神经网络。

详情

AI中文摘要

具有分段线性激活函数（如修正线性单元（ReLU）或maxout）的神经网络是现代机器学习中最基础的模型之一。我们通过将其表示能力与多面体$P$的扩展复杂度$\mathrm{xc}(P)$联系起来，向证明此类神经网络规模的下界迈出了一步。$\mathrm{xc}(P)$是组合优化和多面体几何中一个被充分研究的概念，描述了将$P$建模为线性规划所需的不等式数量。我们证明，$\mathrm{xc}(P)$是任何解决$P$上线性优化问题的单调或输入凸神经网络规模的下界。这暗示了此类神经网络在多种问题（包括多项式可解的最大权匹配问题）上的指数级下界。为了尝试对一般神经网络也证明类似的下界，我们引入了虚拟扩展复杂度$\mathrm{vxc}(P)$的概念，它推广了$\mathrm{xc}(P)$，描述了将$P$上的线性优化问题表示为两个线性规划之差所需的不等式数量。我们证明$\mathrm{vxc}(P)$是任何在$P$上进行优化的神经网络规模的下界。虽然推导$\mathrm{vxc}(P)$的有用下界仍是一个开放问题，但我们通过证明给定具有小编码大小的虚拟扩展公式可以高效优化多面体$P$，论证了这一概念值得独立于神经网络进行研究。

英文摘要

Neural networks with piecewise linear activation functions, such as rectified linear units (ReLU) or maxout, are among the most fundamental models in modern machine learning. We make a step towards proving lower bounds on the size of such neural networks by linking their representative capabilities to the notion of the extension complexity $\mathrm{xc}(P)$ of a polytope $P$. This is a well-studied quantity in combinatorial optimization and polyhedral geometry describing the number of inequalities needed to model $P$ as a linear program. We show that $\mathrm{xc}(P)$ is a lower bound on the size of any monotone or input-convex neural network that solves the linear optimization problem over $P$. This implies exponential lower bounds on such neural networks for a variety of problems, including the polynomially solvable maximum weight matching problem. In an attempt to prove similar bounds also for general neural networks, we introduce the notion of virtual extension complexity $\mathrm{vxc}(P)$, which generalizes $\mathrm{xc}(P)$ and describes the number of inequalities needed to represent the linear optimization problem over $P$ as a difference of two linear programs. We prove that $\mathrm{vxc}(P)$ is a lower bound on the size of any neural network that optimizes over $P$. While it remains an open question to derive useful lower bounds on $\mathrm{vxc}(P)$, we argue that this quantity deserves to be studied independently from neural networks by proving that one can efficiently optimize over a polytope $P$ given a virtual extended formulation with small encoding size.

URL PDF HTML ☆

赞 0 踩 0

2410.23222 2026-05-29 cs.LG cs.AI stat.ML 版本更新

Dataset-Driven Channel Masks in Transformers for Multivariate Time Series

数据集驱动的Transformer通道掩码用于多变量时间序列

Seunghan Lee, Taeyoung Park, Kibok Lee

发表机构 * Department of Statistics and Data Science, Yonsei University（延世大学统计与数据科学系）； LG AI Research（LG人工智能研究）

AI总结提出部分通道依赖（PCD）概念，通过数据集特定的通道掩码（CMs）改进Transformer中的通道依赖建模，并在多种任务和数据集上验证有效性。

Comments ICASSP 2026. Preliminary version: NeurIPS Workshop on Time Series in the Age of Large Models 2024 (Oral presentation)

详情

AI中文摘要

最近基础模型的进展已成功扩展到时间序列（TS）领域，这得益于大规模TS数据集的出现。然而，先前的努力主要集中于捕获通道依赖（CD），这对于建模多变量时间序列至关重要，并且基于注意力的方法已被广泛用于此目的。尽管如此，这些方法主要关注修改架构，往往忽略了数据集特定特征的重要性。在这项工作中，我们引入了部分通道依赖（PCD）的概念，通过利用数据集特定信息来增强基于Transformer的模型中的CD建模，从而细化模型捕获的CD。为了实现PCD，我们提出了通道掩码（CMs），通过逐元素乘法将其集成到Transformer的注意力矩阵中。CMs由两个组件组成：1）捕获通道之间关系的相似性矩阵，以及2）数据集特定且可学习的领域参数，用于细化相似性矩阵。我们在多种任务和数据集上使用不同的骨干网络验证了PCD的有效性。代码可在此存储库获取：https://github.com/YonseiML/pcd。

英文摘要

Recent advancements in foundation models have been successfully extended to the time series (TS) domain, facilitated by the emergence of large-scale TS datasets. However, previous efforts have primarily Capturing channel dependency (CD) is essential for modeling multivariate time series (TS), and attention-based methods have been widely employed for this purpose. Nonetheless, these methods primarily focus on modifying the architecture, often neglecting the importance of dataset-specific characteristics. In this work, we introduce the concept of partial channel dependence (PCD) to enhance CD modeling in Transformer-based models by leveraging dataset-specific information to refine the CD captured by the model. To achieve PCD, we propose channel masks (CMs), which are integrated into the attention matrices of Transformers via element-wise multiplication. CMs consist of two components: 1) a similarity matrix that captures relationships between the channels, and 2) dataset-specific and learnable domain parameters that refine the similarity matrix. We validate the effectiveness of PCD across diverse tasks and datasets with various backbones. Code is available at this repository: https://github.com/YonseiML/pcd.

URL PDF HTML ☆

赞 0 踩 0

2409.06439 2026-05-29 cs.LG stat.CO stat.ML 版本更新

Extending Explainable Ensemble Trees (E2Tree) to regression contexts

将可解释集成树（E2Tree）扩展到回归场景

Massimo Aria, Agostino Gnasso, Carmela Iorio, Marjolein Fokkema

发表机构 * Department of Economics and Statistics, University of Naples Federico II（那不勒斯费德里科二世大学经济学与统计学系）； Institute of Psychology, Leiden University（莱顿大学心理学研究所）

AI总结本文通过引入新的不相似度度量，将可解释集成树方法从分类扩展到回归，并在真实数据集上验证其解释能力。

详情

DOI: 10.1002/asmb.70064
Journal ref: Applied Stochastic Models in Business and Industry, Vol. 42, No. 1, e70064 (2026)

AI中文摘要

集成方法如随机森林通过聚合多个弱学习器提供了高精度的预测，改变了监督学习的格局。然而，尽管它们有效，这些方法往往缺乏透明度，阻碍了用户理解随机森林模型如何得出预测。可解释集成树（E2Tree）是一种解释随机森林的新方法，提供了响应变量与预测变量之间关系的图形表示。E2Tree的一个显著特点是它不仅考虑预测变量对响应的影响，还通过计算和使用不相似度度量来考虑预测变量之间的关联。E2Tree方法最初是为分类任务提出的。在本文中，我们将该方法扩展到回归场景。为了展示所提算法的解释能力，我们在真实数据集上进行了演示。

英文摘要

Ensemble methods such as random forests have transformed the landscape of supervised learning, offering highly accurate prediction through the aggregation of multiple weak learners. However, despite their effectiveness, these methods often lack transparency, impeding users' comprehension of how RF models arrive at their predictions. Explainable ensemble trees (E2Tree) is a novel methodology for explaining random forests, that provides a graphical representation of the relationship between response variables and predictors. A striking characteristic of E2Tree is that it not only accounts for the effects of predictor variables on the response but also accounts for associations between the predictor variables through the computation and use of dissimilarity measures. The E2Tree methodology was initially proposed for use in classification tasks. In this paper, we extend the methodology to encompass regression contexts. To demonstrate the explanatory power of the proposed algorithm, we illustrate its use on real-world datasets.

URL PDF HTML ☆

赞 0 踩 0

2406.10238 2026-05-29 cs.CL cs.LG cs.SI 版本更新

基于LoRA的贝叶斯推理中低损失谷的构造与启示

Daniel Dold, Emanuel Sommer, Julius Kobialka, Oliver Dürr, David Rügamer

发表机构 * HTWG Konstanz（康斯坦茨应用科学大学）； LMU Munich（慕尼黑大学）； Munich Center for Machine Learning (MCML)（慕尼黑机器学习中心）

AI总结本文提出LoRA-Curve方法，通过分段贝塞尔曲线参数化在LoRA空间中连接独立最优解，形成连续低损失谷，并结合平坦极小扰动和JS散度正则化，在不牺牲性能的前提下提高预测分布的互信息，实现功能多样性。

详情

AI中文摘要

虽然低秩适应（LoRA）等参数高效微调方法已成为大型语言模型的标准方法，但对认知不确定性的原则性估计仍然具有挑战性。最近在LoRA机制下的结果表明，深度集成等离散多模态方法相比单模态方法几乎没有优势。这与深度学习中的更广泛观察相矛盾，在深度学习中，集成独立最优解通常能改善泛化，而通过连续低损失谷连接这些模态能进一步增强贝叶斯模型平均（BMA）。LoRA空间中是否存在这种结构，以及它是否能产生局部或离散方法所遗漏的功能多样性，尚未被研究。我们引入了LoRA-Curve，一种在LoRA空间中的分段贝塞尔曲线参数化，包含两种变体：一种自由配置，联合优化所有控制点；另一种锚定配置，连接独立微调的LoRA最优解。我们证明了损失沿曲线的路径连续性和Lipschitz正则性，并通过Qwen2.5 7B在推理和分类基准上的实验表明，线性插值会遇到损失障碍，而我们的锚定多段曲线通过连续低损失谷连接独立最优解。结合平坦极小扰动和詹森-香农散度正则化，LoRA-Curve在不牺牲性能的情况下，可测量地提高了预测分布的互信息，并将连续参数空间遍历与功能多样性联系起来。

英文摘要

While parameter-efficient fine-tuning methods like low-rank adaptation (LoRA) are standard for large language models, principled estimation of epistemic uncertainty remains challenging. Recent results in the LoRA regime suggest that discrete multi-mode approaches such as deep ensembles offer little benefit over single-mode methods. This contradicts broader observations in deep learning, where ensembling independent optima typically improves generalization, and linking these modes through continuous low-loss valleys further enhances Bayesian model averaging (BMA). Whether such structure exists in the LoRA space and whether it yields functional diversity missed by local or discrete methods has not been studied. We introduce LoRA-Curve, a segmented Bézier curve parameterization in the LoRA space, with two variants: a free configuration that jointly optimizes all control points, and an anchored configuration that connects independently fine-tuned LoRA optima. We prove pathwise continuity and Lipschitz regularity of the loss along the curve and empirically show, across reasoning and classification benchmarks with Qwen2.5 7B, that linear interpolation encounters loss barriers, while our anchored multi-segment curves connect independent optima through continuous low-loss valleys. Combined with flat-minima perturbations and a Jensen-Shannon divergence regularizer, LoRA-Curve yields measurably higher mutual information of the predictive distribution without sacrificing performance, and links continuous parameter-space traversal to functional diversity.

URL PDF HTML ☆

赞 0 踩 0

2605.29547 2026-05-29 cs.LG cs.AI math.OC 版本更新

Singularity-aware Optimization via Randomized Geometric Probing: Towards Stable Non-smooth Optimization

基于随机几何探测的奇异性感知优化：迈向稳定的非光滑优化

Ruoran Xu, Borong She, Xiaobo Jin, Qiufeng Wang

发表机构 * Xi'an Jiaotong-Liverpool University（西安交通大学利物浦大学）

AI总结针对非光滑优化中Adam优化器的梯度抖动问题，提出奇异性感知Adam（S-Adam），通过局部几何不稳定性（LGI）度量动态调整步长，实现稳定训练并提升泛化性能。

Comments International Conference on Machine Learning (ICML), 2026

详情

AI中文摘要

深度学习优化严重依赖于损失景观平滑的假设，而现代架构由于ReLU激活和量化算子等非光滑组件系统性地违反了这一条件。在这种非光滑情况下，Adam等自适应优化器会出现梯度抖动，即由Clarke次微分内冲突信号引起的剧烈振荡，导致收敛性差和泛化能力欠佳。为解决此问题，我们引入了奇异性感知Adam（S-Adam），一种通过基于局部几何不稳定性动态调整步长来稳定训练的新型优化器。我们的关键贡献是局部几何不稳定性（LGI）度量，一种从随机方向导数方差导出的Clarke次微分直径的计算高效估计量。S-Adam采用自适应阻尼机制exp(-$λ$$ρ$)，在高不稳定性区域减缓更新，同时在平滑盆地保持快速收敛。我们使用微分包含提供了严格的收敛性分析，证明S-Adam以最优的O(1/$\sqrt(T)$)速率几乎必然收敛到($δ$,$ε$)-Clarke稳定点。在量化感知训练（QAT）和高噪声小批量学习上的实证评估表明，S-Adam持续优于AdamW和Prox-SGD，在CIFAR-100上实现高达6%的准确率提升，在TinyImageNet上实现3%的提升，同时有效缓解梯度振荡。

英文摘要

Deep learning optimization relies heavily on the assumption of smooth loss landscapes, a condition systematically violated by modern architectures due to non-smooth components such as ReLU activations and quantization operators. In such non-smooth regimes, adaptive optimizers such as Adam suffer from gradient chattering, violent oscillations caused by conflicting signals within the Clarke subdifferential, leading to poor convergence and suboptimal generalization. To address this, we introduce Singularity-aware Adam (S-Adam), a novel optimizer that stabilizes training by dynamically modulating step sizes based on local geometric instability. Our key contribution is the Local Geometric Instability (LGI) metric, a computationally efficient estimator of the Clarke subdifferential diameter derived from the variance of randomized directional derivatives. S-Adam incorporates an adaptive damping mechanism exp(-$λ$$ρ$) that decelerates updates in high-instability regions while preserving fast convergence in smooth basins. We provide a rigorous convergence analysis using differential inclusions, proving that S-Adam converges almost surely to ($δ$,$ε$)-Clarke stationary points at the optimal O(1/$\sqrt(T)$) rate. Empirical evaluations on Quantization-Aware Training (QAT) and high-noise small-batch learning demonstrate that S-Adam consistently outperforms AdamW and Prox-SGD, achieving accuracy gains of up to 6 percent on CIFAR-100 and 3 percent on TinyImageNet while effectively mitigating gradient oscillations.

URL PDF HTML ☆

赞 0 踩 0

2605.29543 2026-05-29 cs.LG cs.AI cs.CL cs.HC cs.IR 版本更新

SCOPE: A Lightweight-training LLM Framework for Air Traffic Control Readback Monitoring

SCOPE：一种用于空中交通管制复诵监控的轻量训练LLM框架

Qihan Deng, Minghua Zhang, Yang Yang, Zhenyu Gao

发表机构 * Department of Mechanical and Aerospace Engineering, The Hong Kong University of Science and Technology（香港科学与技术大学机械与航空航天工程系）； School of Electronic and Information Engineering, Beihang University（北航电子与信息工程学院）； State Key Laboratory of CNS/ATM（国家空管自动化系统实验室）

AI总结提出SCOPE框架，通过冻结LLM结合插件式开放集分类器和上下文学习机制，实现高效准确的空管复诵监控，在少样本设置下开放集检测准确率达91.05%，异常纠正率96.63%。

详情

AI中文摘要

飞行员对空中交通管制（ATC）语音指令的复诵是航空运输中防止沟通失误的主要保障。然而，复诵异常仍与约80%的航空事故相关。这一脆弱性因交通量增加和认知负荷升高而进一步加剧，从而推动了机器自动化复诵监控的需求。传统的基于规则和机器学习的方法难以在高度可变且不断演变的空管-飞行员通信术语中泛化。尽管大语言模型（LLM）凭借其强大的推理和泛化能力开辟了新途径，但现有方法在实践中仍面临部署和计算障碍。在这项工作中，我们提出了SCOPE（Semantic reasoning for Communication via Open-set Plug-in with Examples），一种新颖的轻量训练LLM框架，提升了基于机器的ATC复诵监控的效率和准确性。核心思想是在冻结的LLM之上，将插件式开放集分类器与精心设计的上下文学习机制相结合。在半合成通信数据集上的大量实验表明，SCOPE在实现运行环境所需的低延迟响应的同时，达到了优越的准确性。在少样本设置下，SCOPE在开放集检测中达到91.05%的准确率，并纠正了96.63%的异常复诵，从而在提供决策解释的同时优于现有最强基线。这些发现证明了我们的框架作为通向可解释和可控的ATC复诵监控的实用途径的潜力。

英文摘要

Pilot readback of Air Traffic Control (ATC) voice instructions is a primary safeguard against miscommunication in air transportation. However, readback anomalies remain implicated in approximately 80% of aviation incidents. This vulnerability is further exacerbated by rising traffic volume and elevated cognitive workload, thereby motivating automated readback monitoring by machine. Traditional rule-based and machine learning approaches struggle to generalize across the highly variable and evolving phraseology of air traffic controller-pilot communications. While Large Language Models (LLMs) have opened a new avenue through their strong reasoning and generalization capabilities, existing approaches still face deployment and computational barriers in practice. In this work, we propose Semantic reasoning for Communication via Open-set Plug-in with Examples (SCOPE), a novel lightweight-training LLM framework that advances both the efficiency and accuracy of machine-based ATC readback monitoring. The core idea is to couple a plug-in open-set classifier with a carefully designed in-context learning mechanism on top of a frozen LLM. Extensive experiments on the semi-synthetic communication dataset show that SCOPE attains superior accuracy while delivering the low-latency response required for operational environments. Under a few-shot setting, SCOPE achieves 91.05% accuracy in open-set detection and corrects 96.63% of anomalous readbacks, thereby outperforming the strongest available baselines while providing explanations for its decisions. These findings demonstrate the potential of our framework as a practical pathway toward interpretable and controllable ATC readback monitoring.

URL PDF HTML ☆

赞 0 踩 0

2605.29537 2026-05-29 cs.CC cs.LG cs.LO 版本更新

The Complexity of Verifying Feedforward Neural Networks in Quantised Settings

量化设置下前馈神经网络验证的复杂性

Eric Alsmann, Martin Lange, Marco Sälzer

发表机构 * University of Kassel（卡塞尔大学）； RPTU University Kaiserslautern-Landau（科布伦茨-劳埃希斯大学）

AI总结研究量化设置下前馈神经网络验证的计算复杂性，区分三类网络并分析线性规划和位向量规范下的复杂性，证明量化网络验证仍为NP完全，并为动态量化网络建立上界。

详情

AI中文摘要

我们研究了量化设置下神经网络验证的计算复杂性。我们区分了三类前馈神经网络（FNNs）：具有精确有理权重的有理FNNs、权重来自有限宽度算术的量化FNNs，以及根据给定有限宽度算术评估有理网络的动态量化FNNs。我们考虑了文献中使用的两种规范类型。线性规划（LP）规范是线性约束的合取，而位向量（BV）规范允许在位级别进行推理，并能表达非线性约束。我们的结果给出了这些验证问题的复杂性全景。对于具有固定算术精度的量化FNNs，我们证明在LP和BV规范下的验证仍然是NP完全的，与有理情况下的复杂性相匹配。对于具有BV规范的动态量化FNNs，我们建立了上界，补充了先前已知的PSPACE-hard结果。

英文摘要

We investigate the computational complexity of neural network verification in quantised settings. We distinguish three classes of Feedforward Neural Networks (FNNs): rational FNNs with exact rational weights, quantised FNNs whose weights come from a finite-width arithmetic, and dynamically quantised FNNs in which rational networks are evaluated with respect to a given finite-width arithmetic. We consider two types of specifications used in the literature. Linear programming (LP) specifications are conjunctions of linear constraints, while bit-vector (BV) specifications allow reasoning at the bit level and can express non-linear constraints. Our results give a complexity landscape of these verification problems. For quantised FNNs with fixed arithmetic precision, we show that verification under both LP and BV specifications remains NP-complete, matching the complexity of the rational case. For dynamically quantised FNNs with BV specifications, we establish upper bounds, complementing a previously known PSPACE-hardness result.

URL PDF HTML ☆

赞 0 踩 0

2605.29535 2026-05-29 cs.LG 版本更新

AsymVLM: Asymmetric Token Pruning for Efficient Vision-Language Model Inference

AsymVLM：面向高效视觉-语言模型推理的非对称令牌剪枝

Yilin Feng, Ahmed Burak Gulhan, Mahmut Taylan Kandemir

发表机构 * The Pennsylvania State University（宾夕法尼亚州立大学）

AI总结针对视觉和文本令牌在预填充与解码阶段的不同特性，提出非对称剪枝方法AsymVLM，通过视觉令牌的激进剪枝和文本令牌的基于阈值的驱逐，实现高达54%的FLOPs节省并在文档和图表理解任务上提升2-3%的准确率。

详情

AI中文摘要

视觉-语言模型（VLM）每张图像处理数千个视觉令牌，而文本令牌相对较少，但现有压缩方法对两种模态一视同仁。我们观察到两种模态具有根本不同的特性：视觉令牌在空间上冗余且主导预填充阶段，而文本令牌具有因果依赖性并在解码过程中累积。基于这种非对称性，我们提出并实证评估了AsymVLM，该方法在预填充前使用学习的重要性评分器结合每样本自适应预算对视觉令牌进行激进剪枝，并仅在文本令牌超过固定预算时执行基于时间阈值的驱逐。实验表明，AsymVLM在现有方法中实现了最高的FLOPs节省（高达54%），同时在视觉信息空间局部化且与查询相关的文档和图表理解任务上，比现有方法提升2-3%的准确率，并在整体基准上保持竞争性精度。在文本主导的场景中，我们的驱逐策略通过适应VLM的短上下文特性，显著优于标准的LLM缓存压缩方法。

英文摘要

Vision-Language Models (VLMs) process thousands of visual tokens per image alongside comparatively few text tokens, yet existing compression methods treat both modalities uniformly. We observe that the two modalities have fundamentally different properties: vision tokens are spatially redundant and dominate prefill, while text tokens are causally dependent and accumulate during decoding. Based on this asymmetry, we propose and empirically evaluate AsymVLM, which applies aggressive pruning to vision tokens before prefill using a learned importance scorer with per-sample adaptive budgeting, and temporal threshold-based eviction to text tokens only when they exceed a fixed budget. Our experiments indicate that AsymVLM achieves the highest FLOPs savings (up to 54%) among state-of-the-art methods while outperforming existing approaches by 2--3% on document and chart understanding tasks where visual information is spatially localized and query-specific, and maintaining competitive accuracy on holistic benchmarks. In text-dominated scenarios, our eviction strategy substantially outperforms standard LLM cache compression methods by adapting to the short-context nature of VLM.

URL PDF HTML ☆

赞 0 踩 0

2605.29531 2026-05-29 cs.SD cs.CV cs.LG 版本更新

Audio Deepfake Detection with Half-Truth Localisation Using Cross-Attentive Feature Fusion

使用交叉注意力特征融合的半真音频深度伪造检测与定位

S. Sutharya, Remya K. Sasi

发表机构 * Department of Computer Science（计算机科学系）

AI总结提出CAFNet模型，通过三元分类和边界回归联合检测部分伪造音频，在MLADDC数据集上达到92.71%准确率和0.075s定位误差。

Comments 13 pages, 5 figures, 11 tables

详情

AI中文摘要

音频深度伪造检测通常作为二分类问题研究，但部分篡改语音（其中一段短合成片段被拼接进真实语音）构成了更困难且更现实的威胁。检测此类半真音频不仅需要区分真实和完全伪造语音，还需要定位篡改发生的位置。我们提出了CAFNet，一个576k参数的架构，联合处理这两个任务：它在单次前向传播中执行三元分类（真实、完全伪造或半真）并回归合成区域的时间边界。CAFNet通过并行深度可分离卷积分支和交叉注意力融合梅尔频率倒谱系数（MFCC）、线性频率倒谱系数（LFCC）和色度短时傅里叶变换（Chroma-STFT）特征，随后使用双向长短期记忆（BiLSTM）回归头进行边界预测。在组合的多语言音频深度伪造检测语料库（MLADDC）T2+T3测试集上，CAFNet达到92.71%的准确率和0.9910的宏观曲线下面积（AUC），边界定位平均绝对误差（MAE）为0.075秒，中位误差为0.052秒。在二分类检测中，它达到96.76%的准确率和3.20%的等错误率（EER），以超过500倍的参数减少优于微调的XLS-R 300M（78.31%）和AST 87M（93.03%）。跨数据集研究进一步表明，即使在降低骨干学习率的情况下，标准微调也会破坏跨域表示。

英文摘要

Audio deepfake detection is well-studied as a binary problem, but partially manipulated speech, where a short synthesised segment is spliced into an otherwise genuine utterance, poses a harder and more realistic threat. Detecting such half-truth audio requires not only distinguishing it from real and fully fake speech, but also localising where the manipulation occurs. We present CAFNet, a 576k-parameter architecture that addresses both tasks jointly: it performs ternary classification (real, fully-fake, or half-truth) and regresses the temporal boundaries of the synthesised region in a single forward pass. CAFNet fuses Mel-Frequency Cepstral Coefficient (MFCC), Linear-Frequency Cepstral Coefficient (LFCC), and Chroma Short-Time Fourier Transform (Chroma-STFT) features through parallel depthwise-separable convolution branches with cross-attention, followed by a Bidirectional Long Short-Term Memory (BiLSTM) regression head for boundary prediction. On the combined Multi-Lingual Audio Deepfake Detection Corpus (MLADDC) T2+T3 test set, CAFNet achieves 92.71% accuracy and macro Area Under the Curve (AUC) of 0.9910, with boundary localisation Mean Absolute Error (MAE) of 0.075s and a median error of 0.052s. On binary detection, it achieves 96.76% accuracy and 3.20% Equal Error Rate (EER), outperforming fine-tuned XLS-R 300M (78.31%) and AST 87M (93.03%) at over 500 times fewer parameters. A cross-dataset study further shows that standard fine-tuning collapses cross-domain representations even under reduced backbone learning rates.

URL PDF HTML ☆

赞 0 踩 0

2605.29525 2026-05-29 cs.LG 版本更新

Learning to Perturb Hidden Representations for Generalizable Deep Learning

学习扰动隐藏表示以实现可泛化深度学习

Hua Li

发表机构 * Henan University（河南大学）

AI总结提出学习扰动激活（LPA）方法，通过自适应地扰动隐藏层激活并利用PGD学习类别级扰动，提升模型泛化能力，在平衡分类、长尾分类和域泛化任务上优于现有方法。

详情

AI中文摘要

深度神经网络通过级联表示处理数据：输入特征、隐藏激活、logits和损失。虽然输入、logit和标签层面的扰动已被系统研究，但构成网络大部分计算的中间隐藏激活尚未得到统一的扰动分析。本文建立了隐藏激活扰动的统一框架，揭示了Dropout、Manifold Mixup、对抗特征扰动及相关方法都施加了特定形式的激活扰动，但采用类别无关或随机策略。我们推测扩张性扰动（增加激活范数）起到正增强作用，而收缩性扰动（减少激活范数）起到负增强作用，并且扰动层决定了效果类似于输入级增强（浅层）还是logit级操作（深层）。我们提出学习扰动激活（LPA），该方法在选定的隐藏层自适应地扰动激活，并通过PGD学习类别级扰动。我们进一步提供了将激活扰动与平坦最小值和通过层的扰动放大联系起来的理论分析。在平衡分类、长尾分类和域泛化上的实验表明，LPA一致优于现有方法，并为logit扰动方法（如LPL）提供互补优势。

英文摘要

Deep neural networks process data through a cascade of representations: input features, hidden activations, logits, and loss. While perturbations at the input, logit, and label levels have been systematically studied, the intermediate hidden activations, which constitute the bulk of the network's computation, have received no unified perturbation analysis. In this paper, we establish a unified framework for hidden activation perturbation, revealing that Dropout, Manifold Mixup, adversarial feature perturbation, and related methods all impose specific forms of activation perturbation but with class-agnostic or random strategies. We conjecture that expansive perturbation (increasing activation norm) acts as positive augmentation, while contractive perturbation (decreasing activation norm) acts as negative augmentation, and that the perturbation layer determines whether the effect resembles input-level augmentation (shallow layers) or logit-level manipulation (deep layers). We propose Learning to Perturb Activations (LPA), which adaptively perturbs activations at a selected hidden layer with class-level perturbations learned via PGD. We further provide theoretical analysis connecting activation perturbation to flat minima and perturbation amplification through layers. Experiments on balanced classification, long-tail classification, and domain generalization demonstrate that LPA consistently outperforms existing methods and provides complementary benefits to logit perturbation methods such as LPL.

URL PDF HTML ☆

赞 0 踩 0

2605.29523 2026-05-29 cs.LG 版本更新

面向持续监督微调的在策略重放

Yan Chen, Taojie Zhu, Meng Zhang, Xin Chen, Jiaqi Huang, Dongyang Xu, Yizhi Wang

发表机构 * Tsinghua University（清华大学）； Alibaba Group（阿里巴巴集团）

AI总结提出在策略重放（OPR）方法，通过重放模型自身生成的高质量响应来缓解持续监督微调中的灾难性遗忘，在多个大语言模型上显著降低遗忘。

详情

AI中文摘要

持续监督微调（SFT）是将大型语言模型（LLMs）适配到连续下游任务的事实标准，但它会遭受早期能力的灾难性遗忘。最近的研究表明，在策略信号——在模型自身输出上训练——比离策略监督更可靠地减少遗忘。现有的在策略方法通过新的训练目标（例如，带有教师副本的自蒸馏损失）路由该信号，从而继承了额外的前向传播、调度敏感性和来自教师的风格漂移。我们改为通过训练数据源路由在策略信号。我们的方法，在策略重放（OPR），在少量历史提示上展开最新检查点，通过任务奖励过滤生成结果，并将幸存（提示，模型响应）对作为普通SFT示例重放。没有教师，没有辅助损失，也没有即时蒸馏。在三个7-8B指令微调骨干（Qwen2.5-7B-Instruct、Qwen3-8B、Llama3.1-8B-Instruct）上，在TRACE持续学习基准测试中，OPR一致地减少了遗忘；在最尖锐的压力测试（Qwen2.5-7B-Instruct，顺序SFT BWT -13.93）中，OPR在10%重放预算下将BWT提升至-0.65，在1%预算下提升至-2.29——与调优的普通重放基线相比，|BWT|减少了46%，在所有三个骨干上观察到42-46%的减少。我们给出了一个KL收缩解释，将OPR和先前的在策略蒸馏方法置于单一轴上，并提出了一个反直觉的发现，解释了为什么普通重放已经是一个强基线：低分重放一致地比普通重放更差，表明OPR中的有效成分是在策略分布，而不是单独的响应质量。我们的代码可在https://github.com/Yancey2024/OnPolicyReplay获取。

英文摘要

Continual supervised fine-tuning (SFT) is the de facto recipe for adapting large language models (LLMs) to a stream of downstream tasks, but it suffers from catastrophic forgetting of earlier capabilities. Recent work shows that on-policy signals -- training on the model's own outputs -- reduce forgetting more reliably than off-policy supervision. Existing on-policy methods route this signal through a new training objective (e.g., self-distillation losses with a teacher copy), inheriting an extra forward pass, schedule sensitivity, and stylistic drift from the teacher.We instead route the on-policy signal through the training data source. Our method, On-Policy Replay (OPR), rolls out the most recent checkpoint on a small budget of historical prompts, filters the generations by a task reward, and replays the surviving (prompt, model response) pairs as ordinary SFT examples. There is no teacher, no auxiliary loss, and no on-the-fly distillation. Across three 7--8B instruction-tuned backbones (Qwen2.5-7B-Instruct, Qwen3-8B, Llama3.1-8B-Instruct) on the TRACE continual-learning benchmark, OPR consistently reduces forgetting; on the sharpest stress test (Qwen2.5-7B-Instruct, Sequential SFT BWT -13.93), OPR lifts BWT to -0.65 at a 10% replay budget and to -2.29 at a 1% budget -- a 46% reduction in |BWT| over a tuned Vanilla Replay baseline, with 42--46% reductions observed across all three backbones. We give a KL-shrinkage interpretation that places OPR and prior on-policy distillation methods on a single axis, and we present a counterintuitive finding that explains why Vanilla Replay is already a strong baseline: low-score replay is uniformly worse than Vanilla Replay, demonstrating that the active ingredient in OPR is the on-policy distribution, not the response quality alone.Our code is available at https://github.com/Yancey2024/OnPolicyReplay.

URL PDF HTML ☆

赞 0 踩 0

2605.29494 2026-05-29 cs.LG 版本更新

非共轭因子图的闭式变分推断组合

Mykola Lukashchuk, Kyrylo Yemets, Wouter M. Kouw, Dmitry Bagaev, İsmail Şenöz, Jeff Beck, Bert de Vries

发表机构 * Eindhoven University of Technology, the Netherlands（埃因霍温理工大学，荷兰）； Lviv Polytechnic National University, Lviv, Ukraine（利沃夫国立理工大学，利沃夫，乌克兰）； Lazy Dynamics, Utrecht, the Netherlands（Lazy Dynamics，乌得勒支，荷兰）

AI总结提出五种因子图原语，证明任意组合均支持闭式变分消息传递，并通过堆叠路由层实现通用函数逼近，应用于时间序列预测。

详情

AI中文摘要

将概率构建块堆叠成更深层次的架构通常会破坏闭式推断。我们证明闭式推断是可以保持的。我们识别了五种因子图原语：双线性因子、指数链接、Gamma先验、高斯似然和等式节点，并证明任何由它们组成的模型都允许闭式变分消息传递。这种构造之所以有效，是因为每个原语都保留了一小部分消息族：在平均场分解下，高斯变量上的消息保持高斯分布，精度变量上的消息保持Gamma分布，而唯一的非共轭接口——指数链接——通过高斯矩生成函数和Gamma族的充分统计量保持可处理性。我们展示了从静态集成到输入依赖门控再到分裂分支路由的递增深度组合，并表明堆叠路由层编码任意决策树，建立了具有闭式推断的通用函数逼近。应用于集成时间序列预测时，该框架产生了一个贝叶斯专家混合模型，其中门控函数是推断而非学习得到的，在五个基准数据集上提供了对专家选择的校准不确定性。

英文摘要

Stacking probabilistic building blocks into deeper architectures typically breaks closed-form inference. We show that closed-form inference can be preserved. We identify five factor-graph primitives: a bilinear factor, an exponential link, a Gamma prior, a Gaussian likelihood, and an equality node, and prove that any model composed from them admits closed-form variational message passing. The construction works because each primitive preserves a small set of message families: under mean-field factorization, messages on Gaussian variables remain Gaussian and messages on precision variables remain Gamma, while the only non-conjugate interface, the exponential link, remains tractable through the Gaussian moment-generating function and the sufficient statistics of the Gamma family. We demonstrate composition at increasing depth, from static ensembles through input-dependent gating to split-branch routing, and show that stacking routing layers encodes arbitrary decision trees, establishing universal function approximation with closed-form inference. Applied to ensemble time-series forecasting, the framework yields a Bayesian mixture of experts in which gating functions are inferred rather than learned, providing calibrated uncertainty over expert selection across five benchmark datasets.

URL PDF HTML ☆

赞 0 踩 0

2605.29464 2026-05-29 stat.ML cs.LG 版本更新

Deep Optimal Individualized Treatment Rules for Bivariate Survival Outcomes via Adaptive Prediction-Powered Learning

双变量生存结局的深度最优个体化治疗规则：基于自适应预测驱动学习

Kun Ren, Yifan Cui, Wen Su

发表机构 * Department of Biostatistics, City University of Hong Kong（香港城市大学生物统计学系）； Center for Data Science, Zhejiang University（浙江大学数据科学中心）

AI总结针对随机试验中的双变量生存结局，提出一种基于深度神经网络的自适应预测驱动方法，通过随机策略建模治疗规则并耦合边际加速失效时间模型，以最大化联合生存概率。

2605.29459 2026-05-29 cs.CL cs.LG 版本更新

Kronecker Embeddings: Byte-Level Structured Token Representations for Parameter-Efficient Language Models

Kronecker嵌入：用于参数高效语言模型的字节级结构化词元表示

Rohan Shravan

发表机构 * The School of AI（人工智能学院）

AI总结提出Kronecker嵌入，通过字节级字符-位置确定性分解替代标准嵌入表，消除91-94%输入侧可训练参数，在多个实验中实现更低验证损失、更强拼写鲁棒性和运行时效率。

Comments 28 pages, 16 tables. Reference implementation: https://github.com/theschoolofai/kronecker-embeddings

详情

AI中文摘要

大型语言模型通过一个形状为|V| x d_model的可学习嵌入表路由每个输入，在前沿规模下消耗数亿到数十亿的可训练参数。我们引入Kronecker嵌入，一种确定性的字节级字符-位置分解，用固定编码器和单个可学习投影替换该表，与标准BPE分词器兼容，在前沿规模下消除91-94%的输入侧可训练参数。我们提供五项贡献。第一，跨六个LM（135M-671B参数）的模型探针显示，训练后的输入嵌入将探针词的印刷变体聚类程度远高于形态学相关词；Kronecker在嵌入层避免了这种聚类。第二，在FineWeb-Edu上对nanoGPT GPT-2 124M进行2.5B词元的三种子受控比较显示，Kronecker达到比BPE绑定基线低2.5±0.2%的验证损失（差距0.083±0.007 nats，约9%更低的困惑度），达到BPE收敛损失所需的步数减少约1.43倍。第三，在110个干净/拼写错误对上的拼写鲁棒性探针显示，Kronecker在55.5%的对上保持top-1预测，而BPE为47.3%（+8.2个百分点），并将KL降低7.6%，在11个类别中赢得或平局10个；生成探针显示Kronecker在生成中回显字节新颖字符串和拼写错误，而BPE则遗忘它们。第四，BPE嵌入范数在训练期间漂移，而Kronecker投影范数保持在1.0附近，与稳定的表示目标一致。第五，一种即时运行时变体从4.5 MB的字节缓冲区重建嵌入，而不是从词汇量为131,072的2.15 GB表中重建，步长时间开销为0.01-0.24%。字节级局部性存在权衡：字节相似但语义距离远的对（compute/commute, nation/notion）聚类在一起，将消歧转移到早期注意力层。

英文摘要

Large language models route every input through a learned embedding table of shape |V| x d_model, consuming hundreds of millions to billions of trainable parameters at frontier scale. We introduce Kronecker Embeddings, a deterministic byte-level character-position factorization that replaces this table with a fixed encoder and a single learned projection, compatible with standard BPE tokenizers, eliminating 91--94% of input-side trainable parameters at frontier scale. We provide five contributions. First, a cross-model probe across six LMs (135M-671B parameters) shows trained input embeddings cluster typographic variants of the probe word far more than morphological relatives; Kronecker escapes this clustering at the embedding layer. Second, a controlled three-seed comparison on nanoGPT GPT-2 124M over 2.5B tokens of FineWeb-Edu shows Kronecker reaching 2.5 +- 0.2% lower validation loss than the BPE-tied baseline (gap 0.083 +- 0.007 nats, ~9% lower perplexity), needing ~1.43x fewer steps to reach BPE's converged loss. Third, a spelling-robustness probe over 110 clean/typo pairs shows Kronecker preserves the top-1 prediction on 55.5% of pairs vs. 47.3% for BPE (+8.2 pp) and lowers KL by 7.6%, winning or tying in 10 of 11 categories; a generation probe shows Kronecker echoes byte-novel strings and typos through generation where BPE forgets them. Fourth, BPE embedding norm drifts during training while Kronecker projection norm stays near 1.0, consistent with a stable representational target. Fifth, an on-the-fly runtime variant reconstructs embeddings from a 4.5 MB byte buffer rather than a 2.15 GB table at vocabulary 131,072, with 0.01--0.24% step-time overhead. Byte-level locality has a tradeoff: byte-similar but semantically distant pairs (compute/commute, nation/notion) cluster together, shifting disambiguation to early attention layers.

URL PDF HTML ☆

赞 0 踩 0

2605.29454 2026-05-29 cs.LG 版本更新

A Full-Pipeline Framework for Evaluating Membership Inference Attacks in Machine Learning

用于评估机器学习中成员推断攻击的全流程框架

Ding Chen, Xinwen Cheng, Xuyang Zhong, Xinping Chen, Xiaolin Huang, Chen Liu

发表机构 * City University of Hong Kong（香港城市大学）； Shanghai Jiao Tong University（上海交通大学）

AI总结提出一个涵盖数据、架构、算法和后训练模块的全流程评估框架，系统分析不同上下文对成员推断攻击效果的影响，并通过标准化威胁模型和互补指标提供实用指南。

详情

AI中文摘要

虽然成员推断攻击（MIAs）是识别训练数据的主流方法，但其应用已扩展到隐私审计和机器遗忘。然而，该领域缺乏一个系统性的框架来评估不同上下文如何影响MIA的效果。没有这样的特征描述，实践者可能会部署在基准测试中表现良好但在面对特定真实世界数据集的细微差别时变得统计上无关的算法。为了弥合这一差距并提供可操作的见解，我们引入了一个全面的评估框架，该框架系统地描述了整个机器学习流程（包括数据、架构、算法和后训练模块）中的隐私风险。我们的框架旨在固有地捕捉多样化的操作上下文，严格评估了在广泛训练配置下的最先进MIA。为了考虑真实世界部署中不同的误分类成本，我们采用了三个互补指标：对称成本下的平衡准确率，以及低FPR下的TPR（或低FNR下的TNR）用于严格惩罚误报或漏检的非对称场景。此外，认识到现有MIA假设不同的对手能力，我们形式化了两种标准化的威胁模型，并将这些攻击调整为相应的变体，以确保公平的基准测试。大量的实证评估表明，特定MIA方法的效果高度依赖于假设的威胁模型和选择的评估指标。最终，我们将这些发现提炼为可操作的指南，并提供一个即用的审计工具包，使实践者能够进行更好的隐私评估。

英文摘要

While Membership Inference Attacks (MIAs) are the prevailing method for identifying training data, their application has expanded into privacy auditing and machine unlearning. Nevertheless, the field lacks a systematic framework for evaluating how different contexts affect MIA efficacy. Without such a characterization, practitioners risk deploying algorithms that perform well on benchmarks but become statistically irrelevant when faced with the nuances of specific, real-world datasets. To bridge this gap and provide actionable insights, we introduce a comprehensive evaluation framework that systematically characterizes privacy risks across the entire machine learning pipeline, spanning data, architectures, algorithms, and post-training modules. Designed to inherently capture diverse operational contexts, our framework rigorously evaluates state-of-the-art MIAs across a broad spectrum of training configurations. To account for varying misclassification costs in real-world deployments, we employ three complementary metrics: Balanced Accuracy for symmetric costs, alongside TPR at low FPR (or TNR at low FNR) for asymmetric scenarios where false alarms or missed detections are strictly penalized. Furthermore, recognizing that existing MIAs assume divergent adversary capabilities, we formalize two standardized threat models and adapt these attacks into corresponding variants to ensure an equitable benchmark. Extensive empirical evaluations demonstrate that the efficacy of specific MIA methodologies is highly sensitive to the assumed threat models and chosen evaluation metrics. Ultimately, we distill these findings into actionable guidelines and provide a ready-to-use auditing toolkit, empowering practitioners to conduct better privacy assessments.

URL PDF HTML ☆

赞 0 踩 0

2605.29453 2026-05-29 cs.LG cs.AI 版本更新

使用共轭梯度法构建理想观察者的高效通道

Weimin Zhou

发表机构 * University of Arizona, Wyant College of Optical Sciences（亚利桑那大学光学科学学院）； University of Arizona, Department of Radiology & Imaging Sciences（亚利桑那大学放射科与成像科学系）

AI总结针对医学成像系统图像质量的任务评估，提出基于共轭梯度（CG）的方法构建高效通道，以近似贝叶斯理想观察者（IO）和霍特林观察者（HO）的性能。

Comments Submitted to the Journal of Medical Imaging (JMI) Special Issue Honoring Dr. Harrison H. Barrett

2605.29412 2026-05-29 eess.SY cs.LG cs.SY 版本更新

Real-Time Retargeting Using Controllability Boundary for Chandrayaan-3 Lunar Landing

基于可控边界的月船三号月球着陆实时重定向

Suraj Kumar, Debjyoti Chakrabarti, Aditya Rallapalli, Bharat Kumar GVP, Ashok Kumar Kakula

发表机构 * Controls and Digital Area, U R Rao Satellite Center, Indian Space Research Organization（控制与数字部门，U R Rao卫星中心，印度空间研究组织）

AI总结针对月船三号月球着陆任务，提出一种利用可控边界凸表示实现实时重定向的制导策略，通过数据驱动框架首次在运行任务中验证其有效性。

Comments 8 pages, 6 figures, Accepted for publication in American Control Conference 2026

2605.29411 2026-05-29 cs.LG cs.AI stat.ME stat.ML 版本更新

The Good, the Bad, and the Ugly of Markov Boundary for Tabular Prediction

马尔可夫边界在表格预测中的好、坏与丑

Shu Wan, Abhinav Gorantla, Huan Liu, K. Selçuk Candan

发表机构 * Arizona State University（亚利桑那州立大学）

AI总结研究马尔可夫边界在表格预测中的实际效用，发现理论上最优的边界在实践中有条件地提升预测性能，但因果发现方法难以实现其潜力。

Comments 11 pages, 9 figures, 2 tables. Preprint

详情

AI中文摘要

在标准图形假设下，目标变量的马尔可夫边界是使所有其他特征冗余的最小特征集。一旦观察到边界，目标变量与表格的其余部分条件独立。这对于表格预测来说是一个诱人的对象，因为它恰好指出了模型所需的列。然而，现代回归器仍然在完整特征集上训练。我们询问马尔可夫边界是否在SCM3K（一个包含3450个任务的合成SCM基准，特征数量从40到1000，涵盖六个SCM家族）上对预测真正有用，并使用六个回归器进行评估。答案比理论所暗示的要微妙得多。将回归器限制在oracle边界上通常会显著改善预测，并且随着特征空间变得更大更稀疏，改善程度增加。但是，通过因果发现恢复边界并在恢复的掩码上训练的自然流程并不奏效。现有的估计器在达到边界最有帮助的区域之前就耗尽了计算预算，即使它们运行，也很少能击败完整特征集。我们将此归因于三个原因。发现优化的是结构恢复而非预测。假阴性和假阳性具有高度不对称的预测成本。精确边界只是众多击败所有特征的特征集之一。然后，我们阐述了这些事实对于预测对齐的特征选择以及学习使用因果结构的表格模型的意义。

英文摘要

Under standard graphical assumptions, the Markov boundary of a target variable is the smallest set of features that renders every other feature redundant. Once the boundary is observed, the target is conditionally independent of the rest of the table. This is a tempting object for tabular prediction, since it names exactly the columns a model should need. Yet modern regressors are still trained on the full feature set. We ask whether the Markov boundary is genuinely useful for prediction on SCM3K, a 3,450-task synthetic SCM benchmark with feature counts from 40 to 1000 and six SCM families, evaluated with six regressors. The answer is more nuanced than the theory suggests. Restricting a regressor to the oracle boundary often improves prediction substantially, and the improvement grows as the feature space becomes larger and sparser. But the natural pipeline of recovering the boundary with causal discovery and training on the recovered mask does not deliver. Existing estimators exhaust the compute budget before reaching the regime where the boundary helps most, and even where they run they rarely beat the full feature set. We trace this to three causes. Discovery optimizes structural recovery rather than prediction. False negatives and false positives carry sharply asymmetric predictive cost. The exact boundary is only one of many feature sets that beat all features. We then develop what these facts imply for prediction-aligned feature selection and for tabular models that learn to use causal structure.

URL PDF HTML ☆

赞 0 踩 0

2605.29405 2026-05-29 cs.LG 版本更新

Information-Directed Offline-to-Online Reinforcement Learning

信息导向的离线到在线强化学习

Keru Chen

发表机构 * School of Electrical, Computer and Energy Engineering, Arizona State University（电气、计算机与能源工程学院，亚利桑那州立大学）

AI总结本文提出信息导向采样（IDS）方法，通过条件互信息量化离线数据后的残余不确定性，在离线到在线强化学习中平衡即时遗憾与信息增益，并证明其贝叶斯遗憾界及在偏置残余不确定性场景下的优势。

详情

AI中文摘要

基于离线数据集的决策通常从固定离线数据中预热策略或评分模型，然后通过有限的在线交互进行优化。离线数据减少了不确定性，但并未消除探索需求；它改变了仍需探索的内容。我们通过学习目标 $χ$ 与在线轨迹在给定离线数据集条件下的条件互信息 $I(χ;τ_{1:T}\\mid\\mathcal{D}_N)$ 来形式化这种残余不确定性。这一观点自然地引出了信息导向采样（IDS），一个由参数 $η\\\ge 0$ 参数化的家族，通过权衡即时遗憾与信息增益来选择动作。我们通过比率证书证明了 IDS 的通用离线到在线贝叶斯遗憾界：任何由参考汤普森采样策略在同一随机策略类上满足的信息比率界都会被 IDS 继承。在已知动力学的贝叶斯线性奖励模型中，条件互信息具有对数行列式形式，且普通 IDS（$η=0$）满足 $\\widetilde O\\\!\\\left(Hd\\\min\\\left\\\{\\\sqrt T,\\\,T\\\sqrt{C^\\\dagger_{β,\\\mathrm{IDS}_0}(N,T)/N}\\right\\\}\\right)$，其中覆盖系数与普通 IDS 自身诱导的访问分布相关。我们还识别出一个预热阶段，其中存在一个主导但信息丰富的探测动作，普通 IDS 会选择该探测动作而汤普森采样从不选择，从而产生常数因子的贝叶斯遗憾分离。受控的赌博机实验和 D4RL 离线到在线强化学习实验验证了这一机制：当离线数据信息丰富但留下偏置或低概率的残余不确定性，且目标在线动作可以解决这些不确定性时，IDS 最为有益，这种情形在离线强化学习、离线黑箱优化和贝叶斯优化中普遍存在。

英文摘要

Decision-making from offline datasets typically warm-starts a policy or score model from fixed offline data and then refines it with limited online interaction. Offline data reduces uncertainty, but it does not remove the need for exploration; it changes what remains to be explored. We formalise this residual uncertainty by the conditional mutual information $I(χ;τ_{1:T}\mid\mathcal{D}_N)$ between a learning target $χ$ and the online trajectories after conditioning on the offline dataset. This view leads naturally to information-directed sampling (IDS), a family parameterised by $η\ge 0$ that selects actions by trading off instantaneous regret against information gain. We prove a generic offline-to-online Bayesian regret bound for IDS through a ratio certificate: any information-ratio bound satisfied by a reference Thompson-sampling policy over the same randomised policy class is inherited by IDS. In a known-dynamics Bayesian linear-reward model, the conditional mutual information has a log-determinant form, and vanilla IDS ($η=0$) satisfies $\widetilde O\!\left(Hd\min\left\{\sqrt T,\,T\sqrt{C^\dagger_{β,\mathrm{IDS}_0}(N,T)/N}\right\}\right),$ where the coverage coefficient is tied to the visitation distribution induced by vanilla IDS itself. We also identify a warm-start regime with a dominated but informative probe in which vanilla IDS selects the probe while Thompson sampling never does, giving a constant-factor Bayesian regret separation. Controlled bandit experiments and D4RL offline-to-online RL experiments validate this mechanism: IDS is most beneficial when offline data is informative but leaves biased or low-probability residual uncertainty that targeted online actions can resolve, a regime shared by offline RL, offline black-box optimization, and Bayesian optimization.

URL PDF HTML ☆

赞 0 踩 0

2605.29401 2026-05-29 cs.LG 版本更新

Rethinking Post-Training Recipes for Multimodal Time-Series Forecasting

重新思考多模态时间序列预测的后训练方法

Haoxin Liu, Yichen Zhou, Rajat Sen, B. Aditya Prakash, Abhimanyu Das

发表机构 * Georgia Institute of Technology（佐治亚理工学院）； Google Research（谷歌研究）

AI总结提出PostTime后训练方法，结合监督微调和基于可验证奖励的强化学习，利用大语言模型根据多模态上下文修正数值时间序列基础模型的预测，显著提升多模态时间序列预测性能。

详情

AI中文摘要

时间序列基础模型（TSFMs）在使用数值数据进行零样本单模态预测方面表现出色，但与LLMs不同，它们无法处理通常影响现实世界轨迹的多模态、非数值上下文。在这项工作中，我们弥合了这一差距，并主张一种多模态时间序列预测方法，该方法对LLMs进行后训练，使其作为上下文引导的修正器，作用于强大的数值TSFM先验。我们引入了PostTime，一种结合监督微调（SFT）和基于可验证奖励的强化学习（RLVR）的后训练方案，以及一种生成预测修正的自动推理轨迹的方法。PostTime教会LLM生成上下文条件的预测干预——基于多模态上下文决定修正、保留或忽略TSFM先验。我们在TimesX多模态预测基准上，使用Gemma-3-4B LLM和TimesFM-2.5 TSFM评估了该方法，结果表明它显著优于单独的TSFM、仅LLM的基线以及现有的多模态预测方法。

英文摘要

Time-Series Foundation Models (TSFMs) excel at zero-shot unimodal forecasting using numerical data, but unlike LLMs they cannot consume multimodal, non-numerical context that often shape real-world trajectories. In this work, we bridge this gap and argue for a multimodal time-series forecasting approach that post-trains LLMs to act as context-guided revisors over strong numerical TSFM priors. We introduce PostTime, a post-training recipe combining Supervised Fine-Tuning (SFT) and Reinforcement Learning with Verifiable Rewards (RLVR), along with a methodology to generate automated reasoning traces for forecast revisions. PostTime teaches an LLM to generate context-conditioned forecast interventions -- decisions to revise, preserve, or ignore the TSFM prior based on the multimodal context. We evaluate this approach on the TimesX multimodal forecasting benchmark using a Gemma-3-4B LLM and TimesFM-2.5 TSFM, and show that it significantly outperforms standalone TSFMs, LLM-only baselines, and existing multimodal forecasting approaches.

URL PDF HTML ☆

赞 0 踩 0

2605.29398 2026-05-29 cs.LG cs.AI 版本更新

GDSD: Reinforcement Learning as Guided Denoiser Self-Distillation for Diffusion Language Models

GDSD：强化学习作为扩散语言模型的引导去噪器自蒸馏

Xiaohang Tang, Keyue Jiang, Che Liu, Qifang Zhao, Xiaoxiao Xu, Sangwoong Yoon, Ilija Bogunovic

发表机构 * UCL Dept. of Statistical Science（伦敦大学学院统计科学系）； UCL Centre for AI（伦敦大学学院人工智能中心）； Alibaba Group（阿里巴巴集团）； Dept. of EEE（电子工程系）； Imperial College London（伦敦帝国理工学院）； UNIST（全南大学）； University of Basel（巴塞尔大学）

AI总结提出引导去噪器自蒸馏（GDSD）方法，通过从逆KL正则化强化学习的闭式最优解中导出的优势引导自教师直接蒸馏扩散语言模型的去噪器，避免了ELBO似然代理带来的训练-推理不匹配偏差，在规划、数学和代码基准上显著优于现有方法。

Comments Preprint

详情

AI中文摘要

强化学习（RL）可用于改进扩散大语言模型（dLLMs）的策略（去噪器），但受到策略似然难以处理的阻碍。一类主流且高效的方法将标准RL中的似然替换为其证据下界（ELBO），该下界从随机掩码序列中估计。尽管与预训练高度一致，但这些方法通过使用ELBO作为似然代理引入了训练-推理不匹配（TIM）偏差，可能降低性能。在这项工作中，我们提出了引导去噪器自蒸馏（GDSD），直接从优势引导的自教师中蒸馏dLLMs的去噪器，该自教师源自逆KL正则化RL的闭式最优解。GDSD通过无归一化目标将dLLM的去噪器logits与教师匹配，将RL简化为无似然自蒸馏，从而绕过了TIM偏差。最近的基于ELBO的方法表现为应用不同蒸馏散度的实例，但存在GDSD避免的可诊断病态。在LLaDA-8B和Dream-7B的规划、数学和代码基准上，GDSD以更稳定的训练奖励动态持续优于先前最先进的基于ELBO的方法，测试准确率提升高达+19.6%。这些结果表明，直接的去噪器自蒸馏，无需依赖ELBO似然代理，可以为dLLMs提供更稳定有效的RL过程。代码可在https://github.com/GaryBall/GDSD获取。

英文摘要

Reinforcement learning (RL) can be used to improve the policy (denoiser) of diffusion large language models (dLLMs), while being hindered by the intractability of the policy likelihood. A dominant and efficient family of methods replaces the likelihood in standard RL with its evidence lower bound (ELBO), estimated from randomly masked sequences. Despite being well aligned with pre-training, these approaches introduce bias through training--inference mismatch by using the ELBO as a likelihood surrogate, which can degrade performance. In this work, we propose Guided Denoiser Self-Distillation (GDSD) to directly distill the denoiser of dLLMs from an advantage-guided self-teacher, derived from the closed-form optimum of reverse-KL regularized RL. GDSD matches the dLLM's denoiser logits to the teacher's via a normalization-free objective, which reduces RL to likelihood-free self-distillation and thus bypasses the TIM biases. Recent ELBO-based methods emerge as instances of applying different distillation divergences, but with diagnosable pathologies that GDSD avoids. On planning, math, and coding benchmarks with LLaDA-8B and Dream-7B, GDSD consistently outperforms prior state-of-the-art ELBO-based methods with a more stable training reward dynamics, achieving test-accuracy improvements of up to $+19.6\%$. These results suggest that direct denoiser self-distillation, without relying on an ELBO likelihood surrogate, can provide a more stable and effective RL procedure for dLLMs. Code is available at https://github.com/GaryBall/GDSD.

URL PDF HTML ☆

赞 0 踩 0

2605.29387 2026-05-29 cs.LG cs.AI stat.ML 版本更新

基于混合整数线性规划的共聚物推断的混合向量模型

Jianshen Zhu, Raveena Rai, Taiyo Sohkawa, Naveed Ahmed Azam, Kazuya Haraguchi, Liang Zhao, Tatsuya Akutsu

发表机构 * Department of Information Science and Technology, Tokyo University of Science（东京科学大学信息科学与技术系）； Discrete Mathematics and Computational Intelligence Laboratory, Department of Mathematics, Quaid-i-Azam University（夸齐-阿扎姆大学数学系离散数学与计算智能实验室）； Graduate School of Informatics, Kyoto University（京都大学信息研究生院）； Graduate School of Advanced Integrated Studies in Human Survivability (Shishu-Kan), Kyoto University（京都大学人类生存高级综合研究研究生院（Shishu-Kan））； Bioinformatics Center, Institute for Chemical Research, Kyoto University（京都大学化学研究所生物信息中心）

AI总结提出混合向量模型，通过混合整数线性规划实现共聚物的逆设计，在多个物化数据集上取得高预测精度并保持可解性。

详情

AI中文摘要

最近开发了一种新颖的两阶段分子推断框架mol-infer，通过两层模型下的混合整数线性规划（MILP），在给定学习预测函数和结构约束的条件下，以最优性和精确性推断具有规定抽象结构和期望性质值的化学图。在本研究中，我们通过引入一种称为混合向量（MV）模型的简单特征表示，将该框架扩展到共聚物。在所提出的模型中，共聚物特征向量表示为MILT可处理单体描述符的凸组合，加权系数为组成单体的混合比例。这种表示不需要明确的序列类别信息，因此自然兼容基于MILP的逆设计。在该模型下，我们使用人工神经网络、简化二次多元线性回归和随机森林为多个共聚物性质数据集构建预测函数。所提出的表示在多个物理化学性质数据集上实现了实际有用的预测性能；特别地，十个数据集中有九个的最佳测试R²分数超过0.7，六个数据集超过0.9。我们还制定了在MV表示下具有规定混合比例的多单体逆设计问题，并表明即使在三单体设置下，生成的MILP实例仍然可解。最后，我们通过重新评估推断的候选物并将重新计算的性质值与学习模型预测的值进行比较，进行外部一致性检查。总体而言，所提出的框架为在两层模型下实现共聚物的模型级精确逆设计提供了可处理的第一步。

英文摘要

A novel two-phase molecule inference framework, mol-infer, has recently been developed to infer chemical graphs with prescribed abstract structures and desired property values through mixed integer linear programming (MILP) under the two-layered model, with guaranteed optimality and exactness relative to the given learned prediction function and structural constraints. In this study, we extend this framework to copolymers by introducing a simple feature representation, called the mixing vector (MV) model. In the proposed model, a copolymer feature vector is represented as a convex combination of MILP-tractable monomer descriptors weighted by the mixing ratio of the constituent monomers. This representation does not require explicit sequence-class information and is therefore naturally compatible with MILP-based inverse design. Under this model, we construct prediction functions for several copolymer property datasets using artificial neural networks, reduced quadratic multiple linear regression, and random forests. The proposed representation achieves practically useful predictive performance across multiple physicochemical property datasets; in particular, the best test R^2 score exceeds 0.7 for nine of the ten datasets and exceeds 0.9 for six datasets. We also formulate a multi-monomer inverse-design problem under the MV representation with a prescribed mixing ratio and show that the resulting MILP instances remain tractable, even for three-monomer settings. Finally, we perform an external consistency check by re-evaluating the inferred candidates and comparing the re-computed property values with those predicted by the learned model. Overall, the proposed framework gives a tractable first step toward model-level exact inverse design of copolymers under the two-layered model.

URL PDF HTML ☆

赞 0 踩 0

2605.29327 2026-05-29 cs.CL cs.LG 版本更新

Reasoning-preserved Efficient Distillation of Large Language Models via Activation-aware Initialization

保留推理能力的大语言模型高效蒸馏：基于激活感知初始化

Junlin He, Yihong Tang, Tong Nie, Guilong Li, Binyu Yang, Jinxiao Du, Lijun Sun, Wei Ma

发表机构 * The Hong Kong Polytechnic University, Hong Kong SAR, China（香港理工大学）； McGill University, Montreal, QC, Canada（麦吉尔大学）

AI总结针对高效蒸馏导致的多步推理能力严重下降（推理崩溃），提出RED方法，通过激活感知初始化投影矩阵为通道选择矩阵，理论缓解有效秩崩溃，恢复推理能力并保持高效训练与通用性能。

详情

AI中文摘要

高效蒸馏（EDistill）通过结构化剪枝参数和调优轻量模块以高训练效率压缩大语言模型（LLM）。尽管这些EDistill LLM在通用能力基准上相对于类似大小的LLM取得了最先进的（SOTA）性能，但我们发现其多步推理能力严重下降，我们称之为推理崩溃。我们系统分析了推理崩溃的几何起源，并表明基于宽度缩减投影矩阵的SOTA EDistill方法遭受有效秩（eRank）崩溃，即隐藏表示的有效秩下降。我们从理论上解释了随机初始化投影矩阵的奇异值如何变得分布不均，导致eRank崩溃，进而导致token不可区分性。为解决此问题，我们提出了RED（保留推理能力的高效蒸馏）方法，该方法引入激活感知初始化，将投影矩阵初始化为通道选择矩阵，从而在理论上缓解eRank崩溃。在Llama和Qwen系列上的实验表明，RED在保持高训练效率和SOTA通用能力的同时，显著恢复了推理能力。

英文摘要

Efficient Distillation (EDistill) compresses large language models (LLMs) by structured pruning parameters and tuning lightweight modules with high training efficiency. Although these EDistilled LLMs achieve state-of-the-art (SOTA) performance on general ability benchmarks relative to similarly sized LLMs, we identify a severe degradation in their multi-step reasoning ability, which we term reasoning collapse. We systematically analyze the geometric origins of reasoning collapse and show that the SOTA EDistill method based on width-reducing projection matrices suffers from eRank collapse, in which the effective rank (eRank) of hidden representations drops. We theoretically explain how singular values of randomly initialized projection matrices become unevenly distributed, leading to eRank collapse and thus token indistinguishability. To address this issue, we propose RED (Reasoning-preserved Efficient Distillation) for LLMs, which introduces activation-aware initialization to initialize projection matrices as channel-selection matrices, thus theoretically mitigating eRank collapse. Experiments on Llama and Qwen series demonstrate that RED substantially recovers reasoning while maintaining high training efficiency and SOTA general ability.

URL PDF HTML ☆

赞 0 踩 0

2605.29326 2026-05-29 cs.LG 版本更新

NeuroEdge: Real-Time Hand Gesture Recognition with High-Density EMG Using Deep Learning at the Edge

NeuroEdge：基于边缘深度学习的密集肌电实时手势识别

Peter Chudinov, Zhenyu Lin, Jay Motamarry, Srihita Panati, Xiaorong Zhang, Zhuwei Qin

发表机构 * San Francisco State University（旧金山州立大学）； Department of Biology（生物系）； School of Engineering in Computer Engineering（计算机工程学院）； College of San Mateo（圣马特奥学院）； Contra Costa College（康特拉科斯塔学院）

AI总结提出NeuroEdge系统，通过HD-EMG无线传输和轻量级CNN推理引擎，在微控制器上实现实时手势识别，准确率90%，延迟83ms。

详情

AI中文摘要

高密度肌电（HD-EMG）已成为解码精细神经肌肉活动的强大方式，可实现用于假肢控制、康复和增强交互等应用的实时神经-机器接口（NMI）。尽管卷积神经网络（CNN）等深度学习方法在基于EMG的手势识别中表现出高分类精度，但由于计算和内存限制，它们在嵌入式硬件上的部署仍然是一个重大挑战。本文提出NeuroEdge，一种基于实时HD-EMG的NMI系统，完全在资源受限的微控制器上执行手势识别。该系统包含两个定制模块：HD-EMG StreamBridge，一种无线通信接口，将原始HD-EMG数据从Quattrocento放大器流式传输到ESP32微控制器；以及EdgeDL推理引擎，一种在索尼Spresense微控制器上执行的轻量级深度学习框架。一个针对嵌入式推理优化的紧凑一维CNN实时处理滑动窗口的EMG数据。数据流和推理通过利用直接内存访问（DMA）进行数据传输以及ESP32和Spresense之间的串行外设接口（SPI）突发通信的架构进行流水线和同步，确保低延迟性能。实验结果表明，NeuroEdge在七种手势中实现了90%的实时分类准确率，使用从前臂记录的192通道HD-EMG，总平均延迟为83毫秒。我们的系统证明了在基于微控制器的边缘设备上部署基于HD-EMG的复杂手势识别的可行性，弥合了高分辨率生物信号采集与基于深度学习的嵌入式推理之间的差距，为下一代NMI铺平了道路。

英文摘要

High-density electromyography (HD-EMG) has emerged as a powerful modality for decoding fine-grained neuromuscular activity, enabling real-time neural-machine interfaces (NMIs) for applications such as prosthetic control, rehabilitation, and augmented interaction. While deep learning approaches such as convolutional neural networks (CNNs)have demonstrated high classification accuracy for EMG-based gesture recognition, their deployment on embedded hardware remains a major challenge due to computational and memory constraints. This paper presents NeuroEdge, a real-time HD EMG-based NMI system that performs gesture recognition entirely on resource-constrained microcontrollers. The system features two custom-designed modules: the HD-EMG StreamBridge, a wireless communication interface that streams raw HD-EMG data from a Quattrocento amplifier to an ESP32 microcontroller; and the EdgeDL Inference Engine, a lightweight deep learning framework executing on a Sony Spresense microcontroller. A compact 1-dimensional CNN optimized for embedded inference processes, sliding windows of EMG data in real time. Data streaming and inference are pipelined and synchronized through an architecture that utilizes Direct Memory Access (DMA) for data transfer and Serial Peripheral Interface (SPI) burst communication between the ESP32 and Spresense, ensuring low-latency performance. Experimental results show that NeuroEdge achieves a real-time classification accuracy of 90% across seven hand gestures, with a total average latency of 83 ms using 192 channels of HD-EMG recorded from the forearm. Our system demonstrates the feasibility of deploying complex HD-EMG-based gesture recognition on microcontroller-based edge devices, bridging the gap between high-resolution biosignal acquisition and deep learning-based embedded inference for next-generation NMIs.

URL PDF HTML ☆

赞 0 踩 0

2605.29307 2026-05-29 cs.CL cs.AI cs.IR cs.LG 版本更新

GrepSeek: Training Search Agents for Direct Corpus Interaction

GrepSeek：训练用于直接语料库交互的搜索代理

Alireza Salemi, Chang Zeng, Atharva Nijasure, Jui-Hui Chung, Razieh Rahimi, Fernando Diaz, Hamed Zamani

发表机构 * University of Massachusetts Amherst（马萨诸塞大学阿默斯特分校）； Princeton University（普林斯顿大学）； Carnegie Mellon University（卡内基梅隆大学）

AI总结提出GrepSeek，一种通过两阶段训练（冷启动数据集+GRPO优化）和语义保持的分片并行执行引擎，训练紧凑型搜索代理直接与文本语料库交互（通过shell命令），在开放域问答中取得最优F1和精确匹配。

详情

AI中文摘要

大型语言模型（LLM）搜索代理通过多轮推理和信息检索，在知识密集型语言任务中展现出强大潜力。大多数现有系统使用检索器，该检索器接收关键词或自然语言查询，并利用预计算文档表示的索引返回排序后的文档列表。在本工作中，我们探索了一种互补视角，其中搜索代理将语料库本身视为搜索环境，并通过执行可执行的shell命令来寻找证据。我们引入了GrepSeek，一种优化的直接语料库交互（DCI）搜索代理，它训练一个紧凑的搜索代理从大型文本语料库中查找、过滤和组合证据。为了解决在大语料库上直接使用强化学习进行学习行为的不稳定性，我们提出了一种两阶段训练流程。首先，我们使用答案感知的Tutor和答案盲的Planner构建冷启动数据集，生成经过验证的、因果基础的搜索轨迹。其次，我们使用组相对策略优化（GRPO）优化初始化的策略，使代理能够通过与语料库的直接交互来改进其任务导向的搜索行为。为了使DCI在大规模下实用，我们进一步使用语义保持的分片并行执行引擎，该引擎将基于shell的检索加速高达7.6倍，同时保持与shell命令顺序执行的字节精确等价。在七个开放域问答基准上的实验表明，GrepSeek在整体词元级F1和精确匹配上取得了最强性能。我们的分析还揭示了纯粹词汇交互在具有显著表面形式变化的查询上的局限性，表明DCI作为搜索代理的一种实用且具有竞争力的方法，可以在现实世界中补充现有的检索范式。

英文摘要

Large Language Model (LLM) search agents have shown strong promise for knowledge-intensive language tasks through multiple rounds of reasoning and information retrieval. Most existing systems access information using a retriever that takes a keyword or natural language query and returns a ranked list of documents using an index of pre-computed document representations. In this work, we explore a complementary perspective in which the search agent treats the corpus itself as the search environment and finds evidence by issuing executable shell commands. We introduce GrepSeek, an optimized direct corpus interaction (DCI) search agent that trains a compact search agent to find, filter, and compose evidence from large text corpora. To address the instability of learning behavior directly with reinforcement learning on large corpora, we propose a two-stage training pipeline. First, we construct a cold-start dataset using an answer-aware Tutor and answer-blind Planner to generate verified, causally grounded search trajectories. Second, we refine the initialized policy with Group Relative Policy Optimization (GRPO), allowing the agent to improve its task-oriented search behavior through direct interaction with the corpus. To make DCI practical at scale, we further use a semantics-preserving sharded-parallel execution engine that accelerates shell-based retrieval by up to $7.6\times$ while preserving byte-exact equivalence with sequential execution of the shell command. Experiments across seven open-domain question answering benchmarks show that GrepSeek achieves the strongest overall token-level $F_1$ and Exact Match. Our analysis also highlights the limitations of purely lexical interaction on queries with substantial surface-form variation, suggesting DCI as a practical and competitive method for search agents that can complement existing retrieval paradigms in the real world.

URL PDF HTML ☆

赞 0 踩 0

2605.29283 2026-05-29 cs.LG cs.AI 版本更新

Do Physics Foundation Models Learn Generalizable Physics? A Bias-Aware Benchmark Across Physical Regimes and Distribution Shifts

物理基础模型能否学习可泛化的物理？一种跨物理机制和分布偏移的偏差感知基准

Mengdi Chu, Yang Liu, Ayan Biswas, Han-Wei Shen

发表机构 * The Ohio State University（俄亥俄州立大学）； Los Alamos National Laboratory（洛斯阿拉莫斯国家实验室）

AI总结通过构建包含8种物理动力学、3种训练数据混合和25种测试机制的基准，评估五种物理基础模型架构，发现当前模型是条件性而非通用性泛化者，其泛化能力依赖于物理机制、时间尺度、初始条件、预训练、模型大小和架构，并指出改进需超越缩放模型或扩展数据，转向学习跨机制、时间尺度和分布偏移的可迁移物理知识。

Comments 26 pages, 31 figures

详情

AI中文摘要

最近的物理基础模型声称具有通用的时空预测能力，但它们的评估通常将性能压缩为固定训练分布下的单一平均分数。这使得难以确定模型是否学习了可泛化的物理动力学，还是仅在特定设置下表现良好。我们构建了一个包含8种物理动力学、3种训练数据混合和25种测试机制的基准，这些测试机制由动态尺度和初始条件复杂性变化引起，涵盖了分布内、分布偏移和分布外设置。我们评估了五种物理基础模型架构和每种架构的四种模型变体（从头训练和三种预训练大小），共得到60,000个测量结果。我们的结果表明，当前的物理基础模型表现为条件性而非通用性泛化者：它们的泛化能力取决于物理机制、时间尺度、初始条件设置、预训练、模型大小和架构。改进训练数据分布只能部分缓解这一限制。预训练和缩放也无法可靠地消除它们的能力偏差。我们认为，改进物理基础模型需要超越缩放模型或扩展数据，转向学习能够更好地跨机制、时间尺度和分布偏移捕获可迁移物理知识的机制。

英文摘要

Recent physics foundation models claim general spatiotemporal forecasting ability, yet their evaluations often collapse performance into a single average score under a fixed training distribution. This makes it difficult to determine whether a model has learned generalizable physical dynamics or only performs well under particular settings. We construct a benchmark with 8 physical dynamics, 3 training-data mixtures, and 25 test regimes induced by dynamic-scale and initial-condition complexity shifts, covering in-distribution, distribution-shift, and out-of-distribution settings. We evaluate five physics foundation model architectures and four model variants per architecture (scratch and three pretrained sizes), resulting in 60,000 measurements. Our results show that current physics foundation models behave as conditional rather than universal generalists: their generality depends on the physical regime, temporal scale, initial-condition setting, pretraining, model size, and architecture. Improving the training data distribution only partially mitigates this limitation. Pretraining and scaling are also unable to reliably remove their ability biases. We argue that improving physics foundation models requires moving beyond scaling models or expanding data, toward learning mechanisms that better capture transferable physical knowledge across regimes, temporal scales, and distribution shifts.

URL PDF HTML ☆

赞 0 踩 0

2605.29273 2026-05-29 cs.LG math.OC 版本更新

A Theoretical and Experimental Study of a Novel Adaptive Learning Algorithm

一种新型自适应学习算法的理论与实验研究

Sakshi Kumari, Shyam Kumar M, Sushmitha P

发表机构 * Department of Mathematics Indian Institute of Technology Patna（数学系印度理工学院帕纳瓦）； Department of Mechanical Engineering Indian Institute of Technology Kharagpur（机械工程系印度理工学院Khargpur）

AI总结针对现有自适应优化器（如Adam和AMSGrad）的收敛性问题，提出基于视线方法的C-Adam优化器，给出收敛性理论证明并通过数值实验验证。

2605.29272 2026-05-29 cs.LG cs.AI stat.ML 版本更新

Causal Label Recovery in Payment Networks

支付网络中的因果标签恢复

Gaurav Dhama

发表机构 * Mastercard（麦star卡）

AI总结针对支付网络中标签存在的四种系统偏差，提出序列三重稳健（STR）估计器，同时纠正所有偏差并达到半参数效率界，实现基于数天而非数月数据的训练。

Comments 49 pages

详情

AI中文摘要

支付网络中的欺诈检测模型依赖于存在系统性偏差的退单标签进行训练。每个标签必须依次经过三个门控：授权（被拒绝的交易不产生标签）、发卡行报告（未报告的欺诈不可见）和延迟（待处理的退单在训练时缺失）。到达的标签可能因第一方滥用或发卡行错误分类而受损。配套论文[arXiv:2605.27557]证明这四种损害对检测性能施加了极小极大下界。本文问：能否达到该下界？我们将观测流程形式化为一个具有三个倾向阶段和一个损坏层的顺序缺失数据问题，并构建了序列三重稳健（STR）估计器。STR同时纠正所有四种损害，并达到半参数效率界——没有估计器能具有更低的渐近方差。它是序列三重稳健的：在每个门控处，一致性仅要求倾向模型或结果回归中有一个正确指定，而非两者。我们提供了通过噪声率调整的伪标签进行损坏校正、通过经验贝叶斯收缩稳定小发卡行的逆倾向权重、提供有效置信区间的插件方差估计量，以及用于有限样本保证的伯恩斯坦集中不等式。在操作层面，我们推导了最优训练延迟——使标签质量损失和模型过时之和最小化的成熟窗口——并证明STR允许使用数天而非数月前的数据进行训练，将模型新鲜度与退单成熟周期解耦。对于任何样本量，STR在均方误差上严格优于基于退单的朴素训练。

英文摘要

Fraud detection models in payment networks train on chargeback labels that are systematically biased. Every label must survive three sequential gates: authorization (declined transactions generate no labels), issuer reporting (unreported fraud is invisible), and delay (pending chargebacks are missing at training time). Labels that do arrive may be corrupted by first-party misuse or issuer misclassification. A companion paper [arXiv:2605.27557] proved that these four impairments impose a minimax lower bound on detection performance. This paper asks: can that bound be achieved? We formalize the observation pipeline as a sequential missing-data problem with three propensity stages and a corruption layer, and construct the Sequential Triply Robust (STR) estimator. The STR corrects for all four impairments simultaneously and achieves the semiparametric efficiency bound -- no estimator can have lower asymptotic variance. It is sequentially triply robust: at each gate, consistency requires only that either the propensity model or the outcome regression is correctly specified, not both. We provide corruption correction via noise-rate-adjusted pseudo-labels, empirical Bayes shrinkage to stabilize inverse-propensity weights for small issuers, a plug-in variance estimator yielding valid confidence intervals, and a Bernstein concentration inequality for finite-sample guarantees. On the operational side, we derive the optimal training delay -- the maturity window that minimizes the sum of label-quality loss and model staleness -- and prove that the STR permits training on data that is days old rather than months old, decoupling model freshness from the chargeback maturity cycle. The STR provably dominates naive chargeback-based training in mean squared error for any sample size.

URL PDF HTML ☆

赞 0 踩 0

2605.29271 2026-05-29 cs.AI cs.IR cs.LG 版本更新

CoHyDE: Iterative Co-Training of LLM Rewriter & Dense Encoder for Tool Retrieval

CoHyDE: 用于工具检索的LLM改写器与稠密编码器的迭代协同训练

Vaishali Senthil, Ashutosh Hathidara, Sebastian Schreiber

发表机构 * SAP Labs（SAP实验室）

AI总结提出CoHyDE方法，通过迭代协同训练稠密编码器和LLM改写器，结合对比学习和偏好对齐，在工具检索任务中同时提升标准查询和模糊查询的性能。

详情

AI中文摘要

在大规模API目录上的工具检索是LLM智能体的核心瓶颈：用户查询以口语化、通常不明确的语言出现，而目录使用技术性API词汇，没有固定的编码器能够单独弥合这一差距。两种主要的训练方法，对比编码器微调和基于冻结LLM的HyDE式查询扩展，从相反的角度解决这个问题，并在互补的方向上失败：微调编码器在查询的表面形式与目录匹配时表现出色，但在不匹配时性能崩溃；而零样本HyDE对不明确的查询更鲁棒，但生成不感知目录的假设描述，当查询形式良好时检索性能下降。我们提出CoHyDE，一种迭代过程，将稠密编码器和LLM改写器训练为单个共同演化的系统：编码器使用改写器生成的目录风格假设描述通过InfoNCE重新训练，改写器通过DPO基于编码器的检索分数进行偏好对齐，两者在循环开始前在工具目录上进行热启动。在ToolBench目录的约10k工具子集上，三轮CoHyDE在标准查询上比最强的单组件基线提高+2.5个百分点的NDCG@5，在保留的模糊查询上提高+6.3个百分点，在最难的模糊层级上增益高达+8个百分点。消融实验证实协同训练是关键因素：单独使用任一组件都无法在形式良好和模糊查询上匹配CoHyDE，在模糊查询上损失高达-8个百分点。

英文摘要

Tool retrieval over large API catalogs is a core bottleneck for LLM agents: user queries arrive in colloquial, often underspecified language, while the catalog uses technical API vocabulary that no fixed encoder can bridge on its own. The two dominant training approaches, contrastive encoder fine-tuning and HyDE-style query expansion with a frozen LLM, address this problem from opposite ends and fail in complementary directions: the fine-tuned encoder excels when the query's surface form already matches the catalog but collapses when it does not, while zero-shot HyDE is more robust to underspecified queries yet generates catalog-unaware hypothetical descriptions that degrade retrieval when queries are well-formed. We introduce CoHyDE, an iterative procedure that trains the dense encoder and the LLM rewriter as a single co-evolving system: the encoder is retrained with InfoNCE on catalog-style hypothetical descriptions produced by the rewriter, and the rewriter is preference-aligned via DPO against the encoder's retrieval scores, with both sides warm-started on the tool catalog before the loop begins. On a ~10k tool subset of the ToolBench catalog, three rounds of CoHyDE improve over the strongest single-component baseline by +2.5 pp NDCG@5 on standard queries and +6.3 pp on held-out vague queries, with gains as large as +8 pp on the hardest vague tier. Ablations confirm that co-training is the key ingredient: using either component in isolation fails to match CoHyDE on both well-formed and vague queries, with losses of up to -8 pp on vague queries.

URL PDF HTML ☆

赞 0 踩 0

2605.29267 2026-05-29 cs.AI cs.LG 版本更新

When and How Human Curation Backfires: Preference Alignment under Multi-Model Self-Consuming Loop

人类策展何时以及如何适得其反：多模型自消费循环下的偏好对齐

Yang Zhang, Xiukun Wei, Xueru Zhang

发表机构 * Department of Computer Science and Engineering, The Ohio State University, Columbus, Ohio（计算机科学与工程系，俄亥俄州立大学，哥伦布，俄亥俄）

AI总结研究多模型自消费训练中人类策展对模型对齐的影响，发现跨模型交互可能削弱甚至逆转策展效果，导致长期对齐退化。

详情

AI中文摘要

基础模型越来越多地使用先前模型迭代生成的合成数据进行训练，而非仅依赖真实数据。这种自消费训练范式可能导致模型崩溃、发散或偏差放大。近期工作（Ferbach et al., 2024）表明，将人类策展纳入循环可以引导自消费模型向人类对齐的行为，但这些分析聚焦于单一孤立模型，该模型仅消耗自身输出。然而，在实践中，模型经常交互并训练于其他模型产生的输入-输出对。本文研究多模型机制下的自消费训练。我们首先形式化了一个交互自消费模型的框架，并刻画了所得动力系统何时收敛到稳定点。然后，我们考察了一个模型的人类策展如何影响其自身对齐（自影响），以及这种效应如何传播到其他模型（交叉影响）。与孤立设置中人类策展总是增强模型对齐不同，我们表明跨模型交互可以削弱甚至逆转这种效应，最终损害长期对齐。

英文摘要

Foundation models are increasingly trained on synthetic data generated by prior model iterations rather than exclusively on real data. This self-consuming training paradigm can lead to model collapse, divergence, or bias amplification. Recent work (Ferbach et al., 2024) shows that incorporating human curation into the loop can steer a self-consuming model toward human-aligned behavior, but these analyses focus on a single, isolated model that solely consumes its own outputs. In practice, however, models often interact and train on input-output pairs produced by other models. This paper studies self-consuming training in the multi-model regime. We first formalize a framework for interacting self-consuming models and characterize when the resulting dynamical system converges to a stable point. We then examine how human curation of one model affects its own alignment (self-influence) and how such effects propagate to other models (cross-influence). Unlike isolated settings where human curation always enhances model alignment, we show that cross-model interactions can dampen or even invert this effect, ultimately degrading long-term alignment.

URL PDF HTML ☆

赞 0 踩 0

2605.29259 2026-05-29 cs.LG cs.AI 版本更新

KLAS: Using Similarity to Stitch Neural Networks for Improved Accuracy-Efficiency Tradeoffs

KLAS：利用相似性拼接神经网络以改进精度-效率权衡

Debopam Sanyal, Anantharaman Iyer, Alind Khare, Trisha Jain, Akshay Jajoo, Myungjin Lee, Clayton Kerce, Alexey Tumanov

发表机构 * Georgia Institute of Technology（佐治亚理工学院）； Microsoft M365 Research（微软M365研究）； Cisco Research（思科研究）； Georgia Tech Research Institute（佐治亚理工研究机构）

AI总结提出KLAS框架，通过KL散度度量中间表示相似性自动选择最佳拼接配置，在相同微调成本下提升拼接模型的精度-效率曲线。

详情

AI中文摘要

鉴于部署目标的广泛性，灵活模型选择对于在给定计算预算内优化性能至关重要。最近的研究表明，在模型家族内拼接预训练模型能够实现精度-效率权衡空间的成本效益插值。拼接将一个预训练模型的中间激活变换到另一个模型，生成新的插值拼接网络。这类网络沿精度-效率谱提供了部署选项池。然而，现有拼接方法往往产生次优权衡且缺乏泛化性，因为它们主要依赖启发式方法选择拼接配置。我们认为，构建改进的精度-效率权衡需要显式捕获并利用被拼接预训练模型之间的相似性。为此，我们引入KLAS，一种新颖的拼接选择框架，通过利用中间表示之间的KL散度，自动化和泛化跨模型家族的拼接选择。KLAS从$O(k^2n^2)$种可能性中为$k$个深度为$n$的预训练模型识别最有前景的二元拼接。通过全面实验，我们证明KLAS在相同微调成本下改进了拼接模型的精度-效率曲线，与基线相比，KLAS在相同计算成本下实现了高达$1.21\%$的ImageNet-1K top-1准确率提升，或在保持准确率的同时将FLOPs降低$1.33\times$。

英文摘要

Given the wide range of deployment targets, flexible model selection is essential for optimizing performance within a given compute budget. Recent work demonstrates that stitching pretrained models within a model family enables cost-effective interpolation of the accuracy-efficiency tradeoff space. Stitching transforms intermediate activations from one pretrained model into another, producing a new interpolated stitched network. Such networks provide a pool of deployment options along the accuracy-efficiency spectrum. However, existing stitching approaches often yield suboptimal tradeoffs and lack generalizability, as they primarily rely on heuristics to select stitch configurations. We argue that constructing improved accuracy-efficiency tradeoffs requires explicitly capturing and leveraging the similarity between pretrained models being stitched. To this end, we introduce KLAS, a novel stitch selection framework that automates and generalizes stitch selection across model families by leveraging KL divergence between intermediate representations. KLAS identifies the most promising binary stitches from the $O(k^2n^2)$ possibilities for $k$ pretrained models of depth $n$. Through comprehensive experiments, we demonstrate that KLAS improves the accuracy-efficiency curve of stitched models at the same finetuning cost as baselines. KLAS achieves up to $1.21\%$ higher ImageNet-1K top-1 accuracy at the same computational cost, or maintains accuracy with a $1.33\times$ reduction in FLOPs.

URL PDF HTML ☆

赞 0 踩 0

2605.29250 2026-05-29 cs.CL cs.AI cs.IR cs.LG 版本更新

OmniRetrieval: Unified Retrieval across Heterogeneous Knowledge Sources

OmniRetrieval：跨异构知识源的统一检索

Jinheon Baek, Soyeong Jeong, Sangwoo Park, Woongyeong Yeo, Minki Kang, Patara Trirat, Heejun Lee, Sung Ju Hwang

发表机构 * KAIST（韩国科学技术院）

AI总结提出OmniRetrieval框架，通过自然语言查询识别并调度到不同知识源的本地执行引擎，在13个数据集和309个知识库上超越单源基线，实现异构知识源统一检索。

详情

AI中文摘要

现实世界的信息需求需要访问结构多样的知识源，从非结构化文本和关系表到知识图谱和属性图。然而，现有的检索器一次只在一个源上操作，使用固定的查询语言，使得可用知识的更广泛图景被不兼容的接口所分割。一种自然的统一尝试是将这些源折叠到一个共享空间中，但这会抹去每个源的结构性优势（如模式、本体、组合操作符），而这些优势赋予了每个源其表达能力。因此，对多样化知识的有效检索需要的不是同质化，而是一个能够按每个源自身条件与其交互的总体层。为了实现这一点，我们提出了OmniRetrieval，一个框架，它接受任何自然语言查询，识别合适的知识源，并将源原生查询分派到其本地执行引擎。在涵盖文本、关系和图结构源的13个数据集和309个不同知识库的广泛基准测试中，OmniRetrieval超过了单源基线，证明了它可以作为异构源的通用接口，同时保留使每个源有价值的结构差异。

英文摘要

Real-world information needs require access to structurally diverse knowledge sources, from unstructured text and relational tables to knowledge graphs and property graphs. Existing retrievers, however, operate over one source at a time under a fixed query language, leaving the broader landscape of available knowledge fragmented behind incompatible interfaces. A natural attempt at unification would collapse these sources into a shared space, but this erases the structural affordances (such as schemas, ontologies, compositional operators) that give each source its expressive power. Effective retrieval over diverse knowledge, therefore, requires not homogenization but an overarching layer that meets each source on its own terms. To achieve this, we present OmniRetrieval, a framework that takes any natural-language query, identifies appropriate knowledge sources, and dispatches source-native queries to their native execution engines. Across an extensive benchmark spanning 13 datasets and 309 distinct knowledge bases over text, relational, and graph-structured sources, OmniRetrieval exceeds single-source baselines, demonstrating that it can serve as a general-purpose interface to the heterogeneous sources while preserving the structural distinctions that make each source valuable.

URL PDF HTML ☆

赞 0 踩 0

2605.29249 2026-05-29 stat.ML cs.LG 版本更新

Prediction-Powered Inference Across Many Tasks for AI Evaluation & Social Science Research

跨任务预测驱动推理在AI评估与社会科学研究中的应用

Nicolas Emmenegger, Ellery Stahler, Chara Podimata

发表机构 * MIT（麻省理工学院）

AI总结提出多任务预测驱动推理框架，通过跨任务重校准利用共享结构，在标签稀缺时提升统计推断效率，并证明非线性结构是跨任务增益的必要条件。

详情

AI中文摘要

许多应用需要在多个相关任务中进行统计上有效的推断，而每个假设只使用少量高质量标签。在AI评估中，这些任务可能对应于不同提示、子群体或假设下的模型行为；在社会科学调查中，它们可能对应于相关问题、群体或测量条件。预测驱动推理（PPI）利用丰富但廉价的代理测量来改进有限真实标签的推断，但常用方法独立处理任务，因此未能利用相关任务间的共享结构。这一限制在每任务仅有少量标签的场景中尤为重要。为解决此问题，我们引入了一个多任务预测驱动推理框架，该框架利用来自相关任务的标记数据来提高统计功效，同时保留任务特定的推断。我们的方法通过跨任务重校准来利用代理-真实关系中的共享结构，同时保留任务内修正和功效调优，以构建精确的点估计和置信区间。我们证明，只有当代理-真实关系包含非线性结构时，才能实现超越功效调优PPI的效率提升；仿射跨任务重校准在渐近意义上等同于使用原始代理。我们通过合成和半合成数据集上的实验，以及2024年美国总统大选期间审计语言模型关于选举相关信息的案例研究，补充了我们的理论发现。利用一项大型人工标注研究，我们表明当标签稀缺时，跨任务重校准可以显著减少置信区间宽度。

英文摘要

Many applications require statistically valid inference across many related tasks, while using only a handful of high-quality labels per hypothesis. In AI evaluation, these tasks may correspond to model behaviors across prompts, subgroups, or hypotheses; in social science surveys, they may correspond to related questions, populations, or measurement conditions. Prediction-powered inference (PPI) uses abundant but inexpensive proxy measurements to improve inference from limited, ground-truth labels, but commonly used methods treat tasks independently and therefore fail to exploit shared structure across related tasks. This limitation is especially important in settings where only a small number of labels are available per task. To address this issue, we introduce a multi-task prediction-powered inference framework that uses labeled data from related tasks to improve power while preserving task-specific inference. Our methods exploit the shared structure in the proxy-ground-truth relationship through cross-task recalibration, while retaining within-task rectification and power tuning to construct accurate point estimates and confidence intervals. We prove that efficiency gains beyond power-tuned PPI are only possible when the proxy-ground-truth relationship contains nonlinear structure; affine cross-task recalibrations are asymptotically equivalent to using the original proxy. We complement our theoretical findings with experiments on synthetic and semi-synthetic datasets, as well as a case study auditing language models on election-related information during the 2024 U.S. presidential election. Using a large human-annotation study, we show that cross-task recalibration can substantially reduce confidence interval widths when labels are scarce.

URL PDF HTML ☆

赞 0 踩 0

2605.29247 2026-05-29 cs.AI cs.CL cs.LG 版本更新

DenseSteer: Steering Small Language Models towards Dense Math Reasoning

DenseSteer: 引导小型语言模型进行密集数学推理

Yang Ouyang, Shuhang Lin, Jung-Eun Kim

发表机构 * North Carolina State University（北卡罗来纳州立大学）； Rutgers University（罗格斯大学）

AI总结提出DenseSteer，一种无需训练的推理时引导框架，通过调节内部表征向密集推理模式靠拢，提升小型模型在多步数学推理中的准确性。

Comments ICML 2026

2605.29245 2026-05-29 cs.CR cs.CL cs.LG 版本更新

Implicit Identity Technologies for LLMs: Fingerprinting and Watermarking across Datasets, Models, and Generated Content

LLM的隐式身份技术：跨数据集、模型和生成内容的指纹识别与水印

Bing Liu, Shunping Wang, Yufan Zhu, Xinyi Yu, Jing Huang, Linkang Du, Hongbin Pei, Wei Luo

发表机构 * School of Cyber Science and Engineering, Xi’an Jiaotong University, Xi’an, China（西安交通大学计算机科学与工程学院）； State Grid Henan Marketing Service Center, Henan, China（国网河南营销服务中心）； Institute of Information Engineering, Chinese Academy of Sciences, Beijing, China（中国科学院信息工程研究所）； School of Cyber Security, University of Chinese Academy of Sciences, Beijing, China（中国科学院大学网络安全学院）； School of Information Technology, Deakin University, Geelong, Australia（迪金大学信息技术学院）

AI总结本文综述了LLM指纹识别和水印技术，提出隐式身份统一抽象，并基于生命周期分类法组织数据集、模型和生成内容的技术，建立评估框架。

Comments Accepted by IJCAI-ECAI 2026. 11 pages, 1 figure. Survey and taxonomy of LLM fingerprinting and watermarking for identity, provenance, generated-content attribution, and asset protection

详情

AI中文摘要

本文对LLM指纹识别和水印技术进行了综述和分类，用于身份验证、所有权验证、溯源和生成内容归因。大型语言模型（LLM）需要大量数据、计算和专业知识投入，并越来越多地部署在高风险场景中，因此保护LLM相关资产并追溯其来源至关重要。现有工作已在数据集溯源、模型所有权和生成内容检测方面迅速扩展，但该领域仍然碎片化：指纹识别和水印的使用往往不一致，且方法通常仅在孤立的资产特定设置中研究。为解决这一差距，我们引入隐式身份作为LLM系统中可验证但不可直接观察的身份信号的统一抽象。我们将指纹识别区分为源自内在特征的非侵入式身份，将水印区分为有意嵌入数据、模型或生成内容中的侵入式身份。然后，我们提出一种基于生命周期的分类法，将技术组织到数据集、模型和生成内容中，并进一步通过验证语义进行区分：基于相似性的归因和密钥验证。最后，我们建立一个以可识别性、鲁棒性和可部署性为中心的评估框架，总结在现实访问和变换条件下的代表性指标。通过统一术语、生命周期阶段和评估目标，本综述为研究LLM身份技术以及开发更可靠的资产保护和溯源机制提供了结构化基础。

英文摘要

This paper presents a survey and taxonomy of LLM fingerprinting and watermarking for identity, ownership verification, provenance, and generated-content attribution. Large language models (LLMs) require substantial investments in data, computation, and expertise, and are increasingly deployed in high-stakes settings, making it critical to protect LLM-related assets and trace their origins. Existing work has rapidly expanded across dataset provenance, model ownership, and generated-content detection, but the field remains fragmented: fingerprinting and watermarking are often used inconsistently, and methods are typically studied within isolated asset-specific settings. To address this gap, we introduce implicit identity as a unifying abstraction for verifiable but not directly observable identity signals in LLM systems. We distinguish fingerprinting as non-intrusive identity derived from intrinsic characteristics, and watermarking as intrusive identity deliberately embedded into data, models, or generated content. We then propose a lifecycle-based taxonomy that organises techniques across datasets, models, and generated content, and further separates them by verification semantics: similarity-based attribution and keyed verification. Finally, we establish an evaluation framework centred on identifiability, robustness, and deployability, summarising representative metrics under realistic access and transformation regimes. By unifying terminology, lifecycle stages, and evaluation objectives, this survey provides a structured foundation for studying LLM identity technologies and for developing more reliable mechanisms for asset protection and provenance.

URL PDF HTML ☆

赞 0 踩 0

2605.29236 2026-05-29 cs.LG 版本更新

SigmaMedStat: Temporal Signal Modeling for ICU False Alarm Reduction

SigmaMedStat: 用于ICU误报减少的时间信号建模

Arunkumar Ramachandran

AI总结提出SigmaMedStat系统，通过将60秒记录分割为6个10秒块并提取连续小波变换尺度图，结合EfficientNet-B0编码器和两层LSTM网络进行时间建模，在PhysioNet/CinC Challenge 2015数据集上实现AUC 0.822，有效降低ICU误报。

Comments Code available at github.com/Arun-K-Ram/sigmamedstat

详情

AI中文摘要

重症监护病房（ICU）中的警报疲劳是一个有充分记录的患者安全危机。临床监护仪每天每位患者产生350次或更多警报，其中72-99%在临床上无关紧要。工作人员对非可操作警报的脱敏增加了错过真正紧急情况的风险。本文提出了SigmaMedStat，一个机器学习系统，在采取临床行动之前评估生理警报信号的可信度。在PhysioNet/Computing in Cardiology Challenge 2015数据集（包含498个四通道ICU警报记录）上评估了四种方法。主要贡献是一个时间建模框架，它将每个60秒记录分割成六个连续的10秒块，进而为每个块生成连续小波变换（CWT）尺度图，使用共享的EfficientNet-B0编码器对每个块进行编码，并将得到的特征序列传递给两层长短期记忆（LSTM）网络。五折分层交叉验证的平均AUC为0.822 +/- 0.016（95% CI: [0.790,0.853]），而基于完整60秒窗口的静态EfficientNet基线为0.641。消融研究证实，时间分块和多通道信号融合均独立地有助于分类性能。按警报类型分析显示，心室扑动是最准确分类的警报类型（AUC 0.820），而心脏停搏仍然是最难的（AUC 0.722）。错误分析识别出65个假阴性和85个高置信度错误分类作为主要失败模式。所有代码和结果公开在https://github.com/Arun-K-Ram/sigmamedstat。

英文摘要

Alarm fatigue in intensive care units (ICUs) is a well documented patient safety crisis. Clinical monitors generate 350 or more alarms per patient per day, out of which 72-99% are clinically irrelevant. Staff desensitization to non-actionable alarms increases the risk of missed true emergencies. This paper presents SigmaMedStat, a machine learning system that evaluates the trustworthiness of physiological alarm signals before clinical action is taken. Four approaches were evaluated on the PhysioNet/Computing in Cardiology Challenge 2015 dataset of 498 four-channel ICU alarm recordings. Primary contribution is a temporal modeling framework that splits each 60 second recording into six consecutive 10-second chunks, and this in turn generates Continuous Wavelet Transform (CWT) scalograms per chunk, encodes each chunk with a shared EfficientNet-B0 encoder, and passes the resulting feature sequence to a two-layer Long Short-Term Memory (LSTM) network. Five-fold stratified cross-validation yields a mean AUC of 0.822 +/- 0.016 (95% CI: [0.790,0.853]), compared to 0.641 for a static EfficientNet baseline trained on the full 60-second window. Ablation studies confirm that temporal chunking and multi-channel signal fusion both contribute independently to classification performance. Per-alarm type analysis reveals that Ventricular Flutter is the most accurately classified alarm type (AUC 0.820) while Asystole remains the hardest (AUC 0.722). Error analysis identifies 65 false negatives and 85 high-confidence misclassifications as the primary failure modes. All code and results are publicly available at https://github.com/Arun-K-Ram/sigmamedstat.

URL PDF HTML ☆

赞 0 踩 0

2605.29202 2026-05-29 cs.LG 版本更新

Auditing Training Data in Generative Music Models via Black-Box Membership Inference

通过黑盒成员推断审计生成音乐模型中的训练数据

Yi Chen Liu, Jiawei Yu, Kexin Cao, Syed Irfan Ali Meerza, Trishika Movva, Jian Liu

发表机构 * University of Georgia（佐治亚大学）； Independent Researcher（独立研究者）； University of Tennessee（田纳西大学）

AI总结本文提出一种黑盒成员推断方法，通过比较候选音频与模型基于其描述生成输出的语义对齐程度，并训练音乐审计器分类成员身份，实现对生成音乐模型训练数据的高精度审计。

Comments The paper has been accepted for presentation at the workshop ArtSec 2026: Workshop on Artwork Security and Provenance in the Age of AI

详情

基于生成式机器学习的季节预报概率偏差校正：以北极海冰预测为例

Parsa Gooya, Reinel Sospedra-Alfonso

发表机构 * Canadian Centre for Climate Modelling and Analysis（加拿大气候建模与分析中心）

AI总结本研究提出基于条件变分自编码器的概率后处理框架，通过生成器替代高斯参数化解码器并采用连续排序概率评分优化，有效校正季节预报的系统偏差并提升分辨率与谱能量。

详情

AI中文摘要

季节气候预测通过提供未来几个月最可能发生的气候条件及其相关不确定性的早期信息，支持规划和风险管理。集合预报通过模拟许多可能的结果来实现这一点，使得预测能够以可用的概率形式表达。大集合和高分辨率预报通过更好地采样不确定性和捕捉更精细尺度的过程来加强这种指导，但会带来显著的计算成本。此外，预报集合存在漂移，并表现出系统偏差和随提前时间增长的时空误差，需要仔细的后处理和校准。加拿大气候建模与分析中心开发了一种基于条件变分自编码器（cVAE）的概率后处理框架，用于生成北极海冰的偏差校正季节预测的大集合。生成模型旨在学习以有偏模型预测为条件的观测分布。这使得能够生成任意大的、经过良好校准的、偏差校正的预测集合，且具有更高的技能。在此，我们扩展该框架以解决标准cVAE已知的局限性——预测中细尺度能量的损失和特征性的模糊。具体而言，我们在cVAE中使用生成器替代高斯参数化解码器，并在目标函数中使用连续排序概率评分代替均方误差。我们进一步使用比原始预报更高分辨率的目标数据集。我们表明，与基准预测相比，调整后的预测校准更好，与观测分布更一致，误差更小，同时相对于标准cVAE提高了原始预报的分辨率、锐度和谱功率。

英文摘要

Seasonal climate predictions support planning and risk management by offering early information of the most likely-to-occur climate conditions in the coming months, and associated uncertainties. Ensemble forecasts enable this by simulating many plausible outcomes, allowing predictions to be expressed as usable probabilities. Large ensembles and high-resolution forecasts strengthen this guidance by better sampling uncertainty and capturing finer-scale processes but come with significant computational cost. Moreover, forecast ensembles drift and exhibit systematic biases and spatio-temporal errors that grow with lead time, requiring careful post-processing and calibration. A probabilistic post-processing framework based on conditional Variational Autoencoders (cVAEs) was developed at the Canadian Center for Climate Modeling and Analysis to generate large ensembles of bias adjusted seasonal predictions of Arctic sea ice. The generative model was designed to learn the observational distribution conditioned on the biased model prediction. This enables generation of arbitrarily large ensembles of well-calibrated, bias corrected forecasts with improved skill. Here, we extend this framework to address the loss of fine-scale energy and the characteristic blurriness in predictions, a known limitation of standard cVAEs. Specifically, we employ a generator in place of the Gaussian parametrized decoder in the cVAE and use Continuous Ranked Probability Score in the objective function instead of the Mean Square Error. We further use a higher resolution target dataset compared to the raw forecast. We show that the adjusted forecasts are better calibrated, more consistent with the observational distribution, and exhibit smaller errors than benchmark predictions, while also enhancing the resolution of the raw forecasts and improving sharpness and spectral power relative to the standard cVAE.

URL PDF HTML ☆

赞 0 踩 0

2605.29168 2026-05-29 cs.AI cs.LG 版本更新

Better Later Than Sooner: Neuro-Symbolic Knowledge Graph Construction via Ontology-grounded Post-extraction Correction

晚做总比早做好：基于本体后提取校正的神经符号知识图谱构建

Lorenzo Loconte, Timothy Hospedales, Cristina Cornelio

发表机构 * University of Edinburgh, UK（爱丁堡大学）； Samsung AI Center, Cambridge, UK（三星人工智能中心）

AI总结提出一种神经符号框架，通过后提取校正解决LLM提取知识图谱时的本体不一致问题，减少token使用并提升图谱一致性。

详情

AI中文摘要

问答是AI中的核心挑战，特别是对于需要跨文档多跳推理或聚合、穷举等符号操作的复杂查询。检索增强生成已成为问答的主要方法，最近的基于图的变体通过组织知识以更好地支持组合性问题，部分解决了这些问题。然而，大多数基于文本图的RAG方法仍缺乏可靠回答复杂问题所需的符号操作结构。这推动了基于符号图的方法，该方法提取知识图谱，其关系是逻辑谓词，支持类似SQL的查询。然而，这些流程通常使用LLM进行KG提取，这可能导致一致性问题，即提取的事实可能违反常识本体约束。我们提出了一种用于本体基础KG构建的神经符号框架，结合了开放域提取、基于嵌入的类型和谓词规范化，以及针对本体违规的LLM校正。通过将校正推迟到后提取阶段，我们的方法避免了重复的LLM调用，显著减少了token使用，同时提高了KG一致性并保持了下游问答质量。最后，通过测量SPARQL图模式的出现，我们展示了提取的KG非常适合符号查询。

英文摘要

Question answering (QA) is a core challenge in AI, particularly for complex queries requiring multi-hop reasoning across documents, or symbolic operations like aggregation or exhaustive listing. Retrieval-augmented generation has become the dominant approach to QA, with recent graph-based variants addressing part of these issues by organizing knowledge to better support compositional questions. However, most textual graph-based RAG methods still lack the structure needed for symbolic operations useful to answer complex questions reliably. This motivates symbolic graph-based approaches, which extract knowledge graphs (KGs) whose relations are logic predicates that enable SQL-like querying. Yet these pipelines typically use LLMs for KG extraction, which can introduce consistency issues, where extracted facts may violate commonsense ontology constraints. We propose a neuro-symbolic framework for ontology-grounded KG construction combining open-domain extraction, embedding-based canonicalization of types and predicates, and targeted LLM-based correction of ontology violations. By deferring corrections to a post-extraction stage, our method avoids repeated LLM calls, substantially reducing token usage while improving KG consistency and preserving downstream QA quality. Finally, we show that the extracted KGs are well suited for symbolic querying by measuring the occurrence of SPARQL graph patterns.

URL PDF HTML ☆

赞 0 踩 0

2605.29161 2026-05-29 cs.LG cs.AI 版本更新

Evolutionary Refinement of Generative Graph Topologies: A Hybrid WGAN-GA Approach

生成图拓扑的进化精炼：一种混合WGAN-GA方法

James Sargant, Seyedeh Ava Razi Razavi, Renata Dividino, Sheridan Houghten

发表机构 * Computer Science Brock University, Canada（计算机科学布鲁克大学加拿大）

AI总结提出一种混合WGAN-GA方法，通过遗传算法精炼GAN生成的图结构，减少度分布和谱分布等偏差，使合成图更接近真实图。

Comments 6 pages, 4 Figures, 4 Tables, IEEE World Congress on Computational Intelligence

详情

AI中文摘要

由于离散连通性、图大小变化和类别特定的结构模式，生成逼真的图结构数据具有挑战性。最近基于生成对抗网络（GAN）的图生成方法通过学习连通性和匹配类别特定的密度分布来改进边建模。然而，这些模型在与真实图相比时仍表现出明显的偏差，例如度和谱分布，表明重要的结构属性未完全保留。本工作旨在通过使用遗传算法（GA）精炼现有基于GAN的图生成器框架生成的图来减少这些偏差。在GAN框架中，生成器同时生成节点特征和连通性模式，而基于GNN的判别器评估图的真实性和类别一致性，以确保全局结构和类别对齐。在此基础上，我们应用GA来精炼生成图的边。精炼过程引导合成图更接近真实数据，同时保持多样性和新颖性。实验结果表明，与基础模型相比，GA精炼持续降低组合最大均值差异（MMD），从而生成更匹配真实结构模式的图。这表明进化精炼是纠正基于GAN的图生成器中残留结构偏差的有效且灵活的方法，提高了它们用于逼真图合成和数据增强的适用性。

英文摘要

Generating realistic graph-structured data is challenging due to discrete connectivity, varying graph sizes, and class-specific structural patterns. Recent Generative Adversarial Networks (GAN)-based graph generation methods improve edge modelling by learning connectivity and matching class-specific density distributions. However these models still exhibit noticeable deviations such as in degree and spectral distribution when compared to real graphs, indicating that important structural properties are not fully preserved. This work aims to reduce these deviations by refining the graphs produced by an existing GAN-based graph generator framework with a Genetic Algorithm (GA). In the GAN framework, the generator produces both node features and connectivity patterns, while a GNN-based critic evaluates graph realism and class consistency to ensure global structural and class alignment. Building on this foundation, we apply a GA to refine the edges of generated graphs. The refinement process guides synthetic graphs toward closer agreement with real data, while preserving diversity and novelty. Experimental results show that the GA refinement consistently lowers combined Maximum Mean Discrepancy (MMD) compared to the base model, leading to graphs that more closely match real structural patterns. This demonstrates that evolutionary refinement is an effective and flexible way to correct residual structural deviations in GAN-based graph generators, improving their suitability for realistic graph synthesis and data augmentation.

URL PDF HTML ☆

赞 0 踩 0

2605.29158 2026-05-29 cs.LG cs.IR q-bio.BM 版本更新

PROTOCOL: Late Interaction Retrieval for Protein Homolog Search

PROTOCOL: 用于蛋白质同源搜索的延迟交互检索

Gabrielle Cohn, Rohan Gumaste, Minh Hoang, Vihan Lakshman

发表机构 * MIT（麻省理工学院）； Princeton University（普林斯顿大学）

AI总结提出ProtoCol模型，利用ColBERT风格的延迟交互机制对残基嵌入进行最大相似度评分，以提升远程同源搜索的灵敏度，在SCOPe超家族和Pfam clan基准上优于多种基线方法。

详情

AI中文摘要

蛋白质同源搜索是功能注释、结构预测和进化分析的基础，但在全局序列相似性较弱且经典比对方法灵敏度下降的“模糊区”中仍然具有挑战性。蛋白质语言模型提供了上下文感知的表示，可以在此范围内提高比对灵敏度。然而，先前的基于蛋白质嵌入的检索流程通常将这些表示池化为单个向量，可能掩盖揭示远程同源性的局部基序、结构域或保守残基。我们引入了ProtoCol，该模型将蛋白质表示为残基嵌入的集合，并使用ColBERT风格的延迟交互来测试残基级比较是否改善同源检索。ProtoCol独立编码蛋白质，保持候选表示可预计算，并通过残基嵌入上的MaxSim对候选进行评分。在SCOPe超家族和Pfam clan基准上，ProtoCol优于基于序列组成、比对、池化PLM和训练的单向量基线，支持延迟交互作为远程同源搜索的有效检索层。

英文摘要

Protein homology search underlies function annotation, structure prediction, and evolutionary analysis, but remains challenging in the "twilight zone," where global sequence similarity is weak and classical alignment methods lose sensitivity. Protein language models provide context-aware representations that could improve alignment sensitivity in this regime. However, prior protein embedding-based retrieval pipelines often pool these representations into a single vector, potentially obscuring local motifs, domains, or conserved residues that reveal remote homology. We introduce ProtoCol, a model which represents proteins as sets of residue embeddings and uses ColBERT-style late interaction to test whether residue-level comparison improves homolog retrieval. ProtoCol encodes proteins independently, keeps candidate representations pre-computable, and scores candidates with MaxSim over residue embeddings. On SCOPe superfamily and Pfam clan benchmarks, ProtoCol outperforms sequence-composition, alignment-based, pooled PLM, and trained single-vector baselines, supporting late interaction as an effective retrieval layer for remote homology search.

URL PDF HTML ☆

赞 0 踩 0

2605.29157 2026-05-29 cs.LG cs.AI cs.CL 版本更新

Parallax: Parameterized Local Linear Attention for Language Modeling

Parallax: 参数化局部线性注意力用于语言建模

Yifei Zuo, Dhruv Pai, Zhichen Zeng, Alec Dewulf, Shuming Hu, Zhaoran Wang

发表机构 * Northwestern University（西北大学）； Tilde Research（Tilde研究）； University of Washington（华盛顿大学）

AI总结提出Parallax，一种可扩展的参数化局部线性注意力机制，通过消除数值求解器并学习查询投影器，在语言模型预训练中实现一致的困惑度改进和下游任务迁移优势。

详情

私有随机决策理论在线学习的最优间隙相关遗憾

Tommaso Cesari, Roberto Colomboni

发表机构 * School of Electrical Engineering and Computer Science University of Ottawa（电气工程与计算机科学学院，渥太华大学）； School of Mathematics University of Bristol（数学学院，布里斯托尔大学）

AI总结针对完全信息、事件级纯差分隐私的随机决策理论在线学习，提出一种无水平线的纯差分隐私算法，并证明遗憾界为O(log K / Δ_min + log K / ε)。

详情

AI中文摘要

我们研究具有完全信息和事件级纯差分隐私的随机决策理论在线学习。Hu和Mehta在COLT上提出的一个开放问题要求确定在纯事件级差分隐私下，随机决策理论在线学习的最优间隙相关遗憾率。对于$K$个动作，损失在$[0,1]$中，且唯一最优动作与次优动作的间隙为$Δ_{\min}$，已知下界为$ rac{\log K}{\min\{Δ_{\min},\varepsilon\}} $，或等价地，在通用常数范围内，为\[ rac{\log K}{Δ_{\min}}+ rac{\log K}{\varepsilon} \]。我们给出一个无水平线的纯DP算法，并证明对于任意水平线$T$，显式遗憾界\[ \operatorname{Reg}_T \le 1000 \cdot \left( rac{\log K}{Δ_{\min}}+ rac{\log K}{\varepsilon} ight) \]。数值常数未优化。该算法将时间划分为指数增长大小的块，每个块内执行单个动作，并通过指数机制（应用于前一个块的数据无关随机前缀）选择下一个动作。随机前缀将块遗憾转化为所有前缀长度上softmax选择误差的和。单个熵势参数以代价$\log K/\varepsilon$控制所有隐私主导的大间隙动作。

英文摘要

We study stochastic decision-theoretic online learning with full information and event-level pure differential privacy. A COLT open problem of Hu and Mehta asks to determine the optimal gap-dependent regret rate for stochastic decision-theoretic online learning under pure event-level differential privacy. For $K$ actions, losses in $[0,1]$, and a unique best action separated from the second-best action by gap $Δ_{\min}$, the known lower bound is of order $ \frac{\log K}{\min\{Δ_{\min},\varepsilon\}}, $ or equivalently, up to universal constants, of order \[ \frac{\log K}{Δ_{\min}}+\frac{\log K}{\varepsilon}. \] We give a horizon-free pure-DP algorithm and prove the explicit regret bound \[ \operatorname{Reg}_T \le 1000 \cdot \left(\frac{\log K}{Δ_{\min}}+\frac{\log K}{\varepsilon}\right) \] for every horizon $T$. The numerical constant is not optimized. The algorithm partitions time into blocks of exponentially increasing size, plays a single action throughout each block, and chooses the next action by an exponential mechanism applied to a data-independent random prefix of the previous block. The random prefix converts block regret into a sum, over all prefix lengths, of softmax selection errors. A single entropy-potential argument controls all privacy-dominated large-gap actions at cost $\log K/\varepsilon$.

URL PDF HTML ☆

赞 0 踩 0

2605.29139 2026-05-29 stat.ML cs.LG 版本更新

何时与多久？时间推理中的读出-中介角度

Shreyas Fadnavis, Praitayini Kanakaraj, Felix Wyss

发表机构 * Bioscope AI

AI总结通过测量线性探针与模型实际计算子空间之间的角度，发现探针可能学习与模型无关的正交方向，从而揭示基于探针的可解释性存在根本缺陷。

详情

AI中文摘要

线性探针几乎可以完美解码表示，但却可能与模型如何使用该表示完全无关。在语言模型的日历日期持续时间推理中，一个$\\\sin$/ $\\\cos$探针从层的激活中恢复一年中的第几天，但消融其方向对模型的答案没有影响——而在同一层通过分布式对齐搜索（DAS）找到的四维子空间被消融时，性能完全崩溃。我们测量这两个子空间之间的角度——\\emph{读出-中介角度}——发现它与两个随机子空间之间的角度（Haar均匀零假设）无法区分，这意味着探针学到了与模型实际计算正交的方向。逆向工程电路揭示了原因：注意力头通过学习的QK偏移（$\\\pm30$和$\\\pm61$天）路由月份粒度的上下文，然后MLP将\\emph{何时}（绝对日期）转换为\\emph{多久}（持续时间）——所有这些都在探针从未触及的因果子空间的下游。稀疏自编码器分解证实了这种分裂：探针对齐和DAS对齐的特征编码了语义上不相交的概念，因果重叠可忽略不计。这种分离在四个规模（$1.5$-$9\\\,$B）和两个模型家族中重复出现，并在另外两个领域（空间位移、符号算术）有初步证据，表明读出-中介正交性是探针可解释性的一种普遍失败模式。这直接削弱了将探针部署为运行时安全监控的提议：探针可以在模型已悄然放弃的方向上报告高置信度。

英文摘要

A linear probe can decode a representation almost perfectly and yet be completely irrelevant to how the model uses it. On calendar-date duration reasoning in language models, a $\sin$/$\cos$ probe recovers day-of-year from a layer's activations, yet ablating its direction has no effect on the model's answers -- while ablating a four-dimensional subspace found by Distributed Alignment Search (DAS) at the same layer collapses performance entirely. We measure the angle between these two subspaces -- the \emph{readout-mediator angle} -- and find it indistinguishable from the angle between two random subspaces (the Haar-uniform null), meaning the probe has learned a direction orthogonal to the model's actual computation. Reverse-engineering the circuit reveals why: attention heads route month-grained context through learned QK offsets at ${\pm}30$ and ${\pm}61$ days, and MLPs then convert \emph{when} (absolute date) into \emph{how long} (duration) -- all downstream of the causal subspace the probe never touches. Sparse-autoencoder decomposition confirms the split: probe-aligned and DAS-aligned features encode semantically disjoint concepts with negligible causal overlap. The dissociation replicates across four scales ($1.5$-$9\,$B) and two model families, with preliminary evidence on two further domains (spatial displacement, symbolic arithmetic), suggesting that readout-mediator orthogonality is a general failure mode of probe-based interpretability. This directly undermines proposals to deploy probes as runtime safety monitors: the probe can report high confidence on a direction the model has silently abandoned.

URL PDF HTML ☆

赞 0 踩 0

2605.29121 2026-05-29 math.DS cs.AI cs.LG 版本更新

A Minimal Bifurcation Model of Load Imbalance in a Softmax Mixture-of-Experts Router

Softmax混合专家路由器中负载不平衡的最小分岔模型

O. M. Kiselev

发表机构 * Innopolis University（因诺波利斯大学）

AI总结提出一个两专家混合专家层的自适应softmax路由最小动力学模型，通过平均场极限从离散强化规则导出，发现超临界叉形分岔导致负载不平衡，并推导了分岔集和尖点灾变的精确参数方程。

Comments 21 pages, 11 figures

详情

AI中文摘要

我们提出了一个两专家混合专家（MoE）层的自适应softmax路由的最小动力学模型。该模型作为离散强化规则的平均场极限得到：被选中的专家获得小的分数增量，而所有分数经历正则化衰减。在对称情况下，极限系统具有超临界叉形分岔：对于弱反馈，存在唯一的稳定平衡状态，而当反馈强度超过临界值时，出现两个稳定的不对称状态。当加入外部不对称性时，叉形分岔展开为一对折叠分岔，在控制参数平面中形成一个尖点。我们推导了分岔集和尖点灾变的局部规范型的精确参数方程。数值实验将这一图景与经验专家负载、一个小的可训练MoE模型、硬top-1 PyTorch路由以及一个关于数字的小型分类实验联系起来。结果为自适应MoE路由器中负载不平衡的突然转变提供了一个可控的低维机制。

英文摘要

We propose a minimal dynamical model of adaptive softmax routing for a two-expert Mixture-of-Experts (MoE) layer. The model is obtained as a mean-field limit of a discrete reinforcement rule: the selected expert receives a small score increment, while all scores undergo regularizing decay. In the symmetric case the limiting system has a supercritical pitchfork bifurcation: for weak feedback there is a unique stable balanced state, whereas above a critical feedback strength two stable asymmetric states appear. When an external asymmetry is added, the pitchfork unfolds into a pair of fold bifurcations forming a cusp in the control-parameter plane. We derive exact parametric equations for the bifurcation set and the local normal form of the cusp catastrophe. Numerical experiments connect this picture to empirical expert load, a small trainable MoE model, hard top-1 PyTorch routing, and a small classification experiment on digits. The results provide a controlled low-dimensional mechanism for abrupt transitions to load imbalance in adaptive MoE routers.

URL PDF HTML ☆

赞 0 踩 0

2605.29114 2026-05-29 cs.CR cs.LG cs.RO 版本更新

ReasonBreak: Probing Vulnerabilities in Reasoning-Enabled Vision-Language-Action Models for Autonomous Driving

ReasonBreak: 探测自动驾驶中具备推理能力的视觉-语言-行动模型的脆弱性

Mohammadreza Teymoorianfard, Jean-Philippe Monteuuis, Jonathan Petit, Amir Houmansadr

发表机构 * University of Massachusetts Amherst（马萨诸塞大学阿默斯特分校）； Qualcomm（高通）

AI总结本文通过黑盒攻击方法，首次系统研究了具备推理能力的视觉-语言-行动模型在自动驾驶中面对真实输入扰动时的脆弱性，发现其推理和轨迹生成均易受攻击，导致碰撞率上升。

详情

AI中文摘要

具备集成推理能力的视觉-语言-行动（VLA）模型已被提出用于端到端自动驾驶，假设推理与轨迹生成之间存在紧密耦合。然而，此类系统在真实输入扰动下的鲁棒性尚未得到充分探索。我们表明，这些模型对真实输入扰动高度脆弱，在闭环仿真中推理攻击成功率高达89%，轨迹操控攻击成功率高达72%，导致碰撞率上升和安全指标下降。以NVIDIA近期开发的Alpamayo模型为代表，我们首次对具备推理能力的VLA模型在真实文本输入损坏下进行了系统性黑盒研究，评估了其对推理和驾驶行为的影响。我们引入了一个推理感知评估框架，捕捉推理的语义和结构方面，并结合以安全为中心的度量。我们还引入了一个基准，用于评估自动驾驶中推理-轨迹交互的攻击与防御。我们的结果强调了严格评估和改进防御的必要性，以确保自动驾驶中具备推理能力的VLA系统的安全性。

英文摘要

Vision-Language-Action (VLA) models with integrated reasoning have been proposed for end-to-end autonomous driving, assuming a tight coupling between reasoning and trajectory generation. However, the robustness of such systems under realistic input perturbations remains largely unexplored. We show that these models are highly vulnerable to realistic input perturbations, achieving up to 89% attack success rate (ASR) on reasoning and up to 72% on trajectory manipulation in closed-loop simulation, leading to increased collision rates and degraded safety metrics. Using NVIDIA's recent Alpamayo models as representative industry-developed VLAs, we conduct the first systematic black-box study of reasoning-enabled VLA models under realistic textual input corruptions, evaluating their impact on reasoning and driving behavior. We introduce a reasoning-aware evaluation framework capturing both semantic and structural aspects of reasoning, along with safety-centric measures. We also introduce a benchmark for evaluating attacks and defenses on reasoning-trajectory interactions in autonomous driving. Our results highlight the need for rigorous evaluation and improved defenses to ensure the safety of reasoning-enabled VLA systems in autonomous driving.

URL PDF HTML ☆

赞 0 踩 0

2605.29108 2026-05-29 cs.LG 版本更新

Bridging Chemists and AI: An Expert-Augmented Framework for Interpretable Route Evaluation

连接化学家与人工智能：一种专家增强的可解释路线评估框架

Yujia Guo, Mikhail Kabeshov, Tat Hong Duong Le, Samuel Genheden, Marco V. Mijangos, Varvara Voinarvoska, Giulia Bergonzini, Ola Engkvist, Samuel Kaski

发表机构 * Department of Computer Science, Aalto University（艾尔沃斯大学计算机科学系）； Discovery Sciences R&D, AstraZeneca（阿斯利康发现科学研发部）； Department of Computer Science and Engineering, Chalmers University of Technology and University of Gothenburg（查尔姆斯理工大学和哥德堡大学计算机科学与工程系）； Department of Computer Science, University of Manchester（曼彻斯特大学计算机科学系）

AI总结提出一种专家增强的数据驱动评分框架，结合机器学习与化学家领域知识，实现多步合成路线的数值与可解释评估，显著提升预测准确性。

Comments 13 pages, 11 figures, ELLIS Unconference Workshop: Generative Models, LLMs, and the Future of Molecular AI (ML4Molecules 2025)

详情

AI中文摘要

选择高效的多步合成路线是有机合成中的一个核心挑战，特别是在药物化学和工艺化学中，路线选择直接影响可行性、成本和开发效率。数据驱动的评估系统常常过度简化合成设计的多目标性质，并依赖于代理数据集（如专利路线）而非普遍适用的标准。为了解决这一问题，我们引入了一种专家增强的数据驱动评分框架，该框架将机器学习与化学家的领域知识相结合，用于数值和可解释的路线评估。使用参考路线与机器生成路线之间的树编辑距离训练基于DeepSets的模型，然后通过专家评估进行微调，以产生定量分数和可解释的定性类别：好、合理和差。所得系统在类别评估预测上实现了0.78的Spearman相关系数和0.77的Pearson相关系数，在分数预测上实现了60.2%的top-1排名准确率，显著优于之前17.5%的基线水平。

英文摘要

Selecting efficient multi-step synthetic routes is a central challenge in organic synthesis, particularly in medicinal and process chemistry, where route choice directly impacts feasibility, cost, and development efficiency. Data-driven assessment systems often oversimplify the multi-objective nature of synthesis design and rely on proxy datasets, such as patent routes, rather than universally grounded criteria. To address this, we introduce an expert-augmented, data-driven scoring framework that integrates machine learning with chemists' domain knowledge for both numerical and explainable route assessment. A DeepSets-based model is trained using tree edit distance between reference and machine-generated routes, and then fine-tuned with expert evaluations to produce both quantitative scores and interpretable qualitative categories: Good, Plausible, and Bad. The resulting system achieves a Spearman correlation coefficient of 0.78 and a Pearson correlation of 0.77 for category assessment prediction, and 60.2% top-1 ranking accuracy for score prediction, substantially outperforming the previous baseline of 17.5%.

URL PDF HTML ☆

赞 0 踩 0

2605.29101 2026-05-29 cs.LG cs.IT math.IT 版本更新

知识卸载：将大语言模型分解为稀疏骨干和记忆模块

Karim Galliamov, Rochelle Choenni, Ivan Titov

发表机构 * University of Amsterdam（阿姆斯特丹大学）； University of Edinburgh（爱丁堡大学）

AI总结提出知识卸载（KOFF）框架，通过结构化剪枝和轻量级恢复模块将预训练LLM分解为稀疏共享骨干和领域特定记忆，在约12%全局稀疏度下保持模型性能，并发现语言特定神经元优先被移除。

详情

AI中文摘要

大语言模型将通用能力和领域特定知识编码在同一组参数中。我们探究这种能力是否可以重组：将广泛有用的计算保留在共享骨干中，而将专门知识移入外部记忆模块。我们提出知识卸载（KOFF），一个将预训练LLM分解为稀疏共享骨干和领域特定记忆的框架。从冻结的基础模型开始，我们联合学习结构化剪枝掩码和轻量级恢复模块，这些模块以LoRA适配器和学习型键值缓存的形式实现。在3B到8B的Llama和Qwen模型上，我们发现非平凡的能力可以从共享骨干中移出而不会导致模型能力大幅下降。在大约12%的全局稀疏度下，KOFF保留了未剪枝模型的大部分性能，而剪枝相同冻结模型但没有记忆则性能急剧下降。消融实验表明LoRA和学习型KV记忆是互补的，专门化分析表明学习到的分解是有意义的：语言特定神经元被优先移除，而语言通用神经元主要保留在骨干中。这些结果表明知识可以在共享核心和可交换的外部记忆之间重新分配。

英文摘要

LLMs encode both general capabilities and domain-specific knowledge in a single set of parameters. We ask whether this capacity can be reorganized: keeping broadly useful computation in a shared backbone, while moving specialized knowledge into external memory modules. We propose \emph{knowledge offloading} (KOFF), a framework for decomposing a pretrained LLM into a sparse shared backbone and domain-specific memories. Starting from a frozen base model, we jointly learn a structured pruning mask and lightweight recovery modules, implemented as LoRA adapters and learned key-value caches. Across Llama and Qwen models from 3B to 8B, we find that non-trivial capacity can be moved out of the shared backbone without a large loss in model ability. At around 12\% global sparsity, KOFF preserves much of the unpruned model's performance, while pruning the same frozen model without memories degrades sharply. Ablations show that LoRA and learned KV memories are complementary, and specialization analyses suggest that the learned decomposition is meaningful: language-specific neurons are preferentially removed while language-general neurons largely remain in the backbone. These results suggest that knowledge can be reallocated between a shared core and swappable external memories.

URL PDF HTML ☆

赞 0 踩 0

2605.29068 2026-05-29 cs.AI cs.CL cs.CR cs.LG 版本更新

Robust and Efficient Guardrails with Latent Reasoning

具有潜在推理的鲁棒高效防护栏

Siddharth Sai, Xiaofei Wen, Muhao Chen

发表机构 * University of California, Davis（加州大学戴维斯分校）

AI总结提出COLAGUARD模型，通过阶段式训练将多步安全推理转移到连续潜在空间，在保持高安全性能的同时实现12.9倍加速和22.4倍令牌减少。

详情

AI中文摘要

随着大型语言模型（LLMs）在现实应用中的日益部署，维护其安全性至关重要。现有的安全防护栏通常依赖单次分类或更近期的蒸馏推理。基于推理的防护栏显著优于仅分类的基线，但会带来大量的查询延迟和令牌开销，使其不适用于高吞吐量部署。为了解决这一挑战，我们提出了COLAGUARD，一种通过阶段式训练课程将多步安全推理转移到连续潜在空间的防护栏模型，从而在推理时实现直接的隐藏状态传播。在涵盖八个安全基准的十个提示和响应审核设置上评估，COLAGUARD在宏观F1上比Llama Guard 3提高了8.24分，并与我们的显式推理基线GuardReasoner在宏观F1上相当，同时实现了12.9倍的加速和22.4倍的令牌使用减少。我们的结果表明，潜在推理为可部署的防护栏提供了一种实用的替代方案，以替代显式理由生成，共同提高安全鲁棒性和推理效率，而不是将它们视为竞争目标。

英文摘要

Maintaining the safety of large language models (LLMs) is crucial as they are increasingly deployed in real-world applications. Existing safety guardrails typically rely on single-pass classification or, more recently, distilled reasoning. Reasoning-based guardrails significantly outperform classification-only baselines, but they incur substantial query latency and token overhead that make them impractical for highthroughput deployment. To address this challenge, we propose COLAGUARD, a guardrail model that transfers multi-step safety reasoning into a continuous latent space through a stage-wise training curriculum, enabling direct hidden-state propagation at inference. Evaluated on ten prompt- and response-moderation settings spanning eight safety benchmarks, COLAGUARD improves macro-F1 by 8.24 points over Llama Guard 3 and matches our explicit reasoning baseline, GuardReasoner, in macroF1 while delivering a 12.9X speedup and 22.4X reduction in token usage. Our results suggest that latent reasoning offers a practical alternative to explicit rationale generation for deployable guardrails, jointly improving safety robustness and inference efficiency rather than treating them as competing objectives.

URL PDF HTML ☆

赞 0 踩 0

2605.29042 2026-05-29 cs.AI cs.LG 版本更新

Differentiable Belief-based Opponent Shaping

基于可微信念的对手塑造

Aarav G Sane, Karthik Sivachandran, Rohan Paleja

发表机构 * Department of Computer Science（计算机科学系）

AI总结提出D-BOS方法，通过可微的信念更新和梯度传播，在隐藏角色游戏中实现对手信念的塑造，从而自然涌现最优策略。

详情

AI中文摘要

人类协调往往依赖于通过战略行动影响他人信念的能力。在多智能体强化学习中，对手塑造试图复制这种影响，尽管现有方法通常作用于对手的参数、策略或价值空间。同时，隐藏角色游戏中的信念操纵技术通常依赖于硬编码的目标，如欺骗或信念饱和。我们提出基于可微信念的对手塑造（D-BOS），一种一阶方法，将每个观察者的信念视为被塑造的对手状态，并通过$k$步softmax-贝叶斯信念动力学进行微分。我们的方法不显式奖励欺骗或合作行为，而是将信念状态作为塑造目标。这使得最优策略能够从环境奖励结构中自然涌现。这种信念空间公式通过微分对手信念更新提供对手塑造信号，并通过聚合多个观察者个体推断信念轨迹上的梯度，自然地扩展到多个观察者。实验上，D-BOS在隐藏角色游戏中优于PPO和BBM，在混合动机设置中提升最大。

英文摘要

Human coordination often relies on the ability to influence the beliefs of others through strategic action. In multi-agent reinforcement learning, opponent shaping attempts to replicate this influence, though existing methods typically operate within an opponent's parameter, policy, or value space. Meanwhile, belief-manipulation techniques in hidden-role games often rely on hard-coded objectives, such as deception or belief saturation. We propose Differentiable Belief-based Opponent Shaping (D-BOS), a first-order method that treats each observer's belief as the shaped opponent state and differentiates through $k$-step softmax-Bayes belief dynamics. Rather than explicitly rewarding deceptive or cooperative behavior, our method treats the belief state as the target for shaping. This allows the optimal strategy to emerge naturally from the environment's reward structure. This belief-space formulation provides an opponent-shaping signal by differentiating through opponent belief updates, and naturally extends to multiple observers by aggregating gradients over their individual inferred belief trajectories. Empirically, D-BOS outperforms PPO and BBM in hidden-role games, with the largest gains in mixed-motive settings.

URL PDF HTML ☆

赞 0 踩 0

2605.29033 2026-05-29 cs.LG 版本更新

Moment Matching Q-Learning

矩匹配Q学习

Yiyan, Liang, Sifei Liu, Weitong Zhang

发表机构 * School of Data and Information Science, University of North Carolina at Chapel Hill, Chapel Hill, USA（数据与信息科学学院，北卡罗来纳大学教堂山分校，教堂山，美国）

AI总结提出矩匹配Q学习（MoMa QL）框架，利用最大均值差异（MMD）匹配原始分布与目标分布的所有阶统计量，实现条件得分函数的分布级收敛，在D4RL任务中计算效率高且性能相当，并在离线到在线强化学习中通过加速流策略的动作采样展现更优的适应性和性能。

Comments 23 pages, 14 figures, 10 tables, accepted by ICML 2026

详情

AI中文摘要

基于得分和流的生成模型在捕捉复杂分布方面表现出显著的表达能力，并已广泛应用于从图像生成到强化学习的任务中。然而，这些模型存在推理延迟长的问题，这在具有迭代采样的强化学习中造成了显著的计算瓶颈。为了克服这一限制，我们提出了一个名为矩匹配Q学习（MoMa QL）的新框架，该框架利用统计假设检验中的最大均值差异（MMD）技术，旨在匹配原始分布和目标分布之间的所有阶统计量。通过对所有矩统计量施加强正则化，该算法保证了条件得分函数的分布级收敛，并在各种超参数下保持稳定。实验表明，我们的方法MoMa QL在各种D4RL任务中计算效率更高，且性能相当甚至具有竞争力。值得注意的是，通过加速基于流的策略的动作采样过程，MoMa QL在离线到在线强化学习任务中表现出更优的性能，因为其在线交互微调更快且适应性更强。

英文摘要

Score-based and flow-based generative models exhibit remarkable expressive capacity in capturing complex distributions, and have been extensively deployed in tasks ranging from image generation to reinforcement learning. Nevertheless, these models suffer from prolonged inference latency, which imposes a significant computational bottleneck in RL with iterative sampling. To overcome this limitation, we propose a new framework named Moment Matching Q-Learning (MoMa QL), which utilizes a technique from statistical hypothesis testing known as maximum mean discrepancy (MMD) that intend to match all orders of statistics between the original and target distribution. By enforcing strong regularization on all moment statistics, this algorithm guarantees distribution-level convergence for conditional score function and remains stable under various hyperparameters. Empirically, we show that our method MoMa QL is more computationally efficient with a comparable if not competitive performance in various D4RL tasks. Remarkably, by accelerating the action sampling process for flow-based policies, MoMa QL demonstrates superior performance in offline-to-online RL tasks because of faster and stronger adaptability for online interactive finetuning.

URL PDF HTML ☆

赞 0 踩 0

2605.29032 2026-05-29 cs.LG stat.ML 版本更新

用于宇宙学21厘米光锥模拟的三维条件扩散模型

Bin Xia, John H. Wise

发表机构 * Center for Relativistic Astrophysics, School of Physics, Georgia Institute of Technology, Atlanta, GA 30332, USA（相对论天体物理中心，物理学院，佐治亚理工学院，亚特兰大，GA 30332，USA）

AI总结针对三维21厘米光锥模拟的困难，通过对比预处理、动态范围压缩、架构深度和训练时长等配置，发现Yeo-Johnson预处理结合中等幅度压缩在全局信号的标准化平均绝对误差上表现最优，但视觉上合理的样本仍存在统计偏差。

详情

AI中文摘要

我们研究了用于三维21厘米光锥模拟的条件扩散模型，重点关注天空平面大小为$64\times64$、视线深度达1024个像素的立方体。与早期的二维研究相比，三维设置更加困难，因为内存限制导致微批次非常小，而底层体素分布高度偏斜且长尾。我们通过使用$25{,}600$个训练光锥和固定参数点的验证集成，对预处理选择、动态范围压缩设置、架构深度和训练时长进行了控制比较。在验证中，每个参考参数点包含800个具有独立初始条件的21cmFAST实现，并且每个模型和每个参考集使用800个样本进行报告的集成比较。我们通过图像和摘要统计空间中的互补诊断评估生成的光锥：亮温度切片、全局信号、功率谱和简化散射系数。在测试的配置中，预处理是控制稳定训练和最终物理保真度的主导因素。在此探索的配置中，Yeo-Johnson预处理结合中等幅度压缩给出了最一致的有利权衡，最强的定量支持来自基于全局信号的标准差归一化平均绝对误差（$\mathrm{MAE}_{\rm std}$）的排名，并且在互补诊断中表现出定性一致的行为。同时，视觉上合理的三维样本在两点和高阶统计中仍然保留可测量的偏差。因此，我们将当前工作视为三维21厘米模拟以及未来纳入更真实观测效应的研究的一个模拟级基线。

英文摘要

We investigate conditional diffusion modeling for three-dimensional 21 cm lightcone emulation, focusing on cubes with a sky-plane size of $64\times64$ and a line-of-sight depth up to 1024 cells. Relative to earlier 2D studies, the 3D setting is substantially harder because memory limits enforce very small micro-batches while the underlying voxel distribution is highly skewed and long tailed. We perform controlled comparisons across preprocessing choices, dynamic-range compression settings, architecture depth, and training duration using $25{,}600$ training lightcones and validation ensembles at fixed parameter points. For validation, each reference parameter point contains 800 21cmFAST realizations with independent initial conditions, and we use 800 samples per model and per reference set for the reported ensemble comparisons. We evaluate generated lightcones with complementary diagnostics in both image and summary-statistic spaces: brightness-temperature slices, the global signal, the power spectrum, and reduced scattering coefficients. Across the tested configurations, preprocessing is the dominant factor governing stable training and the resulting physical fidelity. Among the configurations explored here, Yeo-Johnson preprocessing combined with moderate amplitude compression gives the most consistently favorable trade-off, with the strongest quantitative support coming from rankings based on the standard-deviation-normalized mean absolute error ($\mathrm{MAE}_{\rm std}$) of the global signal and qualitatively compatible behavior in the complementary diagnostics. At the same time, visually plausible 3D samples still retain measurable biases in two-point and higher-order statistics. We therefore view the present work as a simulation-level baseline for three-dimensional 21 cm emulation and for future studies that incorporate more realistic observational effects.

URL PDF HTML ☆

赞 0 踩 0

2605.29009 2026-05-29 cs.LG cs.AI 版本更新

Label-Free Reinforcement Learning via Cross-Model Entropy

无标签强化学习：跨模型熵方法

Matt Gorbett, Hossein Shirazi

发表机构 * Independent Researcher（独立研究者）； San Diego State University（圣地亚哥州立大学）

AI总结提出跨模型熵（CME）作为无标签奖励信号，用于强化学习后训练大语言模型，在开放指令遵循任务上优于基线方法。

详情

AI中文摘要

使用强化学习后训练大语言模型受限于奖励信号。现有方法需要真实可验证的奖励（限制于自动正确性检查领域，如数学、代码执行）或人类偏好标签（收集成本高且易受奖励攻击）。最近的无标签方法用自参考信号（如多数投票或模型自身输出的token熵）替代真实验证器，但可能强化模型自身错误。本文提出跨模型熵（CME），即生成器响应在独立验证器模型下的平均对数似然，作为无标签奖励信号用于强化学习后训练。CME是连续的、无需训练，基于验证器认为不意外的响应可能正确或高质量的准则。由于验证器独立于生成器，该信号无法通过自一致性被操纵。我们将CME集成到GRPO中，不改变训练循环的其他部分，将无标签强化学习扩展到开放指令遵循——自参考信号不适用或不适配的场景。在开放指令遵循（UltraFeedback提示，在AlpacaEval 2.0上评估）上，CME奖励在四个模型家族（Qwen、Llama、Gemma、OLMo）和三种训练范式（预训练、SFT和指令微调）的头对头LLM-as-Judge比较中击败未训练基线，调整平局后的胜率从52.5%到71.4%。代码将在发表后发布。

英文摘要

Post-training large language models with reinforcement learning is bottlenecked by the reward signal. Existing approaches require either ground-truth verifiable rewards, restricting training to domains with automatic correctness checks (e.g., mathematics, code execution), or human preference labels, which are expensive to collect and prone to reward hacking. Recent label-free methods replace ground-truth verifiers with self-referential signals like majority voting or token entropy over a model's own outputs, but risk reinforcing a model's own errors. In this work we propose Cross-Model Entropy (CME), the mean log-likelihood of a generator's response under a separate verifier model, as a label-free reward signal for RL post-training. CME is continuous, training-free, and grounded in the principle that responses a verifier finds unsurprising are likely correct or high quality. Because the verifier is independent of the generator, the signal cannot be gamed through self-consistency. We integrate CME into GRPO with no other changes to the training loop, extending label-free RL to open-ended instruction following -- a regime where self-referential signals are inapplicable or poorly suited. On open-ended instruction following (UltraFeedback prompts, evaluated on AlpacaEval 2.0), CME rewards beat the untrained base in head-to-head LLM-as-Judge comparisons across four model families (Qwen, Llama, Gemma, OLMo) and three training regimes (pretrained, SFT, and instruction-tuned), with tie-adjusted win rates ranging from 52.5% to 71.4%. Code will be released upon publication.

URL PDF HTML ☆

赞 0 踩 0

2605.29008 2026-05-29 cs.LG 版本更新

Causal Intelligence for Constraint-Aware Intervention Design to Induce State Transitions

因果智能：面向状态转换的约束感知干预设计

Zixuan Song, Uwe Mueller, Dimitris V. Manatakis

发表机构 * MRL, Merck & Co., Inc.（MRL，默克公司）

AI总结提出COAST方法，通过因果图学习和约束感知多目标优化，从数据中设计干预策略以实现系统状态转换。

详情

AI中文摘要

通过有针对性的干预将系统从一个状态驱动到另一个状态是科学中的一个基本挑战，然而大多数预测模型提供的机制洞察有限，且缺乏原则性的决策框架。本文提出COAST（状态转换的因果最优行动），一种用于计算机设计约束干预的因果智能方法，该干预诱导用户定义的状态转换。给定表征源状态和目标状态的数据，COAST学习上下文特定的因果图和结构因果模型，将观测到的分布变化归因于机制层面的因果驱动因素，并引入一种新颖的约束感知多目标优化公式，平衡转换效果、干预复杂性和目标状态稳定性。该方法模块化且领域无关，通过可互换的组件整合特征选择、因果发现、因果建模以及干预识别和评估。在合成基准和真实生物数据集上，COAST恢复了关键的因果驱动因素，并识别出实现期望状态转换的稳健的单目标和多目标干预策略，同时提供透明的机制解释以指导实验验证。

英文摘要

Driving a system from one state to another through targeted interventions is a fundamental challenge in science, yet most predictive models offer limited mechanistic insight and no principled framework for decision-making. Here we present COAST (Causally Optimal Actions for State Transitions), a causal-intelligence approach for the in-silico design of constrained interventions that induce user-defined state transitions. Given data characterizing source and target states, COAST learns context-specific causal graphs and structural causal models, attributes observed distributional shifts to mechanism-level causal drivers, and introduces a novel constraint-aware multi-objective optimization formulation that balances transition efficacy, intervention complexity, and target-state stability. The approach is modular and domain-agnostic, integrating feature selection, causal discovery, causal modeling, and intervention identification and evaluation through interchangeable components. Across synthetic benchmarks and real biological datasets, COAST recovers key causal drivers and identifies robust single- and multi-target intervention strategies that achieve desired state transitions, accompanied by transparent mechanistic rationales to guide experimental validation.

URL PDF HTML ☆

赞 0 踩 0

2605.29005 2026-05-29 cs.LG cs.AI 版本更新

LoRe: Adaptive Interaction-Evaluation Routing with Per-Step Interaction Budgets for Iterative Graph Solvers

LoRe: 基于每步交互预算的自适应交互评估路由用于迭代图求解器

Jintao Li, Yong-Yi Wang, Zheng-An Wang, Heng Fan

发表机构 * Beijing Key Laboratory of Fault-Tolerant Quantum Computing, Beijing Academy of Quantum Information Sciences, Beijing, China（北京容错量子计算重点实验室，量子信息科学北京市院，北京，中国）； Beijing National Laboratory for Condensed Matter Physics, Institute of Physics, CAS, Beijing, China（北京凝聚态物理国家实验室，物理研究所，中国科学院，北京，中国）； Beijing Key Laboratory of Advanced Quantum Technology, Beijing, China（北京先进量子技术重点实验室，北京，中国）； Hefei National Laboratory, Hefei, China（合肥国家实验室，合肥，中国）

AI总结提出LoRe方法，通过动态路由计算到高冲突或高不确定性交互，实现每步固定比例交互评估，在不牺牲解质量的前提下显著提升迭代图求解器的可扩展性和速度。

Comments Accepted at ICML 2026

详情

AI中文摘要

基于扩散的组合优化神经求解器反复重新评估密集的边/因子交互，导致推理时间昂贵且在大规模下常受内存限制。受多体物理计算方法的启发，我们引入LoRe，一种无需训练、推理时即插即用的包装器，强制执行每步交互评估预算：在每次迭代中，它通过动态路由计算到高冲突或高不确定性交互，仅评估固定比例的交互，而不是使用固定的稀疏化（例如静态kNN图或静态掩码）。在完全包含的端到端挂钟时间核算下，LoRe显著提高了最大独立集（MIS）问题的可扩展性，将可行推理扩展到基线内存溢出限制的3倍以上，实现了约8倍的加速和约12倍的峰值内存减少，同时在此范围内保持解质量。在大规模旅行商问题（TSP）上展示了跨任务通用性，并对拓扑变化具有零样本鲁棒性，LoRe在n=1000时实现了约15倍的加速，内存减少44倍，且巡回质量具有竞争力。

英文摘要

Diffusion-based neural solvers for combinatorial optimization repeatedly re-evaluate dense edge/factor interactions, making inference expensive in wall-clock time and often memory-bound at scale. Inspired by the computational methodologies of many-body physics, we introduce LoRe, a training-free, inference-time drop-in wrapper that enforces per-step interaction-evaluation budgeting: at each iteration, it evaluates only a fixed fraction of interactions by dynamically routing computation to high-conflict or high-uncertainty interactions, instead of using a fixed sparsification (e.g., static kNN graphs or static masks). Under fully inclusive end-to-end wall-clock accounting, LoRe substantially improves scalability on the Maximum Independent Set (MIS) problem, extending feasible inference more than $3\times$ beyond the baseline's out-of-memory limit, delivering a $\sim 8\times$ speedup and a $\sim 12\times$ peak-memory reduction, with solution quality preserved in this regime. Demonstrating cross-task generality on the large-scale Traveling Salesperson Problem (TSP) and zero-shot robustness to topology shifts, LoRe achieves a $\sim 15\times$ speedup at $n=1000$ with a $44\times$ memory reduction and competitive tour quality.

URL PDF HTML ☆

赞 0 踩 0

2605.29002 2026-05-29 cs.LG cs.DC 版本更新

通过孪生自监督学习从fMRI中学习鲁棒且任务不变的功能表示

Jiyao Wang, Peiyu Duan, Nicha C. Dvornek, Lawrence H. Staib, Denis Sukhodolsky, Pamela Ventola, James S. Duncan

发表机构 * organization= Department of Biomedical Engineering , addressline= Yale University , city= New Haven , state= CT , country= USA ； organization= Radiology \& Biomedical Imaging , addressline= Yale School of Medicine , city= New Haven , state= CT , country= USA ； organization= Electrical Engineering , addressline= Yale University , city= New Haven , state= CT , country= USA ； organization= Child Study Center , addressline= Yale School of Medicine , city= New Haven , state= CT , country= USA

AI总结提出轻量级自监督框架BrainSimSiam，利用正样本对学习鲁棒且通用的fMRI表示，在多个下游任务中超越全监督基线，接近大规模模型性能。

详情

AI中文摘要

功能磁共振成像（fMRI）是研究人脑功能的强大工具。然而，数据采集的高成本和精神病学评定量表固有的主观性常常导致数据集样本量小且标签质量可变，特别是在针对特定神经疾病时。结合fMRI数据固有的高维性，这些限制显著增加了模型过拟合的风险。近年来，通过组合多个数据集开发fMRI基础模型的兴趣日益增长；然而，预训练和微调所需的计算资源往往令人望而却步。我们展示了一个轻量级自监督框架能够产生跨多种下游任务泛化的表示，超越全监督基线，并接近大规模模型的性能。我们引入了BrainSimSiam，一种数据高效的自监督表示学习框架，利用仅正样本对来学习鲁棒且可泛化的特征。我们证明了所学表示在多个下游分类和回归任务中取得了强劲性能，突显了BrainSimSiam在数据有限的神经影像应用中的潜力。

英文摘要

Functional magnetic resonance imaging (fMRI) is a powerful tool for investigating human brain function. However, the high cost of data acquisition and the inherent subjectivity of psychiatric rating scales often lead to datasets with small sample sizes and variable label quality, especially when targeting a specific neurological condition. Combined with the inherently high dimensionality of fMRI data, these limitations substantially increase the risk of model overfitting. Recent years have seen growing interest in developing fMRI foundation models by combining multiple datasets; however, the computational resources needed for pretraining and fine-tuning are often prohibitive. We show that a lightweight self-supervised framework yields representations that generalize across diverse downstream tasks, outperforming fully supervised baselines and approaching the performance of large-scale models. We introduce BrainSimSiam, a data-efficient self-supervised representation learning framework that leverages positive-only data pairs to learn robust and generalizable features. We demonstrate that the learned representations achieve strong performance across multiple downstream classification and regression tasks, highlighting the potential of BrainSimSiam for data-limited neuroimaging applications.

URL PDF HTML ☆

赞 0 踩 0

2605.28983 2026-05-29 cs.LG cs.AI math.DS math.RT physics.comp-ph 版本更新

The Hamilton-Jacobi Theory of Deep Learning

深度学习的哈密顿-雅可比理论

Jose Marie Antonio Miñoza, Erika Fille T. Legara, Christopher P. Monterola

发表机构 * Center for AI Research PH（人工智能研究所以PH）； Asian Institute of Management（亚洲管理学院）

AI总结本文通过将神经网络训练精确识别为哈密顿-雅可比初值问题的搜索，建立了深度学习与粘性哈密顿-雅可比方程之间的严格对应关系，并统一了残差网络、Transformer、RNN等架构，导出了最优泛化率、对抗鲁棒性等定量结果。

详情

AI中文摘要

在本文中，神经网络训练被精确地识别为通过哈密顿-雅可比初值问题的搜索：每个梯度步选择粘性哈密顿-雅可比方程的初始数据，其Hopf-Cole传播子最佳拟合观测值；在推理时，输入是评估该解的空间点，初始条件已编码在权重中。这种对应对于log-sum-exp层是精确的，对于更广泛的架构（残差网络、Transformer和循环架构（RNN、LSTM、SSM））是结构性的，它们离散化同一类哈密顿-雅可比方程，具有依赖于架构的哈密顿量和粘性。一个单一的变形参数ε在交换图中统一了所有四个视角（网络、热带代数、粘性PDE、凸优化），并在Lipschitz条件下封闭。定量结果包括：固定t时的极小极大最优泛化率O(n^{-1/(d+2)})；由ε控制的对抗鲁棒性；残差网络的反向传播作为哈密顿系统的协态方程（庞特里亚金最大值原理）；通过PDE求积与数据内在维度一致的标度指数；以及闭式O(N)影响函数（softmax归因权重π_j），其熵景观随着ε增加经历折叠分岔，每个分岔合并归因盆地。

英文摘要

In this paper, training a neural network is identified, exactly, as a search through Hamilton--Jacobi initial-value problems: each gradient step selects the initial data of a viscous Hamilton--Jacobi equation whose Hopf--Cole propagator best fits the observations; at inference, the input is the spatial point at which that solution is evaluated and the initial condition is already encoded in the weights. The correspondence is exact for log-sum-exp layers and structural for broader architectures: residual networks, transformers, and recurrent architectures (RNNs, LSTMs, SSMs) each discretize the same class of Hamilton--Jacobi equations, with architecture-dependent Hamiltonian and viscosity. A single deformation parameter $\varepsilon$ unifies all four perspectives (network, tropical algebra, viscous PDE, convex optimization) in a commutative diagram closed under Lipschitz conditions. Quantitative consequences include: the minimax optimal generalization rate $O(n^{-1/(d+2)})$ for fixed $t$; adversarial robustness controlled by $\varepsilon$; backpropagation as the co-state equation of the Hamiltonian system for residual networks (Pontryagin Maximum Principle); scaling exponents consistent with data intrinsic dimension via PDE quadrature; and a closed-form $O(N)$ influence function (softmax attribution weights $π_j$) whose entropy landscape undergoes fold bifurcations as $\varepsilon$ increases, each merging attribution basins.

URL PDF HTML ☆

赞 0 踩 0

2605.28980 2026-05-29 math.OC cs.LG cs.NA eess.SP math.NA 版本更新

Manifold-based Algorithms for the Hadamard Decomposition

基于流形的Hadamard分解算法

Nicolas Gillis, Subhayan Saha, Stefano Sicilia, Arnaud Vandaele

发表机构 * Department of Mathematics and Operational Research, University of Mons（数学与运筹学系，蒙斯大学）； Gruppo Nazionale Calcolo Scientifico-Istituto Nazionale di Alta Matematica（科学计算组-国家高级数学研究所）

AI总结针对Hadamard分解问题，提出三种基于流形的新算法（包括Manopt、块投影梯度和无投影流形梯度下降），并设计新的初始化策略，在合成和真实数据上优于现有方法。

Comments 27 pages, code available from https://github.com/StefanoSicilia/Hadamard-Decomposition

详情

AI中文摘要

给定矩阵 $X$ 和两个秩 $r_1$ 和 $r_2$，Hadamard分解（HD）寻找两个低秩矩阵 $X_1$（秩 $r_1$）和 $X_2$（秩 $r_2$），它们与 $X$ 大小相同，使得 $X\approx X_1\circ X_2$，其中 $\circ$ 是Hadamard（逐元素）乘积。大多数情况下，HD比标准低秩近似（如截断奇异值分解（TSVD））更具表现力，因为它可以用相同数量的参数表示更高秩的矩阵；这是因为 $X_1 \circ X_2$ 的秩通常等于 $r_1 r_2$。本文首先给出HD的一些理论见解，特别是一个有用的重写形式 $X\approx WH^\top$，其中 $W$ 和 $H$ 有 $r_1 r_2$ 列并属于某些流形。这使我们能够开发三种计算HD的新算法。第一种使用表示 $X\approx X_1\circ X_2$ 并依赖于Manopt工具箱。另外两种依赖于重写形式 $X\approx WH^\top$：一种是块投影梯度方法，另一种是基于流形的梯度下降算法，不需要投影到可行集。最后两种算法特别适用于处理大规模稀疏数据。我们还提出了新的初始化策略，以提高HD的精度。我们将我们的算法和初始化策略与TSVD及现有技术进行了比较。数值结果表明，新方法在合成和真实数据上高效且具有竞争力。

英文摘要

Given a matrix $X$, and two ranks $r_1$ and $r_2$, the Hadamard decomposition (HD) looks for two low-rank matrices, $X_1$ of rank $r_1$ and $X_2$ of rank $r_2$, both of the same size as $X$, such that $X\approx X_1\circ X_2$, where $\circ$ is the Hadamard (element-wise) product. In most cases, HD is more expressive than standard low-rank approximations such as the truncated singular value decomposition (TSVD), as it can represent higher-rank matrices with the same number of parameters; this is because the rank of $X_1 \circ X_2$ is generically equal to $r_1 r_2$. In this paper, we first present some theoretical insights for HD, in particular a useful reformulation $X\approx WH^\top$ where $W$ and $H$ have $r_1 r_2$ columns and belong to certain manifolds. These allow us to develop three new algorithms for computing HD. The first one uses the representation $X\approx X_1\circ X_2$ and relies on the Manopt toolbox. The other two rely on the reformulation $X\approx WH^\top$: one is a block projected gradient method, and the other is a manifold-based gradient descent algorithm that does not require projection onto the feasible set. The last two algorithms are particularly effective for handling large sparse data. We also propose new initializations that allow us to improve the accuracy of the HD. We compare our algorithms and initialization strategies with the TSVD and with the state of the art. Numerical results show that the new methods are efficient and competitive on both synthetic and real data.

URL PDF HTML ☆

赞 0 踩 0

2605.28977 2026-05-29 cs.LG cs.AI 版本更新

Comparing Post-Hoc Explainable AI Methods for Interpreting Black-Box EEG Models in Depression Detection

比较事后可解释AI方法用于解释抑郁症检测中的黑盒脑电图模型

Antonia Šarčević, Nikolina Frid

发表机构 * University of Zagreb Faculty of Electrical Engineering and Computing（Zagreb大学电子工程与计算学院）

AI总结本研究通过多种事后可解释性方法（如DeepSHAP、集成梯度、GradCAM、遮挡和置换特征重要性）分析InceptionTime架构在脑电图抑郁症检测中的决策过程，发现不同方法在额叶、颞叶和后部脑区（尤其是右半球）的归因模式部分收敛，但方法间存在差异，强调了事后可解释性的有用性和局限性。

详情

AI中文摘要

深度学习的最新进展使得基于脑电图的重度抑郁症分类越来越准确，但高容量模型的决策过程仍然难以解释。本研究调查了应用于训练用于基于脑电图的重度抑郁症检测的InceptionTime架构的多种事后可解释性方法。分析包括基于Shapley、基于梯度和基于扰动的归因方法：DeepSHAP、集成梯度、GradCAM、遮挡和置换特征重要性。在受试者级别的分层5折交叉验证框架内，通过跨脑电图片段和受试者的全局归因聚合进行可解释性分析。评估的方法揭示了部分收敛的归因模式，其中额叶、颞叶和后部脑区（尤其是右半球）反复受到关注。定量比较表明，基于梯度和基于扰动的方法之间具有实质性一致性，而DeepSHAP产生了相对独特的归因分布。同时，可解释性方法之间的差异凸显了方法假设对所得解释的影响。总体而言，结果表明，不同的事后可解释性方法捕捉了基于脑电图的深度学习模型在抑郁症检测中的部分重叠的相关性结构。尽管观察到的归因模式与先前几项关于重度抑郁症的脑电图研究大致一致，但该分析应被视为探索性的，而非确凿的神经生理学生物标志物或临床适用性的证据。该研究强调了事后可解释性在解释精神病学应用中的黑盒脑电图分类器方面的有用性和局限性。

英文摘要

Recent advances in deep learning have enabled increasingly accurate electroencephalography (EEG)-based classification of Major Depressive Disorder (MDD), but the decision-making processes of high-capacity models remain difficult to interpret. This study investigates multiple post-hoc explainability methods applied to an InceptionTime architecture trained for EEG-based MDD detection. The analysis includes Shapley-based, gradient-based, and perturbation-based attribution approaches: DeepSHAP, Integrated Gradients, GradCAM, Occlusion, and Permutation Feature Importance. Explainability analysis was performed within a subject-level stratified 5-fold cross-validation framework using global attribution aggregation across EEG segments and subjects. The evaluated methods revealed partially convergent attribution patterns, with recurring emphasis on frontal, temporal, and posterior EEG regions, particularly in the right hemisphere. Quantitative comparison demonstrated substantial agreement between gradient- and perturbation-based approaches, while DeepSHAP produced comparatively distinct attribution distributions. At the same time, variability between explainability methods highlighted the influence of methodological assumptions on the resulting explanations. Overall, the results suggest that different post-hoc explainability approaches capture partially overlapping relevance structures in EEG-based deep learning models for depression detection. Although the observed attribution patterns are broadly consistent with several previous EEG studies of MDD, the analysis should be interpreted as exploratory rather than evidence of definitive neurophysiological biomarkers or clinical applicability. The study highlights both the usefulness and limitations of post-hoc explainability for interpreting black-box EEG classifiers in psychiatric applications.

URL PDF HTML ☆

赞 0 踩 0

2605.28975 2026-05-29 cs.LG 版本更新

A Training-Time Diagnostic for Generalization via the Log-Alignment Ratio

基于对数对齐比率的训练时泛化诊断

Ali Shehper, Ashish Vaswani

发表机构 * Essential AI

AI总结提出对数对齐比率（LAR）作为参数-激活对齐的度量，通过捕捉训练中权重谱与激活谱的扩散来跟踪记忆与泛化的转换，并在grokking和语言模型预训练中预测泛化差距。

Comments 32 pages, 25 figures

详情

AI中文摘要

我们研究了对数对齐比率（LAR），这是参数化理论中引入的一种参数-激活对齐度量。我们将其重新表述为矩阵归一化奇异值平方的权重谱$p$与输入在其奇异方向上投影的归一化平方的激活谱$q$之间的重叠。我们表明，通过捕捉训练过程中$p$和$q$的扩散，非嵌入LAR在两种不同设置下跟踪记忆与泛化之间的转换。在grokking中，LAR预测学习函数的有效维度：$k \approx n^{2(1-\text{LAR})}$，其中$n$是矩阵的输入维度。在3B参数语言模型预训练中，其与无过拟合基线的偏差跟踪泛化差距，并且其下降速率随着过拟合的接近而增加。LAR可从前向传播过程中可用的量计算，计算开销可忽略，且无需保留验证数据。

英文摘要

We study the log-alignment ratio (LAR), a measure of parameter-activation alignment, introduced in parameterization theory. We reformulate it as the overlap between a weight spectrum $p$ of the normalized squared singular values of a matrix and an activation spectrum $q$ of the normalized squared projections of inputs onto its singular directions. We show that unembedding LAR tracks the transition between memorization and generalization in two different settings by capturing the spread of $p$ and $q$ during training. In grokking, LAR predicts the effective dimension of the learned function: $k \approx n^{2(1-\text{LAR})}$, where $n$ is the input dimension of the matrix. In 3B-parameter language model pre-training, its deviation from a non-overfitting baseline tracks the generalization gap, and its rate of decline increases as overfitting approaches. LAR is computable from quantities available during the forward pass with negligible computational overhead, and requires no held-out validation data.

URL PDF HTML ☆

赞 0 踩 0

2605.28961 2026-05-29 stat.ML cs.LG math.OC 版本更新

Dynamics of Stochastic Momentum with Sparse Updates in High Dimensions

高维稀疏更新下随机动量的动力学

Katie Everett, Elliot Paquette

发表机构 * Google DeepMind & MIT（谷歌DeepMind及麻省理工学院）； McGill University & Mila（麦吉尔大学及MILA）

AI总结本文通过最小二乘和逻辑回归模型，理论分析了稀疏更新下动量的动力学，揭示了由动量保留时间尺度与学习时间尺度之比决定的相结构，并发现不同令牌稀疏度下的振荡动力学存在谱冲突。

详情

AI中文摘要

现有的动量理论假设梯度以大致恒定的速率到达每个参数，但这一假设在重尾数据分布和现代架构中常被违反。我们理论分析了稀疏更新下两种可处理动量模型的动力学：具有稀疏输入的最小二乘模型和具有稀有类别的逻辑回归模型。两者都给出了精确的闭式二阶矩动力学，我们针对稀疏性、批量大小和动量衰减的三个标度指数刻画了其高维极限。两个问题上的相结构由两个内在时间尺度之比决定：动量保留时间尺度（缓冲区存活的活动更新次数）和学习时间尺度（减少平方误差所需的活动更新次数）。当学习远慢于保留时，极限匹配SGD；当学习更快时，系统不稳定；当时间尺度相当时，我们恢复经典的重球动力学。振荡动力学发生在不同令牌稀疏度的不同动量值处，从而在全局动量上产生跨令牌频率的谱冲突。

英文摘要

Existing theory of momentum assumes that gradients arrive at every parameter at a roughly constant rate, an assumption violated in practice by heavy-tailed data distributions and modern architectures. We theoretically analyze the dynamics of two tractable models of momentum under sparse updates: a least squares model with sparse inputs and a logistic regression model with a rare class. Both admit exact closed-form second-moment dynamics whose high-dimensional limits we characterize across three scaling exponents for sparsity, batch size, and momentum decay. The phase structure on both problems is governed by the ratio of two intrinsic timescales: a momentum retention timescale (how many active updates the buffer survives) and a learning timescale (how many active updates it takes to reduce the squared error). When learning is much slower than retention, the limit matches SGD; when learning is faster, the system is unstable; where the timescales coincide, we recover classical heavy-ball dynamics. The oscillatory dynamics occur at different momentum values for different token sparsity, creating a spectral conflict for global momentum across token frequencies.

URL PDF HTML ☆

赞 0 踩 0

2605.28940 2026-05-29 hep-ph cs.LG hep-ex physics.data-an 版本更新

LoRA适配器的特征几何：微调语言模型中表示差异的稀疏自编码器分析

Prasanth K K

发表机构 * Independent AI Safety Researcher（独立人工智能安全研究员）

AI总结本研究使用稀疏自编码器分析LoRA微调引起的表示几何变化，发现LoRA特征字典与预训练特征存在弱几何对齐，且适配器特定SAE能更有效重建delta激活。

详情

AI中文摘要

低秩适配（LoRA）已成为适应大型语言模型的广泛采用方法，但LoRA微调引起的内部表示变化仍未被充分理解。在这项工作中，我们使用稀疏自编码器（SAE）研究LoRA诱导表示的几何结构。我们引入了一个delta激活框架，该框架隔离了适配器对残差流的特定贡献。使用Gemma-2-9B和LoRA秩4、8、16和32，我们在多个Transformer层上训练适配器特定的SAE，并将它们学习的特征空间与预训练的SAE字典进行比较。我们使用解码器方向之间的余弦相似度、特征子空间的主角分析以及激活表示之间的中心核对齐（CKA）来评估表示对齐。跨层和秩，我们一致观察到LoRA诱导的特征字典与预训练SAE特征之间的几何对齐相对较弱。适配器特定的SAE也比预训练SAE更有效地重建delta激活，这表明LoRA更新在残差流内占据了部分不同的表示结构。此外，特征密度随秩和深度增加，而几何差异在各秩之间保持相对稳定。这些发现提供了经验证据，表明LoRA微调可以诱导出预训练可解释性字典未完全捕获的特征结构，对微调语言模型的机制可解释性、适配分析和安全审计具有启示意义。

英文摘要

Low-Rank Adaptation (LoRA) has emerged as a widely adopted approach for adapting large language models, yet the internal representational changes induced by LoRA fine-tuning remain insufficiently understood. In this work, we investigate the geometry of LoRA-induced representations using Sparse Autoencoders (SAEs). We introduce a delta activation framework that isolates the adapter-specific contribution to the residual stream. Using Gemma-2-9B with LoRA ranks 4, 8, 16, and 32, we train adapter-specific SAEs across multiple transformer layers and compare their learned feature spaces with pretrained SAE dictionaries. We evaluate representational alignment using cosine similarity between decoder directions, principal-angle analysis of feature subspaces, and Centered Kernel Alignment (CKA) between activation representations. Across layers and ranks, we consistently observe comparatively weak geometric alignment between LoRA-induced feature dictionaries and pretrained SAE features. Adapter-specific SAEs also reconstruct delta activations more effectively than pretrained SAEs, suggesting that LoRA updates occupy partially distinct representational structure within the residual stream. Additionally, feature density increases with rank and depth, while geometric divergence remains relatively stable across ranks. These findings provide empirical evidence that LoRA fine-tuning can induce feature structures that are not fully captured by pretrained interpretability dictionaries, with implications for mechanistic interpretability, adaptation analysis, and safety auditing of fine-tuned language models.

URL PDF HTML ☆

赞 0 踩 0

2605.28890 2026-05-29 cs.CR cs.LG 版本更新

Echoes within the Reasoning: Stealthy and Effective Watermarking via Chain of Thought

推理中的回声：通过思维链实现隐蔽且有效的数字水印

Jiacheng Lu, Yiming Li, Tao Song, Weijian Wang, Wenjie Qu, Haibing Guan, Jiaheng Zhang

发表机构 * School of Computer Science, Shanghai Jiao Tong University, Shanghai, China（上海交通大学计算机科学学院）； Nanyang Technological University, Singapore（南洋理工大学）； National University of Singapore, Singapore（新加坡国立大学）

AI总结提出BiCoT框架，通过将水印嵌入推理轨迹的内部几何结构，并利用基于Top-logprob的黑盒验证器RSR，在不影响推理保真度的前提下实现鲁棒的水印检测。

Comments This paper is accepted by ICML2026

详情

预注册可检测效应：面向4位量化基准的配对MDE预算，附带一项试点审计

Zexin Zhuang, Yanhang Li, Zhichao Fan

发表机构 * Southern Methodist University（南方 Methodist 大学）； Northeastern University（东北ern 大学）； University of Illinois Urbana-Champaign（伊利诺伊大学厄巴纳-香槟分校）

AI总结本文提出一种配对最小可检测效应（MDE）边界公式，用于量化基准的可靠性评估，并通过试点审计验证其有效性。

详情

AI中文摘要

这是一篇带有非配对试点审计的规划方法说明。我们将经典的配对二项样本量计算（Miettinen, 1968）应用于量化基准，给出了在配对项目数$m$和FP16-NF4不一致率$ρ_d$下的保守最小可检测效应（MDE）边界$δ^{*} \\\le (z_{1-α/2}+z_{1-β})\\\sqrt{ρ_d/m}$。该边界将“我的量化声明有多可靠？”转化为基准设计者在运行前可以承诺的一行预算。我们在四个模型和四个基准（$n=100$的$k=5$次分割）上展示了该边界，并添加了一项并行的MMLU提示模板研究，以将边界的量化噪声尺度与提示噪声尺度进行比较。假设$ρ_d=0.10$（一个未测量的规划值），所有观察到的NF4-FP16差异均低于隐含的MDE，且大多数跨分割标准差落在二项参考$\\sqrt{p(1-p)/n}$的$\\pm 1.5$个百分点内，因此在$n=100$子样本上报告为“基准不可靠性”的大部分方差是二项抽样噪声。唯一的边界单元格（OPT-WinoGrande，$|Δ|=3.2$个百分点）在$ρ_d=0.10$时低于隐含MDE，但在$ρ_d=0.05$时高于它，说明了该边界明确的规划权衡。在MMLU上，提示模板范围2-10个百分点达到或超过了最大的观察量化差异（3.2个百分点），因此未先固定提示模板的量化审计会将模板方差吸收到其噪声基底中。我们用一个五行预注册模板补充了该边界。

英文摘要

This is a planning-method note with an unpaired pilot audit. We adapt the classical paired-binary sample-size calculation (Miettinen, 1968) to quantization benchmarks, giving a conservative minimum detectable effect (MDE) bound $δ^{*} \le (z_{1-α/2}+z_{1-β})\sqrt{ρ_d/m}$ in the paired item count $m$ and the FP16-NF4 disagreement rate $ρ_d$. The bound turns "how reliable is my quantization claim?" into a one-line budget a benchmark designer can commit to before running. We illustrate the bound on four models and four benchmarks ($k=5$ splits of $n=100$), and add a parallel MMLU prompt-template study to put the bound's quantization-noise scale alongside the prompt-noise scale. Assuming $ρ_d=0.10$ (an unmeasured planning value), all observed NF4-FP16 deltas fall below the implied MDE, and most cross-split SDs lie within $\pm 1.5$ pp of the binomial reference $\sqrt{p(1-p)/n}$, so much of the variance reported as "benchmark unreliability" on $n=100$ subsamples is binomial sampling noise. The single borderline cell (OPT-WinoGrande, $|Δ|=3.2$ pp) is below the implied MDE at $ρ_d=0.10$ but above it at $ρ_d=0.05$, illustrating the planning trade-off the bound makes explicit. On MMLU, prompt-template ranges of 2-10 pp meet or exceed the largest observed quantization delta (3.2 pp), so a quantization audit that does not first fix the prompt template absorbs template variance into its noise floor. We complement the bound with a five-line pre-registration template.

URL PDF HTML ☆

赞 0 踩 0

2605.28870 2026-05-29 cs.LG cs.AI 版本更新

Representation Alignment Rests on Linear Structure

表示对齐依赖于线性结构

Kiril Bangachev, Guy Bresler, Yury Polyanskiy

发表机构 * Massachusetts Institute of Technology（麻省理工学院）

AI总结本文通过信号、偏差和噪声的三部分统计框架研究柏拉图表示假说，提出对齐源于对象与属性的线性关系，并通过稀疏自编码器提取线性特征、中心化和归一化减少偏差、以及数据稀缺导致噪声等证据支持该框架。

详情

AI中文摘要

我们通过表示的三部分统计框架研究柏拉图表示假说（PRH）：信号、偏差和噪声。{1) 信号：} 我们提出柏拉图对齐源于对象与属性之间的普遍关系，这种关系根据线性表示假说（LRH）在线性上编码。我们通过稀疏自编码器提取线性对象-属性特征，并展示这些稀疏表示通常比其稠密对应物表现出更强的跨模态对齐，从而提供证据表明LRH有助于解释PRH。{2) 偏差：} 由于使用的不同架构和训练过程，模型具有不同的隐式偏差。我们表明这种差异可以部分缓解。中心化和归一化一致地改善跨模型对齐。{3) 噪声：} 有限样本训练导致表示中的噪声。我们通过揭示词频与对齐之间在LLM和文本嵌入模型中的强且一致的正相关，提供证据表明表示噪声由数据稀缺驱动。综合信号、偏差和噪声，我们提出一个统计模型，该模型细化线性表示假说，并解释与现代AI架构中出现的表示对齐相关的进一步现象。

英文摘要

We investigate the Platonic Representation Hypothesis (PRH) through a tripartite statistical framework of representations: signal, bias, and noise. {1) Signal:} We propose that Platonic alignment arises from the universal relationship between objects and attributes, which is encoded linearly in representations according to the Linear Representation Hypothesis (LRH). We provide evidence that LRH helps explain PRH by extracting linear object-attribute features with sparse autoencoders and showing that these sparse representations often exhibit stronger cross-modal alignment than their dense counterparts. {2) Bias:} Models have different implicit biases due to the diverse architectures and training procedures used. We show that this difference can be partially mitigated. Centering and normalization consistently improve cross-model alignment. {3) Noise:} Finite-sample training leads to noise in representations. We provide evidence that representational noise is driven by data scarcity by revealing a strong and consistent positive correlation between word frequency and alignment in LLMs and text embedding models. Synthesizing signal, bias, and noise, we propose a statistical model that refines the Linear Representation Hypothesis and explains further phenomena related to the alignment of representations emerging from diverse modern AI architectures.

URL PDF HTML ☆

赞 0 踩 0

2605.28869 2026-05-29 cs.LG cs.AI 版本更新

Balancing Multimodal Learning through Label Space Reshaping

通过标签空间重塑平衡多模态学习

Xiaoyu Ma, Weijie Zhang, Yuanhao Gao, Han Miao, Yongjian Deng, Hao Chen

AI总结针对多模态学习中模态不平衡问题，提出基于标签空间重塑的BMLR方法，通过均衡各模态映射难度来提升多模态性能。

Comments In process

详情

AI中文摘要

多模态学习常受模态不平衡问题困扰，其中收敛较快的模态主导优化，而其他模态训练不足。现有方法通常通过加强弱模态或调整优化梯度来缓解此问题。然而，这些策略主要补偿优化速率差异，往往以牺牲强模态的优化能力为代价，而未从模态层面分析这些差异如何产生。基于理论洞察和实证观察，我们认为学习速度的差异源于模态特定特征空间与共享标签空间之间映射难度的不同。为解决此问题，我们提出了平衡多模态标签重塑（BMLR），这是首个从标签侧设计促进多模态平衡的方法。BMLR重塑跨模态标签空间以均衡各模态的映射难度，从而促进模态交互并为每个模态注入更丰富的类间信息。跨多种架构的大量实验表明，BMLR持续提升多模态性能，并与多种模型设计表现出强兼容性。源代码即将发布。

英文摘要

Multimodal learning often suffers from modality imbalance, where modalities that converge faster dominate optimization while others remain undertrained. Existing approaches typically mitigate this issue by strengthening the weak modality or adjusting optimization gradients. However, such strategies mainly compensate for optimization rate discrepancies, often at the expense of the strong modality's optimization capacity, without analyzing how these discrepancies arise at the modality level. Based on theoretical insights and empirical observations, we argue that the discrepancy of learning pace arises from differences in the mapping difficulty between modality-specific feature space and the shared label space. To address this issue, we propose Balanced Multimodal Label Reshaping (BMLR), the first method that promotes multimodal balance from the label-side design. BMLR reshapes the cross-modal label space to equalize mapping difficulty across modalities, thereby facilitating modality interaction and injecting richer inter-class information into each modality. Extensive experiments across multiple architectures demonstrate that BMLR consistently improves multimodal performance and exhibits strong compatibility with diverse model designs. The source code will be released soon.

URL PDF HTML ☆

赞 0 踩 0

2605.28868 2026-05-29 cs.LG cs.AI 版本更新

TaxDistill: Improving Metagenomic Taxonomic Annotation via Distilled Genomic Foundation Models

TaxDistill：通过蒸馏基因组基础模型改进宏基因组分类注释

Rongye Ye, Lun Li, Zheng Luo, Yiran Zhan, Shuhui Song

发表机构 * National Genomics Data Center, China National Center for Bioinformation（中国生物信息中心国家基因组数据中心）； Beijing Key Laboratory of Intelligent Governance and Application of Biological Big Data, China National Center for Bioinformation（北京生物大数据智能治理与应用重点实验室，中国生物信息中心）； Beijing Institute of Genomics, Chinese Academy of Sciences（北京基因组研究所，中国科学院）； University of Chinese Academy of Sciences（中国科学院大学）

AI总结提出TaxDistill知识蒸馏框架，利用500M参数的基因组基础模型GenomeOcean作为教师网络生成软标签，以减轻初始检索工具引入的标签噪声，从而提升宏基因组序列分类性能。

Comments The manuscript contains 14 pages, 7 figures, and 3 tables

详情

AI中文摘要

宏基因组分类注释旨在识别环境样本中DNA片段的微生物起源。依赖序列相似性的传统方法通常受到高微生物多样性和参考数据库不完整性的限制，这推动了诸如Taxometer等学习方法的发展，这些方法通过事后校正来学习更具信息量的宏基因组序列表示。然而，这些方法通常依赖于训练期间从相似性搜索工具获得的标签，这不可避免地引入了噪声，从而损害表示学习并降低分类性能。为了解决这个问题，我们提出了TaxDistill，一种用于宏基因组分类的知识蒸馏框架。我们引入GenomeOcean，一个500M参数的基因组基础模型，作为教师网络来提取深层语义特征并基于置信度生成软标签。通过将这些软标签信息蒸馏到轻量级学生网络中，TaxDistill有效减少了初始检索工具引入的标签噪声。在七个不同的CAMI2数据集上的全面实验表明，TaxDistill在大多数场景下优于现有基线。例如，在胃肠道数据集上，它将MMseqs2的F1分数从0.763提高到0.941，优于Taxometer基线。总体而言，TaxDistill为复杂宏基因组分析中的标签校正提供了一种可靠的方法。

英文摘要

Metagenomic taxonomic annotation aims to identify the microbial origins of DNA fragments in environmental samples. Traditional methods that rely on sequence similarity are often constrained by the high microbial diversity and the incompleteness of reference databases, which has motivated the development of learning approaches such as Taxometer that perform post hoc correction to learn more informative metagenomic sequence representations. However, these methods typically rely on labels derived from similarity search tools during training, which inevitably introduces noise that can impair representation learning and degrade classification performance. To address this issue, we propose TaxDistill, a knowledge distillation framework for metagenomic classification. We introduce GenomeOcean, a 500M parameter genomic foundation model, as the teacher network to extract deep semantic features and generate soft labels based on confidence. By distilling this soft label information into a lightweight student network, TaxDistill effectively reduces the label noise introduced by initial retrieval tools. Comprehensive experiments on seven diverse CAMI2 datasets demonstrate that TaxDistill outperforms existing baselines in most scenarios. For instance, on the Gastrointestinal dataset, it improves the F1 score of MMseqs2 from 0.763 to 0.941, outperforming the Taxometer baseline. Overall, TaxDistill provides a reliable method for label correction in complex metagenomic analysis.

URL PDF HTML ☆

赞 0 踩 0

2605.28867 2026-05-29 cs.LG cs.AI 版本更新

PrismFlow: Residual Dynamics for Flow Matching in Time-Series Generation

PrismFlow: 时间序列生成中流匹配的残差动力学

Junru Zhang, Lang Feng, Jinbo Wang, Xu Guo, Yucheng Wang, Han Yu, Min Wu, Yabo Dong, Duanqing Xu

发表机构 * Zhejiang University, China（浙江大学，中国）； Nanyang Technological University, Singapore（南洋理工大学，新加坡）； I2R, Agency for Science, Technology and Research (A*STAR), Singapore（科技研究局（A*STAR）新加坡研究所，新加坡）

AI总结提出PrismFlow方法，通过Koopman启发的动力学专家和置信度感知的胜者全得目标，在流匹配中学习残差修正，以解决标准流匹配中全局向量场估计器导致的频谱失真和模式覆盖不足问题，在时间序列生成中取得最优性能。

详情

可微PDE求解器的端到端PyTorch接口：一项RANS模型校正研究

Luca Saverio, Michele Alessandro Bucci, Gianmarco Farro, Cédric Content, Denis Sipp

发表机构 * Digital Sciences \& Technologies Department , Safran Tech , Magny-Les-Hameaux , 78114 , France ； MONHADE, équipe INRIA-ONERA, DSG , ONERA, Institut Polytechnique de Paris , Palaiseau , 91120 , France

AI总结提出一个端到端可微机器学习框架，通过将PDE作为隐层集成到PyTorch中，优化参数化校正项，用于数据同化和闭合建模，并在可压缩流RANS方程上验证。

详情

AI中文摘要

本工作提出了一种在完全可微的机器学习框架内求解偏微分方程约束反问题的端到端策略。所提出的公式提供了一种统一且用户友好的方法，适用于从数据同化到闭合建模的广泛问题。我们的方法结合了一个基线可微PDE求解器（从非线性系统$R(w) = 0$预测状态$w$）和一个通用的加性、参数化、可微校正$f_ϕ(w)$，其可训练参数为$ϕ$。我们展示了如何通过将PDE重新表述为隐层，将其集成到任意目标函数中，同时利用PyTorch的自动微分图，在完全可微的Python工作流中优化phi。该方法在可压缩流的雷诺平均纳维-斯托克斯方程上进行了演示，其中闭合项或其一部分使用可训练参数或神经网络建模。第一个应用考虑了二维NASA壁装驼峰测试案例，其中生产项参数针对时间平均LES数据进行了优化。第二个应用在VKI LS-59涡轮叶片上进行，其中通过优化可训练空间场重建了Spalart-Allmaras涡粘性场。使用可微BROADCAST求解器和Spalart-Allmaras湍流模型，从VKI LS-59涡轮叶片几何形状生成数据集。结果突出了该框架的灵活性，展示了其超越湍流建模，适用于更广泛的物理信息PDE约束问题（具有数据驱动组件）的适用性。

英文摘要

This work presents an end-to-end strategy for solving inverse problems constrained by Partial Differential Equations within a fully differentiable Machine Learning framework. The proposed formulation provides a unified and user-friendly methodology applicable to a wide range of problems, from data assimilation to closure modeling. Our approach combines a baseline differentiable PDE solver, which predicts the state w from the nonlinear system $R(w) = 0$, with a generic additive, parametrized, and differentiable correction $f_ϕ(w)$, with trainable parameters $ϕ$. We show how to optimize phi within a fully differentiable Python workflow by reformulating the PDE as an implicit layer, enabling its integration into arbitrary objective functions, while leveraging PyTorch's automatic differentiation graph. The method is demonstrated on the Reynolds-Averaged Navier-Stokes equations for compressible flows, where the closure term, or a portion of it, is modeled using trainable parameters or a Neural Network. The first application considers the 2D NASA Wall-Mounted Hump test case, where a production-term parameter is optimized against time-averaged LES data. A second application is carried out on the VKI LS-59 turbine blade, where the Spalart-Allmaras eddy viscosity field is reconstructed through the optimization of a trainable spatial field. A dataset is generated starting from the VKI LS-59 turbine blade geometry using the differentiable BROADCAST solver with the Spalart-Allmaras turbulence model. The results highlight the flexibility of the framework, showing its applicability beyond turbulence modeling to a broader class of physics-informed PDE-constrained problems with data-driven components.

URL PDF HTML ☆

赞 0 踩 0

2605.28854 2026-05-29 cs.CL cs.LG q-bio.NC 版本更新

Large language models reorganize representational geometry during in-context learning

大型语言模型在上下文学习中重组表征几何结构

Hua-Dong Xiong, Li Ji-An, Robert C. Wilson, Kwonjoon Lee, Xue-Xin Wei

发表机构 * School of Psychological and Brain Sciences, Georgia Tech（佐治亚理工学院心理与脑科学学院）； Department of Psychology, New York University（纽约大学心理学系）； Center of Excellence for Computational Cognition, Georgia Tech（佐治亚理工学院计算认知卓越中心）； Honda Research Institute（本田研究院）； Departments of Neuroscience and Psychology, The University of Texas at Austin（德克萨斯大学奥斯汀分校神经科学与心理学系）

AI总结研究大型语言模型在上下文学习中的表征几何重组，发现其性能与任务表征结构相关，并通过原型算法动态调整表征以提高可分性。

详情

AI中文摘要

大型语言模型（LLMs）表现出显著的灵活性：它们可以从上下文示例中适应新任务，而无需任何参数更新，这种能力被称为上下文学习（ICL）。先前关于合成任务的研究表明，ICL可以实现特定算法，展示了架构能力，并且机制分析已经识别出支持这种行为的关键回路。然而，由于上下文计算——无论其算法形式如何——依赖于高维表征空间中的变换，该空间的几何结构如何塑造ICL的有效性仍不清楚。受神经科学中将分类视为神经表征解缠的观点启发，我们假设ICL依赖于任务相关表征的成功在线解缠。为了验证这一想法，我们研究了LLMs如何对上下文示例进行分类，这些示例的标签由模型自身具有已知结构的内部表征定义。我们表明，ICL性能与底层分类任务的表征结构系统性相关，并且成功的ICL伴随着几何重组，增加了在线可分性。我们进一步发现，LLM的行为可以通过一种原型类算法很好地描述，该算法在重塑表征以支持分类的同时整合证据。这些发现为预训练LLMs中的ICL提供了几何解释，将表征几何结构确立为ICL的机制约束，并量化了预训练表征所能提供的与上下文学习所能利用之间的差距。

英文摘要

Large language models (LLMs) exhibit remarkable flexibility: they can adapt to novel tasks from in-context examples without any parameter updates, a capability known as in-context learning (ICL). Prior work on synthetic tasks has shown that ICL can implement specific algorithms, demonstrating architectural competence, and mechanistic analyses have identified key circuits that support this behavior. However, because in-context computation -- regardless of its algorithmic form -- relies on transformations in high-dimensional representation space, it remains unclear how the geometry of that space shapes ICL effectiveness. Motivated by the neuroscience view of classification as the untangling of neural representations, we hypothesize that ICL depends on the successful online untangling of task-relevant representations. To test this idea, we study how LLMs classify in-context examples whose labels are defined by the model's own internal representations with known structure. We show that ICL performance correlates systematically with the representational structure of the underlying classification task and that successful ICL is accompanied by geometric reorganization that increases online separability. We further find that LLM behavior is well described by a prototype-like algorithm that integrates evidence while reshaping representations to support classification. These findings offer a geometric account of ICL in pretrained LLMs, establish representational geometry as a mechanistic constraint on ICL, and quantify the gap between what pretrained representations afford and what in-context learning can exploit.

URL PDF HTML ☆

赞 0 踩 0

2605.28853 2026-05-29 q-fin.PM cs.LG 版本更新

Financially Guided Deep Portfolio Optimization

财务引导的深度投资组合优化

Rahul Fernandes, Travis Desell

发表机构 * Department of Software Engineering（软件工程系）； Rochester Institute of Technology（罗切斯特理工学院）

AI总结提出一个端到端框架，通过直接优化夏普比率、Omega比率、条件风险价值(CVaR)和风险平价等关键财务指标的微分代理，利用神经网络学习投资组合权重，在2007-2023年50只标普500股票上，最佳模型(AttentionLSTM结合Omega-CVaR-RiskParity损失)在2022-2023年样本外测试中实现年化夏普比率0.29和总复合收益+7.86%，超越标普500指数12.38个百分点。

详情

AI中文摘要

由于非平稳性、噪声数据和高交易成本，现实金融市场中的投资组合优化极其困难。标准的预测-然后优化方法首先预测收益，然后求解权重，这加剧了预测误差，并且常常在制度转换下失败。我们提出一个端到端框架，直接优化关键财务指标——夏普比率、Omega比率、条件风险价值(CVaR)和风险平价——的可微代理，使得神经网络能够通过反向传播学习投资组合权重。我们的扩展窗口滚动前向程序，应用于2007年至2023年的50只标普500股票，包含了现实的买卖价差成本，并每季度再平衡。在具有挑战性的样本外测试期（2022-2023年），最佳模型——使用Omega-CVaR-RiskParity损失的AttentionLSTM——实现了年化夏普比率0.29和总复合收益+7.86%，而标普500指数总收益为-4.52%，年化夏普比率为-0.02。这比标普500指数高出12.38个百分点（相对改进超过270%），同时保持尾部风险（CVaR）几乎不变。该框架持续优于等权重投资组合、标普500指数以及传统方法（MVP、HRP、NCO），表明将财务目标直接嵌入模型训练能够在不利市场条件下产生稳健、经济上有意义的超额收益。

英文摘要

Portfolio optimization in real-world financial markets is notoriously difficult due to non-stationarity, noisy data, and high transaction costs. Standard predict-then-optimize methods first forecast returns and then solve for weights, compounding prediction errors and often failing under regime shifts. We propose an end-to-end framework that directly optimizes differentiable surrogates of key financial metrics - Sharpe ratio, Omega ratio, Conditional Value-at-Risk (CVaR), and Risk Parity - allowing neural networks to learn portfolio weights via backpropagation. Our expanding-window walk-forward procedure, applied to 50 S&P 500 stocks from 2007 to 2023, incorporates realistic bid-ask spread costs and rebalances quarterly. On the challenging out-of-sample test period (2022-2023), the best model - an AttentionLSTM with the Omega-CVaR-RiskParity loss - achieves an annualized Sharpe of 0.29 and a total compounded return of +7.86%, while the S&P 500 delivers -4.52% total return and an annualized Sharpe of -0.02. This outperforms the S&P 500 by 12.38 percentage points (a relative improvement of over 270%), while keeping tail risk (CVaR) nearly unchanged. The framework consistently outperforms the equal-weight portfolio, S&P 500, and traditional methods (MVP, HRP, NCO), demonstrating that embedding financial objectives directly into model training yields robust, economically meaningful outperformance even in adverse market conditions.

URL PDF HTML ☆

赞 0 踩 0

2605.28851 2026-05-29 astro-ph.EP astro-ph.IM cs.LG physics.ao-ph 版本更新

Towards a Foundation Model for the Martian Atmosphere

火星大气基础模型

Sujit Roy, Udayshankar Nair, Yuling Wu, Georgios Priftis, Liping Wang, Anastasia Georgiou, Anne Jones, Björn Lütjens, Johannes Schmude, Campbell Watson, Rachel A. Slank, Ankur Kumar, Anirbit Mukherjee, Procheta Sen, Ramin Lolachi, Haonan Chen, Manil Maskey, Juan Bernabé-Moreno, Rahul Ramachandran

发表机构 * Earth System Science Center, University of Alabama in Huntsville（阿拉巴马大学亨茨维尔分校地球系统科学中心）； NASA Marshall Space Flight Center（美国宇航局马歇尔空间飞行中心）； Department of Electrical & Computer Engineering, Colorado State University（科罗拉多州立大学电气与计算机工程系）； Science and Technology Institute/Universities Space Research Association (USRA)（科学与技术研究所/大学空间研究协会）； Department of Computer Science, The University of Manchester（曼彻斯特大学计算机科学系）； School of Computer Science, University of Liverpool（利物浦大学计算机科学学院）； Center for Space Sciences and Technology, University of Maryland, Baltimore County（马里兰大学巴尔的摩分校空间科学与技术中心）； NASA Goddard Space Flight Center（美国宇航局戈达德空间飞行中心）； Center for Research and Exploration in Space Science and Technology, NASA/GSFC（空间科学与技术研究与探索中心，NASA/GSFC）； IBM Research（IBM研究院）

AI总结针对火星大气数据稀疏、计算成本高等挑战，本文探讨了构建数据驱动基础模型的设计空间，包括可用数据、物理模型、下游应用及AI方法。

详情

AI中文摘要

火星大气中存在从行星尺度沙尘暴到中尺度地形云和夜间低空急流等动力学现象。全球环流模型能够模拟这些现象，但在解析中尺度特征所需的分辨率下计算成本高昂。虽然卫星遥感观测的同化使得利用此类模型进行预报成为可能，但观测记录通常稀疏、短暂且分散在不同仪器代际之间。这些限制促使我们开发数据驱动的火星大气基础模型。基础模型处于复杂的设计空间中。可用数据、底层过程的物理特性以及人工智能的相应发展之间存在相互作用。尽管基础模型旨在以数据和计算高效的方式处理多个用例，但明确单个模型能够合理解决哪些应用至关重要。本文旨在阐明这一设计空间。我们讨论了从大气反演到再分析数据集以及现有物理模型的可用数据。此外，我们识别了广泛的候选下游应用。最后，我们考虑了在此背景下可以利用的人工智能（AI）相关最新进展。这里，我们特别关注用于大气物理的AI模型、数据驱动的数据同化方法以及在有限数据环境下工作的技术。

英文摘要

The martian atmosphere hosts dynamical phenomena ranging from planet-encircling dust storms to mesoscale orographic clouds and nocturnal low-level jets. General circulation model show capability to simulate these phenomena, but is computationally expensive at resolution needed to resolve mesoscale features. While assimilation of satellite remote sensing observation enable forecasting capabilities using such models, observation record is often sparse, short and fragmented across instrument generators. These constraints motivate the development of a data-driven foundation model for the Martian atmosphere. Foundation models live in a complex design landscape. There is an interplay between the available data, the physics of the underlying processes and corresponding developments in AI. Even though the idea of a foundation model is to address multiple use cases in a data- and compute-efficient manner, it is important to have a clear picture what applications can sensibly addressed by a single model. The purpose of this paper is to elucidate this design landscape. We discuss available data ranging from atmospheric retrievals to reanalysis datasets as well as existing physical models. Moreover, we identify a wide range of candidate downstream applications. Finally, we consider relevant recent developments in artificial intelligence (AI) that can be leveraged in this context. Here, we put a particular emphasis on AI models for atmospheric physics, data-driven approaches to data assimilation as well as methods to work in a limited data setting.

URL PDF HTML ☆

赞 0 踩 0

2605.28844 2026-05-29 cs.NE cs.LG 版本更新

无分辨率依赖的几何参数化与映射神经替代模型：面向空间变化场

Yanwen Huang, Lok Ming Lui, Gary P. T. Choi

发表机构 * Department of Mathematics, The Chinese University of Hong Kong（香港中文大学数学系）

AI总结提出一种无分辨率依赖的神经替代模型，通过多分辨率几何编码和几何感知约束（变分能量、扩散密度均衡、拟共形理论）无监督学习，直接从空间变化参数场预测映射位置，适用于任意结构化或非结构化点集。

详情

AI中文摘要

许多成像问题需要计算由空间变化的强度、特征或密度场引起的空间变换。典型例子包括畸变校正、可变形图像配准、基于图谱的分割以及变形驱动的图像分析。这些任务可以表述为几何映射问题，其中变换被约束以保持局部结构、控制边界行为或调节角度畸变。此类公式通常导致变分模型、扩散过程或椭圆偏微分方程。然而，当底层参数场在不同实例间变化时，重复求解高分辨率系统在计算上变得昂贵。在这项工作中，我们提出了一种无分辨率依赖的神经替代模型，用于几何参数化和映射问题。给定一个空间变化的参数场 $p:\Omega\to\mathbb{R}^m$ 和查询位置 $\{x_i\}_{i=1}^N\subset\Omega$，该模型预测任意结构化或非结构化点集上的映射位置 $\{u(x_i)\}_{i=1}^N$。为了避免对固定网格的依赖，我们采用了一种多分辨率几何编码策略，该策略将网络条件建立在参数场的坐标增强样本上。该模型通过强制执行源自变分能量、基于扩散的密度均衡和拟共形理论的几何感知约束进行训练，无需标记解数据。在拟共形映射和密度均衡映射问题上的实验结果展示了我们提出方法的有效性。

英文摘要

Many imaging problems require computing spatial transformations induced by spatially varying intensity, feature, or density fields. Canonical examples include distortion correction, deformable image registration, atlas-based segmentation, and deformation-driven image analysis. These tasks can be formulated as geometric mapping problems in which the transformation is constrained to preserve local structure, control boundary behavior, or regulate angular distortion. Such formulations typically lead to variational models, diffusion processes, or elliptic partial differential equations. However, repeatedly solving high-resolution systems becomes computationally expensive when the underlying parameter fields vary across instances. In this work, we propose a resolution-free neural surrogate for geometric parameterization and mapping problems. Given a spatially varying parameter field $p:Ω\to\mathbb{R}^m$ and query locations $\{x_i\}_{i=1}^N\subsetΩ$, the model predicts mapped locations $\{u(x_i)\}_{i=1}^N$ on arbitrary structured or unstructured point sets. To avoid dependence on a fixed grid, we use a multi-resolution geometric encoding strategy that conditions the network on coordinate-augmented samples of the parameter field. The model is trained without labeled solution data by enforcing geometry-aware constraints derived from variational energies, diffusion-based density equalization, and quasi-conformal theory. Experimental results on quasi-conformal mapping and density-equalizing mapping problems are presented to demonstrate the effectiveness of our proposed method.

URL PDF HTML ☆

赞 0 踩 0

2605.28488 2026-05-29 stat.ML cs.LG math.ST stat.TH 版本更新

Bridging Maximum Likelihood and Optimal Transport for Efficient Inference and Model Selection in Stochastic Block Models

桥接最大似然与最优传输：随机块模型中的高效推理与模型选择

Simon Queric, Cédric Vincent-Cuaz, Charles Bouveyron, Marco Corneli

发表机构 * Université Côte d’Azur（法国蔚蓝海岸大学）； Inria（法国国家信息与自动化技术研究院）； CNRS LJAD（法国国家科学研究中心LJAD实验室）； Maasai Nice, France（法国尼斯马萨伊研究所）； EPFL Lausanne, Switzerland（瑞士洛桑联邦理工学院）； CNRS CEPAM（法国国家科学研究中心CEPAM实验室）

AI总结本文通过最优传输视角研究随机块模型，提出正则化与未正则化的半松弛Gromov-Wasserstein估计器，实现聚类与模型参数的联合推断及簇数自动选择。

Comments 10 pages, 8 figures

详情

AI中文摘要

我们通过最优传输（OT）的视角研究随机块模型（SBM）中的推断。首先，我们证明最大似然变分推断（MLVI）可以解释为带有熵正则化的半松弛Gromov-Wasserstein（srGW）投影。虽然这种公式能产生准确的聚类，但熵正则化阻止了传输计划的稀疏性，从而阻碍了内在的模型选择。因此，我们研究未正则化的srGW估计器，并证明它们在渐近情况下一致地恢复SBM连接矩阵和潜在簇分配。然而，这种渐近性质在有限样本中并不能转化为可靠的模型选择，需要额外的机制来促进推断的簇比例中的稀疏性。我们通过实验表明，这种正则化公式产生的估计器能够在单个优化问题中同时恢复模型参数并选择簇的数量，从而避免了昂贵的网格搜索或启发式模型选择程序。

英文摘要

We study inference in stochastic block models (SBMs) through the lens of optimal transport (OT). We first establish that maximum likelihood variational inference (MLVI) can be interpreted as a semi-relaxed Gromov-Wasserstein (srGW) projection with entropic regularization. While this formulation yields accurate clustering, the entropic regularization prevents transport plans to be sparse, hindering intrinsic model selection. Consequently, we investigate unregularized srGW estimators, and prove that they consistently recover both the SBM connectivity matrix and latent cluster assignments in the asymptotic regime. However, this asymptotic property does not translate into reliable model selection in finite samples, and calls for additional mechanisms to promote sparsity in the inferred cluster proportions. We empirically show that such a regularized formulation yields estimators that simultaneously recover model parameters and select the number of clusters in a single optimization problem, thereby avoiding costly grid search or heuristic model selection procedures.

URL PDF HTML ☆

赞 0 踩 0

2605.28293 2026-05-29 cs.LG cs.AI 版本更新

ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation

ProRL: 通过修正策略梯度估计实现主动推荐的有效强化学习

Hongru Hou, Tiehua Mei, Denghui Geng, Jinhui Huang, Ao Xu, Hengrui Chen, Jiaqing Liang, Deqing Yang

发表机构 * School of Data Science, Fudan University, Shanghai, China（复旦大学数据科学学院，上海，中国）

AI总结针对主动推荐系统中策略梯度估计存在的长度依赖偏差和高方差问题，提出ProRL框架，通过逐步奖励中心化和位置特定优势估计两个机制修正梯度，显著提升推荐效果。

Comments Accepted in ICML 2026

详情

AI中文摘要

主动推荐系统（PRS）旨在通过生成中间推荐路径来引导用户偏好向目标物品转移。强化学习（RL）为优化此类序列决策任务提供了原则性框架，因为路径奖励可以自然地捕捉短期接受度和长期引导有效性。然而，将策略梯度直接应用于PRS会导致梯度估计存在缺陷。我们识别出两个缺陷：（1）路径级奖励分解为具有正均值的步骤级奖励，产生长度依赖偏差，导致梯度倾向于路径扩展而非有意义的探索；（2）用整个路径级奖励加权每个步骤忽略了分解结构，导致高梯度方差。为修正这两个缺陷，我们提出了一种有效的RL框架ProRL，其中包含两种用于主动推荐的新机制。首先，逐步奖励中心化减去期望奖励以消除长度依赖偏差，确保路径扩展产生零期望梯度信号。其次，位置特定优势估计利用奖励分解结构计算步骤相关的基线，降低梯度方差。这些机制共同产生精确针对路径质量的策略梯度。我们在三个真实世界数据集上的实验表明，ProRL显著优于最先进的PRS。我们的代码可在https://github.com/hongruhou89/ProRL获取。

英文摘要

Proactive Recommender Systems (PRSs) aim to guide user preference shift toward target items by generating paths of intermediate recommendations. Reinforcement learning (RL) provides a principled framework for optimizing such sequential decision tasks, as path rewards can naturally capture both short-term acceptance and long-term guidance effectiveness. However, naively applying policy gradients to PRS results in deficient gradient estimation. We identify two deficiencies: (1) path-level rewards decompose into step-level rewards with positive mean, creating a length-dependent bias that causes gradients to favor path extension over meaningful exploration; (2) weighting each step by the entire path-level reward ignores the decomposition structure, leading to high gradient variance. To rectify these two deficiencies, we propose an effective RL framework ProRL with two novel mechanisms for proactive recommendation. First, Stepwise Reward Centering subtracts expected rewards to neutralize length-dependent bias, ensuring that path extension yields zero expected gradient signal. Second, Position-Specific Advantage Estimation leverages the reward decomposition structure to compute step-dependent baselines, reducing gradient variance. Together, these mechanisms yield policy gradients that precisely target path quality. Our experiments on three real-world datasets demonstrate that ProRL significantly outperforms state-of-the-art PRSs. Our code is available at https://github.com/hongruhou89/ProRL.

URL PDF HTML ☆

赞 0 踩 0

2605.27975 2026-05-29 cs.LG stat.ML 版本更新

多模式呼吸衰竭预测的前瞻性评估：胸部X光片能否在电子健康记录信号之外提升性能？

Xiaolei Lu, Shamim Nemati

AI总结本研究提出一种门控多模态框架，集成结构化电子健康记录时间序列数据和胸部X光片基础模型表示，用于前瞻性预测ICU患者24小时内是否需要有创机械通气，结果显示相比仅使用电子健康记录的模型和医生预测，多模态融合提高了区分度、敏感性和阳性预测值。

详情

AI中文摘要

呼吸衰竭的早期预测对于重症监护病房的及时临床干预至关重要。现有的基于电子健康记录（EHR）的模型可以持续监测生理恶化，但可能无法完全捕捉胸部X光片（CXR）中反映的肺部病理生理学。在本研究中，我们探讨CXR信息是否能在仅使用EHR信号的基础上改善有创机械通气的前瞻性预测。我们开发了一个门控多模态框架，将结构化EHR时间序列数据与CXR基础模型表示相结合。门控模块根据患者特定的临床背景自适应地控制成像特征的贡献，使模型在成像信息有用时选择性地依赖它。我们前瞻性地评估了该框架在ICU患者中预测24小时内需要有创机械通气的性能，并将其与已建立的仅使用EHR的模型（Ventio）、在匹配临床时间点获得的医生预测以及替代多模态变体进行比较。门控多模态模型比仅使用EHR的基线模型实现了更高的区分度，使用REMEDIS和MedInsight CXR表示时AUROC值分别为0.860和0.858，而Ventio为0.752。相对于医生预测，多模态框架显著提高了敏感性，同时保持了良好的特异性。与仅使用EHR的模型相比，多模态整合提高了特异性和阳性预测值，表明CXR信息可以细化选定患者的风险估计。这些发现支持自适应多模态融合作为将成像纳入前瞻性呼吸衰竭预测的实用策略。

英文摘要

Early prediction of respiratory failure is critical for timely clinical intervention in intensive care units. Existing electronic health record (EHR)-based models can continuously monitor physiologic deterioration, but they may not fully capture pulmonary pathophysiology reflected in chest radiographs (CXRs). In this study, we ask whether CXR information improves prospective prediction of invasive mechanical ventilation beyond EHR signals alone. We develop a gated multimodal framework that integrates structured EHR time-series data with CXR foundation-model representations. The gating module adaptively controls the contribution of imaging features based on patient-specific clinical context, allowing the model to selectively rely on imaging information when it is informative. We prospectively evaluate the framework for predicting invasive mechanical ventilation within 24 hours in ICU patients and compare it with an established EHR-only model (Ventio), physician predictions obtained at matched clinical time points, and alternative multimodal variants. The gated multimodal models achieved higher discrimination than the EHR-only baseline, with AUROC values of 0.860 and 0.858 using REMEDIS and MedInsight CXR representations, respectively, compared with 0.752 for Ventio. Relative to physician predictions, the multimodal framework substantially improved sensitivity while maintaining favorable specificity. Compared with the EHR-only model, multimodal integration increased specificity and positive predictive value, suggesting that CXR information can refine risk estimation in selected patients. These findings support adaptive multimodal fusion as a practical strategy for incorporating imaging into prospective respiratory failure prediction.

URL PDF HTML ☆

赞 0 踩 0

2605.26194 2026-05-29 cs.LG 版本更新

Eureka：面向企业AI云资源需求预测的智能特征工程

Hangxuan Li, Renjun Jia, Xuezhang Wu, Yunjie Qian, Zeqi Zheng, Xianling Zhang

发表机构 * Alibaba Cloud Computing Co. Ltd, Hangzhou, China（阿里云计算有限公司，杭州，中国）； School of Computer Science, Fudan University, Shanghai, China（复旦大学计算机学院，上海，中国）； School of Computer Science and Technology, Tongji University, Shanghai, China（同济大学计算机科学与技术学院，上海，中国）； Independent Researcher, United States（独立研究员，美国）

AI总结提出Eureka框架，将特征工程视为智能体代码生成问题，通过专家代理、LLM特征工厂和自演化对齐引擎三阶段，自动生成可执行特征代码，在医疗、金融、社交等7个公开基准及阿里云GPU资源需求预测中显著提升性能。

Comments accepted at NeurIPS 2025 Workshop, DASFAA 2026 (International Conference on Database Systems for Advanced Applications)

详情

DOI: 10.1007/978-981-92-0378-9_33
Journal ref: Database Systems for Advanced Applications (DASFAA 2026), Lecture Notes in Computer Science, vol. 16540, pp. 528-540, Springer

AI中文摘要

有效的特征对于预测模型性能至关重要，但创建特征通常需要领域专业知识，限制了跨应用的可扩展性。我们将特征工程定义为一个智能体代码生成问题：特征不再是静态的数据转换，而是可生成、评估和迭代改进的可执行程序。我们提出了Eureka，一个由LLM驱动的三阶段框架。（1）专家代理，通过领域知识的SFT微调，生成结构化的JSON格式特征设计方案。（2）LLM特征工厂，通过思维链推理将每个方案转化为可执行的Python代码，将特征假设转化为可运行的程序。（3）自演化对齐引擎，使用带双通道奖励（基于指标的效用+语义对齐）的强化学习（GRPO）来提升代码质量。通过将特征表达为程序，学习到的生成模式可以跨领域迁移。在医疗、金融和社交领域的7个公开基准上评估，Eureka一致优于传统的AutoFE和基于LLM的基线。我们进一步在阿里云的云GPU资源需求预测中展示了Eureka的有效性，其中Eureka将需求满足率提高了16%，并将计算资源迁移率降低了33%。

英文摘要

Effective features are crucial for predictive model performance, but creating them often requires domain expertise, limiting scalability across applications. We define feature engineering as an agentic code generation problem: features are not static data transformations, but executable programs that can be generated, evaluated, and iteratively improved. We present Eureka, an LLM-driven framework with three stages. (1) An Expert Agent, fine-tuned via SFT on domain knowledge, produces structured feature design plans in JSON format. (2) An LLM Feature Factory translates each plan into executable Python code through chain-of-thought reasoning, turning feature hypotheses into runnable programs. (3) A Self-Evolving Alignment Engine uses Reinforcement Learning (GRPO) with dual-channel reward (metric-based utility + semantic alignment) to enhance code quality. By expressing features as programs, the learned generation patterns can transfer across domains. Evaluated on 7 public benchmarks in healthcare, finance, and social domains, Eureka consistently outperforms both traditional AutoFE and LLM-based baselines. We further demonstrate Eureka's effectiveness on cloud GPU resource demand prediction at Alibaba Cloud, where Eureka improves demand fulfillment rate by 16% and lowers computing resource migration rates by 33%.

URL PDF HTML ☆

赞 0 踩 0

2605.24846 2026-05-29 cs.LG cs.AI 版本更新

扩散理论教程：从微分方程到扩散模型

Jiayi Fu, Yuxia Wang

AI总结本教程从微分方程角度统一阐述扩散模型的数学基础，推导ODE和SDE表示，解释分数匹配和去噪目标，并涵盖DDPM、DDIM、流匹配和扩散语言模型。

Comments A detailed tutorial on Diffusion models and SDE

详情

AI中文摘要

扩散模型已成为生成建模的主导框架，但其数学基础通常通过扩散概率模型、基于分数的建模、随机微分方程和数值采样方法分别呈现。我们编写本教程，从微分方程的角度提供这些观点的统一且自洽的阐述。从条件高斯噪声过程出发，我们推导常微分方程（ODE）和随机微分方程（SDE）表示，过渡到相应的边际正向动力学，然后得到使生成成为可能的逆向时间SDE和概率流ODE。我们表明逆向采样中的中心未知量是边际分数，解释在噪声预测参数化下分数匹配如何成为标准去噪目标，并讨论实际的逆向时间采样和引导。我们进一步将DDPM、DDIM、流匹配和基于分数的SDE置于一个共同框架中，并以连续嵌入空间中的扩散语言模型结束，同时简要讨论离散掩码标记扩散。本教程旨在作为扩散过程的分析基础与建立在其上的现代生成算法之间的桥梁。

英文摘要

Diffusion models have emerged as a dominant framework for generative modeling, but their mathematical foundations are often presented separately through diffusion probabilistic models, score-based modeling, stochastic differential equations, and numerical sampling methods. We write this tutorial to provide a unified and self-contained account of these viewpoints from the perspective of differential equations. Starting from a conditional Gaussian noising process, we derive ordinary differential equation (ODE) and stochastic differential equation (SDE) representations, pass to the corresponding marginal forward dynamics, and then obtain the reverse-time SDE and probability-flow ODE that make generation possible. We show that the central unknown quantity in reverse sampling is the marginal score, explain how score matching becomes the standard denoising objective under a noise-prediction parameterization, and discuss practical reverse-time sampling and guidance. We further place DDPM, DDIM, flow matching, and score-based SDEs in a common framework, and conclude with diffusion language models in continuous embedding space together with a brief discussion of discrete masked-token diffusion. The tutorial is intended as a bridge between the analytical foundations of diffusion processes and the modern generative algorithms built upon them.

URL PDF HTML ☆

赞 0 踩 0

2605.22082 2026-05-29 cs.RO cs.LG 版本更新

DualKV: 面向高效RL训练的共享提示Flash注意力机制，支持大规模展开和长上下文

Jiading Gai, Shuai Zhang, Xiang Song, Bernie Wang, George Karypis

发表机构 * Amazon Web Services（亚马逊网络服务）； Google（谷歌）； University of Minnesota（明尼苏达大学）

AI总结针对RL训练中共享提示重复计算问题，提出DualKV内核，通过融合CUDA前向/反向核和veRL数据流水线重排，消除提示复制，实现1.63-3.82倍策略更新加速。

详情

AI中文摘要

现代RL后训练方法（如GRPO和DAPO）在从共享提示（$P$个token）采样的$N$个响应序列（每个$R$个token）上进行训练，但标准FlashAttention在前向和反向传播中将所有$P$个提示token复制$N$次——在相同的隐藏状态上重复计算和内存。在大规模展开、长上下文RL训练（$N\geq16$，$P\geq8\text{K}$）中，这种冗余主导了策略更新成本。我们观察到，在仅解码器模型中，因果掩码使提示表示在每一层跨序列不变，因此所有逐token操作（归一化、投影、MLP）和注意力可以一次性处理提示——这一特性尚未在训练的内核级别被利用。我们提出\textbf{DualKV}，这是首个消除RL训练中共享提示复制的FlashAttention内核变体，通过(1)~融合的CUDA前向和反向内核，在单次内核启动中迭代两个不相交的KV区域——共享上下文和逐序列响应，以及(2)~veRL中的数据流水线重设计，将$N(P{+}R)$个token重新打包为每个微批$P{+}NR$个token，将token减少从注意力扩展到整个模型，因子$ρ= N(P{+}R)/(P{+}NR)$。DualKV在数学上等价于标准注意力，且不引入近似。在Qwen3-8B GRPO训练中，使用8$\times$H100 GPU（$N{=}32$，8K上下文），DualKV实现了$1.63$--$2.09\times$的策略更新加速，支持$2\times$更大的微批，并将MFU从$36\%$提升至$76\%$。类似增益在DAPO上成立（$2.47\times$加速，$77\%$ MFU）。在30B MoE规模下，使用16$\times$H100，DualKV相比FlashAttention（需要4路Ulysses序列并行以避免OOM）实现了$3.82\times$的策略更新加速和$3.38\times$的端到端步骤加速。

英文摘要

Modern RL post-training methods such as GRPO and DAPO train on $N$ response sequences of $R$ tokens sampled from a shared prompt of $P$ tokens, but standard FlashAttention replicates all $P$ prompt tokens $N$ times across both forward and backward passes -- duplicating compute and memory on identical hidden states. In large-rollout, long-context RL training ($N{\geq}16$, $P{\geq}8\text{K}$), this redundancy dominates the policy update cost. We observe that in decoder-only models, causal masking makes prompt representations invariant across sequences at every layer, so all per-token operations (norms, projections, MLP) and attention can process the prompt once -- a property not yet exploited at the kernel level for training. We propose \textbf{DualKV}, the first FlashAttention kernel variant that eliminates shared-prompt replication during RL training, via (1)~fused CUDA forward and backward kernels that iterate over two disjoint KV regions -- shared context and per-sequence response -- in a single kernel launch, and (2)~a data-pipeline redesign in veRL that repacks $N(P{+}R)$ tokens into $P{+}NR$ tokens per micro-batch, extending the token reduction from attention to the entire model by a factor $ρ= N(P{+}R)/(P{+}NR)$. DualKV is mathematically equivalent to standard attention and introduces no approximation. On Qwen3-8B GRPO training with 8$\times$H100 GPUs ($N{=}32$, 8K-context), DualKV achieves $1.63$--$2.09\times$ policy-update speedup, enables $2\times$ larger micro-batches, and raises MFU from $36\%$ to $76\%$. Similar gains hold for DAPO ($2.47\times$ speedup, $77\%$ MFU). At 30B MoE scale on 16$\times$H100, DualKV achieves $3.82\times$ policy-update and $3.38\times$ end-to-end step speedup over FlashAttention (which requires 4-way Ulysses sequence parallelism to avoid OOM).

URL PDF HTML ☆

赞 0 踩 0

2605.13841 2026-05-29 cs.SD cs.AI cs.CL cs.LG 版本更新

EVA-Bench: A New End-to-end Framework for Evaluating Voice Agents

EVA-Bench：一种用于评估语音代理的新型端到端框架

Tara Bogavelli, Gabrielle Gauthier Melançon, Katrina Stankiewicz, Oluwanifemi Bamgbose, Fanny Riols, Hoang H. Nguyen, Raghav Mehndiratta, Lindsay Devon Brin, Joseph Marinier, Hari Subramani, Anil Madamala, Sridhar Krishna Nemala, Srinivas Sunkara

发表机构 * ServiceNow

AI总结提出EVA-Bench框架，通过机器人间音频对话模拟和复合指标（EVA-A和EVA-X）全面评估语音代理的准确性和体验质量。

Comments Work in progress

详情

AI中文摘要

语音代理是一种通过口语对话完成任务的人工智能系统，越来越多地部署在企业应用中。然而，现有基准测试未能同时解决两个核心评估挑战：生成逼真的模拟对话，以及全面衡量语音特定故障模式的质量。我们提出了EVA-Bench，一个端到端评估框架，同时解决这两个问题。在模拟方面，EVA-Bench通过动态多轮对话协调机器人间的音频对话，并自动进行模拟验证，检测用户模拟器错误并在评分前适当重新生成对话。在测量方面，EVA-Bench引入了两个复合指标：EVA-A（准确性），捕捉任务完成度、忠实度和音频级语音保真度；以及EVA-X（体验），捕捉对话进展、口语简洁性和话轮转换时机。这两个指标适用于所有主要的代理架构，支持直接的跨架构比较。EVA-Bench包含三个企业领域的213个场景、一个用于口音和噪声鲁棒性的受控扰动套件，以及区分峰值能力和可靠能力的pass@1、pass@k、pass^k测量。在跨越所有三种架构的12个系统中，我们发现：（1）没有系统在EVA-A pass@1和EVA-X pass@1上同时超过0.5；（2）峰值性能和可靠性能差异显著（EVA-A上pass@k与pass^k的中位数差距为0.44）；（3）口音和噪声扰动暴露了显著的鲁棒性差距，其影响因架构、系统和指标而异（平均Δ高达0.314）。我们在开源许可下发布了完整的框架、评估套件和基准数据。

英文摘要

Voice agents, artificial intelligence systems that conduct spoken conversations to complete tasks, are increasingly deployed across enterprise applications. However, no existing benchmark jointly addresses two core evaluation challenges: generating realistic simulated conversations, and measuring quality across the full scope of voice-specific failure modes. We present EVA-Bench, an end-to-end evaluation framework that addresses both. On the simulation side, EVA-Bench orchestrates bot-to-bot audio conversations over dynamic multi-turn dialogues, with automatic simulation validation that detects user simulator error and appropriately regenerates conversations before scoring. On the measurement side, EVA-Bench introduces two composite metrics: EVA-A (Accuracy), capturing task completion, faithfulness, and audio-level speech fidelity; and EVA-X (Experience), capturing conversation progression, spoken conciseness, and turn-taking timing. Both metrics apply to all major agent architectures, enabling direct cross-architecture comparison. EVA-Bench includes 213 scenarios across three enterprise domains, a controlled perturbation suite for accent and noise robustness, and pass@1, pass@k, pass^k measurements that distinguish peak from reliable capability. Across 12 systems spanning all three architectures, we find: (1) no system simultaneously exceeds 0.5 on both EVA-A pass@1 and EVA-X pass@1; (2) peak and reliable performance diverge substantially (median pass@k--pass^k gap of 0.44 on EVA-A); and (3) accent and noise perturbations expose substantial robustness gaps, with effects varying across architectures, systems, and metrics (mean $Δ$ up to 0.314). We release the full framework, evaluation suite, and benchmark data under an open-source license.

URL PDF HTML ☆

赞 0 踩 0

2605.12208 2026-05-29 stat.ML cs.AI cs.LG stat.CO 版本更新

Self-Supervised Laplace Approximation for Bayesian Uncertainty Quantification

自监督拉普拉斯近似用于贝叶斯不确定性量化

Julian Rodemann, Alexander Marquard, Thomas Augustin, Michele Caprio

发表机构 * Rational Intelligence Lab, CISPA Helmholtz Center for Information Security Department of Statistics, LMU Munich（理性智能实验室，CISPA海德堡信息安全中心统计学系，慕尼黑大学）； Department of Statistics, LMU Munich（统计学系，慕尼黑大学）； Department of Computer Science, The University of Manchester（计算机科学系，曼彻斯特大学）

AI总结提出自监督拉普拉斯近似（SSLA），通过重新拟合自预测数据直接近似后验预测分布，实现确定性、无采样的贝叶斯不确定性量化，并在回归任务中优于经典拉普拉斯近似。

Comments Accepted for publication in TMLR (https://openreview.net/forum?id=T8w8L2t3JG), v2: fixed typos and added a deceased-author footnote with a dedication to Thomas Augustin

详情

Journal ref: Transactions on Machine Learning Research (TMLR). ISSN 2835-8856 (2026)

AI中文摘要

重新审视LLM剪枝对测试时缩放的有效性

Ocean Monjur, Shahriar Kabir Nahin, Anshuman Chhabra

发表机构 * Bellini College of AI, Cybersecurity, and Computing（人工智能、网络安全与计算学院）

AI总结本文研究非结构化剪枝对推理型大语言模型测试时缩放性能的影响，发现其优于结构化剪枝甚至有时超过未剪枝模型，并探讨了层间稀疏分配策略的作用。

详情

AI中文摘要

大型语言模型（LLM）现在通过测试时计算缩放（TTS）展现出卓越的推理能力，在数学和编程基准测试中表现令人印象深刻。与此同时，模型压缩研究开发了剪枝方法，旨在在不牺牲任务性能的情况下移除冗余/有害参数。这两项研究进展的交叉点构成了我们工作的基础。具体到推理型LLM，先前的工作表明结构化剪枝（移除整组层块的方法）显著降低了TTS推理性能。然而，在这项工作中，我们重新审视了这一假设，并研究了非结构化剪枝（仅小心移除某些冗余/有害权重的方法）是否表现出类似的局限性。令人惊讶的是，我们在两个推理型LLM（s1.1-7B和Qwen3-8B）的四个推理基准上的广泛实验一致表明，与结构化剪枝相比，非结构化剪枝增强了TTS性能，有时甚至能超越未剪枝的全权重LLM。此外，我们还实证研究了不同层间稀疏分配策略的影响，这些策略是实现这些非结构化方法的重要参数选择。这些发现挑战了剪枝总是降低TTS性能的传统观念，实际上表明，谨慎进行的剪枝可以保持TTS的有效性。

英文摘要

Large Language Models (LLMs) now exhibit remarkable reasoning capabilities through test-time compute scaling (TTS), with impressive performance across math and coding benchmarks. In parallel, research in model compression has developed pruning methods that seek to remove redundant/detrimental parameters without sacrificing task performance. The intersection of these two research advancements lays the foundation for our work. Specific to reasoning LLMs, prior work has shown that structured pruning (methods which remove entire set of layer blocks), significantly degrades TTS reasoning performance. However, in this work, we revisit this assumption and investigate whether unstructured pruning (methods that carefully remove only certain redundant/detrimental weights) exhibits similar limitations. Surprisingly, our extensive experiments across four reasoning benchmarks on two reasoning LLMs: s1.1-7B and Qwen3-8B, consistently show that unstructured pruning augments TTS performance compared to structured pruning, and at times can even outperform the unpruned full-weight LLMs. Furthermore, we also empirically study the impact of different layer-wise sparsity allocation strategies, which are an important parametric choice for instantiating these unstructured methods. These findings challenge the conventional notion that pruning always reduces TTS performance and in fact, suggest that carefully undertaken pruning can retain TTS effectiveness.

URL PDF HTML ☆

赞 0 踩 0

2604.24824 2026-05-29 cs.LG 版本更新

使用两阶段核岭回归估计连续治疗效果

Seok-Jin Kim, Kaizheng Wang

发表机构 * Department of IEOR, Columbia University（哥伦比亚大学工业工程与运营研究系）； Department of IEOR and Data Science Institute, Columbia University（哥伦比亚大学工业工程与数据科学研究所）

AI总结针对连续治疗的效果函数估计问题，提出两阶段核岭回归方法，通过第一阶段建模响应与治疗和协变量的关系，第二阶段构造伪结果校正分布偏移，无需估计条件治疗密度即可达到最优学习界，并实现数据驱动的模型选择。

详情

AI中文摘要

我们研究连续治疗的效果函数估计问题，该函数将每个治疗值映射到群体平均结果。该设置中的一个核心挑战是混杂：治疗分配通常依赖于协变量，产生选择偏差，使得直接对响应进行回归不可靠。为了解决这个问题，我们提出了一种两阶段核岭回归方法。在第一阶段，我们学习一个模型，将响应表示为治疗和协变量的函数；在第二阶段，我们使用该模型构造伪结果以校正分布偏移，然后拟合第二个模型来估计治疗效果。尽管响应随治疗和协变量变化，但通过对协变量平均得到的诱导效果函数通常更简单，我们的估计器适应这种结构。我们在不估计条件治疗密度的情况下实现了最优学习界，从而绕过了现有方法中的一个主要瓶颈。此外，我们引入了一种完全数据驱动的模型选择程序，该程序对未知的重叠程度和底层核的谱衰减具有可证明的自适应性。

英文摘要

We study the problem of estimating the effect function for a continuous treatment, which maps each treatment value to a population-averaged outcome. A central challenge in this setting is confounding: treatment assignment often depends on covariates, creating selection bias that makes direct regression of the response on treatment unreliable. To address this issue, we propose a two-stage kernel ridge regression method. In the first stage, we learn a model for the response as a function of both treatment and covariates; in the second stage, we use this model to construct pseudo-outcomes that correct for distribution shift, and then fit a second model to estimate the treatment effect. Although the response varies with both treatment and covariates, the induced effect function obtained by averaging over covariates is typically much simpler, and our estimator adapts to this structure. Our optimal learning bounds are achieved without estimating the conditional treatment density, thereby bypassing a major bottleneck in existing methods. Furthermore, we introduce a fully data-driven model selection procedure that achieves provable adaptivity to both the unknown degree of overlap and the spectral decay of the underlying kernel.

URL PDF HTML ☆

赞 0 踩 0

2604.13147 2026-05-29 stat.ML cs.LG math.PR 版本更新

Adaptive Learning via Off-Model Training and Importance Sampling for Fully Non-Markovian Optimal Stochastic Control. Complete version

基于离模型训练和重要性采样的自适应学习用于完全非马尔可夫最优随机控制（完整版）

Dorival Leão, Alberto Ohashi, Simone Scotti, Adolfo M. D da Silva

发表机构 * Departamento de Matemática, Universidade de Brasília（数学系，巴西利亚大学）； Università di Pisa, DEM（比萨大学，DEM）； Université Paris Cité, LPSM（巴黎Cité大学，LPSM）

AI总结针对完全非马尔可夫且依赖未知模型参数的连续时间随机控制问题，提出一种基于离散骨架和重要性采样的蒙特卡洛学习方法，实现离模型训练架构和自适应参数更新，并给出非渐近误差界。

Comments Typos are fixed. Numerical experiment is revised

详情

AI中文摘要

本文研究连续时间随机控制问题，其受控状态是完全非马尔可夫的，且依赖于未知模型参数。这类问题自然出现在路径依赖随机微分方程、粗糙波动率对冲以及分数布朗运动驱动的系统中。基于先前工作中发展的离散骨架方法，我们提出了一种用于相关嵌入后向动态规划方程的蒙特卡洛学习方法。我们的主要贡献有两方面。首先，针对几类具有代表性的非马尔可夫受控系统，我们构造了显式的支配训练律和Radon-Nikodym权重。这产生了一种离模型训练架构，其中在参考律下生成固定的合成数据集，而通过重要性采样恢复与目标模型相关的动态规划算子。其次，我们利用这种结构设计了参数模型不确定性下的自适应更新机制，使得可以通过重新加权相同的训练样本而非重新生成新轨迹来执行重复校准。对于固定参数，我们建立了通过深度神经网络逼近嵌入动态规划方程的非渐近误差界。对于自适应学习，我们推导了将蒙特卡洛逼近误差与模型风险误差分离的定量估计。数值实验在结构化线性二次型例子中展示了离模型训练机制和自适应重要性采样更新。

英文摘要

This paper studies continuous-time stochastic control problems whose controlled states are fully non-Markovian and depend on unknown model parameters. Such problems arise naturally in path-dependent stochastic differential equations, rough-volatility hedging, and systems driven by fractional Brownian motion. Building on the discrete skeleton approach developed in earlier work, we propose a Monte Carlo learning methodology for the associated embedded backward dynamic programming equation. Our main contribution is twofold. First, we construct explicit dominating training laws and Radon--Nikodym weights for several representative classes of non-Markovian controlled systems. This yields an off-model training architecture in which a fixed synthetic dataset is generated under a reference law, while the dynamic programming operators associated with a target model are recovered by importance sampling. Second, we use this structure to design an adaptive update mechanism under parametric model uncertainty, so that repeated recalibration can be performed by reweighting the same training sample rather than regenerating new trajectories. For fixed parameters, we establish non-asymptotic error bounds for the approximation of the embedded dynamic programming equation via deep neural networks. For adaptive learning, we derive quantitative estimates that separate Monte Carlo approximation error from model-risk error. Numerical experiments illustrate both the off-model training mechanism and the adaptive importance-sampling update in structured linear-quadratic examples.

URL PDF HTML ☆

赞 0 踩 0

2604.05446 2026-05-29 stat.ML cs.LG 版本更新

MEC: Machine-Learning-Assisted Generalized Entropy Calibration for Semi-Supervised Mean Estimation

MEC：基于机器学习的广义熵校准用于半监督均值估计

Se Yoon Lee, Jae Kwang Kim

发表机构 * Texas A\&M University（德克萨斯A&M大学）； Iowa State University（爱荷华州立大学）

AI总结提出MEC方法，通过交叉拟合校准加权改进预测驱动推断，在半监督均值估计中实现半参数效率界，并提升置信区间覆盖率和精度。

详情

AI中文摘要

获取高质量标签成本高昂，而无标签协变量通常丰富，这推动了具有可靠不确定性量化的半监督推断方法的发展。预测驱动推断（PPI）利用在少量标记样本上训练的机器学习预测器来提高效率，但在模型误指定下可能损失效率，并因标签重用而导致覆盖失真。我们引入了基于机器学习的广义熵校准（MEC），这是PPI的一种交叉拟合、校准加权变体。MEC通过基于Bregman投影的原则性校准框架对标记样本重新加权，以更好地与目标群体对齐，从而提高效率。这使MEC对预测器的仿射变换具有鲁棒性，并通过用更弱的投影误差条件替代原始预测误差条件，放宽了有效性的要求。因此，MEC在比现有PPI变体更弱的假设下达到了半参数效率界。在模拟和实际数据应用中，MEC实现了接近名义覆盖率的置信区间，并且比CF-PPI和普通PPI具有更紧的置信区间。

英文摘要

Obtaining high-quality labels is costly, whereas unlabeled covariates are often abundant, motivating semi-supervised inference methods with reliable uncertainty quantification. Prediction-powered inference (PPI) leverages a machine-learning predictor trained on a small labeled sample to improve efficiency, but it can lose efficiency under model misspecification and suffer from coverage distortions due to label reuse. We introduce Machine-Learning-Assisted Generalized Entropy Calibration (MEC), a cross-fitted, calibration-weighted variant of PPI. MEC improves efficiency by reweighting labeled samples to better align with the target population, using a principled calibration framework based on Bregman projections. This yields robustness to affine transformations of the predictor and relaxes requirements for validity by replacing conditions on raw prediction error with weaker projection-error conditions. As a result, MEC attains the semiparametric efficiency bound under weaker assumptions than existing PPI variants. Across simulations and a real-data application, MEC achieves near-nominal coverage and tighter confidence intervals than CF-PPI and vanilla PPI.

URL PDF HTML ☆

赞 0 踩 0

2603.23971 2026-05-29 cs.CL cs.AI cs.GT cs.LG cs.MA 版本更新

The Price Reversal Phenomenon: When Cheaper Reasoning Models Cost More

价格反转现象：当更便宜的推理模型成本更高时

Lingjiao Chen, Chi Zhang, Yeye He, Ion Stoica, Matei Zaharia, James Zou

发表机构 * Stanford University（斯坦福大学）； UC Berkeley（加州大学伯克利分校）； CMU（卡内基梅隆大学）； Microsoft Research（微软研究院）

AI总结本文首次系统研究推理模型标价与实际成本的偏差，发现32%的模型对比较中存在价格反转现象，并基于Shapley值建立成本归因框架，揭示思考令牌消耗和交互轮次的高度异质性是主要原因。

详情

AI中文摘要

开发者和消费者越来越根据列出的API价格选择推理模型（RMs）。然而，这些价格在多大程度上准确反映了实际推理成本？我们首次系统研究这一问题，评估了8个前沿RM在12个不同任务上的表现，涵盖竞赛数学、科学问答、代码生成和多领域智能体。我们发现了定价反转现象：在32%的模型对比较中，标价较低的模型实际上产生了更高的总成本，反转幅度高达28倍。例如，Gemini 3 Flash的标价比GPT-5.4便宜80%，但其在所有任务上的实际成本却高出38%。我们基于Shapley值构建了一个正式的成本归因框架，并利用它追溯了思考令牌消耗和交互轮次数量巨大异质性的主要贡献因素：对于同一查询，一个模型可能比另一个模型多使用900%的思考令牌，或多出10倍的环境交互轮次。我们进一步表明，每次查询的成本预测本质上是困难的：同一查询的重复运行产生的思考令牌变化高达9.7倍，为任何预测器建立了不可约的噪声底限。因此，我们提出成本分布预测作为一个开放挑战。我们的发现表明，列出的API定价是实际成本的不可靠代理，呼吁进行成本感知的模型选择和透明的每次请求成本监控。

英文摘要

Developers and consumers increasingly choose reasoning models (RMs) based on their listed API prices. However, how accurately do these prices reflect actual inference costs? We conduct the first systematic study of this question, evaluating 8 frontier RMs across 12 diverse tasks covering competition math, science QA, code generation, and multi-domain agents. We uncover the pricing reversal phenomenon: in 32% of model-pair comparisons, the model with a lower listed price actually incurs a higher total cost, with reversal magnitude reaching up to 28x. For example, Gemini 3 Flash's listed price is 80% cheaper than GPT-5.4's, yet its actual cost across all tasks is 38% higher. We build a formal cost attribution framework based on Shapley value, and leverage it to trace the dominating contributors to vast heterogeneity in thinking token consumption and number of interaction turns: on the same query, one model may use 900% more thinking tokens than another, or 10x more turns of environment interactions. We further show that per-query cost prediction is fundamentally difficult: repeated runs of the same query yield thinking token variation up to 9.7x, establishing an irreducible noise floor for any predictor. Thus, we propose cost distribution prediction as an open challenge. Our findings demonstrate that listed API pricing is an unreliable proxy for actual cost, calling for cost-aware model selection and transparent per-request cost monitoring.

URL PDF HTML ☆

赞 0 踩 0

2603.22348 2026-05-29 cs.LG cs.GT 版本更新

Learning Safely Without Knowing the World:COMPASS-Hedge

在不了解世界的情况下安全学习：COMPASS-Hedge

Ting Hu, Luanda Cai, Emmanouil-Vasileios Vlatakis-Gkaragkounis

发表机构 * Department of Economics University of Wisconsin–Madison（经济学系威斯康星大学麦迪逊分校）； Department of Finance University of Wisconsin–Madison（金融系威斯康星大学麦迪逊分校）； Department of Computer Sciences University of Wisconsin–Madison（计算机科学系威斯康星大学麦迪逊分校）

AI总结提出COMPASS-Hedge算法，通过自适应伪遗憾缩放和基于阶段的激进策略，首次在全信息在线学习中同时实现对抗环境下的极小化最优遗憾、随机环境下的实例最优遗憾以及相对于基准策略的常数遗憾，且无需先验知识。

详情

AI中文摘要

在线学习算法常常面临一个基本的三难困境：在对抗性和随机性设置之间平衡遗憾保证，并提供相对于固定比较器的基线安全性。虽然现有方法在其中一个或两个领域表现出色，但它们通常无法在不牺牲最优速率或需要问题相关参数的神谕访问的情况下统一所有三个目标。在这项工作中，我们通过引入COMPASS-Hedge来弥合这一差距。据我们所知，我们的算法是第一个全信息任意时间方法，同时实现（达到对数因子）：i）对抗环境中的极小化最优遗憾；ii）随机环境中实例最优、间隙相关的遗憾；以及iii）相对于指定基准策略的$\tilde{\mathcal{O}}(1)$遗憾。关键是，COMPASS-Hedge是无参数的，不需要事先了解环境的性质或随机次优间隙的大小。我们的方法依赖于自适应伪遗憾缩放和基于阶段的激进策略的新颖结合，以及比较器感知的混合策略。据我们所知，这提供了全信息设置中的第一个“三世界最优”保证，确立了基线安全性不必以最坏情况鲁棒性或随机效率为代价。

英文摘要

Online learning algorithms often face a fundamental trilemma: balancing regret guarantees between adversarial and stochastic settings and providing baseline safety against a fixed comparator. While existing methods excel in one or two of these regimes, they typically fail to unify all three without sacrificing optimal rates or requiring oracle access to problem-dependent parameters. In this work, we bridge this gap by introducing COMPASS-Hedge. To the best of our knowledge, our algorithm is the first full-information anytime method to simultaneously achieve, up to logarithmic factors: i) minimax-optimal regret in adversarial environments; ii) instance-optimal, gap-dependent regret in stochastic environments; and iii) $\tilde{\mathcal{O}}(1)$ regret relative to a designated baseline policy. Crucially, COMPASS-Hedge is parameter-free and requires no prior knowledge of the environment's nature or the magnitude of the stochastic suboptimality gaps. Our approach hinges on a novel integration of adaptive pseudo-regret scaling and phase-based aggression, coupled with a comparator-aware mixing strategy. To the best of our knowledge, this provides the first "best-of-three-world" guarantee in the full-information setting, establishing that baseline safety does not have to come at the cost of worst-case robustness or stochastic efficiency.

URL PDF HTML ☆

赞 0 踩 0

2603.18859 2026-05-29 cs.AI cs.CL cs.LG 版本更新

RewardFlow: Topology-Aware Reward Propagation on State Graphs for Agentic RL with Large Language Models

RewardFlow: 面向大语言模型智能体强化学习的拓扑感知状态图奖励传播

Xiao Feng, Bo Han, Zhanke Zhou, Jiaqi Fan, Jiangchao Yao, Ka Ho Li, Dahai Yu, Michael Kwok-Po Ng

发表机构 * TMLR Group（TMLR小组）； Hong Kong Baptist University（香港 Baptist大学）； TCL Corporate Research (HK) Co Ltd（TCL企业研究（香港）有限公司）； Cooperative Medianet Innovation Center Shanghai Jiao Tong University（合作中位网创新中心上海交通大学）； Department of Mathematics Hong Kong Baptist University（香港 Baptist大学数学系）

AI总结提出RewardFlow方法，通过构建状态图进行拓扑感知的奖励传播，为智能体推理提供无标注的密集奖励，显著提升强化学习性能。

详情

AI中文摘要

强化学习在增强大语言模型智能体推理方面展现出潜力，但稀疏的终端奖励阻碍了细粒度优化。过程奖励建模提供了一种替代方案，但带来了高计算成本、奖励黑客风险和标注瓶颈。我们引入RewardFlow，一种用于估计智能体推理中状态级奖励的轻量级方法。通过构建捕获轨迹内在拓扑结构的状态图，RewardFlow执行拓扑感知的传播以估计每个状态对成功的贡献，从而产生有原则的、无标注的密集奖励。用于强化学习优化时，RewardFlow在四个智能体基准测试中显著优于先前基线：在基于文本的任务上平均成功率提高6.2%，在视觉推理上跨三个模型尺度比最强基线提高29.7%，在DeepResearch上准确率提高10%，同时具有卓越的鲁棒性和训练效率。RewardFlow的实现已在https://github.com/tmlr-group/RewardFlow公开。

英文摘要

Reinforcement learning (RL) shows promise for enhancing LLM agentic reasoning, yet sparse terminal rewards hinder fine-grained optimization. Process reward modeling offers an alternative but incurs high computational costs, reward hacking risks, and annotation bottlenecks. We introduce RewardFlow, a lightweight method for estimating state-level rewards in agentic reasoning. By constructing state graphs that capture the intrinsic topological structure of trajectories, RewardFlow performs topology-aware propagation to estimate each state's contribution to success, yielding principled, annotation-free dense rewards. Used for RL optimization, RewardFlow substantially outperforms prior baselines across four agentic benchmarks: +6.2% average success rate on text-based tasks, +29.7% on visual reasoning over the strongest baseline across three model scales, and +10% accuracy on DeepResearch, with superior robustness and training efficiency. The implementation of RewardFlow is publicly available at https://github.com/tmlr-group/RewardFlow.

URL PDF HTML ☆

赞 0 踩 0

2603.16673 2026-05-29 cs.RO cs.AI cs.LG 版本更新

基于子模重放的根吸收前缀轨迹平衡用于GFlowNet训练

Xi Wang, Wenbo Lu, Shengjie Wang

发表机构 * Courant Institute School of Mathematics, Computing, and Data Science, New York University（纽约大学Courant研究所数学、计算与数据科学学院）； Courant Institute School of Mathematics, Computing（纽约大学Courant研究所数学、计算与数据科学学院）； Data Science, New York University（纽约大学数据科学学院）

AI总结针对GFlowNet的模式坍塌问题，提出RapTB目标函数（通过根锚定子轨迹监督和吸收后缀备份提供密集前缀学习信号）和SubM子模重放策略（促进高奖励和多样性），在分子生成等任务中提升优化性能和多样性。

详情

AI中文摘要

生成流网络（GFlowNets）能够微调大型语言模型以近似奖励比例的后验分布，但仍容易出现模式坍塌，表现为前缀坍塌和长度偏差。我们将此归因于两个因素：（i）对早期前缀的信用分配较弱，以及（ii）有偏的重放导致偏移的、非代表性的训练流分布。我们提出根吸收前缀轨迹平衡（RapTB），该目标函数将子轨迹监督锚定在根节点，并通过吸收后缀备份将终端奖励传播到中间前缀，从而提供密集的前缀级学习信号。为了减轻重放引起的分布偏移，我们进一步引入SubM，一种子模重放刷新策略，同时促进高奖励和多样性。实验表明，在使用SMILES字符串的分子生成等任务中，RapTB结合SubM持续提升优化性能和分子多样性，同时保持高有效性。

英文摘要

Generative Flow Networks (GFlowNets) enable fine-tuning large language models to approximate reward-proportional posteriors, but they remain prone to mode collapse, manifesting as prefix collapse and length bias. We attribute this to two factors: (i) weak credit assignment to early prefixes, and (ii) biased replay that induces a shifted, non-representative training flow distribution. We propose Rooted absorbed prefix Trajectory Balance RapTB, an objective that anchors subtrajectory supervision at the root and propagates terminal rewards to intermediate prefixes via absorbed suffix-based backups, providing dense prefix-level learning signals. To mitigate replay-induced distribution shift, we further introduce SubM, a submodular replay refresh strategy that promotes both high reward and diversity. Empirically, on tasks such as molecule generation with LLM using SMILES strings, RapTB combined with SubM consistently improves optimization performance and molecular diversity while preserving high validity.

URL PDF HTML ☆

赞 0 踩 0

2603.00377 2026-05-29 cs.LG 版本更新

聚合模型而非解释：改进特征重要性估计

Joseph Paillard, Angel Reyero Lobo, Denis A. Engemann, Bertrand Thirion

发表机构 * Roche Pharma Research \& Early Development, Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd, Basel, Switzerland ； Universite Paris-Saclay, Inria, CEA, Palaiseau, France

AI总结针对特征重要性估计不准确的问题，本文通过理论分析证明模型级集成比解释级集成能更有效地降低误差，并在基准和蛋白质组学数据上验证。

详情

AI中文摘要

特征重要性方法有望将机器学习模型从预测引擎转变为科学发现的工具。然而，由于数据采样和算法随机性，表达性模型可能不稳定，导致变量重要性估计不准确，削弱其在关键生物医学应用中的效用。尽管集成提供了一种解决方案，但由于重要性度量的非线性，决定是解释单个集成模型还是聚合单个模型解释是困难的，并且尚未得到充分研究。我们的理论分析在适应复杂最先进机器学习模型的假设下发展，揭示了这一选择主要由模型的超额风险驱动。与先前文献相反，我们表明模型级集成通过减少这一主导误差项，提供了更准确的变量重要性估计，特别是对于表达性模型。我们在经典基准和来自英国生物银行的大规模蛋白质组学研究中验证了这些发现。

英文摘要

Feature-importance methods show promise in transforming machine learning models from predictive engines into tools for scientific discovery. However, due to data sampling and algorithmic stochasticity, expressive models can be unstable, leading to inaccurate variable importance estimates and undermining their utility in critical biomedical applications. Although ensembling offers a solution, deciding whether to explain a single ensemble model or aggregate individual model explanations is difficult due to the nonlinearity of importance measures and remains largely understudied. Our theoretical analysis, developed under assumptions accommodating complex state-of-the-art ML models, reveals that this choice is primarily driven by the model's excess risk. In contrast to prior literature, we show that ensembling at the model level provides more accurate variable-importance estimates, particularly for expressive models, by reducing this leading error term. We validate these findings on classical benchmarks and a large-scale proteomic study from the UK Biobank.

URL PDF HTML ☆

赞 0 踩 0

2602.10765 2026-05-29 cs.LG 版本更新

Collaborative Threshold Watermarking

协作阈值水印

Tameem Bakr, Anish Ambreth, Nils Lukas

发表机构 * Department of Machine Learning, MBZUAI（机器学习系，MBZUAI）

AI总结针对联邦学习中多客户端模型溯源问题，提出 (t,K)-阈值水印协议，通过秘密共享水印密钥实现至少 t 个客户端协作验证，且抵抗少于 t 客户端的共谋攻击。

详情

AI中文摘要

在联邦学习（FL）中，$K$ 个客户端共同训练一个模型，而不共享原始数据。由于每个参与者投入数据和计算资源，客户端需要机制来后续证明联合训练模型的来源。模型水印在权重中嵌入隐藏信号，但朴素方法要么随着 $K$ 增长，每个客户端的水印被稀释而无法扩展，要么赋予单个客户端验证并可能移除水印的能力。我们引入 $(t,K)$-阈值水印：客户端在训练期间协作嵌入共享水印，而只有至少 $t$ 个客户端的联盟才能重建水印密钥并验证可疑模型。我们秘密共享水印密钥 $τ$，使得少于 $t$ 个客户端的联盟无法重建它，并且可以在不公开 $τ$ 的情况下进行验证。我们在白盒设置中实例化我们的协议，并在 IID 和非 IID 分区上的图像分类任务以及语言模型微调设置中评估它。我们的水印在规模（$K=128$）下仍可检测，准确率损失最小，并且在攻击（包括使用多达 20% 训练数据的自适应微调）下仍保持在检测阈值（$z\ge 4$）以上。代码可在 https://github.com/tameemalaa/collaborative-threshold-watermark 获取。

英文摘要

In federated learning (FL), $K$ clients jointly train a model without sharing raw data. Because each participant invests data and compute, clients need mechanisms to later prove the provenance of a jointly trained model. Model watermarking embeds a hidden signal in the weights, but naive approaches either do not scale with many clients as per-client watermarks dilute as $K$ grows, or give any individual client the ability to verify and potentially remove the watermark. We introduce $(t,K)$-threshold watermarking: clients collaboratively embed a shared watermark during training, while only coalitions of at least $t$ clients can reconstruct the watermark key and verify a suspect model. We secret-share the watermark key $τ$ so that coalitions of fewer than $t$ clients cannot reconstruct it, and verification can be performed without revealing $τ$ in the clear. We instantiate our protocol in the white-box setting and evaluate it on image classification tasks on both IID and non-IID partitions, as well as language models fine-tuning setting. Our watermark remains detectable at scale ($K=128$) with minimal accuracy loss and stays above the detection threshold ($z\ge 4$) under attacks including adaptive fine-tuning using up to 20% of the training data. Code is available at https://github.com/tameemalaa/collaborative-threshold-watermark.

URL PDF HTML ☆

赞 0 踩 0

2602.09499 2026-05-29 cs.LG cs.CR 版本更新

Computationally Efficient Replicable Learning of Parities and Applications

奇偶性的计算高效可复现学习及其应用

Moshe Noivirt, Jessica Sorrell, Eliad Tsfadia

发表机构 * Department of Computer Science, Johns Hopkins University（约翰霍普金斯大学计算机科学系）； Department of Computer Science, Bar-Ilan University（巴伊兰大学计算机科学系）

AI总结本文提出首个计算高效的奇偶性可复现学习算法，证明可复现学习在一般分布上严格超越SQ学习，并揭示其与差分隐私在样本复杂度上的分离。

详情

AI中文摘要

我们研究了可复现性（Impagliazzo等 [STOC `22], Ghazi等 [NeurIPS `21]）与其他稳定性概念之间的计算关系。具体而言，我们关注可复现PAC学习及其与差分隐私（Dwork等 [TCC 2006]）和统计查询（SQ）模型（Kearns [JACM `98]）的联系。从统计角度看，已知差分隐私学习和可复现学习是等价的，并且严格强于SQ学习。然而，在计算上，所有先前已知的高效（即多项式时间）可复现学习算法都局限于SQ可学习任务或受限分布，这与差分隐私学习形成对比。我们的主要贡献是第一个计算高效的可复现算法，用于在任意分布上可实现地学习奇偶性，这一任务在SQ模型中是困难的，但在差分隐私下是可能的。这一结果首次证明，在一般分布上的高效可复现学习严格扩展了高效SQ学习，并且在能力上更接近高效差分隐私学习，尽管可复现性与隐私之间存在计算分离。此外，我们利用我们的奇偶性学习器证明，假设$RP \neq NP$，将可复现性转化为纯差分隐私需要样本复杂度的严格损失。我们的主要构建模块是一个新的、高效且可复现的算法，给定一组向量，该算法输出其线性张成的一个子空间，该子空间覆盖了大部分向量。

英文摘要

We study the computational relationship between replicability (Impagliazzo et al. [STOC `22], Ghazi et al. [NeurIPS `21]) and other stability notions. Specifically, we focus on replicable PAC learning and its connections to differential privacy (Dwork et al. [TCC 2006]) and to the statistical query (SQ) model (Kearns [JACM `98]). Statistically, it was known that differentially private learning and replicable learning are equivalent and strictly more powerful than SQ-learning. Yet, computationally, all previously known efficient (i.e., polynomial-time) replicable learning algorithms were confined to SQ-learnable tasks or restricted distributions, in contrast to differentially private learning. Our main contribution is the first computationally efficient replicable algorithm for realizable learning of parities over arbitrary distributions, a task that is known to be hard in the SQ-model, but possible under differential privacy. This result provides the first evidence that efficient replicable learning over general distributions strictly extends efficient SQ-learning, and is closer in power to efficient differentially private learning, despite computational separations between replicability and privacy. Additionally, we leverage our parity learner to prove that, assuming $RP \neq NP$, converting replicability to pure differential privacy requires a strict loss in sample complexity. Our main building block is a new, efficient, and replicable algorithm that, given a set of vectors, outputs a subspace of their linear span that covers most of them.

URL PDF HTML ☆

赞 0 踩 0

2602.05786 2026-05-29 cs.LG stat.AP stat.ML 版本更新

Selecting Hyperparameters for Tree-Boosting

选择树提升的超参数

Floris Jan Koster, Fabio Sigrist

发表机构 * Seminar for Statistics, ETH Zurich（苏黎世联邦理工学院统计研究所）

AI总结本文通过59个数据集比较了多种超参数优化方法，发现SMAC方法显著优于其他方法，并揭示了超参数调优的关键因素。

详情

AI中文摘要

树提升是一种广泛用于表格数据的机器学习技术。然而，其样本外准确性严重依赖于多个超参数。在本文中，我们使用59个回归和分类数据集，实证比较了几种流行的树提升超参数优化方法，包括随机网格搜索、树结构Parzen估计器（TPE）、基于高斯过程的贝叶斯优化（GP-BO）、Hyperband、基于序列模型的算法配置（SMAC）方法以及确定性全网格搜索。我们发现SMAC方法明显优于所有其他考虑的方法。我们进一步观察到：（i）需要相对较大的试验次数（大于100）才能进行准确的调优，（ii）使用超参数的默认值会产生非常不准确的模型，（iii）所有考虑的超参数都可能对树提升的准确性产生实质性影响，即不存在一组比其他超参数更重要的超参数，以及（iv）对于回归任务，使用早停法选择提升迭代次数比将其包含在搜索空间中能产生更准确的结果。

英文摘要

Tree-boosting is a widely used machine learning technique for tabular data. However, its out-of-sample accuracy is critically dependent on multiple hyperparameters. In this article, we empirically compare several popular methods for hyperparameter optimization for tree-boosting including random grid search, the tree-structured Parzen estimator (TPE), Gaussian-process-based Bayesian optimization (GP-BO), Hyperband, the sequential model-based algorithm configuration (SMAC) method, and deterministic full grid search using $59$ regression and classification data sets. We find that the SMAC method clearly outperforms all the other considered methods. We further observe that (i) a relatively large number of trials larger than $100$ is required for accurate tuning, (ii) using default values for hyperparameters yields very inaccurate models, (iii) all considered hyperparameters can have a material effect on the accuracy of tree-boosting, i.e., there is no small set of hyperparameters that is more important than others, and (iv) choosing the number of boosting iterations using early stopping yields more accurate results compared to including it in the search space for regression tasks.

URL PDF HTML ☆

赞 0 踩 0

2602.02909 2026-05-29 cs.AI cs.FL cs.LG 版本更新

Reasoning about Reasoning: BAPO Bounds on Chain-of-Thought Token Complexity in LLMs

关于推理的推理：LLM中思维链令牌复杂度的BAPO界限

Kiran Tomlinson, Tobias Schnabel, Adith Swaminathan, Jennifer Neville

发表机构 * Microsoft Research, Redmond, WA（微软研究院，西雅图，华盛顿）

AI总结通过扩展BAPO模型，证明二元多数、三元组匹配和图可达性三个任务需要Ω(n)个思维链令牌，实验验证了线性缩放与理论下界一致。

Comments 31 pages; accepted to ICML '26

详情

AI中文摘要

通过思维链（CoT）推理进行推理时扩展是当前最先进LLM性能的主要驱动力，但会带来显著的延迟和计算成本。我们解决了一个基本的理论问题：随着输入规模增长，需要多少推理令牌才能解决问题？通过扩展有界注意力前缀预言机（BAPO）模型——一种量化任务所需信息流的LLM抽象——我们证明了三个典型的BAPO困难任务所需的CoT令牌下界：二元多数、三元组匹配和图可达性。我们证明当输入规模为$n$时，每个任务需要$Ω(n)$个推理令牌。我们通过显式构造给出了匹配或接近匹配的上界。最后，我们在前沿推理模型上的实验显示，这些任务上的推理令牌数量近似线性缩放，且在推理预算受限时出现失败，这与我们的理论下界一致。总之，我们的结果识别了通过CoT进行推理时计算的基本瓶颈，并为分析最优推理长度提供了一种原则性工具。

英文摘要

Inference-time scaling via chain-of-thought (CoT) reasoning is a major driver of state-of-the-art LLM performance, but it comes with substantial latency and compute costs. We address a fundamental theoretical question: how many reasoning tokens are required to solve a problem as input size grows? By extending the bounded attention prefix oracle (BAPO) model--an abstraction of LLMs that quantifies the information flow required to solve a task--we prove lower bounds on the CoT tokens required for three canonical BAPO-hard tasks: binary majority, triplet matching, and graph reachability. We show that each requires $Ω(n)$ reasoning tokens when the input size is $n$. We complement these results with matching or near-matching upper bounds via explicit constructions. Finally, our experiments with frontier reasoning models show approximately linear reasoning token scaling on these tasks and failures when constrained to smaller reasoning budgets, consistent with our theoretical lower bounds. Together, our results identify fundamental bottlenecks in inference-time compute through CoT and offer a principled tool for analyzing optimal reasoning length.

URL PDF HTML ☆

赞 0 踩 0

2602.01058 2026-05-29 cs.LG cs.AI cs.CL 版本更新

Good SFT Optimizes for SFT, Better SFT Prepares for Reinforcement Learning

好的SFT优化SFT，更好的SFT为强化学习做准备

Dylan Zhang, Yufeng Xu, Haojin Wang, Qingzhi Chen, Hao Peng

发表机构 * University of Illinois Urbana-Champaign（伊利诺伊大学厄巴纳-香槟分校）； New York University (Shanghai)（纽约大学（上海））

AI总结针对当前SFT-RL流程中离线SFT数据分布与在线RL策略分布不匹配的问题，提出基于策略评估的离线学习损失重加权方法PEAR，通过重要性采样重加权SFT损失，提升后续RL训练效果。

详情

AI中文摘要

推理大语言模型的后训练是一个整体过程，通常包括离线SFT阶段和后续的在线强化学习（RL）阶段。然而，SFT通常被孤立地优化，仅追求最大化SFT性能。我们表明，在相同的RL训练后，从更强的SFT检查点初始化的模型可能显著劣于从较弱检查点初始化的模型。我们将此归因于当前SFT-RL流程中典型的错配：生成离线SFT数据的分布可能与在线RL期间优化的策略（该策略从其自身的rollout中学习）存在显著差异。我们提出PEAR（基于策略评估的离线学习损失重加权算法），这是一种在SFT阶段纠正此错配并让模型更好地为RL做准备的方法。PEAR使用重要性采样来重加权SFT损失，具有三种变体，分别在token、块和序列级别操作。它可以用于增强标准SFT目标，并且一旦收集到离线数据的概率，仅需很少的额外训练开销。我们在可验证推理游戏和数学推理任务上对Qwen 2.5和3以及DeepSeek蒸馏模型进行了控制实验。PEAR在标准SFT基础上持续提升了RL后性能，在AIME2025上pass@8增益高达14.6%。我们的结果表明，通过设计和评估SFT时考虑下游RL而非孤立进行，PEAR是迈向更全面的大语言模型后训练的有效一步。

英文摘要

Post-training of reasoning LLMs is a holistic process that typically consists of an offline SFT stage followed by an online reinforcement learning (RL) stage. However, SFT is often optimized in isolation to maximize SFT performance alone. We show that, after identical RL training, models initialized from stronger SFT checkpoints can significantly underperform those initialized from weaker ones. We attribute this to a mismatch typical in current SFT-RL pipelines: the distribution that generates the offline SFT data can differ substantially from the policy optimized during online RL, which learns from its own rollouts. We propose PEAR (Policy Evaluation-inspired Algorithm for Offline Learning Loss Re-weighting), an SFT-stage method that corrects this mismatch and better prepares the model for RL. PEAR uses importance sampling to reweight the SFT loss, with three variants operating at the token, block, and sequence levels. It can be used to augment standard SFT objectives and incurs little additional training overhead once probabilities for the offline data are collected. We conduct controlled experiments on verifiable reasoning games and mathematical reasoning tasks on Qwen 2.5 and 3 and DeepSeek-distilled models. PEAR consistently improves post-RL performance over canonical SFT, with pass at 8 gains up to a 14.6 percent on AIME2025. Our results suggest that PEAR is an effective step toward more holistic LLM post-training by designing and evaluating SFT with downstream RL in mind rather than in isolation.

URL PDF HTML ☆

赞 0 踩 0

2601.22531 2026-05-29 cs.LG cs.AI 版本更新

Learn from A Rationalist: Distilling Intermediate Interpretable Rationales

向理性主义者学习：蒸馏中间可解释原理

Jiayi Dai, Randy Goebel

发表机构 * Department of Computing Science, University of Alberta, Edmonton, Canada（阿尔伯塔大学计算机科学系，加拿大埃德蒙顿）； Alberta Machine Intelligence Institute, Edmonton, Canada（阿尔伯塔机器智能研究所，加拿大埃德蒙顿）

AI总结提出REKD方法，通过知识蒸馏将教师模型的可解释原理和预测传授给学生模型，提升基于较弱神经网络的可解释原理提取模型的预测性能。

Comments Accepted to the 43rd International Conference on Machine Learning (ICML 2026)

详情

AI中文摘要

由于深度神经网络（DNN）的广泛使用，尤其是在高风险领域，DNN的可解释性受到了越来越多的关注。原理提取（RE）的总体思想是通过选择-预测架构为DNN提供一个可解释的设计框架，其中两个神经网络分别联合学习进行特征选择和预测。仅依赖于最终任务预测的远程监督，学习选择特征子集（或原理）的过程需要在所有可能的特征组合空间中进行搜索，这在计算上具有挑战性，当基础神经网络能力不足时甚至更加困难。为了提高基于能力较弱或较小神经网络（即学生）的RE模型的预测性能，我们提出了REKD（基于知识蒸馏的原理提取），其中学生RE模型除了自身的RE优化外，还从教师（即理性主义者）的原理和预测中学习。这种对RE的结构调整与人类如何从可解释和可验证的知识中有效学习的方式高度一致。由于该方法与神经模型无关，任何黑盒神经网络都可以作为骨干模型集成。为了证明REKD的可行性，我们使用BERT和视觉变换器（ViT）模型的多种变体进行了实验。我们在语言和视觉分类数据集（即IMDB电影评论、CIFAR 10和CIFAR 100）上的实验表明，REKD显著提高了学生RE模型的预测性能。

英文摘要

Because of the pervasive use of deep neural networks (DNNs), especially in high-stakes domains, the interpretability of DNNs has received increased attention. The general idea of rationale extraction (RE) is to provide an interpretable-by-design framework for DNNs via a select-predict architecture where two neural networks learn jointly to perform feature selection and prediction, respectively. Given only the remote supervision from the final task prediction, the process of learning to select subsets of features (or rationales) requires searching in the space of all possible feature combinations, which is computationally challenging and even harder when the base neural networks are not sufficiently capable. To improve the predictive performance of RE models that are based on less capable or smaller neural networks (i.e., the students), we propose REKD (Rationale Extraction with Knowledge Distillation) where a student RE model learns from the rationales and predictions of a teacher (i.e., a rationalist) in addition to the student's own RE optimization. This structural adjustment to RE aligns well with how humans could learn effectively from interpretable and verifiable knowledge. Because of the neural-model agnostic nature of the method, any black-box neural network could be integrated as a backbone model. To demonstrate the viability of REKD, we conduct experiments with multiple variants of BERT and vision transformer (ViT) models. Our experiments across language and vision classification datasets (i.e., IMDB movie reviews, CIFAR 10 and CIFAR 100) show that REKD significantly improves the predictive performance of the student RE models.

URL PDF HTML ☆

赞 0 踩 0

2601.22347 2026-05-29 cs.LG cs.AI 版本更新

Pushing the Limits of Block Rotations in Post-Training Quantization

推动后训练量化中块旋转的极限

Sai Sanjeet, Ian Colbert, Pablo Monteagudo-Lago, Giuseppe Franco, Yaman Umuroglu, Nicholas J. Fraser

发表机构 * Advanced Micro Devices, Inc. (AMD)（Advanced Micro Devices公司）； State University of New York at Buffalo（纽约州立大学布法罗分校）； Norwegian University of Science（挪威科学与技术大学）

AI总结本文提出PeRQ框架，通过置换和旋转重新分布激活值，以克服块旋转在抑制异常值时的几何限制，显著提升后训练量化精度。

详情

AI中文摘要

最近的后训练量化（PTQ）方法采用块旋转来在舍入前扩散异常值。虽然这减少了在线全向量旋转的开销，但块结构对异常值抑制的影响仍知之甚少。为填补这一空白，我们首次对块Hadamard旋转的异常值抑制进行了系统的非渐近分析。我们的分析表明，异常值抑制从根本上受限于输入向量的几何结构。特别地，在确定性最坏情况下，当旋转前的ℓ1范数质量在块间均匀分布时，旋转后的异常值最小。受这些见解的启发，我们引入了PeRQ（置换、旋转、然后量化），一个在旋转前通过置换重新分布激活质量的PTQ框架。我们提出了一种贪婪质量扩散算法，通过均衡期望的块间ℓ1范数来校准置换。为避免增加推理开销，我们识别了Transformer架构中置换等变区域，在部署前将这些置换合并到模型权重中。实验表明，PeRQ在所有块大小上一致地提高了精度，在将Llama3 1B量化为INT4且块大小为16时，恢复了全向量旋转困惑度的90%，而未经置换时仅为46%。

英文摘要

Recent post-training quantization (PTQ) methods have adopted block rotations to diffuse outliers prior to rounding. While this reduces the overhead of online full-vector rotations, the effect of block structure on outlier suppression remains poorly understood. To fill this gap, we present the first systematic, non-asymptotic analysis of outlier suppression for block Hadamard rotations. Our analysis reveals that outlier suppression is fundamentally limited by the geometry of the input vector. In particular, in the deterministic worst case, post-rotation outliers are minimized when the pre-rotation $\ell_1$ norm mass is evenly distributed across blocks. Guided by these insights, we introduce PeRQ (Permute, Rotate, then Quantize), a PTQ framework that redistributes activation mass via permutations prior to rotation. We propose a greedy mass diffusion algorithm to calibrate permutations by equalizing the expected blockwise $\ell_1$ norms. To avoid adding inference overhead, we identify permutation-equivariant regions in transformer architectures to merge these permutations into model weights before deployment. Experiments show that PeRQ consistently improves accuracy across all block sizes, recovering up to 90% of the full-vector rotation perplexity when quantizing Llama3 1B to INT4 with block size 16, compared to 46% without permutations.

URL PDF HTML ☆

赞 0 踩 0

2601.21725 2026-05-29 cs.CL cs.LG 版本更新

Procedural Pretraining: Warming Up Language Models with Abstract Data

程序化预训练：用抽象数据预热语言模型

Liangze Jiang, Zachary Shinnick, Anton van den Hengel, Hemanth Saratchandran, Damien Teney

发表机构 * EPFL（苏黎世联邦理工学院）； Idiap Research Institute（伊迪普研究机构）； AIML, Adelaide University（人工智能实验室，阿德莱德大学）

AI总结提出程序化预训练方法，通过在抽象结构化数据（如形式语言生成的程序数据）上预训练语言模型，显著提升其推理能力并加速后续语义知识学习，实验表明仅需0.1-0.3%的程序数据即可超越标准预训练。

Comments ICML 2026. Project page: https://zlshinnick.github.io/procedural-pretraining-page/

详情

AI中文摘要

直接在网络规模语料库上预训练语言模型是当前的主流范式。我们研究了一种替代方案：首先让模型接触抽象结构化数据，以简化后续丰富语义知识的获取，类似于人类在学习高级推理之前先学习简单逻辑和数学。我们关注由形式语言和其他简单算法生成的程序数据作为此类抽象数据。首先，我们诊断了不同形式的程序数据能够提升的算法技能，通常效果显著。例如，当模型在Dyck序列（平衡括号）上预训练时，上下文召回（大海捞针）的准确率从10%跃升至98%。其次，我们研究了这些增益如何反映在更大模型（高达1.3B参数）的预训练中。我们发现，仅在前端加入0.1%至0.3%的程序数据，就能显著优于在自然语言、代码和非正式数学（C4、CodeParrot和DeepMind-Math数据集）上的标准预训练。值得注意的是，这也使得模型仅需原始数据的55/67/86%即可达到相同的损失值，从而相应地减少FLOPs。第三，我们探索了这些收益背后的机制，发现程序化预训练在注意力层和MLP层中都注入了非平凡的结构。前者对于结构化领域（如代码）尤为重要，后者对于语言领域重要。最后，我们为组合多种形式的程序数据铺平了道路。我们的结果表明，程序化预训练是一种简单、轻量级的方法，能够提升性能并加速语言模型预训练，最终揭示了在LLM中将知识获取与推理分离的前景。

英文摘要

Pretraining language models directly on web-scale corpora is the de facto paradigm. We study an alternative where the model is initially exposed to abstract structured data to ease the subsequent acquisition of rich semantic knowledge, much like humans learning simple logic and mathematics before higher reasoning. We focus on procedural data, generated by formal languages and other simple algorithms, as such abstract data. We first diagnose the algorithmic skills that different forms of procedural data can improve, often significantly. For example, the accuracy of context recall (Needle-in-a-haystack) jumps from 10 to 98% when a model is pretrained on Dyck sequences (balanced brackets). Second, we study how these gains are reflected in pretraining larger models (up to 1.3B). We find that front-loading as little as 0.1 to 0.3% procedural data significantly outperforms standard pretraining on natural language, code, and informal mathematics (C4, CodeParrot, and DeepMind-Math datasets). Notably, this also enables the models to reach the same loss value with only 55/67/86% of the original data and thus a comparable reduction in FLOPs. Third, we explore the mechanisms behind the benefits and find that procedural pretraining instills non-trivial structure in both attention and MLP layers. The former is particularly important for structured domains (e.g. code), and the latter for language. Finally, we lay a path for combining multiple forms of procedural data. Our results show that procedural pretraining is a simple, lightweight means of improving performance and accelerating language model pretraining, ultimately suggesting the promise of disentangling knowledge acquisition from reasoning in LLMs.

URL PDF HTML ☆

赞 0 踩 0

2601.21243 2026-05-29 math.OC cs.LG cs.NA math.NA 版本更新

Solving the Offline and Online Min-Max Problem of Non-smooth Submodular-Concave Functions: A Zeroth-Order Approach

求解非光滑子模-凹函数的离线和在线极小极大问题：一种零阶方法

Amir Ali Farzin, Yuen-Man Pun, Philipp Braun, Tyler Summers, Iman Shames

发表机构 * School of Engineering, Australian National University, Canberra, Australia（澳大利亚国立大学工程学院，澳大利亚堪培拉）； Department of Electrical and Electronic Engineering, University of Melbourne, Melbourne, Australia（墨尔本大学电气与电子工程系，澳大利亚墨尔本）； Department of Mechanical Engineering, University of Texas at Dallas, Texas, USA（德克萨斯大学达拉斯分校机械工程系，美国德克萨斯）

AI总结针对目标函数关于最小化变量非光滑子模、关于最大化变量凹的极小极大问题，提出一种基于Lovász扩展次梯度和高斯平滑的零阶方法，证明离线情形下收敛到ε-鞍点，在线情形下达到O(√N P̄_N)对偶间隙。

2601.20255 2026-05-29 cs.LG cs.CL cs.SE 版本更新

HE-SNR: Uncovering Latent Logic via Entropy for Guiding Mid-Training on SWE-bench

HE-SNR：通过熵揭示潜在逻辑以指导SWE-bench上的中期训练

Yueyang Wang, Jiawei Fu, Baolong Bi, Xili Wang, Xiaoqing Liu

发表机构 * School of Mathematical Sciences, Peking University, Beijing, China（北京大学数学科学学院）； Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China（中国科学院计算技术研究所）

AI总结针对SWE-bench基准，提出基于熵压缩假说的HE-SNR指标，通过细粒度熵分析指导中期训练数据筛选，在高达560B参数模型上验证有效性。

Comments Accepted at ICML 2026. 21 pages, 15 figures

详情

AI中文摘要

SWE-bench已成为评估大型语言模型在复杂软件工程任务中能力的主要基准。虽然这些能力主要在中期训练阶段获得，并在监督微调（SFT）期间被激发，但目前仍然缺乏能够有效指导中期训练的指标。诸如困惑度（PPL）等标准指标受到“长上下文税”的影响，且与下游SWE性能的相关性较弱。在本文中，我们首先引入严格的数据过滤策略来弥补这一差距。关键地，我们提出了熵压缩假说，将智能重新定义为不是通过标量Top-1压缩，而是通过将不确定性结构化为低阶的熵压缩状态（“合理犹豫”）的能力。基于这种细粒度熵分析，我们制定了一个新的指标，HE-SNR（高熵信噪比）。我们在不同上下文窗口（32K/128K）下对高达560B参数的模型验证了我们的方法。这项工作为优化LLM在复杂工程领域的潜在能力提供了理论基础和实用工具。

英文摘要

SWE-bench has emerged as the premier benchmark for evaluating Large Language Models on complex software engineering tasks. While these capabilities are fundamentally acquired during the mid-training phase and subsequently elicited during Supervised Fine-Tuning (SFT), there remains a critical deficit in metrics capable of guiding mid-training effectively. Standard metrics such as Perplexity (PPL) are compromised by the "Long-Context Tax" and exhibit weak correlation with downstream SWE performance. In this paper, we bridge this gap by first introducing a rigorous data filtering strategy. Crucially, we propose the Entropy Compression Hypothesis, redefining intelligence not by scalar Top-1 compression, but by the capacity to structure uncertainty into Entropy-Compressed States of low orders ("reasonable hesitation"). Grounded in this fine-grained entropy analysis, we formulate a novel metric, HE-SNR (High-Entropy Signal-to-Noise Ratio). We validate our approach on models with up to 560B parameters across different context windows (32K/128K). This work provides both the theoretical foundation and practical tools for optimizing the latent potential of LLMs in complex engineering domains.

URL PDF HTML ☆

赞 0 踩 0

2601.18728 2026-05-29 cs.LG math.DG math.OC math.ST stat.TH 版本更新

Riemannian AmbientFlow: Towards Simultaneous Manifold Learning and Generative Modeling from Corrupted Data

黎曼环境流：面向从损坏数据同时进行流形学习和生成建模

Willem Diepeveen, Oscar Leong

发表机构 * Department of Mathematics, University of California, Los Angeles（数学系，加州大学洛杉矶分校）； Department of Statistics and Data Science, University of California, Los Angeles（统计与数据科学系，加州大学洛杉矶分校）

AI总结提出Riemannian AmbientFlow框架，通过变分推断和数据驱动黎曼几何，从损坏观测中同时学习概率生成模型和非线性数据流形，并理论保证误差可控与双Lipschitz流形参数化。

详情

AI中文摘要

现代生成建模方法在从干净样本学习复杂数据分布方面表现出强大性能。然而，在许多科学和成像应用中，干净样本不可用，只能观测到噪声或线性损坏的测量值。此外，数据中存在的潜在结构（如流形几何）对于进一步的科学分析至关重要。在这项工作中，我们引入了Riemannian AmbientFlow，一个直接从损坏观测中同时学习概率生成模型和底层非线性数据流形的框架。基于AmbientFlow的变分推断框架，我们的方法结合了由归一化流引起的数据驱动黎曼几何，通过拉回度量和黎曼自编码器提取流形结构。我们建立了理论保证，表明在适当的几何正则化和测量条件下，学习到的模型以可控误差恢复底层数据分布，并产生光滑的双Lipschitz流形参数化。我们进一步证明，所得的光滑解码器可以作为具有恢复保证的逆问题的原则性生成先验。我们在低维合成流形和MNIST上实证验证了我们的方法。

英文摘要

Modern generative modeling methods have demonstrated strong performance in learning complex data distributions from clean samples. In many scientific and imaging applications, however, clean samples are unavailable, and only noisy or linearly corrupted measurements can be observed. Moreover, latent structures, such as manifold geometries, present in the data are important to extract for further downstream scientific analysis. In this work, we introduce Riemannian AmbientFlow, a framework for simultaneously learning a probabilistic generative model and the underlying, nonlinear data manifold directly from corrupted observations. Building on the variational inference framework of AmbientFlow, our approach incorporates data-driven Riemannian geometry induced by normalizing flows, enabling the extraction of manifold structure through pullback metrics and Riemannian Autoencoders. We establish theoretical guarantees showing that, under appropriate geometric regularization and measurement conditions, the learned model recovers the underlying data distribution up to a controllable error and yields a smooth, bi-Lipschitz manifold parametrization. We further show that the resulting smooth decoder can serve as a principled generative prior for inverse problems with recovery guarantees. We empirically validate our approach on low-dimensional synthetic manifolds and on MNIST.

URL PDF HTML ☆

赞 0 踩 0

2601.12699 2026-05-29 cs.LG cs.SY eess.SY 版本更新

Bandit Algorithms for Deep Brain Stimulation

深度脑刺激的赌博机算法

Arkaprava Gupta, Nicholas Carter, William Zellers, Prateek Ganguli, Benedikt Dietrich, Vibhor Krishna, Parasara Sridhar Duggirala, Samarjit Chakraborty

发表机构 * Department of Computer Science, UNC Chapel Hill（UNC夏洛茨维尔大学计算机科学系）； Hochschule München（慕尼黑应用科学大学）； Department of Neurosurgery, UNC Chapel Hill（UNC夏洛茨维尔大学神经外科系）； TU Munich Institute for Advanced Study（慕尼黑技术大学高级研究学院）

AI总结提出基于时间与阈值触发的剪枝多臂赌博机算法，无需离线训练，在抑制病理性β波段活动和降低刺激能耗方面优于深度强化学习方法，并验证了其在资源受限植入式系统上的可行性。

Comments Accepted to the ACM/IEEE 17th International Conference on Cyber-Physical Systems (ICCPS) 2026

详情

AI中文摘要

深度脑刺激（DBS）是帕金森病的有效治疗方法，但传统的固定参数刺激会降低电池寿命并引起副作用，同时无法适应变化的神经动力学。最近的强化学习方法提高了适应性，但大多数依赖深度神经网络，需要离线训练且计算成本过高，不适合植入式硬件。本文提出了一种基于时间与阈值触发的剪枝多臂赌博机（T3P MAB）算法的资源意识自适应DBS框架。该方法联合调节刺激频率和幅度，避免预先训练，并且足够透明以支持临床医生指导的调整。使用计算基底节-丘脑模型，我们展示了T3P比竞争的MAB方法收敛更快，并且在抑制病理性β波段活动方面优于深度强化学习基线，同时降低刺激功率。我们在不同的微控制器上实现了该方法，并报告了详细的能量测量，显示在不到两分钟内收敛，适合资源受限的植入式系统。这些结果支持轻量级赌博机控制作为实现个性化、节能DBS的实用途径。

英文摘要

Deep Brain Stimulation (DBS) is an effective treatment for Parkinson's disease, but conventional fixed-parameter stimulation can reduce battery life and cause side effects while failing to adapt to changing neural dynamics. Recent reinforcement learning approaches improve adaptability, yet most rely on deep neural networks that require offline training and are computationally too expensive for implantable hardware. This paper presents a resource-conscious adaptive DBS framework based on a Time- and Threshold-Triggered Pruned Multi-Armed Bandit (T3P MAB) algorithm. The proposed method jointly tunes stimulation frequency and amplitude, avoids prior training, and remains transparent enough to support clinician-guided adjustment. Using a computational basal ganglia-thalamic model, we show that T3P converges faster than competing MAB methods and outperforms deep-RL baselines in suppressing pathological beta-band activity while reducing stimulation power. We implemented it on different microcontrollers and report detailed energy measurements, showing convergence in under two minutes and suitability for resource-constrained implantable systems. These results support lightweight bandit-based control as a practical path toward personalized, energy-efficient DBS.

URL PDF HTML ☆

赞 0 踩 0

2601.08654 2026-05-29 cs.CL cs.AI cs.LG 版本更新

BITS for GAPS：用于层次高斯过程代理的贝叶斯信息论采样

Kyla D. Jones, Alexander W. Dowling

发表机构 * Department of Chemical and Biomolecular Engineering, University of Notre Dame, Notre Dame, IN 46556, USA（化学与生物分子工程系，诺特大学）

AI总结提出BITS for GAPS框架，通过贝叶斯层次建模将超参数不确定性传播到采样准则中，实现基于高斯过程代理模型的信息论实验设计，并在汽液平衡案例中验证其提升预测精度和信息增益的效果。

详情

DOI: 10.1016/j.compchemeng.2026.109650
Journal ref: Computers & Chemical Engineering, 197, 109041 (2026)

AI中文摘要

我们引入了用于层次高斯过程代理的贝叶斯信息论采样（BITS for GAPS），这是一个框架，能够实现基于高斯过程的代理模型的信息论实验设计。与标准方法（在采集函数中使用固定或点估计的超参数）不同，我们的方法通过贝叶斯层次建模将超参数不确定性传播到采样准则中。在该框架中，潜在函数接受高斯过程先验，而超参数被赋予额外的先验以捕捉建模者对控制物理现象的知识。因此，采集函数同时包含了来自潜在函数及其超参数的不确定性，确保采样由数据稀缺性和模型不确定性共同指导。我们进一步在此背景下建立了理论结果：后验微分熵的闭式近似和下界。我们通过一个汽液平衡案例研究展示了该框架在混合建模中的实用性。具体来说，我们为二元混合物中的潜在活度系数构建了一个代理模型。通过将代理嵌入扩展形式的拉乌尔定律中，我们构建了一个混合模型。该混合模型随后用于指导蒸馏设计。该案例研究展示了如何将部分物理知识转化为层次高斯过程代理。它还表明，使用BITS for GAPS通过瞄准Wilson活度模型的高不确定性区域，增加了期望信息增益和预测准确性。总体而言，BITS for GAPS是一个用于复杂物理系统中自适应数据采集的通用不确定性感知框架。

英文摘要

We introduce Bayesian Information-Theoretic Sampling for hierarchical GAussian Process Surrogates (BITS for GAPS), a framework enabling information-theoretic experimental design of Gaussian process-based surrogate models. Unlike standard methods, which use fixed or point-estimated hyperparameters in acquisition functions, our approach propagates hyperparameter uncertainty into the sampling criterion through Bayesian hierarchical modeling. In this framework, a latent function receives a Gaussian process prior, while hyperparameters are assigned additional priors to capture the modeler's knowledge of the governing physical phenomena. Consequently, the acquisition function incorporates uncertainties from both the latent function and its hyperparameters, ensuring that sampling is guided by both data scarcity and model uncertainty. We further establish theoretical results in this context: a closed-form approximation and a lower bound of the posterior differential entropy. We demonstrate the framework's utility for hybrid modeling with a vapor-liquid equilibrium case study. Specifically, we build a surrogate model for latent activity coefficients in a binary mixture. We construct a hybrid model by embedding the surrogate into an extended form of Raoult's law. This hybrid model then informs distillation design. This case study shows how partial physical knowledge can be translated into a hierarchical Gaussian process surrogate. It also shows that using BITS for GAPS increases expected information gain and predictive accuracy by targeting high-uncertainty regions of the Wilson activity model. Overall, BITS for GAPS is a generalized uncertainty-aware framework for adaptive data acquisition in complex physical systems.

URL PDF HTML ☆

赞 0 踩 0

2511.14584 2026-05-29 cs.LG cs.AI 版本更新

ReflexGrad: Within-Episode Failure Recovery in LLM Agents via Progress-Gated Dual-Process Routing

ReflexGrad: 基于进度门控双过程路由的LLM智能体片段内故障恢复

Ankush Kadu, Aswanth Krishnan

发表机构 * GitHub

AI总结提出ReflexGrad双过程架构，通过进度门控路由在无演示条件下实现LLM智能体片段内故障恢复，显著提升任务成功率。

Comments 18 pages, 4 figures, 10 tables. Accepted at ICML 2026 FoGen Workshop

详情

AI中文摘要

一种面向CNN的基于LRP剪枝的精度感知扩展，以防止数据稀缺迁移学习中的级联精度下降

Daisuke Yasui, Toshitaka Matsuki, Hiroshi Sato

发表机构 * Mathematics and Computer Science National Defense Academy of Japan（日本防卫大学校数学与计算机科学系）

AI总结针对数据稀缺迁移学习中预训练CNN剪枝导致的级联精度下降问题，提出一种精度感知的剪枝控制机制，通过动态调整剪枝率和顺序来抑制精度下降，提升模型压缩后的分类性能。

Comments Accepted to scientific reports. The title was revised during the peer review process

详情

DOI: 10.1038/s41598-026-47992-8

AI中文摘要

在大规模数据集（如ImageNet）上预训练的卷积神经网络（CNN）被广泛用作特征提取器，从稀缺数据中构建特定任务的高精度分类模型。在此类场景中，由于数据稀缺，微调预训练CNN变得困难，因此必须使用固定权重。然而，当权重固定时，许多对目标任务无贡献的滤波器仍保留在模型中，导致不必要的冗余和效率降低。因此，需要有效的方法通过剪枝对推理不必要的滤波器来减小模型大小。为此，已有研究提出了利用逐层相关性传播（LRP）的方法。LRP量化每个滤波器对推理结果的贡献，从而可以剪枝低相关性的滤波器。然而，现有基于LRP的剪枝方法被观察到会导致级联精度下降。在本研究中，我们为现有基于LRP的滤波器剪枝方法引入了一种精度感知的剪枝控制机制，该机制通过使用类别精度的调和平均数动态调整剪枝率和剪枝顺序，抑制级联精度下降，并在小数据环境下压缩预训练模型的同时保持任务特定性能。我们证明，该控制机制有效缓解了级联精度下降，与现有基于LRP的剪枝方法相比，实现了更高的分类精度，将VGG16的精度-剪枝率曲线下的类别平均面积（AUC）比传统基于LRP的方法提高了约15%。

英文摘要

Convolutional Neural Networks (CNNs) pre-trained on large-scale datasets such as ImageNet are widely used as feature extractors to construct high-accuracy classification models from scarce data for specific tasks. In such scenarios, fine-tuning the pre-trained CNN is difficult due to data scarcity, necessitating the use of fixed weights. However, when the weights are kept fixed, many filters that do not contribute to the target task remain in the model, leading to unnecessary redundancy and reduced efficiency. Therefore, effective methods are needed to reduce model size by pruning filters that are unnecessary for inference. To address this, approaches utilizing Layer-wise Relevance Propagation (LRP) have been proposed. LRP quantifies the contribution of each filter to the inference result, enabling the pruning of filters with low relevance. However, existing LRP-based pruning methods have been observed to cause cascading accuracy degradation. In this study, we introduce an accuracy-aware pruning control mechanism for existing LRP-based filter pruning methods, which suppresses cascading accuracy degradation by dynamically adjusting the pruning rate and the pruning order using the harmonic mean of class accuracy, and compresses the pre-trained model while preserving task-specific performance in a small-data environment. We demonstrate that this control mechanism effectively mitigates cascading accuracy degradation and achieves higher classification accuracy compared to existing LRP-based pruning methods, improving the class-averaged area under the accuracy-pruning-rate curve (AUC) of VGG16 by approximately 15\% over conventional LRP-based approaches.

URL PDF HTML ☆

赞 0 踩 0

2511.04934 2026-05-29 cs.LG 版本更新

Leak@$k$: Unlearning Does Not Make LLMs Forget Under Probabilistic Decoding

Leak@$k$：在概率解码下，遗忘并未使LLM真正忘记

Hadi Reisizadeh, Jiajun Ruan, Yiwei Chen, Soumyadeep Pal, Sijia Liu, Mingyi Hong

发表机构 * University of Minnesota（明尼苏达大学）； Michigan State University（密歇根州立大学）； IBM Research（IBM研究院）

AI总结本文发现现有大语言模型遗忘方法在概率解码下无法真正删除敏感信息，并提出新指标leak@$k$和算法RULE来评估和缓解知识泄露。

详情

AI中文摘要

大型语言模型（LLM）中的遗忘对于法规遵从和构建避免产生私人、有毒、非法或受版权保护内容的伦理生成式AI系统至关重要。尽管进展迅速，但在这项工作中，我们表明 extit{几乎所有}现有的遗忘方法在实践中都未能实现真正的遗忘。具体来说，虽然在确定性（贪婪）解码下对这些“已遗忘”模型的评估通常表明使用标准基准成功移除了知识，但我们表明当使用标准概率解码对模型进行采样时，敏感信息可靠地重新出现。为了严格捕捉这一漏洞，我们引入了 exttt{leak@$k$}，一种新的元评估指标，用于量化在现实解码策略下从模型生成$k$个样本时遗忘知识重新出现的可能性。使用三个广泛采用的基准TOFU、MUSE和WMDP，我们使用 exttt{leak@$k$}指标进行了首次大规模、系统性的遗忘可靠性研究。我们的发现表明，知识泄露在方法和任务中持续存在，强调当前最先进的遗忘技术仅提供有限的遗忘。我们提出了一种算法，称为基于Leak@$k$指标的鲁棒遗忘（ exttt{RULE}）来解决这一问题。我们证明， exttt{RULE}为TOFU基准提供了一个已遗忘模型，在大量生成样本下没有信息泄露。在MUSE基准上， exttt{RULE}在大多数采样预算$k$下，在 exttt{leak@$k$}指标上优于最先进的遗忘方法。代码可在https://github.com/OptimAI-Lab/Leak-k获取。

英文摘要

Unlearning in large language models (LLMs) is critical for regulatory compliance and for building ethical generative AI systems that avoid producing private, toxic, illegal, or copyrighted content. Despite rapid progress, in this work, we show that \textit{almost all} existing unlearning methods fail to achieve true forgetting in practice. Specifically, while evaluations of these `unlearned' models under deterministic (greedy) decoding often suggest successful knowledge removal using standard benchmarks, we show that sensitive information reliably resurfaces when models are sampled with standard probabilistic decoding. To rigorously capture this vulnerability, we introduce \texttt{leak@$k$}, a new meta-evaluation metric that quantifies the likelihood of forgotten knowledge reappearing when generating $k$ samples from the model under realistic decoding strategies. Using three widely adopted benchmarks, TOFU, MUSE, and WMDP, we conduct the first large-scale, systematic study of unlearning reliability using \texttt{leak@$k$} metric. Our findings demonstrate that knowledge leakage persists across methods and tasks, underscoring that current state-of-the-art (SOTA) unlearning techniques provide only limited forgetting. We propose an algorithm, termed Robust Unlearning under LEak@$k$ metric (\texttt{RULE}) to address this concern. We demonstrate that \texttt{RULE} provides an unlearned model for TOFU benchmark with no information leakage for a large number of generation samples. On the MUSE benchmark, \texttt{RULE} outperforms SOTA unlearning methods under the \texttt{leak@$k$} metric across most sampling budgets $k$. Codes are available at https://github.com/OptimAI-Lab/Leak-k.

URL PDF HTML ☆

赞 0 踩 0

2510.16060 2026-05-29 cs.LG cs.AI stat.ME stat.ML 版本更新

Beyond Accuracy: Are Time Series Foundation Models Well-Calibrated?

超越准确性：时间序列基础模型是否良好校准？

Coen Adler, Yuxin Chang, Felix Draxler, Samar Abdi, Padhraic Smyth

发表机构 * Department of Computer Science（计算机科学系）； Department of Statistics（统计学系）； Google, Irvine（谷歌（伊文斯堡））

AI总结本文系统评估了五个时间序列基础模型和两个基线的校准特性，发现基础模型校准优于基线且无系统性过度自信或信心不足。

Comments Published as a conference paper at ICLR 2026

详情

Journal ref: Proceedings of ICLR 2026

AI中文摘要

最近时间序列数据基础模型的发展引起了在各种应用中使用此类模型的广泛兴趣。尽管基础模型实现了最先进的预测性能，但它们的校准特性仍然相对未被充分探索，尽管校准在许多实际应用中可能至关重要。在本文中，我们研究了五个近期时间序列基础模型和两个竞争基线的校准相关特性。我们进行了一系列系统评估，包括模型校准（即过度自信或信心不足）、不同预测头的影响以及长期自回归预测下的校准。我们发现时间序列基础模型始终比基线模型校准得更好，并且往往不会系统性地过度自信或信心不足，这与在其他深度学习模型中常见的过度自信形成对比。

英文摘要

The recent development of foundation models for time series data has generated considerable interest in using such models across a variety of applications. Although foundation models achieve state-of-the-art predictive performance, their calibration properties remain relatively underexplored, despite the fact that calibration can be critical for many practical applications. In this paper, we investigate the calibration-related properties of five recent time series foundation models and two competitive baselines. We perform a series of systematic evaluations assessing model calibration (i.e., over- or under-confidence), effects of varying prediction heads, and calibration under long-term autoregressive forecasting. We find that time series foundation models are consistently better calibrated than baseline models and tend not to be either systematically over- or under-confident, in contrast to the overconfidence often seen in other deep learning models.

URL PDF HTML ☆

赞 0 踩 0

2510.12310 2026-05-29 cs.CR cs.LG 版本更新

DeepTrust: Multi-Step Classification through Dissimilar Adversarial Representations for Robust Android Malware Detection

DeepTrust：通过不同对抗表示的多步分类实现鲁棒的Android恶意软件检测

Daniel Pulido-Cortázar, Daniel Gibert, Felip Manyà

发表机构 * Artificial Intelligence Research Institute (IIIA-CSIC)（人工智能研究所（IIIA-CSIC））

AI总结提出DeepTrust元启发式方法，通过级联条件激活的异构分类器序列，最大化内部模型表示差异，在特征空间逃逸攻击下实现鲁棒检测，在2025年IEEE SaTML竞赛中获第一名。

详情

DOI: 10.1016/j.eswa.2026.132961

AI中文摘要

在过去十年中，机器学习已被广泛用于识别恶意Android应用程序。然而，这些方法仍然容易受到对抗样本的攻击，即那些被巧妙操纵以欺骗机器学习模型做出错误预测的样本。本研究提出了DeepTrust，一种新颖的元启发式方法，它将灵活的分类器（如深度神经网络）排列成有序序列，最终决策由单个内部模型根据级联激活的条件做出。在2025年IEEE SaTML会议的鲁棒Android恶意软件检测竞赛中，DeepTrust获得了第一名并取得了最先进的结果，在特征空间逃逸攻击下，其性能比次优竞争对手高出266%。同时，它在非对抗性恶意软件上保持了最高的检测率，假阳性率低于1%。该方法的效果源于最大化内部模型之间学习表示的差异。通过使用诱导数据产生根本不同嵌入的分类器，决策空间对攻击者变得不可预测。这挫败了逃逸攻击固有的迭代扰动过程，从而在不牺牲干净样本准确性的情况下增强了系统的鲁棒性。

英文摘要

Over the last decade, machine learning has been extensively applied to identify malicious Android applications. However, such approaches remain vulnerable against adversarial examples, i.e., examples that are subtly manipulated to fool a machine learning model into making incorrect predictions. This research presents DeepTrust, a novel metaheuristic that arranges flexible classifiers, like deep neural networks, into an ordered sequence where the final decision is made by a single internal model based on conditions activated in cascade. In the Robust Android Malware Detection competition at the 2025 IEEE Conference SaTML, DeepTrust secured the first place and achieved state-of-the-art results, outperforming the next-best competitor by up to 266% under feature-space evasion attacks. This is accomplished while maintaining the highest detection rate on non-adversarial malware and a false positive rate below 1%. The method's efficacy stems from maximizing the divergence of the learned representations among the internal models. By using classifiers inducing fundamentally dissimilar embeddings of the data, the decision space becomes unpredictable for an attacker. This frustrates the iterative perturbation process inherent to evasion attacks, enhancing system robustness without compromising accuracy on clean examples.

URL PDF HTML ☆

赞 0 踩 0

2510.12152 2026-05-29 stat.ML cs.LG 版本更新

Follow-the-Perturbed-Leader for Decoupled Bandits: Best-of-Both-Worlds and Practicality

解耦赌博机的跟随扰动领导者：两全其美与实用性

Chaiwon Kim, Jongyeong Lee, Min-hwan Oh

发表机构 * Seoul National University, Seoul, Korea（首尔国立大学，韩国首尔）； Korea Institute of Science and Technology, Seoul, Korea（韩国科学技术院，韩国首尔）

AI总结针对解耦多臂赌博机问题，提出一种高效的跟随扰动领导者策略，在随机环境下实现常数遗憾，在对抗环境下实现最优O(√KT)遗憾，且避免了凸优化和重采样过程，显著降低计算成本。

Comments Accepted to ICML 2026, 31 pages

详情

AI中文摘要

我们研究了解耦多臂赌博机问题，其中学习者在每一轮分别选择一个臂进行探索，并选择另一个可能不同的臂进行利用。在此设置中，探索臂的损失被观察到但不承担，而利用臂的损失被承担但不被观察到。我们提出了一种高效的跟随扰动领导者（FTPL）策略，该策略在随机环境下实现常数遗憾，在对抗环境下实现最优$O(\sqrt{KT})$遗憾，从而获得两全其美（BOBW）保证。我们方法的一个关键特征是它完全避免了先前BOBW策略所需的凸优化以及FTPL赌博机策略中通常使用的重采样过程。这使得FTPL能够充分发挥其计算效率优势，大幅降低计算成本。我们通过实验证实，我们的策略不仅提高了运行时间，而且在两种环境下都表现出优越的遗憾性能。

分布逆强化学习

Feiyang Wu, Ye Zhao, Anqi Wu

发表机构 * School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, USA（计算科学与工程学院，佐治亚理工学院，美国亚特兰大）； George W. Woodruff School of Mechanical Engineering, Georgia Institute of Technology, Atlanta, USA（乔治·W·伍德鲁夫机械工程学院，佐治亚理工学院，美国亚特兰大）

AI总结提出一种离线逆强化学习的分布框架，通过最小化一阶随机占优违反并整合扭曲风险度量，联合建模奖励函数的不确定性和回报的完整分布，实现奖励分布和分布感知策略的恢复。

Comments ICML 2026 Oral

详情

AI中文摘要

我们提出了一种用于离线逆强化学习（IRL）的分布框架，该框架联合建模奖励函数的不确定性和回报的完整分布。与恢复确定性奖励估计或仅匹配期望回报的传统IRL方法不同，我们的方法通过最小化一阶随机占优（FSD）违反，从而将扭曲风险度量（DRMs）整合到策略学习中，捕捉专家行为中更丰富的结构，特别是在学习奖励分布方面，使得能够恢复奖励分布和分布感知策略。该公式非常适合行为分析和风险感知模仿学习。理论分析表明，该算法以$\mathcal{O}(\varepsilon^{-2})$的迭代复杂度收敛。在合成基准、真实神经行为数据和MuJoCo控制任务上的实验结果表明，我们的方法恢复了富有表现力的奖励表示，并实现了最先进的性能。

英文摘要

We propose a distributional framework for offline Inverse Reinforcement Learning (IRL) that jointly models uncertainty over reward functions and full distributions of returns. Unlike conventional IRL approaches that recover a deterministic reward estimate or match only expected returns, our method captures richer structure in expert behavior, particularly in learning the reward distribution, by minimizing first-order stochastic dominance (FSD) violations and thus integrating distortion risk measures (DRMs) into policy learning, enabling the recovery of both reward distributions and distribution-aware policies. This formulation is well-suited for behavior analysis and risk-aware imitation learning. Theoretical analysis shows that the algorithm converges with $\mathcal{O}(\varepsilon^{-2})$ iteration complexity. Empirical results on synthetic benchmarks, real-world neurobehavioral data, and MuJoCo control tasks demonstrate that our method recovers expressive reward representations and achieves state-of-the-art performance.

URL PDF HTML ☆

赞 0 踩 0

2510.02480 2026-05-29 cs.AI cs.LG 版本更新

Controlling the Risk of Corrupted Contexts for Language Models via Early-Exiting

通过早退机制控制语言模型中有害上下文的风险

Andrea Wynn, Metod Jazbec, Charith Peris, Rinat Khaziev, Anqi Liu, Daniel Khashabi, Eric Nalisnick

发表机构 * Johns Hopkins University（约翰霍普金斯大学）； University of Amsterdam（阿姆斯特丹大学）； Amazon AGI（亚马逊人工智能实验室）； Amazon Alexa（亚马逊Alexa）

AI总结提出一种结合动态早退预测与无分布风险控制的方法，限制有害上下文对语言模型性能的退化，并在有益上下文中实现计算效率提升。

Comments Accepted to ICML 2026

详情

AI中文摘要

大型语言模型（LLMs）可能受到有害或不相关上下文的影响，这会显著损害模型在下游任务上的性能。这促使我们设计具有内置机制的原则性方案，以防范此类“垃圾进，垃圾出”场景。我们提出一种新颖方法，限制有害上下文对模型性能的退化程度。首先，我们定义模型的基线“安全”行为——即无任何上下文（零样本）时的模型性能。接着，我们应用无分布风险控制（DFRC）来控制用户提供的上下文将性能降至该安全零样本基线以下的程度。我们通过利用动态早退预测实现这一点，忽略那些最关注不安全输入的后注意力头。最后，我们提出对DFRC的修改，使其既能控制有害输入的风险，又能利用有益输入的性能和效率提升。我们在涵盖上下文学习和开放式问答的9项任务上展示了理论和实证结果，表明我们的方法能有效控制有害上下文的风险，同时在使用有益上下文时实现显著的计算效率提升。

英文摘要

Large language models (LLMs) can be influenced by harmful or irrelevant context, which can significantly harm model performance on downstream tasks. This motivates principled designs in which LLM systems include built-in mechanisms to guard against such "garbage in, garbage out" scenarios. We propose a novel approach to limit the degree to which harmful context can degrade model performance. First, we define a baseline "safe" behavior for the model -- the model's performance given no context at all (zero-shot). Next, we apply distribution-free risk control (DFRC) to control the extent to which the user-provided context can decay performance below this safe zero-shot baseline. We achieve this by leveraging dynamic early exit prediction, ignoring later attention heads that attend the most to the unsafe inputs. Finally, we propose modifications to DFRC that allow it to both control risk for harmful inputs \textit{and} leverage performance and efficiency gains on helpful inputs. We present both theoretical and empirical results across 9 tasks spanning in-context learning and open-ended question answering, showing that our approach can effectively control risk for harmful context and simultaneously achieve substantial computational efficiency gains with helpful context.

URL PDF HTML ☆

赞 0 踩 0

2510.00777 2026-05-29 cs.LG 版本更新

In-Place Feedback: Reliable Refinement for Multi-Turn Expert-LLM Collaboration

原地反馈：多轮专家-LLM协作的可靠精炼方法

Youngbin Choi, Minjong Lee, Saemi Moon, Seunghyuk Cho, Chaehyeon Chung, MoonJeong Park, Dongwoo Kim

发表机构 * Graduate School of Artificial Intelligence, POSTECH（POSTECH人工智能研究生院）； Department of Computer Science and Engineering, POSTECH（POSTECH计算机科学与工程系）

AI总结提出原地反馈交互范式，通过用户直接编辑模型先前响应并让模型从编辑上下文继续生成，在五个推理密集型基准上优于标准多轮反馈且更省token，用户研究证实其能提高最终输出满意度并降低疲劳。

Comments 42pages

详情

AI中文摘要

LLM生成的草稿常包含细微的事实或逻辑错误，但先前研究表明模型难以可靠地整合旨在修正这些错误的多轮反馈。我们提出原地反馈，一种交互范式，其中用户直接编辑模型先前的响应，模型从编辑后的上下文继续生成。在五个推理密集型基准上，原地反馈始终优于标准多轮反馈，同时需要更少的token，我们的细粒度分析表明，它能更可靠地应用修正并将修正传播到后续推理中。一项由领域专家精炼LLM生成摘要的用户研究证实了这些发现：参与者报告了更高的最终输出满意度和显著更低的疲劳感，而结合原地反馈和多轮反馈的混合策略在每个测量维度上得分最高。这些结果表明，直接编辑错误是专家-LLM协作的更有效范式。

英文摘要

LLM-generated drafts often contain subtle factual or logical errors, yet prior work shows that models struggle to reliably integrate multi-turn feedback aimed at fixing them. We propose in-place feedback, an interaction paradigm in which the user directly edits the model's previous response and the model continues generation from the edited context. In-place feedback consistently outperforms standard multi-turn feedback across five reasoning-intensive benchmarks while requiring fewer tokens, and our fine-grained analysis shows that it applies corrections more reliably and propagates them to subsequent reasoning. A user study with domain experts refining LLM-generated summaries corroborates these findings: participants report higher final-output satisfaction and substantially lower fatigue with in-place feedback, and a mixed strategy combining in-place and multi-turn feedback scores highest on every measured dimension. These results suggest that editing errors directly is a more effective paradigm for expert-LLM collaboration.

URL PDF HTML ☆

赞 0 踩 0

2509.21707 2026-05-29 stat.ML cs.LG stat.ME 版本更新

SADA: Safe and Adaptive Aggregation of Multiple Black-Box Predictions in Semi-Supervised Learning

SADA：半监督学习中多个黑箱预测的安全自适应聚合

Jiawei Shan, Zhifeng Chen, Yiming Dong, Yazhen Wang, Jiwei Zhao

发表机构 * Department of Biostatistics & Medical Informatics, University of Wisconsin-Madison（生物统计与医学信息学系，威斯康星大学麦迪逊分校）； Department of Statistics, University of Wisconsin-Madison（统计学系，威斯康星大学麦迪逊分校）

AI总结提出一种安全自适应聚合多个不确定质量黑箱预测的方法，保证不劣于仅用标注数据，并在存在完美预测时实现更快收敛或半参数效率界。

详情

AI中文摘要

半监督学习（SSL）在实践中出现于标注数据稀缺或获取成本高昂，而大量未标注数据易于获取的情况下。随着机器学习技术的广泛采用，使用多种模型和算法（包括深度学习、大语言模型和生成式AI）生成多个预测标签已变得越来越可行。在本文中，我们提出了一种新颖方法，能够安全且自适应地聚合多个质量不确定的黑箱预测，用于推理和预测任务。我们的方法提供两个关键保证：（i）无论预测质量如何，其表现永远不会差于仅使用标注数据；（ii）如果任意一个预测（无需知道是哪一个）完美拟合真实标签，算法会自适应地利用这一点，以实现更快的收敛速度或半参数效率界。我们通过小规模模拟和两项具有不同科学目标的真实数据分析展示了所提算法的有效性。提供了用户友好的R包sada以促进实际实施。

英文摘要

Semi-supervised learning (SSL) arises in practice when labeled data are scarce or expensive to obtain, while large quantities of unlabeled data are readily available. With the growing adoption of machine learning techniques, it has become increasingly feasible to generate multiple predicted labels using a variety of models and algorithms, including deep learning, large language models, and generative AI. In this paper, we propose a novel approach that safely and adaptively aggregates multiple black-box predictions of uncertain quality for both inference and prediction tasks. Our method provides two key guarantees: (i) it never performs worse than using the labeled data alone, regardless of the quality of the predictions; and (ii) if any one of the predictions (without knowing which one) perfectly fits the ground truth, the algorithm adaptively exploits this to achieve either a faster convergence rate or the semiparametric efficiency bound. We demonstrate the effectiveness of the proposed algorithm through small-scale simulations and two real-data analyses with distinct scientific goals. A user-friendly R package, sada, is provided to facilitate practical implementation.

URL PDF HTML ☆

赞 0 踩 0

2509.17208 2026-05-29 cs.LG physics.atm-clus 版本更新

Active Learning for Machine Learning Driven Molecular Dynamics

主动学习驱动的分子动力学机器学习

Kevin Bachelor, Sanya Murdeshwar, Daniel Sabo, Razvan Marinescu

发表机构 * University of California, Santa Cruz（加州大学圣克ruz分校）； GiwoTech Inc.（GiwoTech公司）

AI总结针对机器学习粗粒化势函数在模拟中因采样不足而性能退化的问题，提出基于RMSD的主动学习框架，通过在线查询Oracle生成数据，在保持粗粒化效率的同时修正覆盖缺口，使Chignolin蛋白模型的TICA空间W1指标提升33.05%。

Comments 9 pages, 4 figures, for Neurips Workshop: Machine Learning and the Physical Sciences 2025

详情

AI中文摘要

机器学习粗粒化（CG）势函数速度快，但当模拟到达欠采样的生物分子构象时性能会随时间退化，而生成广泛的全原子（AA）数据来应对这一问题在计算上不可行。我们提出了一种用于分子动力学（MD）中CG神经网络势函数的新型主动学习（AL）框架。该方法基于CGSchNet模型，采用从MD模拟中基于均方根偏差（RMSD）的帧选择，通过在神经网络势函数训练过程中查询预言机来实时生成数据。该框架在保持CG级效率的同时，在RMSD识别的精确覆盖缺口处修正模型。通过训练粗粒化神经网络势函数CGSchNet，我们实验证明该框架探索了先前未见过的构型，并在构象空间中未探索的区域上训练模型。我们的主动学习框架使得在Chignolin蛋白上训练的CGSchNet模型在内部基准测试套件上的时间滞后独立成分分析（TICA）空间中，Wasserstein-1（W1）指标提升了33.05%。

英文摘要

Machine-learned coarse-grained (CG) potentials are fast, but degrade over time when simulations reach under-sampled bio-molecular conformations, and generating widespread all-atom (AA) data to combat this is computationally infeasible. We propose a novel active learning (AL) framework for CG neural network potentials in molecular dynamics (MD). Building on the CGSchNet model, our method employs root mean squared deviation (RMSD)-based frame selection from MD simulations in order to generate data on-the-fly by querying an oracle during the training of a neural network potential. This framework preserves CG-level efficiency while correcting the model at precise, RMSD-identified coverage gaps. By training CGSchNet, a coarse-grained neural network potential, we empirically show that our framework explores previously unseen configurations and trains the model on unexplored regions of conformational space. Our active learning framework enables a CGSchNet model trained on the Chignolin protein to achieve a 33.05\% improvement in the Wasserstein-1 (W1) metric in Time-lagged Independent Component Analysis (TICA) space on an in-house benchmark suite.

URL PDF HTML ☆

赞 0 踩 0

2509.08194 2026-05-29 cs.LG stat.ML 版本更新

Prescribe-then-Select: Adaptive Policy Selection for Contextual Stochastic Optimization

先规定后选择：面向情境随机优化的自适应策略选择

Caio de Prospero Iglesias, Kimberly Villalobos Carballo, Dimitris Bertsimas

AI总结针对情境随机优化中候选策略在协变量空间表现异质的问题，提出Prescribe-then-Select模块化框架，通过构建可行策略库并基于最优策略树集成学习元策略实现数据驱动的自适应选择，在单阶段报童和两阶段运输规划问题中优于单一最优策略。

详情

AI中文摘要

我们解决了情境随机优化中的策略选择问题，其中协变量作为情境信息可用，且决策必须满足严格的可行性约束。在许多情境随机优化场景中，来自不同建模范式的多个候选策略在协变量空间上表现出异质性能，没有单一策略能够统一占优。我们提出了Prescribe-then-Select（PS）模块化框架，该框架首先构建一个可行候选策略库，然后学习一个元策略来为观测到的协变量选择最佳策略。我们使用在训练集上通过交叉验证训练的最优策略树集成来实现元策略，使策略选择完全数据驱动。在两个基准情境随机优化问题——单阶段报童和两阶段运输规划中，PS在协变量空间的异质区域始终优于最佳单一策略，并在不存在这种异质性时收敛到占优策略。所有重现结果的代码可在https://anonymous.4open.science/r/Prescribe-then-Select-TMLR获取。

英文摘要

We address the problem of policy selection in contextual stochastic optimization (CSO), where covariates are available as contextual information and decisions must satisfy hard feasibility constraints. In many CSO settings, multiple candidate policies--arising from different modeling paradigms--exhibit heterogeneous performance across the covariate space, with no single policy uniformly dominating. We propose Prescribe-then-Select (PS), a modular framework that first constructs a library of feasible candidate policies and then learns a meta-policy to select the best policy for the observed covariates. We implement the meta-policy using ensembles of Optimal Policy Trees trained via cross-validation on the training set, making policy choice entirely data-driven. Across two benchmark CSO problems--single-stage newsvendor and two-stage shipment planning--PS consistently outperforms the best single policy in heterogeneous regimes of the covariate space and converges to the dominant policy when such heterogeneity is absent. All the code to reproduce the results can be found at https://anonymous.4open.science/r/Prescribe-then-Select-TMLR.

URL PDF HTML ☆

赞 0 踩 0

2509.05771 2026-05-29 stat.ML cs.LG math.OC 版本更新

Risk-averse Fair Multi-class Classification

风险规避的公平多类分类

Darinka Dentcheva, Xiangyu Tian

发表机构 * Department of Mathematical sciences（数学科学系）； Stevens Institute of Technology（史蒂文斯理工学院）

AI总结基于一致风险度量与系统性风险理论，提出一种适用于噪声、稀缺和标签不可靠数据的风险规避多类分类框架，并通过非线性聚合的系统方法设计两阶段随机规划及正则化分解算法，同时实现公平性增强。

详情

AI中文摘要

我们基于一致风险度量和系统性风险理论开发了一种新的分类框架。所提出的方法适用于数据存在噪声、稀缺（相对于问题维度）且标签可能不可靠的多类问题。在论文的第一部分，我们提供了使用系统性风险模型的基础，并展示了如何将其应用于线性和基于核的多类问题中。我们提出了一种通过非线性聚合的系统理论方法进行更高级的公式化，这导致了一个两阶段随机规划问题。设计了一种风险规避的正则化分解方法来求解该问题。在性能分析中，我们使用一种流行的多类方法作为所提出分类方法的基准。我们通过使用一致风险度量对该方法进行多种推广来说明我们的想法。所提出的风险规避方法的可行性在理论和数值上得到了支持。此外，我们证明了系统性风险度量的应用有助于在分类中强制执行公平性。对所提出模型的公平性进行了仔细的分析和实验。对于所有方法，我们的数值实验表明，它们在训练数据不可靠的情况下具有鲁棒性，并且在未知数据上的表现优于最小化期望分类误差的方法。此外，当类别数量增加时，性能会得到提升。

英文摘要

We develop a new classification framework based on the theory of coherent risk measures and systemic risk. The proposed approach is suitable for multi-class problems when the data is noisy, scarce (relative to the dimension of the problem), and the labeling might be unreliable. In the first part of our paper, we provide the foundation of the use of systemic risk models and show how to apply it in the context of linear and kernel-based multi-class problems. More advanced formulation via a system-theoretic approach with non-linear aggregation is proposed, which leads to a two-stage stochastic programming problem. A risk-averse regularized decomposition method is designed to solve the problem. We use a popular multi-class method as a benchmark in the performance analysis of the proposed classification methods. We illustrate our ideas by proposing several generalization of that method by the use of coherent measures of risk. The viability of the proposed risk-averse methods are supported theoretically and numerically. Additionally, we demonstrate that the application of systemic risk measures facilitates enforcing fairness in classification. Analysis and experiments regarding the fairness of the proposed models are carefully conducted. For all methods, our numerical experiments demonstrate that they are robust in the presence of unreliable training data and perform better on unknown data than the methods minimizing expected classification errors. Furthermore, the performance improves when the number of classes increases.

URL PDF HTML ☆

赞 0 踩 0

2508.15371 2026-05-29 cs.CL cs.AI cs.LG 版本更新

Confidence-Modulated Speculative Decoding for Large Language Models

置信度调节的推测解码用于大型语言模型

Jaydip Sen, Subhasis Dasgupta, Hetvi Waghela

发表机构 * Department of Data Science（数据科学系）； Praxis Business School（普拉克斯商学院）

AI总结本文提出一种基于置信度调节的推测解码框架，通过熵和边际不确定性度量动态调整草稿长度与验证过程，在机器翻译和摘要任务上实现加速并保持或提升BLEU和ROUGE分数。

Comments This is the preprint of the paper, which has been accepted for oral presentation and publication in the proceedings of IEEE INDISCON 2025. The conference will be organized at the National Institute of Technology, Rourkela, India, from August 21 to 23, 2025. The paper is 10 pages long, and it contains 2 figures and 5 tables

详情

DOI: 10.1109/INDISCON66021.2025.11254640

AI中文摘要

推测解码已成为一种通过草稿-验证范式并行化令牌生成来加速自回归推理的有效方法。然而，现有方法依赖静态草稿长度和刚性验证标准，限制了其在不同模型不确定性和输入复杂性下的适应性。本文提出一种基于置信度调节草稿的信息论推测解码框架。通过利用草稿模型输出分布上的熵和边际不确定性度量，所提方法在每次迭代中动态调整推测生成的令牌数量。这种自适应机制减少了回滚频率，提高了资源利用率，并保持了输出保真度。此外，验证过程使用相同的置信度信号进行调节，使得在不牺牲生成质量的情况下更灵活地接受草稿令牌。在机器翻译和摘要任务上的实验表明，与标准推测解码相比，该方法在保持或提升BLEU和ROUGE分数的同时实现了显著加速。所提方法提供了一种原则性的即插即用方法，用于在不确定性变化条件下实现大型语言模型的高效且鲁棒的解码。

英文摘要

Speculative decoding has emerged as an effective approach for accelerating autoregressive inference by parallelizing token generation through a draft-then-verify paradigm. However, existing methods rely on static drafting lengths and rigid verification criteria, limiting their adaptability across varying model uncertainties and input complexities. This paper proposes an information-theoretic framework for speculative decoding based on confidence-modulated drafting. By leveraging entropy and margin-based uncertainty measures over the drafter's output distribution, the proposed method dynamically adjusts the number of speculatively generated tokens at each iteration. This adaptive mechanism reduces rollback frequency, improves resource utilization, and maintains output fidelity. Additionally, the verification process is modulated using the same confidence signals, enabling more flexible acceptance of drafted tokens without sacrificing generation quality. Experiments on machine translation and summarization tasks demonstrate significant speedups over standard speculative decoding while preserving or improving BLEU and ROUGE scores. The proposed approach offers a principled, plug-in method for efficient and robust decoding in large language models under varying conditions of uncertainty.

URL PDF HTML ☆

赞 0 踩 0

2508.08677 2026-05-29 cs.LG cs.CV 版本更新

Multi-level Collaborative Distillation Meets Global Workspace Model: A Unified Framework for OCIL

多级协作蒸馏遇见全局工作空间模型：面向OCIL的统一框架

Shibin Su, Guoqiang Liang, De Cheng, Shizhou Zhang, Lingyan Ran

发表机构 * School of Computer Science, Northwestern Polytechnical University（西北工业大学计算机学院）； School of Telecommunications Engineering, Xidian University（西安电子科技大学电信工程学院）

AI总结提出一种结合全局工作空间模型和多级协作蒸馏的统一框架，通过融合多学生模型参数形成共享隐式记忆并周期性广播，以及跨学生一致性和历史知识对齐机制，有效平衡在线类增量学习中的稳定性与可塑性。

Comments 15 pages, 8 figures

详情

AI中文摘要

在线类增量学习（OCIL）使模型能够从非独立同分布的数据流中持续学习。由于数据流中的样本只能被看到一次，因此与离线学习相比，它更适用于现实场景。然而，这一约束加剧了OCIL在维持稳定性与可塑性之间适当平衡的挑战。此外，在现实世界中更严格的内存缓冲区约束下，当前基于重放的方法效果较差。虽然集成方法提高了可塑性，但它们常常在稳定性上遇到困难。受全局工作空间理论（GWT）启发，我们提出了一种新颖方法，通过全局工作空间模型（GWM）——一种共享的隐式记忆，指导多个学生模型的学习——来增强集成学习。GWM通过在每个训练批次中融合所有学生的参数形成，捕获历史学习轨迹，并作为知识巩固的动态锚点。类似于GWT的广播机制，GWM定期重新分发给学生，稳定学习并促进跨任务一致性。此外，我们引入了一种多级协作蒸馏机制。它强制学生之间保持对等一致性，并通过将每个学生与GWM对齐来保留历史知识。因此，学生模型在保持先前所学知识的同时，仍能适应新任务，在稳定性与可塑性之间实现更好的平衡。在三个标准OCIL基准上的大量实验表明，我们的方法在各种内存预算下为多个OCIL模型带来了显著的性能提升。代码可在https://github.com/susususushi/GWM获取。

英文摘要

Online Class-Incremental Learning (OCIL) enables models to learn continuously from non-i.i.d. data streams. Since samples of the data streams can be seen only once, it is more suitable for real-world scenarios compared to offline learning. However, this constraint intensifies the challenge for OCIL in maintaining an appropriate balance between stability and plasticity. Moreover, under stricter memory buffer constraints in real world, current replay-based methods are less effective. While ensemble methods improve plasticity, they often struggle with stability. Inspired by the Global Workspace Theory (GWT), we propose a novel approach that enhances ensemble learning through a Global Workspace Model (GWM)-a shared, implicit memory that guides the learning of multiple student models. The GWM is formed by fusing the parameters of all students within each training batch, capturing the historical learning trajectory and serving as a dynamic anchor for knowledge consolidation. Like the broadcasting mechanism of GWT, the GWM is redistributed periodically to students, stabilizing learning and promoting cross-task consistency. In addition, we introduce a multi-level collaborative distillation mechanism. It enforces peer-to-peer consistency among students and preserves historical knowledge by aligning each student with the GWM. As a result, student models remain adaptable to new tasks while maintaining previously learned knowledge, striking a better balance between stability and plasticity. Extensive experiments on three standard OCIL benchmarks show that our method delivers significant performance improvement for several OCIL models across various memory budgets. The code is available at https://github.com/susususushi/GWM.

URL PDF HTML ☆

赞 0 踩 0

2508.02537 2026-05-29 cs.LG 版本更新

Solved in Unit Domain: JacobiNet for Differentiable Coordinate-Transformed PINNs

在单位域中求解：用于可微坐标变换PINNs的JacobiNet

Xi Chen, Jianchuan Yang, Junjie Zhang, Runnan Yang, Xu Liu, Hong Wang, Tinghui Zheng, Ziyu Ren, Wenqi Hu

发表机构 * Department of Mechanical and Aerospace Engineering, The Hong Kong University of Science and Technology（香港科技大学机械与航空航天工程系）； Department of Mechanics & Engineering, College Architecture & Environment, Sichuan University（四川大学力学与工程学院）； School of Mechanical Engineering and Automation, Beihang University（北京航空航天大学机械工程与自动化学院）； Department of Civil and Environmental Engineering, The Hong Kong University of Science and Technology（香港科技大学土木与环境工程系）； College of Computing and Data Science, Nanyang Technological University（南洋理工大学计算机与数据科学学院）； West China Biomedical Big Data Center, West China Hospital, Sichuan University（四川大学西昌生物医学大数据中心，西昌医院）

AI总结提出JacobiNet，一种基于学习的可微坐标变换PINN框架，通过端到端可微架构统一域映射与PDE求解，解决不规则边界域中PINNs的归一化、边界强制和损失项不平衡问题，显著提升精度和效率。

Comments Accepted by Journal of Computational Physics

详情

AI中文摘要

物理信息神经网络（PINNs）通过将物理定律嵌入学习过程，为求解偏微分方程提供了强大框架。然而，当应用于具有不规则边界的域时，PINNs常遭受不稳定和收敛缓慢的问题，这源于（1）几何各向异性导致的不一致归一化，（2）不精确的边界强制，以及（3）损失项竞争的不平衡。常见的解决方法是将该域映射到规则空间。然而，传统的映射方法依赖于特定情况的网格，在预指定的固定节点上定义雅可比矩阵，并通过链式法则重新表述PDE——使其与现代自动微分和张量框架不兼容。为弥合这一差距，我们提出了JacobiNet，一种基于学习的坐标变换PINN框架，在端到端可微架构中统一了域映射和PDE求解。JacobiNet通过自动梯度实现直接的雅可比矩阵计算，与下游PINNs共享计算图，从而避免了特定情况的网格划分、显式的雅可比矩阵计算/存储以及手动的PDE重新表述，同时解锁了几何编辑操作。通过将物理建模与几何复杂性分离，JacobiNet（1）解决了原始各向异性坐标中的归一化挑战，（2）促进了边界条件的硬性强制，以及（3）缓解了长期存在的损失项间不平衡问题。在各种PDE上的评估表明，JacobiNet将相对L2误差从0.11-0.73降低到0.01-0.09，平均精度提升了15.6倍。在具有变化形状的血管状域中，JacobiNet实现了对未见几何形状的毫秒级映射推理，平均预测精度提升了3.65倍，同时提供了超过10倍的加速——展示了强大的泛化能力、精度和效率。

英文摘要

Physics-Informed Neural Networks (PINNs) offer a powerful framework for solving PDEs by embedding physical laws into the learning process. However, when applied to domains with irregular boundaries, PINNs often suffer from instability and slow convergence, which stems from (1) inconsistent normalization due to geometric anisotropy, (2) inaccurate boundary enforcement, and (3) imbalanced loss term competition. A common workaround is to map the domain to a regular space. Yet, conventional mapping methods rely on case-specific meshes, define Jacobians at pre-specified fixed nodes, reformulate PDEs via the chain rule-making them incompatible with modern automatic differentiation, tensor-based frameworks. To bridge this gap, we propose JacobiNet, a learning-based coordinate-transformed PINN framework that unifies domain mapping and PDE solving within an end-to-end differentiable architecture. JacobiNet enables direct Jacobian computation via autograd, shares computation graph with downstream PINNs, thereby avoiding case-specific meshing, explicit Jacobian computation/storage, and manual PDE reformulation while unlocking geometric-editing operations. Separating physical modeling from geometric complexity, JacobiNet (1) addresses normalization challenges in the original anisotropic coordinates, (2) facilitates the hard enforcement of boundary conditions, and (3) mitigates the long-standing imbalance among loss terms. Evaluated on various PDEs, JacobiNet reduces the relative L2 error from 0.11-0.73 to 0.01-0.09, achieving an average 15.6x improvement in accuracy. In vessel-like domains with varying shapes, JacobiNet enables millisecond-level mapping inference for unseen geometries, improves prediction accuracy by an average of 3.65x, while delivering over 10x speedup-demonstrating strong generalization, accuracy, and efficiency.

URL PDF HTML ☆

赞 0 踩 0

2507.21429 2026-05-29 stat.ML cs.LG 版本更新

From Sublinear to Linear: Local Convergence in Finite-Width Networks via Locally Polyak-Lojasiewicz Regions

从次线性到线性：通过局部Polyak-Lojasiewicz区域在有限宽度网络中的局部收敛

Agnideep Aich, Ashit Baran Aich, Bruce Wade

发表机构 * Stanford University（斯坦福大学）； University of Louisiana at Lafayette（路易斯安那州立大学拉法叶分校）； Presidency College Kolkata, India（印度科利切斯特 Presidency 学院）

AI总结本文研究有限宽度前馈网络在平方经验损失下梯度下降的局部线性收敛，通过局部Polyak-Lojasiewicz不等式和NTK正定性条件，证明了在局部拟凸区域内可实现线性收敛。

详情

AI中文摘要

我们研究了有限宽度前馈网络在平方经验损失下梯度下降的局部线性收敛。先前的工作表明，梯度下降可以保持在初始化附近的局部拟凸区域（LQCR）内，但仅给出次线性速率。我们证明，如果经验神经正切核在初始化时正定、在LQCR上Lipschitz稳定且与LQCR半径兼容，则平方损失满足局部Polyak-Łojasiewicz不等式，常数$μ= λ_0 - L_Θr(\Rcal) > 0$。结合固定步长迭代包含在LQCR内（作为线性速率定理中的假设），这在该区域上产生线性收敛。LQCR提供局部化；固定步长包含作为线性速率定理中的假设；PL不等式来自平方损失下的NTK条件。因此，结果是充分的局部条件，并非声称该机制对于快速收敛是必要或唯一的。实验上，我们通过NTK谱间隙、参数漂移、经验PL比率和次优性衰减来检验理论。在二值MNIST上，NTK保持正定，PL比率有正的下包络，损失在稳定区域呈几何衰减。在宽度消融实验中，固定步长宽度1024的运行离开局部区域；减小步长将最终漂移从1.870降至0.158，恢复观察到的局部区域诊断，并产生研究中观察到的最大经验PL比率下包络。在CIFAR-10子集上的CNN鲁棒性检查显示，PL比率包络在三个种子下保持正，且在稳定区域上三个种子均有正的下包络。

英文摘要

We study local linear convergence of gradient descent for finite-width feedforward networks under the squared empirical loss. Prior work shows that GD can remain confined to a Locally Quasi-Convex Region (LQCR) around initialization, but only gives a sublinear rate. We show that if the empirical Neural Tangent Kernel is positive at initialization, Lipschitz stable on the LQCR, and compatible with the LQCR radius, then the squared loss satisfies a local Polyak-Łojasiewicz inequality with constant $μ= λ_0 - L_Θr(\Rcal) > 0$. Combined with fixed-step iterate containment in the LQCR, imposed as a hypothesis in the linear-rate theorem, this yields linear convergence on the region. The LQCR supplies localization; fixed-step containment is imposed as a hypothesis in the linear-rate theorem; and the PL inequality comes from NTK conditioning under squared loss. The result is therefore a sufficient local condition, not a claim that this mechanism is necessary or unique for fast convergence. Empirically, we probe the theory through NTK spectral gap, parameter drift, empirical PL ratio, and suboptimality decay. On binary MNIST, the NTK remains positive, the PL ratio has a positive lower envelope, and the loss shows geometric decay on the stable regime. In a width ablation, the fixed-step width-$1024$ run leaves the local regime; reducing the step size lowers final drift from $1.870$ to $0.158$, restores the observed local-regime diagnostics, and yields the largest empirical PL-ratio lower envelope observed in the study. A CNN robustness check on a CIFAR-10 subset shows the PL-ratio envelope remains positive across three seeds, with a positive lower envelope across all three seeds on the stable regime.

URL PDF HTML ☆

赞 0 踩 0

2507.03318 2026-05-29 cs.LG cs.AI 版本更新

Structure-Aware Compound-Protein Affinity Prediction via Graph Neural Network with Group Lasso Regularization

基于图神经网络与组套索正则化的结构感知化合物-蛋白质亲和力预测

Zanyu Shi, Yang Wang, Pathum Weerawarna, Jie Zhang, Timothy Richardson, Yijie Wang, Kun Huang

发表机构 * Department of Biostatistics & Health Data Science（生物统计学与健康数据科学系）； Indiana University（印第安纳大学）； Department of Computer Science（计算机科学系）； Indiana University Bloomington（印第安纳大学布卢明顿分校）； Division of Clinical Pharmacology（临床药理学部）； Indiana University School of Medicine（印第安纳大学医学院）； IUSM-Purdue TREAT-AD Center（IUSM-普渡大学TREAT-AD中心）； Department of Medical and Molecular Genetics（医学与分子遗传学系）

AI总结提出利用图神经网络结合组套索和稀疏组套索正则化，从活性悬崖分子对中学习结构信息以预测化合物-蛋白质亲和力（IC50），并提升模型可解释性。

Comments 15 pages, 7 figures

详情

DOI: 10.34133/csbj.0012
Journal ref: Comput Struct Biotechnol J. 2026;35:0012

AI中文摘要

可解释人工智能（XAI）方法越来越多地被应用于药物发现中，以学习分子表示并识别驱动性质预测的子结构。然而，为化合物性质预测构建结构-活性关系（SAR）建模的端到端可解释模型面临诸多挑战，例如特定蛋白质靶标的化合物-蛋白质相互作用活性数据有限，以及分子构型位点的细微变化会显著影响分子性质。我们利用具有活性悬崖的分子对，这些分子共享骨架但在取代基位点不同，其特征是对特定蛋白质靶标具有较大的效力差异。我们提出一个框架，通过实现图神经网络（GNN）来利用活性悬崖对的性质和结构信息，以预测化合物-蛋白质亲和力（即半数最大抑制浓度，IC50）。为了增强模型性能和可解释性，我们使用结构感知损失函数训练GNN，采用组套索和稀疏组套索正则化，这些正则化方法能够剪枝并突出与活性差异相关的分子子图。我们将该框架应用于针对三种原癌基因酪氨酸蛋白激酶Src蛋白（PDB ID：1O42、2H8H、4MXO）的分子活性悬崖数据。我们的方法通过稀疏组套索整合公共和私有节点信息，改进了性质预测，这体现在均方根误差（RMSE）降低和皮尔逊相关系数（PCC）提高上。应用正则化还通过提升图级全局方向分数和改进原子级着色精度，增强了GNN的特征归因能力。这些进展增强了药物发现流程中模型的可解释性，特别是在先导化合物优化中识别关键分子子结构方面。

英文摘要

Explainable artificial intelligence (XAI) approaches have been increasingly applied in drug discovery to learn molecular representations and identify substructures driving property predictions. However, building end-to-end explainable models for structure-activity relationship (SAR) modeling for compound property prediction faces many challenges, such as the limited number of compound-protein interaction activity data for specific protein targets, and plenty of subtle changes in molecular configuration sites significantly affecting molecular properties. We exploit pairs of molecules with activity cliffs that share scaffolds but differ at substituent sites, characterized by large potency differences for specific protein targets. We propose a framework by implementing graph neural networks (GNNs) to leverage property and structure information from activity cliff pairs to predict compound-protein affinity (i.e., half maximal inhibitory concentration, IC50). To enhance model performance and explainability, we train GNNs with structure-aware loss functions using group lasso and sparse group lasso regularizations, which prune and highlight molecular subgraphs relevant to activity differences. We applied this framework to activity cliff data of molecules targeting three proto-oncogene tyrosine-protein kinase Src proteins (PDB IDs: 1O42, 2H8H, 4MXO). Our approach improved property prediction by integrating common and uncommon node information with sparse group lasso, as reflected in reduced root mean squared error (RMSE) and improved Pearson's correlation coefficient (PCC). Applying regularizations also enhances feature attribution for GNN by boosting graph-level global direction scores and improving atom-level coloring accuracy. These advances strengthen model interpretability in drug discovery pipelines, particularly for identifying critical molecular substructures in lead optimization.

URL PDF HTML ☆

赞 0 踩 0

2506.06254 2026-05-29 cs.AI cs.CL cs.LG 版本更新

PersonaAgent: Bridging Memory and Action for Personalized LLM Agents

PersonaAgent：弥合个性化LLM智能体的记忆与行动

Weizhi Zhang, Xinyang Zhang, Chenwei Zhang, Liangwei Yang, Jingbo Shang, Zhepei Wei, Henry Peng Zou, Zijie Huang, Zhengyang Wang, Yifan Gao, Xiaoman Pan, Lian Xiong, Jingguo Liu, Philip S. Yu, Xian Li

发表机构 * Amazon Stores Foundational AI（亚马逊基础AI）

AI总结提出PersonaAgent框架，通过整合个性化记忆模块（情景与语义记忆）和行动模块，并利用角色提示作为中介实现记忆与行动的协同，以解决LLM智能体的个性化任务。

Comments Accepted in ACL 2026

详情

AI中文摘要

由大型语言模型驱动的智能体近期作为先进范式出现，在广泛领域和任务中展现出令人印象深刻的能力。尽管潜力巨大，当前LLM智能体常采用一刀切方法，缺乏响应用户不同需求和偏好的灵活性。这一局限促使我们开发PersonaAgent——首个旨在处理多样化个性化任务的个性化LLM智能体框架。具体而言，PersonaAgent整合了两个互补组件：一个包含情景记忆和语义记忆机制的个性化记忆模块；一个使智能体能够执行针对用户定制的工具行动的个性化行动模块。核心在于，角色（定义为每位用户独特的系统提示）充当中间件：它利用来自个性化记忆的洞察来控制智能体行动，而这些行动的结果反过来又优化记忆。基于该框架，我们提出一种测试时用户偏好对齐策略，该策略模拟最近的n次交互以优化角色提示，通过模拟响应与真实响应之间的文本损失反馈确保实时用户偏好对齐。实验评估表明，PersonaAgent不仅有效个性化行动空间，还能在测试时实际应用中扩展，显著优于其他基线方法。这些结果证明了我们的方法在提供定制化、动态用户体验方面的可行性和潜力。

英文摘要

Large Language Model (LLM) empowered agents have recently emerged as advanced paradigms that exhibit impressive capabilities in a wide range of domains and tasks. Despite their potential, current LLM agents often adopt a one-size-fits-all approach, lacking the flexibility to respond to users' varying needs and preferences. This limitation motivates us to develop PersonaAgent, the first personalized LLM agent framework designed to address versatile personalization tasks. Specifically, PersonaAgent integrates two complementary components - a personalized memory module that includes episodic and semantic memory mechanisms; a personalized action module that enables the agent to perform tool actions tailored to the user. At the core, the persona (defined as unique system prompt for each user) functions as an intermediary: it leverages insights from personalized memory to control agent actions, while the outcomes of these actions in turn refine the memory. Based on the framework, we propose a test-time user-preference alignment strategy that simulate the latest n interactions to optimize the persona prompt, ensuring real-time user preference alignment through textual loss feedback between simulated and ground-truth responses. Experimental evaluations demonstrate that PersonaAgent significantly outperforms other baseline methods by not only personalizing the action space effectively but also scaling during test-time real-world applications. These results underscore the feasibility and potential of our approach in delivering tailored, dynamic user experiences.

URL PDF HTML ☆

赞 0 踩 0

2506.06095 2026-05-29 cs.LG 版本更新

Accelerating Sparse Transformer Inference on GPU

加速GPU上的稀疏Transformer推理

Wenhao Dai, Haodong Deng, Mengfei Rong, Xinyu Yang, Hongyu Liu, Fangxin Liu, Hailong Yang, Qianwen Cao, Qingxiao Sun

发表机构 * SSSLab, Dept. of CST China University of Petroleum-Beijing Beijing China（SSSLab，计算机科学与技术系中国石油大学（北京）北京中国）； Baidu Inc. Beijing China（百度公司北京中国）； School of Computer Science Shanghai Jiao Tong University Shanghai China（计算机科学学院上海交通大学上海中国）； China University of Petroleum-Beijing Beihang University Beijing China（中国石油大学（北京）北京航空航天大学北京中国）； China University of Petroleum-Beijing（中国石油大学（北京））； Beihang University（北京航空航天大学）； Baidu Inc.（百度公司）； Shanghai Jiao Tong University（上海交通大学）

AI总结针对稀疏Transformer推理加速问题，提出STOF框架，通过分析建模将多头注意力映射为行式或块式核并采用独特存储格式，结合两阶段搜索的算子融合方案，在GPU上实现高达1.6倍的多头注意力计算加速和1.4倍的端到端推理加速。

详情

AI中文摘要

大型语言模型（LLMs）因其强大的理解能力在全球广受欢迎。作为LLMs的核心组件，通过并行化加速Transformer逐渐成为研究热点。掩码层向Transformer引入稀疏性以减少计算量。然而，以往的工作很少关注稀疏Transformer的性能优化。此外，当前的静态算子融合方案无法适应多样化的应用场景。为解决上述问题，我们提出STOF，一个针对稀疏Transformer的优化框架，能够在GPU上实现灵活的掩码和算子融合。对于多头注意力（MHA）结构，STOF根据分析建模将计算映射为具有独特存储格式的行式或块式核。对于下游算子，STOF将融合方案映射到编译模板，并通过两阶段搜索确定最优运行配置。实验结果表明，与最先进的工作相比，STOF在MHA计算中实现了最高1.6倍的加速，在端到端推理中实现了最高1.4倍的加速。

英文摘要

Large language models (LLMs) are popular around the world due to their powerful understanding capabilities. As the core component of LLMs, accelerating Transformer through parallelization has gradually become a hot research topic. Mask layers introduce sparsity into Transformer to reduce calculations. However, previous works rarely focus on the performance optimization of sparse Transformer. In addition, current static operator fusion schemes fail to adapt to diverse application scenarios. To address the above problems, we propose STOF, a framework that incorporates optimizations for Sparse Transformer that enables flexible masking and Operator Fusion on GPU. For multi-head attention (MHA) structure, STOF maps the computation to row-wise or blockwise kernels with unique storage formats according to analytical modeling. For downstream operators, STOF maps the fusion scheme to compilation templates and determines the optimal running configuration through two-stage searching. The experimental results show that compared to the stateof-the-art work, STOF achieves maximum speedups of 1.6x in MHA computation and 1.4x in end-to-end inference.

URL PDF HTML ☆

赞 0 踩 0

2506.05985 2026-05-29 cs.LG cs.RO 版本更新

Dynamic Mixture of Progressive Parameter-Efficient Expert Library for Lifelong Robot Learning

动态渐进式参数高效专家库混合用于终身机器人学习

Yuheng Lei, Sitong Mao, Shunbo Zhou, Hongyuan Zhang, Xuelong Li, Ping Luo

发表机构 * The University of Hong Kong（香港大学）； Institute of Artificial Intelligence (TeleAI), China Telecom（人工智能研究院（TeleAI），中国电信）； Huawei Cloud Computing Technologies（华为云计算技术）； Ola Dimensions ； HKU Shanghai Intelligent Computing Research Center（香港大学上海智能计算研究中心）

AI总结针对终身学习中任务标识不可用和知识隔离问题，提出动态渐进式参数高效专家库混合（DMPEL），通过构建低秩专家库和轻量路由器实现灵活的前向迁移，并引入专家系数回放缓解遗忘，在LIBERO基准上以最少可训练参数和存储超越现有方法。

Comments Accepted to Transactions on Machine Learning Research (TMLR) at https://openreview.net/forum?id=MHVBrjS8cG . Code is available at https://github.com/HarryLui98/DMPEL

详情

AI中文摘要

一个通用智能体必须在其生命周期中持续学习和适应，实现高效的前向迁移，同时最小化灾难性遗忘。先前在主导的预训练-微调范式中的工作探索了用于单任务适应的参数高效微调，通过少量参数有效引导冻结的预训练模型。然而，在终身学习背景下，这些方法依赖于测试时任务标识符这一不切实际的假设，并限制了孤立适配器之间的知识共享。为解决这些限制，我们提出了用于终身机器人学习的动态渐进式参数高效专家库混合（DMPEL）。DMPEL逐步构建一个低秩专家库，并采用轻量路由器将专家动态组合成端到端策略，从而实现灵活高效的终身前向迁移。此外，通过利用微调参数的模块化结构，我们引入了专家系数回放，引导路由器准确检索先前遇到任务的冻结专家。该技术缓解了遗忘，同时相比对整个策略进行经验回放，显著节省存储和计算。在终身机器人学习基准LIBERO上的大量实验表明，我们的框架在持续适应过程中的成功率上优于最先进的终身学习方法，同时使用了最少的可训练参数和存储。

英文摘要

A generalist agent must continuously learn and adapt throughout its lifetime, achieving efficient forward transfer while minimizing catastrophic forgetting. Previous work within the dominant pretrain-then-finetune paradigm has explored parameter-efficient fine-tuning for single-task adaptation, effectively steering a frozen pretrained model with a small number of parameters. However, in the context of lifelong learning, these methods rely on the impractical assumption of a test-time task identifier and restrict knowledge sharing among isolated adapters. To address these limitations, we propose Dynamic Mixture of Progressive Parameter-Efficient Expert Library (DMPEL) for lifelong robot learning. DMPEL progressively builds a low-rank expert library and employs a lightweight router to dynamically combine experts into an end-to-end policy, enabling flexible and efficient lifelong forward transfer. Furthermore, by leveraging the modular structure of the fine-tuned parameters, we introduce expert coefficient replay, which guides the router to accurately retrieve frozen experts for previously encountered tasks. This technique mitigates forgetting while being significantly more storage- and computation-efficient than experience replay over the entire policy. Extensive experiments on the lifelong robot learning benchmark LIBERO demonstrate that our framework outperforms state-of-the-art lifelong learning methods in success rates during continual adaptation, while utilizing minimal trainable parameters and storage.

URL PDF HTML ☆

赞 0 踩 0

2506.04602 2026-05-29 cs.GT cs.LG 版本更新

MVP-Shapley: Feature-based Modeling for Evaluating the Most Valuable Player in Basketball

MVP-Shapley：基于特征建模的篮球最有价值球员评估方法

Haifeng Sun, Yu Xiong, Runze Wu, Kai Wang, Lan Zhang, Changjie Fan, Shaojie Tang, Xiang-Yang Li

发表机构 * University of Science and Technology of China（科学技术大学）； Netease, Fuxi AI Lab（网易凤凰人工智能实验室）； University at Buffalo（布法罗大学）

AI总结提出一种基于Shapley值的MVP评估框架，通过特征处理、胜负模型训练和贡献分配，结合因果优化实现球员排名，并在NBA数据集上验证有效性。

详情

AI中文摘要

电子竞技和多人在线游戏社区的蓬勃发展凸显了评估最有价值球员（MVP）的关键重要性。建立可解释且实用的MVP评估方法非常具有挑战性。在我们的研究中，我们特别关注逐回合数据，该数据记录了比赛中的相关事件，如助攻和得分。我们旨在通过引入一种新的MVP评估框架（记为\oursys）来应对这些挑战，该框架利用Shapley值。该方法包括特征处理、胜负模型训练、Shapley值分配以及基于球员贡献的MVP排名确定。此外，我们从因果关系的角度优化算法，使其与专家投票结果一致。最后，我们通过使用NBA数据集和Dunk City Dynasty数据集进行验证，证实了我们方法的有效性，并在行业中实现了在线部署。

英文摘要

The burgeoning growth of the esports and multiplayer online gaming community has highlighted the critical importance of evaluating the Most Valuable Player (MVP). The establishment of an explainable and practical MVP evaluation method is very challenging. In our study, we specifically focus on play-by-play data, which records related events during the game, such as assists and points. We aim to address the challenges by introducing a new MVP evaluation framework, denoted as \oursys, which leverages Shapley values. This approach encompasses feature processing, win-loss model training, Shapley value allocation, and MVP ranking determination based on players' contributions. Additionally, we optimize our algorithm to align with expert voting results from the perspective of causality. Finally, we substantiated the efficacy of our method through validation using the NBA dataset and the Dunk City Dynasty dataset and implemented online deployment in the industry.

URL PDF HTML ☆

赞 0 踩 0

2505.20634 2026-05-29 cs.LG stat.ML 版本更新

Explaining Concept Shift with Interpretable Feature Attribution

用可解释的特征归因解释概念漂移

Ruiqi Lyu, Alistair Turcan, Bryan Wilder

发表机构 * Carnegie Mellon University（卡内基梅隆大学）

AI总结提出SGShift方法，通过将概念漂移建模为特征选择任务，利用广义加性模型、敲除和吸收等统计工具识别导致源域与目标域模型性能差异的稀疏漂移特征。

详情

AI中文摘要

当特征条件标签分布在域间发生变化时，就会发生概念漂移，这可能导致即使调优良好的机器学习模型在新域上校准失效。识别这些漂移特征可以独特地揭示域间特征-标签关系如何不同，考虑到这种差异可能跨越科学相关的维度（如时间、疾病状态、人群等）。在本文中，我们提出SGShift，一种将表格数据中概念漂移导致的性能下降归因于稀疏漂移特征集的方法。我们将概念漂移框架化为特征选择任务，以学习能够解释源域和目标域模型间性能差异的特征。该框架使SGShift能够适应强大的统计工具，如广义加性模型、敲除和吸收，以识别这些漂移特征。我们在各种机器学习模型的合成数据和真实数据上进行了广泛实验，发现SGShift比基线方法更准确地识别漂移特征，在漂移域中所需样本少，并且对复杂的概念漂移情况具有鲁棒性。

英文摘要

Concept shift occurs when the distribution of labels conditioned on the features changes between domains, which can make even a well-tuned ML model miscalibrated on a new domain. Identifying these shifted features provides unique insight into how feature-label relationships differ between domains, considering the difference may be across a scientifically relevant dimension, such as time, disease status, population, etc. In this paper, we propose SGShift, a method for attributing performance degradation under concept shift in tabular data to a sparse set of shifted features. We frame concept shift as a feature selection task to learn the features that can explain performance differences between models in the source and target domain. This framework enables SGShift to adapt powerful statistical tools such as generalized additive models, knockoffs, and absorption towards identifying these shifted features. We conduct extensive experiments in synthetic and real data across various ML models and find SGShift can identify shifted features much more accurately than baseline methods, requires few samples in the shifted domain, and is robust to complex cases of concept shift.

URL PDF HTML ☆

赞 0 踩 0

2505.05968 2026-05-29 cs.LG cs.MA 版本更新

Offline Multi-agent Reinforcement Learning via Sequential Score Decomposition

离线多智能体强化学习通过序列得分分解

Dan Qiao, Wenhao Li, Shanchao Yang, Hongyuan Zha, Baoxiang Wang

发表机构 * School of Data Science, The Chinese University of Hong Kong, Shenzhen, China（香港中文大学（深圳）数据科学学院）； School of Computer Science, Tongji University, Shanghai, China（同济大学计算机科学学院）； Vector Institute（向量研究所）

AI总结针对离线合作多智能体强化学习中联合动作空间高维和异质行为数据导致的策略分布偏移问题，提出序列得分函数分解方法，利用扩散模型从多模态离线数据中学习每个智能体的正则化信号，指导策略更新至高分、分布内区域，在多个粒子环境和多智能体MuJoCo基准上实现最先进性能。

Comments ICML 2026 Accepted

详情

Journal ref: Forty-Third International Conference on Machine Learning, 2026

AI中文摘要

离线合作多智能体强化学习（MARL）因分布偏移面临独特挑战，尤其源于联合动作空间的高维性和分布外联合动作选择的存在。在这项工作中，我们强调离线MARL的一个基本挑战来自合作任务的多均衡性质，这诱导了高度多模态的联合行为策略空间与异质质量行为数据的耦合。这使得个体策略正则化难以与一致的协调模式对齐，导致策略分布偏移问题。为应对这一挑战，我们设计了一种序列得分函数分解方法，从联合行为策略中提炼每个智能体的正则化信号，在分散执行约束下诱导协调模态选择。然后我们利用灵活的基于扩散的生成模型从多模态离线数据中学习这些得分函数，并将其集成到联合动作评论家中，以在共享团队奖励下引导策略更新朝向高分、分布内区域。我们的方法在多个粒子环境和多智能体MuJoCo基准上一致实现了最先进性能。据我们所知，这是首个明确解决离线与在线MARL之间分布差距的工作，为更可泛化的基于离线策略的MARL方法铺平了道路。

英文摘要

Offline cooperative multi-agent reinforcement learning (MARL) faces unique challenges due to distributional shifts, particularly stemming from the high dimensionality of joint action spaces and the presence of out-of-distribution joint action selections. In this work, we highlight that a fundamental challenge in offline MARL arises from the multi-equilibrium nature of cooperative tasks, which induces a highly multimodal joint behavior policy space coupled with heterogeneous-quality behavior data. This makes it difficult for individual policy regularization to align with a consistent coordination pattern, leading to the policy distribution shift problems. To tackle this challenge, we design a sequential score function decomposition method that distills per-agent regularization signals from the joint behavior policy, which induces coordinated modality selection under decentralized execution constraints. Then we leverage a flexible diffusion-based generative model to learn these score functions from multimodal offline data, and integrate them into joint-action critics to guide policy updates toward high-reward, in-distribution regions under a shared team reward. Our approach achieves state-of-the-art performance across multiple particle environments and Multi-agent MuJoCo benchmarks consistently. To the best of our knowledge, this is the first work to explicitly address the distributional gap between offline and online MARL, paving the way for more generalizable offline policy-based MARL methods.

URL PDF HTML ☆

赞 0 踩 0

2505.02743 2026-05-29 cs.LG stat.ML 版本更新

Cooperative Variance Estimation and Bayesian Neural Networks for Disentangling Aleatoric and Epistemic Uncertainties

合作方差估计与贝叶斯神经网络用于分离偶然不确定性和认知不确定性

Jiaxiang Yi, Miguel A. Bessa

发表机构 * Faculty of Mechanical Engineering, Delft University of Technology, Mekelweg 2, Delft, 2628 CD, The Netherlands（代尔夫特理工大学机械工程学院）； School of Engineering, Brown University, 184 Hope St., Providence, RI 02912, USA（布朗大学工程学院）

AI总结提出通过合作训练方差估计网络与贝叶斯神经网络，实现偶然不确定性与认知不确定性的分离，并提升均值估计性能。

Comments 38 pages, 26 figures

详情

AI中文摘要

真实世界的数据包含偶然不确定性——由不完美的测量或对数据生成过程的不完全了解引起的不可约噪声。均值-方差估计网络可以学习这种类型的不确定性，但需要即兴的正则化策略以避免过拟合，并且无法预测认知不确定性（模型不确定性）。相反，贝叶斯神经网络可以预测认知不确定性，但由于贝叶斯推断的近似性质，它们以难以训练而著称。我们提出合作训练一个方差估计网络与一个贝叶斯神经网络，并通过实验证明，所得模型在改善均值估计的同时分离了偶然不确定性和认知不确定性。我们展示了该方法在多种数据集上的有效性和可扩展性，包括我们创建的一个时间依赖异方差回归数据集，其中偶然不确定性是已知的。所提出的方法易于实现、鲁棒，并且适用于各种模型架构。

英文摘要

Real-world data contains aleatoric uncertainty - irreducible noise arising from imperfect measurements or from incomplete knowledge about the data generation process. Mean-variance estimation networks can learn this type of uncertainty but require ad-hoc regularization strategies to avoid overfitting and are unable to predict epistemic uncertainty (model uncertainty). Conversely, Bayesian neural networks predict epistemic uncertainty but are notoriously difficult to train due to the approximate nature of Bayesian inference. We propose to cooperatively train a variance estimation network with a Bayesian neural network and empirically demonstrate that the resulting model disentangles aleatoric and epistemic uncertainties while improving the mean estimation. We demonstrate the effectiveness and scalability of this method across a diverse range of datasets, including a time-dependent heteroscedastic regression dataset we created where the aleatoric uncertainty is known. The proposed method is straightforward to implement, robust, and adaptable to various model architectures.

URL PDF HTML ☆

赞 0 踩 0

2503.13844 2026-05-29 cs.CL cs.AI cs.CY cs.LG 版本更新

Towards Detecting Persuasion on Social Media: From Model Development to Insights on Persuasion Strategies

检测社交媒体上的说服：从模型开发到说服策略的洞察

Elyas Meguellati, Stefano Civelli, Pietro Bernardelle, Shazia Sadiq, Irwin King, Gianluca Demartini

发表机构 * University of Queensland（昆士兰大学）； The Chinese University of Hong Kong（香港中文大学）

AI总结本文通过开发轻量级说服文本检测模型（在SemEval 2023任务3子任务3中达到最优性能）并应用于澳大利亚联邦选举2022 Facebook广告数据集，揭示了政治竞选在不同资金策略、词汇选择、人口统计定位和选举临近时说服强度时间变化中的模式。

详情

DOI: 10.1609/icwsm.v20i1.42714
Journal ref: Proceedings of the International AAAI Conference on Web and Social Media 20(1) (2026) 1587-1608

AI中文摘要

政治广告通过嵌入更广泛宣传策略中的微妙说服技巧，在塑造公众舆论和影响选举结果方面发挥着关键作用。检测这些说服元素对于提高选民意识和确保民主进程的透明度至关重要。本文通过两项相互关联的研究，提出了一种连接模型开发与实际应用的综合方法。首先，我们引入了一个轻量级说服文本检测模型，该模型在SemEval 2023任务3子任务3中达到了最先进性能，同时所需的计算资源和训练数据远少于现有方法。其次，我们通过收集澳大利亚联邦选举2022 Facebook广告（APA22）数据集，对其中一部分进行说服标注，并对模型进行微调以使其从主流新闻适应社交媒体内容，从而展示了该模型的实际效用。然后，我们应用微调后的模型对APA22数据集的其余部分进行标注，揭示了政治竞选如何通过不同的资金策略、词汇选择、人口统计定位以及选举日临近时说服强度的时间变化来利用说服的独特模式。我们的发现不仅强调了分析社交媒体说服时领域特定建模的必要性，还展示了揭示这些策略如何能够增强透明度、告知选民并促进数字竞选中的问责制。

英文摘要

Political advertising plays a pivotal role in shaping public opinion and influencing electoral outcomes, often through subtle persuasive techniques embedded in broader propaganda strategies. Detecting these persuasive elements is crucial for enhancing voter awareness and ensuring transparency in democratic processes. This paper presents an integrated approach that bridges model development and real-world application through two interconnected studies. First, we introduce a lightweight model for persuasive text detection that achieves state-of-the-art performance in Subtask 3 of SemEval 2023 Task 3 while requiring significantly fewer computational resources and training data than existing methods. Second, we demonstrate the model's practical utility by collecting the Australian Federal Election 2022 Facebook Ads (APA22) dataset, partially annotating a subset for persuasion, and fine-tuning the model to adapt from mainstream news to social media content. We then apply the fine-tuned model to label the remainder of the APA22 dataset, revealing distinct patterns in how political campaigns leverage persuasion through different funding strategies, word choices, demographic targeting, and temporal shifts in persuasion intensity as election day approaches. Our findings not only underscore the necessity of domain-specific modeling for analyzing persuasion on social media but also show how uncovering these strategies can enhance transparency, inform voters, and promote accountability in digital campaigns.

URL PDF HTML ☆

赞 0 踩 0

2502.16548 2026-05-29 cs.LG cs.AI cs.CV 版本更新

A Composable Multimodal Framework for cine CMR-Text-Driven Prediction of Heart Failure Outcomes

用于电影心脏磁共振-文本驱动的心力衰竭结局预测的可组合多模态框架

Jianzhou Chen, Jinyang Sun, Xiumei Wang, Xi Chen, Heyu Chu, Guo Song, Yuji Luo, Xingping Zhou, Rong Gu

发表机构 * Department of Cardiology, Nanjing Drum Tower Hospital, State Key Laboratory of Pharmaceutical Biotechnology, Nanjing University（南京鼓楼医院心内科，南京大学国家药物生物技术重点实验室）； School of Electronic Information and Electrical Engineering, Shanghai Jiao Tong University（上海交通大学电子信息与电气工程学院）； College of Electronic and Optical Engineering, Nanjing University of Posts and Telecommunications（南京邮电大学电子与光学工程学院）； College of Integrated Circuit Science and Engineering, Nanjing University of Posts and Telecommunications（南京邮电大学集成电路科学与工程学院）； Department of Cardiology, Nanjing Drum Tower Hospital Clinical College of Nanjing Medical University（南京医科大学南京鼓楼医院临床学院心内科）； Institute of Quantum Information and Technology, Nanjing University of Posts and Telecommunications（南京邮电大学量子信息与技术研究院）

AI总结提出一种可组合多模态框架，通过整合cine CMR影像、结构化临床指标和非结构化文本记录，实现比单模态AI算法更准确的心力衰竭预后预测，并支持个性化治疗优化。

详情

AI中文摘要

目的。根据世界卫生组织（WHO）及其他公共卫生机构的数据，心力衰竭是全球主要死因之一，每年导致数百万人死亡。尽管心力衰竭领域已取得显著进展，生存率和射血分数有所改善，但由于其复杂性和多因素特征，仍存在大量未满足的需求。本研究旨在提出并评估一种用于心力衰竭评估和治疗优化的可组合策略框架，旨在提供更全面的患者评估和管理。方法。该框架利用多模态算法分析全面的患者数据，明确整合了电影心脏磁共振（cine CMR）序列、结构化临床指标（如实验室结果、人口统计学数据）和非结构化文本记录（如病史、处方）。通过整合这些多种数据源，我们的框架为患者提供了更全面的评估和优化的治疗方案。主要结果。与单模态AI算法相比，该多模态框架在心力衰竭预后预测方面展现出更高的准确性。此外，它还能详细评估各种病理指标对心力衰竭结局的影响。意义。通过系统性地整合异质性临床数据，该方法支持更全面的预后评估，并有助于为心力衰竭患者制定优化的个性化治疗计划。

英文摘要

Objective. Heart failure is one of the leading causes of death worldwide, with millions of deaths each year, according to data from the World Health Organization (WHO) and other public health agencies. While significant progress has been made in the field of heart failure, leading to improved survival rates and improvement of ejection fraction, there remains substantial unmet needs, due to the complexity and multifactorial characteristics. This study aims to propose and evaluate a composable strategy framework for assessment and treatment optimization in heart failure, designed to provide more holistic patient evaluation and management. Approach. The framework leverages multi-modal algorithms to analyze a comprehensive range of patient data, explicitly integrating cine cardiac magnetic resonance (cine CMR) sequences, structured clinical metrics (e.g., lab results, demographics), and unstructured textual records (e.g., medical history, prescriptions). By integrating these various data sources, our framework offers a more holistic evaluation and optimized treatment plan for patients. Main results. The multi-modal framework demonstrates superior accuracy in HF prognosis prediction compared to single-modal AI algorithms. Additionally, it enables a detailed evaluation of the impact of various pathological indicators on HF outcomes. Significance. By integrating heterogeneous clinical data in a systematic manner, this approach supports more comprehensive prognosis assessment and facilitates optimized, personalized treatment planning for heart failure patients.

URL PDF HTML ☆

赞 0 踩 0

2411.00278 2026-05-29 cs.LG 版本更新

KAN-AD: Time Series Anomaly Detection with Kolmogorov-Arnold Networks

KAN-AD：基于Kolmogorov-Arnold网络的时间序列异常检测

Quan Zhou, Changhua Pei, Fei Sun, Jing Han, Zhengwei Gao, Dan Pei, Haiming Zhang, Gaogang Xie, Jianhui Li

发表机构 * Computer Network Information Center, Chinese Academy of Sciences（中国科学院计算机网络信息中心）； University of the Chinese Academy of Sciences（中国科学院大学）； Hangzhou Institute for Advanced Study, University of the Chinese Academy of Sciences（中国科学院大学杭州高等研究 institute）； Institute of Computing Technology, Chinese Academy of Sciences（中国科学院计算技术研究所）； School of Frontier Sciences, Nanjing University（南京大学前沿科学学院）； Department of Computer Science and Technology, Tsinghua University（清华大学计算机科学与技术系）

AI总结针对时间序列异常检测中预测模型易过拟合局部波动的问题，提出用截断傅里叶展开替代B样条的KAN-AD方法，通过强调全局模式并抵抗局部扰动，在四个基准上平均检测精度提升15%。

Comments 11 pages, ICML 2025

详情

Journal ref: Proceedings of the 42nd International Conference on Machine Learning, PMLR 267:79136-79149, 2025

AI中文摘要

时间序列异常检测（TSAD）支撑着云服务和网络系统中的实时监控，能够快速识别异常以防止代价高昂的故障。大多数基于预测模型的TSAD方法倾向于通过强调微小波动而过度拟合。我们的分析表明，有效的TSAD应专注于通过平滑局部模式对“正常”行为进行建模。为此，我们将时间序列建模重新表述为用平滑单变量函数逼近序列。每个单变量函数的局部平滑性确保拟合的时间序列对局部扰动保持鲁棒。然而，由于B样条函数固有的局部化特性，直接实现KAN易受这些扰动影响。因此，我们提出KAN-AD，用截断傅里叶展开替代B样条，并引入一种新颖的轻量级学习机制，该机制在强调全局模式的同时对局部扰动保持鲁棒。在四个流行的TSAD基准上，KAN-AD相比最先进的基线实现了平均15%的检测精度提升（峰值超过27%）。值得注意的是，其可训练参数少于1000个，相比原始KAN推理速度提升50%，展示了该方法的效率和实际可行性。

英文摘要

Time series anomaly detection (TSAD) underpins real-time monitoring in cloud services and web systems, allowing rapid identification of anomalies to prevent costly failures. Most TSAD methods driven by forecasting models tend to overfit by emphasizing minor fluctuations. Our analysis reveals that effective TSAD should focus on modeling "normal" behavior through smooth local patterns. To achieve this, we reformulate time series modeling as approximating the series with smooth univariate functions. The local smoothness of each univariate function ensures that the fitted time series remains resilient against local disturbances. However, a direct KAN implementation proves susceptible to these disturbances due to the inherently localized characteristics of B-spline functions. We thus propose KAN-AD, replacing B-splines with truncated Fourier expansions and introducing a novel lightweight learning mechanism that emphasizes global patterns while staying robust to local disturbances. On four popular TSAD benchmarks, KAN-AD achieves an average 15% improvement in detection accuracy (with peaks exceeding 27%) over state-of-the-art baselines. Remarkably, it requires fewer than 1,000 trainable parameters, resulting in a 50% faster inference speed compared to the original KAN, demonstrating the approach's efficiency and practical viability.

URL PDF HTML ☆

赞 0 踩 0

2410.19371 2026-05-29 stat.ML cs.CR cs.LG 版本更新

Noise-Aware Differentially Private Variational Inference

噪声感知的差分隐私变分推断

Talal Alrawajfeh, Joonas Jälkö, Antti Honkela

发表机构 * University of Helsinki（赫尔辛基大学）

AI总结针对差分隐私导致下游推断不可靠的问题，提出一种基于随机梯度变分推断的噪声感知近似贝叶斯推断方法，可应用于高维和非共轭模型，并改进了后验评估精度。

Comments 26 pages, 4 figures

2410.15236 2026-05-29 cs.CR cs.AI cs.LG 版本更新

Jailbreaking and Mitigation of Vulnerabilities in Large Language Models

大语言模型的越狱与漏洞缓解

Benji Peng, Hanxuan Chen, Keyu Chen, Qian Niu, Ziqian Bi, Ming Liu, Pohsun Feng, Tianyang Wang, Lawrence K. Q. Yan, Yizhu Wen, Yichao Zhang, Caitlyn Heqi Yin, Xinyuan Song, Riyang Bao, Jiacheng Shi

发表机构 * Hunan University Changsha, PRC ； Georgia Institute of Technology Atlanta, USA ； Kyoto University Kyoto, Japan ； Purdue University West Lafayette, USA ； National Taiwan Normal University Taipei, ROC ； University of Liverpool Suzhou, PRC ； Hong Kong University of Science ； University of Hawaii Honolulu, USA ； The University of Texas at Dallas Dallas, USA ； University of Wisconsin-Madison Madison, USA ； Emory University Atlanta, USA ； College of William \& Mary Williamsburg, USA

AI总结本文综述了大语言模型在提示注入和越狱攻击下的漏洞，分类攻击方法并评估防御策略，指出研究空白与未来方向。

详情

DOI: 10.63336/Eureka.47
Journal ref: Eureka 1(1) (2026) 26-61

AI中文摘要

大语言模型通过推进自然语言理解和生成，在医疗、软件工程和对话系统等领域实现了广泛应用，从而改变了人工智能。尽管在过去几年取得了这些进展，但大语言模型已显示出相当大的漏洞，特别是对提示注入和越狱攻击。本综述分析了这些漏洞的研究现状，并介绍了可用的防御策略。我们大致将攻击方法分为基于提示的、基于模型的、多模态的和多语言的，涵盖对抗性提示、后门注入和跨模态利用等技术。我们还回顾了各种防御机制，包括提示过滤、转换、对齐技术、多智能体防御和自律，评估了它们的优缺点。我们还讨论了用于评估大语言模型安全性和鲁棒性的关键指标和基准，指出了在交互环境中量化攻击成功率的挑战以及现有数据集中的偏差。通过识别当前研究空白，我们提出了未来在韧性对齐策略、针对不断演变的攻击的高级防御、越狱检测自动化以及考虑伦理和社会影响方面的方向。本综述强调了在人工智能社区内持续研究和合作的必要性，以增强大语言模型的安全性并确保其安全部署。

英文摘要

Large Language Models (LLMs) have transformed artificial intelligence by advancing natural language understanding and generation, enabling applications across fields beyond healthcare, software engineering, and conversational systems. Despite these advancements in the past few years, LLMs have shown considerable vulnerabilities, particularly to prompt injection and jailbreaking attacks. This review analyzes the state of research on these vulnerabilities and presents available defense strategies. We roughly categorize attack approaches into prompt-based, model-based, multimodal, and multilingual, covering techniques such as adversarial prompting, backdoor injections, and cross-modality exploits. We also review various defense mechanisms, including prompt filtering, transformation, alignment techniques, multi-agent defenses, and self-regulation, evaluating their strengths and shortcomings. We also discuss key metrics and benchmarks used to assess LLM safety and robustness, noting challenges like the quantification of attack success in interactive contexts and biases in existing datasets. Identifying current research gaps, we suggest future directions for resilient alignment strategies, advanced defenses against evolving attacks, automation of jailbreak detection, and consideration of ethical and societal impacts. This review emphasizes the need for continued research and cooperation within the AI community to enhance LLM security and ensure their safe deployment.

URL PDF HTML ☆

赞 0 踩 0

2408.15451 2026-05-29 cs.LG cs.CR stat.ME 版本更新

Certified Causal Defense with Generalizable Robustness

具有泛化鲁棒性的认证因果防御

Yiran Qiao, Yu Yin, Chen Chen, Jing Ma

发表机构 * Case Wester Reserve University（凯斯西储大学）； University of Virginia（弗吉尼亚大学）

AI总结提出GLEAN框架，通过可认证因果因子学习解耦因果关系与虚假相关性，并设计因果认证防御策略，实现跨分布偏移域的鲁棒性泛化。

Comments Accepted by AAAI 2025

详情

AI中文摘要

尽管机器学习模型在各种场景中已被证明有效，但普遍认为许多模型容易受到对抗性攻击。近年来，出现了大量对抗性防御的研究。其中，认证防御因其对输入在特定范围内（例如$l_2$球）的任意对抗性扰动具有理论保证而闻名。然而，该领域现有的大多数工作难以将其认证鲁棒性泛化到具有分布偏移的其他数据域中。这一问题的根源在于难以消除不同域中虚假相关性对鲁棒性的负面影响。为解决此问题，本文提出了一种新颖的认证防御框架GLEAN，该框架将因果视角引入认证防御的泛化问题。具体而言，我们的框架集成了一个可认证的因果因子学习组件，以解耦输入与标签之间的因果关系和虚假相关性，从而排除虚假相关性对防御的负面影响。在此基础上，我们设计了一种因果认证防御策略来处理对潜在因果因子的对抗性攻击。通过这种方式，我们的框架不仅对训练分布中数据上的恶意噪声具有鲁棒性，而且能够将其鲁棒性泛化到具有分布偏移的各个域中。在基准数据集上的大量实验验证了我们的框架在不同数据域中认证鲁棒性泛化的优越性。代码见补充材料。

英文摘要

While machine learning models have proven effective across various scenarios, it is widely acknowledged that many models are vulnerable to adversarial attacks. Recently, there have emerged numerous efforts in adversarial defense. Among them, certified defense is well known for its theoretical guarantees against arbitrary adversarial perturbations on input within a certain range (e.g., $l_2$ ball). However, most existing works in this line struggle to generalize their certified robustness in other data domains with distribution shifts. This issue is rooted in the difficulty of eliminating the negative impact of spurious correlations on robustness in different domains. To address this problem, in this work, we propose a novel certified defense framework GLEAN, which incorporates a causal perspective into the generalization problem in certified defense. More specifically, our framework integrates a certifiable causal factor learning component to disentangle the causal relations and spurious correlations between input and label, and thereby exclude the negative effect of spurious correlations on defense. On top of that, we design a causally certified defense strategy to handle adversarial attacks on latent causal factors. In this way, our framework is not only robust against malicious noises on data in the training distribution but also can generalize its robustness across domains with distribution shifts. Extensive experiments on benchmark datasets validate the superiority of our framework in certified robustness generalization in different data domains. Code is available in the supplementary materials.

URL PDF HTML ☆

赞 0 踩 0

2310.14161 2026-05-29 cs.LG 版本更新

Promoting Generalization for Exact Solvers via Adversarial Instance Augmentation

通过对抗性实例增强促进精确求解器的泛化能力

Haoyang Liu, Yufei Kuang, Jie Wang, Xijun Li, Yongdong Zhang, Feng Wu

发表机构 * CAS Key Laboratory of Technology in GIPAS, University of Science and Technology of China（GIPAS技术CAS重点实验室，中国科学技术大学）； Institute of Artificial Intelligence, Hefei Comprehensive National Science Center（合肥综合性国家科学中心人工智能研究院）

AI总结针对学习型MILP求解器在未见实例上性能下降的问题，提出对抗性实例增强方法AdaSolver，通过将不可微的实例增强建模为上下文赌博机问题并联合对抗训练增强策略与求解器，显著提升基于模仿学习和强化学习的分支定界求解器的泛化能力。

详情

Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, 2026

AI中文摘要

机器学习已成功应用于提高混合整数线性规划（MILP）求解器的效率。然而，由于训练分布的多样性有限，基于学习的求解器在未见过的MILP实例上——尤其是在扰动环境中的大规模实例上——常常遭受严重的性能下降。为解决这一问题，我们提出了一种新颖方法，称为对抗性实例增强，该方法无需了解新实例生成的问题类型，以促进分支定界（B&B）求解器中基于学习的分支模块的数据多样性（AdaSolver）。我们使用MILP实例的二分图表示，并通过学习到的增强策略增强图结构，从而获得各种扰动实例以正则化求解器。AdaSolver的主要技术贡献在于，我们将不可微的实例增强建模为上下文赌博机问题，并对抗性地训练基于学习的求解器和增强策略，从而实现对增强策略的高效梯度训练。据我们所知，AdaSolver是首个通用且有效的框架，用于理解和改进基于模仿学习（IL-based）和基于强化学习（RL-based）的B&B求解器的泛化能力。大量实验表明，通过生成各种增强实例，AdaSolver在各种分布上均显著提升了求解效率。

英文摘要

Machine learning has been successfully applied to improve the efficiency of Mixed-Integer Linear Programming (MILP) solvers. However, the learning-based solvers often suffer from severe performance degradation on unseen MILP instances -- especially on large-scale instances from a perturbed environment -- due to the limited diversity of training distributions. To tackle this problem, we propose a novel approach, which is called Adversarial Instance Augmentation and does not require to know the problem type for new instance generation, to promote data diversity for learning-based branching modules in the branch-and-bound (B&B) Solvers (AdaSolver). We use the bipartite graph representations for MILP instances and obtain various perturbed instances to regularize the solver by augmenting the graph structures with a learned augmentation policy. The major technical contribution of AdaSolver is that we formulate the non-differentiable instance augmentation as a contextual bandit problem and adversarially train the learning-based solver and augmentation policy, enabling efficient gradient-based training of the augmentation policy. To the best of our knowledge, AdaSolver is the first general and effective framework for understanding and improving the generalization of both imitation-learning-based (IL-based) and reinforcement-learning-based (RL-based) B&B solvers. Extensive experiments demonstrate that by producing various augmented instances, AdaSolver leads to a remarkable efficiency improvement across various distributions.

URL PDF HTML ☆

赞 0 踩 0

2308.13222 2026-05-29 physics.comp-ph cs.LG physics.flu-dyn stat.ML 版本更新

Bayesian Reasoning for Physics Informed Neural Networks

物理信息神经网络的贝叶斯推理

Krzysztof M. Graczyk, Kornel Witkowski

发表机构 * Institute for Theoretical Physics, University of Wroc aw（沃拉夫大学理论物理研究所）； Institute of Low Temperature and Structure Research（低温与结构研究所）； Polish Academy of Sciences（波兰科学院）

AI总结提出一种基于证据驱动的贝叶斯物理信息神经网络方法，通过拉普拉斯近似高效计算模型证据，自动优化偏微分方程残差、边界条件和观测数据之间的损失权重，并在热方程、波动方程和伯格斯方程上验证了其求解精度与不确定性量化能力。

Comments 21 pages, 12 figures, re-edit the description of the Bayesian framework, some of the content moved to Appendix. Discussion of numerical performance added, as well as related approaches

详情

DOI: 10.1103/29bd-jfhz
Journal ref: Phys. Rev. E 113, 055307 (2026)

AI中文摘要

我们引入了一种基于证据驱动的贝叶斯物理信息神经网络公式，能够自动优化偏微分方程残差、边界条件和观测数据之间的损失权重。与现有基于采样或变分推理的贝叶斯PINN方法不同，所提方法使用拉普拉斯近似解析计算模型证据，从而无需后验采样即可实现高效的超参数调优和模型比较。我们在热方程、波动方程和伯格斯方程上演示了该方法，获得了与精确解或参考解一致的结果。在伯格斯方程示例中，我们进一步展示了该框架自然地整合了控制方程和含噪声测量中的信息，在统一的贝叶斯框架内提供了预测不确定性。

英文摘要

We introduce an evidence-driven Bayesian formulation of physics-informed neural networks that enables automatic optimization of loss weights between PDE residuals, boundary conditions, and observational data. Unlike existing Bayesian PINN approaches based on sampling or variational inference, the proposed method uses a Laplace approximation to compute model evidence analytically, enabling efficient hyperparameter tuning and model comparison without posterior sampling. We demonstrate the method on the heat, wave, and Burgers' equations, obtaining solutions in agreement with exact or reference results. In the Burgers' equation example, we further show that the framework naturally integrates information from governing equations and noisy measurements, providing predictive uncertainties within a unified Bayesian setting.

URL PDF HTML ☆

赞 0 踩 0

2306.10356 2026-05-29 cs.LG cs.AI eess.SP 版本更新

MATNet: Multi-Level Fusion Transformer-Based Model for Day-Ahead PV Generation Forecasting

MATNet：基于多层级融合Transformer的日前光伏发电预测模型

Matteo Tortora, Francesco Conte, Gianluca Natrella, Paolo Soda

发表机构 * Department of Naval, Electrical, Electronics ； Telecommunications Engineering, University of Genoa, Via all’Opera Pia 11a, 16145 Genoa, Italy ； Unit of Innovation, Entrepreneurship \& Sustainability, Department of Engineering, University Campus Bio-Medico of Rome Via Alvaro del Portillo 21, 00128 Rome, Italy ； Computer Systems Department of Engineering, University Campus Bio-Medico of Rome Via Alvaro del Portillo 21, 00128 Rome, Italy

AI总结提出一种基于多层级融合Transformer的多模态架构MATNet，通过多级联合融合和软注意力机制利用历史光伏数据与气象数据，在日前多步光伏发电预测中显著优于基线模型（RMSE 0.0445，相对提升约65%），并展现出对缺失数据的鲁棒性和跨域零样本泛化能力。

详情

AI中文摘要

可再生能源发电的准确预测对于促进可再生能源融入电力系统至关重要。聚焦光伏（PV）单元，预测方法主要分为基于物理和基于数据两大类，其中基于人工智能（AI）的模型提供了最先进的性能。然而，这些基于AI的模型虽然能够捕捉数据中的复杂模式和关系，却忽略了现象背后的物理先验知识。因此，本文提出MATNet，一种新颖的基于Transformer的多模态架构，用于多步日前光伏发电预测。该模型通过多层级联合融合方法输入历史光伏数据以及历史和预报气象数据，在多个融合阶段采用软注意力机制。我们在Ausgrid基准数据集上评估了MATNet的有效性，其显著优于各种基线模型，实现了0.0445的RMSE，相比表现最佳的基线方法相对提升约65%。分析进一步通过一系列消融研究、对缺失数据的敏感性分析（突显了MATNet对输入退化的鲁棒性）、在五个外部光伏数据集上的跨站点零样本泛化评估（证明了MATNet在显著域偏移下的鲁棒性）以及对模型计算复杂度的评估（确认了其在预测精度与计算效率之间的良好平衡）得到丰富。这些结果凸显了MATNet作为促进光伏能源融入电网的可靠且高效解决方案的潜力。代码可在https://github.com/arco-group/MATNet获取。

英文摘要

Accurate forecasting of renewable generation is crucial to facilitate the integration of Renewable Energy Sources into the power system. Focusing on photovoltaic (PV) units, forecasting methods can be divided into two main categories: physics-based and data-based strategies, with Artificial Intelligence (AI)-based models providing state-of-the-art performance. However, while these AI-based models can capture complex patterns and relationships in the data, they ignore the underlying physical prior knowledge of the phenomenon. Therefore, in this paper, we propose MATNet, a novel transformer-based multimodal architecture for multi-step day-ahead PV power generation forecasting. The model is fed with historical PV data and historical and forecast weather data through a multi-level joint fusion approach, employing a soft-attention mechanism at multiple fusion stages. We evaluate the effectiveness of MATNet on the Ausgrid benchmark dataset, where it significantly outperforms various baseline models, achieving an RMSE of 0.0445, corresponding to a relative improvement of approximately 65% compared to the best-performing baseline method. The analysis is further enriched by a comprehensive set of ablation studies, a sensitivity analysis on missing data, which highlights MATNet's resilience to input degradation, a cross-site zero-shot generalization evaluation on five external PV datasets, demonstrating MATNet's robustness under significant domain shifts, and an assessment of the model's computational complexity, confirming its favorable balance between predictive accuracy and computational efficiency. These results highlight MATNet's potential as a reliable and efficient solution to facilitate the integration of PV energy into the power grid. The code is available at https://github.com/arco-group/MATNet.

URL PDF HTML ☆

赞 0 踩 0