arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 2154
专题追踪
2605.18899 2026-05-20 cs.LG cs.AI

Don't Let Bandit Feedback Pull Continual LLM-Recommender Updates Off Target

不要让多臂老虎机反馈将连续LLM推荐系统更新偏离目标

Taesan Kim, Hyeongjun Yun, Jaegul Choo, Chung Park

发表机构 * SK Telecom(SK电信) KAIST(韩国科学技术院)

AI总结 本文提出了一种名为Anchored Bandit Policy Optimization (ABPO)的框架,用于持续改进基于生成式大语言模型的推荐系统,通过结合组内相对策略优化(GRPO)和显式处理曝光偏差和反馈模糊性,以减少因部署日志提供的策略形状上下文老虎机反馈导致的偏差,并提高推荐准确性。

详情
AI中文摘要

基于生成式大语言模型的推荐系统(LLM-Rec)需要持续部署后的更新,但部署日志仅提供策略形状的上下文老虎机反馈:结果仅在由先前服务策略暴露的项目上被观察到,导致曝光偏差,并产生部分、不对称的信号,包括相对可靠的积极响应和模糊的无响应。我们提出了一种连续LLM-Rec更新的Anchored Bandit Policy Optimization(ABPO)框架,结合组内相对策略优化(GRPO)与显式处理曝光偏差和反馈模糊性。具体来说,我们将在每个GRPO滚动组中插入暴露的推荐作为记录的锚点,使组内相对归一化能够针对先前策略实际暴露的动作进行校准,而不是仅针对新采样的滚动。因为正响应和无响应仅通过先前策略暴露被观察到,我们对固定锚点应用自归一化逆倾向评分,以校正策略不匹配。同时,我们将两种反馈类型进行不对称处理:正响应提供相对直接的推荐信号,而无响应仍然模糊,因为它们可能反映真正的不感兴趣或未观察到的外部因素。为了避免因模糊的无响应而过于激进的更新,我们用模型输出标记的置信度来削弱其惩罚,作为无监督的可靠性信号。在Amazon Reviews和MovieLens的五个领域中,我们的方法在推荐准确性上产生了持续的更新收益,同时比先前的基线方法更有效地缓解了先前策略引起的曝光偏差。

英文摘要

Generative LLM-based recommenders (LLM-Rec) require continual post-deployment updates, yet deployment logs provide only policy-shaped contextual bandit feedback: outcomes are observed solely for items exposed by a prior serving policy, inducing exposure bias and yielding partial, asymmetric signals consisting of relatively reliable positive responses and ambiguous no-responses. We propose an Anchored Bandit Policy Optimization (ABPO) framework for continual LLM-Rec updates that combines group-relative policy optimization (GRPO) with explicit treatment of exposure bias and feedback ambiguity. Specifically, we insert the exposed recommendation as a logged anchor into each GRPO rollout group, so that group-relative normalization is calibrated against the action actually exposed by the prior policy rather than against newly sampled rollouts alone. Because both positive- and no-responses are observed only through prior-policy exposure, we apply self-normalized inverse propensity scoring to the fixed anchor for both feedback types to correct for policy mismatch. At the same time, we treat the two feedback types asymmetrically in reliability: positive responses provide relatively direct endorsement signals, whereas no-responses remain ambiguous because they may reflect either true disinterest or unobserved external factors. To avoid overly aggressive updates from ambiguous no-responses, we temper their penalties with self-certainty, using the model's output-token confidence as a verifier-free reliability signal. Across five domains from Amazon Reviews and MovieLens, our method yields consistent post-update gains in recommendation accuracy while mitigating prior-policy-induced exposure bias more effectively than prior baselines.

2605.18895 2026-05-20 cs.RO cs.AI

KG-ASG: Collision-Knowledge-Guided Closed-Loop Adversarial Scenario Generation With Primary-Support Attribution

KG-ASG: 基于碰撞知识的闭环对抗场景生成与主支持属性

Cheng Wang, Chen Xiong, Ziwen Wang, Yuchen Zhou, Qiang Liu

发表机构 * Guangdong Provincial Key Laboratory of Intelligent Transportation System, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University(广东省智能交通系统重点实验室,智能系统工程学院,中山大学深圳校区)

AI总结 本文提出KG-ASG框架,通过碰撞知识引导和主支持属性,提高自动驾驶系统安全验证的对抗有效性、可解释性和可执行性。

详情
AI中文摘要

自动驾驶系统安全验证需要高风险场景覆盖、清晰的碰撞语义、可执行轨迹和可追溯的多车辆交互。现有安全关键场景生成方法通常依赖低级轨迹扰动、碰撞代理优化或单对抗者搜索,可能产生具有模糊碰撞原因或不可控多车辆碰撞的对抗样本。本文提出KG-ASG,一种基于碰撞知识的闭环对抗场景生成框架,具有主支持属性。KG-ASG构建了结构化的碰撞知识库,并训练了一个轻量级的碰撞专家来推断目标碰撞模式、唯一的主对抗者、支持车辆及其交互角色。在该语义先验的引导下,多车辆对抗生成被公式化为主支持过程,其中主对抗者引发主要冲突,支持车辆塑造周围风险结构,而不会成为额外碰撞者。规则、物理、交互安全性和单碰撞器约束被作为硬门来过滤不可执行的样本。为处理反应性驾驶者行为,进一步使用规划器-控制器反馈进行故障诊断、候选重新排序和终端细化。在MetaDrive中重建的WOMD场景上的实验表明,KG-ASG在IDM、Cruise和Expert控制器下实现了强对抗有效性,同时提高了有效主攻击、减少了多碰撞,并获得了闭环恢复收益。这些结果表明,碰撞知识引导和主支持单碰撞器推理提高了自动驾驶安全验证的对抗有效性、可解释性和可执行性。

英文摘要

Safety validation of autonomous driving systems requires high-risk scenario coverage, clear collision semantics, executable trajectories, and attributable multi-vehicle interactions. Existing safety-critical scenario generation methods often rely on low-level trajectory perturbations, collision-proxy optimization, or single-adversary search, which may produce adversarial samples with ambiguous collision causes or uncontrolled multi-vehicle collisions. This paper proposes KG-ASG, a collision-knowledge-guided closed-loop adversarial scenario generation framework with primary-support attribution. KG-ASG constructs a structured collision knowledge base and trains a lightweight Collision Expert to infer the target collision mode, the unique primary adversary, support vehicles, and their interaction roles. Guided by this semantic prior, multi-vehicle adversarial generation is formulated as a primary-support process, where the primary adversary induces the main conflict and support vehicles shape the surrounding risk structure without becoming additional colliders. Rule, physical, interaction-safety, and single-collider constraints are imposed as hard gates to filter non-executable samples. To handle reactive ego behaviors, planner-controller feedback is further used for failure diagnosis, candidate re-ranking, and terminal refinement. Experiments on WOMD scenarios reconstructed in MetaDrive show that KG-ASG achieves strong adversarial effectiveness while improving Valid Primary Attack, reducing multi-collision, and obtaining closed-loop recovery gains under IDM, Cruise, and Expert controllers. These results demonstrate that collision-knowledge guidance and primary-support single-collider reasoning improve adversarial effectiveness, interpretability, and executability for autonomous driving safety validation.

2605.18892 2026-05-20 cs.LG cs.AI cs.DC

Data-Free Client Contribution Estimation via Logit Maximization for Federated Learning

通过Logit最大化实现无数据的客户端贡献估计用于联邦学习

Asim Ukaye, Nurbek Tastan, Mubarak Abdu-Aguye, Karthik Nandakumar

发表机构 * MBZUAI, Abu Dhabi, UAE(MBZUAI,阿布扎赫德,阿联酋) Michigan State University, Michigan, USA(密歇根州立大学,密歇根,美国)

AI总结 本文提出了一种基于Logit最大化的无数据客户端贡献估计和聚合框架CELM,该框架无需共享原始数据、客户端元数据或辅助公开数据,通过客户端更新获取类别证据分数并构建跨客户端证据矩阵,以量化每类的竞争力和类别覆盖范围,从而计算出对少数类提供强判别性证据的客户端贡献权重,提高联邦学习的鲁棒性和性能。

Comments 22 pages, 7 figures

详情
AI中文摘要

联邦学习(FL)使计算机视觉模型能够协同学习,其中隐私和监管限制防止在设备或组织之间集中数据。然而,实际的FL部署往往表现出严重的类别不平衡和标签偏斜,导致标准聚合协议过度拟合主导客户端并降级少数类性能。我们提出了一种基于Logit最大化的无数据、按类别贡献估计和聚合框架(CELM),该框架不需要共享原始数据、客户端元数据或辅助公开数据。FL服务器通过客户端更新获取类别证据分数,并构建跨客户端证据矩阵,该矩阵量化了每类的竞争力和类别覆盖范围。使用该矩阵,我们计算出贡献权重,以提升为少数类提供强判别性证据的客户端的权重。所得到的聚合是稳定的,由于简单约束和动量平滑,且与标准FL训练流水线保持兼容。我们在受控的非独立同分布和病理标签分割的代表性视觉基准上评估了该方法,证明CELM基于的聚合提高了对不平衡和统计异质性的鲁棒性,同时在不需任何额外数据交换的情况下实现了更好的性能。

英文摘要

Federated learning (FL) enables collaborative learning of computer vision models, where privacy and regulatory constraints prevent centralizing data across devices or organizations. However, practical FL deployments often exhibit severe class imbalance and label skew, causing standard aggregation protocols to overfit dominant clients and degrade minority-class performance. We propose a data-free, class-wise contribution estimation and aggregation framework based on logit maximization (CELM) that does not require sharing raw data, client metadata, or auxiliary public datasets. The FL server probes client updates to obtain class-wise evidence scores and assembles a cross-client evidence matrix, which quantifies both per-class competence and class coverage. Using this matrix, we compute contribution weights that upweight clients providing strong, discriminative evidence for underrepresented classes. The resulting aggregation is stable due to simplex constraints and momentum smoothing, and it remains compatible with standard FL training pipelines. We evaluate the approach on representative vision benchmarks under controlled non-IID and pathological label splits, demonstrating that CELM-based aggregation improves robustness to imbalance and statistical heterogeneity, while yielding better performance without requiring any additional data exchange.

2605.18891 2026-05-20 cs.LG cs.AI

Auditing Reasoning-Trace Memorization Claims after Unlearning with Head-Conditioned Canaries

在取消学习后使用头部条件化的候鸟审计推理轨迹记忆化声明

Yanhang Li, Zhichao Fan, Zexin Zhuang

发表机构 * Northeastern University, USA(东北大学) University of Illinois Urbana-Champaign, USA(伊利诺伊大学厄巴纳-香槟分校) Southern Methodist University, USA(南方 Methodist 大学)

AI总结 该研究通过在DeepSeek-R1-Distill-Qwen-7B上使用LoRA记忆化的虚构作者和NPO取消学习,结合六token候鸟头部条件,审计推理轨迹记忆化声明,发现正向解析器拆分绕过间隙本身并不能识别隐藏的权重级记忆化,也不能排除其存在。

详情
AI中文摘要

对推理模型的取消学习评估有时会显示绕过模式。答案侧看起来已取消学习,但模型自身的推理轨迹仍会发出遗忘内容,这种差距被当作证据表明权重仍记忆。我们使用LoRA记忆化的虚构作者和NPO取消学习,在六token候鸟头部条件下审计此阅读。在一种种子下,用相同的权重交换推理轨迹为短非候鸟预填,答案率下降幅度等于绕过间隙本身,无论预填是否模仿训练模板。在第二种种子下,绕过间隙缩小而非消失,预填交换方向反转并使答案率达到上限。正向解析器拆分绕过间隙本身并不能识别隐藏的权重级记忆化,也不能排除其存在。在不同的distillate中,相同指标因解析器无法找到闭合标签而改变符号。我们推荐在解码时进行模板交换作为廉价的合理性检查,与传统审计并行。

英文摘要

Evaluations of unlearning on reasoning models sometimes show a bypass pattern. The answer side looks unlearned, but the model's own thinking trace keeps emitting the forgotten content, and the gap is taken as evidence that the weights still remember. We audit this reading on DeepSeek-R1-Distill-Qwen-7B with LoRA-memorized fictional authors and NPO unlearning, conditioned on a six-token canary head. On one seed, swapping the thinking trace for a short non-canary prefill on the same weights drops the answer rate by as much as the bypass gap itself, whether the prefill mimics the training template or not. On a second seed the bypass gap shrinks rather than vanishing, and the prefill swap reverses direction and brings the answer rate to ceiling. A positive parser-split bypass gap thus does not by itself identify hidden weight-level memorization, and does not rule it out either. On a different distillate the same metric flips sign because the parser cannot find the closing tag. We recommend a decode-time template swap as a cheap sanity check alongside the canonical audit.

2605.18889 2026-05-20 cs.LG cs.AI

Soft Learning

软学习

Mohammed Aledhari, Ali Aledhari, Fatimah Aledhari, Mohamed Rahouti

发表机构 * University of North Texas(北卡罗来纳州立大学) Fordham University(福尔特姆大学)

AI总结 本文提出软学习框架,通过交叉验证非负最小二乘法发现最优组合权重,实现比深度网络快数十倍的训练速度,同时具备内在可解释性和未来扩展性,优于多种方法,在70%的任务上排名第一。

详情
AI中文摘要

现代机器学习迫使从业者在强大的但昂贵的深度网络和快速但有限的经典算法之间做出选择。本文介绍了软学习,一个维护异质专家库的框架,涵盖线性模型、树集成、核机和神经网络,并通过交叉验证非负最小二乘法发现可证明最优的组合权重。软学习保证能匹配或超过其专家的最佳加权组合,仅在CPU上训练速度比深度网络快两到三个数量级(72-435倍,取决于测试配置),通过学习的权重提供内在可解释性,揭示哪种算法范式最适合数据,并且具有未来保障性:添加专家能保证性能维持或提升。在37个数据集(25个分类,12个回归)上,针对包括CatBoost和调优深度网络在内的九种方法,软学习在70%的任务上排名第一,获得最佳平均排名(Friedman检验,p=1.12×10^-12),并且是唯一同时在分类和回归上均表现优异的方法,无需GPU硬件或超参数调优。这些结果表明从“哪种算法最好?”到“什么是有证明最优的组合?”的范式转变,软学习通过正式保证回答任何数据模态的问题。

英文摘要

Modern machine learning forces practitioners to choose between powerful but expensive deep networks and fast but limited classical algorithms. Here we introduce Soft Learning, a framework that maintains a library of heterogeneous specialists -- spanning linear models, tree ensembles, kernel machines, and neural networks -- and discovers provably optimal combination weights through cross-validated non-negative least squares. Soft Learning is guaranteed to match or exceed the best weighted combination of its specialists, trains over two orders of magnitude faster than deep networks on CPU alone (72-435x faster across tested configurations), provides inherent interpretability through learned weights that reveal which algorithmic paradigm best fits the data, and is future-proof: adding specialists is mathematically guaranteed to maintain or improve performance. Across 37 datasets (25 classification, 12 regression) against nine methods including CatBoost and tuned deep networks, Soft Learning ranks first on 70% of tasks, achieves the best mean rank (Friedman test, p = 1.12 x 10^-12), and is the only method to simultaneously excel at both classification and regression -- all without GPU hardware or hyperparameter tuning. These results suggest a paradigm shift from "which algorithm is best?" to "what is the provably optimal combination?" -- a question Soft Learning answers with formal guarantees for any data modality.

2605.18884 2026-05-20 cs.LG cs.CV

Navigating the Emotion Tree: Hierarchical Hyperbolic RAG for Multimodal Emotion Recognition

在情绪树中导航:用于多模态情绪识别的分层双曲RAG

Zeheng Wang, Bo Zhao, Yijie Zhu, Zhishu Liu, Hui Ma, Ruixin Zhang, Shouhong Ding, Qianyu Xie, Zitong Yu

发表机构 * Great Bay University(广东东莞大亚湾大学) Tencent Youtu Lab(腾讯优图实验室)

AI总结 本文提出HyperEmo-RAG,一种利用结构化情绪知识库的检索增强生成框架,通过双曲空间嵌入和证据图构建来提升多模态情绪识别的性能。

详情
AI中文摘要

多模态情绪识别旨在整合文本、音频和视频源以理解人类情感状态。尽管多模态大语言模型在多模态推理方面表现优异,但通常将情绪类别视为独立标签,忽略了人类心理的丰富层次分类。此外,缺乏外部上下文知识使它们容易过度解释噪声线索,进一步复杂化细粒度情绪分类。为了解决这些问题,我们提出了HyperEmo-RAG,一种检索增强生成框架,利用结构化情绪知识库。我们的框架引入了两个关键创新。1)层次双曲 grounding。认识到情绪分类的内在层次树结构,我们将层次情绪标签和多模态样本嵌入到连续双曲空间(Poincaré球)中,并设计了层次束搜索 deliberation 过程,逐步从粗粒度到细粒度级别检索样本。2)结构化证据注入。基于检索到的证据,我们构建证据图,并通过Tree-Aware Attention机制和EmotionGraphFormer将结构化知识作为显式认知上下文注入LLM中,保持图结构信息的完整性。在多个数据集上的实验表明,HyperEmo-RAG显著优于现有方法。

英文摘要

Multimodal emotion recognition aims to integrate text, audio, and video sources to understand human affective states. Although multimodal large language models excel at multimodal reasoning, they typically treat emotion categories as independent labels, ignoring the rich hierarchical taxonomy of human psychology. Moreover, lacking external contextual knowledge makes them highly susceptible to over-interpreting noisy cues, further complicating fine-grained emotion classification. To address these issues, we propose \textbf{HyperEmo-RAG}, a retrieval-augmented generation framework that leverages a structured emotional knowledge base. Our framework introduces two key innovations. 1) Hierarchical hyperbolic grounding. Recognizing the inherent hierarchical tree structure of emotion taxonomies, we jointly embed hierarchical emotion labels and multimodal samples into a continuous hyperbolic space (Poincaré ball) and design a hierarchical beam-search deliberation process that progressively retrieves samples from coarse to fine-grained levels. 2) Structured evidence injection. Based on the retrieved evidence, we construct an evidence graph and inject the structured knowledge as explicit cognitive context into the LLM through a Tree-Aware Attention mechanism and an EmotionGraphFormer, preserving the integrity of graph-structured information. Experiments on multiple datasets demonstrate that HyperEmo-RAG significantly outperforms existing methods.

2605.18883 2026-05-20 cs.LG cs.AI

Prediction Is Not Physics: Learning and Evaluating Conserved Quantities in Neural Simulators

预测并非物理:在神经模拟器中学习和评估守恒量

Andrew Bukowski, Aditya Kothari, Simba Shi, Ishir Rao

发表机构 * Yale University(耶鲁大学)

AI总结 本文研究了神经网络能否从物理轨迹中学习或选择全局守恒量,通过三个哈密顿系统(抛体运动、单摆和弹簧-质量系统)验证了不同模型在守恒律保持方面的性能,发现黑盒CDN在加入时间一致性损失时表现更优,而多项式CDN对训练配置敏感。

Comments 10 pages

详情
AI中文摘要

训练在哈密顿轨迹上的扩散模型可以达到接近10^-3的滚动MSE,但其能量的标准差比地面真实能量的标准差大7500到36000倍,表明未能保持守恒定律。这一差距促使我们提出核心问题:神经网络能否从物理轨迹中学习或选择全局守恒量?我们研究了三个哈密顿系统:抛体运动、单摆和弹簧-质量系统。我们使用了结构化的T(v)+V(q)能量模型、黑盒守恒发现网络(CDN)、多项式CDN以及条件扩散基线。结构化网络在干净数据上对分析能量的R²≥0.9999,而黑盒CDN在训练时加入时间一致性损失和小的对齐损失(λ_align=0.2)时,R²≥0.996。当λ_align=0时,CDN在单摆和弹簧-质量系统上Pearson R²崩溃(<10^-3),表明仅靠时间一致性无法可靠地识别真实能量。在1%的加性高斯噪声下,CDN在抛体和弹簧-质量系统上优于结构化模型,表明CDN可能在该设置下对噪声输入更鲁棒。然而,多项式CDN对训练配置敏感:在单摆系统上短训练计划下R²=0.78,但通过更多训练时间和数据可以达到R²=0.9998,无论是否加入噪声。

英文摘要

A diffusion model trained on Hamiltonian trajectories can achieve rollout MSE near $10^{-3}$, but the standard deviation of its energy over time is between 7500 and 36000 times larger than the ground-truth energy standard deviation, indicating a failure to preserve conservation laws. This gap motivates our central question of whether neural networks can learn or select globally conserved quantities from physical trajectories. We investigate this across three Hamiltonian systems: projectile motion, pendulum, and spring-mass. We use a structured $T(v)+V(q)$ energy model, a black-box Conservation Discovery Network (CDN), a polynomial CDN, and a conditional diffusion baseline. The structured network reaches $R^2 \geq 0.9999$ against analytical energy on clean data, while the black-box CDN reaches $R^2 \geq 0.996$ when trained with temporal consistency plus a small alignment loss to analytical energy at $t=0$ ($λ_{\mathrm{align}}=0.2$). With $λ_{\mathrm{align}}=0$, CDN Pearson $R^2$ collapses on pendulum and spring-mass ($< 10^{-3}$), showing that temporal consistency alone is not enough to reliably identify the true energy. Under $1\%$ additive Gaussian noise, the CDN outperforms the structured model on the projectile and spring-mass systems, suggesting that the CDN may be more robust to noisy inputs in this setting. However, the polynomial CDN is sensitive to training configuration: it achieves $R^2=0.78$ under a short training schedule on the pendulum system, but reaches $R^2=0.9998$ with more training time and data, regardless of whether noise is added.

2605.18882 2026-05-20 cs.LG cs.AI

To Call or Not to Call: Diagnosing Intrinsic Over-Calling Bias in LLM Agents

叫还是不叫:诊断LLM代理中的内在过度调用偏差

Wei Shi, Ziheng Peng, Sihang Li, Xiting Wang, Xiang Wang, Mengnan Du, Na Zou

发表机构 * Shanghai Jiao Tong University(上海交通大学) Shanghai Artificial Intelligence Laboratory(上海人工智能实验室) Renmin University of China(中国人民大学) The Chinese University of Hong Kong Shenzhen(香港中文大学(深圳)) University of Science and Technology of China(中国科学技术大学)

AI总结 本文研究了LLM代理中过度调用现象,提出内在偏差假说,通过稀疏自编码器恢复行为对齐的特征基,减少到带符号激活边距,并估计偏移量,从而修正过度调用问题。

详情
AI中文摘要

LLM代理表现出一种一致的倾向,即在不需要工具的情况下也频繁调用工具。在When2Call基准测试中,三个家族的六个模型显示出较高的调用准确性,但调用准确性远低于不调用准确性,导致总体准确性在55%-70%之间。我们将其归因于内在偏差假说(IBH):调用/不调用决策映射具有激活无关的调用偏移,因此模型在激活平衡时仍倾向于调用。使用稀疏自编码器(SAEs),我们恢复了与调用/不调用决策对齐的特征基,将其减少到带符号激活边距,并直接估计偏移量。在所有六个模型中,只有当不调用激活超过调用激活时,模型才是决策中性的,这与IBH一致。然后,我们通过自适应边距校准引导(AMCS)进行因果测试,这是一种沿SAE解码器方向的闭合形式反偏移。消除诊断出的偏移量可以减轻过度调用并提高总体准确性,同时调用准确性下降很小。我们的工作将过度调用从经验现象转变为可以进行因果修正的机制性对象。代码可在https://github.com/SKURA502/agent-sae/上获取。

英文摘要

LLM agents exhibit a consistent tendency to over-call, invoking tools even in situations where none is needed. On the When2Call benchmark, six models from three families show high call accuracy but much lower no-call accuracy, leaving overall accuracy in the 55%-70% range. We trace this to an Intrinsic Bias Hypothesis (IBH): the call/no-call decision mapping carries an activation-independent call offset, so the model favors call even at activation parity. Using Sparse Autoencoders (SAEs), we recover behavior-aligned feature bases for the call/no_call decision, reduce them to a signed activation margin, and estimate the offset directly. Across all six models, the model is decision-neutral only when no_call activation outweighs call activation, consistent with IBH. We then causally test IBH with Adaptive Margin-Calibrated Steering (AMCS), a closed-form counter-bias shift along SAE decoder directions. Cancelling the diagnosed offset mitigates over-calling and improves overall accuracy with a negligible drop in call accuracy. Our work recasts over-calling from an empirical phenomenon into a mechanistic object amenable to causal correction. Code is available at https://github.com/SKURA502/agent-sae/.

2605.18881 2026-05-20 cs.LG physics.flu-dyn

Emergence of a Flow-Assisted Casting Strategy for Olfactory Navigation via Memory-Augmented Reinforcement Learning

气味导航中通过记忆增强强化学习的流辅助铸造策略的出现

Changxu Zhao, Dongxiao Zhao, Xin Bian, Gaojin Li

发表机构 * State Key Laboratory of Ocean Engineering, School of Ocean and Civil Engineering, Shanghai Jiao Tong University, Shanghai 200240, People’s Republic of China(海洋工程国家重点实验室,海洋与土木工程学院,上海交通大学,上海200240,中华人民共和国) State Key Laboratory of Fluid Power and Mechatronic Systems, Department of Engineering Mechanics, Zhejiang University, Hangzhou 310027, People’s Republic of China(流体动力与机械系统国家重点实验室,工程力学系,浙江大学,杭州310027,中华人民共和国)

AI总结 研究通过记忆增强强化学习探讨了在动态流场中动物如何利用记忆长度和流条件优化气味搜索效率,发现智能体通过自适应调整搜索轨迹几何形状和启动铸造的浓度阈值来最大化成功概率。

详情
AI中文摘要

在动态流场中,尽管依赖随机检测,各种动物表现出显著的气味搜索能力。有趣的是,存在一个最佳时间窗口,可以整合这些检测以最大化搜索效率。为了理解其内在机制,我们研究了在不稳定的流中,不同记忆长度和流条件下的强化学习(RL)智能体的导航性能。在没有任何预定义模型的情况下,智能体发展出一种流辅助的铸造策略,并自适应地调整其搜索轨迹的几何形状和启动铸造的浓度阈值以最大化成功率。智能体朝气味源的平均速度对记忆长度表现出非单调依赖性,这可以由“扇区搜索”模型解释。

英文摘要

In dynamic flow fields, various animals exhibit remarkable odor search capabilities despite relying on stochastic detections. Interestingly, there exists an optimal time window for integrating these detections that maximizes search efficiency. To understand the underlying mechanism, we investigate the navigation performance of Reinforcement Learning (RL) agents in unsteady flows under varying memory lengths and flow conditions. Without any predefined models, the agents develop a flow-assisted casting strategy and adaptively adjust both the geometry of their search trajectories and the concentration threshold for initiating casting to maximize the success rate. The agent's average speed toward the odor source exhibits a non-monotonic dependence on memory length, which can be explained by the "sector-search" model.

2605.18880 2026-05-20 cs.LG cs.CV q-bio.QM

A Multi-Dimensional Clustering Approach for Identifying Inborn Errors of Immunity

一种多维聚类方法用于识别先天性免疫缺陷

Nishad Kulkarni, Alexandra K. Martinson, Nicholas L. Rider, Michael Keller, Syed Muhammad Anwar

发表机构 * Sheikh Zayed Institute for Pediatric Surgical Innovation, Children’s National Hospital, Washington, DC(Sheikh Zayed儿童外科创新研究所,儿童医院,华盛顿特区) Childrens National Hospital, Washington, DC(儿童医院,华盛顿特区) Department of Health Systems & Implementation Science, Division of Allergy & Immunology Virginia Tech Carilion School of Medicine, Roanoke, VA(健康系统与实施科学部门,过敏与免疫学分会弗吉尼亚理工大学Carilion医学院,罗阿诺克,VA) Division of Allergy & Immunology Childrens National Hospital, Washington, DC(过敏与免疫学分会儿童医院,华盛顿特区) School of Medicine and Health Sciences, George Washington University, Washington, DC(医学与健康科学学院,乔治华盛顿大学,华盛顿特区)

AI总结 本文提出一种多维聚类方法,用于从全国数据注册中识别新的罕见疾病模式并提取与先天性免疫缺陷相关的特征,通过改进IEI特征意识和开发罕见疾病人群分析的数据工具包,扩展了复杂医疗记录到可被无监督ML解释的数据结构。

Comments Accepted at EMBC 2026

详情
AI中文摘要

先天性免疫缺陷(IEI)等罕见疾病需要早期诊断以防止终器官损伤并提高生活质量。获取和整理大规模电子健康记录(EHR)数据的障碍限制了常规数据驱动分析保持在IEI和其他罕见疾病趋势的前沿。在IEI中开发机器学习(ML)算法进行模式识别以及已发表的方法研究如何系统地处理和整合复杂医疗数据有限。我们提出的流程,包括数据整理和ML聚类算法,旨在识别新的罕见疾病模式并从全国数据注册中提取IEI相关的特征。我们的EHR数据格式化和处理方法提出了一个流程,将原始免疫学实验室数据转换为向量。这进一步结合了通过聚类进行疾病模式识别的超参数调优。本研究改进了IEI特征意识,开发了罕见疾病人群分析的数据工具包,并扩展了将复杂医疗记录转换为可被无监督ML解释的数据结构。

英文摘要

Rare diseases such as inborn errors of immunity (IEI) require early diagnosis to prevent end organ damage and improve quality of life. Hurdles in accessing and curating large scale electronic health record (EHR) data limit routine data driven analyses to remain on the forefront of IEI and other rare disease trends. Development of machine learning (ML) algorithms in IEI for pattern recognition as well as published methodology examining how to systematically process and integrate complex medical data is limited. Our proposed pipeline, including data curation and ML clustering algorithms, is designed to recognize novel rare disease patterns and extract IEI- associated features from a national data registry. Our methodology for EHR data formatting and processing presents the pipeline that transforms raw immunologic lab data into vectors. This is further combined with hyperparameter tuning for diseases pattern recognition via clustering. This study refines IEI feature awareness, develops data tool kits for rare disease populations analysis, and expands on transforming complex medical records in data structures interpretable by unsupervised ML.

2605.18872 2026-05-20 cs.LG cs.AI cs.RO

EUPHORIA: Efficient Universal Planning via Hybrid Optimization for Robust Industrial Robotic Assembly

EUPHORIA: 通过混合优化实现高效通用规划以实现稳健的工业机器人装配

Shih-Yu Lai, Chia-Ching Yen, Yang-Ting Shen, Peter Yichen Chen, Yu-Lun Liu, Bing-Yu Chen

发表机构 * National Taiwan University(国立台湾大学) MoonShine Animation Studio(MoonShine动画工作室) National Cheng Kung University(国立成功大学) The University of British Columbia(不列颠哥伦比亚大学) National Yang Ming Chiao Tung University(阳明交通大学)

AI总结 本文提出EUPHORIA框架,通过混合优化策略实现通用少样本适应和动态效率,解决建筑机器人装配中规划器高度专业化和操作低效的问题,结合元几何编码器、物理引导图变压器和残差稳定性校正等方法,实现高效且鲁棒的装配规划。

详情
AI中文摘要

建筑机器人装配面临持续瓶颈:现有规划器要么高度专业化,需要每次新几何设计都进行昂贵的再训练,要么操作低效,将结构序列和运动学运动视为独立过程。我们提出了EUPHORIA,一个统一框架,通过混合优化策略实现通用少样本适应和动态效率。为克服再训练瓶颈,我们提出了基于图超网络的元几何编码器:不同于标准对比学习仅在特征级识别,我们的超网络动态从最小支持集中生成策略参数,使参数级适应复杂拓扑(如穹顶、拱门)而无需基于梯度的再训练。对于结构推理,我们引入了通过软演员-评论家(SAC)训练的物理引导图变压器,其物理偏置注意力机制通过离散元模型(DEM)模拟的接触力调节注意力分数,引导规划器朝向结构关键连接。我们进一步通过运动学感知序列确保操作效率,其中SAC目标惩罚高能转换。最后,我们通过残差稳定性校正弥合仿真到现实的差距,这是一种可微优化层,通过最小化联合能量-稳定性成本优先级来微调粗略装配动作。实验表明,EUPHORIA显著减少了与解耦基线相比的能量消耗,并在未见的非标准几何上实现了最先进的成功率,通过融合元学习、物理引导注意力和残差优化,实现一个连贯的通用规划器。

英文摘要

Robotic assembly in architectural construction faces a persistent bottleneck: existing planners are either highly specialized, requiring prohibitive retraining for every new geometric design, or operationally inefficient, treating structural sequencing and kinematic motion as disjoint processes. We present EUPHORIA, a unified framework that achieves universal few-shot adaptability and dynamic efficiency through a hybrid optimization strategy. To overcome the retraining bottleneck, we propose a Meta-Geometric Encoder based on Graph Hypernetworks: unlike standard contrastive learning, which performs only feature-level recognition, our hypernetwork dynamically generates policy parameters from a minimal support set, enabling parameter-level adaptation to complex topologies (e.g., domes, arches) without gradient-based retraining. For structural reasoning, we introduce a Physics-Informed Graph Transformer trained via Soft Actor-Critic (SAC), with a Physics-Bias Attention mechanism that modulates attention scores using contact forces from Discrete Element Model (DEM) simulations, guiding the planner toward structurally critical connections. We further ensure operational efficiency through Kinematics-Aware Sequencing, where the SAC objective penalizes high-energy transitions. Finally, we bridge the Sim2Real gap via Residual Stability Correction, a differentiable optimization layer that fine-tunes coarse assembly actions by minimizing a joint energy-stability cost prior to execution. Experiments show that EUPHORIA significantly reduces energy consumption over decoupled baselines and achieves state-of-the-art success rates on unseen, non-standard geometries with minimal few-shot examples, fusing meta-learning, physics-informed attention, and residual optimization into a cohesive, generalized planner.

2605.18871 2026-05-20 cs.LG cs.AI

Distributional Energy-Based Models for Uncertainty-Aware Structured LLM Reasoning

基于不确定性感知的结构LLM推理的分布能量模型

Shireen Kudukkil Manchingal, Abhey Kalia, Fernanda Gonçalves, Shebin Rawther

发表机构 * Oxford Dynamics Harwell Science and Innovation Campus(牛津动力学哈威尔科学与创新校园)

AI总结 本文提出了一种分解的能量函数,结合了学习的质量评分器和确定性分析约束惩罚,用于验证结构LLM输出。该方法通过两步推理循环触发目标再生或 abstention,能够在多个基准测试中超越单次Qwen-72B,并减少约束违反。

详情
AI中文摘要

当大型语言模型生成结构化输出如旅行计划、代码解决方案或多步证明时,个别推理步骤可能正确,但整体输出可能违反预算、失败测试用例或与先前推论矛盾。我们提出了一种分解的能量函数,结合了学习的质量评分器和确定性分析约束惩罚,用于验证结构LLM输出。质量评分器是单个冻结编码器上的异构集合,包含低秩适配器(3%可训练参数);集合均值对候选者进行排名,标准差量化epistemic不确定性,驱动一个两步推理循环,触发目标再生或 abstention。在五个基准测试(GSM8K、MuSR、TravelPlanner、TACO、Knights & Knaves)中,我们的149M参数验证器协调一个7-26B开放生成器池,在每个基准测试中均优于单次Qwen-72B,与Claude Sonnet 4.6在MuSR上匹配(67.7% vs. 68.0%),并且在TravelPlanner上将约束违反减少53%(相对于Opus 4.6,oracle 0.028,随机 0.231)。两种方法是互补的:结构验证在约束可检查时获胜(验证器捕捉信号前沿模型无法自我检测),而预训练规模先验在不可检查时获胜(叙述推理、代码语义)。跨数据集的混淆分析确认在四个推理任务上确实存在质量区分,并识别出代码中的模型身份捷径,通过最后一层重新训练得以缓解。评分器在困难数据上训练后可实现零样本转移:一个MuSR训练的评分器在没有看到数学问题的情况下在GSM8K上达到93.9%。

英文摘要

When Large Language Models produce structured outputs such as travel plans, code solutions, or multi-step proofs, individual reasoning steps may appear correct while the output as a whole violates budgets, fails test cases, or contradicts earlier deductions. We propose a decomposed energy function that combines a learned quality scorer with deterministic analytical constraint penalties for verifying structured LLM outputs. The quality scorer is a heterogeneous ensemble of low-rank adapters on a single frozen encoder (3% trainable parameters); the ensemble mean ranks candidates while the standard deviation quantifies epistemic uncertainty, driving a two-pass inference loop that triggers targeted regeneration or abstention. Across five benchmarks (GSM8K, MuSR, TravelPlanner, TACO, Knights & Knaves), our 149M-parameter verifier orchestrating a pool of 7-26B open generators outperforms single-shot Qwen-72B on every benchmark, matches Claude Sonnet 4.6 on MuSR (67.7% vs. 68.0%), and reduces constraint violations by 53% relative to Opus 4.6 on TravelPlanner (oracle 0.028, random 0.231). The two routes are complementary: structural verification wins when constraints are checkable (the verifier captures signal frontier models cannot self-detect), while pretraining-scale priors win where they are not (narrative inference, code semantics). A cross-dataset confounding analysis confirms genuine quality discrimination on four reasoning tasks and identifies a model-identity shortcut on code, mitigated via last-layer retraining. Scorers trained on difficult data transfer zero-shot: a MuSR-trained scorer achieves 93.9% on GSM8K without seeing a math problem.

2605.18869 2026-05-20 cs.LG cs.AI cs.NE

MO-CAPO: Multi-Objective Cost-Aware Prompt Optimization

MO-CAPO:多目标成本感知提示优化

Jan Büssing, Moritz Schlager, Timo Heiß, Tom Zehle, Matthias Feurer

发表机构 * Technical University of Munich (TUM), Munich Center for Machine Learning (MCML)(慕尼黑工业大学(TUM)、慕尼黑机器学习中心(MCML)) LMU Munich, Munich Center for Machine Learning (MCML)(慕尼黑大学(LMU)、慕尼黑机器学习中心(MCML)) University of Freiburg, ELLIS Institute Tübingen(弗赖堡大学、图宾根ELLIS研究所) TU Dortmund University, Lamarr Institute for Machine Learning(多特蒙德工业大学、拉马尔机器学习与人工智能研究所)

AI总结 本文提出MO-CAPO,一种多目标提示优化算法,同时优化性能和推理成本,并通过预算分配实现高效优化,通过评估四个任务和三个LLM,证明其在噪声R2指标上优于NSGA-II基线,并在较低预算下达到竞争性性能。

详情
AI中文摘要

大型语言模型(LLMs)在广泛的任务上表现出色,但对提示设计高度敏感,促使需要自动提示优化。现有方法主要关注性能,忽略竞争目标如推理成本或延迟。同时,现有多目标提示优化工作依赖于现成的NSGA-II,忽略优化效率。为此,我们引入MO-CAPO,一种新的多目标提示优化算法,同时优化性能和推理成本,利用预算分配实现成本高效的优化。我们进一步提出一个面向部署的成本目标,捕捉LLM推理的完整计算概况。我们评估了我们的方法在四个任务和三个LLM上的表现,并将其与基于NSGA-II的多目标方法和最先进的单目标提示优化器进行比较。结果表明,MO-CAPO一致地识别出强、稳健和多样的Pareto前沿近似,同时保持成本效率。它在12种情况中的8种情况下在噪声R2指标上优于NSGA-II基线,并且在显著较低的预算下常能达到竞争性性能。发现的解决方案集涵盖了被单目标优化器遗漏的多样化性能-成本权衡,但顶级性能候选者仍与单目标解决方案竞争。此外,我们进行了首次多目标机器学习实验的评估,考虑了泛化和鲁棒性通过噪声R2和近似间隙,使解决方案质量的评估更加现实。MO-CAPO使从业者能够从高效发现的多个提示中选择,这些提示提供不同的性能和成本权衡。

英文摘要

Large language models (LLMs) achieve strong performance across a wide range of tasks but are highly sensitive to prompt design, motivating the need for automatic prompt optimization. Existing methods predominantly focus on performance alone, ignoring competing objectives such as inference cost or latency. At the same time, existing work on multi-objective prompt optimization relies on off-the-shelf NSGA-II, ignoring optimization efficiency. As a remedy, we introduce MO-CAPO, a novel multi-objective prompt optimization algorithm that jointly optimizes performance and inference cost while leveraging budget allocation for cost-efficient optimization. We further propose a deployment-oriented cost objective that captures the full computational profile of LLM inference. We evaluate our approach across four tasks and three LLMs and compare it to an NSGA-II-based multi-objective method and state-of-the-art single-objective prompt optimizers. Results show that MO-CAPO consistently identifies strong, robust, and diverse Pareto front approximations while maintaining cost-efficiency. It outperforms the NSGA-II baseline on 8 out of 12 cases in terms of the noisy R2 metric and achieves competitive performances often already at a considerably lower budget. The discovered solution sets span diverse performance-cost trade-offs that are omitted by single-objective optimizers, yet the top-performance candidates remain competitive with single-objective solutions. Additionally, we conduct the first evaluation of multi-objective machine learning experiments that considers generalization and robustness through noisy R2 and approximation gap, enabling a more realistic assessment of solution quality. MO-CAPO enables practitioners to select from an efficiently discovered set of multiple prompts offering different trade-offs between performance and cost.

2605.18867 2026-05-20 cs.LG cs.AI

EVA-0: Test-Time Model Evolution with Only Two Forward Passes per Sample

EVA-0: 仅两次前向传递的测试时间模型演化

Guohao Chen, Shuaicheng Niu, Geng Li, Yunbei Zhang, Shilin Shan, Chunyan Miao, Jianfei Yang

发表机构 * Nanyang Technological University(南洋理工大学) Tulane University(路易斯安那州立大学)

AI总结 本文研究了在仅两次前向传递预算下测试时间模型演化的问题,提出EVA-0框架以解决零阶优化中的三个关键障碍,实现高效部署。

详情
AI中文摘要

测试时间模型演化为部署模型提供了一种改进 unlabeled 测试时间经验的有前景方法,但大多数现有方法依赖反向传播(BP),这导致了显著的内存开销,使它们难以在边缘设备、量化模型、专用加速器或黑盒模型上部署。在本文中,我们研究了在严格两次前向预算下测试时间模型演化,这一设置推动了适应向高度高效的现实部署发展。我们揭示了零阶测试时间优化中的三个关键障碍:对捷径解的易感性、不受控的权重漂移和无效的更新方向估计。为克服这些问题,我们提出了EVA-0,一个最小的零阶适应框架,其特点包括:1)保持损失尺度不变以防止捷径解;2)设计了锚点引导的优化策略以缓解权重漂移;3)使用样本级对称双侧扰动进行更新方向估计和推理。EVA-0不需要BP,并且在每个样本上仅需两次前向传递即可完成推理和适应。在ImageNet-C和ViT-Base上的结果表明,EVA-0优于基于BP的DeYO和无BP的FOA,并在FOA上实现了14倍的速度提升。代码将被发布。

英文摘要

Test-time model evolution offers a promising way for deployed models to improve from unlabeled test-time experience, yet most existing methods depend on backpropagation (BP), which incurs substantial memory overhead and makes them difficult to deploy on edge devices, quantized models, specialized accelerators, or black-box models. In this work, we study test-time model evolution under a strict two-forward budget, a setting that pushes adaptation toward highly efficient real-world deployment. We reveal three key obstacles in zeroth-order test-time optimization: susceptibility to shortcut solutions, uncontrolled weight drift, and ineffective update direction estimation. To overcome them, we propose EVA-0, a minimal zeroth-order adaptation framework that: 1) keeps the loss scale-invariant to prevent shortcut solutions; 2) devises an anchor-guided optimization strategy to alleviate weight drift; 3) uses sample-wise symmetric two-sided perturbation for update direction estimation and inference. EVA-0 requires no BP and performs both inference and adaptation within only two forward passes per sample. Results on ImageNet-C & ViT-Base show that EVA-0 outperforms both BP-based DeYO and BP-free FOA, while achieving a 14x speed-up over FOA. Code will be released.

2605.18865 2026-05-20 cs.LG cs.AI

From Sparsity to Simplicity: Enabling Simpler Sequential Replacements via Sparse Attention Distillation

从稀疏到简单:通过稀疏注意力蒸馏实现更简单的顺序替换

Yuxin Ren, Maxwell D Collins, Miao Hu, Huanrui Yang

发表机构 * University of Arizona(亚利桑那大学) TetraMem, Inc.(TetraMem公司)

AI总结 本文提出通过稀疏注意力蒸馏实现更简单的顺序替换,通过分析transformer层中的稀疏模式,发现可以将复杂的token依赖分解为不同复杂度的序列到序列映射,并用更简单的顺序模块替代部分层功能,从而减少参数量和延迟。

详情
AI中文摘要

自注意力机制是大规模transformer预训练的核心基础,但其二次token交互成本使得推理过程昂贵。用更简单的顺序模块替代注意力具有吸引力,但直接替换往往导致信息丢失,尤其是在大规模情况下。本文通过稀疏性的视角重新审视注意力替换。基于对transformer各层中稀疏模式的观察,我们提出预训练transformer将复杂的token依赖分解为多种复杂度的序列到序列映射,其中某些层的功能可以被近似并用更简单的顺序模块替代而不丢失信息。我们通过插拔式层间蒸馏框架验证这一前提,以近似和替代预训练视觉transformer模型中的注意力功能。在固定训练预算下,受控组的替换结果显示:替换稀疏注意力的层比替换密集注意力的层导致的准确率下降更小。我们进一步通过AViT风格的token保留对预训练的ViT施加显式的注意力稀疏性,并进行稀疏性引导的顺序替换模型蒸馏,其中我们发现增加教师模型的稀疏性会一致减少学生模型与教师模型之间的差距。所提出的方法通过注意力稀疏性的指导实现了更小的参数量和延迟的高效注意力替换。

英文摘要

Self-attention serves as the core foundation of large-scale transformer pretraining, but its quadratic token interaction cost makes inference expensive. Replacing attention with simpler sequential modules is appealing, yet naive substitution is often lossy, especially at larger scales. This paper revisits attention replacement through the lens of sparsity. Based on the observation of diverse sparsity patterns across transformer layers, we posit that pretrained transformers decompose the complex token dependency across tokens into various sequence-to-sequence mappings of diverse complexities, where some layer functionalities can be approximated and replaced with much simpler sequential modules without loss. We evaluate this premise using a plug-and-play layer-wise distillation framework to approximate and replace attention functionalities in pretrained vision transformer models. Controlled group-wise replacements under a fixed training budget reveal a clear pattern: substituting layers with sparser attention incurs substantially smaller accuracy drops than replacing denser ones. We further impose explicit attention sparsity on the pretrained ViT via AViT-style token retention and perform sparsity-guided distillation for sequential replacing models, where we see increasing teacher sparsity consistently reduces the student-teacher gap. The proposed method achieves efficient attention replacement for reduced parameter size and latency through the guidance of attention sparsity.

2605.18864 2026-05-20 cs.LG cs.AI cs.CL

SAGE: Shaping Anchors for Guided Exploration in RLVR of LLMs

SAGE: 通过塑造锚点引导LLMs的RLVR探索

Chanuk Lee, Minki Kang, Sung Ju Hwang

发表机构 * KAIST(韩国科学技术院)

AI总结 本文提出SAGE框架,通过重塑反KL锚分布来实现可控的经验支持扩展,从而在数学推理基准中提升pass@1和pass@k的表现。

Comments Preprint

详情
AI中文摘要

近期研究发现,可验证奖励的强化学习(RLVR)能够可靠地提高推理任务的pass@1指标,但往往在pass@k上未能取得类似提升,引发了关于RLVR是否真正使大语言模型获得新推理能力还是仅提高基础模型中现有推理模式采样效率的问题。先前分析大多支持后者观点,认为这种限制源于标准RLVR目标的结构特性,导致探索压力不足。在本文中,我们提出一个核心结构约束源于反KL正则化,该正则化稳定了训练但本质上将策略锚定于参考分布,从而抑制了替代推理模式的出现。然而,我们显示,去除KL项或用前向KL替代并不能提供满意的解决方案,因为两者都会通过诱导奖励黑客或将概率质量分配给非目标区域而破坏效率-覆盖权衡。为了解决这一矛盾,我们提出了SAGE,一个原理性的框架,通过引导函数q(x,y)重塑反KL锚分布本身,实现可控的经验支持扩展,从而在挑战性的数学推理基准中获得一致的pass@1和pass@k提升。我们的代码可在https://github.com/tally0818/SAGE上获得。

英文摘要

Recent studies observe that reinforcement learning with verifiable rewards (RLVR) reliably improves pass@1 on reasoning tasks, yet often fails to yield comparable gains in pass@k, raising the question of whether RLVR genuinely enables large language models to acquire novel reasoning abilities or merely enhances the efficiency of sampling reasoning modes already present in the base model. Prior analyses largely support the latter view, attributing this limitation to structural properties of standard RLVR objectives that result in insufficient exploration pressure. In this work, we argue that a central structural constraint arises from reverse-KL regularization, which stabilizes training but inherently anchors the policy to the reference distribution, thereby suppressing the emergence of alternative reasoning modes. However, we show that neither removing the KL term nor replacing it with forward-KL provides a satisfactory solution, as both disrupt the efficiency-coverage trade-off by either inducing reward hacking or allocating probability mass to off-target regions. To resolve this tension, we propose SAGE, a principled framework that enables controllable empirical support expansion by reshaping the reverse-KL anchor distribution itself through a guide function q(x,y), achieving consistent improvements in both pass@1 and pass@k across challenging mathematical reasoning benchmarks. Our code is available at https://github.com/tally0818/SAGE.

2605.18862 2026-05-20 cs.LG cs.AI cs.CR

Towards Family-Grouped Hierarchical Federated Learning on Sub-5KB Models: A Feasibility Study of Privacy-Preserving ECG Monitoring for Ultra-Resource-Constrained Wearables

面向子5KB模型的家庭分组分层联邦学习:隐私保护ECG监测在超低资源约束可穿戴设备上的可行性研究

Hangyu Wu

发表机构 * Shenzhen Coddie Technology co.,ltd(深圳科迪科技有限公司)

AI总结 本文提出家庭分组分层联邦学习(Family-FL)和轻量级Tiny CNN-LSTM架构,通过模拟评估在超低资源约束微控制器上实现隐私保护的联邦学习的可行性,展示了在MIT-BIH数据库上达到91.9%的准确率和76.7%的通信量减少。

Comments Supported by Shenzhen Coddie Technology Co., Ltd. This is a preprint and has not been peer-reviewed

详情
AI中文摘要

心血管疾病仍是全球导致死亡的主要原因,通过可穿戴设备持续ECG监测早期检测心律失常可以预防危及生命事件。联邦学习(FL)通过在设备上保留原始ECG数据实现隐私保护的协同训练,但标准FL导致通信开销过大,标准深度学习模型无法在超低功耗微控制器上运行。我们提出家庭分组分层联邦学习(Family-FL),一种三级架构,利用家庭作为隐私边界在家庭内聚合后再进行全局同步。我们进一步设计了一种硬件受限的Tiny CNN-LSTM架构,仅包含669个参数,INT8量化后仅占用4.65KB Flash和2.95KB RAM,满足STC32G12K128类微控制器的约束。在MIT-BIH心律失常数据库上的实验(5次独立运行的平均值)表明,Family-FL相比FedAvg减少了76.7%的通信量,同时保持了可比的准确性。Family-FL-Tiny在91.9±1.2%的准确率和宏F1为0.483±0.031的情况下,将总通信量减少到FedAvg的0.31%。该模型实现了可靠的室性心律失常检测(每类F1=0.80),这是家庭初步筛查中最临床关键的异常情况。这些结果通过基于模拟的评估证明了通过隐私保护联邦学习在超低资源约束微控制器上的技术可行性。我们诚实地讨论了局限性:无硬件部署、单数据集验证(MIT-BIH,47名受试者)、罕见类敏感性降低以及无正式差分隐私保证。

英文摘要

Cardiovascular disease remains the leading cause of death worldwide, and early detection of arrhythmias through continuous ECG monitoring on wearable devices can prevent life-threatening events. Federated Learning (FL) enables privacy-preserving collaborative training by keeping raw ECG data on device, yet standard FL incurs prohibitive communication overhead and standard deep learning models cannot fit on ultra-low-power microcontrollers. We propose Family-Grouped Hierarchical Federated Learning (Family-FL), a three-tier architecture that uses the family as a natural privacy boundary for intra-family aggregation before global synchronization. We further design a hardware-constrained Tiny CNN-LSTM architecture with only 669 parameters, INT8-quantized to occupy merely 4.65KB Flash and 2.95KB RAM, meeting the constraints of STC32G12K128-class microcontrollers. Experiments on the MIT-BIH Arrhythmia Database (mean of 5 independent runs with different seeds) demonstrate that Family-FL reduces communication volume by 76.7% compared to FedAvg while maintaining comparable accuracy. Family-FL-Tiny achieves 91.9 +/- 1.2% accuracy with macro-F1 of 0.483 +/- 0.031, reducing total communication to 0.31% of FedAvg. The model achieves reliable ventricular arrhythmia detection (per-class F1 = 0.80), the most clinically critical abnormality for home-based preliminary screening. These results demonstrate the technical feasibility of privacy-preserving federated learning on ultra-resource-constrained microcontrollers through simulation-based evaluation. We honestly discuss limitations: no hardware deployment, single-dataset validation (MIT-BIH, 47 subjects), reduced rare-class sensitivity, and absence of formal differential privacy guarantees.

2605.18858 2026-05-20 cs.LG cs.AI cs.GT stat.ML

When Individually Calibrated Models Become Collectively Miscalibrated

当个体校准的模型成为集体不校准的

Zhaohui Wang

发表机构 * USC Viterbi School of Engineering(南加州大学维特比工程学院)

AI总结 研究探讨了在多智能体环境中,即使每个模型都经过个体校准,聚合预测仍可能不校准的现象,提出通过VCG聚合方法解决这一问题,实现激励相容和近最优性能。

Comments 42 pages, 1 main figure, multiple tables. Accepted at ProbML 2026

详情
AI中文摘要

概率预测系统常常将多个模型的概率估计聚合为单一决策。一个常见假设是,如果每个模型都经过个体校准,聚合预测也将是良好的校准。我们展示了在多智能体设置中,这一假设不成立:当预测者战略性地相互作用时,即使没有刻意协调,个体校准的预测者也可能集体上不校准。这种现象自然出现在智能体在重叠数据上独立训练时。我们证明,在基于Brier分数的聚合中,当信念正相关时,每个智能体的个体最优报告系统地低估了正类概率,导致价格of anarchy大于一,只要协方差(b_i, b_j) > 0。在典型设置(n=5个智能体,成对相关性=0.5,基础率=0.3)中,经实测的PoA在假阴性率上达到7.25倍。相比之下,基于VCG的聚合通过奖励边际贡献对齐激励,实现主导策略激励相容性和近最优性能。在三个现实世界数据集(NSL-KDD、UNSW-NB15、信用卡欺诈)上的实验显示,VCG在保持可比准确性的同时表现出强鲁棒性。它在数据稀疏和对抗性设置中表现尤其出色,自适应加权进一步在分布偏移下提升了性能。

英文摘要

Probabilistic prediction systems often aggregate probability estimates from multiple models into a single decision. A common assumption is that if each model is individually calibrated, the aggregate prediction will also be well calibrated. We show that this assumption fails in multi-agent settings: individually calibrated predictors can become collectively miscalibrated when their predictions interact strategically, in the game-theoretic sense of Brier-optimal local response, even without deliberate coordination. This phenomenon arises naturally when agents are independently trained on overlapping data. We prove that under Brier-score-based aggregation with positively correlated beliefs, each agent's individually optimal report systematically underestimates the positive-class probability, yielding a Price of Anarchy greater than one whenever Cov(b_i, b_j) > 0. In a canonical setting (n = 5 agents, pairwise correlation = 0.5, base rate = 0.3), the empirically measured PoA in false-negative rate reaches 7.25x. In contrast, VCG-based aggregation aligns incentives by rewarding marginal contribution, achieving dominant-strategy incentive compatibility and near-optimal performance. Experiments on three real-world datasets (NSL-KDD, UNSW-NB15, Credit Card Fraud) show that VCG provides strong robustness while maintaining comparable accuracy. It performs particularly well in data-sparse and adversarial settings, and adaptive weighting further improves performance under distribution shift.

2605.18855 2026-05-20 cs.LG cs.CV

Delta Attention Residuals

Delta Attention Residuals

Cheng Luo, Zefan Cai, Junjie Hu

发表机构 * Independent Researcher(独立研究者) University of Wisconsin–Madison(威斯康星大学麦迪逊分校)

AI总结 本文提出Delta Attention Residuals,通过在残差连接中引入对每个子层引入的变化(delta)进行注意力机制,解决了传统注意力残差中因累积隐藏状态冗余导致的路由崩溃问题,从而提升模型跨层选择信息的能力。

详情
AI中文摘要

Attention Residuals将标准加性残差连接替换为在前一层输出上学习的softmax注意力,实现了选择性的跨层路由。然而,标准Attention Residuals仍然在累积的隐藏状态上进行注意力计算,这些状态高度冗余。我们发现这种冗余导致在更深的层中出现路由崩溃:注意力权重变得低对比度且接近均匀(最大权重≈0.2),限制了模型在前一层中选择信息性状态的能力。这提出了一个关键但尚未深入研究的设计问题:在Attention Residuals中应路由何种层间表示?为回答这个问题,我们提出了Delta Attention Residuals,其在delta(每个子层引入的变化(v_i = h_{i+1} - h_i))上进行注意力计算,而非累积状态。Delta表示在结构上具有多样性,产生更高对比度的注意力分布(最大权重≈0.6),从而在层间实现更选择性和有效的路由。这一原则适用于单个子层和块粒度。在所有测试的规模(220M-7.6B)中,Delta Attention Residuals始终优于标准残差和Attention Residuals,验证困惑度提升1.7-8.2%。Delta Attention Residuals还允许通过标准微调将预训练检查点转换为Delta Attention Residuals。代码可在https://github.com/wdlctc/delta-attention-residuals-code获得。

英文摘要

Attention Residuals replace standard additive residual connections with learned softmax attention over previous layer outputs, enabling selective cross-layer routing. However, standard Attention Residuals still attend over cumulative hidden states in previous layers, which are highly redundant. We show that this redundancy leads to routing collapse in deeper layers: attention weights become low-contrast and closer to uniform (max weight ${\approx}$0.2), limiting the model's ability to select informative states in previous layers. This raises a key but underexplored design question: what layer-wise representations should be routed in Attention Residuals? To answer this question, we propose Delta Attention Residuals, which attend over deltas -- the change introduced by each sublayer ($\mathbf{v}_i = \mathbf{h}_{i+1} - \mathbf{h}_i$) -- instead of cumulative states. Delta representations are structurally diverse and yield higher-contrast attention distributions (max weight ${\approx}$0.6), enabling more selective and effective routing across layers. This principle applies at both per-sublayer and block granularity. Across all tested scales (220M--7.6B), Delta Attention Residuals consistently outperform both standard residuals and Attention Residuals, with 1.7--8.2\% validation perplexity gains. Delta Attention Residuals also enables converting pretrained checkpoints into Delta Attention Residuals via standard fine-tuning. Code is available at https://github.com/wdlctc/delta-attention-residuals-code.

2605.18854 2026-05-20 cs.LG

Evaluating Memory Condensation Strategies for Coding Agents in Data-Driven Scientific Discovery

评估用于数据驱动科学发现的编码代理的记忆压缩策略

Renuka Chintalapati, Sid Raskar, Anurag Acharya, Jared Willard, Patrick Emami, Sameera Horawalavithana

发表机构 * Pacific Northwest National Laboratory(太平洋西北国家实验室) National Laboratory of the Rockies(落基山国家实验室)

AI总结 本文评估了八种记忆压缩策略在数据驱动科学发现任务中的表现,发现没有压缩器显著提升假设质量,但基于LLM的压缩器会增加24-94%的token成本,而屏蔽工具调用输出可实现8.6%的净节省,且最佳压缩器因科学领域和任务长度而异。

详情
AI中文摘要

编码代理在长时间任务中积累大量上下文,但固定的上下文窗口迫使从业者在截断和任务失败之间做出选择。尽管已提出许多记忆压缩策略,从简单的滑动窗口到LLM生成的摘要,但缺乏系统性的比较来指导策略选择,尤其是在科学发现任务中。我们使用GPT-4o对六十个DiscoveryBench任务(涵盖六个科学领域,总计480次评估)评估了八种记忆压缩策略。我们发现,没有压缩器显著改变假设质量,而基于LLM的压缩器会增加24-94%的token成本,屏蔽工具调用输出可实现8.6%的净节省。我们还观察到,数据驱动科学发现的最佳压缩器因科学领域和任务长度而异。

英文摘要

Coding agents accumulate extensive context during long-running tasks, yet fixed context windows force practitioners to choose between truncation and task failure. While numerous memory condensation strategies have been proposed, from simple sliding windows to LLM-generated summaries, no systematic comparison exists to guide strategy selection, especially in scientific discovery tasks. We evaluate eight memory condensation strategies using GPT-4o on sixty DiscoveryBench tasks spanning six scientific domains (480 total evaluations). We find that no condenser significantly alters hypothesis quality, while LLM-based condensers increase token costs by 24-94 percent, and masking tool-call outputs achieves an 8.6 percent net savings. We also observe that the optimal condenser for data-driven scientific discovery varies by scientific domain and task length.

2605.18853 2026-05-20 cs.LG cs.CV cs.DC

INAR-VL: Input-Aware Routing for Edge-Cloud Vision-Language Inference

INAR-VL:面向边缘-云视觉-语言推断的输入感知路由

Ahmed Šabanović, Paul Joe Maliakel, Ivona Brandić

发表机构 * TU Wien(维也纳技术大学)

AI总结 本文提出INAR-VL,一种轻量级的边缘-云路由系统,用于多模态推断的两级部署。该系统通过轻量级的图像和文本复杂度信号指导路由和模型选择,在本地执行简单查询,将复杂查询卸载到云端,从而在延迟、能耗和准确性之间取得平衡。

Comments 8 pages, 3 figures

详情
AI中文摘要

边缘部署的视觉-语言模型(VLMs)面临延迟与准确性的权衡:云端执行提供高质量预测但会带来通信延迟和能耗,而仅边缘执行则速度更快但准确性较低,因为模型容量有限。这种权衡进一步受到图像质量和推理复杂度异质性的影响,使静态部署效果不佳。我们提出了INAR-VL,一种轻量级的边缘-云路由系统,用于两级部署中的多模态推断。INAR-VL在边缘和云端维护互补的VLMs,并利用轻量级的图像和文本复杂度信号指导路由和模型选择,执行简单查询本地化,当有利时将复杂查询卸载到云端。在视觉问答任务上的评估表明,INAR-VL将36%的请求执行在边缘,延迟降低24%,能耗降低26%,并保持97%的云端准确性。

英文摘要

Edge deployment of Vision-Language Models (VLMs) faces a tradeoff between latency and accuracy: cloud execution provides high-quality predictions but incurs communication delay and energy cost, while edge-only execution is faster but less accurate due to limited model capacity. This trade-off is further complicated by heterogeneity in image quality and reasoning complexity, making static placement suboptimal. We present INAR-VL, a lightweight edge-cloud routing system for multimodal inference in a two-tier deployment. INAR-VL maintains complementary VLMs across edge and cloud and uses lightweight image and text complexity signals to guide routing and model selection, executing simple queries locally while offloading complex ones when beneficial. Evaluation on visual question answering shows that INAR-VL executes 36% of requests on the edge, reduces latency by 24%, lowers energy by 26%, and preserves 97% of cloud-level accuracy.

2605.18852 2026-05-20 cs.LG cs.AI cs.CL

Robust Checkpoint Selection for Multimodal LLMs via Agentic Evaluation and Stability-Aware Ranking

通过代理评估和稳定性感知排名实现多模态大语言模型的鲁棒检查点选择

Qinwu Xu, Zhuoheng Li, Jessie Salas

发表机构 * Meta AI

AI总结 本文提出了一种多阶段框架,结合了精心挑选的现实世界数据、结构化的LLM判断和多阶段排名协议,以解决多模态大语言模型检查点选择中的鲁棒决策问题,强调数据质量(特别是OCR可读性)对评估有效性的重要性。

详情
AI中文摘要

多模态大语言模型(MLLMs)的检查点选择在性能差异微小且评估信号易受噪声影响时面临重大挑战。现有方法依赖静态基准或逐点评分,经常与实际应用场景不一致,并缺乏对不确定性的鲁棒估计,特别是在OCR密集场景中。在本文中,我们将检查点选择建模为在评估不确定性下的稳健决策问题。我们提出了一种多阶段框架,整合了精心挑选的现实世界数据、结构化的LLM判断和多阶段排名协议。评估系统通过逐点过滤、列表排名和成对比较进行逐步细化。为了提高可靠性,我们引入基于子采样的置信度估计和基于百分位数的评分公式,以捕捉分布特征并惩罚尾部失败。此外,我们证明数据质量,特别是OCR可读性,是评估有效性的重要决定因素。

英文摘要

Checkpoint selection for multimodal large language models (MLLMs) presents significant challenges when performance differentials are marginal and evaluation signals are prone to noise. Existing methodologies rely heavily on static benchmarks or pointwise scoring, which frequently misalign with in-the-wild usage and lack robust uncertainty estimation, particularly in OCR-heavy scenarios. In this work, we formulate checkpoint selection as a robust decision problem under evaluation uncertainty. We propose a multi-stage framework that integrates curated real-world data, structured LLM-based judgment, and multi-stage ranking protocols. The evaluation system orchestrates progressive refinement via pointwise filtering, listwise ranking, and pairwise comparison. To enhance reliability, we introduce subsampling-based confidence estimation and a percentile-based scoring formulation that captures distributional characteristics while penalizing tail failures. Furthermore, we demonstrate that data quality, specifically OCR readability, is a critical determinant of evaluation validity.

2605.18851 2026-05-20 cs.LG

STRIDE: Learnable Stepwise Language Feedback for LLM Reasoning

STRIDE: 用于LLM推理的可学习分步语言反馈

Junjie Zhang, Guozheng Ma, Shunyu Liu, Zetian Hu, Yongcheng Jing, Ting-En Lin, Yongbin Li, Dacheng Tao

发表机构 * Generative AI Lab, College of Computing and Data Science, Nanyang Technological University, Singapore(生成式人工智能实验室,计算与数据科学学院,南洋理工大学,新加坡) Tongyi Lab, Alibaba Group(通义实验室,阿里巴巴集团)

AI总结 本文提出STRIDE框架,通过可学习的分步语言反馈提升LLM推理能力,解决了传统方法在标注成本高、信息瓶颈等问题,实验显示其在多种推理基准上表现优异。

详情
AI中文摘要

最近强化学习(RL)的进步突显了其在激励大型语言模型(LLM)推理能力的潜力。然而,现有分步级方法面临标注成本高、领域覆盖有限的问题,而标量评分进一步引入信息瓶颈,无法提供足够的语义带宽来改进中间决策。替代的语言批评方法依赖于冻结或外部批评者,虽然提供更丰富的文本反馈,但缺乏持续政策改进所需的可扩展性。在本工作中,我们提出语言驱动的分步轨迹重定向(STRIDE),一种新颖的训练框架,将过程监督从标量奖励转移到可学习的分步语言反馈。具体来说,我们仅使用基于结果的奖励共同训练生成器和生成验证器,消除外部标注,通过联合对齐的验证器训练实现持续的政策改进。验证器的分步语言批评明确本地化并解释失败,使生成器能够在中间步骤将推理轨迹转向替代决策。轨迹重定向设计保证了即使在噪声或次优验证器反馈下也能实现无害的政策改进。在多样化的推理基准实验中,STRIDE显著优于最先进的基线,同时在零次通过率问题上取得突破,其中标量方法在消融研究中无法产生学习信号,证明了可学习分步语言反馈在增强LLM推理能力方面的有效性。

英文摘要

Recent advances in Reinforcement Learning (RL) have underscored its potential for incentivizing reasoning capabilities of Large Language Models (LLMs). However, existing step-level efforts suffer from costly annotations that limit domain coverage, while scalar scores further impose an information bottleneck, offering insufficient semantic bandwidth to improve intermediate decisions. Alternative language-critique approaches, which rely on frozen or external critics, provide richer textual feedback but lack the scalability needed for sustained policy improvement. In this work, we propose language-driven stepwise trajectory redirection, termed as STRIDE, a novel training framework that shifts process supervision from scalar rewards to learnable stepwise language feedback. Specifically, we co-train a generator and a generative verifier using only outcome-based rewards, eliminating external annotations, while delivering sustained policy improvement through jointly aligned verifier training. The verifier's stepwise language critiques explicitly localize and explain failures, enabling the generator to redirect reasoning trajectories at intermediate steps toward alternative decisions. The trajectory redirection design guarantees harmless policy improvement, even under noisy or suboptimal verifier feedback. Experiments on diverse reasoning benchmarks show that STRIDE significantly outperforms state-of-the-art baselines, as well as achieving breakthroughs on zero-pass-rate problems where scalar methods yield no learning signal in our ablation studies, demonstrating the effectiveness of learnable stepwise language feedback for enhancing LLM reasoning.

2605.18849 2026-05-20 cs.LG cs.AI

INSIGHTS: Demonstration-Based Summaries of Time Series Predictors

INSIGHTS: 时间序列预测器的基于演示的摘要

Bar Eini Porat, Rom Gutman, Uri Shalit, Ofra Amir

发表机构 * Technion Israel Institute of Technology(技术学院以色列理工学院) Tel-Aviv University(特拉维夫大学)

AI总结 本文提出INSIGHTS方法,一种模型无关、以用户为中心的方法,用于提供时间序列模型的全局解释。该方法通过生成样本摘要,平衡时间序列样本的重要性与多样性,为用户提供全面的模型行为概述。

详情
AI中文摘要

可解释性方法发展迅速,但时间序列模型的全局解释仍不完善,大多数方法集中在局部实例层面的解释上。我们介绍了INSIGHTS,一种模型无关、以用户为中心的方法,用于提供时间序列模型的全局解释。我们的方法在设计上优先考虑简单性、效率和透明性,确保利益相关者能够轻松采用其输出。尽管当前方法专注于局部解释,INSIGHTS生成样本摘要,提供模型行为的全面概述。它通过利用效用函数平衡时间序列样本的重要性与多样性,捕捉领域特定的时间序列行为特征,如超过领域规范。我们通过实验、访谈和用户研究评估INSIGHTS。我们的结果表明,INSIGHTS能够构建全面、多样的时间序列子集,生成易于个体评估的摘要。它受到领域专家的青睐,因其能够提供模型行为的稳定理解以及识别的样本质量。此外,接受INSIGHTS摘要的用户研究参与者表现出对模型整体行为的更深入理解。

英文摘要

Explainability methods have progressed rapidly, but global explanations for time-series models remain underdeveloped, with most approaches focusing on local, instance-level attributions. We introduce INSIGHTS, a model-agnostic, user-centric approach for providing global explanations of time series models. Our approach prioritizes simplicity, efficiency, and transparency in its design, ensuring that stakeholders can readily adopt its outputs. While current methods focus on local explanations, INSIGHTS generates sample summaries that offer a comprehensive overview of model behavior. It balances the importance and diversity of time series samples to create informative subsets using utility functions that capture domain-specific aspects of time series behavior, such as exceeding domain norms. We evaluate INSIGHTS through experiments, interviews, and a user study. Our results indicate INSIGHTS effectively constructs comprehensive, diverse time series subsets, producing summaries manageable for individual evaluation. It is preferred by domain experts for its ability to provide a stable understanding of model behavior and the quality of the samples identified. Moreover, user study participants presented with INSIGHTS-based summaries exhibit an enhanced understanding of the model's overall behavior.

2605.18847 2026-05-20 cs.LG cs.AI

Transformers Linearly Represent Highly Structured World Models

Transformer 通过线性方式表示高度结构化的世界模型

Roman Kniazev, Nathanaël Fijalkow

发表机构 * LaBRI, CNRS University of Bordeaux(LaBRI、CNRS 波尔多大学)

AI总结 研究探讨了Transformer在训练过程中是否能构建任务的内部模型,并发现其内部表示结构与领域结构相匹配,通过Sudoku求解轨迹训练的Transformer展示了其内部计算机制和稀疏可解释的决策电路。

详情
AI中文摘要

当Transformer被训练于顺序推理轨迹时,它们是否会构建底层任务的内部模型?如果是的话,这些内部表示的结构是否与领域结构相匹配?我们训练了一个8层的Transformer模型来解决数独问题,并对其内部计算进行了机理分析。我们得出两个结论。第一,该模型构建了一个子结构世界模型:它不按人分析员所期望的那样逐个单元格表示棋盘状态,而是围绕数独约束所作用的行、列和盒子来组织信息。第二,我们识别出一个裸单电路:在最终的MLP层中,一组专用神经元,每个神经元单独检测特定单元格中恰好只剩一个可能的数字,并可靠地促进该数字。这些发现表明,涌现世界模型的几何结构由领域约束代数决定,而非其表面表现,且所得到的决策电路是稀疏的、单义的且完全可解释的。更广泛地说,这些发现展示了机理可解释性工具能够恢复Transformer如何解决组合推理任务的端到端算法账户。

英文摘要

Do transformers, when trained on sequential reasoning traces, build internal models of the underlying task? And if so, does the structure of those internal representations mirror the structure of the domain? We train an 8-layer transformer on Sudoku solving traces and perform a mechanistic analysis of its internal computation. We establish two results. First, the model builds a substructure world model: it does not represent the board state cell by cell, as a human analyst would expect, but organizes information around the rows, columns, and boxes that Sudoku's constraints act on. Second, we identify a naked-single circuit: a small set of dedicated neurons in the final MLP layer, each individually detecting when exactly one digit remains possible for a specific cell, and reliably promoting that digit. These findings show that the geometry of an emergent world model is shaped by the constraint algebra of the domain, not its surface presentation, and that the resulting decision circuit is sparse, monosemantic, and fully interpretable. More broadly, they demonstrate that mechanistic interpretability tools can recover an end-to-end algorithmic account of how a transformer solves a combinatorial reasoning task.

2605.18846 2026-05-20 cs.LG cs.AI cs.IT math.IT

Lost and Found in Translation: Variational Diagnostics for Neural Codebook Channels

译失与找回:变分诊断用于神经码本信道

Yusuke Hayashi

发表机构 * Artificial Life Institute(人工生命研究所) AI Alignment Network(人工智能对齐网络) Humanity Brain(人类大脑)

AI总结 该研究提出了一种变分诊断方法,用于评估神经码本信道中解码器对编码器码本的读取情况,解决了传统VAE诊断无法判断解码器是否正确读取编码器码本的问题。

Comments 9 pages, 2 figures

详情
AI中文摘要

经典通信系统不仅因随机噪声失效,还当发射端和接收端使用不兼容的操作码本时也会失效。变分自编码器(VAEs)联合训练编码器$ q_ϕ $和解码器$ p_θ $,并将其潜在空间视为离散码用于聚类、条件生成和机制可解释性。然而,标准VAE诊断——ELBO、主动单元、互信息和码本直方图——只能验证该码是否被使用,而不能验证解码器是否在编码器的码下读取每个潜在变量。我们通过神经码本信道$ K_{e o d}(j\mid i) $,一种耦合的编码器-解码器诊断方法,填补了这一差距。该信道的非对角线质量由架构无关的伯努利-KL证书$ d_{\mathrm{bin}}(1-\mathcal{A} \,\|\, arη_p) \le arΔ $控制,该证书是经典KL链式法则在离散化到编码器-解码器不一致事件下的操作专门化,补充了构造性的边缘不可能性结果:没有任何组合的边缘直方图、熵、主动码计数或互信息决定$ K_{e o d} $。我们对四个sklearn数据集(有限网格精确、5/5种子、20/20对满足边界)、二维模型(在$ 2.71 imes $观测到的不一致处非空虚)、MNIST在重要性采样控制下以及一个VQ-VAE达到预测极限$ \hat{\mathcal{A}}=1.000 $进行了证书审计。该包$ (K_{e o d}, \mathcal{A}, R_{\mathrm{eff}}, R, \mathrm{AU}) $是一个审计准备的报告单位。更广泛地说,该框架使不匹配解码——经典通信理论数十年前所命名的失败模式——在单个深度生成模型中可见。

英文摘要

Classical communication systems fail not only through random noise but also when transmitter and receiver use incompatible operational codebooks. Variational autoencoders (VAEs) train an encoder $q_ϕ$ and decoder $p_θ$ jointly, and practitioners treat the resulting latent space as a discrete code -- for clustering, conditional generation, and mechanistic interpretability. Yet standard VAE diagnostics -- ELBO, active units, mutual information, and code histograms -- certify only whether this code is used, never whether the decoder reads each latent under the encoder's code. We close this gap with the neural codebook channel $K_{e\to d}(j\mid i)$, a coupled encoder-decoder diagnostic whose off-diagonal mass is bounded by an architecture-free Bernoulli-KL certificate $d_{\mathrm{bin}}(1-\mathcal{A} \,\|\, \barη_p) \le \barΔ$ controlled by the variational gap. The certificate is the operational specialization of the classical KL chain rule under disintegration to the encoder-decoder disagreement event, complemented by a constructive marginal-impossibility result: no combination of marginal histograms, entropies, active-code counts, or mutual information determines $K_{e\to d}$. We audit the certificate on four sklearn datasets (finite-grid exact, 5/5 seeds, 20/20 pairs satisfy the bound), a 2D model where the bound is non-vacuous at $2.71\times$ the observed disagreement and the four-term identity closes within $10^{-4}$, MNIST under importance-sampling control, and a VQ-VAE attaining the predicted limit $\hat{\mathcal{A}}=1.000$. The package $(K_{e\to d}, \mathcal{A}, R_{\mathrm{eff}}, R, \mathrm{AU})$ is an audit-ready reporting unit. More broadly, the framework makes mismatched decoding -- a failure mode classical communication theory named decades ago -- visible inside a single deep generative model.

2605.18845 2026-05-20 cs.LG cs.AI

First-Passage Prediction of Grokking Delay: ACalibrated Law under AdamW with Causal Validation

Grokking延迟的首次通过预测:AdamW下的校准定律与因果验证

Truong Xuan Khanh, Truong Quynh Hoa, Luu Duc Trung, Phan Thanh Duc

发表机构 * H&K Research Studio(H&K研究室) Clevix LLC(Clevix公司) Banking Academy of Vietnam(越南银行学院)

AI总结 本文提出了一种在AdamW优化器下预测grokking延迟的定量方法,通过推导闭合形式定律并结合因果验证,实现了对模型记忆延迟的准确预测。

Comments 51 pages, 7 figures, 6 tables. Preprint

详情
AI中文摘要

我们首次对AdamW下的grokking延迟进行了定量预测。将延迟视为首次通过时间,推导出闭合形式定律T_grok - T_mem = (1 / 2 kappa_LL eta lambda) log(V_mem / V_star),其中V_t = ||theta_t||^2是参数范数的平方,V_star是架构相关的阈值,kappa_LL吸收了AdamW对clean-SGD收缩率2 eta lambda的修正。在单个超参数单元上校准(kappa_LL, V_star)可对26个保留运行的grokking延迟进行预测,MAPE为17.7%(在41倍延迟范围内);该定律适用于MLP(MAPE 18.0%,N=34)但在跨任务扩展时退化为23.3%(N=46,43.5倍范围),其中存在结构残差,V_star / V_mem在架构内相对稳定(CV约为14%在1L变压器上)。首次通过V_t是必要但不充分的。定量分位数定理表明,正延迟需要同时满足范数分离V_mem > V_post和阈值alpha_star = arcsin(C / V_T_mem^(1/2))的角达性,其中C可从经验NTK特征图和验证-边距分位数中计算。在模数p=89上校准C可预测alpha_star = 47.2度(p=97时观测到47.8度,误差1.3%)作为先验跨单元预测。因果干预冻结范数或移除权重衰减在记忆化时消除grokking(0/6 vs. 3/3基线),使角位移保持在12度附近。kappa_LL是按架构经验测量而非从(beta_1, beta_2, epsilon)推导;同一架构内CV最大为15%(四个架构内),但不同架构变体之间的值差异约为2倍。经验范围是AdamW下的算法任务(模运算,稀疏奇偶性);该定律是否适用于自然语言模型尚不明确。

英文摘要

We give the first quantitative prediction of grokking delay under AdamW. Treating the delay as a first-passage time, we derive a closed-form law T_grok - T_mem = (1 / 2 kappa_LL eta lambda) log(V_mem / V_star), where V_t = ||theta_t||^2 is the squared parameter norm, V_star is an architecture-dependent threshold, and kappa_LL absorbs the AdamW correction to the clean-SGD contraction rate 2 eta lambda. Calibrating (kappa_LL, V_star) on a single hyperparameter cell predicts grokking delays on 26 held-out runs with MAPE 17.7% over a 41x delay range; the law generalises to MLPs (MAPE 18.0%, N=34) and degrades to 23.3% on cross-task extension (N=46, 43.5x range), with a structured residual in which V_star / V_mem stays comparatively stable within architecture (CV about 14% on the 1L transformer). First-passage of V_t is necessary but not sufficient. A quantile-margin theorem establishes that positive delay requires both norm separation V_mem > V_post and angular reachability of a threshold alpha_star = arcsin(C / V_T_mem^(1/2)), where C is computable from the empirical NTK feature map and the validation-margin quantile. Calibrating C on modulus p=89 predicts alpha_star = 47.2 degrees at p=97 (observed 47.8 degrees, error 1.3%) as a prior cross-cell prediction. Causal interventions that freeze the norm or remove weight decay at memorisation eliminate grokking (0/6 vs. 3/3 baseline), trapping the angular displacement near 12 degrees. kappa_LL is empirically measured per architecture rather than derived from (beta_1, beta_2, epsilon); within-architecture CV stays at most 15% across four architectures, but values differ by about 2x between architectural variants beyond depth alone. Empirical scope is algorithmic tasks (modular arithmetic, sparse parity) under AdamW; whether the law transfers to natural-language scale models is open.

2605.18844 2026-05-20 cs.LG cs.AI

Graph-Driven Cross-Industry Real-Time Monitoring Framework for Anti-Money Laundering Detection in Converged Mobility-Energy Supply Chain Networks

基于图的跨行业实时监控框架用于反洗钱检测在融合的移动-能源供应链网络

Rong Liu, Xiaojun Xiao, Zhanqing Su

发表机构 * School of Public Policy, University of Southern California(南加州大学公共政策学院) Boston University(波士顿大学) Johns Hopkins University(约翰霍普金斯大学)

AI总结 本文提出了一种基于图的跨行业实时反洗钱监控框架(GCRMF),用于整合的旅行-能源供应链网络,通过构建跨行业异构图并结合双图注意力网络,动态编码资本流动路径和时间演变特征,以提高跨行业洗钱行为的识别能力,并通过自监督在线学习机制实现实时适应和持续优化。

详情
AI中文摘要

随着旅行和能源行业的深度整合,跨行业供应链金融逐渐成为隐藏洗钱事件的高风险领域。为此,本文提出了一种基于图的跨行业实时反洗钱监控框架(GCRMF)用于整合的旅行-能源供应链网络。首先,构建了一个涵盖新能源汽车租赁平台、能源供应商、金融科技机构等的跨行业异构图(CIHG),并通过临时双图注意力网络(Temporal Dual-Graph Attention Network)整合行业语义,动态编码资本流动路径和时间演变特征。随后,为识别由合谋主体共同产生的结构性欺诈行为,提出了一种基于对比学习和分层图采样的元路径子图推理模块,以增强跨行业反复洗钱行为的识别能力。同时,采用自监督在线学习机制实现实时适应和持续优化以应对新的洗钱策略。实验结果表明,与现有跨行业场景下的图神经网络方法相比,GCRMF在F1分数上提高了超过17.8%,并显著降低了误报率。

英文摘要

With the deep integration of the travel and energy industries, cross-industry supply chain finance has gradually become a high-risk field of hidden money laundering incidents. For this reason, this work proposes a graph-driven cross-industry real-time anti-money laundering monitoring framework (GCRMF) for integrated travel - energy supply chain networks. First, a cross-industry heterogeneous graph (CIHG) covering new energy vehicle rental platforms, energy suppliers, fintech institutions, etc., is constructed, and industry semantics are integrated through temporarily Dual-GAT (Temporal Dual-Graph Attention Network), dynamically encoding capital flow paths and evolution features over time. Subsequently, in order to identify the structural fraud behavior together produced by colluding subjects, a meta-path subgraph reasoning module based on contrastive learning and hierarchical graph sampling is proposed to enhance the discrimination capability of cross-industry recurring money laundering behavior. Meanwhile, a self-supervised online learning mechanism is adopted for real-time adaptation and continuous optimization to new money laundering strategies. The experimental results show that compared with existing graph neural network methods in cross-industry scenarios, GCRMF improves the performance by more than 17.8% of F1 score and greatly reduces the false positive rate.

2605.18843 2026-05-20 cs.LG

TEMPO: Temporal Enforcement via Mode-Separated Policy Optimization for Trustworthy LLM Backtesting

TEMPO: 通过模式分离策略优化实现可信大语言模型回测的时序执行

Zeyu Zhang, Bradly C. Stadie

发表机构 * Department of Statistic and Data Science(统计与数据科学系)

AI总结 本文提出TEMPO方法,通过模式分离策略优化,解决大语言模型回测中因泄露后截止日期知识导致的评估不准确问题,核心贡献是引入双模式奖励和基于GRPO的训练流程,有效减少知识泄露并提升任务性能。

Comments 9 pages in main context

详情
AI中文摘要

对大型语言模型进行历史事件回测需要仅基于截止日期之前可用的信息进行推理。然而,模型经常从预训练中泄露后截止日期的知识到推理过程中,导致看似准确度提高但破坏评估的有效性。基于提示的约束在被抑制内容与预测有因果关系时失效,而知识卸载无法解决此问题,因为时间合规性是实例特定的:同一事实可能对一个截止日期是合法证据,对另一个截止日期则为违规。而不是删除知识,模型必须学习时间纪律:选择受每个实例截止日期条件的证据。我们提出TEMPO(通过模式分离策略优化实现时序执行),通过两个贡献训练这种纪律:(1)一个双模式奖励,其中泄漏模式将后截止日期的主张驱动至零作为硬性前提,然后性能模式优化任务性能;(2)基于GRPO的训练流程,使模型能够发现时间有效的推理策略。我们证明训练单调减少泄露,收敛到无泄露最优解,并在合规后提升任务性能。在三个预测任务和两个模型上,TEMPO将泄露率从2~13%降至0.6~3.7%,在强预截止信号存在时任务性能提升6~13%,在预测任务本身困难时维持稳定。

英文摘要

Backtesting large language models on historical events requires reasoning exclusively from information available before a specified cutoff date. Yet models routinely leak post-cutoff knowledge from pre-training into their reasoning, inflating apparent accuracy and undermining evaluation validity. Prompt-based constraints fail when suppressed content is causally related to the prediction, and knowledge unlearning cannot address this problem because temporal compliance is instance-specific: the same fact may be legitimate evidence for one cutoff date and a violation for another. Rather than erasing knowledge, the model must learn temporal discipline: selecting evidence conditioned on each instance's cutoff date. We propose TEMPO (Temporal Enforcement via Mode-separated Policy Optimization), which trains this discipline via two contributions: (1) a two-mode reward where a leakage mode drives post-cutoff claims to zero as a hard prerequisite before a performance mode optimizes task performance; and (2) a GRPO-based training pipeline that enables the model to discover temporally valid reasoning strategies. We prove that training monotonically decreases leakage, converges to the leak-free optimum, and improves task performance once compliance is achieved. On three prediction tasks and two models, TEMPO reduces leakage from 2~13% to 0.6~3.7% across all conditions, with task performance improving 6~13% where strong pre-cutoff signals exist and maintained where the prediction task is inherently difficult from valid information alone.

2605.18842 2026-05-20 cs.LG

Safe Continual Reinforcement Learning under Nonstationarity via Adaptive Safety Constraints

在非平稳环境下通过自适应安全约束实现安全的持续强化学习

Timofey Tomashevskiy

发表机构 * McMaster Centre for Software Certification(麦斯特软件认证中心) Department of Computing and Software(计算与软件系) McMaster University(麦斯特大学)

AI总结 本文提出了一种结合三种自适应安全机制的框架,用于在非平稳环境下实现安全的持续强化学习,通过自适应约束机制减少分布偏移下的安全违规,同时保持任务性能。

Comments Preprint version

详情
AI中文摘要

在非平稳环境中进行安全强化学习需要能够适应环境变化的安全部件。标准的安全强化学习方法通常假设固定约束或稳定的环境条件,这在分布偏移下可能不足。我们提出了LILAC+,一个用于非平稳环境下安全持续强化学习的框架,结合了三种自适应安全机制:基于上下文的安全约束、适应速度约束和预算到状态的安全执行。基于上下文的约束通过推断和预测的环境上下文调整安全要求。适应速度约束在环境变化速率超过智能体安全适应能力时收紧安全要求。预算到状态执行将累积安全要求转换为本地状态级控制约束,可在决策时执行。这些机制共同提供了一种统一的方法,用于持续强化学习中的主动和反应性安全适应。我们在模拟驾驶环境中评估了该框架,在平稳、已见非平稳和未见非平稳条件下进行测试。结果表明,自适应安全约束在分布偏移下显著减少了安全违规,同时在与无约束和固定约束基线相比时保持了具有竞争力的任务性能。这些发现表明,安全的持续强化学习需要能够响应当前状态信息、预测的环境上下文、适应需求和剩余安全预算的自适应约束机制。

英文摘要

Safe reinforcement learning in nonstationary environments requires safety mechanisms that adapt as environmental conditions change. Standard safe reinforcement learning methods often assume fixed constraints or stable environmental conditions, which can become inadequate under distribution shift. We propose LILAC+, a framework for safe continual reinforcement learning under nonstationarity that combines three adaptive safety mechanisms: context-based safety constraints, adaptation-speed constraints, and budget-to-state safety enforcement. Context-based constraints adjust safety requirements using inferred and predicted environmental context. Adaptation-speed constraints tighten safety requirements when the rate of environmental change exceeds the agent's ability to adapt safely. Budget-to-state enforcement converts cumulative safety requirements into local state-level control constraints that can be enforced at decision time. Together, these mechanisms provide a unified approach for proactive and reactive safety adaptation in continual reinforcement learning. We evaluate the framework in simulated driving environments under stationary, seen nonstationary, and unseen nonstationary conditions. The results show that adaptive safety constraints substantially reduce safety violations under distribution shift while maintaining competitive task performance compared with unconstrained and fixed-constraint baselines. These findings suggest that safe continual reinforcement learning requires adaptive constraint mechanisms that respond not only to current state information but also to predicted environmental context, adaptation demand, and remaining safety budget.