arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2606.10089 2026-06-10 cs.LG cs.AI 新提交

A Theory on Flow Matching with Neural Networks

基于神经网络的流匹配理论

Yihan He, Qishuo Yin, Yuan Cao, Jianqing Fan, Han Liu

AI总结本文为神经网络参数化的条件速度场流匹配建立了理论基础，证明了过参数化两层ReLU网络中梯度下降的收敛性，推导了条件速度场匹配目标的泛化界，并提供了生成样本的Wasserstein距离保证。

2606.10088 2026-06-10 cs.CV 新提交

Interpretable Temporal Facial-Region Motion Analysis for In-the-Wild Parkinson's Disease Video Classification

可解释的时序面部区域运动分析用于野外帕金森病视频分类

Riyadh Almushrafy

AI总结提出基于面部区域关键点的时序运动描述符，在YouTubePD基准上实现轻量级且可解释的PD视频分类，平衡准确率达0.826。

Comments 22 pages, 6 figures. Submitted to Biomedical Signal Processing and Control

详情

AI中文摘要

面部表情减少是帕金森病（PD）常见的运动表现，通常描述为面部运动减退或面部运动迟缓。本文研究从面部区域关键点提取的时序运动描述符是否能够支持野外PD相关视频分类，并在YouTubePD基准上进行评估。每个视频使用来自14个预定义面部区域的几何描述符表示。在相同的二分类协议下，比较了静态几何、归一化几何、基于速度的描述符、相对速度描述符以及GRU序列基线。为了评估稳定性和可解释性，研究包括种子鲁棒性分析、区域级消融和排列重要性。最佳结果使用归一化速度描述符和随机森林分类器获得，在保留测试集上达到平衡准确率0.826和AUROC 0.855。在10个随机种子下，该表示保持稳定，平衡准确率为0.810 ± 0.018，AUROC为0.855 ± 0.005。总体而言，结果表明归一化的面部区域运动是YouTubePD视频分类的一种轻量级且可解释的表示。该研究作为基准级分析，不声称临床严重程度评估或MDS-UPDRS面部表情评分。

英文摘要

Reduced facial expressivity is a common motor manifestation of Parkinson's disease (PD), often described as hypomimia or facial bradykinesia. This paper examines whether temporal motion descriptors extracted from facial-region keypoints can support in-the-wild PD-related video classification on the YouTubePD benchmark. Each video is represented using geometric descriptors from 14 predefined facial regions. Static geometry, normalized geometry, velocity-based descriptors, relative-velocity descriptors, and a GRU sequence baseline are compared under the same binary classification protocol. To assess stability and interpretability, the study includes seed-robustness analysis, region-level ablation, and permutation importance. The best result is obtained with normalized velocity descriptors and a Random Forest classifier, reaching a balanced accuracy of 0.826 and an AUROC of 0.855 on the held-out test split. Across 10 random seeds, this representation remains stable, with balanced accuracy of 0.810 +/- 0.018 and AUROC of 0.855 +/- 0.005. Overall, the results suggest that normalized facial-region motion is a lightweight and interpretable representation for YouTubePD video classification. The study is framed as a benchmark-level analysis and does not claim clinical severity assessment or MDS-UPDRS facial-expression scoring.

URL PDF HTML ☆

赞 0 踩 0

2606.10087 2026-06-10 cs.CL cs.LG 新提交

CodeAlchemy: Synthetic Code Rewriting at Scale

CodeAlchemy：大规模合成代码重写

Ankit Gupta, Aditya Prasad, Rameswar Panda

AI总结提出CodeAlchemy框架，通过5种策略生成超过500B token的合成代码数据，引入DevEval和TraceEval基准，3B模型在多项任务上超越10倍大小的前沿模型。

详情

AI中文摘要

在原始代码上预训练可以学习语法，但为多样化的真实世界任务格式提供的信号稀疏。虽然合成数据已被证明对语言模型具有变革性，但代码领域除有限的质量改进外仍基本未被探索。我们提出CodeAlchemy，一个合成数据生成框架，通过5种策略将公开来源的代码转换为语义丰富的训练数据：CodeEnhance（质量感知重写）、CodeQA（基于模板的问题）、CodeDev（开发者任务）、CodeDialogue（多轮对话）和CodeTrace（执行轨迹）。我们处理了15种语言的3个语料库，生成了超过500B token的合成数据以及350B推理token，数量级远超先前工作。CodeTrace对14种语言和5K个库的1.3M+文件进行插桩和执行，捕获控制流、状态跟踪和库知识。我们引入了DevEval（开发者任务）和TraceEval（执行预测）基准；前沿模型如Claude Sonnet 4.5在TraceEval上仅达到5.6%的精确匹配，揭示了语义理解的关键差距。我们的3B模型在HumanEval上达到83.5%，在MBPP上达到63.2%，在DevEval上达到8.09%的胜率，在TraceEval上达到15.36 ROUGE-2，超越了包括27B Gemma-3和32B Granite-4.0在内的10倍大小的前沿模型。

英文摘要

Pre-training on raw code teaches syntax but provides sparse signal for diverse real-world task formats. While synthetic data has proven transformative for language models, code remains largely unexplored beyond limited quality improvements. We present CodeAlchemy, a synthetic data generation framework that transforms publicly sourced code into semantically-rich training data through 5 strategies: CodeEnhance (quality-aware rewriting), CodeQA (template-based problems), CodeDev (developer tasks), CodeDialogue (multi-turn conversations), and CodeTrace (execution traces). We process 3 corpora across 15 languages to generate 500B+ tokens of synthetic data plus 350B reasoning tokens, orders of magnitude more than prior efforts. CodeTrace instruments and executes 1.3M+ files across 14 languages and 5K libraries, capturing control flow, state tracking, and library knowledge. We introduce DevEval (developer tasks) and TraceEval (execution prediction) benchmarks; frontier models like Claude Sonnet 4.5 achieve only 5.6% exact match on TraceEval, revealing critical gaps in semantic understanding. Our 3B models achieve 83.5% on HumanEval, 63.2% on MBPP, 8.09% win rate on DevEval, and 15.36 ROUGE-2 on TraceEval, outperforming frontier models 10x the size including 27B Gemma-3 and 32B Granite-4.0.

URL PDF HTML ☆

赞 0 踩 0

2606.10086 2026-06-10 cs.AI 新提交

Exploratory Responsiveness and Adaptive Rigidity under AI-Assisted Optimization

AI辅助优化下的探索响应性与适应性刚性

Balaraju Battu

AI总结本文提出AI辅助优化下的探索适应理论，通过动态框架分析预测辅助如何影响系统探索响应性，揭示收敛预测机制导致适应性降低、刚性增强，而探索增强机制则促进适应性。

详情

AI中文摘要

本文发展了AI辅助优化下的探索适应理论。核心论点是，AI系统的长期适应效应关键取决于预测辅助如何与探索响应性本身相互作用。我们使用一个动态框架形式化这一机制，其中认知、制度和技术系统在由多个局部强化配置构成的崎岖认知景观上演化。模型中的一个核心状态变量是适应响应性，它衡量系统在不断变化的条件下穿越不熟悉的概念和制度轨迹的能力。在收敛预测机制下，AI系统替代探索参与，降低适应响应性，并产生亚稳态陷阱、滞后、过早收敛和探索崩溃动力学，使系统局部高效但全局刚性。该框架还识别出对比的探索增强机制，其中AI系统放大探索搜索、概念穿越和适应流动性。因此，有效替代参数是响应性依赖的：拥有弱探索例程的系统更容易受到探索替代，而已经拥有高适应响应性的系统可能利用AI辅助在崎岖景观上扩展探索流动性。因此，AI的长期适应效应不仅取决于AI能力本身，还取决于制度结构、发展背景和人机交互架构。

英文摘要

This paper develops a theory of exploratory adaptation under AI-assisted optimization. The central argument is that the long-run adaptive effects of AI systems depend critically on how predictive assistance interacts with exploratory responsiveness itself. We formalize this mechanism using a dynamical framework in which cognitive, institutional, and technological systems evolve over rugged epistemic landscapes characterized by multiple locally reinforced configurations. A central state variable in the model is adaptive responsiveness, which measures the capacity of a system to traverse unfamiliar conceptual and institutional trajectories under changing conditions. Under convergent predictive regimes, AI systems substitute for exploratory engagement, reducing adaptive responsiveness and generating metastable trapping, hysteresis, premature convergence, and exploration-collapse dynamics in which systems become locally efficient but globally rigid. The framework also identifies contrasting exploration-enhancing regimes in which AI systems amplify exploratory search, conceptual traversal, and adaptive mobility. The effective substitution parameter is therefore responsiveness-dependent: systems possessing weak exploratory routines are more vulnerable to exploratory substitution, whereas systems already possessing high adaptive responsiveness may use AI assistance to expand exploratory mobility across rugged landscapes. The long-run adaptive effects of AI consequently depend not only on AI capability itself, but also on institutional structure, developmental context, and the architecture of human-machine interaction.

URL PDF HTML ☆

赞 0 踩 0

2606.10084 2026-06-10 cs.LG cs.AI 新提交

Divide-and-Conquer Modeling for the CTF-4-Science Lorenz Benchmark

CTF-4-Science Lorenz基准的分治建模策略

Shundong Li

AI总结提出分治建模策略，针对CTF-4-Science Lorenz基准的五个场景族分别设计模型，通过平滑去噪、NG-RC/NVAR预测、Lorenz过渡校正和参数前缀混合，以79.63分证明场景特定更新优于通用模型。

2606.10080 2026-06-10 cs.LG cs.AI q-bio.QM 新提交

VFUSE: Virulent Feature Understanding with Sparse autoEncoders

VFUSE: 基于稀疏自编码器的毒力特征理解

Michael Yu, Matthew L. Olson

AI总结提出VFUSE方法，通过训练稀疏自编码器（SAE）分析扩散-Transformer模型激活，识别蛋白质设计中的危险特征，实现可解释性提升而不牺牲性能。

详情

AI中文摘要

生成模型在蛋白质设计等领域取得了显著进展，但这种能力也使得危险蛋白质的生成变得不透明。在这项工作中，我们引入了VFUSE（基于稀疏自编码器的毒力特征理解），这是一种机制可解释性方法，通过在扩散-Transformer激活上训练SAE来审计蛋白质模型中的危险感知特征。我们将VFUSE应用于RoseTTAFold3和RFDiffusion3，这些是流行的开源蛋白质折叠和合成模型。我们发现，对于某些模块，线性探针在SAE潜在空间中的拟合效果显著优于原始模型表示，从而在不牺牲模型性能的情况下提高了可解释性。此外，我们识别出SAE中的单语义特征，这些特征仅在危险设计上激活，AUROC高达0.84（q < 10^{-13}）。据我们所知，这是首次在全原子扩散模型上训练SAE，也是首次对蛋白质设计模型进行特征级毒力审计，为安全且可解释的蛋白质设计铺平了道路。

英文摘要

Generative models have shown remarkable progress in a variety of domains such as protein design, but such power enables the opaque generation of hazardous proteins. In this work, we introduce VFUSE (Virulent Feature Understanding with Sparse autoEncoders), a mechanistic interpretability approach that trains SAEs on diffusion-transformer activations to audit protein models for hazard-aware features. We apply VFUSE to RoseTTAFold3 and RFDiffusion3, popular open-weight models for protein folding and synthesis. We find that for certain blocks, linear probes detect hazardous designs significantly better when fit in the SAE latent space over the original model's representations: improving interpretability without sacrificing model performance. Furthermore, we identify monosemantic features from the SAE that fire only on hazardous designs at up to AUROC $0.84$ ($q < 10^{-13}$). To our knowledge this is the first SAE trained on an all-atom diffusion model and the first feature-level virulence audit of a protein design model, paving the way towards safe and interpretable protein design.

URL PDF HTML ☆

赞 0 踩 0

2606.10071 2026-06-10 cs.LG cs.AI 新提交

Temporal Sheaf Neural Networks with Dynamic Orthogonal Transport

时序层神经网络与动态正交传输

Md Sadek Hossain Asif, Tanzila Khan, Md. Mosaddek Khan

AI总结提出时序层神经网络（TSNN），通过动态正交帧和局部坐标系间显式传输实现时序链接预测，在多种基准上超越现有方法，尤其适用于节点角色异质性强的图。

详情

AI中文摘要

我们引入了时序层神经网络（TSNN），这是一个时序链接预测框架，它为每个节点配备一个时变正交帧，并仅在局部坐标系之间进行显式传输后比较节点状态。与在共享全局嵌入空间中运行的现有连续时间图模型不同，TSNN通过动态局部帧建模节点特定且不断演化的交互语义。该模型通过高效的低秩Householder乘积参数化每个节点的帧，在帧更新下精确保留存储的隐藏状态，并使用几何残差解码器，该解码器基于传输距离锚定预测，同时学习残差校正。所有计算严格因果，仅使用事件前历史。我们证明了对称度归一化层拉普拉斯算子与对称归一化图拉普拉斯算子正交相似，而随机游走归一化形式在相应度度量下相似；TSNN使用的全激活、特征缩放扩散正是组合层Dirichlet能量上的度量梯度步，具有无度单调下降和非扩张保证。帧漂移仅线性扰动更新。在TGB v2链接预测和时序异质排行榜以及DGB基准套件上，TSNN在大多数基准上匹配或超越最强先前方法，在表现出强节点角色异质性的图上改进最大。消融实验证实了动态帧、正交传输和几何残差解码的独特优势。

英文摘要

We introduce Temporal Sheaf Neural Networks (TSNN), a temporal link prediction framework that equips each node with a time-varying orthogonal frame and compares node states only after explicit transport between local coordinate systems. In contrast to existing continuous-time graph models that operate in a shared global embedding space, TSNN models node-specific and evolving interaction semantics through dynamic local frames. The model parameterizes per-node frames via efficient low-rank Householder products, preserves stored hidden states exactly under frame updates, and uses a geometric-residual decoder that anchors predictions on transported distances while learning residual corrections. All computations are strictly causal and use only the pre-event history. We show that the symmetric degree-normalized sheaf Laplacian is orthogonally similar to the symmetric normalized graph Laplacian, with the random-walk normalized form similar in the corresponding degree metric; the full-active, feature-scaled diffusion used by TSNN is exactly a metric-gradient step on the combinatorial sheaf Dirichlet energy, with a degree-free monotone-descent and non-expansiveness guarantee. Frame drift perturbs updates only linearly. Across TGB v2 link-prediction and temporal-heterogeneous leaderboards, together with the DGB benchmark suite, TSNN matches or surpasses the strongest prior methods on most benchmarks, with the largest improvements on graphs exhibiting strong node-role heterogeneity. Ablations confirm the distinct benefit of dynamic frames, orthogonal transport, and geometric-residual decoding.

URL PDF HTML ☆

赞 0 踩 0

2606.10068 2026-06-10 cs.LG cs.AI 新提交

Importance-Aware Scheduling for High-Dimensional Hyperparameter Optimization

高维超参数优化的重要性感知调度

Ruinan Wang, Ian Nabney, Mohammad Golbabaee

AI总结提出GIF方法，通过小样本预热估计超参数重要性，按重要性分组并比例分配试验，保留全空间回退，在高维基准上优于TPE等方法，提升采样效率。

Comments 8 pages, 5 figures. Accepted to IJCNN 2026

详情

AI中文摘要

超参数优化（HPO）对于构建高性能的ML/DL模型至关重要，但传统优化器在高维空间中常常难以应对，其中评估成本高昂且进展被许多低影响变量稀释。我们提出贪婪重要性优先（GIF），一种重要性感知的调度策略，使用小样本预热来估计超参数重要性，形成基于重要性的分组，按比例分配试验，并保留全空间回退。我们在五个各向异性解析函数、Bayesmark和NAS-Bench-301上，在固定评估预算下评估GIF。在高维基准上，GIF比TPE、BOHB、随机搜索和顺序分组更快地达到更好的当前最优解。在有效维度较小的Bayesmark上，GIF仍具有竞争力，但优势较小。消融研究表明，重要性估计、比例分配和回退步骤都有助于性能提升。我们还验证了HIA组件在解析基准上恢复了预期的各向异性。这些结果表明，GIF是一种简单且即插即用的方法，可提高高维HPO中的样本效率。

英文摘要

Hyperparameter Optimization (HPO) is essential for building high-performing ML/DL models, yet conventional optimizers often struggle in high-dimensional spaces where evaluations are costly and progress is diluted across many low-impact variables. We propose Greedy Importance First (GIF), an importance-aware scheduling strategy that uses a small-sample warm start to estimate hyperparameter importance, forms importance-based groups, allocates trials proportionally, and retains a full-space fallback. We evaluate GIF under fixed evaluation budgets on five anisotropic analytic functions, Bayesmark, and NAS-Bench-301. On the higher-dimensional benchmarks, GIF reaches better incumbents with faster convergence than TPE, BOHB, Random Search, and Sequential Grouping. On Bayesmark, where the effective dimensionality is smaller, GIF remains competitive but the margins are smaller. Ablation studies show that importance estimation, proportional allocation, and the fallback step all contribute to the gains. We also verify that the HIA component recovers the intended anisotropy on the analytic benchmarks. These results suggest that GIF is a simple and plug-compatible way to improve sample efficiency in high-dimensional HPO.

URL PDF HTML ☆

赞 0 踩 0

2606.10066 2026-06-10 cs.CV cs.AI cs.LG 新提交

A Controlled Audit of Pretraining Contamination in Public Medical Vision-Language Benchmarks

公共医学视觉语言基准中预训练污染的受控审计

Bruce Changlong Xu, Lan Wu, Alexander Ryu

AI总结审计发现公共医学VLM基准存在图像源重叠和文本规范顺序交换性信号，但确认的像素级重复罕见，且现有成员推理检测器在小规模医学VLM队列中不可靠。

Comments 30 pages, 7 figures, 9 tables. Preprint

详情

AI中文摘要

医学视觉语言模型（VLM）在公共基准上进行评估，这些基准的图像和问答对多年来一直可自由下载，但报告准确度假设这些示例在预训练中不存在。我们对SLAKE-En、PathVQA、VQA-RAD以及一个辅助的公共OmniMedVQA镜像上的开放VLM进行了审计，使用了四种检测器系列：图像侧近邻重叠（针对PMC-OA-beta）、规范顺序可交换性、队列相对Min-K%++尾部富集以及跨模型Top-K重叠。我们发现SLAKE-En上存在可测量的图像侧源重叠：SigLIP-B-16标记了19.8%的图像，SigLIP-SO400M标记了4.2%，而域外对照产生0/2000个标记。人工裁定显示，相同模态、相同投影的匹配对应不同患者，而非经过验证的像素级重复，因此我们将其解释为源或分布重叠，而非确认的每图像记忆。在文本侧，Qwen2.5-VL在SLAKE-En上显示出规范顺序可交换性信号，该信号在顺序消融和外部非医学基线中仍然存在。在OmniMedVQA镜像上，五个医学和通用VLM触发了可交换性，而BLIP-2保持干净。相比之下，队列相对Min-K%++尾部富集和跨模型Top-K重叠在外部预域基线中崩溃：BLIP-2重现了明显的正信号，尽管缺乏合理的医学VQA暴露。我们得出结论，这些队列相对检测器作为小规模医学VLM队列上的独立成员推理信号是不可靠的。

英文摘要

Medical vision-language models (VLMs) are evaluated on public benchmarks whose images and question-answer pairs have been freely downloadable for years, yet reported accuracy assumes these examples were absent from pretraining. We audit open VLMs on SLAKE-En, PathVQA, VQA-RAD, and an auxiliary public OmniMedVQA mirror using four detector families: image-side near-neighbour overlap against PMC-OA-beta, canonical-order exchangeability, cohort-relative Min-K%++ tail enrichment, and cross-model top-K overlap. We find measurable image-side source overlap on SLAKE-En: 19.8% of images are flagged under SigLIP-B-16 and 4.2% under SigLIP-SO400M, while out-of-domain controls produce 0/2000 flags. Manual adjudication shows same-modality, same-projection matches to different patients rather than verified pixel-level duplicates, so we interpret this as source or distributional overlap rather than confirmed per-image memorization. On the text side, Qwen2.5-VL on SLAKE-En shows a canonical-order exchangeability signal that survives ordering ablation and external non-medical baselines. On the OmniMedVQA mirror, exchangeability fires for five medical and general VLMs while BLIP-2 remains clean. In contrast, cohort-relative Min-K%++ tail enrichment and cross-model top-K overlap collapse under an external pre-domain baseline: BLIP-2 reproduces the apparent positive signals despite lacking plausible medical-VQA exposure. We conclude that these cohort-relative detectors are unreliable as standalone membership-inference signals on small medical-VLM cohorts.

URL PDF HTML ☆

赞 0 踩 0

2606.10064 2026-06-10 cs.LG cs.AI 新提交

Bittensor Agent Arenas as a Trajectory Primitive: Distilling a Shopping Agent from ShoppingBench Subnet Traces

Bittensor 智能体竞技场作为轨迹基元：从 ShoppingBench 子网轨迹中蒸馏购物智能体

Shardul Bansal, Seth Schilbe, Jarrod Barnes

AI总结针对小模型后训练缺乏多轮轨迹数据的问题，利用 Bittensor 子网 SN15 的竞技机制生成激励对齐的轨迹，通过结构质量过滤提取智能体轨迹，后训练 Qwen3-4B 模型在 ShoppingBench 上达到 42.7% ASR，接近合成数据基线。

Comments 10 pages, 4 figures, Data and Models available at: https://huggingface.co/collections/oro-ai/shoppingbench-sn15-trajectory-primitive

详情

AI中文摘要

小模型智能体后训练的瓶颈更多在于其消耗的轨迹基质而非算法。领先的方案（RLVR、群体相对 RL、拒绝采样重 SFT）都需要携带每轨迹监督的多轮轨迹，而现有的两个来源存在不足：前沿合成数据继承了合成器的偏见并坍缩了长尾，而未经过滤的生产日志未经评判且被捷径行为污染。我们认为可以设计一个激励对齐的智能体竞技场来制造此类轨迹，并在 ORO Subnet 15（SN15）上进行了演示，这是 ShoppingBench 智能体电商基准的 Bittensor 部署。SN15 的竞赛机制、LLM 推理评判器和旋转泄漏簇防护问题集产生了一个具有三个特性的语料库：激励对齐的多样性、每轨迹评判和反记忆的留出评估。我们引入了一个结构质量过滤器，通过保留智能体轨迹（模型自身发出工具调用）并拒绝子任务轨迹（模型仅在确定性搜索循环上进行分类或叙述），将原始数据流转换为可训练的语料库，然后使用与已发布的 ShoppingBench SFT-然后-GRPO 流程匹配的方案对 Qwen3-4B 进行后训练。在泄漏簇防护的留出分区上，以生产严格方式评分，模型从已发布的 Qwen3-4B 基线的 18.0% ASR 提升至 42.7%，与合成数据 SFT 仅基线（43.6%）在单问题噪声范围内，同时仅训练了子网单日输出的一小部分。监督堆栈留下了较大的 pass@8 到 pass@1 差距（53.3% 对比 34.8%）；每步教师基础的 Dr. GRPO 奖励将该空间转化为过程改进，我们确定子任务数据流是缩小与 48.7% SFT+GRPO 基线差距的主要杠杆。我们发布了过滤器、语料库分割和竞技场机制。

英文摘要

Small-model agentic post-training is bottlenecked less by the algorithm than by the trajectory substrate it consumes. Leading recipes (RLVR, group-relative RL, rejection-sampled re-SFT) all need multi-turn traces carrying per-trajectory supervision, and the two existing sources fall short: frontier-synthesised data inherits the synthesizer's biases and collapses the long tail, while unfiltered production logs are unjudged and contaminated by shortcut behaviour. We argue that an incentive-aligned agent arena can be engineered to manufacture such trajectories, and demonstrate this on ORO Subnet 15 (SN15), a Bittensor deployment of the ShoppingBench agentic-commerce benchmark. SN15's race mechanism, LLM reasoning judge, and rotating leak-cluster-guarded problem suite yield a corpus with three properties: incentive-aligned diversity, per-trajectory judging, and anti-memorised held-out evaluation. We introduce a structural-quality filter that converts the raw firehose into a trainable corpus by keeping agentic trajectories (the model itself emits the tool calls) and rejecting sub-task trajectories (the model only classifies or narrates over a deterministic search loop), then post-train Qwen3-4B with a recipe matched to the published ShoppingBench SFT-then-GRPO pipeline. On a leak-cluster-guarded held-out partition scored production-strict, the model lifts from the published Qwen3-4B base of 18.0% ASR to 42.7%, within single-problem noise of the synthetic-data SFT-only baseline (43.6%), while training on a fraction of a single day of subnet output. The supervised stack leaves a large pass@8 to pass@1 gap (53.3% vs 34.8%); a per-step teacher-grounded Dr. GRPO reward converts that headroom into process improvement, and we identify the sub-task firehose as the primary lever for closing the gap to the 48.7% SFT+GRPO bar. We release the filter, the corpus splits, and the arena mechanics.

URL PDF HTML ☆

赞 0 踩 0

2606.10062 2026-06-10 cs.AI cs.MA 新提交

Deployment-Time Memorization in Foundation-Model Agents

基础模型智能体中的部署时记忆

Lei, Chen, Guilin Zhang, Kai Zhao, Dalmo Cirne, Andy Olsen, Xu Chu, Zeke Miller, Alet Blanken, Amine Anoun, Jerry Ting

AI总结研究基础模型智能体在部署时记忆的设计选择如何影响个性化效用、提取风险和删除保真度，提出遗忘残差分数并揭示压缩与删除的权衡。

Comments 4 pages, ICML MemFM 2026 Workshop

详情

AI中文摘要

基础模型智能体正成为越来越长寿命的系统，它们跨交互记忆用户，使记忆成为明确的部署时功能，而不仅仅是模型权重的属性。现有工作处理参数化记忆或审计固定记忆配置，但没有描述记忆设计选择如何共同塑造个性化效用、提取风险和删除保真度。我们将这一表面研究为部署时记忆，将智能体记忆表述为通过个性化召回（PR）和对抗提取率（AER）测量的隐私-效用前沿，并扫描三个记忆设计旋钮：摘要攻击性、检索广度（k）和删除模式。我们进一步引入遗忘残差分数（FRS）来量化删除的信息是否仍可从派生记忆层中恢复。在LongMemEval上，关键事实摘要将Gemma 3 12B上的金丝雀提取减少76%，GPT-4o-mini上减少64%，同时几乎保留所有个性化召回；关键是，一旦内容被压缩掉，增加k不再恢复泄漏。然而，相同的压缩会导致删除保真度失败：仅原始删除使派生摘要副本在大约20%的实例中可恢复，只有全管道清除或墓碑修订才能使最差层残差为零。总之，这些结果确立了持久智能体记忆必须作为一级记忆机制进行评估——通过它帮助智能体回忆的内容、它使什么可提取以及它真正能擦除什么来评估。

英文摘要

Foundation-model agents are increasingly long-lived systems that remember users across interactions, making memorization an explicit deployment-time function rather than solely a property of model weights. Existing work addresses parametric memorization or audits fixed memory configurations, but does not characterize how memory-design choices jointly shape personalization utility, extraction risk, and deletion fidelity. We study this surface as deployment-time memorization, formulating agent memory as a privacy-utility frontier measured by Personalization Recall (PR) and Adversarial Extraction Rate (AER), and sweeping three memory-design knobs: summarization aggressiveness, retrieval breadth (k), and deletion mode. We further introduce the Forgetting Residue Score (FRS) to quantify whether deleted information remains recoverable from derived memory tiers. On LongMemEval, key-fact summarization reduces canary extraction by 76% on Gemma 3 12B and 64% on GPT-4o-mini while preserving nearly all personalization recall; critically, once content is compressed away, increasing k no longer restores leakage. The same compression, however, induces a deletion-fidelity failure: raw-only deletion leaves derived summary copies recoverable in approximately 20% of instances, and only full-pipeline purge or tombstone redaction drives worst-tier residue to zero. Together, these results establish that persistent agent memory must be evaluated as a first-class memorization mechanism -- assessed by what it helps agents recall, what it makes extractable, and what it can truly erase.

URL PDF HTML ☆

赞 0 踩 0

2606.10061 2026-06-10 cs.CL 新提交

BenSyc: Benchmarking Conversational Sycophancy and Human Alignment in LLMs for Bengali Contexts

BenSyc: 孟加拉语上下文中大语言模型对话谄媚与人类对齐的基准测试

Kazi Noshin, Sajib Acharjee Dip, Ranat Das Prangon, Fardin Hassan Tamim, Syed Ishtiaque Ahmed, Liqing Zhang, Sharifa Sultana

AI总结提出BenSyc基准，基于孟加拉语社交数据构建五级标注集，评估15+模型在对话对齐分类与生成任务上的表现，发现前沿模型在区分共情与强化性认可上仍存在困难。

详情

AI中文摘要

大型语言模型（LLMs）越来越多地参与情感敏感的社交对话，其回应可能从平衡支持转向过度认可或升级性对齐。现有的谄媚研究主要关注事实一致性和指令遵循设置，而文化背景下的对话谄媚尚未得到充分探索。我们引入了BenSyc，这是首个用于研究孟加拉语社交语境中对话谄媚的基准。从孟加拉国和西孟加拉邦社区收集的11,840条Reddit帖子和170k条评论出发，我们构建了一个人工验证的基准，包含二元标签和一个细粒度的五级分类体系，涵盖无效化、中立、支持、认可和升级。我们在对话对齐分类和响应生成任务上评估了超过15个开源和专有LLM。结果表明，即使对于前沿的指令调优模型，区分共情性支持与强化导向的认可仍然具有挑战性：最佳系统在二元检测上仅达到61.8 Macro-F1，在五类分类上达到61.7 Macro-F1。在生成设置中，多个模型在情感激烈的情境下频繁产生强烈认可或升级性回应。我们的发现凸显了不同模型家族和对话行为之间的显著差异，强调了文化背景下的多语言基准对于评估社交对齐的对话AI系统的重要性。

英文摘要

Large language models (LLMs) increasingly participate in emotionally sensitive social conversations, where responses may shift from balanced support toward excessive validation or escalatory alignment. Existing sycophancy research primarily focuses on factual agreement and instruction-following settings, leaving culturally grounded conversational sycophancy underexplored. We introduce BenSyc, the first benchmark for studying conversational sycophancy in Bengali social contexts. Starting from 11,840 Reddit posts and 170k comments collected from communities across Bangladesh and West Bengal, we construct a human-validated benchmark with binary labels and a fine-grained five-level taxonomy spanning Invalidation, Neutral, Support, Validation, and Escalation. We evaluate more than 15 open and proprietary LLMs on conversational alignment classification and response generation tasks. Results show that distinguishing empathetic support from reinforcement-oriented validation remains challenging even for frontier instruction-tuned models: the best system achieves only 61.8 Macro-F1 on binary detection and 61.7 Macro-F1 on five-class classification. In generation settings, several models frequently produce strongly validating or escalatory responses in emotionally charged situations. Our findings highlight substantial variation across model families and conversational behaviors, underscoring the importance of culturally grounded multilingual benchmarks for evaluating socially aligned conversational AI systems.

URL PDF HTML ☆

赞 0 踩 0

2606.10044 2026-06-10 cs.AI 新提交

Business World Model

商业世界模型

Cecil Pang, Hiroki Sayama

AI总结提出商业世界模型（BWM）架构，将世界模型思想应用于商业环境，通过编码状态、动态、约束和目标，支持自主决策与规划。

详情

AI中文摘要

企业越来越多地采用AI驱动的工具来提高生产力、降低成本并增强产品和服务。然而，AI的变革潜力不仅限于自动化预定义任务：它在于使智能系统能够从高层战略目标出发，规划、优化和执行商业计划。本文介绍了商业世界模型（BWM）的概念和架构，这是一种专门针对商业和组织环境的世界模型。受人工智能、认知科学和控制理论中的世界模型启发，BWM编码了商业状态、动态、约束、目标和可行的动作空间，以支持自主决策。我们提出了一种以商业语义为中心的公式，其中商业状态、动态和动作与关键商业实体相关联。在此框架内，智能体可以模拟替代动作序列，估计其对未来商业结果的影响，并在不确定性下评估权衡。所提出的架构将语义数据表示、概率机器学习模型、确定性业务规则和显式动作空间整合为一个用于规划和反事实推理的连贯结构。尽管其各个组成部分并非全新，但BWM的贡献在于将它们组织为用于商业计划的可执行内部模拟器。这项工作为能够从基于指令的执行转向目标驱动的规划和执行的自主商业系统奠定了概念基础。

英文摘要

Businesses are increasingly adopting AI-enabled tools to improve productivity, reduce costs, and enhance products and services. However, the transformative potential of AI extends beyond automating predefined tasks: it lies in enabling intelligent systems to plan, optimize, and execute business initiatives from high-level strategic objectives. This paper introduces the concept and architecture of a business world model (BWM), a world model specialized for business and organizational environments. Inspired by world models in artificial intelligence, cognitive science, and control theory, a BWM encodes business states, dynamics, constraints, objectives, and feasible action space to support autonomous decision-making. We propose a business-semantics-centric formulation in which business states, dynamics and actions are linked to key business entities. Within this framework, agents can simulate alternative action sequences, estimate their effects on future business outcomes, and evaluate trade-offs under uncertainty. The proposed architecture integrates semantic data representations, probabilistic machine learning models, deterministic business rules, and explicit action space into a coherent structure for planning and counterfactual reasoning. Although its individual components are not new, the contribution of BWM lies in organizing them as an executable internal simulator for business initiatives. This work establishes a conceptual foundation for autonomous business systems capable of moving from instruction-based execution toward goal-driven planning and execution.

URL PDF HTML ☆

赞 0 踩 0

2606.10039 2026-06-10 cs.RO 新提交

Robotic Nonprehensile Object Transportation with a Hanging Tray

使用悬挂托盘的机器人非抓取式物体运输

Adam Heins, Angela P. Schoellig

AI总结针对机器人服务员问题，提出使用绳索悬挂托盘实现三维摆运动，仅需3自由度移动基座即可减少滑动和泼洒，实验验证了有效性并集成到交互演示中。

Comments 8 pages, 11 figures. IEEE/ASME International Conference on Advanced Intelligent Mechatronics, 2026

详情

AI中文摘要

我们考虑称为服务员问题的非抓取式物体运输任务，其中机器人必须将平衡在托盘上的物体从一个位置移动到另一个位置。与先前关于机器人服务员问题的工作（使机器人倾斜由末端执行器刚性握持的托盘）不同，我们使用由绳索从末端执行器悬挂的托盘，使其行为类似于三维摆。一些先前的工作驱动机器人使末端执行器模拟摆的行为，因为摆运动减少了作用在运输物体上的剪切力，从而最小化刚性物体的滑动和液体容器中的泼洒。相比之下，我们使用真实的悬挂托盘，使得我们能够获得摆运动的益处，同时仅驱动3自由度移动基座，而不需要完整的6自由度机械臂。我们在仿真和真实硬件上的实验表明，与静态、刚性握持的托盘相比，悬挂托盘显著减少了滑动和泼洒。此外，我们将悬挂托盘集成到交互式机器人服务员演示中，该演示使用计算机视觉识别举手的人，并通过视觉伺服引导机器人朝向它们，使它们能够接触托盘。

英文摘要

We consider the nonprehensile object transportation task known as the waiter's problem, in which a robot must move an object balanced on a tray from one location to another. In contrast to prior works on the robotic waiter's problem, which make the robot tilt a tray rigidly held by its end effector (EE), we use a tray suspended from the EE by ropes, such that it behaves like a three-dimensional pendulum. Some prior works have actuated the robot so that the EE simulates the behavior of a pendulum, because pendular motion reduces the shear forces acting on the transported objects, minimizing the sliding of rigid objects and sloshing in containers of liquid. In contrast, our use of a real hanging tray allows us to obtain the benefits of pendular motion while only actuating a 3 degree-of-freedom (DOF) mobile base, rather than requiring a full 6-DOF manipulator arm. Our experiments in simulation and on real hardware show that the hanging tray substantially reduces both sliding and sloshing compared to a static, rigidly-grasped tray. Furthermore, we integrate the hanging tray into an interactive robot waiter demonstration, which uses computer vision to identify people with a raised hand and visual servoing to steer toward them and allow them to access the tray.

URL PDF HTML ☆

赞 0 踩 0

2606.10029 2026-06-10 cs.LG cs.AI cs.CL 新提交

Interpreting and Steering a Text-to-Speech Language Model with Sparse Autoencoders

用稀疏自编码器解释和引导文本转语音语言模型

Nikita Koriagin, Georgii Aparin, Nikita Balagansky, Daniil Gavrilov

AI总结本文在CosyVoice3语言模型骨干上训练BatchTopK稀疏自编码器，发现特征可解释且因果可控，能操纵笑声、性别和语速。

详情

AI中文摘要

语言模型日益成为文本转语音（TTS）系统的骨干，但我们对其在文本和生成语音令牌共享单一残差流时构建的表示知之甚少。我们在CosyVoice3的语言模型骨干上训练BatchTopK稀疏自编码器，并引入一种模态感知的自动解释流水线，根据特征触发的位置——文本前缀上下文、1秒语音片段或两者——为每个特征打标签。恢复的特征是可解释的，涵盖音素、笑声、口音提示和说话者性别。通过SAE潜在空间进行引导表明，这些特征是因果性的而非仅仅是描述性的：有针对性的干预将笑声概率从0.02提高到0.79，翻转感知到的说话者性别，并在保持口语内容的同时控制语速。因此，SAE特征既可作为解释性对象，也可作为TTS合成的控制方向。

英文摘要

Language models increasingly serve as the backbone of text-to-speech (TTS) systems, yet we understand little about the representations they build when text and generated speech tokens share a single residual stream. We train BatchTopK sparse autoencoders on the LM backbone of CosyVoice3 and introduce a modality-aware auto-interp pipeline that labels each feature from where it fires-text-prefix context, 1-second speech clips, or both. The recovered features are interpretable, spanning phonemes, laughter, accent prompts and speaker gender. Steering through the SAE latent space shows these features are causal rather than merely descriptive: targeted interventions raise laughter probability from 0.02 to 0.79, flip perceived speaker gender, and control speech rate while preserving spoken content. SAE features thus serve both as interpretability objects and as control directions for TTS synthesis.

URL PDF HTML ☆

赞 0 踩 0

2606.10025 2026-06-10 cs.RO cs.CV cs.LG 新提交

GHOST: Hierarchical Sub-Goal Policies for Generalizing Robot Manipulation

GHOST: 用于泛化机器人操作的层次化子目标策略

Sriram Krishna, Ben Eisner, Haotian Zhan, Ying Yuan, Haoyu Zhen, Chuang Gan, Shubham Tulsiani, David Held

AI总结提出GHOST框架，通过将控制分解为高层子目标预测和低层目标条件控制器，实现视觉运动操作策略的泛化，并利用人类演示适应新物体和任务变化。

Comments Accepted at RSS 2026

详情

AI中文摘要

我们提出了GHOST，一个学习视觉运动操作策略的框架，该策略能够泛化到训练分布之外。GHOST将控制分解为：(i) 高层策略，从多视角RGB-D观测中预测下一个子目标作为3D末端执行器位姿的分布，以及(ii) 低层目标条件控制器，执行特定于具体体的动作。为了将基于图像的策略条件化于3D目标，我们引入了一个简单的空间接口，将预测的目标投影到图像平面，并将其表示为末端执行器热图。在一系列操作任务中，与平坦的扩散策略相比，这种层次化分解持续提高了性能和鲁棒性。此外，我们展示了这种层次化接口也使得整合人类演示变得容易，而无需依赖（嘈杂的）动作重定向。由于子目标在很大程度上与具体体无关，我们在人类视频上训练高层策略，以指定如何应用和组合学到的技能，同时保持低层策略仅在机器人数据上训练。这种层次结构使得能够使用少量人类演示适应新物体和任务变化。

英文摘要

We present GHOST, a framework for learning visuomotor manipulation policies that generalize beyond the training distribution. GHOST factorizes control into (i) a high-level policy that predicts the next sub-goal as a distribution over 3D end-effector poses from multi-view RGB-D observations, and (ii) a low-level goal-conditioned controller that executes embodiment-specific actions. To condition image-based policies on 3D goals, we introduce a simple spatial interface that projects predicted goals into the image plane and represents them as end-effector heatmaps. Across a suite of manipulation tasks, this hierarchical factorization consistently improves performance and robustness compared to a flat Diffusion Policy. Further, we show that this hierarchical interface also makes it easy to incorporate human demonstrations without relying on (noisy) action retargeting. As sub-goals are largely embodiment-agnostic, we train the high-level policy on human video to specify how learned skills should be applied and composed, while keeping the low-level policy trained purely on robot data. This hierarchy enables adaptation to novel objects and task variations using a small number of human demonstrations.

URL PDF HTML ☆

赞 0 踩 0

2606.10021 2026-06-10 cs.CV 新提交

SpineReport: Automated 3D Quantification and Reporting of Lumbar Spine Degeneration on MRI

SpineReport: MRI上腰椎退变的自动化3D量化与报告

Nathan Molinier, Adrian A. Marth, Reto Sutter, Christoph Germann, Jacob A. Connolly, Mathieu Guay-Paquet, Nathan D. Schilaty, Kenneth A. Weber, Julien Cohen-Adad

AI总结提出SpineReport开源框架，利用鲁棒解剖分割从腰椎MRI中提取3D形态和信号特征，生成个体化报告，在中央管狭窄评估中AUC达0.95。

Comments Submitted to Medical Image Analysis

详情

AI中文摘要

腰椎疾病是全球致残的主要原因，但MRI上退变的可靠量化仍具挑战。临床实践中，分析主要在二维（2D）中进行，因为手动三维（3D）评估耗时。然而，2D测量重复性有限，尤其当解剖结构不与成像平面对齐时。现有自动化方法通常局限于2D、依赖离散分级或缺乏鲁棒性和可解释性。我们介绍SpineReport，一个用于腰椎MRI全面3D形态测量的开源全自动框架。利用鲁棒解剖分割，该方法从关键结构中提取定量指标，包括椎管、脊髓、椎骨、椎间盘和椎间孔。这些指标包括形态和信号特征，支持跨受试者和纵向评估。SpineReport进一步生成个体化报告，允许与队列分布比较，提高脊柱形态的可解释性和客观表征。临床相关性根据放射科医生报告的中央管、侧隐窝和椎间孔狭窄严重程度分级进行评估。指标与中央管狭窄严重程度强相关，T2加权脑脊液信号表现最佳（AUC = 0.95）。椎管前后径和面积比也显示出强相关性和高区分能力（AUC > 0.80）。对于侧隐窝狭窄，相关性中等，侧方脑脊液信号最具信息量（AUC = 0.73）。尽管感兴趣区域提取鲁棒，但未观察到与椎间孔狭窄的显著关联。SpineReport作为开放获取工具发布：此https URL

英文摘要

Lumbar spine conditions are a leading cause of disability worldwide, yet reliable quantification of degeneration from MRI remains challenging. In clinical practice, analysis is predominantly performed in two dimensions (2D), as manual three-dimensional (3D) assessment is time-consuming. However, 2D measurements suffer from limited reproducibility, particularly when anatomical structures are not aligned with the imaging plane. Existing automated approaches are often restricted to 2D, rely on discrete grading, or lack robustness and interpretability. We introduce SpineReport, an open-source, fully automated framework for comprehensive 3D morphometric analysis of lumbar spine MRI. Leveraging robust anatomical segmentations, the method extracts quantitative metrics from key structures, including the spinal canal, spinal cord, vertebrae, intervertebral discs, and foramina. These include both morphological and signal-based features, enabling cross-subject and longitudinal assessment. SpineReport further generates subject-specific reports that allow comparison with cohort distributions, improving interpretability and objective characterization of spinal morphology. Clinical relevance was evaluated against radiologist-reported severity grades for central canal, lateral recess, and foraminal stenosis. Metrics showed strong associations with central canal stenosis severity, with T2-weighted CSF signal providing the highest performance (AUC = 0.95). Canal AP diameter and area ratios also demonstrated strong correlations and high discriminative ability (AUC > 0.80). For lateral recess stenosis, associations were moderate, with lateral CSF signal being the most informative (AUC = 0.73). No significant associations were observed for foraminal stenosis despite robust region-of-interest extraction. SpineReport is released as an open-access tool: https://ivadomed.github.io/SpineReport/

URL PDF HTML ☆

赞 0 踩 0

2606.10019 2026-06-10 cs.CV cs.AI cs.RO 新提交

Generalized-CVO: Fast and Correspondence-Free Local Point Cloud Registration with Second Order Riemannian Optimization

广义CVO：基于二阶黎曼优化的快速无对应局部点云配准

Ray Zhang, Marcus Greiff, Thomas Lew, John Subosits

AI总结提出一种基于几何表面结构和再生核希尔伯特空间嵌入的无对应局部点云配准方法，采用二阶流形优化实现高达10倍加速，在LiDAR和RGB-D跟踪及物体配准中显著降低漂移并提升鲁棒性。

Comments 16 pages, 12 figures

详情

AI中文摘要

我们提出了一种快速且无需对应关系的局部点云配准方法，该方法利用了几何表面结构和再生核希尔伯特空间（RKHS）嵌入。该方法将点云表示为具有逐点各向异性核的连续函数，这些核编码了局部几何信息。这种公式化在沿表面法线方向改善对齐的同时，放松了沿切线方向的对齐。为了解决由此产生的配准问题，我们提出了一种具有近似黎曼海森矩阵的二阶流形优化方案，与先前基于无对应RKHS方法中使用的一阶求解器相比，实现了高达10倍的加速。我们展示了在多种室内外数据集上改进的帧到帧LiDAR和RGB-D跟踪精度。在驾驶领域的LiDAR跟踪配准任务中，我们在具有挑战性的特征稀疏环境下实现了平移和旋转漂移均减少超过55%。在物体配准基准测试中，我们展示了相比基于ICP的方法更强的鲁棒性，并且在优化全局初始化时（尤其是在中等错位情况下）获得了进一步的提升。

英文摘要

We propose a fast and correspondence-free local point cloud registration method that leverages geometric surface structure and reproducing kernel Hilbert space (RKHS) embeddings. The method represents point clouds as continuous functions with point-wise anisotropic kernels that encode local geometry. This formulation improves alignment along surface normals while relaxing alignment along tangential directions. To solve the resulting registration problem, we propose a second-order on-manifold optimization scheme with approximate Riemannian Hessians, achieving a speedup of up to 10x over the first-order solvers used in prior correspondence-free RKHS-based methods. We demonstrate improved frame-to-frame LiDAR and RGB-D tracking accuracy across diverse indoor and outdoor datasets. On a LiDAR tracking registration task in the driving domain, we achieve a reduction of $>55\%$ in both translational and rotational drift in challenging feature-sparse environments. On object registration benchmarks, we show improved robustness over ICP-based methods and further gains when refining global initialization, particularly under moderate misalignment.

URL PDF HTML ☆

赞 0 踩 0

2606.09967 2026-06-10 cs.CV 新提交

ABot-Earth 0.5: Generative 3D Earth Model

ABot-Earth 0.5：生成式3D地球模型

Ming Qian, Tianjian Ouyang, Mingchao Sun, Zijian Wang, Jincheng Xiong, Jiarong Han, Yongchang Zhang, Jiawei Zhang, Xu Wang, Yu Liu, Luyang Tang, Fei Yu, Zengye Ge, Mengmeng Du, Yuan Liu, Nianfei Fan, Song Wang, Yingliang Peng, Chunxue Jia, Yang Liu, Shiying Zeng, Haozhe Shi, Junnan Lai, Hongyu Pan, Zheng Wu, Ning Guo, Mu Xu, Hang Zhang

AI总结提出ABot-Earth 0.5框架，利用3D高斯泼溅从卫星图像生成大规模无缝3D环境，每平方公里合成时间低于10分钟，支持实时交互可视化，降低3D重建成本。

Comments From Amap-cvlab, Alibaba. Official page: https://abot-earth.amap.com/

详情

AI中文摘要

我们提出ABot-Earth 0.5，一个生成式3D框架，旨在从普遍存在的、地理参考的卫星图像中合成大规模无缝3D环境。为此，我们提出了一种新颖的生成模型，直接使用3D高斯泼溅（3DGS）表示。该模型在多样化的真实世界城市重建语料库上进行训练，学习生成逼真的几何和纹理。在推理时，它仅以卫星图像为条件合成新颖的3D场景，可扩展速率低于每平方公里10分钟，同时表现出卓越的真实感。该框架设计为易于访问，集成了分层细节级别（LOD）结构，允许在基于Web的地图引擎上进行实时交互式可视化。这种高保真模拟沙箱有效缓解了模拟到现实的领域差距，支持关键的具身人工智能下游应用，如闭环无人机导航。通过提供超低成本和高效的解决方案，ABot-Earth 0.5显著降低了大规模3D重建的技术和财务障碍，并推动了全球数字地球可视化的未来。

英文摘要

We present ABot-Earth 0.5, a generative 3D framework designed to synthesize vast, seamless 3D environments from ubiquitous, geospatially referenced satellite imagery. To achieve this, we propose a novel generative model formulated directly with the 3D Gaussian Splatting (3DGS) representation. The model is trained on a diverse corpus of existing real-world urban reconstructions, learning to generate realistic geometry and textures. At inference, it synthesizes novel 3D scenes conditioned solely on satellite imagery at a scalable rate of under 10 minutes per square kilometer, while demonstrating exceptional realism. The framework is designed for accessibility, with integrated hierarchical level-of-detail (LOD) structures that permit real-time, interactive visualization on web-based map engines. This high-fidelity simulation sandbox effectively mitigates the sim-to-real domain gap, enabling critical downstream Embodied AI applications like closed-loop UAV navigation. By providing an ultra-low-cost and high-efficiency solution, ABot-Earth 0.5 significantly lowers the technical and financial barriers to large-scale 3D reconstruction and empowers the future of global digital earth visualization.

URL PDF HTML ☆

赞 0 踩 0

2606.09966 2026-06-10 cs.SD 新提交

RespiraMFM: A Multimodal Foundation Model with Contrastive Audio-Language Alignment for Respiratory Disease Identification

RespiraMFM：一种用于呼吸道疾病识别的对比音频-语言对齐多模态基础模型

Shakhrul Iman Siam, Tiantian Feng, Jiankun Zhang, Shrikanth Narayanan, Mi Zhang

AI总结提出RespiraMFM多模态基础模型，通过对比音频-文本对齐策略整合呼吸音与临床信息，在监督和零样本任务中分别提升AUROC 9.15%和20.98%。

Comments ACL 2026 Main Conference

详情

AI中文摘要

呼吸道疾病仍然是全球死亡率的主要原因，及时准确的诊断对于改善患者预后和减轻医疗负担至关重要。虽然先前的工作已经探索了基于音频的呼吸道疾病检测模型，但这种单模态方法通常泛化能力和诊断精度有限。在本文中，我们提出了RespiraMFM，一种多模态基础模型，它将呼吸音与患者病史和症状相结合，以提高诊断准确性和疾病检测能力。我们引入了一种有效的音频-文本多模态整合对比对齐策略，使模型能够学习呼吸音与相应文本临床信息之间更好的跨模态表示。我们使用七个真实世界数据集，在监督微调和零样本设置下，对五种主要呼吸道疾病评估了RespiraMFM，在监督任务中AUROC提高了9.15%，在零样本任务中比现有基线提高了20.98%。这些发现强调了我们的框架在推进呼吸道疾病管理中早期诊断和改善临床决策方面的潜力。

英文摘要

Respiratory diseases remain a leading cause of global mortality, where timely and accurate diagnosis is critical to improving patient outcomes and reducing healthcare burdens. While prior work has explored audio-based models for respiratory disease detection, such unimodal approaches often suffer from limited generalizability and diagnostic precision. In this paper, we propose RespiraMFM, a Multimodal Foundation Model that integrates respiratory sounds with patient medical history and symptoms to enhance diagnostic accuracy and disease detection capabilities. We introduce an effective contrastive alignment strategy for audio-text multimodal integration, allowing the model to learn better cross-modal representations between respiratory sounds and corresponding textual clinical information. We evaluate RespiraMFM across five major respiratory diseases using seven real-world datasets in both supervised fine-tuning and zero-shot settings, achieving a 9.15% improvement in AUROC on supervised tasks and a 20.98% gain on zero-shot tasks over existing baselines. These findings underscore the potential of our framework to advance early diagnosis and improve clinical decision-making in respiratory disease management.

URL PDF HTML ☆

赞 0 踩 0

2606.09962 2026-06-10 cs.LG cs.AI cs.SD 新提交

Optimality of FSQ Tokens for Continuous Diffusion for Categorical Data with Application to Text-to-Speech

FSQ 令牌在分类数据连续扩散中的最优性及其在文本到语音中的应用

Vadim Popov, Wenju Gu, Tasnima Sadekova, Georgii Aparin, Assel Yermekova

AI总结本文研究连续扩散模型中离散令牌的潜在空间结构，通过理论分析和实验证明 FSQ 令牌化方案在分类数据连续扩散中最优，并在文本到语音任务中验证其优于基于 LLM 的方法。

详情

AI中文摘要

分类数据的连续扩散是一种属于扩散家族的框架，旨在生成离散数据。近年来，由于研究人员试图实现寻找自回归大型语言模型的合理替代方案这一具有挑战性的目标，对此类模型的科学兴趣不断增长。在本文中，我们研究了与离散令牌相对应的潜在空间结构的性质，这些性质通过扩散路径测度上的 Kullback-Leibler 散度和最优训练扩散模型正确预测令牌的准确性来表达。我们发现，FSQ 令牌化方案具有的潜在空间结构使其最适合分类数据的连续扩散，这一点通过严格的理论分析和数值实验得到了验证。为了在现实场景中验证我们的发现，我们训练了几个以语音令牌作为中间声学特征的文本到语音扩散模型，并表明基于 FSQ 令牌的模型确实表现最佳，而且它优于其强大的基于 LLM 的对应模型，同时体积更小、速度更快。

英文摘要

Continuous diffusion for categorical data is a framework belonging to the diffusion family and aiming at generating discrete data. The scientific interest to such models has been constantly increasing these days because researchers try to achieve a challenging goal of finding reasonable alternatives to autoregressive large language models. In this paper, we study the properties of the structure of the latent space corresponding to discrete tokens expressed in terms of Kullback-Leibler divergence on diffusion path measures and accuracy of the correct token prediction by the optimally trained diffusion model. We find that FSQ tokenization scheme has the latent space structure with the properties that make it best suited for continuous diffusion for categorical data as verified through rigorous theoretical analysis and numerical experiments. To validate our findings in real-life scenario, we train several text-to-speech diffusion models having speech tokens as intermediate acoustic features, and show that the one based on FSQ tokens indeed performs the best, and, moreover, it outperforms its strong LLM-based counterpart, at the same time being significantly smaller and faster.

URL PDF HTML ☆

赞 0 踩 0

2606.09961 2026-06-10 cs.LG cs.AI 新提交

3SPO: State-Score-Supervised Policy Optimization for LLM Agents

3SPO: 面向LLM智能体的状态分数监督策略优化

Yu Han, Kailing Li, Yang Jiao, Yulin Dai, Yuqian Fu, Linhai Zhuo, Tianwen Qian

AI总结提出3SPO算法，通过动态状态分数监督实现逐步骤策略优化，解决多轮智能体任务中奖励稀疏和信用分配问题，在ALFWorld和WebShop上分别比GRPO提升22.6%和15.6个百分点。

详情

AI中文摘要

通过强化学习（RL）将大型语言模型（LLM）训练为自主智能体，已使前沿模型在长周期任务中实现超人类性能。然而，现有RL算法在轨迹级别操作，仅在收集完整回合后执行策略优化。这种粗粒度方法在多轮智能体设置中面临根本性挑战，其中奖励稀疏、延迟，且跨单个步骤的信用分配至关重要。在这项工作中，我们提出\textbf{状态分数监督策略优化（3SPO）}，一种新颖的RL算法，通过动态状态分数监督执行逐步骤策略优化。在每个步骤，3SPO基于历史成功率计算状态分数，监督逐步骤信用分配、自适应回合和逐步骤策略优化，无需价值函数估计或额外辅助模型。理论上，在每状态臂架抽象下，我们证明所提出的分数监督分配机制实现了对数分配遗憾，并为动作识别、分数可区分性和过滤稳定性提供了样本复杂度保证。在ALFWorld和WebShop上使用Qwen2.5-1.5B/7B-Instruct的实验表明，3SPO在ALFWorld上持续优于GRPO $+22.6\%$，在WebShop上优于$+15.6$个百分点，同时使用相当资源实现了$2.4\times$更多的状态探索和$1.8\times$更快的收敛。代码可从此https URL获取。

英文摘要

Training large language models (LLMs) as autonomous agents via reinforcement learning (RL) has enabled frontier models to achieve superhuman performance in long-horizon tasks. However, existing RL algorithms operate at the trajectory level, performing policy optimization only after collecting complete episode rollouts. This coarse-grained approach faces fundamental challenges in multi-turn agent settings where rewards are sparse, delayed, and credit assignment across individual steps is critical. In this work, we propose \textbf{State-Score-Supervised Policy Optimization (3SPO)}, a novel RL algorithm that performs post-step policy optimization with dynamic state score supervision. At each step, 3SPO computes the state score based on historical success rates, supervising step-wise credit assignment, adaptive rollout and post-step policy optimization without requiring value function estimation or additional auxiliary models. Theoretically, under a per-state bandit abstraction, we show that the proposed score-supervised allocation mechanism achieves logarithmic allocation regret and provide sample-complexity guarantees for action identification, score distinguishability, and filtering stability. Experiments on ALFWorld and WebShop with Qwen2.5-1.5B/7B-Instruct show that 3SPO consistently outperforms GRPO by $+22.6\%$ on ALFWorld and $+15.6$ points on WebShop, while using comparable resources to achieve $2.4\times$ more state exploration and $1.8\times$ faster convergence. Code is available at https://github.com/genalyu/3SPO.

URL PDF HTML ☆

赞 0 踩 0

2606.09960 2026-06-10 cs.LG cs.AI 新提交

HydraCIL: Decoupled Class-Incremental Learning through Prototype-Guided Multi-Head Classifiers

HydraCIL: 通过原型引导的多头分类器实现解耦的类增量学习

Daniel Vila-Cruz, Laura Morán-Fernández, Verónica Bolón-Canedo

AI总结提出HydraCIL模型，通过冻结主干网络、解耦特征提取与学习，并利用原型相似性选择任务特定分类头，在资源受限环境中实现高效类增量学习，匹配或超越现有方法同时大幅降低训练时间和碳排放。

Comments Accepted for publication at the International Joint Conference on Neural Networks (IJCNN 2026)

详情

AI中文摘要

我们提出HydraCIL，一种基于原型引导的多头分类器的解耦持续学习模型，旨在嵌入式及资源受限环境中的可持续部署。虽然大多数类增量学习（CIL）方法依赖强大硬件和长时间再训练周期，但实际系统（如机器人或边缘AI设备）必须在有限资源下快速适应。HydraCIL通过冻结主干网络并将特征提取与学习解耦来解决这一问题。对于每个任务，特征被提取一次，并创建一个轻量级的、任务特定的分类器头，避免了昂贵的主干再训练。在推理时，HydraCIL通过与原型的相似性选择适当的头。在CIFAR-100、ImageNet-100、CoRe50和Flowers102数据集上的实验表明，HydraCIL匹配或超越了最先进的CIL方法，同时显著减少了训练时间和碳足迹，使其成为在能源效率和快速适应至关重要的实际及嵌入式环境中进行持续学习的实用解决方案。

英文摘要

We present HydraCIL, a decoupled continual learning model based on prototype-guided multi-head classifiers, targeting sustainable deployment in embedded and resource-constrained environments. While most Class-Incremental Learning (CIL) methods rely on powerful hardware and long retraining cycles, real-world systems, such as robots or edge AI devices, must adapt quickly with limited resources. HydraCIL addresses this gap by freezing the backbone and decoupling feature extraction from learning. For each task, features are extracted once and a lightweight, task-specific classifier head is created, avoiding costly backbone retraining. At inference, HydraCIL selects the appropriate head via similarity with prototypes. Experiments on CIFAR-100, ImageNet-100, CoRe50, and Flowers102 datasets show that HydraCIL matches or outperforms state-of-the-art CIL methods while significantly reducing training time and carbon footprint, making it a practical solution for continual learning in real-world and embedded settings, where energy efficiency and rapid adaptation are critical.

URL PDF HTML ☆

赞 0 踩 0

2606.09959 2026-06-10 cs.LG cs.AI 新提交

Temporal Context Conditioning for Seasonality-Aware Precipitation Nowcasting of High-Intensity Rainfall

面向高强度降雨的季节感知降水临近预报的时间上下文条件化

Gijs van Nieuwkoop, Siamak Mehrkanoon

AI总结提出TA-SmaAt-UNet模型，通过时间条件层（昼夜和季节循环编码）增强雷达降水临近预报，显著提升高强度降雨事件的预测性能。

Comments 9 pages, 6 figures

详情

AI中文摘要

降水临近预报越来越多地采用直接从近期雷达观测中学习的深度学习模型。尽管这类模型能有效捕捉短期降水运动，但它们往往缺乏降雨发展所依据的气象条件的更广泛上下文信息。本文研究轻量级时间上下文是否能改善基于雷达的临近预报，特别是针对高强度降雨。我们提出了时间感知小注意力U-Net（TA-SmaAt-UNet），它在核心SmaAt-UNet模型基础上扩展了时间条件层，利用昼夜时间和一年中时间的循环编码来调节中间特征表示。在KNMI雷达降水数据上的实验表明，时间条件化对罕见的高强度降水事件最为有益，同时也能改善季节变异性和预测降水强度分布的表征。层传导分析进一步表明，尽管参数成本很小，模型仍积极使用添加的时间条件层。这些发现表明，简单的、基于物理动机的时间上下文可以提高基于深度学习的降水临近预报的真实性和可靠性。我们的模型实现和训练设置可在GitHub上获取。

英文摘要

Precipitation nowcasting is increasingly being approached with deep learning models that learn directly from recent radar observations. Although such models can efficiently capture short-term precipitation motion, they often lack broader contextual information about the meteorological conditions under which rainfall develops. This paper investigates whether lightweight temporal context can improve radar-based nowcasting, particularly for high-intensity rainfall. We propose the Time-Aware Small-Attention U-Net (TA-SmaAt-UNet), which extends the core SmaAt-UNet model with temporal conditioning layers that use cyclical encodings of time-of-day and time-of-year to modulate intermediate feature representations. Experiments on KNMI radar precipitation data show that temporal conditioning is most beneficial for rare, high-intensity precipitation events, while also improving the representation of seasonal variability and predicted rainfall-intensity distributions. A layer conductance analysis further indicates that the added temporal conditioning layers are actively used by the model despite their small parameter cost. These findings suggest that simple, physically motivated temporal context can improve the realism and reliability of deep learning-based precipitation nowcasts. The implementation of our models and training setup is available on \href{https://github.com/gijsvn/TA-SmaAt-UNet}{GitHub}.

URL PDF HTML ☆

赞 0 踩 0

2606.09958 2026-06-10 cs.RO cs.AI 新提交

Uncertainty-Aware Motion Planning for Autonomous Driving in Mixed Traffic Environment

混合交通环境下自动驾驶的不确定性感知运动规划

Ming Cheng, Hao Chen, Ziyi Yang, Ziluowen Luo, Senzhang Wang

AI总结提出不确定性感知运动规划（UAMP），通过量化人类意图不确定性并引入不确定性校准值学习，提升自动驾驶在混合交通中的安全性和舒适性。

详情

AI中文摘要

在自动驾驶和人类驾驶车辆可能共存的混合交通环境中，自动驾驶车辆的运动规划需要预测周围人类驾驶员的未来行为。现有的基于强化学习的方法通常直接将预测的人类意图纳入观测以实现主动规划。然而，由于行为多样性、感知噪声和部分可观测性，人类意图本质上是不确定的。将预测意图视为确定性状态可能导致自动驾驶车辆做出不安全决策。为解决此问题，我们提出不确定性感知运动规划（UAMP），该规划将人类意图预测的不确定性纳入自动驾驶决策。具体来说，UAMP首先引入一个邻近感知不确定性估计器，以量化交互条件下的意图不确定性，并构建一个不确定性引导的联合意图分布，覆盖周围的人类驾驶车辆。在此不确定性集合内，UAMP进一步引入不确定性校准值学习（UCVL），以纠正因直接将不确定的人类意图预测纳入观测而产生的值函数学习偏差。在各种混合交通场景中的大量实验表明，与现有方法相比，UAMP显著提高了安全性和驾驶舒适性，同时保持了交通效率。代码发布在此https URL。

英文摘要

In mixed-traffic environments where autonomous and human-driven vehicles may co-exist, motion planning for autonomous vehicles requires anticipating the future behaviors of surrounding human drivers. Existing reinforcement learning-based methods generally directly incorporate the predicted human intents into the observation to enable a proactive planning. However, human intent is inherently uncertain due to the behavioral diversity, perception noise, and partial observability. Treating predicted intends as deterministic states can result in unsafe decisions for autonomous vehicles. To address this problem, we propose Uncertainty-Aware Motion Planning (UAMP), which incorporates uncertainty in human intent prediction for AV decision-making. Specifically, UAMP first introduces a proximity-aware uncertainty estimator to quantify the interaction-conditioned intent uncertainty and constructs an uncertainty-guided joint intent distribution over surrounding human-driven vehicles. Within this uncertainty set, UAMP further introduces Uncertainty-Calibrated Value Learning (UCVL) to correct value function learning biases arising from directly incorporating uncertain human intent predictions into the observation. Extensive experiments in various mixed-traffic scenarios show that UAMP significantly improves safety and driving comfort, while maintaining traffic efficiency compared with existing approaches. The code is released at https://anonymous.4open.science/r/UAMP-5638.

URL PDF HTML ☆

赞 0 踩 0

2606.09954 2026-06-10 cs.LG cs.AI 新提交

Does Normalization Choice Matter for Causal Large Time-Series Models?

归一化选择对因果大规模时间序列模型重要吗？

Samy-Melwan Vilhes, Gilles Gasso, Mokhtar Z Alaya

AI总结研究因果大规模时间序列模型中不同归一化策略对训练收敛和预测性能的影响，发现归一化选择显著影响模型效果。

详情

Journal ref: ICLR 2026 Workshop: Time Series in the Age of Large Models, Apr 2026, Rio De Janeiro, Brazil

AI中文摘要

用于时间序列预测的大规模模型已成为在异构信号集合上训练模型的有前景的范式。这些模型通常依赖于因果自回归架构，其中每个观测值根据过去依次预测。在实践中，真实世界的时间序列表现出非平稳性，这显著影响预测性能。为了缓解这一问题，通常采用归一化。然而，在高效的因果设置中，归一化可能在训练期间导致来自未来观测的信息泄漏。最近提出的替代方案，包括因果归一化和从初始观测计算的统计量，旨在解决这一问题，但其实际影响仍未被充分理解。在这项工作中，我们评估了基于Transformer的大规模时间序列模型（采用分块和高效因果策略训练）的归一化策略。我们展示了归一化选择显著影响训练收敛和预测性能。

英文摘要

Large models for time-series forecasting have been emerged as a promising paradigm for training models on heterogeneous collections of signals. These models typically rely on causal autoregressive architectures, where each observation is sequentially predicted from past. In practice, real-world time-series exhibit non-stationarities, which significantly influence predictive performance. To mitigate this, normalization is commonly employed. However, in efficient causal settings it might induce information leakage from future observations during training. Recent alternatives, including causal normalization and statistics computed from initial observations, have been proposed to address this issue, but their practical implications remain insufficiently understood. In this work, we evaluate normalization strategies for transformer-based large time-series models trained with patching and efficient causal strategy. We showcase that normalization choice significantly influences both training convergence and forecasting performance.

URL PDF HTML ☆

赞 0 踩 0

2606.09951 2026-06-10 cs.LG 新提交

Hasse Diagrams for Attention: A Partial Order Framework for Designing Transformer Masks

注意力的哈斯图：设计Transformer掩码的偏序框架

Chentao Li, Han Guo

AI总结本文提出一个理论框架，证明多层Transformer的信息流收敛到哈斯图，并将并行训练任务设计转化为求哈斯图最小公共超图问题，由此导出两种新注意力掩码。

Comments 21 pages, 9 figures. Theoretical framework for attention mask design; no experiments included

详情

AI中文摘要

在大型Transformer模型的训练过程中，注意力掩码控制序列中信息流的范围和方向。存在多种掩码变体，诸如FlexAttention之类的算子已经支持任意注意力掩码。然而，对于任意掩码所引起的信息流结构，一直缺乏系统的形式化分析。本文开发了一个完整的理论框架。我们证明，在足够深度下，多层Transformer的信息流收敛到一个哈斯图——表示偏序的有向无环图。在此基础上，我们将并行训练任务的设计重新表述为寻找哈斯图的最小公共超图的问题，并建立了最小公共超图的判定准则。这产生了一种直接从任务族推导注意力掩码的构造性方法。应用该框架，我们设计了两种新颖的掩码：一种确保训练-推理一致性的块生成注意力掩码（块双流注意力），以及一种全监督双向注意力掩码（蝴蝶注意力）。这些结果证明了该框架发现新结构的能力。

英文摘要

During the training of large Transformer models, attention masks regulate the scope and direction of information flow across a sequence. Numerous mask variants exist, and operators such as FlexAttention already support arbitrary attention masks. Nevertheless, a systematic formal analysis of the information-flow structure induced by arbitrary masks has been missing. This paper develops a complete theoretical framework. We prove that, with sufficient depth, the information flow of a multi-layer Transformer converges to a Hasse diagram -- a directed acyclic graph representing a partial order. Building on this, we recast the design of parallel training tasks as the problem of finding a minimal common supergraph of Hasse diagrams, and we establish a criterion for the minimal common supergraph. This yields a constructive method to derive attention masks directly from a family of tasks. Applying the framework, we design two novel masks: a block-generation attention mask that ensures training-inference consistency (Block Two-Stream Attention), and a fully supervised bidirectional attention mask (Butterfly Attention). These results demonstrate the framework's capacity to discover new structures.

URL PDF HTML ☆

赞 0 踩 0

2606.09949 2026-06-10 cs.LG cs.AI 新提交

Learning Where to Simulate: Generative Active Sampling for Online PDE Surrogate Training

学习何处模拟：在线PDE代理训练的生成式主动采样

Pierre Cesar, Sofya Dymchenko, Abhishek Purandare, Bruno Raffin

AI总结提出在线生成式主动采样（OGAS），通过扩散模型学习配置参数与代理性能的关系，主动采样高难度区域，显著降低尾部分布误差，提升代理最坏情况可靠性。

详情

AI中文摘要

数据驱动的PDE代理使用数值PDE求解器产生的数据进行训练。然而，当代理的目标是在广泛的PDE配置（例如初始条件和物理系数）上泛化时，生成具有代表性的训练集并非易事。配置参数的均匀采样通常低估了表现出挑战性动力学的轨迹，导致训练后的代理出现高预测误差和大误差方差。在线训练将数据生成和代理训练耦合，通过允许实时调整求解器参数提供了自然优势。为了有效利用这一能力，我们引入了在线生成式主动采样（OGAS），一种主动学习方法，它反应性地学习配置参数与代理性能之间的关系，以控制采样分布。OGAS与代理并行训练一个快速扩散模型，作为条件采样器，将代理派生的难度信号（例如损失或不确定性）映射到配置参数。通过主动从偏向高难度的先验中抽取目标信号，OGAS持续将数据生成导向挑战性区域，而不会延迟训练流程。我们在具有不同挑战性动力学的2D PDE（Kuramoto-Sivashinsky、Navier-Stokes、Gray-Scott）上评估OGAS，参数多达308个，并使用多种代理架构。在所有设置中，与均匀采样相比，OGAS一致地改善了尾部分布统计，显著降低了第99百分位以上的误差和整体误差离散度。虽然优先考虑挑战性轨迹引入了与平均误差的权衡，但OGAS有效确保了训练后代理的最坏情况可靠性，且壁钟时间开销可忽略不计。

英文摘要

Data-driven PDE surrogates are trained with data produced by numerical PDE solvers. However, when the surrogate's goal is to generalize across a wide range of PDE configurations (e.g., initial conditions and physical coefficients), generating a representative training set is non-trivial. Uniform sampling of configuration parameters often under-represents trajectories exhibiting challenging dynamics, leading to high prediction errors and large error variance in the trained surrogate. Online training, where data generation and surrogate training are coupled, offers a natural advantage by allowing solver parameters to be steered on-the-fly. To efficiently exploit this capability, we introduce Online Generative Active Sampling (OGAS), an active learning method that reactively learns the relationship between configuration parameters and surrogate performance to control the sampling distribution. OGAS trains a fast diffusion model in parallel to the surrogate to act as a conditional sampler, mapping a surrogate-derived difficulty signal (e.g., loss or uncertainty) to configuration parameters. By actively drawing target signals from a prior biased toward high difficulty, OGAS continuously steers data generation toward challenging regimes without delaying the training workflow. We evaluate OGAS across 2D PDEs with distinct challenging dynamics (Kuramoto-Sivashinsky, Navier-Stokes, Gray-Scott) and up to 308 parameters, using multiple surrogate architectures. Across all settings, OGAS consistently improves tail statistics, yielding substantial reductions in errors above the 99th percentile and overall error dispersion compared to uniform sampling. While prioritizing challenging trajectories introduces a trade-off with average error, OGAS effectively ensures worst-case reliability of trained surrogates with negligible wall-time overhead.

URL PDF HTML ☆

赞 0 踩 0

2606.09940 2026-06-10 cs.LG cs.AI 新提交

Interactions Between Crosscoder Features: A Compact Proofs Perspective

交叉编码器特征间的交互：一个紧凑证明的视角

Dmitry Manning-Coe, Thomas Read, Anna Soligo, Oliver Clive-Griffin, Chun-Hei Yip, Rajashree Agrawal, Jason Gross

AI总结本文从紧凑证明角度形式化交叉编码器特征交互，提出交互度量并应用于计算稀疏性、语义聚类和检测休眠代理。

Comments Accepted at the NeurIPS 2025 Workshop on Mechanistic Interpretability

详情

AI中文摘要

像稀疏自编码器（SAEs）和交叉编码器这样的字典学习方法试图通过将模型的激活分解为独立特征来解释模型。因此，特征之间的交互会在重构中引入误差。我们通过紧凑证明形式化了这一直觉，并做出了五项贡献。首先，我们展示了原则上如何使用交叉编码器构建模型性能的紧凑证明。其次，我们证明了该证明中出现的误差项可以自然地解释为交叉编码器特征之间交互的度量，并提供了多层感知器（MLP）层中交互项的显式表达式。然后，我们提供了这种新交互度量的三个应用。在第三项贡献中，我们展示了交互项本身可以用作可微分的损失惩罚。应用这种惩罚，我们可以实现“计算稀疏”的交叉编码器，当在每个数据点和神经元仅保留单个特征时，保留MLP性能的60%，而标准交叉编码器仅保留10%。接着，我们展示了根据我们的交互度量进行聚类可以提供语义上有意义的特征聚类，最后，我们展示了休眠代理具有显著的交互。代码可在以下网址获取：https://this URL。

英文摘要

Dictionary learning methods like Sparse Autoencoders (SAEs) and crosscoders attempt to explain a model by decomposing its activations into independent features. Interactions between features hence induce errors in the reconstruction. We formalize this intuition via compact proofs and make five contributions. First, we show how, \textit{in principle}, a compact proof of model performance can be constructed using a crosscoder. Second, we show that an error term arising in this proof can naturally be interpreted as a measure of interaction between crosscoder features and provide an explicit expression for the interaction term in the Multi-Layer Perceptron (MLP) layers. We then provide three applications of this new interaction measure. In our third contribution we show that the interaction term itself can be used as a differentiable loss penalty. Applying this penalty, we can achieve ``computationally sparse'' crosscoders that retain $60\%$ of MLP performance when only keeping a single feature at each datapoint and neuron, compared to $10\%$ in standard crosscoders. We then show that clustering according to our interaction measure provides semantically meaningful feature clusters, and finally that sleeper agents have significant interactions. Code is available at https://github.com/chainik1125/crosscoders-feature-interactions/tree/arxiv.

URL PDF HTML ☆

赞 0 踩 0

2606.09937 2026-06-10 cs.LG cs.AI cs.CL 新提交

RKSC: Reasoning-Aware KV Cache Sharing and Confident Early Exit for Multi-Step LLM Inference

RKSC：面向多步LLM推理的感知推理的KV缓存共享与自信提前退出

Anirudh Sekar

AI总结提出RKSC框架，通过注意力相似性KV共享、置信门控提前退出和推理选择性块缓存管理，消除多分支LLM推理中的结构冗余，实现平均3.008倍加速，错误率仅0.37%。

Comments Accepted to the ICML 2026 Workshop on Statistical Frameworks for Uncertainty in Agentic Systems

详情

AI中文摘要

我们提出RKSC（感知推理的KV缓存共享），一种无需训练的推理框架，消除了多分支LLM推理流程中的两种结构冗余。ASKS（注意力相似性KV共享）计算前缀KV缓存一次，并通过隐藏状态余弦相似度广播给所有语义相似的分支，严格推广了vLLM和SGLang使用的精确令牌前缀缓存。CGEE（置信门控提前退出）应用两种互补的退出机制：（1）当生成置信度在分支间具有决定性时，完全跳过验证前向传播；（2）当逐层熵稳定时，在中间层终止验证传播，使用Transformer骨干上的轻量级钩子。RSBCM（推理选择性块缓存管理器）通过注意力加权深度优先驱逐防止无界缓存增长。在五个模型家族（7B-10B）、四个基准测试和1000个评估问题上，RKSC相对于无KV基线实现了平均3.008倍加速（峰值3.990倍），相对于vLLM等效前缀缓存平均提升1.66倍，CGEE导致的错误率仅为0.37%（1616次验证调用中6次错误）。无需微调或架构更改。代码可在该URL获取。

英文摘要

We introduce RKSC (Reasoning-Aware KV Cache Sharing), a training-free inference framework that eliminates two structural redundancies in multi-branch LLM reasoning pipelines. ASKS (Attention-Similarity KV Sharing) computes the prefix KV cache once and broadcasts it to all semantically similar branches via hidden-state cosine similarity, strictly generalising the token-exact prefix caching used by vLLM and SGLang. CGEE (Confidence-Gated Early Exit) applies two complementary exit mechanisms: (1) it skips the verification forward pass entirely when generation confidence is decisive across branches, and (2) it terminates the verification pass at an intermediate layer when per-layer entropy stabilises, using lightweight hooks on the transformer backbone. RSBCM (Reasoning-Selective Block Cache Manager) prevents unbounded cache growth via attention-weighted depth-priority eviction. Across five model families (7B-10B), four benchmarks, and 1,000 evaluated problems, RKSC achieves a mean speedup of 3.008x over the No-KV baseline (peak 3.990x), a 1.66x mean improvement over vLLM-equivalent prefix caching, with a CGEE-induced error rate of only 0.37% (6 errors out of 1,616 verify calls). No fine-tuning or architecture changes are required. Code is available at https://github.com/AnirudhSekar/RKSC.

URL PDF HTML ☆

赞 0 踩 0