arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.29158 2026-05-29 cs.LG cs.IR q-bio.BM

PROTOCOL: Late Interaction Retrieval for Protein Homolog Search

PROTOCOL: 用于蛋白质同源搜索的延迟交互检索

Gabrielle Cohn, Rohan Gumaste, Minh Hoang, Vihan Lakshman

AI总结提出ProtoCol模型，利用ColBERT风格的延迟交互机制对残基嵌入进行最大相似度评分，以提升远程同源搜索的灵敏度，在SCOPe超家族和Pfam clan基准上优于多种基线方法。

详情

AI中文摘要

蛋白质同源搜索是功能注释、结构预测和进化分析的基础，但在全局序列相似性较弱且经典比对方法灵敏度下降的“模糊区”中仍然具有挑战性。蛋白质语言模型提供了上下文感知的表示，可以在此范围内提高比对灵敏度。然而，先前的基于蛋白质嵌入的检索流程通常将这些表示池化为单个向量，可能掩盖揭示远程同源性的局部基序、结构域或保守残基。我们引入了ProtoCol，该模型将蛋白质表示为残基嵌入的集合，并使用ColBERT风格的延迟交互来测试残基级比较是否改善同源检索。ProtoCol独立编码蛋白质，保持候选表示可预计算，并通过残基嵌入上的MaxSim对候选进行评分。在SCOPe超家族和Pfam clan基准上，ProtoCol优于基于序列组成、比对、池化PLM和训练的单向量基线，支持延迟交互作为远程同源搜索的有效检索层。

英文摘要

Protein homology search underlies function annotation, structure prediction, and evolutionary analysis, but remains challenging in the "twilight zone," where global sequence similarity is weak and classical alignment methods lose sensitivity. Protein language models provide context-aware representations that could improve alignment sensitivity in this regime. However, prior protein embedding-based retrieval pipelines often pool these representations into a single vector, potentially obscuring local motifs, domains, or conserved residues that reveal remote homology. We introduce ProtoCol, a model which represents proteins as sets of residue embeddings and uses ColBERT-style late interaction to test whether residue-level comparison improves homolog retrieval. ProtoCol encodes proteins independently, keeps candidate representations pre-computable, and scores candidates with MaxSim over residue embeddings. On SCOPe superfamily and Pfam clan benchmarks, ProtoCol outperforms sequence-composition, alignment-based, pooled PLM, and trained single-vector baselines, supporting late interaction as an effective retrieval layer for remote homology search.

URL PDF HTML ☆

赞 0 踩 0

2605.29157 2026-05-29 cs.LG cs.AI cs.CL

Parallax: Parameterized Local Linear Attention for Language Modeling

Parallax: 参数化局部线性注意力用于语言建模

Yifei Zuo, Dhruv Pai, Zhichen Zeng, Alec Dewulf, Shuming Hu, Zhaoran Wang

AI总结提出Parallax，一种可扩展的参数化局部线性注意力机制，通过消除数值求解器并学习查询投影器，在语言模型预训练中实现一致的困惑度改进和下游任务迁移优势。

详情

AI中文摘要

大型语言模型（LLM）已成为人工智能的核心范式，但注意力的核心计算原语在结构上仍未改变。局部线性注意力（LLA）是一种从测试时回归框架的非参数统计中推导出的注意力机制。与先前关于高效注意力变体的研究相比，LLA将softmax注意力中的局部常数估计升级为局部线性估计，在关联记忆上提供了可证明更优的偏差-方差权衡。然而，由于计算和数值稳定性问题，LLA尚未在LLM预训练中扩展。我们引入Parallax，一种可扩展用于LLM的参数化局部线性注意力。Parallax消除了LLA中的数值求解器，并学习一个额外的类似查询的投影器来探测KV协方差。我们将Parallax置于一个由带宽、投影器构造和仿射结构连接的注意力机制家族中。我们提出一种硬件感知算法，提高了相对于FlashAttention的算术强度，将注意力转移到更受计算限制的区域。我们的原型解码核在各种批大小和上下文长度下匹配或超越FlashAttention 2/3。我们在0.6B和1.7B规模上预训练Parallax，发现整个预训练过程中困惑度持续改善，且收益迁移到下游基准测试。在参数匹配和计算匹配的控制下，优势持续存在，展示了帕累托改进。我们进行了仔细的预训练消融实验，并发现了一个新现象：Muon优化器解锁了Parallax的能力。据我们所知，这是架构研究文献中首次对注意力机制进行强架构-优化器协同设计的实证演示。

英文摘要

Large Language Models (LLMs) have become the central paradigm in artificial intelligence, yet the core computational primitive of attention has remained structurally unchanged. Local Linear Attention (LLA) is an attention mechanism derived from nonparametric statistics in the test-time regression framework. In contrast to prior research on efficient attention variants, LLA upgrades the local constant estimate in softmax attention to a local linear estimate, yielding provably superior bias-variance tradeoffs for associative memory. However, LLA has not been scaled in LLM pretraining due to computational and numerical stability concerns. We introduce Parallax, a parameterized Local Linear Attention that is scalable for LLMs. Parallax eliminates the numerical solver in LLA and learns an extra query-like projector that probes the KV covariance. We place Parallax within a family of attention mechanisms connected by the bandwidth, the probe construction and the affine structure. We propose a hardware-aware algorithm that increases the arithmetic intensity over FlashAttention, shifting attention into a more compute bound regime. Our prototype decode kernel matches or outperforms FlashAttention 2/3 across diverse batch sizes and context lengths. We pretrain Parallax at 0.6B and 1.7B scales and find consistent perplexity improvements throughout pretraining with gains that transfer to downstream benchmarks. The advantage persists under both parameter-matched and compute-matched controls, demonstrating a Pareto improvement. We perform careful pretraining ablations and identify a novel phenomenon whereby Muon unlocks the capacity of Parallax. To our knowledge, this is the first empirical demonstration of strong architecture-optimizer codesign for attention mechanisms in the architecture research literature.

URL PDF HTML ☆

赞 0 踩 0

2605.29156 2026-05-29 cs.LG cs.CL

RUBRIC-ARROW: Alternating Pointwise Rubric Reward Modeling for LLM Post-training in Non-verifiable Domains

RUBRIC-ARROW：面向不可验证领域的大语言模型后训练的交替逐点评分规则奖励建模

Haoxiang Jiang, Zihan Dong, Tianci Liu, Wanying Wang, Ran Xu, Tony Yu, Linjun Zhang, Haoyu Wang

AI总结针对非可验证领域绝对评分困难的问题，提出交替框架RUBRIC-ARROW，联合训练规则生成器和条件裁判，通过概率评分规则和交替GRPO减少平局，提升奖励建模准确率并改善下游策略后训练。

2605.29155 2026-05-29 cs.RO cs.AI cs.DC

CA-AC-MPC: CUDA-Accelerated Actor-Critic Model Predictive Control

CA-AC-MPC: CUDA加速的Actor-Critic模型预测控制

Antoonio Buo, Vittorio Cammarota, Michele Avagnale, Pierluigi Arpenti, Vincenzo Lippiello, Fabio Ruggiero

AI总结提出CUDA加速的AC-MPC变体，通过GPU并行优化降低训练和推理延迟，在敏捷无人机竞速任务中实现最先进圈速和近极限动态性能。

Comments Accepted for presentation at the 2026 International Conference on Unmanned Aircraft Systems, ICUAS 2026

2605.29153 2026-05-29 cs.LG cs.AI physics.comp-ph

Unveiling Multi-regime Patterns in SciML: Distinct Failure Modes and Regime-specific Optimization

揭示科学机器学习中的多机制模式：不同的失败模式与机制特定优化

Yuxin Wang, Yuanzhe Hu, Xiaokun Zhong, Xiaopeng Wang, Haiquan Lu, Tianyu Pang, Michael W. Mahoney, Yujun Yan, Pu Ren, Yaoqing Yang

AI总结通过机制感知诊断框架，研究科学机器学习模型在不同超参数设置下的多机制行为，发现三种一致机制结构、优化效果的机制特异性以及精细失败模式，为提升鲁棒性提供指导。

Comments Accepted by ICML 2026

详情

AI中文摘要

在不同超参数设置下训练的神经网络可能落入不同的训练“机制”，这些机制内部行为一致，而机制间存在定性差异。本文通过一个机制感知的诊断框架，联合分析性能、训练动态和损失景观几何，研究科学机器学习（SciML）模型中的这种多机制行为。我们识别出三个关键发现：（i）在许多标准SciML模型、不同的约束施加方式以及各种优化器设计中，一致地出现一个三机制结构；（ii）优化效果是机制特定的，没有单一方法在所有机制中表现良好；（iii）SciML模型可能表现出精细的失败模式，这些模式可能挑战对标准损失景观度量的传统解释。我们的结果为建立SciML中失败模式的统一、任务无关视角提供了一种方法，并为提高鲁棒性提供机制感知的指导。我们在广泛使用的SciML模型上验证了这些发现，包括物理信息神经网络、神经算子和神经常微分方程，涵盖了代表性的常微分方程和偏微分方程基准。

英文摘要

Neural networks trained under different hyperparameter settings can fall into distinct training "regimes," with consistent behavior within regimes and qualitative differences across regimes. In this paper, we study such multi-regime behavior in scientific machine learning (SciML) models through a regime-aware diagnostic framework that jointly analyzes performance, training dynamics, and loss-landscape geometry. We identify three key findings: (i) a consistent three-regime structure emerges across many standard SciML models, different constraint enforcements, and various optimizer designs; (ii) optimization effectiveness is regime-specific, with no single method performing well across all regimes; and (iii) SciML models can exhibit fine-grained failure modes that can challenge conventional interpretations of standard loss-landscape metrics. Our results provide an approach to establish a unified, task-oblivious perspective on failure modes in SciML and to inform regime-aware guidance for improving robustness. We validate these findings across widely-used SciML models, including physics-informed neural networks, neural operators, and neural ordinary differential equations, on benchmarks spanning representative ordinary and partial differential equations.

URL PDF HTML ☆

赞 0 踩 0

2605.29152 2026-05-29 cs.LG math.OC stat.ML

Do Deep Networks Forget Initialization? A Forgetting-Time View of Practical Inductive Bias

深度网络会忘记初始化吗？实用归纳偏见的遗忘时间视角

Mohua Das, Pierfrancesco Beneventano, Shibshankar Dey, Gareth H. McKinkey, Tomaso Poggio

AI总结通过引入初始化记忆度量，研究随机初始化对训练后预测器的影响，发现低学习率SGD保留初始化记忆而Adam族方法遗忘，且遗忘动力学与泛化正则化相关。

Comments 39 pages, 9 figures

详情

AI中文摘要

随机初始化的神经网络在函数上诱导先验，但实践中使用的预测器仅在训练后产生。我们询问这种初始偏差有多少在训练流程中幸存。为了使问题可测量，我们引入初始化记忆：验证选择的预测器对随机初始化尺度的依赖性。我们在ResNet上进行了受控的CIFAR-10实验，其中初始化记忆已经尖锐地分离了训练机制。低学习率SGD可以在记住初始化的同时进行插值：在批大小$b=128$的ResNet-9上，尽管训练准确率$\ge99.5\%$，测试准确率在不同初始化尺度上变化$26.5$个百分点。这不是欠训练：将相同的低学习率机制扩展到$5{,}000$个epoch，差异基本不变。相比之下，Adam族方法在很大程度上消除了这种依赖性。当较大的学习率与显式$L_2$范数控制配对时，SGD也可以被遗忘。我们根据遗忘的时间尺度解释这些发现：梯度流式动力学可以保留初始化记忆，而随机有限步效应、显式范数衰减和自适应预处理在由显式或隐式正则化大小控制的尺度上消除它。因此，训练网络的实用归纳偏见不仅仅是架构先验，而是经过训练流程遗忘动力学过滤后的架构先验；并且改善泛化的相同正则化器正是那些消除初始化记忆的。

英文摘要

Randomly initialized neural networks induce a prior over functions, but the predictor used in practice is produced only after training. We ask how much of this initial bias survives the training pipeline. To make the question measurable, we introduce initialization memory: the dependence of the validation-selected predictor on the scale of the random initialization. We perform controlled CIFAR-10 experiments on ResNets where initialization memory already sharply separates training regimes. Low-learning-rate SGD can interpolate while still remembering its initialization: on ResNet-9 with batch size $b=128$, test accuracy varies by $26.5$ percentage points across initialization scales despite $\ge99.5\%$ training accuracy. This is not undertraining: extending the same low-learning-rate regime to $5{,}000$ epochs leaves the spread essentially unchanged. In contrast, Adam-family methods largely erase the dependence. SGD can also be made to forget when larger learning rates are paired with explicit $L_2$ norm control. We interpret these findings in terms of the time scale of forgetting: gradient-flow-like dynamics can preserve initialization memory, whereas stochastic finite-step effects, explicit norm decay, and adaptive preconditioning erase it on scales governed by the size of explicit or implicit regularization. The practical inductive bias of a trained network is therefore not the architectural prior alone, but the architectural prior after being filtered by the forgetting dynamics of the training pipeline; and the same regularizers that improve generalization are precisely those that erase memory of initialization.

URL PDF HTML ☆

赞 0 踩 0

2605.29148 2026-05-29 cs.LG stat.ML

Optimal Gap-Dependent Regret for Private Stochastic Decision-Theoretic Online Learning

私有随机决策理论在线学习的最优间隙相关遗憾

Tommaso Cesari, Roberto Colomboni

AI总结针对完全信息、事件级纯差分隐私的随机决策理论在线学习，提出一种无水平线的纯差分隐私算法，并证明遗憾界为O(log K / Δ_min + log K / ε)。

详情

AI中文摘要

我们研究具有完全信息和事件级纯差分隐私的随机决策理论在线学习。Hu和Mehta在COLT上提出的一个开放问题要求确定在纯事件级差分隐私下，随机决策理论在线学习的最优间隙相关遗憾率。对于$K$个动作，损失在$[0,1]$中，且唯一最优动作与次优动作的间隙为$Δ_{\min}$，已知下界为$ rac{\log K}{\min\{Δ_{\min},\varepsilon\}} $，或等价地，在通用常数范围内，为\[ rac{\log K}{Δ_{\min}}+ rac{\log K}{\varepsilon} \]。我们给出一个无水平线的纯DP算法，并证明对于任意水平线$T$，显式遗憾界\[ \operatorname{Reg}_T \le 1000 \cdot \left( rac{\log K}{Δ_{\min}}+ rac{\log K}{\varepsilon} ight) \]。数值常数未优化。该算法将时间划分为指数增长大小的块，每个块内执行单个动作，并通过指数机制（应用于前一个块的数据无关随机前缀）选择下一个动作。随机前缀将块遗憾转化为所有前缀长度上softmax选择误差的和。单个熵势参数以代价$\log K/\varepsilon$控制所有隐私主导的大间隙动作。

英文摘要

We study stochastic decision-theoretic online learning with full information and event-level pure differential privacy. A COLT open problem of Hu and Mehta asks to determine the optimal gap-dependent regret rate for stochastic decision-theoretic online learning under pure event-level differential privacy. For $K$ actions, losses in $[0,1]$, and a unique best action separated from the second-best action by gap $Δ_{\min}$, the known lower bound is of order $ \frac{\log K}{\min\{Δ_{\min},\varepsilon\}}, $ or equivalently, up to universal constants, of order \[ \frac{\log K}{Δ_{\min}}+\frac{\log K}{\varepsilon}. \] We give a horizon-free pure-DP algorithm and prove the explicit regret bound \[ \operatorname{Reg}_T \le 1000 \cdot \left(\frac{\log K}{Δ_{\min}}+\frac{\log K}{\varepsilon}\right) \] for every horizon $T$. The numerical constant is not optimized. The algorithm partitions time into blocks of exponentially increasing size, plays a single action throughout each block, and chooses the next action by an exponential mechanism applied to a data-independent random prefix of the previous block. The random prefix converts block regret into a sum, over all prefix lengths, of softmax selection errors. A single entropy-potential argument controls all privacy-dominated large-gap actions at cost $\log K/\varepsilon$.

URL PDF HTML ☆

赞 0 踩 0

2605.29144 2026-05-29 cs.RO cs.SY eess.SY

Learning and Adaptation in Wire Arc Additive Manufacturing Bead Geometry Control

线弧增材制造焊道几何控制中的学习与自适应

Chen-Lung Lu, John Wen

AI总结针对线弧增材制造中热场与几何耦合的非线性动态过程，提出基于循环神经网络和一步预测控制的数据驱动方法，并通过逐层预测误差更新模型实现自适应，实验验证了在焊道高度和宽度一致性上的显著提升。

详情

AI中文摘要

机器人线弧增材制造（WAAM）受复杂非线性过程动力学控制，将热场与构建几何耦合。该过程可视为多输入/多输出动态系统，以焊枪速度和送丝速率作为输入，焊道沉积高度和宽度作为输出。本文利用输入/输出数据学习数据驱动模型，并将其用于焊道规划和控制。我们证明，简单的循环神经网络架构和一步预测控制可以在高度和宽度一致性方面改善过程性能。为了考虑打印过程中热条件的变化，我们使用前一层的预测误差更新学习模型。该自适应步骤进一步提高了预测精度和控制器性能。在集成线扫描反馈的机器人WAAM实验平台上进行的实验表明，与恒定输入和静态模型基线相比，高度和宽度一致性有显著改善。所提出的学习和自适应框架为实现增材制造过程的鲁棒、数据驱动调控提供了实用途径。

英文摘要

Robotics Wire Arc Additive Manufacturing (WAAM) is governed by complex and nonlinear process dynamics coupling thermal field to the build geometry. The process may be regarded as a multi-input/multi-output dynamical system with welding torch speed and wire feed rate as inputs and weld bead deposition height and width as outputs. In this paper, we use the input/output data to learn a data-driven model and use it for weld planning and control. We show that a simple recurrent neural network architecture and one-step-ahead predictive control can improve the process performance in terms of height and width consistency. To account for the changing thermal conditions during the printing process, we update the learning model using prediction error from the previous layer. This adaptation step further improves the prediction accuracy and controller performance. Experiments on a robotic WAAM testbed with integrated line-scanner feedback significant improvements in height and width consistency compared to constant input and static model baselines. The proposed learning and adaptation framework provides a practical pathway toward robust, data-driven regulation of additive manufacturing processes.

URL PDF HTML ☆

赞 0 踩 0

2605.29138 2026-05-29 cs.RO cs.AI cs.LG cs.SY eess.SY

Multi-Resolution End-to-End Deep Neural Network for Optimizing Latency-Accuracy Tradeoff in Autonomous Driving

用于优化自动驾驶延迟-准确性权衡的多分辨率端到端深度神经网络

Qitao Weng, Heechul Yun

AI总结提出一种多分辨率端到端CNN，通过运行时选择输入分辨率和分辨率重定向，在延迟预算下优化自动驾驶的延迟-安全性权衡。

Comments ICCPS 2026

详情

AI中文摘要

延迟-准确性权衡是深度神经网络在信息物理系统实时应用中的基础。在自动驾驶中，安全性尤其依赖于预测质量和从感知到执行的端到端延迟。我们观察到：(1) 当考虑延迟时，延迟最优的网络配置随场景上下文和计算可用性而变化；(2) 单一固定分辨率模型在条件变化时变得次优。我们提出了一种用于CARLA城市驾驶挑战的多分辨率端到端深度神经网络，使用单目摄像头输入。我们的方法采用支持多种输入分辨率的卷积神经网络，通过每分辨率批归一化，使得在延迟预算下运行时选择理想输入尺度成为可能，以及分辨率重定向，允许在没有原始训练数据集的情况下进行多分辨率训练。我们在CARLA中实现并评估了我们的多分辨率端到端CNN，以探索延迟-安全性边界。结果显示，相对于固定分辨率基线，每条路线的安全性指标——车道入侵、红灯违规和碰撞——一致改善。

英文摘要

Latency-accuracy tradeoffs are fundamental in real-time applications of deep neural networks (DNNs) for cyber-physical systems. In autonomous driving, in particular, safety depends on both prediction quality and the end-to-end delay from sensing to actuation. We observe that (1) when latency is accounted for, the latency-optimal network configuration varies with scene context and compute availability; and (2) a single fixed-resolution model becomes suboptimal as conditions change. We present a multi-resolution, end-to-end deep neural network for the CARLA urban driving challenge using monocular camera input. Our approach employs a convolutional neural network (CNN) that supports multiple input resolutions through per-resolution batch normalization, enabling runtime selection of an ideal input scale under a latency budget, as well as resolution retargeting, which allows multi-resolution training without access to the original training dataset. We implement and evaluate our multi-resolution end-to-end CNN in CARLA to explore the latency-safety frontier. Results show consistent improvements in per-route safety metrics - lane invasions, red-light infractions, and collisions - relative to fixed-resolution baselines.

URL PDF HTML ☆

赞 0 踩 0

2605.29136 2026-05-29 cs.CV cs.LG

Eulerian Gaussian Splatting using Hashed Probability Pyramids

使用哈希概率金字塔的欧拉高斯溅射

Mia Gaia Polansky, George Kopanas, Stephan Garbin, Todd Zickler, Dor Verbin

AI总结提出一种基于概率溅射的辐射场框架，用梯度优化的体积概率密度替代启发式操作，通过多尺度哈希网格实现端到端优化，在mip-NeRF 360上达到SOTA重建质量并保持3DGS渲染速度。

Comments CVPR 2026. Project Page: https://euleriansplatting.github.io

详情

AI中文摘要

我们引入了一种基于概率溅射的辐射场框架，该框架保留了3D高斯溅射（3DGS）的快速光栅化和测试效率，同时用基于梯度优化的体积概率密度替代了启发式原始操作。我们不通过手动调整的密集化（例如ADC）来重新定位、分割或剔除高斯体，而是将原始位置视为从持久、可学习的密度中抽取的样本。我们使用一种新颖的、内存高效的多尺度层次网格来实例化该密度，从而实现端到端的梯度优化。为了稳定优化，我们推导了一个具有控制变量的无偏梯度估计器，显著降低了方差。通过允许概率质量流向损失要求的地方，我们的框架消除了脆弱的先验，并自然地探索体积，在mip-NeRF 360上实现了最先进的重建质量，同时保持了3DGS级别的渲染速度。

英文摘要

We introduce a probabilistic splat-based radiance field framework that retains the fast rasterization and test-time efficiency of 3D Gaussian Splatting (3DGS) while replacing heuristic primitive manipulation with gradient-based optimization of a volumetric probability density. Rather than relocating, splitting, or culling Gaussians via hand-tuned densification (e.g., ADC), we treat primitive locations as samples drawn from a persistent, learnable density. We instantiate this density using a novel, memory-efficient multi-scale hierarchical grid that enables end-to-end gradient-based optimization. To stabilize the optimization, we derive an unbiased gradient estimator with control variates that markedly reduces variance. By allowing probability mass to flow to where the loss demands, our framework eliminates brittle priors and naturally explores the volume, achieving state-of-the-art reconstruction quality on mip-NeRF 360 while preserving 3DGS-level rendering speed.

URL PDF HTML ☆

赞 0 踩 0

2605.29129 2026-05-29 cs.AI cs.CY econ.GN q-fin.EC

Governing Technical Debt in Agentic AI Systems

代理型AI系统中的技术债务治理

Muhammad Zia Hydari, Raja Iqbal, Narayan Ramasubbu

AI总结本文定义了代理型AI系统中的技术债务和随机税概念，并提出通过轻量级仪表盘和治理控制来管理这些负债和运营成本。

详情

AI中文摘要

代理型AI系统正越来越多地被探索作为生产基础设施：它们进行多步推理、调用工具、通过工作流行动，并通过记忆和反馈进行适应。这些系统带来了传统软件或预测性机器学习技术债务未能完全涵盖的治理挑战。我们将代理型技术债务定义为当提示、记忆、工具模式、编排图、控制策略和可观测性例程被拼凑在一起，速度快于它们能够被验证、标准化和治理时所产生的累积负债。我们将随机税定义为将概率性代理行为保持在可接受范围内所产生的重复性运营负担。区别很重要：债务是设计和治理负债的存量，而税是运营成本的流量，源于随机代理通过工具和工作流行动。我们概述了管理者如何通过轻量级仪表盘和治理控制使两者可见。

英文摘要

Agentic AI systems are increasingly being explored as production infrastructure: they reason over multiple steps, call tools, act through workflows, and adapt through memory and feedback. These systems create governance challenges that are not fully captured by traditional software or predictive ML technical debt. We define Agentic Technical Debt as the accumulated liability created when prompts, memory, tool schemas, orchestration graphs, control policies, and observability routines are patched together faster than they can be validated, standardized, and governed. We define Stochastic Tax as the recurring operating burden of keeping probabilistic agent behavior within acceptable bounds. The distinction matters: debt is a stock of design and governance liability, while the tax is a flow of operating cost that arises because stochastic agents act through tools and workflows. We outline how managers can make both visible through lightweight dashboards and governance controls.

URL PDF HTML ☆

赞 0 踩 0

2605.29126 2026-05-29 cs.LG cs.AI

When and How Long? The Readout-Mediator Angle in Temporal Reasoning

何时与多久？时间推理中的读出-中介角度

Shreyas Fadnavis, Praitayini Kanakaraj, Felix Wyss

AI总结通过测量线性探针与模型实际计算子空间之间的角度，发现探针可能学习与模型无关的正交方向，从而揭示基于探针的可解释性存在根本缺陷。

详情

AI中文摘要

线性探针几乎可以完美解码表示，但却可能与模型如何使用该表示完全无关。在语言模型的日历日期持续时间推理中，一个$\\\sin$/ $\\\cos$探针从层的激活中恢复一年中的第几天，但消融其方向对模型的答案没有影响——而在同一层通过分布式对齐搜索（DAS）找到的四维子空间被消融时，性能完全崩溃。我们测量这两个子空间之间的角度——\\emph{读出-中介角度}——发现它与两个随机子空间之间的角度（Haar均匀零假设）无法区分，这意味着探针学到了与模型实际计算正交的方向。逆向工程电路揭示了原因：注意力头通过学习的QK偏移（$\\\pm30$和$\\\pm61$天）路由月份粒度的上下文，然后MLP将\\emph{何时}（绝对日期）转换为\\emph{多久}（持续时间）——所有这些都在探针从未触及的因果子空间的下游。稀疏自编码器分解证实了这种分裂：探针对齐和DAS对齐的特征编码了语义上不相交的概念，因果重叠可忽略不计。这种分离在四个规模（$1.5$-$9\\\,$B）和两个模型家族中重复出现，并在另外两个领域（空间位移、符号算术）有初步证据，表明读出-中介正交性是探针可解释性的一种普遍失败模式。这直接削弱了将探针部署为运行时安全监控的提议：探针可以在模型已悄然放弃的方向上报告高置信度。

英文摘要

A linear probe can decode a representation almost perfectly and yet be completely irrelevant to how the model uses it. On calendar-date duration reasoning in language models, a $\sin$/$\cos$ probe recovers day-of-year from a layer's activations, yet ablating its direction has no effect on the model's answers -- while ablating a four-dimensional subspace found by Distributed Alignment Search (DAS) at the same layer collapses performance entirely. We measure the angle between these two subspaces -- the \emph{readout-mediator angle} -- and find it indistinguishable from the angle between two random subspaces (the Haar-uniform null), meaning the probe has learned a direction orthogonal to the model's actual computation. Reverse-engineering the circuit reveals why: attention heads route month-grained context through learned QK offsets at ${\pm}30$ and ${\pm}61$ days, and MLPs then convert \emph{when} (absolute date) into \emph{how long} (duration) -- all downstream of the causal subspace the probe never touches. Sparse-autoencoder decomposition confirms the split: probe-aligned and DAS-aligned features encode semantically disjoint concepts with negligible causal overlap. The dissociation replicates across four scales ($1.5$-$9\,$B) and two model families, with preliminary evidence on two further domains (spatial displacement, symbolic arithmetic), suggesting that readout-mediator orthogonality is a general failure mode of probe-based interpretability. This directly undermines proposals to deploy probes as runtime safety monitors: the probe can report high confidence on a direction the model has silently abandoned.

URL PDF HTML ☆

赞 0 踩 0

2605.29123 2026-05-29 cs.AI cs.CL

The Confidence Shortcut: A Reasoning Failure Mode of Masked Diffusion Models

置信捷径：掩码扩散模型的一种推理失败模式

Dueun Kim, Albert No

AI总结本文发现掩码扩散模型在置信度解码时存在推理失败模式，表现为过早预测局部易解部分而忽略长程依赖，导致复杂输入错误率升高，而随机掩码训练能保持推理轨迹条件。

详情

AI中文摘要

掩码扩散语言模型（MDMs）独特地支持任意顺序生成，其中基于置信度的解码目前作为事实上的标准推理策略。为了优化这一点，最近的训练方案试图直接将训练掩码模式与生成过程中观察到的模式对齐。然而，我们认为基于置信度的解码本质上与复杂推理所需的逻辑流轨迹不一致，并且置信度对齐训练会主动强化这种不一致。我们使用多位加法具体说明这一点，其中解码策略在解决长程依赖之前过早预测局部易解的数字，从而在具有挑战性的输入上产生高置信度错误。虽然传统的随机掩码在此困难尾部上保持低失败率，但置信度对齐训练将错误率放大了一个数量级。在五个不同的推理任务中，同样的模式以任务依赖的严重程度出现：基于置信度的解码在高度复杂的输入上引发失败，而置信度对齐训练则加剧了这些失败。相比之下，随机掩码——尽管被认为效率低下——稳健地保留了解决困难尾部所必需的推理轨迹条件。

英文摘要

Masked diffusion language models (MDMs) uniquely support any-order generation, with confidence-based decoding currently serving as the de facto standard inference policy. To optimize for this, recent training schemes attempt to align training mask patterns directly with those observed during generation. However, we argue that confidence-based decoding is inherently misaligned with the logical-flow trajectories required for complex reasoning, and that confidence-aligned training actively entrenches this misalignment. We make this concrete using multi-digit addition, where the decoding strategy prematurely predicts locally easy digits before resolving their long-range dependencies, producing high-confidence errors on challenging inputs. While traditional random masking keeps the failure rate low on this challenging tail, confidence-aligned training amplifies the error rate by an order of magnitude. Across five distinct reasoning tasks, this same pattern emerges with task-dependent severity: confidence-based decoding induces failures on highly complex inputs, and confidence-aligned training exacerbates them. In contrast, random masking -- despite its perceived inefficiency -- robustly preserves the reasoning-trajectory conditionals essential for solving the challenging tail.

URL PDF HTML ☆

赞 0 踩 0

2605.29122 2026-05-29 cs.CV

Robust Cross-Domain Generalization Using Unlabeled Target Data with Source-Domain Supervision

利用源域监督和无标签目标数据的鲁棒跨域泛化

Yuyue Zhou, Shrimanti Ghosh, Michael, Xie, Justin JY Kim, Jessica Knight, Steel McDonald, Vincent Man, Jacob L. Jaremko, Abhilash Hareendranathan

AI总结针对医学影像AI模型跨设备泛化问题，提出结合目标域无监督预训练（掩码图像建模与对比学习）和源域监督训练的策略，在儿科腕部超声骨折检测中实现超过6%的Dice提升。

详情

AI中文摘要

通常，我们希望将使用密集标注训练的医学影像AI模型泛化到来自不同超声扫描仪或临床站点的数据；然而，使用新标注重新训练这些模型往往困难且成本高昂。我们在儿科腕部骨折评估中研究了这一挑战，使用床旁超声（POCUS），其中骨折常见且可通过超声有效分诊。AI在骨折检测中已展现出放射科医生级别的性能，通常借助高质量骨结构分割。然而，由于显著的域偏移，模型在其他中心或探头的数据上表现不佳，并且由于手动标注工作和数据隐私问题，跨设备获取分割标签不切实际。为了解决这个问题，我们提出了一种目标信息引导的自监督预训练和模型集成策略。具体来说，我们的方法结合了掩码图像建模（MIM）和对比学习，无需标签即可学习目标域结构表示，并引入了一个置信度感知融合头来自适应地集成预测。使用Philips Lumify探头收集的源数据集包含密集标签，而使用TeleMED便携式探头收集的目标数据集未标注。整个过程中数据集严格分离。我们的方法使用带标签的源数据进行监督训练，并利用目标域预训练来提高泛化能力。在来自62个儿科POCUS视频的318张图像上，该方法显著提高了跨设备性能，与基线相比，目标域的Dice提升了超过6%。这些结果展示了一种标签高效且保护隐私的跨设备鲁棒超声AI方法，提供了一个可扩展到多中心研究或联邦学习设置的框架。

英文摘要

It is often desirable to generalize medical imaging AI models trained with dense annotations to data acquired from different ultrasound scanners or clinical sites; however, retraining these models with new annotations is often difficult and costly. We examine this challenge in pediatric wrist fracture assessment using point-of-care ultrasound (POCUS), where fractures are common and can be effectively triaged via ultrasound. AI has shown radiologist-level performance for fracture detection, often aided by high-quality bony structure segmentation. However, due to significant domain shifts, models perform poorly on data from other centers or probes, and obtaining segmentation labels across devices is impractical due to manual annotation effort and data privacy concerns. To address this, we propose a target-informed self-supervised pretraining and model-ensemble strategy. Specifically, our approach combines masked image modeling (MIM) and contrastive learning to learn target-domain structural representations without labels, and introduces a confidence-aware infusion head to adaptively integrate predictions. The source dataset, collected with a Philips Lumify probe, contained dense labels, while the target dataset, acquired with a TeleMED portable probe, was unlabeled. The datasets were kept strictly separate throughout the entire process. Our method used labeled source data for supervised training and leveraged target-domain pretraining to improve generalization. On 318 images from 62 pediatric POCUS videos, this approach significantly improved cross-device performance, achieving over 6% Dice improvement on the target domain versus the baseline. These results demonstrate a label-efficient and privacy-preserving approach for cross-device-robust ultrasound AI, offering a framework that can be extended to multi-center studies or federated learning setups.

URL PDF HTML ☆

赞 0 踩 0

2605.29119 2026-05-29 cs.AI

PRO-CUA: Process-Reward Optimization for Computer Use Agents

PRO-CUA: 面向计算机使用代理的过程奖励优化

Yifei He, Rui Yang, Hao Bai, Tong Zhang, Han Zhao

AI总结提出PRO-CUA框架，通过过程奖励模型和逐步骤强化学习，解决计算机使用代理训练中的模仿瓶颈和稀疏奖励问题。

详情

AI中文摘要

计算机使用代理（CUA）在自动化复杂数字工作流方面展现出强大潜力，但其训练仍受限于成本高昂的实时环境交互和有限的高质量监督。现有的过滤行为克隆管道面临模仿瓶颈，包括专家演示的分布偏移和缺乏负学习信号。同时，标准轨迹级强化学习在长程GUI交互中面临稀疏奖励、模糊信用分配和高基础设施成本等问题。在这项工作中，我们提出PRO-CUA，一个用于训练CUA的迭代步骤级强化学习的过程奖励优化框架。PRO-CUA将策略优化与在线环境交互解耦：当前策略通过实时运行收集状态，为每个状态生成多样化的候选动作，从过程奖励模型（PRM）接收步骤级反馈，并通过组相对优势进行优化。这种设计无需依赖黄金答案或离线专家轨迹即可实现密集且灵活的信用分配，同时通过在代理自身的执行状态上训练减少分布偏移。在实时网络基准上的实验证明了PRO-CUA的有效性以及PRM引导的步骤级训练的可靠性。

英文摘要

Computer use agents (CUAs) have shown strong potential for automating complex digital workflows, yet their training remains constrained by costly live environment interaction and limited high-quality supervision. Existing filtered behavior cloning pipelines suffer from imitation bottlenecks, including distribution shift from the expert demonstration and the absence of negative learning signals. Meanwhile, standard trajectory-level reinforcement learning struggles with sparse rewards, ambiguous credit assignment, and high infrastructure costs for long-horizon GUI interaction. In this work, we propose PRO-CUA, a process-reward optimization framework for training CUAs with iterative step-level reinforcement learning. PRO-CUA decouples on-policy environment interaction from policy optimization: the current policy collects states through live rollouts, generates diverse candidate actions for each state, receives step-level feedback from a process reward model (PRM), and is optimized with group-relative advantages. This design enables dense and flexible credit assignment without relying on golden answers or offline expert trajectories, while reducing distribution shift by training on the agent's own execution states. Experiments on live web benchmarks demonstrate the effectiveness of PRO-CUA and the reliability of PRM-guided step-level training.

URL PDF HTML ☆

赞 0 踩 0

2605.29116 2026-05-29 cs.AI

Beyond Consensus: Trace-Level Synthesis in Mixture of Agents

超越共识：混合智能体中的轨迹级合成

Shreyas Fadnavis, Praitayini Kanakaraj, Felix Wyss

AI总结本文提出轨迹级合成方法，通过语义保持输入扰动生成多样化推理轨迹，并利用锚定精炼保证非退化，从而在多数投票失败时仍能恢复正确解，超越基于答案的聚合。

详情

AI中文摘要

当多个LLM智能体解决同一问题时，标准做法是将每个智能体的推理压缩为多数投票或分层合成，将一致性视为终点。我们证明这是不必要的损失：一个读取完整推理轨迹的LLM聚合器即使在智能体一致同意时也能恢复正确解，且有益修正始终超过有害修正——即“聚合悖论”。多数投票存在上限，而扰动多样性无法提高（错误相关性相同）；聚合器的收益来自轨迹级互补性，即从投票丢弃的少数链中组装正确的中间步骤。这些发现促使我们提出自洽混合智能体，通过语义保持输入扰动生成轨迹多样性，通过锚定精炼保护多数并具有可证明的非退化保证，并且始终进行合成——绝不基于共识进行门控。单个模型通过扰动诱导的轨迹变异性在结构化推理、博士级科学、竞赛数学和竞争性编程中优于异构模型池。聚合的单位应是推理轨迹，而非答案。

英文摘要

When multiple LLM agents solve the same problem, standard practice compresses each agent's reasoning into a majority vote or layered synthesis, treating agreement as the finish line. We show this is unnecessarily lossy: an LLM aggregator that reads complete reasoning traces recovers correct solutions even when agents unanimously agree, with beneficial corrections consistently outweighing harmful ones -- the \emph{aggregation paradox}. Majority voting has a ceiling that perturbation diversity does not raise (error correlations are identical); the aggregator's gain comes from trace-level complementarity, assembling correct intermediate steps from minority chains that voting discards. These findings motivate Self-Consistent Mixture of Agents which generates trace diversity through semantic-preserving input perturbations, safeguards the majority via anchored refinement with provable non-degradation guarantees, and always synthesizes -- never gates on consensus. A single model with perturbation-induced trace variation outperforms heterogeneous model pools across structured reasoning, PhD-level science, competition mathematics, and competitive programming. The unit of aggregation should be the reasoning trace, not the answer.

URL PDF HTML ☆

赞 0 踩 0

2605.29108 2026-05-29 cs.LG

Bridging Chemists and AI: An Expert-Augmented Framework for Interpretable Route Evaluation

连接化学家与人工智能：一种专家增强的可解释路线评估框架

Yujia Guo, Mikhail Kabeshov, Tat Hong Duong Le, Samuel Genheden, Marco V. Mijangos, Varvara Voinarvoska, Giulia Bergonzini, Ola Engkvist, Samuel Kaski

AI总结提出一种专家增强的数据驱动评分框架，结合机器学习与化学家领域知识，实现多步合成路线的数值与可解释评估，显著提升预测准确性。

Comments 13 pages, 11 figures, ELLIS Unconference Workshop: Generative Models, LLMs, and the Future of Molecular AI (ML4Molecules 2025)

详情

AI中文摘要

选择高效的多步合成路线是有机合成中的一个核心挑战，特别是在药物化学和工艺化学中，路线选择直接影响可行性、成本和开发效率。数据驱动的评估系统常常过度简化合成设计的多目标性质，并依赖于代理数据集（如专利路线）而非普遍适用的标准。为了解决这一问题，我们引入了一种专家增强的数据驱动评分框架，该框架将机器学习与化学家的领域知识相结合，用于数值和可解释的路线评估。使用参考路线与机器生成路线之间的树编辑距离训练基于DeepSets的模型，然后通过专家评估进行微调，以产生定量分数和可解释的定性类别：好、合理和差。所得系统在类别评估预测上实现了0.78的Spearman相关系数和0.77的Pearson相关系数，在分数预测上实现了60.2%的top-1排名准确率，显著优于之前17.5%的基线水平。

英文摘要

Selecting efficient multi-step synthetic routes is a central challenge in organic synthesis, particularly in medicinal and process chemistry, where route choice directly impacts feasibility, cost, and development efficiency. Data-driven assessment systems often oversimplify the multi-objective nature of synthesis design and rely on proxy datasets, such as patent routes, rather than universally grounded criteria. To address this, we introduce an expert-augmented, data-driven scoring framework that integrates machine learning with chemists' domain knowledge for both numerical and explainable route assessment. A DeepSets-based model is trained using tree edit distance between reference and machine-generated routes, and then fine-tuned with expert evaluations to produce both quantitative scores and interpretable qualitative categories: Good, Plausible, and Bad. The resulting system achieves a Spearman correlation coefficient of 0.78 and a Pearson correlation of 0.77 for category assessment prediction, and 60.2% top-1 ranking accuracy for score prediction, substantially outperforming the previous baseline of 17.5%.

URL PDF HTML ☆

赞 0 踩 0

2605.29101 2026-05-29 cs.LG cs.IT math.IT

Model Merging by Output-Space Projection

通过输出空间投影进行模型合并

Bethan Evans, Benjamin Etheridge, Stephen Roberts, Jared Tanner

AI总结将模型合并形式化为凸二次规划问题，通过校准输入和微调模型输出最小化平方输出校准目标，并推导出预测合并质量的闭式诊断指标。

详情

AI中文摘要

模型合并将多个微调检查点合并为单个多任务模型，无需重新训练。现有方法——如任务算术、模型汤、TIES和DARE——计算高效且经验成功，但依赖于启发式设计选择，缺乏形式化的最优性保证。我们表明，合并可以形式化为关于残差更新的凸二次规划，产生的权重通过校准输入和微调模型输出最小化平方输出校准目标，并将现有方法作为特例包含在内。我们的框架产生一个闭式诊断——选定基捕获的残差能量比例——仅使用校准集即可预测下游合并质量。实验上，QP在单层设置中匹配或优于现有方法，并且我们刻画了最优基相对于更便宜的对角QP提供显著增益的条件。我们通过顺序逐层算法扩展到多层合并，并在语言和视觉基准上展示了一致的增益。

英文摘要

Model merging combines fine-tuned checkpoints into a single multi-task model without retraining. Existing methods - such as task arithmetic, model soups, TIES, and DARE - are computationally efficient and empirically successful, but rely on heuristic design choices and lack formal optimality guarantees. We show that merging can be formulated as a convex quadratic programme over residual updates, yielding weights that minimise a squared-output calibration objective using calibration inputs and fine-tuned model outputs, and subsuming existing methods as special cases. Our framework yields a closed-form diagnostic - the fraction of residual energy captured by a chosen basis - that predicts downstream merge quality using only the calibration set. Empirically, the QP matches or outperforms existing methods in the single-layer setting, and we characterise when the optimal basis provides significant gains over the cheaper diagonal QP. We extend to multi-layer merging via a sequential layer-wise algorithm and demonstrate consistent gains across language and vision benchmarks.

URL PDF HTML ☆

赞 0 踩 0

2605.29098 2026-05-29 cs.CV

Seeing through boxes: Non-Line-of-Sight 3D Reconstruction from Radar Signals

透视箱子：基于雷达信号的非视距三维重建

Jiachen Lu, Hailan Shanbhag, Haitham Al Hassanieh

AI总结提出统一视距与非视距神经几何重建框架GeRaF 2.0，利用外部视距几何约束引导射频信号传播，实现稳定训练和物理一致的重建，在射频几何重建中达到新最优。

Comments Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026

详情

AI中文摘要

从射频信号重建物体几何形状具有根本性挑战，因为射频传感的无透镜成像特性导致低空间分辨率和强噪声。与光信号不同，射频信号可以穿透遮挡物，从而捕获隐藏场景的信息。现有的非视距三维神经重建方法可以恢复封闭环境内的粗糙表面，但常常面临优化不稳定、表面几何噪声大和表面模糊等问题，无法从符号距离场生成精确的零水平集。这些局限性很大程度上源于忽略了封闭区域外视距几何的作用，而视距几何为建模信号传播提供了有价值的物理约束。本文提出统一视距与非视距神经几何重建框架GeRaF 2.0，利用外部视距几何来建模并引导射频信号从视距区域传播到非视距区域。通过将视觉视距先验融入神经场公式，GeRaF 2.0实现了可见和隐藏几何的稳定训练和物理一致重建，在基于射频的几何重建中达到了新的最优水平。

英文摘要

Reconstructing object geometry from radio frequency (RF) signals is fundamentally challenging due to the lensless imaging nature of RF sensing, which leads to low spatial resolution and high noise. Unlike light signals, RF signals can penetrate occlusions and thus capture information about hidden scenes. Existing Non-Line-of-Sight (NLoS) 3D neural reconstruction methods can recover coarse surfaces inside enclosed environments but often suffer from unstable optimization, noisy surface geometry, and surface ambiguity, failing to produce accurate zero-level sets from the signed distance field (SDF). These limitations largely stem from neglecting the role of Line-of-Sight (LoS) geometry outside the enclosed region, which provides valuable physical constraints for modeling signal propagation. In this paper, we introduce a Unified LoS and NLoS neural geometry reconstruction framework GeRaF 2.0 that leverages the outside LoS geometry to model and guide RF propagation from the LoS region into the NLoS region. By integrating visual LoS priors into the neural field formulation, GeRaF 2.0 achieves stable training and physically consistent reconstruction of both visible and hidden geometry, setting a new state-of-the-art in RF-based geometry reconstruction.

URL PDF HTML ☆

赞 0 踩 0

2605.29097 2026-05-29 cs.CV

GeRaF: Neural Geometry Reconstruction from Radio Frequency Signals

GeRaF: 从射频信号进行神经几何重建

Jiachen Lu, Hailan Shanbhag, Haitham Al Hassanieh

AI总结提出GeRaF方法，利用神经隐式学习从射频信号重建近距3D几何，通过滤波渲染、物理射频体渲染和无透镜采样策略解决低分辨率、噪声和镜面反射问题。

Comments Accepted at NeurIPS 2025 (Spotlight)

详情

Journal ref: Advances in Neural Information Processing Systems 38 (2026): 94200-94230

AI中文摘要

GeRaF是首个利用神经隐式学习从射频信号进行近距3D几何重建的方法。与基于RGB或LiDAR的方法不同，射频传感可以穿透遮挡，但由于其无透镜成像特性，存在分辨率低和噪声大的问题。虽然RGB成像中的透镜将采样限制在1D射线上，但射频信号在整个空间中传播，引入显著噪声并导致体渲染的立方复杂度。此外，射频信号通过镜面反射与表面相互作用，需要根本不同的建模。为解决这些挑战，GeRaF (1) 引入基于滤波的渲染以抑制无关信号，(2) 实现基于物理的射频体渲染管线，(3) 提出一种新颖的无透镜采样和无透镜alpha混合策略，使训练期间的全空间采样可行。通过MLP和可训练参数学习符号距离函数、反射率和信号功率，GeRaF迈出了从射频信号在真实环境中重建毫米级几何的第一步。

英文摘要

GeRaF is the first method to use neural implicit learning for near-range 3D geometry reconstruction from radio frequency (RF) signals. Unlike RGB or LiDAR-based methods, RF sensing can see through occlusion but suffers from low resolution and noise due to its lensless imaging nature. While lenses in RGB imaging constrain sampling to 1D rays, RF signals propagate through the entire space, introducing significant noise and leading to cubic complexity in volumetric rendering. Moreover, RF signals interact with surfaces via specular reflections, requiring fundamentally different modeling. To address these challenges, GeRaF (1) introduces filter-based rendering to suppress irrelevant signals, (2) implements a physics-based RF volumetric rendering pipeline, and (3) proposes a novel lensless sampling and lensless alpha blending strategy that makes full-space sampling feasible during training. By learning signed distance functions, reflectiveness, and signal power through MLPs and trainable parameters, GeRaF takes the first step towards reconstructing millimeter-level geometry from RF signals in real-world settings.

URL PDF HTML ☆

赞 0 踩 0

2605.29096 2026-05-29 cs.AI

Trends in AI and Human-AI Interaction in Clinical Trials -- A Hybrid Human-AI Exploration

临床试验中AI与人机交互的趋势——一种混合人机探索

Sandra Woolley, Tim Collins, Khalid Khattak, Illia Chernomorets, Ariane Arevalo, Chris Richardson

AI总结本研究通过分析ClinicalTrials.gov注册库中的记录，揭示了AI相关临床试验的时间趋势和地理分布，并探索了一种混合人机方法用于分析注册试验中的人机交互趋势。

Comments 8 pages plus 2 pages references and appendix

详情

AI中文摘要

本文检查了从ClinicalTrials.gov注册库中检索的记录，以表征AI术语的时间趋势和AI试验的地理分布。该工作还报告了一种探索性的混合人机方法，用于分析注册临床试验中的人机交互趋势。混合工作流程包括前沿生成式AI模型（GPT-5.5）和人工审查，以筛选和分类AI重点搜索返回的记录。研究结果表明，随着时间的推移，AI相关试验显著增加，近期对机器学习、深度学习、聊天机器人、GPTs和大语言模型的引用有所增长。地理上，中国和美国占据了AI相关试验的最大数量，而意大利、法国、西班牙、英国和土耳其（Türkiye）等几个其他国家近期也有显著增长。在100条记录的随机样本中，人类和AI分类器在识别未实质性使用AI的研究方面表现出良好的一致性，但在分类人机交互方面一致性较低，特别是在健康专业人员交互模糊或描述不足的情况下。总体而言，结果表明混合人机筛选临床试验记录是潜在可行的，但更清晰的试验报告和更精确的交互定义将有利于这一过程。

英文摘要

This paper examines records retrieved from the ClinicalTrials.gov registry to characterize temporal trends in AI terminology and the geographical distribution of AI trials. The work also reports on an exploratory hybrid human-AI approach to analyzing human-AI interaction trends in registered clinical trials. The hybrid workflow comprised a frontier generative AI model (GPT-5.5) and human review to screen and categorize records returned by an AI-focused search. The findings indicate a marked increase in AI-related trials over time, with recent growth in references to machine learning, deep learning, chatbots, GPTs, and large language models. Geographically, China and the United States accounted for the largest numbers of AI-related trials, with notable recent increases in several other countries including Italy, France, Spain, the UK and Turkey (Türkiye). In a random sample of 100 records, human and AI classifiers showed good agreement in identifying studies not substantively using AI, but lower agreement in classifying human-AI interaction, particularly where health professional interaction was ambiguous or insufficiently described. Overall, the results suggest that hybrid human-AI screening of clinical trial records is potentially viable, but clearer trial reporting and more precise interaction definitions will benefit the process.

URL PDF HTML ☆

赞 0 踩 0

2605.29092 2026-05-29 cs.CV cs.LG cs.MM

Lightweight Complementary-Cue Fusion for Robust Video Face Forgery Detection

轻量级互补线索融合用于鲁棒视频人脸伪造检测

Sunghwan Baek, Tariq Anwaar, Karanveer Singh, Rita Singh

AI总结提出轻量级融合模块，结合手工特征（小波去噪特征与相位谱或局部二值模式），在极小参数增加下显著提升视频人脸伪造检测的鲁棒性。

Comments 13 pages, 6 figures, 3 tables

详情

AI中文摘要

当前的人脸视频伪造检测器使用宽或双流骨干网络。我们证明，通过单个轻量级融合两个手工线索，可以在更小的模型下实现更高的准确率。基于Xception基线模型（2190万参数），我们构建了两个检测器：LFWS，它添加一个1x1卷积来结合低频小波去噪特征（WDF）和来自空间相位浅层学习（SPSL）的相位谱通道；以及LFWL，它以相同方式融合WDF和局部二值模式（LBP）。这个额外模块仅增加292个参数，使总参数保持在2190万，小于F3Net（2250万）且不到SRM（5530万）的一半。即使如此小的开销，融合模型在FaceForensics++上将平均曲线下面积（AUC）从74.8%提升至78.6%，在DFDC-Preview上从70.5%提升至74.9%，分别比Xception基线提高3.8%和4.4%。在八个公开基准上，它们也始终优于F3Net、SRM和SPSL，无需额外数据或测试时增强。这些结果表明，通过轻量级融合块精心配对的手工特征，可以以远低于可比频率检测器的成本提供有竞争力的鲁棒性。我们的发现提示需要重新评估人脸视频伪造检测中规模驱动的设计选择。

英文摘要

Current face video forgery detectors use wide or dual-stream backbones. We show that a single, lightweight fusion of two handcrafted cues can achieve higher accuracy with a much smaller model. Based on the Xception baseline model (21.9 million parameters), we build two detectors: LFWS, which adds a 1x1 convolution to combine a low-frequency Wavelet-Denoised Feature (WDF) with a phase-spectrum channel derived from Spatial-Phase Shallow Learning (SPSL), and LFWL, which merges WDF with Local Binary Patterns (LBP) in the same way. This extra module adds only 292 parameters, keeping the total at 21.9 million, smaller than F3Net (22.5 million) and less than half the size of SRM (55.3 million). Even with this minimal overhead, the fused models increase the average area under the curve (AUC) from 74.8% to 78.6% on FaceForensics++ and from 70.5% to 74.9% on DFDC-Preview, gains of 3.8% and 4.4% over the Xception baseline. They also consistently outperform F3Net, SRM, and SPSL in eight public benchmarks, without extra data or test-time augmentation. These results show that carefully paired, handcrafted features, combined through the lightweight fusion block, can provide competitive robustness at a significantly lower cost than comparable frequency-based detectors. Our findings suggest a need to reevaluate scale-driven design choices in face video forgery detection.

URL PDF HTML ☆

赞 0 踩 0

2605.29091 2026-05-29 cs.RO cs.MA

Human-in-the-Loop Swarms: A Bionic Swarm Approach to Real-World Soil Mapping

人在环路的群体：一种用于真实土壤测绘的仿生群体方法

Petras Swissler, Mohammadali Rashidioun, Nicholas Sahu, Raaid Kabir, Ayodeji Aderibigbe, Oladoyin Kolawole

AI总结提出Bionic Swarm系统，通过人类用户替代机器人难以实现的任务，结合蓝牙传感器和集中式服务器运行群体算法，并在真实户外环境中验证了Score-Biased-Search算法，降低了实地群体机器人研究的门槛。

Comments 27 pages, 15 figures. Submitted to Advanced Intelligent Systems

详情

AI中文摘要

由于部署硬件的成本高和开发时间长，群体和现场机器人技术在真实世界验证中面临重大障碍。本文介绍了“Bionic Swarm”，一种新颖的系统，通过抽象出许多难以在机器人上实现但对整体算法评估无贡献的任务，并将这些任务交给人类用户，从而降低了这些障碍。这些人类用户通过智能手机网页应用接收指令，该应用从蓝牙连接的传感器获取测量数据并将其转发到集中式服务器。该服务器运行群体算法并向人类用户指示行动。我们通过实验验证了一种名为Score-Biased-Search的岩土聚焦搜索算法来评估该系统，该算法通过为重建地图上的每个位置分配“分数”，然后通过预期分数较高的区域偏置搜索模式，并表现出相对于搜索代理数量的超线性地图重建。在展示该算法的模拟结果后，我们在Bionic Swarm平台上应用该算法，以验证其在真实户外环境中的功能。这项工作表明，这种人在环路的方法显著降低了现场和群体机器人研究的入门门槛。

英文摘要

Swarm and field robotics face significant barriers to real-world validation due to the high cost and development time to deploy hardware. This paper introduces the ``Bionic Swarm,'' a novel system that lowers these barriers by abstracting away many of the tasks that are difficult to implement on robots but which do not contribute to the overall algorithm evaluation, giving these tasks to human users. These human users take directions from a smartphone web-app that takes measurements from Bluetooth-connected sensors and relays them to a centralized server. This server runs the swarm algorithm and directs actions to the human users. We evaluate this system through the experimental validation of a geotechnically-focused search algorithm named Score-Biased-Search, which functions by assigning a ``score'' to each location on a reconstructed map, then biases search patterns through areas of higher expected scores, and which exhibits superlinear map reconstruction relative to the number of search agents. After presenting simulation results for the algorithm, we then apply the algorithm on the Bionic Swarm platform to validate its function in a real-world, outdoor setting. This work demonstrates that this human-in-the-loop approach significantly lowers the barrier to entry for field and swarm robotics research.

URL PDF HTML ☆

赞 0 踩 0

2605.29089 2026-05-29 cs.LG cs.AI cs.CV

OISD: On-Policy Internal Self-Distillation of Language Models

OISD: 语言模型在策略内部自蒸馏

Xinyu Liu, Darryl Cherian Jacob, Yang Zhou, Jindong Wang, Pan He

AI总结提出OISD框架，通过将最终层的预测信号蒸馏到中间层，结合logit对齐和注意力对齐，提升推理能力，在数学推理任务上显著优于基线。

Comments Under Review for Publication

详情

AI中文摘要

最近的强化学习后训练方法主要使用稀疏的结果级奖励来优化最终输出策略，而很大程度上忽略了中间表示中编码的预测信号。在本文中，我们引入了一种称为在策略内部自蒸馏的新范式，并提出了OISD框架，该框架通过将最终层的在策略预测信号转移到中间表示来改进推理。在展开和组相对策略优化（GRPO）优化过程中，最终层既充当策略，又充当所选中间层的分离内部教师，通过两种互补机制引导中间层与其对齐：logit对齐，传递高级推理行为（如何思考）；注意力对齐，强制从最终层到所选中间层的一致注意力模式（看哪里），两者都不需要外部特权信息。我们的OISD与GRPO一起，采用带符号优势加权的Jensen-Shannon对齐来蒸馏信息丰富的中间表示，同时在统一行动策略下保持策略一致性。实验结果表明了OISD的有效性，在四个数学推理任务上，相对于强推理强化学习基线，取得了显著且一致的改进。代码将在https://github.com/THE-MALT-LAB/OISD发布。

英文摘要

Recent reinforcement learning (RL) post-training approaches primarily optimize the final output policy using sparse outcome-level rewards, while largely overlooking predictive signals encoded in intermediate representations. In this paper, we introduce a new paradigm called on-policy internal self-distillation and propose the OISD framework, which improves reasoning by transferring on-policy predictive signals from the final layer to intermediate representations. During rollout and Group Relative Policy Optimization (GRPO) optimization, the final layer acts as both the policy and a detached internal teacher for selected intermediate layers, which are guided to align with it through two complementary mechanisms: logit alignment, which transfers high-level reasoning behaviors (how to think), and attention alignment, which enforces consistent attention patterns (where to look) from the final layer to the selected intermediate layer, both without requiring external privileged information. Our OISD, together with GRPO, employs signed advantage-weighted Jensen--Shannon alignment to distill informative intermediate representations while preserving policy consistency under a unified acting policy. Experimental results demonstrate the effectiveness of OISD, with substantial and consistent improvements over strong reasoning RL baselines across four mathematical reasoning tasks. The code will be released at https://github.com/THE-MALT-LAB/OISD

URL PDF HTML ☆

赞 0 踩 0

2605.29088 2026-05-29 cs.CV

A Deep Learning Iterative Framework for Sentinel-1 Stripmap Enhancement Based on Azimuth Doppler Decomposition

基于方位向多普勒分解的哨兵一号条带图增强深度学习迭代框架

Juan Francisco Amieva, Christian Ayala, Roberto Del Prete, Mikel Galar

AI总结提出一种基于方位子孔径分解的自监督增强框架，利用子孔径与全孔径图像之间的物理一致性生成训练数据，通过单/多帧学习和迭代推理逐步提升图像质量，在哨兵一号条带模式数据上优于MERLIN方法。

Comments Accepted at the AI4Space Workshop, CVPR 2026

详情

AI中文摘要

合成孔径雷达（SAR）图像能够实现全天候、昼夜地球观测；然而，由于散斑噪声和其他固有成像伪影，其仍难以解释。哨兵一号（S1）是最广泛使用的星载SAR任务之一，提供系统性的全球覆盖、高时间分辨率、双极化成像和免费数据获取。在S1模式中，条带图（SM）提供最高分辨率，但散斑噪声和空间约束常常阻碍需要更精细空间细节的应用。这激发了对有效图像增强策略的需求。在这项工作中，我们提出了一种基于方位子孔径分解的S1 SM图像自监督增强框架。该方法利用子孔径重建与对应全孔径图像之间的物理一致性，生成配对训练数据，无需外部传感器、模拟真值或多时相堆叠。所提框架集成了单帧和多帧学习，并融入迭代推理方案，逐步提升图像质量。在真实S1 SM数据上的实验表明，所提方法在PSNR和SSIM上持续优于广泛采用的自监督深度学习基线MERLIN，而MERLIN获得更高的ENL，凸显了结构保真度与散斑平滑之间的权衡。总体而言，结果表明基于子孔径的监督为使用S1数据的SAR图像增强提供了一种物理基础、可复现且操作可行的方法。值得注意的是，所提方法可扩展到其他SAR平台、极化和采集模式。

英文摘要

Synthetic Aperture Radar (SAR) imagery enables all-weather, day-and-night Earth observation; however, it remains difficult to interpret due to speckle noise and other intrinsic imaging artifacts. Sentinel-1 (S1) constitutes one of the most widely used spaceborne SAR missions, offering systematic global coverage, high temporal resolution, dual-polarization imaging, and free data availability. Among S1 modes, Stripmap (SM) provides the highest resolution, yet speckle noise and spatial constraints often hinder applications requiring finer spatial detail. This motivates the need for effective image enhancement strategies. In this work, we propose a self-supervised enhancement framework for S1 SM imagery based on azimuth subaperture decomposition. The method exploits the physical consistency between subaperture reconstructions and the corresponding full-aperture image to generate paired training data without external sensors, simulated ground truth, or multi-temporal stacks. The proposed framework integrates single- and multi-frame learning and incorporates an iterative inference scheme that progressively refines image quality. Experiments on real S1 SM data show that the proposed approach consistently outperforms the widely adopted self-supervised deep learning baseline MERLIN, in terms of PSNR and SSIM, while MERLIN attains higher ENL, highlighting a trade-off between structural fidelity and speckle smoothing. Overall, the results demonstrate that subaperture-based supervision provides a physically grounded, reproducible, and operationally viable approach for SAR image enhancement using S1 data. It is worth noting that the proposed approach can be extended to other SAR platforms, polarizations, and acquisition modes.

URL PDF HTML ☆

赞 0 踩 0

2605.29087 2026-05-29 cs.AI

The Chain Holds, the Answer Folds: Trace-Answer Dissociation in Reasoning Models Under Adversarial Pressure

链条保持，答案翻转：对抗压力下推理模型中的痕迹-答案分离

Yubo Li, Ramayya Krishnan, Rema Padman

AI总结本研究通过2×2潜在-行为框架发现推理模型在持续对抗压力下出现“不忠实屈服”故障模式，即思维链保持正确但答案错误，并验证了推理通道对此的影响。

详情

AI中文摘要

推理模型在单轮基准测试中评估，但部署在多轮对话中，用户会对正确答案进行反驳。在持续对抗压力下，我们发现了一种先前未记录的故障模式：思维链从第一轮到最后一轮保持事实正确，而输出的答案却翻转错误。我们称此为不忠实屈服（UC），并通过一个$2\times 2$潜在-行为框架将其分离出来，该框架揭示了翻转率指标和单轮忠实度探测均遗漏的问题。在三个数据集（MT-Consistency、MMLU-Pro、GSM8K）上，行为翻转时的潜在正确率在思考模式下聚集在50%附近，在无思考模式下降至11-15%——这是模型内成对因果证据，表明推理造成了这一差距。跨模型而言，该效应与推理通道相关（在Qwen3-32B和GPT-OSS-20B中较高，在内联思维链的Gemma-4-31B-it中较低）。独立的GPT-4o评判者验证了86%的UC标签；令牌级探测显示答案槽的argmax在84%的UC单元中是正确的；而一种朴素的痕迹锚定防御适得其反。我们发布了所有轨迹、痕迹和评判者标签。

英文摘要

Reasoning models are evaluated on single-turn benchmarks but deployed in multi-turn dialogue, where users push back on correct answers. Under sustained adversarial pressure we find a previously undocumented failure mode: the chain-of-thought stays factually correct from first turn to last while the emitted answer flips wrong. We call this unfaithful capitulation (UC) and isolate it with a $2\times 2$ latent-versus-behavioral framework that flip-rate metrics and single-turn faithfulness probes both miss. Across three datasets (MT-Consistency, MMLU-Pro, GSM8K), the latent-correct rate at the behavioral flip clusters near 50% in think mode and collapses to 11-15% under no_think -- paired, within-model causal evidence that reasoning creates the gap. Across models the effect tracks the reasoning channel (high in Qwen3-32B and GPT-OSS-20B, low in inline-CoT Gemma-4-31B-it). An independent GPT-4o judge corroborates $86\%$ of UC labels; a token-level probe shows the answer-slot argmax is correct in $84\%$ of UC cells; and a naive trace-anchored defense backfires. We release all trajectories, traces, and judge labels.

URL PDF HTML ☆

赞 0 踩 0

2605.29084 2026-05-29 cs.CL cs.AI cs.IR

Same Question, Different Source, Different Answer: Auditing Source-Dependence in Medical Multi-Source RAG

同一问题，不同来源，不同答案：审计医学多源RAG中的来源依赖性

Yubo Li, Rema Padman, Ramayya Krishnan

AI总结本文提出来源依赖性作为NLP评估缺失的维度，通过构建移植患者教育基准TransplantQA、分层检索策略HERO-QA和结构化输出评判器，审计多源RAG系统中同一问题因检索来源不同而给出不同答案的失败模式。

详情

AI中文摘要

部署在多作者机构语料库上的检索增强生成（RAG）系统可能会根据检索到的来源对同一问题给出不同的答案——这是主流单一黄金答案范式无法诊断的失败模式。我们认为来源依赖性（source-dependence）是NLP评估缺失的一个维度，审计它意味着将评估单位从答案正确性转移到来源间关系。我们在移植患者教育中具体化了这一点，其中机构来源明显存在分歧，发布了三个工件：TransplantQA，一个真实患者问题的基准，每个问题通过将生成基于多个机构手册作为候选来源来回答；HERO-QA，一种分层检索策略，用于基于和审计每个答案；以及一个结构化输出评判器，根据经过验证的5标签分类法对来源间关系进行评分。在大规模上，更好的检索揭示了比先前估计多得多的分歧——低估了其普遍性，而非强度。该框架是领域无关的，可迁移到法律和教育RAG：测量来源依赖性通常是部署的多源NLP的责任。

英文摘要

A retrieval-augmented generation (RAG) system deployed over a multi-author institutional corpus can give a different answer to the same question depending on which source it retrieves -- a failure mode the dominant single-gold-answer paradigm cannot diagnose. We argue that source-dependence is a missing axis of NLP evaluation, and that auditing it means shifting the unit of evaluation from answer correctness to the inter-source relationship. We make this concrete in transplant patient education, where institutional sources demonstrably disagree, releasing three artefacts: TransplantQA, a benchmark of real patient questions, each answered by grounding generation in multiple institutional handbooks as candidate sources; HERO-QA, a hierarchical retrieval strategy that grounds and audits each answer; and a structured-output judge that scores inter-source relationships on a validated 5-label taxonomy. At scale, better retrieval reveals far more disagreement than prior estimates suggested -- understating its prevalence, not its intensity. The framework is domain-agnostic and transfers to legal and educational RAG: measuring source-dependence is a responsibility for deployed multi-source NLP generally.

URL PDF HTML ☆

赞 0 踩 0

2605.29082 2026-05-29 cs.AI

The Importance of Out-of-Band Metadata for Safe Autonomous Agents: The Redpanda Agentic Data Plane

带外元数据对安全自主智能体的重要性：Redpanda 智能体数据平面

Tyler Akidau, Tyler Rockwood, Johannes Brüderl, Marc Millstone

AI总结针对自主智能体在安全关键元数据处理上的不可靠性，提出基于带外元数据通道的 Redpanda 智能体数据平面架构，实现访问策略、数据分类和行为约束的确定性执行，并通过多智能体投资组合再平衡系统验证其有效性。

Comments 6 pages, 1 figure. Published at SAO '26 (co-located with ACM CAIS '26)

详情

AI中文摘要

AI 智能体越来越被期望作为数字员工运作：访问企业数据、做出决策并自主采取行动。但智能体同时比人类更不可预测——容易产生幻觉、误解和对抗性操纵——并且技术能力更强：具有深度系统知识和高吞吐量接口，能够以机器速度级联损害。这种组合使得依赖智能体忠实地解释或传播安全关键元数据（如访问策略、数据分类和行为约束）变得不安全。我们提出 Redpanda 智能体数据平面（ADP），一种围绕带外元数据通道构建的架构：基础设施路径，确定性携带安全上下文、策略信号和审计轨迹，完全在智能体的读写路径之外，并跨越异构基础设施。这些通道在智能体生命周期的每个阶段强制执行治理——在输入时限定数据访问范围，在执行期间约束操作，并在输出时捕获防篡改记录。我们通过一个多智能体投资组合再平衡系统演示了 ADP，其中自主智能体监控市场、做出交易决策并在隔离的客户账户之间执行订单——每个客户的数据范围、交易批准阈值和防篡改审计轨迹均由智能体既看不见也无法绕过的带外通道强制执行。

英文摘要

AI agents are increasingly expected to operate as digital employees: accessing enterprise data, making decisions, and taking actions autonomously. But agents are simultaneously less predictable than humans -- prone to hallucination, misinterpretation, and adversarial manipulation -- and more technically capable: with deep system knowledge and high-throughput interfaces cascading damage at machine speed. This combination makes it unsafe to rely on agents to faithfully interpret or propagate security-critical metadata such as access policies, data classifications, and behavioral constraints. We present the Redpanda Agentic Data Plane (ADP), an architecture built around out-of-band metadata channels: infrastructure pathways that carry security context, policy signals, and audit trails deterministically, entirely outside the agent's read and write path and across heterogeneous infrastructure. These channels enforce governance at every stage of the agent lifecycle -- scoping data access on the way in, constraining actions during execution, and capturing tamper-proof transcripts on the way out. We demonstrate ADP with a multi-agent portfolio rebalancing system in which autonomous agents monitor markets, make trade decisions, and execute orders across isolated client accounts -- with per-client data scoping, trade approval thresholds, and tamper-proof audit trails all enforced by out-of-band channels the agents can neither see nor bypass.

URL PDF HTML ☆

赞 0 踩 0

2605.29078 2026-05-29 cs.AI cs.LG

Bridging the Sim-to-Real Gap in Reinforcement Learning-Based Industrial Dispatching through Execution Semantics

弥合基于强化学习的工业调度中仿真到现实的鸿沟：通过执行语义

Jonathan Hoss, Noah Klarmann

AI总结提出一个策略无关的执行与测量层，通过构建决策有效快照、定义标准化执行合约并记录结果分歧，将执行不确定性转化为可观测的结构化数据，从而弥合仿真与现实的差距。

Comments Accepted for publication at the 24th IEEE International Conference on Industrial Informatics (INDIN 2026), held from 26 to 29 July 2026 in Melbourne, Australia

详情

AI中文摘要

事件驱动的调度策略越来越多地部署在工业环境中，其中决策是在异步和部分可观测的系统状态下做出的。因此，决策状态在时间上不一致，动作可行性未明确定义，执行错误的根源仍然模糊。这些问题限制了可靠性和可解释性。为弥补这一差距，提出一个策略无关的执行与测量层，用于调解调度策略与工业执行环境。该层从异步事件流构建决策有效快照，定义具有明确动作可行性的标准化执行合约，并将结果记录为策略意图、事务结果、物理执行和人工干预之间的分歧。这使得决策语义与执行行为分离，并使部署不匹配可观测且结构上可归因。使用离散事件仿真评估所提框架。结果表明，在所有观测滞后情况下均具有分析优势，因为未区分的执行失败被转化为具有完全归因覆盖的结构化类型化结果。在低观测滞后下操作优势最强，此时可避免的执行错误可在提交前预防。总体而言，该层将执行不确定性转化为用于评估和策略改进的监督数据。

英文摘要

Event-driven scheduling policies are increasingly deployed in industrial environments, where decisions are made under asynchronous and partially observed system states. As a result, decision states are not temporally consistent, action admissibility is not explicitly defined, and the origin of execution errors remains ambiguous. These issues limit both reliability and interpretability. To address this gap, a policy-neutral execution and measurement layer is proposed to mediate between scheduling policies and the industrial execution environment. The layer constructs decision-valid snapshots from asynchronous event streams, defines a standardized execution contract with explicit action admissibility, and records outcomes as divergences between policy intent, transactional outcomes, physical execution, and human intervention. This enables a separation between decision semantics and execution behavior and makes deployment mismatch observable and structurally attributable. The proposed framework is evaluated using a discrete-event simulation. The results show analytical benefits across all observation lag regimes, as undifferentiated execution failures are transformed into structured, typed outcomes with full attribution coverage. Operational benefits are strongest under low observation lag, where avoidable execution errors can be prevented before commitment. Overall, the layer turns execution uncertainty into supervisory data for evaluation and policy refinement.

URL PDF HTML ☆

赞 0 踩 0

2605.29075 2026-05-29 cs.LG

Knowledge Offloading: Decomposing LLMs into Sparse Backbones and Memory Modules

知识卸载：将大语言模型分解为稀疏骨干和记忆模块

Karim Galliamov, Rochelle Choenni, Ivan Titov

AI总结提出知识卸载（KOFF）框架，通过结构化剪枝和轻量级恢复模块将预训练LLM分解为稀疏共享骨干和领域特定记忆，在约12%全局稀疏度下保持模型性能，并发现语言特定神经元优先被移除。

详情

AI中文摘要

大语言模型将通用能力和领域特定知识编码在同一组参数中。我们探究这种能力是否可以重组：将广泛有用的计算保留在共享骨干中，而将专门知识移入外部记忆模块。我们提出知识卸载（KOFF），一个将预训练LLM分解为稀疏共享骨干和领域特定记忆的框架。从冻结的基础模型开始，我们联合学习结构化剪枝掩码和轻量级恢复模块，这些模块以LoRA适配器和学习型键值缓存的形式实现。在3B到8B的Llama和Qwen模型上，我们发现非平凡的能力可以从共享骨干中移出而不会导致模型能力大幅下降。在大约12%的全局稀疏度下，KOFF保留了未剪枝模型的大部分性能，而剪枝相同冻结模型但没有记忆则性能急剧下降。消融实验表明LoRA和学习型KV记忆是互补的，专门化分析表明学习到的分解是有意义的：语言特定神经元被优先移除，而语言通用神经元主要保留在骨干中。这些结果表明知识可以在共享核心和可交换的外部记忆之间重新分配。

英文摘要

LLMs encode both general capabilities and domain-specific knowledge in a single set of parameters. We ask whether this capacity can be reorganized: keeping broadly useful computation in a shared backbone, while moving specialized knowledge into external memory modules. We propose \emph{knowledge offloading} (KOFF), a framework for decomposing a pretrained LLM into a sparse shared backbone and domain-specific memories. Starting from a frozen base model, we jointly learn a structured pruning mask and lightweight recovery modules, implemented as LoRA adapters and learned key-value caches. Across Llama and Qwen models from 3B to 8B, we find that non-trivial capacity can be moved out of the shared backbone without a large loss in model ability. At around 12\% global sparsity, KOFF preserves much of the unpruned model's performance, while pruning the same frozen model without memories degrades sharply. Ablations show that LoRA and learned KV memories are complementary, and specialization analyses suggest that the learned decomposition is meaningful: language-specific neurons are preferentially removed while language-general neurons largely remain in the backbone. These results suggest that knowledge can be reallocated between a shared core and swappable external memories.

URL PDF HTML ☆

赞 0 踩 0