arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 1695
专题追踪
2605.22885 2026-05-25 cs.AI cs.CL cs.LG cs.LO

ImProver 2: Iteratively Self-Improving LMs for Neurosymbolic Proof Optimization

ImProver 2:用于神经符号证明优化的迭代自改进语言模型

Riyaz Ahuja, Tate Rowney, Jeremy Avigad, Sean Welleck

发表机构 * Carnegie Mellon University(卡内基梅隆大学)

AI总结 随着形式化数学库的快速增长,对验证证明的重构和神经证明器训练数据质量的提升需求日益迫切。为解决可扩展性证明优化中面临的异构目标、数据稀缺和高训练推理成本等问题,本文提出ImProver 2,一个用于Lean 4的神经符号框架,结合高效的数据专家迭代流程和形式化结构暴露的轻量非正式抽象框架,并引入一系列衡量证明结构特性的指标。实验表明,该框架能够使小型模型在多个指标上达到与更大模型相当甚至更优的性能,展示了证明优化作为可扩展学习任务的可行性。

详情
AI中文摘要

形式化数学库正在迅速扩展,这产生了对已验证证明进行重构以保持可维护性以及提高神经证明器训练数据质量的日益增长的需求。然而,可扩展的证明优化受到异构且启发式指定的目标、稀缺的数据以及高训练和推理成本的阻碍。为了克服这些挑战,我们引入了ImProver 2,这是一个用于在Lean 4中自动进行证明优化的神经符号框架。ImProver 2将数据高效的专家迭代流程与一个暴露形式结构并附带轻量级非正式抽象的脚手架相结合。我们进一步引入了一套捕捉证明结构属性的指标。使用ImProver 2,我们训练了一个7B参数的模型,该模型在相同模型系列中优于数量级更大的模型,并且在各项指标上与中端前沿模型具有竞争力。我们还证明,我们的神经符号脚手架显著提高了小型和前沿模型的性能。我们表明,通过适当的脚手架和训练,小型模型可以有效地在复杂且多样的指标上重构研究级证明,与更大的系统相匹配,并将证明优化确立为一项可扩展、可学习的任务。

英文摘要

Formal mathematics libraries are rapidly expanding, creating a growing need to refactor verified proofs for maintainability and to improve training data quality for neural provers. However, scalable proof optimization is hindered by heterogeneous and heuristically specified objectives, scarce data, and high training and inference costs. To overcome these challenges, we introduce ImProver 2, a neurosymbolic framework for automated proof optimization in Lean 4. ImProver 2 combines a data-efficient expert-iteration pipeline with a scaffold that exposes formal structure alongside lightweight informal abstractions. We further introduce a suite of metrics capturing structural proof properties. Using ImProver 2, we train a 7B-parameter model that outperforms orders-of-magnitude larger models within the same model family, and is competitive with mid-tier frontier models across metrics. We additionally demonstrate that our neurosymbolic scaffold significantly improves performance across both small and frontier models. We show that with proper scaffolding and training, small models can effectively restructure research-level proofs over complex and varied metrics, matching substantially larger systems and establishing proof optimization as a scalable, learnable task.

2605.22884 2026-05-25 cs.LG cs.AI

Tensor Cache: Eviction-conditioned Associative Memory for Transformers

Tensor Cache: 基于驱逐条件的Transformer联想记忆

Kabir Swain, Sijie Han, Daniel Karl I. Weidele, Mauro Martino, Antonio Torralba

发表机构 * Massachusetts Institute of Technology, Cambridge, MA, USA(麻省理工学院) IBM Research, Cambridge, MA, USA(IBM研究院) University of Toronto, Toronto, Canada(多伦多大学)

AI总结 本文提出了一种名为 Tensor Cache 的两层缓存机制,用于改进 Transformer 模型在长上下文处理中的内存效率与质量。该方法结合了滑动窗口注意力作为第一层缓存(L1),并将被窗口淘汰的键值对压缩存储到第二层缓存(L2)中,通过外积形式的快速权重记忆实现高效召回。研究还揭示了现有训练方法中隐含的虚假外积问题,并提出改进方案,实验表明 Tensor Cache 在多个任务中显著提升了内存与性能的平衡。

详情
AI中文摘要

自回归Transformer的KV缓存随上下文长度线性增长;滑动窗口缓存限制了内存但完全丢弃被驱逐的token,使得窗口外的相关证据变得不可访问。我们引入了\emph{Tensor Cache},一种双层缓存,将滑动窗口softmax注意力作为第一级缓存(L1),与一个固定大小的外积快速权重记忆作为第二级缓存(L2)配对,L2由从窗口中驱逐的KV对提供。最近的token保留在精确的局部注意力中;被驱逐的对被压缩成一个每层矩阵$A$,并通过单个矩阵乘法被未来的查询读取,利用了线性注意力恒等式$q_t(k_i \otimes v_i)=\langle q_t,k_i angle v_i$。一个可学习的标量门融合L1和L2的输出,并且每头的衰减和写入率参数是端到端训练的。外积记忆和读取恒等式是众所周知的;我们的贡献是将其用作仅由滑动窗口驱逐提供的L2缓存,加上识别出常见的分块均值训练捷径$A\!\leftarrow\!λA\!+\!η(ar k\!\otimes\!ar v)$在每个块中静默地引入了$C^2{-}C$个虚假的跨token外积,并通过一个并行的加权和扫描(等价于在float32 epsilon内的每token写入)来弥补这一差距。跨系统规模、受控联想回忆、长上下文语言建模和记忆容量诊断,Tensor Cache在有限状态基线上改善了记忆-质量边界。

英文摘要

Autoregressive Transformer KV caches grow linearly with context length; sliding-window caching bounds memory but discards evicted tokens entirely, so relevant evidence outside the window becomes inaccessible. We introduce \emph{Tensor Cache}, a two-level cache that pairs sliding-window softmax attention as a first-level cache (L1) with a fixed-size outer-product fast-weight memory as a second-level cache (L2) fed by KV pairs evicted from the window. Recent tokens remain in exact local attention; evicted pairs are compressed into a per-layer matrix $A$ and read by future queries through a single matrix multiplication, exploiting the linear-attention identity $q_t(k_i \otimes v_i)=\langle q_t,k_i\rangle v_i$. A learned scalar gate fuses the L1 and L2 outputs, and per-head decay and write-rate parameters are trained end-to-end. The outer-product memory and the read identity are well-known; our contribution is their use as an L2 cache fed exclusively by sliding-window evictions, plus identifying that the common chunked-mean training shortcut $A\!\leftarrow\!λA\!+\!η(\bar k\!\otimes\!\bar v)$ silently introduces $C^2{-}C$ spurious cross-token outer products per chunk, and closing the gap with a parallel weighted-sum scan equivalent to per-token writes within float32 epsilon. Across systems scaling, controlled associative recall, long-context language modeling, and memory-capacity diagnostics, Tensor Cache improves the memory--quality frontier over bounded-state baselines.

2605.22883 2026-05-25 cs.AI cs.LG cs.PF

Energy per Successful Goal: Goal-Level Energy Accounting for Agentic AI Systems

每个成功目标的能量:面向智能体AI系统的目标级能量核算

Deepak Panigrahy, Aakash Tyagi

发表机构 * Independent Researcher(独立研究者) Texas A\&M University(德克萨斯A&M大学) Texas A\&M University Department of Computer Science(德克萨斯A&M大学计算机科学系)

AI总结 当前AI能耗基准通常以单次模型调用或训练运行作为能耗计量单位,但这种方法难以准确反映智能体系统中多步骤任务的能耗情况。本文提出A-LEMS框架,将能耗计量单位从每次推理改为每成功目标的能耗(EpG),并引入调度开销指数(OOI)以量化调度结构对能耗的影响。研究发现,智能体系统完成每项任务的平均能耗是线性基线的4.33倍,且这一差异主要由调度结构而非推理计算引起,表明EpG和OOI为评估智能体AI系统能耗提供了更准确的基准方法。

Comments 34 pages, 16 figures, 10 tables

详情
AI中文摘要

当前的AI能量基准在单次模型调用或训练运行的粒度上测量能耗。对于经典的单轮工作负载,这种单位仍然一致。但对于智能体系统——其中单个用户目标可能触发多步编排、工具调用、重试和故障恢复循环——调用次数是实现产物而非任务属性,推理级归一化错误地表示了目标完成的能量成本。我们提出A-LEMS(智能体LLM能量测量系统),一个跨层测量框架,将AI能量核算单位从每次推理能量重新定义为每个成功目标能量(EpG)。EpG聚合所有执行尝试(包括失败和重试)的总工作流能量,归一化到成功完成的目标。A-LEMS通过时间边界模型、将RAPL信号映射到工作流级能量的五层观测管道,以及将每次测量绑定到硬件和运行时配置的可复现协议,形式化了能量归因。基于EpG,我们定义了编排开销指数(OOI),在相同任务标准下隔离编排相对于线性执行的能耗成本。在五个推理和三个工具增强任务族中,智能体工作流每个成功目标的平均能耗是线性基线的4.33倍(888.1 J vs 205.3 J)。这种开销由编排结构驱动,而非推理计算。对于工具增强任务,OOI反转至低于1.0倍:智能体执行比线性更便宜,确认该指标捕捉了编排结构而非固定的向上偏差。这些发现表明,每次推理能量对于智能体AI是不充分的。EpG和OOI为准确基准测试提供了测量基础,其中编排结构是能耗的主要决定因素。

英文摘要

Current AI energy benchmarks measure consumption at the granularity of a single model invocation or training run. For classical single-turn workloads this unit remains coherent. For agentic systems - where a single user goal may trigger multi-step orchestration, tool calls, retries, and failure-recovery cycles - the invocation count is an implementation artifact rather than a task property, and inference-level normalization misrepresents the energy cost of goal completion. We present A-LEMS (Agentic LLM Energy Measurement System), a cross-layer measurement framework that redefines the unit of AI energy accounting from energy per inference to Energy per Successful Goal (EpG). EpG aggregates total workflow energy across all execution attempts, including failures and retries, normalized by successfully completed goals. A-LEMS formalizes energy attribution through a temporal boundary model, a five-layer observation pipeline mapping RAPL signals to workflow-level energy, and a reproducibility protocol binding every measurement to hardware and runtime configuration. Building on EpG, we define the Orchestration Overhead Index (OOI), isolating the energy cost of orchestration relative to linear execution under identical task criteria. Across five reasoning and three tool-augmented task families, agentic workflows consume 4.33x higher mean energy per successful goal than linear baselines (888.1 J vs 205.3 J). This overhead is driven by orchestration structure, not inference compute. For tool-augmented tasks, OOI inverts below 1.0x: agentic execution is cheaper than linear, confirming the metric captures orchestration structure rather than a fixed upward bias. These findings establish that energy-per-inference is insufficient for agentic AI. EpG and OOI provide the measurement foundation for accurate benchmarking, where orchestration structure is the primary determinant of energy cost.

2605.22880 2026-05-25 cs.CL cs.AI cs.CY

How Far Will They Go? Red-Teaming Online Influence with Large Language Models

它们会走多远?使用大型语言模型对在线影响力进行红队测试

Daniel C. Ruiz, Anna Serbina, Ashwin Rao, Emilio Ferrara, Luca Luceri

发表机构 * Information Sciences Institute University of Southern California(信息科学研究所 乌德穆尔特国立大学)

AI总结 随着基于大语言模型(LLM)的代理越来越多地参与在线讨论,评估其支持政治影响力活动的能力对维护信息完整性至关重要。本文提出了一种实证的“红队”框架,用于测量LLM的奥托窗(Overton Window,OW),即模型在争议性话题上可靠表达的政治观点范围,并量化自然语言越狱技术如何扩展这一范围。研究评估了来自10个模型家族、5个国家的30多个开源LLM,发现其在政治表达上存在系统性偏差,如更倾向于生成左翼内容,且模型规模越大,OW范围越小,不同地区模型表现差异显著。研究还揭示了越狱效果在不同模型家族间差异明显,为识别有效的越狱技术组合提供了参考。

Comments 30 pages, 8 figures, submitted to COLM 2026

详情
AI中文摘要

随着基于大型语言模型(LLM)的代理越来越多地参与在线讨论,对其支持政治影响力活动的能力进行红队测试对于信息完整性至关重要。为实现这一目标,我们专注于本地部署的开源LLM,而非前沿的仅API模型,因为前者更符合在社交媒体环境中部署的注重隐私的恶意行为者的操作约束。我们引入了一个经验性的红队测试框架,用于测量LLM的Overton窗口(OW),即模型在争议话题上能够可靠表达的政治观点范围,并量化简单的自然语言越狱如何扩展该范围。我们评估了来自10个模型家族和5个原产国的30多个LLM。我们发现政治表达存在系统性不对称:开源LLM通常更愿意生成左倾的社交媒体内容,OW往往与模型大小成反比,尽管开源生态系统中的代表性不均,但区域差异显著。越狱效果在不同模型家族之间也差异很大,这促使我们开发一个工作流程来识别有效的越狱技术组合。综合来看,我们的结果建立了一个实用的框架,用于审计开源LLM的政治可操控性,并帮助未来的研究人员设计更强的对策来对抗基于LLM的影响力活动。

英文摘要

As large language model (LLM)-based agents increasingly participate in online discourse, red-teaming their capacity to support political influence campaigns is critical for information integrity. In pursuit of this goal, we focus on locally deployed open-source LLMs, as opposed to frontier API-only models, given their superior alignment with the operational constraints of privacy-conscious malicious actors deployed in social media environments. We introduce an empirical red-teaming framework for measuring LLM Overton Windows (OWs), defined as the range of political opinions a model can reliably express on controversial topics, and for quantifying how simple natural-language jailbreaks expand that range. We evaluate more than 30 LLMs spanning 10 model families and five countries of origin. We find systematic asymmetries in political expressivity: open-source LLMs are typically more willing to generate left-leaning social media content, OWs tend to contract inversely to model size, and regional differences are substantial despite uneven representation in the open-source ecosystem. Jailbreak potency also varies sharply across model families, motivating a workflow for identifying effective combinations of jailbreak techniques. Taken together, our results establish a practical framework for auditing the political steerability of open-source LLMs and for helping future researchers design stronger countermeasures against LLM-enabled influence campaigns.

2605.22878 2026-05-25 cs.AI cs.CL cs.IR cs.LG

SciAtlas: A Large-Scale Knowledge Graph for Automated Scientific Research

SciAtlas:面向自动化科学研究的大规模知识图谱

Shuofei Qiao, Yunxiang Wei, Jiazheng Fan, Bin Wu, Busheng Zhang, Mengru Wang, Yuqi Zhu, Ningyu Zhang, Keyan Ding, Qiang Zhang, Huajun Chen

发表机构 * Zhejiang University(浙江大学) University College London(伦敦大学学院)

AI总结 随着全球学术产出的指数级增长,研究者和人工智能代理面临前所未有的“信息爆炸”挑战,碎片化和非结构化的知识组织阻碍了跨学科的深度融合。为解决这一问题,本文提出 SciAtlas,一个涵盖26个学科、包含4300万篇论文、1.57亿实体和30亿三元组的多学科异构学术知识图谱,旨在构建全景式的科学演进网络。SciAtlas 提供了结构化的拓扑认知基础,打破了学科壁垒,并通过神经符号检索算法实现了从语义匹配到确定性关联发现的转变,为自动化科研全流程提供了高效、低成本的“认知地图”。

Comments Ongoing Work

详情
AI中文摘要

全球学术产出的指数级增长使研究人员和AI代理面临前所未有的“信息爆炸”,其中碎片化和非结构化的知识组织阻碍了深层次的跨学科整合。当前的学术检索工具主要依赖浅层关键词匹配或向量空间语义检索,缺乏导航复杂逻辑连接所需的拓扑推理能力。基于代理的深度研究框架往往容易出现逻辑幻觉并消耗高推理成本。为弥补这一差距,本报告介绍了SciAtlas,一个大规模、多学科、异构的学术资源知识图谱,设计为全景科学演化网络。通过整合来自26个学科的超过4300万篇论文,总计1.57亿个实体和30亿个三元组,SciAtlas提供了一个结构化的拓扑认知基础,打破了学科壁垒,并为AI代理提供了全局视角。此外,我们开发了一种神经符号检索算法,具有三路径协同召回和图重排序,实现了从简单语义匹配到确定性关联发现的无缝过渡。我们还展示了SciAtlas的关键应用方向,包括文献综述、自动化研究趋势综合、想法定位和学术轨迹探索,以证明SciAtlas可以作为有效的“认知地图”,赋能自动化科学研究的全流程,同时显著降低推理成本。我们已在GitHub仓库中发布了知识图谱检索和各种下游任务的接口。

英文摘要

The exponential growth of global academic output has confronted researchers and AI agents with an unprecedented ``information explosion,'' where fragmented and unstructured knowledge organization impedes deep interdisciplinary integration. Current academic retrieval tools predominantly rely on superficial keyword matching or vector-space semantic retrieval, which lack the topological reasoning capabilities required to navigate complex logical connections. Agentic deep-research-based frameworks are often prone to logical hallucinations and consuming high inference costs. To bridge this gap, in this report, we introduce SciAtlas, a large-scale, multi-disciplinary, heterogeneous academic resource knowledge graph designed as a panoramic scientific evolution network. By integrating over 43M papers from 26 disciplines, and a total of 157M entities and 3B triplets, SciAtlas provides a structured topological cognitive substrate that dismantles disciplinary barriers and furnishes AI agents with a global perspective. Furthermore, we develop a neuro-symbolic retrieval algorithm featuring tri-path collaborative recall and graph reranking, achieving a seamless transition from simple semantic matching to deterministic association discovery. We also present key application directions of SciAtlas, including literature review, automated research trend synthesis, idea positioning, and academic trajectory exploration, to demonstrate that SciAtlas can serve as an effective ``cognitive map'' to empower the full loop of automated scientific research while significantly reducing reasoning costs. We have released the interfaces for KG retrieval and various downstream tasks in our GitHub repo.

2605.22876 2026-05-25 cs.LG

WeCon: An Efficient Weight-Conditioned Neural Solver for Multi-Objective Combinatorial Optimization Problems

WeCon: 一种高效的权重条件神经求解器用于多目标组合优化问题

Xuan Wu, Jinbiao Chen, Yang Li, Lijie Wen, Chunguo Wu, Yuanshu Li, Yubin Xiao, Chunyan Miao, You Zhou, Di Wang

发表机构 * Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education(教育部符号计算与知识工程重点实验室) College of Computer Science and Technology, Jilin University(吉林大学计算机科学与技术学院) Department of Industrial Systems Engineering and Management, National University of Singapore(新加坡国立大学工业系统工程与管理系) College of Software, Jilin University(吉林大学软件学院) School of Software, Tsinghua University(清华大学软件学院) School of Computing and Information Systems, Singapore Management University(新加坡管理学院 computing and information systems 系)

AI总结 本文提出了一种高效的权重条件神经求解器WeCon,用于解决多目标组合优化问题。该方法通过设计包含三个注意力模块和门控残差融合块的编码器,增强了实例特征与权重之间的交互,生成更具信息量的权重条件上下文,并在解码器中引入残差融合块以缓解权重信号衰减问题。此外,还提出了高效的偏好优化方法EPO,生成更高质量的解对以提升训练效果。实验表明,WeCon在多个问题规模和分布模式下取得了与当前最优求解器相当的性能,同时推理时间减少了40%。

详情
AI中文摘要

现有的多目标组合优化问题(MOCOP)神经求解器通常采用基于分解的策略,将MOCOP标量化为多个与不同权重向量相关的子问题。然而,它们要么仅在解码过程中注入一次权重,限制了权重条件上下文建模,要么主要在编码过程中注入,导致解码过程中权重信号稀释。此外,偏好优化方法依赖纯随机采样来构建解对以训练求解器,这通常产生信息量较少的解对,从而导致训练效率低下。为了更好地解决这些局限性,我们提出了一种高效的权重条件神经求解器(WeCon)。具体来说,我们设计了一个具有三个注意力块和我们提出的门控残差融合(GRF)块的编码器层,以促进实例特征和权重之间的和谐交互,从而生成信息丰富的权重条件上下文。我们进一步在解码器中引入了一个即插即用的残差融合(RF)块,以减轻权重信号稀释。最后,我们提出了高效偏好优化(EPO),它构建高质量的解,从而生成更多信息量的解对以提高训练效率。在不同问题规模和分布模式下的四个MOCOP变体上的实验表明,WeCon实现了与最先进求解器POCCO-W相当的HyperVolume(HV)值,同时将推理时间减少了40%。消融研究验证了所有设计的贡献。

英文摘要

Existing neural solvers for Multi-Objective Combinatorial Optimization Problems (MOCOPs) commonly adopt decomposition-based strategies that scalarize an MOCOP into multiple subproblems associated with distinct weight vectors. However, they either inject weights only once during decoding, limiting weight-conditioned context modeling, or primarily during encoding, causing weight-signal dilution during decoding. Moreover, preference optimization methods rely on purely random sampling to construct solution pairs for training solvers, which often produces less informative pairs and thus leads to low training effectiveness. To better address these limitations, we propose an efficient Weight-Conditioned neural solver (WeCon). Specifically, we design an encoder layer with three attention blocks and our proposed Gated Residual Fusion (GRF) block to facilitate harmonious interaction between instance features and weights, thereby generating informative weight-conditioned context. We further introduce a plug-and-play Residual Fusion (RF) block in the decoder to alleviate weight-signal dilution. Finally, we propose Efficient Preference Optimization (EPO), which constructs high-quality solutions, thereby generating more informative pairs to improve training effectiveness. Experiments on four MOCOP variants across different problem scales and distribution patterns demonstrate that WeCon achieves HyperVolume (HV) values comparable to SOTA solver POCCO-W, while reducing inference time by 40%. Ablation studies validate the contributions of all designs.

2605.22875 2026-05-25 cs.AI cs.LG

RMA: an Agentic System for Research-Level Mathematical Problems

RMA:一个面向研究级数学问题的智能体系统

Zelin Zhao, Bo Yuan, Jaemoo Choi, Yongxin Chen

发表机构 * Georgia Institute of Technology(佐治亚理工学院)

AI总结 本文提出了一种名为 **RMA** 的智能代理系统,专门用于解决研究级数学问题。RMA 通过分解问题分析、文献检索、公平比较、知识库构建和证明验证等模块,并由初始化器、提议者和验证者代理协同工作,实现了对复杂数学问题的长期推理和迭代证明优化。实验表明,RMA 在 First Proof 基准测试中表现出色,解决了其中八道难题,其生成的证明在逻辑性和可读性上优于现有强基线模型。

详情
AI中文摘要

我们提出了$ extbf{Research Math Agents (RMA)}$,一个用于研究级数学问题自动推理的智能体框架。与以往专注于竞赛数学或形式化定理证明的研究不同,RMA针对需要长程推理、文献依据和迭代证明改进的研究级数学问题。RMA将研究级证明求解分解为专门模块,包括问题分析、文献搜索与理解、公平比较、知识库构建和证明验证,所有这些都由初始化器、提议器和验证器智能体通过共享的结构化内存协调。在这个统一框架内,这些智能体以多角色、多轮工作流的方式运行,通过迭代反馈协作生成、改进和验证候选证明。我们在First Proof基准上评估了RMA,该基准由来自不同领域的专家数学家贡献的十个研究级问题组成。通过全面的专家评估,RMA在First Proof基准上优于强基线(包括GPT-5.2R和Aletheia),解决了十个研究问题中的八个,并生成了逻辑更合理、可读性更强的证明。我们的全面消融研究进一步表明,性能提升来自于结构化推理模块、迭代改进和基于验证器的反馈之间的交互,而非任何单一组件。我们的解决方案和实现将在论文被接收后公开。

英文摘要

We present $\textbf{Research Math Agents (RMA)}$, an agentic framework for automated reasoning on research-level mathematical problems. Unlike prior studies centered on competition mathematics or formal theorem proving, RMA targets research-level mathematical problems that require long-horizon reasoning, literature grounding, and iterative proof refinement. RMA decomposes research-level proof solving into specialized modules for problem analysis, literature search and understanding, fair comparison, knowledge-bank construction, and proof verification, all coordinated by initializer, proposer, and verifier agents through a shared structured memory. Within this unified framework, these agents operate in a multi-role, multi-round workflow, collaboratively generating, refining, and verifying candidate proofs through iterative feedback. We evaluate RMA on the First Proof benchmark, which consists of ten research-level problems contributed by expert mathematicians across diverse domains. Through comprehensive expert evaluation, RMA outperforms strong baselines on the First Proof benchmark, including GPT-5.2R and Aletheia, solving eight out of ten research problems and producing more logically sound and readable proofs. Our comprehensive ablation studies further show that performance gains arise from the interaction of structured reasoning modules, iterative refinement, and verifier-based feedback, rather than any single component. Our solutions and implementations will be made publicly available upon acceptance.

2605.22874 2026-05-25 cs.AI cs.LO

NeuroNL2LTL: A Neurosymbolic Framework for Natural Language Translation of Linear Temporal Logic

NeuroNL2LTL:用于线性时序逻辑自然语言翻译的神经符号框架

Paapa Kwesi Quansah, Ernest Bonnah

发表机构 * Baylor University(贝勒大学)

AI总结 本文提出了一种神经符号框架 NeuroNL2LTL,用于将自然语言翻译为线性时序逻辑(LTL),旨在解决自然语言与形式逻辑之间转换的可靠性与表达力之间的矛盾。该框架通过中间表示结构化地映射到LTL,并结合形式验证进行语义校验与修复,同时利用验证结果作为强化学习的奖励信号,提升模型的正确性。实验表明,该方法在多个领域的大规模需求数据上实现了较高的语义等价性与验证满足率,并能生成易于专家验证的解释性说明。

详情
AI中文摘要

有效地在自然语言(NL)和形式逻辑(如线性时序逻辑LTL)之间进行转换需要专业知识,这限制了形式验证在安全关键开发中的覆盖范围。基于模板的方法牺牲表达能力换取可靠性;神经方法实现了流畅性但无法提供正确性保证。我们提出了NeuroNL2LTL,一种将学习翻译与形式验证统一起来的神经符号架构。NeuroNL2LTL通过一种中间表示进行翻译,该表示到LTL的映射在结构上是保持的。生成的规约经过可满足性和非平凡性检查;一个最小编辑修复机制在近似的错误输出到达下游工具之前对其进行纠正。核心创新是验证器在环训练:验证结果作为强化学习的奖励信号,使神经组件直接针对形式正确性进行优化。在涵盖航空航天、机器人、自动驾驶汽车及其他十个领域的20万+需求上,NeuroNL2LTL实现了与参考规约28%的语义等价性,同时确保86%的输出被验证为可满足。该系统还能从LTL生成上下文相关的解释,使领域专家无需专门培训即可验证规约。这项工作表明,形式验证可以作为神经规约系统的训练目标和运行时过滤器,使我们能够构建可靠性源于逻辑保证而非统计置信度的基于神经的工具。

英文摘要

Effectively translating between natural language (NL) and formal logics like Linear Temporal Logic (LTL) requires expertise that limits formal verification's reach in safety-critical development. Template-based approaches sacrifice expressiveness for reliability; neural methods achieve fluency but provide no correctness guarantees. We present NeuroNL2LTL, a neurosymbolic architecture unifying learned translation with formal verification. NeuroNL2LTL routes translation through an intermediate representation whose mapping to LTL is structure-preserving by construction. Generated specifications undergo satisfiability and non-triviality checking; a minimal-edit repair mechanism corrects near-miss outputs before they reach downstream tools. The central innovation is verifier-in-the-loop training: verification outcomes serve as reward signals for reinforcement learning, producing neural components that optimize directly for formal correctness. On 200,000+ requirements spanning aerospace, robotics, autonomous vehicles, and ten additional domains, NeuroNL2LTL achieves 28\% semantic equivalence with reference specifications while ensuring 86\% of outputs are verified satisfiable. The system also generates contextually grounded explanations from LTL, enabling domain experts to validate specifications without specialized training. This work demonstrates that formal verification can function as both training objective and runtime filter for neural specification systems, allowing us to build neural-based tools whose reliability derives from logical guarantees rather than statistical confidence.

2605.22872 2026-05-25 cs.LG cs.AI cs.CV

MedExpMem: Adapting Experience Memory for Differential Diagnosis

MedExpMem:适应经验记忆用于鉴别诊断

Qianhan Feng, Zhongzhen Huang, Yakun Zhu, Yannian Gu, Winnie Chiu Wing Chu, Xiaofan Zhang, Qi Dou

发表机构 * The Chinese University of Hong Kong(香港中文大学) Shanghai Jiao Tong University(上海交通大学)

AI总结 本文提出了一种名为 MedExpMem 的经验记忆框架,旨在提升基于视觉-语言模型的医疗诊断代理在鉴别诊断方面的能力。该方法通过记录模型自身在诊断过程中的失败经验,生成包含关键鉴别点、决策规则和推理错误模式的成对鉴别笔记,并采用两阶段构建过程模拟医生的学习过程。实验表明,MedExpMem 在多个放射学子专科基准上有效提升了诊断准确性,验证了其在医疗适应性方面的优越性。

Comments MICCAI 2026 Early Accept. Submission Version

详情
AI中文摘要

经验丰富的医生通过临床实践发展诊断专业知识,不仅获得疾病知识,还能区分易混淆的病症。当前的医学视觉语言模型(VLM)缺乏这种能力——它们的参数编码了静态知识,不会随着诊断经历而演变。我们提出了MedExpMem,一个经验记忆框架,使基于VLM的诊断代理能够积累鉴别诊断专业知识。与检索增强生成(检索百科式疾病描述)不同,MedExpMem记忆从代理自身的诊断失败中获得的判别经验,并将其组织为成对的鉴别笔记,编码关键判别因素、可操作的决策规则和推理错误模式。该框架采用两阶段构建过程,模仿医生的学习:初始实践暴露知识差距,反思性重新诊断完善理解。当遇到新病例时,代理检索经验记忆以指导鉴别推理。我们在涵盖11个亚专业的放射学基准上评估了MedExpMem。结果表明,在不同模型和规模上,准确率持续提升,最高达7.0%。分析实验验证了经验质量和鲁棒性,表明MedExpMem是一种有竞争力的方法,解决了参数学习无法触及的医学适应需求。

英文摘要

Experienced physicians develop diagnostic expertise through clinical practice, acquiring not only disease knowledge but also the ability to differentiate confusable conditions. Current medical vision-language models (VLMs) lack this capability -- their parameters encode static knowledge that does not evolve across diagnostic encounters. We propose MedExpMem, an experience memory framework enabling VLM-based diagnostic agents to accumulate differential diagnosis expertise. Unlike retrieval-augmented generation, which retrieves encyclopedic disease descriptions, MedExpMem memorizes discriminative experience derived from the agent's own diagnostic failures and organizes them as pairwise differential notes encoding key discriminators, actionable decision rules and reasoning error patterns. The framework adopts a two-phase construction process mirroring physician learning: initial practice exposes knowledge gaps, and reflective re-diagnosis refines understanding. When encountering new cases, the agent retrieves experience memory to guide differential reasoning. We evaluate MedExpMem on a radiology benchmark spanning 11 subspecialties. Results demonstrate consistent accuracy improvements, maximum 7.0%, across diverse models and scales. Analytical experiments validate experience quality and robustness, demonstrating MedExpMem as a competitive method addresses medical adaptation needs beyond the reach of parameteric learning.

2605.22871 2026-05-25 cs.LG cs.AI stat.ML

Approximate Machine Unlearning through Manifold Representation Forgetting Guided by Self Mode Connectivity

通过自模式连通性引导的流形表示遗忘实现近似机器遗忘

Weiqi Wang, Zhiyi Tian, Chenhan Zhang, Luoyu Chen, Shui Yu

发表机构 * Xi'an Jiaotong University(西安交通大学) Southeast University(东南大学) University of Technology Sydney(悉尼大学)

AI总结 本文提出了一种名为ManiF-SMC的近似机器遗忘方法,旨在解决现有方法在遗忘效果和学习目标保持之间的平衡问题。该方法基于模型在剩余数据上重训练时的语义相似性分类行为,通过将被遗忘样本从原始流形表示中心推向保留数据的语义邻居,实现近似遗忘。为提升遗忘效果并减少对标签和任务梯度的依赖,ManiF-SMC引入了基于边距的三元组损失和自模式连通模块,以自适应生成遗忘边距,实验表明其在多个数据集上达到了与先进方法相当的遗忘效果。

详情
AI中文摘要

机器遗忘是强制执行被遗忘权的基本机制。现有的依赖标签操作或任务梯度反转的遗忘研究通常遗忘效果有限,且可能破坏原始学习目标,通常不能保证与重新训练的标准遗忘等价。本文提出ManiF-SMC(自模式连通性引导的流形遗忘),其动机是观察到在剩余数据上重新训练的模型倾向于根据保留数据中的语义相似性对擦除样本进行分类。我们首先系统地将近似遗忘重新表述为:将每个擦除样本从其原始学习的流形表示质心推向保留数据中最近的语义邻居。这种重新表述使遗忘与重新训练行为对齐,并且仅在表示空间中操作,减少了对标签和任务特定梯度的依赖。为了解决基于流形表示的遗忘问题,ManiF-SMC将遗忘和表示保留目标封装在基于边界的三元组损失中。由于为遗忘找到合适的边界具有挑战性,我们提出一个自模式连通性模块,快速重建局部流形以指导每个遗忘案例的自适应边界生成。在四个代表性数据集上的大量实验表明,ManiF-SMC在仅操作模型表示空间的情况下,实现了与最先进近似方法相当的遗忘效果。

英文摘要

Machine unlearning is a fundamental mechanism that enforces the right to be forgotten. Existing unlearning studies that rely on label manipulation or task-gradient reversal often deliver limited unlearning effectiveness. Moreover, they can undermine the original learning objective and typically do not guarantee equivalence to standard unlearning by retraining. In this paper, we propose \textbf{ManiF-SMC} (\textbf{Mani}fold \textbf{F}orgetting with \textbf{S}elf \textbf{M}ode \textbf{C}onnectivity), motivated by the observation that a model retrained on the remaining data tends to classify erased samples by their semantic similarity to the retained data. We begin with systematically recasting the approximate unlearning as pushing each erased sample away from its original learned manifold representation centroid toward its nearest semantic neighbors in the retained data. This reformulation aligns unlearning with retraining behavior and operates purely in representation space, reducing reliance on labels and task-specific gradients. To tackle the manifold representation-based unlearning problem, ManiF-SMC encapsulates the unlearning and representation preservation goals in a margin-based triplet loss. Because finding a suitable margin for unlearning is challenging, we propose a self-mode-connectivity module that rapidly reconstructs the local manifold to guide the adaptive margins generation for each unlearning case. Extensive experiments on four representative datasets show that ManiF-SMC achieves unlearning effectiveness comparable to state-of-the-art approximate methods while operating solely within the model's representation space.

2605.22870 2026-05-25 cs.LG cs.AI cs.CL

The Readout Shortcut: Positional Number Copying Dominates Arithmetic CoT Readout in Small Language Models

读出捷径:位置数字复制主导小语言模型中的算术思维链读出

Ming Liu

发表机构 * Amazon(亚马逊)

AI总结 该研究探讨了小型语言模型在进行算术推理时,思维链(CoT)提示的实际作用。研究发现,模型在输出答案时更倾向于复制位于答案分隔符前的最后一个数字,而非依赖中间推理过程。这一“位置捷径”现象显著影响了模型性能,表明当前的CoT方法可能更多依赖位置信息而非逻辑推理。实验还揭示了不同模型在复制行为上的差异,并指出这一机制可能与模型架构及任务类型相关。

Comments 18 pages (8 main + 10 appendix), 3 figures, 5 tables

详情
AI中文摘要

思维链提示对于小语言模型进行算术运算是必要的,然而打乱其步骤仍能保留大部分性能。如果思维链贡献的不是逻辑顺序,那是什么?在三个1-3B指令微调的语言模型上,针对GSM8K数据集,我们通过前缀补全隔离了答案读出阶段,并识别出一个位置捷径:模型复制占据答案分隔符前最后一个位置的数字,无论中间推理如何。正确答案的存在贡献了54-92个百分点的准确率(每个模型教师强制上限的89-92%);即使在错误项上,最终答案与思维链最后一个数字匹配的概率为95-96%。复制通道优先于保留上下文补全:用错误值替换最后一个数字会使准确率降至接近零,尽管中间步骤正确;但移除它后,准确率在该基线之上恢复5-32个百分点——当存在可复制的数字时,即使模型本可以执行的单步算术也被抑制。Qwen和Llama在87-95%的情况下复制新干扰项;Gemma则选择性门控。头部级消融实验揭示了特定于架构的头部集;该效应在GSM-Symbolic上复现。在非算术的BBH任务上,打乱保留率急剧下降;在7-8B规模时,出现了内容选择性门控。步骤级忠实度评估有风险将位置答案传输与真实计算混为一谈——这是基于思维链的监督的一个失败模式。

英文摘要

Chain-of-thought (CoT) prompting is necessary for arithmetic in small language models, yet shuffling its steps preserves most performance. What does CoT contribute if not logical sequencing? In three 1-3B instruction-tuned LMs on GSM8K, we isolate the answer-readout stage via prefix completion and identify a positional shortcut: the model copies whichever number occupies the trailing position before the answer delimiter, regardless of intermediate reasoning. Gold-answer presence accounts for 54-92 pp of accuracy (89-92% of each model's teacher-forcing ceiling); even on incorrect items, the final answer matches the last CoT number 95-96% of the time. The copy channel takes precedence over retained-context completion: replacing the trailing number with a wrong value collapses accuracy to near-zero despite correct intermediates, yet removing it recovers 5-32 pp above that floor--even single-step arithmetic the model can otherwise perform is suppressed when a copyable number is present. Qwen and Llama copy novel distractors 87-95% of the time; Gemma gates selectively. Head-level ablation implicates architecture-specific head sets; the effect replicates on GSM-Symbolic. On non-arithmetic BBH tasks, shuffle retention drops sharply; at 7-8B, content-selective gating emerges. Step-level faithfulness evaluations risk conflating positional answer transport with genuine computation--a failure mode for CoT-based oversight.

2605.22869 2026-05-25 cs.LG

FuRA: Full-Rank Parameter-Efficient Fine-Tuning with Spectral Preconditioning

FuRA: 基于谱预条件的全秩参数高效微调

Yequan Zhao, Ruijie Zhang, Liyan Tan, Niall Moran, Tong Qin, Zheng Zhang

发表机构 * University of California at Santa Barbara(加州大学圣芭芭拉分校) Amazon Lab126(亚马逊实验室126)

AI总结 该论文提出了一种名为FuRA的全秩参数高效微调方法,通过谱预处理技术改进传统微调和参数高效微调方法。FuRA利用全秩奇异值分解对权重矩阵进行重参数化,并固定预训练的奇异基,从而约束更新方向,提升优化稳定性。该方法基于块张量训练分解框架,仅优化紧凑核心和奇异值,实现了与LoRA相当的参数、内存和训练效率,同时在多个任务中表现出更优的性能。

详情
AI中文摘要

全微调(Full FT)和参数高效微调方法(如LoRA)在更新权重时未考虑预训练期间建立的谱结构。因此,来自有限微调数据的噪声梯度可能扰动鲁棒的预训练特征。我们识别出谱预条件是缺失的关键:通过全秩奇异值分解(SVD)重新参数化每个权重矩阵,并冻结一个奇异基,将更新约束到预训练的列空间,从而产生一个预条件优化方案,在相同可训练参数数量下优于无约束的全微调。基于这一见解,我们提出了FuRA(全秩适应),一种基于块张量列车分解 W = LSR 的高效全秩适应框架,其中大核心 L 固定为预训练的块状SVD基,而仅优化紧凑核心 R 和块状奇异值 S。该设计同时提供全秩谱预条件、保持全秩更新表达能力,并实现与LoRA相当的参数、内存和步时间效率。FuRA在多种设置下持续优于全微调,包括LLM微调(LLaMA-3-8B常识推理+1.37)、LLM数学推理强化学习以及VLM的视觉指令微调。此外,4位量化变体QFuRA也优于QLoRA。代码可在 https://github.com/olokevin/FuRA-NIPS 获取。

英文摘要

Both full fine-tuning (Full FT) and parameter-efficient fine-tuning methods such as LoRA introduce weight updates without accounting for the spectral structure established during pretraining. As a result, noisy gradients from limited fine-tuning data can perturb robust pretrained features. We identify spectral preconditioning as the missing ingredient: reparameterizing each weight matrix through its full-rank singular value decomposition (SVD) and freezing one singular basis constrains updates to the pretrained column space, yielding a preconditioned optimization scheme that outperforms unconstrained Full FT at the same trainable parameter count. Building on this insight, we propose FuRA (Full-Rank Adaptation), an efficient full-rank adaptation framework based on a block tensor-train factorization W = LSR, where the large core L is fixed to the pretrained block-wise SVD basis, while only the compact core R and the block-wise singular values S are optimized. This design simultaneously provides full-rank spectral preconditioning, preserves full-rank update expressivity, and achieves parameter, memory, and step-time efficiency comparable to LoRA. FuRA consistently outperforms Full FT across multiple settings, including LLM fine-tuning (+1.37 on LLaMA-3-8B commonsense reasoning), LLM reinforcement learning for mathematical reasoning, and visual instruction tuning for VLMs. Furthermore, the 4-bit quantized variant, QFuRA, also surpasses QLoRA. Code is available at https://github.com/olokevin/FuRA-NIPS

2605.22868 2026-05-25 cs.LG

FusionSense: Tri-Stage Near-Sensor Learning for Runtime-Adaptive Multimodal Edge Intelligence

FusionSense: 用于运行时自适应多模态边缘智能的三阶段近传感器学习

Sanggeon Yun, Ryozo Masukawa, Minhyoung Na, Hyunwoo Oh, Yoshiki Yamaguchi, Wenjun Huang, SungHeon Jeong, Mohsen Imani

发表机构 * University of California, Irvine(加州大学尔湾分校) Kookmin University(韩国国民大学) Shibaura Institute of Technology(武藏技术大学)

AI总结 随着自主系统和智能制造对边缘计算的需求增加,如何在有限的能耗、延迟和可靠性条件下实现运行时自适应的多模态感知成为一个关键问题。FusionSense 提出了一种三阶段的近传感器学习框架,通过融合感知决策来优化计算与通信资源,有效减少冗余传输并提升系统效率。该方法在双模态传感器设置下表现出显著的能效提升和数据压缩性能,相比传统方法在多个指标上均有明显优势。

Comments Accepted to ISLPED 2026

详情
AI中文摘要

自主系统和智能工业部署越来越多地将计算分散在近传感器、边缘和云资源之间,其中严格的能量、延迟和可靠性预算要求运行时自适应性。在实践中,决定在每个点计算和传输什么至关重要;然而,随着多模态传感器套件(相机、LiDAR/深度等)在边缘激增,大多数先前的方法要么(i)在强大的服务器上融合模态,要么(ii)应用忽略跨模态依赖的单模态近传感器滤波器,导致冗余传输或遗漏事件。我们提出FusionSense,一种面向能量受限自主边缘系统的融合感知智能传感框架。轻量级近传感器分类器通过三步过程训练:(i)服务器端融合模型学习下游任务,(ii)过滤安全(FoS)标签量化每个模态相对于融合决策的必要性,(iii)通过注入近传感器预测作为辅助信号来压缩边缘端融合模型。结果是一个运行时决策层,联合减少计算和通信,同时随传感器数量线性扩展。在双模态(RGB+深度/LiDAR)设置下使用SynDrone,FusionSense以比单模态滤波器高得多的数据缩减率维持任务质量,并带来显著的端到端增益:在1% FoI出现率下能耗降低高达33倍,10%下降低11倍,在固定30%数据缩减率下质量损失减少92.3%,并且比最佳先前滤波基线节能约1.5倍。

英文摘要

Autonomous systems and smart-industry deployments increasingly split computation across near-sensor, edge, and cloud resources, where tight energy, latency, and reliability budgets demand run-time adaptivity. In practice, deciding what to compute and transmit at each point is pivotal; yet as multimodal sensor suites (cameras, LiDAR/depth, etc.) proliferate at the edge, most prior approaches either (i) fuse modalities on powerful servers or (ii) apply uni-modal near-sensor filters that ignore cross-modal dependencies, leading to redundant transmissions or missed events. We present FusionSense, a fusion-aware intelligent sensing framework for energy-constrained autonomous edge systems. Lightweight near-sensor classifiers are trained via a three-step procedure: (i) a server-side fusion model learns the downstream task, (ii) filter-out-safe (FoS) labels quantify each modality's necessity relative to the fused decision, and (iii) an edge-side fusion model is compacted by injecting near-sensor predictions as auxiliary signals. The result is a run-time decision layer that jointly reduces compute and communication while scaling linearly with sensor count. On a dual-modality (RGB+Depth/LiDAR) setup with SynDrone, FusionSense sustains task quality at substantially higher data-reduction rates than uni-modal filters and delivers large end-to-end gains: up to 33x lower energy at 1% FoI prevalence, 11x at 10%, a 92.3% reduction in quality loss at a fixed 30% data reduction, and roughly 1.5x higher energy savings than the best prior filtering baseline.

2605.22866 2026-05-25 cs.AI cs.LG

BOHM: Zero-Cost Hierarchical Attribution for Compound AI Systems

BOHM:复合AI系统的零成本层次归因

Joss Armstrong

发表机构 * Ericsson Research(爱立信研究)

AI总结 本文提出了一种名为BOHM的零成本分层归因方法,用于复合AI系统中组件的贡献度分析。该方法直接从系统已有的路由权重中提取分层归因树,无需访问组件内部信息,能够在不同粒度上同时提供多分辨率归因,克服了传统基于Shapley值的方法在第三方API和不透明系统中的评估限制。实验表明,BOHM在多个实际场景中表现出优异的归因性能,且与Shapley方法在路由策略接近最优时结果趋于一致。

Comments 35 pages, 10 figures, 20 tables

详情
AI中文摘要

复合AI系统通过专门组件的层次结构路由任务。归因主要由基于Shapley的方法(SHAP)主导,该方法将联盟价值函数分解为每个组件的边际贡献,并需要在任意组件子集上评估系统。这一要求对于第三方API、不透明端点以及将路由集中在少数工具上的代理编排器而言无法满足,因为从部署的编排器中大多数联盟无法评估。我们引入BOHM,它直接从系统已维护的路由权重中提取层次归因树:叶子归因是根到叶子路由权重的路径乘积;第k层归因是深度k节点上的诱导分布。该方法具有零边际成本,无需访问组件内部,并同时提供每个级别的多分辨率归因,而扁平方法在任何评估预算下都无法提供。BOHM和SHAP回答不同的问题,当部署的路由器接近最优路由时两者收敛。在包含880个LiveCodeBench问题的3级层次结构中的18个LLM上,BOHM的Kendall tau=0.928;SHAP在每次种子进行9000倍更多联盟评估时达到tau=0.980。在一项包含5个驱动器和7个基准的代理研究(35个单元格,完全覆盖)中,驱动器将路由集中在单个工具上(顶部份额中位数0.65),单元格级别的tau(BOHM, SHAP)由驱动器的首选是否为经验上最佳工具预测(平均+0.22 vs ~+0.01)。在美国人口普查层次结构(475个叶子,4级)上,BOHM在每个级别恢复真实排名(tau高达0.722)。BOHM满足效率、单调性、对称性和弱抑制,但不满足Shapley的可加性。它最好被理解为一种互补原语:一种在存在路由状态的任何地方可计算的多分辨率分解,其与Shapley的分歧本身具有诊断意义。

英文摘要

Compound AI systems route tasks through hierarchies of specialised components. Attribution is dominated by Shapley-based methods (SHAP), which decompose a coalition value function into per-component marginal contributions and require evaluation of the system on arbitrary component subsets. That requirement fails for third-party APIs, opaque endpoints, and agentic orchestrators that concentrate routing on a few tools, leaving most coalitions un-evaluable from the deployed orchestrator. We introduce BOHM, which extracts a hierarchical attribution tree directly from the routing weights such systems already maintain: leaf attribution is the path product of root-to-leaf routing weights; level-k attribution is the induced distribution over depth-k nodes. The method has zero marginal cost, requires no access to component internals, and provides multi-resolution attribution at every level simultaneously, which flat methods cannot offer at any evaluation budget. BOHM and SHAP answer different questions and converge when the deployed router routes near-optimally. On 18 LLMs in a 3-level hierarchy over 880 LiveCodeBench problems, BOHM yields Kendall tau=0.928; SHAP reaches tau=0.980 at 9,000x more coalition evaluations per seed. On a 5-driver, 7-benchmark agentic study (35 cells, complete coverage), drivers concentrate routing on a single tool (top-share median 0.65), and cell-level tau(BOHM,SHAP) is predicted by whether the driver's top pick is the empirically best tool (mean +0.22 vs ~+0.01). On a US Census hierarchy (475 leaves, 4 levels), BOHM recovers ground-truth rankings at every level (tau up to 0.722). BOHM satisfies efficiency, monotonicity, symmetry, and weak suppression but not Shapley's additivity. It is best understood as a complementary primitive: a multi-resolution decomposition computable wherever routing state exists, whose disagreement with Shapley is itself diagnostic.

2605.22864 2026-05-25 cs.LG

Reading Calibrated Uncertainty from Language Model Trajectories

从语言模型轨迹中读取校准的不确定性

Aliai Eusebi, Alexander Herzog, Xiaoyu Liang, Marie Vasek, Enrico Mariconti, Lorenzo Cavallaro

发表机构 * University College London, London, United Kingdom(伦敦大学学院)

AI总结 该研究探讨了如何从语言模型生成过程中的内部轨迹中更准确地量化不确定性。不同于传统的最大softmax概率方法,作者提出了一种基于模型各层激活路径的几何特征提取方法,通过稀疏线性探针来捕捉不确定性信息。该方法在选择性拒绝任务中表现优于传统方法,且能揭示不同层在生成过程中如何逐步形成误差,为理解模型不确定性提供了更细粒度的分析视角。

详情
AI中文摘要

最大softmax概率(MSP)是评估结构化输出语言模型生成不确定性量化的默认方法。虽然计算成本低,但通常校准不佳。探针模型内部激活的方法将原始隐藏状态输入不透明分类器,将激活视为静态快照,并隐含了表示形成的逐层轨迹。然而,相似的端点可能源于非常不同的路径,证据如何在深度上累积、增强或反转可能揭示最终概率掩盖的不确定性。我们提取了十一个尺度不变的几何特征,追踪逐层MLP更新的累积路径,并将其输入稀疏线性探针。该探针在选择性弃权下优于MSP,增益随基线校准误差增加而增加,最高达21 AURC点。由于每个特征都有封闭形式的几何意义,探针的系数追踪了错误如何以及沿着深度何处形成——哪些层过早承诺,哪些层与运行状态矛盾,轨迹在何处偏离其端点。

英文摘要

The maximum softmax probability (MSP) represents a default approach when evaluating uncertainty quantification for language model generation with structured output. Although cheap, it is often miscalibrated. Methods that probe the model's internal activations feed raw hidden states into opaque classifiers, reading activations as static snapshots and leaving implicit the layer-wise trajectory by which a representation is formed. Yet, similar endpoints can arise from very different paths, and how evidence accumulates, reinforces, or reverses across depth might reveal uncertainty that final probabilities obscure. We extract eleven scale-invariant geometric features, tracing the cumulative path of per-layer MLP updates, and feed them to a sparse linear probe. The probe outperforms MSP under selective abstention, with gains scaling with baseline miscalibration up to 21 AURC points. Because every feature has a closed-form geometric meaning, the probe's coefficients trace how and where along depth errors take shape -- which layers commit prematurely, which contradict the running state, where trajectories drift away from their endpoint.

2605.22843 2026-05-25 cs.CL cs.IR

Knowledge Distillation for Low-Resource Open-source Text-to-SQL Model

面向低资源开源文本到SQL模型的知识蒸馏

Tianhao Qiu, Xiaojun Chen

发表机构 * Shenzhen University(深圳大学)

AI总结 本文研究了在低资源环境下提升开源文本到SQL模型性能的问题,针对领域特定数据库中高质量标注数据稀缺、模式定义不透明等挑战,提出了一种基于知识蒸馏的文本到SQL框架。该方法构建了包含模式语义、缩写、业务逻辑和查询模式的任务特定知识库,并将其注入训练和推理过程,生成符合上下文的合成数据并增强推理能力。实验表明,该方法在多个基准测试中显著提升了开源和闭源大模型在文本到SQL任务中的表现,特别是在低资源的领域特定场景中。

Comments 17ages, 5 figures

详情
AI中文摘要

文本到SQL将自然语言问题转换为可执行的SQL查询,使非技术用户能够访问关系数据库进行分析和智能数据服务。在现实场景中,性能常受低资源设置的限制,其中高质量的标注<问题, SQL>对稀缺,尤其是针对特定领域的数据库。其他挑战包括不明确的模式定义、缩写以及未在模式中显式编码的隐含业务逻辑。现有的数据合成和提示技术提高了覆盖率,但往往无法生成与数据库约束一致的、任务特定且语义扎实的示例。为应对这些挑战,我们提出一个知识感知的文本到SQL框架,该框架构建任务特定的知识库,包括模式语义、缩写、业务逻辑和查询模式,并将其注入训练和推理中。该框架生成多样化、上下文扎实的合成训练数据,并通过目标知识检索增强推理。在涵盖通用和领域特定数据集的七个基准上的实验表明,我们的方法显著提升了开源和闭源大语言模型在文本到SQL任务中的性能,尤其是在低资源领域特定设置中,增强了泛化性、鲁棒性和适应性。

英文摘要

Text-to-SQL converts natural language questions into executable SQL queries, enabling non-technical users to access relational databases for analytics and intelligent data services. In real-world scenarios, performance is often constrained by low-resource settings, where high-quality annotated \texttt{<question, SQL>} pairs are scarce, particularly for domain-specific databases. Additional challenges include opaque schema definitions, abbreviations, and implicit business logic that are not explicitly encoded in the schema. Existing data synthesis and prompting techniques improve coverage but often fail to produce task-specific, semantically grounded examples aligned with database constraints. To address these challenges, we propose a knowledge-aware Text-to-SQL framework that constructs task-specific knowledge base including schema semantics, abbreviations, business logic, and query patterns, and injects them into both training and inference. This framework generates diverse, contextually grounded synthetic training data and enhances inference through targeted knowledge retrieval. Experiments on seven benchmarks, covering both general and domain-specific datasets, demonstrate that our approach substantially improves the performance of open-source and closed-source large language models in Text-to-SQL tasks, especially in low-resource domain-specific settings, enhancing generalization, robustness, and adaptability.

2605.22828 2026-05-25 cs.CL

A Survey of Text and Speech Resources for Hausa and Fongbe: Availability, Quality, and Gaps for NLP Development

豪萨语和丰贝语文本与语音资源综述:可用性、质量及NLP发展差距

Mahounan Pericles Adjovi, Victor Olufemi, Roald Eiselen, Prasenjit Mitra

发表机构 * Carnegie Mellon University Africa(卡内基梅隆大学非洲分校) Centre for Text Technology, North-West University(文本技术中心,北开普大学)

AI总结 本文综述了用于豪萨语和丰贝语的公开文本和语音资源,分析了它们的可用性、质量及在自然语言处理发展中的不足。豪萨语作为拥有数千万使用者的阿非罗-亚细亚语系语言,拥有较为丰富的文本资源,而丰贝语虽资源较少,但近期在语音数据收集方面有所进展。研究指出了两种语言在资源多样性上的差异,并提出了针对命名实体识别和词性标注任务的改进方向,强调了建立更多领域多样化的丰贝语文本和豪萨语语音语料库的必要性。

Comments 8 pages, 7 tables; survey paper; to appear in IEEE SDS 2026

详情
AI中文摘要

本综述全面编目了两种西非语言的公开文本和语音资源:豪萨语(一种亚非语系语言,约有8000万至1亿使用者)和丰贝语(一种尼日尔-刚果语系语言,在贝宁约有200万人使用)。这两种语言代表了资源可用性谱系上的对比案例。我们探讨的问题是: extit{豪萨语和丰贝语的公开NLP资源现状如何,还存在哪些差距?}通过系统搜索学术库、数据平台和网络资源,我们编目了平行语料库、单语文本集、语音数据集、预训练模型和评估基准。对于每种资源,我们记录了规模、领域覆盖、格式、许可和可访问性。我们的发现表明,豪萨语受益于新闻、百科和教育领域更广泛的文本资源多样性。丰贝语虽然文本资源较为有限,但已成为近期学术语音数据收集项目的重点。两种语言在Masakhane的命名实体识别和词性标注基准中均有代表。我们提供了任务特定建议,并指出了优先差距,包括领域多样的丰贝语文本和专门的豪萨语语音语料库。

英文摘要

This survey provides a comprehensive catalog of publicly available text and speech resources for two West African languages: Hausa, an Afroasiatic language with approximately 80-100 million speakers, and Fongbe, a Niger-Congo language spoken by approximately 2 million people in Benin. These languages represent contrasting cases on the resource availability spectrum. We address the question: \textit{What is the current state of publicly available NLP resources for Hausa and Fongbe, and what gaps remain?} Through systematic search of academic repositories, data platforms, and web sources, we catalog parallel corpora, monolingual text collections, speech datasets, pre-trained models, and evaluation benchmarks. For each resource, we document size, domain coverage, format, licensing, and accessibility. Our findings reveal that Hausa benefits from broader text resource diversity across news, encyclopedic, and educational domains. Fongbe, while having more limited text resources, has been the focus of recent academic speech data collection initiatives. Both languages are represented in Masakhane benchmarks for NER and POS tagging. We provide task-specific recommendations and identify priority gaps including domain-diverse Fongbe text and dedicated Hausa speech corpora.

2605.22826 2026-05-25 cs.CL cs.AI cs.GT cs.MA

Evaluating Large Language Models in a Complex Hidden Role Game

评估大型语言模型在复杂隐藏角色游戏中的表现

Niklas Bauer

发表机构 * University of Göttingen(哥廷根大学)

AI总结 本文研究了大型语言模型(LLMs)在复杂隐藏角色游戏《Secret Hitler》中的推理、说服与欺骗能力,引入了角色识别准确率、欺骗保持率和游戏状态影响率等新型评估指标。通过与基于规则的算法和人类游戏进行对比,发现当前模型在策略深度上仍存在明显不足,且增强推理的技术如思维链提示和内部记忆并未提升模型表现,反而导致部分角色的胜率下降。研究结果表明,现有模型在复杂的多轮操控任务中仍表现欠佳,亟需进一步改进以实现更高级的对齐与安全控制。

Comments Master's thesis, University of Göttingen

详情
AI中文摘要

量化大型语言模型(LLMs)的欺骗潜力对于人工智能安全至关重要,但在非受控环境中难以实现。本文研究了LLMs在社交推理游戏《秘密希特勒》中的推理、说服和欺骗能力。我引入了一个开源框架和新的度量指标来衡量性能:角色识别准确率、欺骗保持率和游戏状态影响率。通过将模型与基于规则的算法和人类游戏进行基准测试,我识别出对话能力与战略深度之间的差距。研究还分析了推理增强技术对胜率和战略推理的影响。无论是思维链提示还是内部记忆,都没有带来性能提升,法西斯角色的胜率甚至下降了23.2%。虽然基于规则的智能体在86.7%的情况下与专家人类投票决策一致,但Llama 3.1 70B等模型仅达到59.7%的准确率。扮演法西斯角色的模型始终产生负面的影响分数,并且无法维持欺骗,导致游戏时间比人类短约40%。这些发现表明,当前的架构在复杂的多轮操纵中仍然无效。随着能力的提升,检测模型何时开始掌握这些欺骗行为至关重要。所开发的框架可作为未来对齐研究的可重复测试平台。

英文摘要

Quantifying the deceptive potential of Large Language Models (LLMs) is critical for AI safety, yet difficult to achieve in uncontrolled environments. This work investigates the reasoning, persuasion, and deceptive capabilities of LLMs within the social deduction game Secret Hitler. I introduce an open-source framework and novel metrics to measure performance: Role Identification Accuracy, Deception Retention Rate, and Game State Impact Rate. By benchmarking models against rule-based algorithms and human games, I identify a gap between conversational ability and strategic depth. The study also analyzes the impact of reasoning-enhancement techniques on win rates and strategic reasoning. Neither Chain-of-Thought prompting nor internal memory bring improvements in performance, with up to 23.2% worse win rates for fascist roles. While rule-based agents align with expert human voting decisions 86.7% of the time, models like Llama 3.1 70B achieve only a 59.7% accuracy. Models playing as Fascists consistently yield negative impact scores and fail to sustain deception, resulting in roughly 40% shorter games compared to humans. These findings suggest that current architectures remain ineffective at complex, multi-turn manipulation. As capabilities advance, detecting when models begin to master these deceptive behaviors is crucial. The developed framework serves as a reproducible testbed for future alignment research.

2605.20519 2026-05-25 cs.SD cs.AI

Codec-Robust Attacks on Audio LLMs

针对音频大语言模型的编解码鲁棒攻击

Jaechul Roh, Jean-Philippe Monteuuis, Jonathan Petit, Amir Houmansadr

发表机构 * University of Massachusetts Amherst(马萨诸塞大学阿默斯特分校) Qualcomm(高通)

AI总结 本文研究了针对音频大语言模型(Audio LLMs)的编码器鲁棒攻击方法,提出了一种名为CodecAttack的新攻击技术。该方法在神经音频编码器的连续潜在空间中优化扰动,而非直接对音频波形进行修改,从而绕过压缩过程对波形扰动的过滤。实验表明,CodecAttack在多种真实压缩场景下表现出显著的攻击成功率,远高于传统波形域攻击方法,揭示了有损压缩并不能有效防御对抗性音频攻击。

详情
AI中文摘要

先前对音频大语言模型(Audio LLMs)的攻击表明,精心设计的波形域扰动可以迫使目标对抗性输出。作为针对这些攻击的防御机制,现实中的编解码压缩预处理已被研究用于检测和移除扰动。然而,现有攻击尚未证明对这些压缩的鲁棒性。我们提出CodecAttack,它在神经音频编解码器的连续潜在空间中优化扰动,而不是直接扰动音频波形。我们表明,编解码器的压缩通道会丢弃波形扰动,但会传输在其自身潜在空间中设计的扰动。为了进一步增强攻击在现实压缩通道中的鲁棒性,我们应用了多比特率直通期望变换(EoT),而无需修改目标模型。在三种现实的音频LLM部署场景和三个目标模型上,CodecAttack在中等比特率下对Opus实现了平均85.5%的目标子串攻击成功率(ASR),而使用相同EoT加固训练的波形基线在任何比特率下均未超过26%。该攻击可迁移到未训练的编解码器,在MP3上达到100% ASR,在AAC-LC上达到84% ASR,无需重新训练。逐频带能量分析表明,潜在扰动集中在4kHz以下,这正是编解码器分配最多比特的区域,而波形基线则扩散到编解码器丢弃的高频区域。这些结果表明,有损压缩不是对抗音频的可靠防御,编解码感知攻击对已部署的音频LLM系统构成了实际威胁。

英文摘要

Prior attacks on Audio Large Language Models (Audio LLMs) demonstrated that carefully crafted waveform-domain perturbations can force targeted adversarial outputs. As a defense mechanism against these attacks, real-world codec compression preprocessing has been studied to both detect and remove the perturbations. Yet no existing attack has demonstrated robustness against these compressions. We introduce CodecAttack, which optimizes a perturbation in a neural audio codec's continuous latent space rather than directly perturbing the audio waveform. We show that the codec's compression channel, which discards waveform perturbations, transmits perturbations crafted in its own latent space. To further harden the attack across real-world compression channels, we apply multi-bitrate straight-through Expectation-over-Transformation (EoT), all without modifying the target model. Across three realistic Audio LLM deployment scenarios and three target models, CodecAttack achieves an average 85.5% target-substring attack success rate (ASR) on Opus at moderate bitrates, while the waveform baseline trained with identical EoT hardening does not exceed 26% at any bitrate. The attack transfers to held-out codecs, reaching up to 100% ASR on MP3 and 84% on AAC-LC without retraining. A per-band energy analysis shows that the latent perturbation concentrates below 4kHz, exactly where codecs allocate the most bits, while the waveform baseline spreads into higher frequencies that codecs discard. These results demonstrate that lossy compression is not a reliable defense against adversarial audio and that codec-aware attacks pose a practical threat to deployed Audio LLM systems.

2605.19859 2026-05-25 cs.CV

Eyes on VLM: Benchmarking Gaze Following and Social Gaze Prediction in Vision Language Models

Eyes on VLM: 视觉语言模型中注视跟随与社会性注视预测的基准测试

Hengfei Wang, Anshul Gupta, Pierre Vuillecard, Jean-Marc Odobez

发表机构 * Idiap Research Institute(Idiap研究机构)

AI总结 本文提出EyeVLM,一个用于评估视觉语言模型(VLMs)在注视理解能力上的系统性框架,重点研究注视追踪和社交注视预测两项任务。通过零样本设置和微调方法,对比多种先进VLM在不同提示策略下的表现,并与纯视觉模型进行系统比较。研究发现,当前VLM在精确理解人类注视行为方面仍存在明显不足,需进一步改进模型和训练策略。

Comments Under review

详情
AI中文摘要

视觉语言模型(VLM)已迅速演变为具有强大零样本泛化能力的通用多模态推理器。在此背景下,VLM 可极大促进人类注视与注意力的分析——这是人类行为理解的核心任务,需要推理物理场景以及活动、交互和社会背景。然而,VLM 能在多大程度上可靠地理解人类注视及相关注意行为仍基本未被探索。本文提出 EyeVLM,一个跨两个互补维度(任务和模型)对 VLM 注视理解能力进行系统评估的框架。为评估注视理解能力,我们聚焦两个核心任务。第一个是注视跟随,即预测一个人注视的二维位置,侧重于几何和视觉处理,需要精确理解人脸、注意力方向、3D 场景结构以及被注视目标的空间定位。第二个是社会性注视预测,需要对多人交互(如相互注视和共享注意)进行社会和关系推理,可能更受益于 VLM 中的 LLM 语义推理能力。在模型方面,EyeVLM 通过两种方式评估这些任务:零样本设置,使用多种最先进的开源和闭源 VLM,探索不同提示策略;以及基于任务特定问答对的微调方法,研究模型规模和数据规模的影响。作为基准,我们依赖现有注视理解数据集,并与最先进的纯视觉模型进行系统比较。总体而言,我们的结果表明,当前 VLM 缺乏精确的注视理解能力。虽然标准训练有助于缩小与视觉模型的差距,但仍需显著改进。

英文摘要

Vision-language models (VLMs) have rapidly evolved into general-purpose multimodal reasoners with strong zero-shot generalization. In this context, VLMs could greatly benefit the analysis of human gaze and attention, a central task in human behavior understanding that requires reasoning about the physical scene as well as the activity, interactions, and social context. However, the extent to which VLMs can reliably understand human gaze and related attentional behaviors remains largely unexplored. In this work, we present EyeVLM, a systematic evaluation framework for gaze understanding in VLMs across two complementary dimensions: tasks and models. To assess gaze understanding capabilities, we focus on two core tasks. The first, gaze following, i.e., predicting the 2D location where a person is looking, has a geometric and visual processing focus, requiring a precise understanding of the human face, attention direction, 3D scene structure, and spatial grounding of attended targets. The second, social gaze prediction, requires social and relational reasoning over multi-person interactions (e.g., mutual gaze and shared attention), and may benefit more from the LLM semantic reasoning capabilities within VLMs. Regarding models, EyeVLM evaluates these tasks in two ways: a zero-shot setting with a diverse set of state-of-the-art open- and closed-source VLMs, exploring different prompting strategies; and a fine-tuning approach based on task-specific QA pairs, studying the impact of model scale and data scale. As benchmarks, we rely on existing gaze understanding datasets and perform a systematic comparison with state-of-the-art purely visual models. Overall, our results show that current VLMs lack precise gaze understanding capabilities. While standard training helps reduce the gap with visual models, significant improvements are still needed.

2605.17212 2026-05-25 cs.LG

Anytime PAC-Bayes for Constrained Density-Ratio Networks under Covariate Shift

协变量偏移下约束密度比网络的任意时间PAC-Bayes

Paulo Akira F. Enabe

发表机构 * Escola Politénica University of São Paulo Department of Structural and Geotechnical Engineering(圣保罗大学理工学院土木与地质工程系)

AI总结 本文提出了一种统一的协变量偏移学习框架,通过约束密度比网络估计Radon-Nikodym导数,并结合PAC-Bayes方法提供任意时间的泛化保证。研究通过改变测度恒等式分解目标风险与重要性加权源风险之间的差距,并利用增强拉格朗日方法强制归一化和矩匹配约束,从而控制有效样本量。实验表明,该框架在真实数据上实现了校准的密度比估计,并优于传统方法,验证了其在协变量偏移场景下的有效性与稳定性。

详情
AI中文摘要

提出一个在协变量偏移下学习的统一框架,其中约束密度比网络逼近Radon-Nikodym导数 $r^\star = dP/dQ$ 并馈入任意时间PAC-Bayes泛化证书。一个测度变换恒等式将目标风险与重要性加权源风险之间的差距分解为由 $\|r_\theta - r^\star\|_{L^2(Q)}$ 控制的比率偏差项和由加权损失变异性控制的泛化差距项。通过增广拉格朗日方案将归一化和矩匹配恒等式作为硬积分约束强制执行,其中二阶矩惩罚控制有效样本量。PAC-Bayes在固定时间机制下实例化于加权风险,得到Bernoulli-KL界,将网络加权Gibbs后验识别为唯一的KL正则化最小化器,并量化学习比率在 $L^2(Q)$ 扰动下的稳定性,然后通过几何剥离增强为在 $t \geq t_{\min}$ 上一致的任意时间证书。一个预先注册的两阶段协议结合了对解析真实性的补丁测试和真实数据部署,验证了该框架:网络产生校准比率,相对于未加权ERM和经典直接比率估计基线降低了目标0/1损失,并达到了任意时间证书。记录了一次固定时间覆盖失败,每次分割的覆盖与标签偏移幅度一一对应,确认了仅协变量假设在操作上是紧的,而非证书的缺陷。

英文摘要

A unified framework for learning under covariate shift is presented, in which a constrained density-ratio network approximates the Radon-Nikodym derivative $r^\star = dP/dQ$ and feeds an anytime PAC-Bayes generalization certificate. A change-of-measure identity decomposes the gap between target risk and importance-weighted source risk into a ratio-bias term governed by $\|r_θ- r^\star\|_{L^2(Q)}$ and a generalization-gap term governed by the variability of the weighted loss. Normalization and moment-matching identities are enforced as hard integral constraints through an augmented-Lagrangian scheme, with a second-moment penalty controlling the effective sample size. PAC-Bayes is instantiated on the weighted risk in a fixed-time regime that yields Bernoulli-KL bounds, identifies the network-weighted Gibbs posterior as the unique KL-regularized minimizer, and quantifies stability under $L^2(Q)$ perturbations of the learned ratio, and is then strengthened by geometric peeling to an anytime certificate uniform in $t \geq t_{\min}$. A pre-registered two-campaign protocol combining a patch test against analytic ground truth with a real-data deployment validates the framework: the network produces calibrated ratios, reduces target $0/1$ loss against unweighted ERM and classical direct ratio-estimation baselines, and attains the anytime certificate. A single fixed-time coverage failure is recorded, with per-split coverage aligning one-to-one with the magnitude of the label shift, confirming that the covariate-only assumption is operationally tight rather than a defect of the certificate.

2605.11416 2026-05-25 cs.CL

Freeze Deep, Train Shallow: Interpretable Layer Allocation for Continued Pre-Training

冻结深层,训练浅层:面向持续预训练的可解释层分配

Yu-Hang Wu, Qin-Yuan Liu, Qiu-Yang Zhao, Bo Jiang, Jiang-Feng Yang, Qing-Wei Cong

发表机构 * Nanhu Research Institute of China Electronic Science and Technology(中国电子科技科技研究院纳米研究所) School of Electronic and Electrical Engineering, Shanghai University of Engineering Science(上海工程技术大学电子电气工程学院)

AI总结 本文研究了如何在低成本下进行大语言模型的持续预训练,提出了一种可解释的层分配策略。通过引入架构无关的诊断框架LayerTracer,揭示了各层表示的演变模式与稳定性,发现深层在任务执行中起关键作用且更具鲁棒性。实验表明,在持续预训练中保持深层冻结而浅层训练的策略优于全参数微调及其他分配方式,并在多个基准测试中表现更优,为资源受限团队提供了低成本、可解释的参数分配指导。

详情
AI中文摘要

选择性逐层更新对于大型语言模型(LLMs)的低成本持续预训练至关重要,但由于缺乏可解释的指导,确定哪些层应冻结或训练仍然是一个经验性的黑盒问题。为了解决这个问题,我们提出了LayerTracer,一个与架构无关的诊断框架,通过定位任务执行位置和量化层敏感性来揭示逐层表示和稳定性的演化模式。分析结果表明,深层作为任务执行的关键区域,并且对干扰更新保持高稳定性。基于这一发现,我们进行了三项受控的持续预训练试验,比较了不同的冻结-训练策略,证明在C-Eval和CMMLU基准测试中,训练浅层同时冻结深层始终优于全参数微调和相反的分配。我们进一步展示了一个混合模型案例研究,验证了将高质量预训练模块放置在深层可以有效保留模型的内在知识。这项工作为资源受限的团队提供了一种低成本且可解释的解决方案,为持续预训练和混合模型构建中的逐层参数分配提供了可操作的指导。

英文摘要

Selective layer-wise updates are essential for low-cost continued pre-training of Large Language Models (LLMs), yet determining which layers to freeze or train remains an empirical black-box problem due to the lack of interpretable guidance. To address this issue, we propose LayerTracer, an architecture-agnostic diagnostic framework that reveals the evolution patterns of layer-wise representations and stability by locating task execution positions and quantifying layer sensitivity. Analysis results reveal that deep layers act as critical regions for task execution and maintain high stability against disruptive updates. Guided by this finding, we conduct three controlled continued pre-training trials to compare diverse freeze-train strategies, demonstrating that training shallow layers while freezing deep layers consistently outperforms full-parameter fine-tuning and the opposite allocation on both C-Eval and CMMLU benchmarks. We further present a hybrid model case study, which validates that placing high-quality pre-trained modules in deep layers effectively preserves inherent knowledge of the model. This work delivers a low-cost and interpretable solution for resource-constrained teams, offering actionable guidance for layer-wise parameter allocation in continued pre-training and hybrid model construction.

2604.19995 2026-05-25 cs.CV

A Computational Model of Message Sensation Value in Short Video Multimodal Features that Predicts Sensory and Behavioral Engagement

短视频多模态特征中信息感知价值的计算模型预测感官与行为参与

Haoning Xue, Jingwen Zhang, Xiaohui Wang, Diane Dagyong Kim, Yunya Song

发表机构 * Department of Communication, University of Utah(犹他大学通讯系) Department of Communication, University of California, Davis(加州大学戴维斯分校通讯系) Department of Media and Communication, City University of Hong Kong(香港城市大学媒体与传播系) Division of Emerging Interdisciplinary Areas, Hong Kong University of Science and Technology(香港科学与技术大学新兴跨学科领域 division)

AI总结 本文提出了一种基于多模态特征计算短视频信息感知价值(MSV)的模型,用于预测用户对短视频的感官和行为参与度。该模型结合多模态特征分析与1200个短视频的人类评估,并在三个短视频平台的14492个未见数据上验证,发现MSV与感官参与呈正相关,但与行为参与呈倒U型关系。研究不仅深化了对短视频用户参与机制的理论理解,也为短视频研究提供了可靠的计算工具。

详情
AI中文摘要

当代媒体环境以耸人听闻的短视频为特征。虽然先前研究考察了单个多模态特征的影响,但多模态特征对短视频观众参与度的集体影响仍然未知。基于信息感知价值(MSV)的理论框架,本研究通过多模态特征分析和对1200个短视频的人工评估,开发并测试了一个MSV计算模型。该模型预测感官和行为参与,并在来自三个短视频平台的两个未见数据集(总计N=14,492)上进一步验证。虽然MSV与感官参与正相关,但与行为参与呈倒U型关系:较高的MSV引起更强的感官刺激,但适度的MSV优化行为参与。这项研究推进了对短视频参与的理论理解,并为短视频研究引入了一个强大的计算工具。

英文摘要

The contemporary media landscape is characterized by sensational short videos. While prior research examines the effects of individual multimodal features, the collective impact of multimodal features on viewer engagement with short videos remains unknown. Grounded in the theoretical framework of Message Sensation Value (MSV), this study develops and tests a computational model of MSV with multimodal feature analysis and human evaluation of 1,200 short videos. This model that predicts sensory and behavioral engagement was further validated across two unseen datasets from three short video platforms (combined N = 14,492). While MSV is positively associated with sensory engagement, it shows an inverted U-shaped relationship with behavioral engagement: Higher MSV elicits stronger sensory stimulation, but moderate MSV optimizes behavioral engagement. This research advances the theoretical understanding of short video engagement and introduces a robust computational tool for short video research.

2604.07813 2026-05-25 cs.AI cs.HC

Agentivism: a learning theory for the age of artificial intelligence

Agentivism:人工智能时代的学习理论

Lixiang Yan, Dragan Gašević

发表机构 * School of Education, Tsinghua University(清华大学教育学院) Faculty of Education and School of Computing & Data Science, The University of Hong Kong(香港大学教育学院及计算与数据科学学院) Faculty of Information Technology, Monash University(墨尔本大学信息技术学院)

AI总结 随着生成式和智能代理AI的兴起,学习条件发生了根本变化,传统学习理论难以解释AI辅助下学习成效与真实理解之间的脱节问题。本文提出“代理主义”(Agentivism)学习理论,强调在AI辅助下,学习是通过选择性委托、对AI输出的监控与验证、重建性内化以及减少支持下的迁移能力实现的持久能力增长。该理论为理解人类与AI协同学习的过程提供了新的理论框架。

详情
AI中文摘要

历史上,当学习条件演变时,学习理论也随之改变。生成式和代理式AI创造了一种新条件,允许学习者将解释、写作、问题解决及其他认知工作委托给能够生成、推荐并有时代表学习者行动的系统。这给学习理论带来了根本性挑战:成功的表现不能再被视为学习的标志。学习者在AI支持下可能有效完成任务,同时发展出更少的理解、更弱的判断力和有限的可迁移能力。我们认为,现有学习理论并未完全捕捉到这一问题。行为主义、认知主义、建构主义和联通主义仍然重要,但它们并未直接解释AI辅助的表现何时转化为持久的人类能力。我们提出Agentivism,一种人机交互的学习理论。Agentivism将学习定义为通过选择性委托给AI、对AI贡献的认知监控与验证、对AI辅助输出的重构内化以及在减少支持下的迁移,实现人类能力的持久增长。Agentivism的重要性在于解释当智能委托变得容易且人机交互成为人类学习持续且不断扩大的部分时,学习如何仍然可能。

英文摘要

Learning theories have historically changed when the conditions of learning evolved. Generative and agentic AI create a new condition by allowing learners to delegate explanation, writing, problem solving, and other cognitive work to systems that can generate, recommend, and sometimes act on the learner's behalf. This creates a fundamental challenge for learning theory: successful performance can no longer be assumed to indicate learning. Learners may complete tasks effectively with AI support while developing less understanding, weaker judgment, and limited transferable capability. We argue that this problem is not fully captured by existing learning theories. Behaviourism, cognitivism, constructivism, and connectivism remain important, but they do not directly explain when AI-assisted performance becomes durable human capability. We propose Agentivism, a learning theory for human-AI interaction. Agentivism defines learning as durable growth in human capability through selective delegation to AI, epistemic monitoring and verification of AI contributions, reconstructive internalization of AI-assisted outputs, and transfer under reduced support. The importance of Agentivism lies in explaining how learning remains possible when intelligent delegation is easy and human-AI interaction is becoming a persistent and expanding part of human learning.

2602.20102 2026-05-25 cs.LG cs.AI

BarrierSteer: LLM Safety via Learning Barrier Steering

BarrierSteer: 通过学习障碍引导实现大语言模型安全

Thanh Q. Tran, Arun Verma, Kiwan Wong, Bryan Kian Hsiang Low, Daniela Rus, Wei Xiao

发表机构 * Department of Computer Science, National University of Singapore(新加坡国立大学计算机科学系) Singapore-MIT Alliance for Research and Technology Centre(新加坡-麻省理工联合研究中心) CSAIL, Massachusetts Institute of Technology(麻省理工学院计算机科学与人工智能实验室) Worcester Polytechnic Institute(沃斯堡理工学院)

AI总结 尽管大语言模型(LLMs)在各种任务中表现出色,但其对对抗性攻击和不安全内容生成的易感性仍然是部署中的重大障碍,尤其是在高风险场景中。为此,本文提出了一种名为 BarrierSteer 的新型推理时框架,通过在模型的潜在表示空间中嵌入学习到的非线性安全约束,提升响应的安全性。该方法将隐藏状态的安全分类器视为控制屏障函数(CBFs),在生成过程中引导不安全的潜在轨迹满足安全约束,从而在不修改模型参数的前提下有效提升安全性,并在多个模型和数据集上验证了其优越性。

Comments This paper introduces SafeBarrier, a framework that enforces safety in large language models by steering their latent representations with control barrier functions during inference, reducing adversarial and unsafe outputs

详情
AI中文摘要

尽管大型语言模型(LLMs)在各种任务中表现出色,但它们对对抗性攻击和不安全内容生成的敏感性仍然是部署的重大障碍,尤其是在高风险场景中。解决这一挑战需要既实际有效又有理论依据的安全机制。在本文中,我们介绍了 BarrierSteer,一种新颖的推理时框架,通过将学习到的非线性安全约束直接嵌入模型的潜在表示空间来提高响应安全性。BarrierSteer 将隐藏状态安全分类器视为控制障碍函数(CBFs),从而在生成过程中引导不安全的潜在轨迹。通过有效的约束合并组合多个安全约束,而不修改底层 LLM 参数,BarrierSteer 保持了模型效用。我们提供的理论结果表明,在潜在空间中应用 CBFs 提供了一种有原则、模块化且计算高效的方法,用于根据学习到的安全约束进行引导,并保证学习到的障碍能够捕捉预期的安全属性。我们在多个模型系列和数据集上的广泛实验结果表明,BarrierSteer 显著降低了对抗性攻击成功率和有害生成,优于现有方法。代码可在我们的 GitHub 仓库中获取。

英文摘要

Despite the strong performance of large language models (LLMs) across diverse tasks, their susceptibility to adversarial attacks and unsafe content generation remains a significant obstacle to deployment, particularly in high-stakes settings. Addressing this challenge requires safety mechanisms that are both practically effective and theoretically grounded. In this paper, we introduce BarrierSteer, a novel inference-time framework that improves response safety by embedding learned nonlinear safety constraints directly into the model's latent representation space. BarrierSteer treats hidden-state safety classifiers as Control Barrier Functions (CBFs), enabling constraint-guided steering of unsafe latent trajectories during generation. By composing multiple safety constraints through efficient constraint merging without modifying the underlying LLM parameters, BarrierSteer preserves model utility. We provide theoretical results showing that applying CBFs in the latent space yields a principled, modular, and computationally efficient approach for steering with respect to learned safety constraints, with guarantees conditional on the learned barriers capturing the intended safety property. Our extensive experimental results across multiple model families and datasets demonstrate that BarrierSteer substantially reduces adversarial attack success rates and unsafe generations, outperforming the existing method. The code is available in our \href{https://github.com/thanhquangtran/BarrierSteer}{GitHub repository}.

2601.21306 2026-05-25 cs.LG cs.AI

The Surprising Difficulty of Search in Model-Based Reinforcement Learning

基于模型的强化学习中搜索的惊人困难

Wei-Di Chang, Mikael Henaff, Brandon Amos, Gregory Dudek, Scott Fujimoto

发表机构 * Meta FAIR McGill University(麦吉尔大学)

AI总结 本文研究了基于模型的强化学习中的搜索问题。传统观点认为长期预测和误差累积是主要障碍,但作者发现搜索并不能简单替代学习到的策略,甚至在模型高度准确时也可能损害性能。研究指出,缓解高估偏差比提升模型或价值函数的准确性更为关键,而通过对一组价值函数取最小值的方法能有效解决这一偏差,从而实现高效的搜索,并在多个基准任务中取得领先性能。

Comments ICML 2026

详情
AI中文摘要

本文研究基于模型的强化学习中的搜索问题。传统观点认为,长期预测和复合误差是基于模型强化学习的主要障碍。我们挑战这一观点,表明搜索并不能简单地替代学习策略。令人惊讶的是,我们发现即使模型高度准确,搜索也可能损害性能。相反,我们表明缓解过估计偏差比提高模型或价值函数精度更重要。基于这一见解,我们确定取价值函数集成的最小值可以有效解决这一偏差并实现有效搜索,在多个流行基准领域取得了最先进的性能。

英文摘要

This paper investigates search in model-based reinforcement learning (RL). Conventional wisdom holds that long-term predictions and compounding errors are the primary obstacles for model-based RL. We challenge this view, showing that search is not a drop-in replacement for a learned policy. Surprisingly, we find that search can harm performance even when the model is highly accurate. Instead, we show that mitigating overestimation bias matters more than improving model or value function accuracy. Building on this insight, we identify that taking the minimum over an ensemble of value functions effectively addresses this bias and enables effective search, achieving state-of-the-art performance across multiple popular benchmark domains.

2601.14652 2026-05-25 cs.AI cs.CL cs.MA

MAS-Orchestra: Understanding and Improving Multi-Agent Reasoning Through Holistic Orchestration and Controlled Benchmarks

MAS-Orchestra:通过整体编排和受控基准理解与改进多智能体推理

Zixuan Ke, Yifei Ming, Austin Xu, Ryan Chin, Xuan-Phi Nguyen, Prathyusha Jwalapuram, Jiayu Wang, Semih Yavuz, Caiming Xiong, Shafiq Joty

发表机构 * Salesforce Research(Salesforce研究院) University of Wisconsin-Madison(威斯康星大学麦迪逊分校) Massachusetts Institute of Technology(麻省理工学院)

AI总结 本文提出MAS-Orchestra,一种通过整体编排和受控基准测试来理解和提升多智能体系统(MAS)推理能力的训练框架。该方法将MAS的编排建模为函数调用的强化学习问题,能够一次性生成完整的MAS系统,并通过抽象子智能体为可调用函数,实现对系统结构的全局推理。同时,研究引入MASBENCH基准,从五个维度刻画任务特性,揭示MAS优势依赖于任务结构、验证机制及智能体能力,而非普遍适用。实验表明,MAS-Orchestra在多个基准测试中取得显著提升,效率较现有方法提高十倍以上。

Comments ICML 2026

详情
AI中文摘要

虽然多智能体系统(MAS)通过智能体协调有望提升智能水平,但当前自动MAS设计的方法表现不佳。这些不足源于两个关键因素:(1)方法论复杂性——智能体编排通过顺序的代码级执行进行,限制了全局系统级整体推理,且随智能体复杂性扩展性差;(2)效能不确定性——MAS在未理解相比单智能体系统(SAS)是否有切实益处的情况下被部署。我们提出MAS-Orchestra,一个训练时框架,将MAS编排形式化为具有整体编排的函数调用强化学习问题,一次性生成整个MAS。在MAS-Orchestra中,复杂的、面向目标的子智能体被抽象为可调用函数,从而在隐藏内部执行细节的同时实现系统结构上的全局推理。为了严格研究MAS何时以及为何有益,我们引入了MASBENCH,一个受控基准,沿五个轴表征任务:深度、范围、广度、并行性和鲁棒性。我们的分析揭示,MAS的收益关键取决于任务结构、验证协议以及编排器和子智能体的能力,而非普遍成立。在这些洞察的指导下,MAS-Orchestra在数学推理、多跳问答和基于搜索的问答等公共基准上实现了一致的改进,同时相比强基线实现了超过10倍的效率提升。MAS-Orchestra和MASBENCH共同使得在追求多智能体智能的过程中能够更好地训练和理解MAS。

英文摘要

While multi-agent systems (MAS) promise elevated intelligence through coordination of agents, current approaches to automatic MAS design under-deliver. Such shortcomings stem from two key factors: (1) methodological complexity - agent orchestration is performed using sequential, code-level execution that limits global system-level holistic reasoning and scales poorly with agent complexity - and (2) efficacy uncertainty - MAS are deployed without understanding if there are tangible benefits compared to single-agent systems (SAS). We propose MASOrchestra, a training-time framework that formulates MAS orchestration as a function-calling reinforcement learning problem with holistic orchestration, generating an entire MAS at once. In MAS-Orchestra, complex, goal-oriented subagents are abstracted as callable functions, enabling global reasoning over system structure while hiding internal execution details. To rigorously study when and why MAS are beneficial, we introduce MASBENCH, a controlled benchmark that characterizes tasks along five axes: Depth, Horizon, Breadth, Parallel, and Robustness. Our analysis reveals that MAS gains depend critically on task structure, verification protocols, and the capabilities of both orchestrator and subagents, rather than holding universally. Guided by these insights, MAS-Orchestra achieves consistent improvements on public benchmarks including mathematical reasoning, multi-hop QA, and search-based QA, while achieving more than 10x efficiency over strong baselines. Together, MAS-Orchestra and MASBENCH enable better training and understanding of MAS in the pursuit of multi-agent intelligence.

2508.12043 2026-05-25 cs.RO

Talk Less, Fly Lighter: Autonomous Semantic Compression for UAV Swarm Communication via LLMs

少说,轻飞:基于大语言模型的无人机集群自主语义压缩通信

Fei Lin, Tengchao Zhang, Qinghua Ni, Jun Huang, Siji Ma, Yonglin Tian, Yisheng Lv, Naiqi Wu

发表机构 * Department of Engineering Science, Faculty of Innovation Engineering, Macau University of Science and Technology(工程科学系,创新工程学院,澳门科学理工学院) State Key Laboratory for Management and Control of Complex Systems, Institute of Automation, Chinese Academy of Sciences(复杂系统管理与控制国家重点实验室,自动化研究所,中国科学院)

AI总结 本文研究了如何利用大语言模型(LLM)实现无人机群在通信带宽受限条件下的自主语义压缩通信。通过构建不同复杂度的二维仿真场景,并设计融合系统提示与任务指令的通信-执行流程,系统评估了九种主流LLM在语义压缩方面的性能,分析其在环境复杂度和群规模变化下的适应性与稳定性。实验表明,基于LLM的无人机群能够在多跳链路条件下实现高效的协作通信。

Journal ref Proc. 2025 21st IEEE International Conference on Mechatronic and Embedded Systems and Applications (MESA), pp. 29-34, 2025

详情
AI中文摘要

大语言模型(LLMs)在无人系统中的快速应用显著增强了无人机(UAV)集群的语义理解和自主任务执行能力。然而,有限的通信带宽和高频交互需求对集群内的语义信息传输提出了严峻挑战。本文探讨了LLM驱动的无人机集群进行自主语义压缩通信的可行性,旨在减少通信负载的同时保留关键任务语义。为此,我们构建了四种具有不同环境复杂度的二维仿真场景,并设计了一个集成系统提示与任务指令提示的通信-执行管道。在此基础上,我们系统评估了九种主流LLM在不同场景下的语义压缩性能,并通过环境复杂度和集群规模的消融实验分析了它们的适应性和稳定性。实验结果表明,基于LLM的无人机集群在带宽受限和多跳链路条件下具有实现高效协同通信的潜力。

英文摘要

The rapid adoption of Large Language Models (LLMs) in unmanned systems has significantly enhanced the semantic understanding and autonomous task execution capabilities of Unmanned Aerial Vehicle (UAV) swarms. However, limited communication bandwidth and the need for high-frequency interactions pose severe challenges to semantic information transmission within the swarm. This paper explores the feasibility of LLM-driven UAV swarms for autonomous semantic compression communication, aiming to reduce communication load while preserving critical task semantics. To this end, we construct four types of 2D simulation scenarios with different levels of environmental complexity and design a communication-execution pipeline that integrates system prompts with task instruction prompts. On this basis, we systematically evaluate the semantic compression performance of nine mainstream LLMs in different scenarios and analyze their adaptability and stability through ablation studies on environmental complexity and swarm size. Experimental results demonstrate that LLM-based UAV swarms have the potential to achieve efficient collaborative communication under bandwidth-constrained and multi-hop link conditions.

2504.09583 2026-05-25 cs.RO cs.AI

AirVista-II: An Agentic System for Embodied UAVs Toward Dynamic Scene Semantic Understanding

AirVista-II:面向动态场景语义理解的具身无人机智能体系统

Fei Lin, Yonglin Tian, Tengchao Zhang, Jun Huang, Sangtian Guan, Fei-Yue Wang

发表机构 * Department of Engineering Science, Faculty of Innovation Engineering, Macau University of Science and Technology(创新工程学院工程科学系,澳门科学技术大学) State Key Laboratory for Management and Control of Complex Systems, Institute of Automation, Chinese Academy of Sciences(复杂系统管理与控制国家重点实验室,中国科学院自动化研究所) State Key Laboratory for Management and Control of Complex Systems, Chinese Academy of Sciences(复杂系统管理与控制国家重点实验室,中国科学院)

AI总结 本文提出了一种名为 AirVista-II 的智能代理系统,旨在提升无人机在动态场景中的语义理解能力。该系统融合了基于代理的任务识别与调度、多模态感知机制以及针对不同时间场景的差异化关键帧提取策略,实现了对动态环境中的关键信息高效捕捉。实验表明,该系统在多种无人机应用场景下能够实现高质量的零样本语义理解,显著提升了无人机自主决策的效率与适应性。

Journal ref Proc. 2025 IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp. 6319-6324, 2025

详情
AI中文摘要

无人机在物流运输和灾难响应等动态环境中日益重要。然而,当前任务通常依赖人类操作员监控航拍视频并做出操作决策。这种人机协作模式在效率和适应性方面存在显著局限性。本文提出AirVista-II——一种面向具身无人机的端到端智能体系统,旨在实现动态场景中的通用语义理解和推理。该系统集成了基于智能体的任务识别与调度、多模态感知机制,以及针对不同时间场景定制的差异化关键帧提取策略,从而高效捕获关键场景信息。实验结果表明,所提系统在零样本设置下,能够在多种基于无人机的动态场景中实现高质量的语义理解。

英文摘要

Unmanned Aerial Vehicles (UAVs) are increasingly important in dynamic environments such as logistics transportation and disaster response. However, current tasks often rely on human operators to monitor aerial videos and make operational decisions. This mode of human-machine collaboration suffers from significant limitations in efficiency and adaptability. In this paper, we present AirVista-II -- an end-to-end agentic system for embodied UAVs, designed to enable general-purpose semantic understanding and reasoning in dynamic scenes. The system integrates agent-based task identification and scheduling, multimodal perception mechanisms, and differentiated keyframe extraction strategies tailored for various temporal scenarios, enabling the efficient capture of critical scene information. Experimental results demonstrate that the proposed system achieves high-quality semantic understanding across diverse UAV-based dynamic scenarios under a zero-shot setting.

2503.20066 2026-05-25 cs.RO cs.CV

Learning Scene-Level Signed Directional Distance Function with Ellipsoidal Priors and Neural Residuals

学习场景级有符号方向距离函数:结合椭球先验与神经残差

Zhirui Dai, Hojoon Shin, Yulun Tian, Ki Myung Brian Lee, Nikolay Atanasov

发表机构 * Department of Electrical and Computer Engineering, University of California San Diego(加州大学圣地亚哥分校电气与计算机工程系) Brain Corporation(Brain公司) Robotics Department, University of Michigan(密歇根大学机器人系)

AI总结 本文提出了一种新的神经隐式表示方法——有符号方向距离函数(SDDF),用于解决三维重建和可微渲染中的效率与精度问题。SDDF 以位置和视角方向为输入,直接输出到表面的距离,从而实现高效且精确的几何重建。为提升学习效率,作者结合显式的椭球先验和隐式的神经残差,构建了可微混合表示,有效处理障碍物边界处的距离不连续问题,并在多个指标上优于现有方法。

Journal ref 2026 IEEE Transactions on Pattern Analysis and Machine Intelligence

详情
AI中文摘要

密集重建和可微渲染是3D视觉和计算机图形学中紧密相连的基本操作。最近的神经隐式表示在重建保真度和可微性方面相比传统的离散表示(如网格、点云和体素)展现出显著优势。然而,许多神经隐式模型,如神经辐射场(NeRF)和有符号距离函数(SDF)网络,由于需要沿每条相机射线进行多次查询,渲染效率低下。此外,NeRF和高斯泼溅方法在光度重建方面表现令人印象深刻,但通常需要仔细的监督才能实现精确的几何重建。为了解决这些挑战,我们提出了一种称为有符号方向距离函数(SDDF)的新型表示。与SDF不同,与NeRF类似,SDDF以位置和观察方向作为输入。与SDF类似,与NeRF不同,SDDF直接提供到观察表面的距离,而不是沿视线方向积分。因此,SDDF实现了精确的几何重建和高效的可微方向距离预测。为了高效地学习和预测场景级SDDF,我们开发了一种可微混合表示,结合了显式椭球先验和隐式神经残差。这使得模型能够有效处理障碍物边界周围的距离不连续性,同时保持密集高保真距离预测的能力。通过与最先进表示的广泛评估,我们展示了SDDF实现了(i)有竞争力的SDDF预测精度,(ii)比SDF和NeRF更快的预测速度,以及(iii)与NeRF和高斯泼溅相比更优越的几何一致性。

英文摘要

Dense reconstruction and differentiable rendering are fundamental tightly connected operations in 3D vision and computer graphics. Recent neural implicit representations demonstrate compelling advantages in reconstruction fidelity and differentiability over conventional discrete representations such as meshes, point clouds, and voxels. However, many neural implicit models, such as neural radiance fields (NeRF) and signed distance function (SDF) networks, are inefficient in rendering due to the need to perform multiple queries along each camera ray. Moreover, NeRF and Gaussian Splatting methods offer impressive photometric reconstruction but often require careful supervision to achieve accurate geometric reconstruction. To address these challenges, we propose a novel representation called signed directional distance function (SDDF). Unlike SDF and similar to NeRF, SDDF has a position and viewing direction as input. Like SDF and unlike NeRF, SDDF directly provides distance to the observed surface rather than integrating along the view ray. As a result, SDDF achieves accurate geometric reconstruction and efficient differentiable directional distance prediction. To learn and predict scene-level SDDF efficiently, we develop a differentiable hybrid representation that combines explicit ellipsoid priors and implicit neural residuals. This allows the model to handle distance discontinuities around obstacle boundaries effectively while preserving the ability for dense high-fidelity distance prediction. Through extensive evaluation against state-of-the-art representations, we show that SDDF achieves (i) competitive SDDF prediction accuracy, (ii) faster prediction speed than SDF and NeRF, and (iii) superior geometric consistency compared to NeRF and Gaussian Splatting.