arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 2086
专题追踪
2605.05440 2026-05-08 cs.AI

Authorization Propagation in Multi-Agent AI Systems: Identity Governance as Infrastructure

多智能体AI系统中的授权传播:身份治理作为基础设施

Krti Tallam

发表机构 * Kamiwaza AI

AI总结 本文探讨多智能体系统中授权传播问题,提出三个子问题和七项结构要求,强调身份治理应作为基础设施持续评估和设计。

Comments Security and systems paper, 20 pages

详情
AI中文摘要

围绕代理AI的安全讨论主要集中在提示注入,本文认为多智能体系统也存在独特的授权问题:在非人类主体跨变化边界检索数据、委托任务和合成结果时维持授权不变性。我们称此问题为授权传播。它无法简化为提示注入,也不被传统访问控制模型如RBAC、ABAC或ReBAC完全解决。本文将授权传播形式化为工作流层面的属性,识别了三个子问题(传递委托、聚合推断和时间有效性),并推导出多智能体AI系统中授权架构的七项结构要求。最近关于调用绑定能力令牌、任务范围授权信封、依赖图策略执行和执行次数撤销的研究表明,该领域正逐渐聚焦于问题,但尚未达成完整架构。核心观点是身份治理必须作为基础设施:持续评估、在每次交互边界强制执行,并在 orchestration 逻辑扩展前设计进系统。初步实施证据来自一个生产企业AI平台,显示普通系统行为,而非对抗性行动,已产生该模型预测的失败。

英文摘要

The security discussion around agentic AI focuses heavily on prompt injection. This paper argues that multi-agent systems also create a distinct authorization problem: maintaining authorization invariants as non-human principals retrieve data, delegate tasks, and synthesize results across changing boundaries. We call this problem authorization propagation. It is not reducible to prompt injection and is not fully addressed by classical access-control models such as RBAC, ABAC, or ReBAC. The paper formalizes authorization propagation as a workflow-level property, identifies three sub-problems (transitive delegation, aggregation inference, and temporal validity), and derives seven structural requirements for authorization architectures in multi-agent AI systems. Recent work on invocation-bound capability tokens, task-scoped authorization envelopes, dependency-graph policy enforcement, and execution-count revocation demonstrates that the field is converging on the problem, but not yet on a complete architecture. The central claim is that identity governance must be treated as infrastructure: evaluated continuously, enforced at every interaction boundary, and designed into the system before orchestration logic is allowed to scale. Preliminary implementation evidence from a production enterprise AI platform shows that ordinary system behavior, not only adversarial action, already produces the failures this model predicts.

2605.05439 2026-05-08 cs.CV

Safety-Critical Camera Reliability Monitoring for ADAS via Degradation-Aware Uncertainty Pattern Analysis

面向ADAS的安全关键摄像头可靠性监控:通过退化感知不确定性模式分析

Shiva Aher

发表机构 * Georgia Institute of Technology(佐治亚理工学院)

AI总结 本文提出一种主动摄像头可靠性监控框架,通过退化诱导的不确定性模式分析预估感知风险,引入全局传感器健康指数GSHI,利用风险感知的乘法公式聚合退化严重性,实验显示GSHI在严重性增加时单调下降,且在YOLOv8检测故障前提供积极预警。

详情
AI中文摘要

可靠的摄像头输入对于安全关键的ADAS感知至关重要,但大多数监控方法只能在下游性能退化后才检测传感器故障。我们提出了一种主动摄像头可靠性监控框架,该框架通过退化诱导的不确定性模式分析在下游故障变得可观察之前估计感知风险。该方法引入了一个全局传感器健康指数(GSHI),这是一个连续的可靠性评分,利用风险感知的乘法公式聚合每种退化严重性,使得严重的单模式故障如镜头遮挡或运动模糊能够主导健康评分。一个轻量级的多任务网络从单个RGB图像预测退化类型、严重性、GSHI和空间不确定性图,无需下游任务反馈。训练使用物理和几何意识的合成监督,覆盖十二种摄像头退化模式。在KITTI衍生的退化数据上进行的实验表明,GSHI随着严重性单调下降,达到健康估计MAE为0.064,并在YOLOv8检测故障前提供积极的预警时间0.47±0.25严重性单位。GSHI还优于IQA、检测器置信度和干净特征OOD基线,并能零样本迁移至真实的恶劣天气驾驶数据。这些结果支持退化感知不确定性分析作为智能车辆中主动摄像头可靠性监控的实用方向。

英文摘要

Reliable camera input is essential for safety-critical ADAS perception, but most monitoring approaches detect sensor failures only after downstream performance has degraded. We propose a proactive camera reliability monitoring framework that estimates perception risk from degradation-induced uncertainty patterns before downstream failure becomes observable. The method introduces a Global Sensor Health Index (GSHI), a continuous reliability score that aggregates per-degradation severities using a risk-aware multiplicative formulation, allowing severe single-mode failures such as lens occlusion or motion blur to dominate the health estimate. A lightweight multi-task network predicts degradation type, severity, GSHI, and spatial uncertainty maps from a single RGB image without downstream task feedback. Training uses physics- and geometry-aware synthetic supervision over twelve camera degradation modes. Experiments on KITTI-derived degradations show that GSHI decreases monotonically with severity, achieves a health-estimation MAE of 0.064, and provides positive early-warning lead time of 0.47 $\pm$ 0.25 severity units before YOLOv8 detection failure. GSHI also outperforms IQA, detector-confidence, and clean-feature OOD baselines, and transfers zero-shot to real adverse-weather driving data. These results support degradation-aware uncertainty analysis as a practical direction for proactive camera reliability monitoring in intelligent vehicles.

2605.05438 2026-05-08 cs.LG cs.AI

On Semantic Loss Fine-Tuning Approach for Preventing Model Collapse in Causal Reasoning

在因果推理中防止模型崩溃的语义损失微调方法

Pratik Deshmukh, Atirek Gupta

发表机构 * Technical University of Vienna(维也纳技术大学) HCLTech

AI总结 本文提出一种语义损失函数,通过图逻辑约束和动态lambda调度防止因果推理中的模型崩溃,提升模型在transitivity和d-separation任务上的准确率。

Comments 14 pages, 6 figures

详情
AI中文摘要

标准的transformer模型在因果推理任务上的微调会导致灾难性模型崩溃,模型学习到始终预测'是'或'否'的简单解。我们证明在transitivity和d-separation任务上没有语义损失的Gemma 270M微调导致100%崩溃率,模型在学习无因果推理的情况下获得误导性高准确率(73.9%)。我们提出一个具有图逻辑约束和动态lambda调度的语义损失函数,该方法在transitivity任务上达到70.4%的准确率,在d-separation任务上达到68.6%的准确率,比崩溃基线提高了42.7%。对1000个结构推理样本的对抗评估显示,语义模型在67-70%准确率下表现稳定,而崩溃模型在43-71%准确率下彻底失败。通过在200,000多个评估样本上对五个模型变体的全面基准测试,我们验证了语义损失对于transformer稳定因果推理的重要性。

英文摘要

Standard fine-tuning of transformer models on causal reasoning tasks leads to catastrophic model collapse, where models learn trivial solutions such as always predicting "Yes" or "No" regardless of input structure. We demonstrate that fine-tuning Gemma 270M on transitivity and d-separation tasks without semantic loss results in 100% collapse rate, with models achieving misleadingly high accuracy (73.9%) while learning no causal reasoning. We propose a semantic loss function with graph-based logical constraints and dynamic lambda scheduling that prevents this collapse. Our approach achieves 70.4% accuracy on transitivity tasks and 68.6% on d-separation tasks with stable, context-dependent predictions, representing a 42.7% improvement over collapsed baselines. Adversarial evaluation on 1,000 structural reasoning samples shows semantic models achieve 67-70% accuracy while collapsed models fail catastrophically at 43-71%. We validate our findings through comprehensive benchmarking on 200,000+ evaluation samples across five model variants, demonstrating that semantic loss is essential and not optional, for stable causal reasoning in transformers.

2605.05435 2026-05-08 cs.LG cs.NA math.NA

Active Learning for Conditional Generative Compressed Sensing

基于条件生成压缩感知的主动学习

Alexander DeLise, Nick Dexter

发表机构 * Department of Scientific Computing(科学计算系) Department of Mathematics(数学系) Florida State University(佛罗里达州立大学)

AI总结 本文研究了利用提示条件生成模型从子采样傅里叶测量中恢复图像的问题,提出分离提示在采样分布设计和恢复模型定义中的作用,并证明了提示匹配的Christoffel采样保持相同复杂度常数,而提示不匹配会引入显式兼容性惩罚。

Comments 33 pages, 11 figures

详情
AI中文摘要

生成压缩感知利用预训练生成器的范围作为非线性模型,用于从有限测量中恢复结构化信号。我们研究了使用提示条件生成模型从子采样傅里叶测量中恢复图像的条件版本问题。我们的框架将提示的两个作用分开:用于设计采样分布的提示和用于定义恢复模型的提示。对于ReLU和Lipschitz条件生成器,我们证明了稳定的恢复界限,表明提示匹配的Christoffel采样保持与现有近最优生成压缩感知理论相同的Christoffel复杂度常数,而提示不匹配会引入显式兼容性惩罚。使用Stable Diffusion的实验表明,提示可以有意义地重塑Christoffel采样分布并影响图像恢复。总体而言,我们的结果表明,提示应被视为具有不同影响的设计变量,对传感、近似和恢复都有影响。

英文摘要

Generative compressed sensing uses the range of a pretrained generator as a nonlinear model for recovering structured signals from limited measurements. We study a conditional version of this problem for image recovery from subsampled Fourier measurements using prompt-conditioned generative models. Our framework separates two roles of conditioning: the prompt used to design the sampling distribution and the prompt used to define the recovery model. For ReLU and Lipschitz conditional generators, we prove stable recovery bounds showing that prompt-matched Christoffel sampling retains the same Christoffel complexity constant as existing near-optimal generative compressed sensing theory, while prompt mismatch incurs an explicit compatibility penalty. Experiments with Stable Diffusion show that prompts meaningfully reshape Christoffel sampling distributions and influence image recovery. Overall, our results suggest that prompts should be treated as design variables with distinct effects on sensing, approximation, and recovery.

2605.05415 2026-05-08 cs.LG cs.AI cs.CR

Information Theoretic Adversarial Training of Large Language Models

信息论视角下的大语言模型对抗训练

Yiwei Zhang, Jeremiah Birrell, Reza Ebrahimi, Rouzbeh Behnia, Jason Pacheco, Elisa Bertino

发表机构 * Purdue University(普渡大学) Texas State University(德克萨斯州立大学) University of South Florida(佛罗里达州立大学) University of Arizona(亚利桑那大学)

AI总结 本文提出WARDEN框架,通过信息论方法动态调整对抗示例权重,提升大语言模型的鲁棒性,减少攻击成功率且保持模型效用。

详情
AI中文摘要

尽管对齐和安全性有所进展,大语言模型(LLMs)仍易受对抗性提示攻击。现有对抗训练方法计算成本高且难以扩展。本文提出WARDEN,基于分布鲁棒性,通过f-散度模糊集动态重新加权对抗示例,优化最坏情况下的对抗损失。利用对偶形式,目标转化为KL散度下的log-sum-exp形式,通过动态参数控制重加权强度。实验表明,WARDEN在多个LLM和攻击设置中显著降低攻击成功率,计算和效用成本与CAT、CAPO等基线方法相当,为可扩展的鲁棒对齐提供了实用方法。

英文摘要

Large language models (LLMs) remain vulnerable to adversarial prompting despite advances in alignment and safety, often exhibiting harmful behaviors under novel attack strategies. While adversarial training can improve robustness, existing approaches are computationally expensive and difficult to scale. Recent continuous adversarial training methods, such as Continuous adversarial training (CAT) and Continuous Adversarial Preference Optimization (CAPO), address this challenge by leveraging gradient-based perturbations in the embedding space, enabling more efficient and expressive attacks. Building on this paradigm, we propose WARDEN, a distributionally robust adversarial training framework for LLMs that dynamically reweights adversarial examples through an f -divergence ambiguity set around the empirical training distribution. Our method optimizes the worst-case adversarial loss within a divergence ball around the empirical data distribution, automatically emphasizing harder adversarial examples. Using the convex dual formulation, the objective reduces to a log-sum-exp form under the KL divergence, with a dynamical parameter controlling the strength of reweighting. This study leads to a new class of information-theoretic objectives that significantly reduce attack success rates while maintaining model utility. Across multiple LLMs and attack settings, WARDEN substantially reduces attack success rates with computational and utility costs comparable to CAT-, CAPO-, and MixAT-based baselines, making it a practical approach for scalable robust alignment.

2605.05413 2026-05-08 cs.AI

From History to State: Constant-Context Skill Learning for LLM Agents

从历史到状态:为LLM代理的常域技能学习

Haoyang Xie, Xinyuan Wang, Yancheng Wang, Puda Zhao, Feng Ju

发表机构 * School of Computing and Augmented Intelligence, Arizona State University(计算与增强智能学院,亚利桑那州立大学)

AI总结 本文提出常域技能学习框架,通过轻量任务模块学习可重用流程,利用确定性跟踪器生成紧凑状态块,结合SFT和在线RL提升性能,减少提示词用量。

详情
AI中文摘要

大型语言模型(LLM)代理越来越多地用于操作浏览器、文件、代码和工具,使个人助手成为自然的部署目标。然而,个人代理面临隐私成本与能力之间的紧张关系:云模型能执行多步骤工作流但会暴露敏感中间上下文给外部API,而本地模型保持隐私但可靠性较低。两种设置都需反复支付长技能提示和增长历史的代价。我们提出常域技能学习,一种用于重复代理工作流的上下文到权重框架:可重用的流程在轻量任务家族模块中学习,而推理仅依赖当前观察和紧凑状态块。确定性跟踪器从任务进度渲染此状态块,并提供对齐的子目标奖励,因此每个模块可通过步骤级SFT训练并通过在线RL优化。在ALFWorld、WebShop和SciWorld上,我们的代理在Qwen3-4B、Qwen3-8B和Llama-3.1-8B上均表现出色。使用Qwen3-8B时,SFT+RL在ALFWorld上达到89.6%的未见成功,在WebShop上达到76.8%的成功率,在SciWorld上达到66.4%的未见成功。它们在匹配或超过强大发布的代理训练结果的同时,将每轮提示词数量减少2-7倍,相对于受控ReAct提示基线,表明过程上下文可以被移动到权重中。

英文摘要

Large language model (LLM) agents are increasingly used to operate browsers, files, code and tools, making personal assistants a natural deployment target. Yet personal agents face a privacy-cost-capability tension: cloud models execute multi-step workflows well but expose sensitive intermediate context to external APIs, while local models preserve privacy but remain less reliable. Both settings also pay repeatedly for long skill prompts and growing histories. We propose constant-context skill learning, a context-to-weights framework for recurring agent workflows: reusable procedures are learned in lightweight task-family modules, while inference conditions only on the current observation and a compact state block. A deterministic tracker renders this state block from task progress and supplies aligned subgoal rewards, so each module can be trained with step-level SFT and refined through online RL. Across ALFWorld, WebShop, and SciWorld, our agents achieve strong performance across Qwen3-4B, Qwen3-8B and Llama-3.1-8B. With Qwen3-8B, SFT+RL reaches 89.6\% unseen success on ALFWorld, 76.8\% success on WebShop, and 66.4\% unseen success on SciWorld. They match or exceed strong published agent-training results while reducing prompt tokens per turn by 2--7$\times$ relative to controlled ReAct prompting baselines, showing that procedural context can be moved from prompts into weights.

2605.05411 2026-05-08 cs.RO cs.AI

Creative Robot Tool Use by Counterfactual Reasoning

通过反事实推理实现创意机器人工具使用

M. Tuluhan Akbulut, Varun Satheesh, Ahmed Jaafar, Alper Ahmetoglu, Shane Parr, Aditya Ganeshan, Shivam Vats, George Konidaris

发表机构 * Brown University(布朗大学)

AI总结 本文提出一种因果推理框架,用于识别超出其主要目标的合适工具。通过动态模型模拟实验发现工具与任务的因果关系,并通过几何和物理特征扰动生成反事实工具,实现更可靠的工具选择和更强的技能关键点迁移。

Comments Under review

详情
AI中文摘要

我们提出了一种因果推理框架,用于创意机器人工具使用,其中通过在动态模型中进行模拟实验发现工具与任务之间的因果关系,从而正确识别出适合任务的工具。该框架将因果发现问题分解为两个互补部分:基于VLM的特征建议和通过针对性几何和物理特征扰动生成反事实工具。然后,基于识别出的因果特征对新对象进行分类,并通过关键点匹配在识别出的因果特征条件下转移工具使用技能。通过在动态模型中重构任务,我们的方法将工具使用建立在问题的物理性质上。我们在使用不同棍子到达远处物体、用多样物品从碗中舀取糖果以及用不同箱子或货箱作为阶梯平台从高架上取物等任务中展示了我们的方法。基线比较显示,识别因果特征并将其与物理工具属性相结合,导致更可靠的工具选择和更强的技能关键点迁移。

英文摘要

We propose a causal reasoning framework for creative robot tool use where a suitable tool for a task is correctly identified for use beyond its primary objectives. The proposed framework first discovers the causal relationships between the tool and the task by conducting simulated experiments in a dynamics model. We decouple the causal discovery problem into two complementary components: VLM-based feature suggestion and counterfactual tool generation via targeted geometric and physical feature perturbations. Then, novel objects are classified based on identified causal features, and the tool use skill is transferred via keypoint matching conditioned on the identified causal features. By reconstructing the task in a dynamics model, our approach grounds tool use in the physics of the problem. We illustrate our approach in reaching a distant object with different sticks, scooping candies from a bowl using diverse items, and using different boxes or crates as stepping platforms to retrieve an object from a high shelf. Our baseline comparisons show that identifying causal features and grounding them in physical tool properties leads to more reliable tool selection and stronger skill keypoint transfer.

2605.05410 2026-05-08 cs.AI cs.HC physics.ed-ph

LaTA: A Drop-in, FERPA-Compliant Local-LLM Autograder for Upper-Division STEM Coursework

LaTA:一个符合FERPA规定的本地LLM自动评分器,用于大二STEM课程作业

Jesse A. Rodríguez

发表机构 * School of Mechanical, Industrial, and Manufacturing Engineering(机械、工业与制造工程学院)

AI总结 LaTA是一种本地LLM自动评分器,可有效减轻大二STEM课程的评分负担,通过本地运行和LaTeX流程,实现零成本评分,提升学生自信心并提高考试成绩。

Comments Submitted to Computers & Education

详情
AI中文摘要

LaTA是一种本地LLM自动评分器,能够有效减轻上层STEM课程的评分负担,通过本地运行和LaTeX流程,实现零成本评分,提升学生自信心并提高考试成绩。

英文摘要

Large-language-model (LLM) graders promise to relieve the grading burden of upper-division STEM courses, but most deployments to date send student work to third-party APIs, violating FERPA and exposing institutions to data risk while requiring substantial assignment modification. We present $\textbf{LaTA}\ (\textit{LaTeX Teaching Assistant})$, a drop-in, open-source autograder that runs entirely on commodity on-premises hardware and assumes a LaTeX-native workflow already adopted by many engineering and physics courses. LaTA implements a four-stage pipeline (ingest, segment, grade, report) using a locally hosted open-weight chain-of-thought LLM grader (gpt-oss:120b) that compares student work to an instructor-authored reference solution and applies a YAML rubric with binary per-item scoring. We deployed LaTA in Winter~2026 in ME 373 (Mechanical Engineering Methods) at Oregon State University, grading every weekly assignment for approximately 200 students on a single Mac Studio at \$0 marginal cost per assignment and 1--3 minutes of wall-clock time per submission, enabling regrading of corrected assignments and greatly expanded TA office hour offerings. The instructor-confirmed grading-error rate held at roughly $0.02$--$0.04\%$ per rubric line item across the term. Relative to the same instructor's previous traditionally-graded cohort, the LaTA-graded cohort outperformed by approximately $11\%$ on the midterm exam and $8\%$ on the final exam, and reported large gains in self-assessed confidence on every stated learning objective ($N = 159$ survey responses, $Δ\geq +1.49$ Likert points, $p < 10^{-27}$ on every comparison). We release the code under AGPLv3.

2605.05409 2026-05-08 cs.AI cs.CL

Agentic Retrieval-Augmented Generation for Financial Document Question Answering

代理检索增强生成用于财务文档问答

Yang Shu, Yingmin Liu, Zequn Xie

发表机构 * College of Computer Science and Technology, Zhejiang University(浙江大学计算机科学与技术学院)

AI总结 本文提出FinAgent-RAG框架,通过迭代检索-推理循环和自我验证,提升金融文档问答的精度。引入对比金融检索器、程序化思维模块和自适应策略路由,实验表明在三个基准数据集上均取得显著效果,准确率提升5.62-9.32个百分点。

Comments 22 pages, 11 figures, 13 tables, submitted to Expert Systems with Applications

详情
AI中文摘要

金融文档问答需要复杂的多步数值推理,涉及异构证据——结构化表格、文本叙述和脚注——分散在企业文件中。现有检索增强生成方法采用单次检索-生成范式,难以处理财务分析中普遍存在的组合推理链。我们提出FinAgent-RAG,一种代理RAG框架,通过迭代检索-推理循环和自我验证,专门针对金融数值推理的精度需求。该框架整合了三个领域特定创新:(1) 通过挖掘困难负样本训练的对比金融检索器,以区分语义相似但数值不同的金融段落;(2) 程序化思维推理模块生成可执行的Python代码进行精确算术运算,而非依赖易出错的LLM基础心理计算;(3) 自适应策略路由器根据问题复杂度动态分配计算资源,减少FinQA上的API成本41.3%,同时保持准确性。在三个基准数据集——FinQA、ConvFinQA和TAT-QA上的广泛实验表明,FinAgent-RAG分别达到76.81%、78.46%和74.96%的执行准确率,优于最强基线5.62-9.32个百分点。消融研究、四种LLM的跨模型评估和部署成本分析证实了该框架的鲁棒性和对金融机构的实际可行性。

英文摘要

Financial document question answering (QA) demands complex multi-step numerical reasoning over heterogeneous evidence--structured tables, textual narratives, and footnotes--scattered across corporate filings. Existing retrieval-augmented generation (RAG) approaches adopt a single-pass retrieve-then-generate paradigm that struggles with the compositional reasoning chains prevalent in financial analysis. We propose FinAgent-RAG, an agentic RAG framework that orchestrates iterative retrieval-reasoning loops with self-verification, specifically engineered for the precision requirements of financial numerical reasoning. The framework integrates three domain-specific innovations: (1) a Contrastive Financial Retriever trained with hard negative mining to distinguish semantically similar but numerically distinct financial passages, (2) a Program-of-Thought reasoning module that generates executable Python code for precise arithmetic rather than relying on error-prone LLM-based mental computation, and (3) an Adaptive Strategy Router that dynamically allocates computational resources based on question complexity, reducing API costs by 41.3% on FinQA while preserving accuracy. Extensive experiments on three benchmark datasets--FinQA, ConvFinQA, and TAT-QA--demonstrate that FinAgent-RAG achieves 76.81%, 78.46%, and 74.96% execution accuracy respectively, outperforming the strongest baseline by 5.62--9.32 percentage points. Ablation studies, cross-backbone evaluation with four LLMs, and deployment cost analysis confirm the framework's robustness and practical viability for financial institutions.

2605.05403 2026-05-08 cs.AI

When Helpfulness Becomes Sycophancy: Sycophancy is a Boundary Failure Between Social Alignment and Epistemic Integrity in Large Language Models

当有益变成谄媚:谄媚是大型语言模型中社会契合与认知完整性之间的边界失败

Jiechen Li, Catherine A. Barry, Rishika Randev, Janet Chen, Ella Jorgensen, Brinnae Bent

发表机构 * Duke University(杜克大学)

AI总结 本文探讨大型语言模型中谄媚现象作为社会契合与认知完整性之间边界失败的问题,提出三条件框架以界定谄媚,并讨论其分类、评估及缓解策略。

Comments Currently under review

详情
AI中文摘要

本文主张大型语言模型中的谄媚是社会契合与认知完整性之间的边界失败。现有研究通常通过外部行为如同意错误用户信念、立场反转或偏离客观正确性标准来定义谄媚。这些定义仅捕捉到谄媚的显性形式,而未明确涉及认知完整性和社会契合的更微妙边界失败。我们主张谄媚不应仅理解为同意,而应视为一种使独立认知判断失衡的契合行为。为明确此边界,我们提出三条件框架。首先,用户以信念、偏好或自我概念的形式表达提示。其次,模型通过契合行为向该提示偏移。第三,这种偏移损害认知准确性、独立推理或适当纠正。我们还引入一种分类体系,用于对谄媚进行分类,包括契合目标、机制和严重性。本文最后讨论了对契合评估的影响,并主张边界意识评估、结构化评分表和缓解策略,同时将这些提议置于其他谄媚观点的背景下。

英文摘要

This position paper argues that sycophancy in LLMs is a boundary failure between social alignment and epistemic integrity. Existing work often operationalizes sycophancy through external behavior such as agreement with incorrect user beliefs, position reversals, or deviation from an objective standard of correctness. These formulations capture only overt forms of the phenomenon and leave subtler boundary failures involving epistemic integrity and social alignment underspecified. We argue that sycophancy should not be understood as agreement alone, but as alignment behavior that displaces independent epistemic judgment. To clarify this boundary, we propose a three-condition framework for sycophancy. First, the user expresses a cue in the form of a belief, preference, or self-concept. Second, the model shifts toward that cue through alignment behavior. Third, this shift compromises epistemic accuracy, independent reasoning, or appropriate correction. We also introduce a taxonomy for classifying sycophancy, consisting of alignment targets, mechanisms, and severity. The paper concludes by discussing implications for alignment evaluation and argues for boundary-aware assessment, structured rubrics, and mitigation strategies, while situating these proposals alongside alternative views of sycophancy.

2605.05402 2026-05-08 cs.AI cs.CV eess.IV

Intelligent CCTV for Urban Design: AI-Based Analysis of Soft Infrastructure at Intersections

智能监控摄像头用于城市设计:基于AI的交叉口软基础设施分析

Vinit Katariya, Seungjin Kim, Curtis Craig, Nichole Morris, Hamed Tabkhi

发表机构 * University of Wyoming(怀俄明大学) University of North Carolina at Charlotte(北卡罗来纳大学夏洛特分校) University of Minnesota(明尼苏达大学)

AI总结 本文提出基于AI的分析框架,利用现有监控摄像头评估软干预措施对车速和安全的影响,通过深度学习和视角基速度估计,发现软基础设施能有效降低车速,为交通政策评估提供低成本证据。

Comments 16 pages, 6 figures, 7 tables, Submitted/Under Review at the International Journal of Transportation Research (Submitted on 12 Jan 2026)

详情
AI中文摘要

人工智能和计算机视觉正在改变交通数据收集方式。本文介绍了一种基于AI的分析框架,利用现有CCTV基础设施评估软干预措施(如临时人行道避让区和车道延伸)对车辆速度和安全的影响。通过深度学习和基于视角的速度估计,我们评估了干预前后驾驶员行为,包括明尼阿波利斯市第一周和第二周的重复监测。研究发现,在无信号灯交叉口,平均和85百分位数速度分别下降了18.75%和16.56%,通过交通量下降了12.2%。信号灯交叉口显示了相似的减少,除了一个地点,平均和85百分位数速度分别下降了20.0%和17.19%。这些结果证明了软基础设施的交通缓和效果,并突显了AI驱动方法在快速、低成本和基于证据的交通政策评估中的实用性。

英文摘要

Artificial intelligence (AI) and computer vision are transforming transportation data collection. This study introduces an AI-enabled analytics framework leveraging existing CCTV infrastructure to evaluate the impact of soft interventions, such as temporary pedestrian refuges and curb extensions, on vehicle speed and safety. Using deep learning and perspective-based speed estimation, we evaluated driver behavior before and after interventions, with repeated post-installation monitoring in Week 1 and Week 2, in Minneapolis. Findings reveal that at unsignalized intersections, mean and 85th-percentile speeds fell by up to 18.75% and 16.56%, respectively, while pass-through traffic decreased by as much as 12.2%. Signalized intersections showed comparable reductions except one location, with mean and 85th-percentile speeds dropping by up to 20.0% and 17.19%. These results demonstrate the traffic-calming effectiveness of soft infrastructure and underscore the utility of AI-powered methods for rapid, low-cost, and evidence-based transport policy evaluation.

2605.05395 2026-05-08 cs.LG cs.MS

Differentiable Parameter Optimization for DAEs with State-Dependent Events

基于状态依赖事件的DAEs可微参数优化

Ion Matei, Maksym Zhenirovskyy, Anthony Wong

发表机构 * Fujitsu Research of America(美国富士通研究)

AI总结 本文研究了带有状态依赖事件的半显式DAEs的可微参数优化问题,提出两种互补的梯度计算策略,解决参数学习中代数变量隐式定义、事件时间依赖参数及重置映射引入不连续性等挑战。

详情
AI中文摘要

微分代数方程(DAEs)带有状态依赖事件出现在连续动力学受代数方程约束并被模式变化、开关逻辑、碰撞或状态重新初始化中断的系统中。基于梯度的参数学习对于此类系统具有挑战性,因为代数变量隐式定义,事件时间依赖参数,且重置映射引入不连续性。本文研究了半显式DAEs中带有事件的可微参数优化。我们将学习问题表述为具有DAE动态、代数约束、守卫方程和重置映射的约束最小二乘问题。然后开发了两种互补的梯度计算策略。第一种是通过模拟自动微分的方法,解决向量场内的代数变量,利用隐函数定理对代数求解进行微分,并通过分段可微积分处理事件。第二种是显式的离散伴随方法,将正向模拟表示为事件分割残差系统,并通过求解平滑段和事件残差的拉格朗日乘数来计算梯度。该公式澄清了伴随方法中的残差项是等式约束,而不是启发式惩罚。我们从梯度解释、事件时间处理、实现复杂性和局部有效性等方面比较了两种方法。两种方法为正向模拟所选事件路径提供梯度,并在固定事件顺序和横贯守卫交叉下有效。

英文摘要

Differential-algebraic equations (DAEs) with state-dependent events arise in systems whose continuous dynamics are constrained by algebraic equations and interrupted by mode changes, switching logic, impacts, or state reinitializations. Gradient-based parameter learning for such systems is challenging because algebraic variables are implicitly defined, event times depend on the parameters, and reset maps introduce discontinuities. This paper studies differentiable parameter optimization for semi-explicit DAEs with events. We formulate the learning problem as a constrained least-squares problem with DAE dynamics, algebraic constraints, guard equations, and reset maps. We then develop two complementary gradient-computation strategies. The first is an automatic-differentiation-through-simulation method that solves algebraic variables inside the vector field, differentiates the algebraic solve using the implicit function theorem, and handles events through segmented differentiable integration. The second is an explicit discrete-adjoint method that represents the forward simulation as an event-split residual system and computes gradients by solving for the Lagrange multipliers of smooth-segment and event residuals. The formulation clarifies that residual terms in the adjoint method are equality constraints, not heuristic penalties. We compare the two approaches in terms of gradient interpretation, event-time handling, implementation complexity, and local validity. Both methods provide gradients for the event path selected by the forward simulation and are valid under fixed event ordering and transversal guard crossings.

2605.05392 2026-05-08 cs.CL cs.AI

Generating Query-Focused Summarization Datasets from Query-Free Summarization Datasets

从无查询摘要数据集中生成查询聚焦摘要数据集

Yllias Chali, Deen Abdullah

发表机构 * University of Lethbridge(利思布莱德大学) Alberta, Canada(阿尔伯塔省,加拿大)

AI总结 本文提出基于证据的模型,从无查询摘要数据集中生成查询,通过内在和外在评估验证了其在查询聚焦摘要任务中的有效性。

Comments 7 pages, 1 figure

详情
AI中文摘要

大规模数据集被广泛用于执行摘要任务,但它们可能不包含查询与文档和摘要。在寻找适合查询聚焦摘要(QFS)的数据集时,我们提出两个研究问题:是否有可能从无查询数据集中自动生成基于证据的查询关键词?基于证据的查询生成是否支持QFS任务?本文提出一个基于证据的模型来从无查询数据集中生成查询。为了内在评估我们的模型,我们比较了原始查询与系统生成查询在两个QFS数据集中的相似性。我们还使用不同的预训练模型以及最先进的(SOTA)QFS模型进行摘要任务,以测量我们查询生成方法的外在性能。实验结果表明,使用基于证据的查询生成的摘要在ROUGE得分上与使用原始查询生成的摘要具有竞争力。

英文摘要

Large-scale datasets are widely used to perform summarization tasks, but they may not include queries alongside documents and summaries. In the search for suitable datasets for Query-Focused Summarization (QFS), we identify two research questions: Is it possible to automatically generate evidence-based query keywords from query-free datasets? Does evidence-based query generation support the QFS task? This paper proposes an evidence-based model to generate queries from query-free datasets. To evaluate our model intrinsically, we compare the similarity between the original queries and the system-generated queries of two QFS datasets. We also perform summarization tasks using different pre-trained models, as well as a state-of-the-art (SOTA) QFS model, to measure the extrinsic performance of our query generation approach. Experimental results indicate that summaries generated using evidence-based queries achieve competitive ROUGE scores compared to those generated from the original queries.

2605.05390 2026-05-08 cs.CV

LAMP: Localization Aware Multi-camera People Tracking in Metric 3D World

LAMP:面向度量3D世界的多摄像头人体跟踪

Nan Yang, Julian Straub, Fan Zhang, Richard Newcombe, Jakob Engel, Lingni Ma

发表机构 * Meta Reality Labs Research(Meta现实实验室研究)

AI总结 LAMP提出一种新颖简单框架,通过早期解耦观察者与目标运动,解决多摄像头视角下3D人体跟踪问题,实现动态视角下的高效跟踪。

Comments CVPR 2026. Project page: https://facebookresearch.github.io/LAMP

详情
AI中文摘要

从第一人称多摄像头头显跟踪3D人体运动面临严重自我运动、部分可见性或遮挡以及缺乏训练数据的挑战。现有方法针对单目视频设计,通常需要静态或缓慢移动的摄像头,无法有效利用多视角、校准和定位的输入。这使它们脆弱且在动态第一人称捕获中容易失败。我们提出LAMP(定位感知多摄像头人体跟踪):一种新的简单框架,通过早期解耦观察者和目标运动来解决这个问题。LAMP引入了两步过程。首先,利用已知设备6自由度运动和校准,将所有摄像头在时间窗口内检测到的2D身体关键点转换为统一的3D世界参考框架。第二,一个端到端训练的时空Transformer直接将3D人体运动拟合到此3D射线云。这种“提升-然后拟合”方法使LAMP能够学习并利用世界空间中的自然人体运动先验,同时提供一个优雅的框架,灵活地结合来自多个时间异步、部分观察和移动摄像头的信息。LAMP在单目基准上实现了最先进的结果,同时在我们针对的第一人称设置中显著优于基线。

英文摘要

Tracking 3D human motion from egocentric multi-camera headset is challenged by severe egomotion, partial visibility or occlusions and lack of training data. Existing methods designed for monocular video often require static or slowly-moving cameras and cannot efficiently leverage multi-view, calibrated and localized input. This makes them brittle and prone to fail on dynamic egocentric captures. We propose LAMP (Localization Aware Multi-camera People Tracking): a novel, simple framework to solve this via early disentanglement of observer and target motion. LAMP introduces a two-step process. First, we leverage the known device 6 DoF motion and calibration to convert detected 2D body keypoints from all cameras over a temporal window into a unified 3D world reference frame. Second, an end-to-end-trained spatio-temporal transformer fits 3D human motion directly to this 3D ray cloud. This "lift-then-fit" approach allows LAMP to learn and leverage a natural human motion prior in the world-space, as well as providing an elegant framework to flexibly incorporate information from multiple temporally asynchronous, partially observing and moving cameras. LAMP achieves state-of-the-art results on monocular benchmarks, while significantly outperforming baselines for our targeted egocentric setting.

2605.05389 2026-05-08 cs.LG cs.AI

Two-Stage Learned Decomposition for Scalable Routing on Multigraphs

多阶段学习分解用于多图的可扩展路由

Filip Rydin, Morteza Haghir Chehreghani, Balázs Kulcsár

发表机构 * Chalmers University of Technology(查尔姆斯理工大学) University of Gothenburg(哥德堡大学)

AI总结 本文提出多阶段学习分解方法,通过节点排列和边选择阶段解决多图路由的可扩展性问题,实验表明其在解的质量和速度上优于现有方法。

Comments 20 pages, 3 figures

详情
AI中文摘要

大多数针对车辆路径问题(VRPs)的神经方法局限于欧几里得环境或简单图。在本文中,我们考虑多图,其中平行边代表不同的旅行选项,具有不同的权衡(例如距离与时间)。很少有方法针对这种形式进行设计,而现有的方法面临重大可扩展性问题。我们通过节点-边策略分解(NEPF)方法缓解这些问题,该方法将路由策略分为节点排列阶段和边选择阶段。为实现分解,我们引入了一种预编码边聚合方案和一种非自回归架构用于边阶段,以及一种分层强化学习方法来联合训练各阶段。在六个VRP变体上的实验表明,NEPF在解的质量上与现有最佳方法相当或更优,同时在训练和推理速度上显著更快。

英文摘要

Most neural methods for Vehicle Routing Problems (VRPs) are limited to Euclidean settings or simple graphs. In this work, we instead consider multigraphs, where parallel edges represent distinct travel options with varying trade-offs (e.g., distance vs time). Few methods are designed for such formulations and those that do exist face major scalability issues. We mitigate these scalability issues via a Node-Edge Policy Factorization (NEPF) approach, which splits the routing policy into a node permutation stage and an edge selection stage. To enable the decomposition, we introduce a pre-encoding edge aggregation scheme and a non-autoregressive architecture for the edge stage, as well as a hierarchical reinforcement learning method to train the stages jointly. Our experiments across six VRP variants demonstrate that NEPF matches or outperforms the state-of-the-art in terms of solution quality, while being significantly faster in training and inference.

2605.05386 2026-05-08 cs.AI cs.CL cs.LG

BALAR : A Bayesian Agentic Loop for Active Reasoning

BALAR:主动推理的贝叶斯代理循环

Aymen Echarghaoui, Dongxia Wu, Emily B. Fox

发表机构 * Department of Statistics, Stanford University(统计学系,斯坦福大学) Department of Computer Science, Stanford University(计算机科学系,斯坦福大学)

AI总结 BALAR是一种无需微调的任务无关外环算法,通过维护潜在状态的结构信念,选择澄清问题以最大化预期互信息,并动态扩展状态表示,从而在三个基准测试中显著提升性能。

详情
AI中文摘要

大型语言模型越来越多地在交互环境中运行,解决任务需要与用户多次信息交换。然而,大多数现有系统反应式地处理对话,缺乏明确的机制来推断缺失信息和下一步应问的问题。我们提出了BALAR(主动推理的贝叶斯代理循环),一种任务无关的外环算法,无需微调,使LLM代理与用户进行结构化的多轮交互。BALAR维护对潜在状态的结构信念,通过最大化预期互信息选择澄清问题,并在当前状态表示不足时动态扩展其表示。我们在三个多样化的基准测试上评估了BALAR:AR-Bench-DC(侦探案例)、AR-Bench-SP(思维谜题)和iCraft-MD(临床诊断)。在所有三个基准测试中,BALAR均显著优于所有基线,其在AR-Bench-DC上的准确率比基线高14.6%,在AR-Bench-SP上高38.5%,在iCraft-MD上高30.5%。

英文摘要

Large language models increasingly operate in interactive settings where solving a task requires multiple rounds of information exchange with a user. However, most current systems treat dialogue reactively and lack a principled mechanism to reason about what information is missing and which question should be asked next. We propose BALAR (Bayesian Agentic Loop for Active Reasoning), a task-agnostic outer-loop algorithm that requires no fine-tuning and enables structured multi-turn interaction between an LLM agent and a user. BALAR maintains a structured belief over latent states, selects clarifying questions by maximizing expected mutual information, and dynamically expands its state representation when the current one proves insufficient. We evaluate BALAR on three diverse benchmarks: AR-Bench-DC (detective cases), AR-Bench-SP (thinking puzzles), and iCraft-MD (clinical diagnosis). BALAR significantly outperforms all baselines across all three benchmarks, with $14.6\%$ higher accuracy on AR-Bench-DC, $38.5\%$ on AR-Bench-SP, and $30.5\%$ on iCraft-MD.

2605.05379 2026-05-08 cs.AI cs.CC cs.ET

Partial Evidence Bench: Benchmarking Authorization-Limited Evidence in Agentic Systems

部分证据基准:在代理系统中受授权限制的证据基准测试

Krti Tallam

发表机构 * KamiwazaAI

AI总结 本文提出Partial Evidence Bench,用于评估代理系统在授权限制下的证据处理能力,通过四个维度评估系统表现,展示不同行为模式对完成度的影响。

Comments Benchmark paper with deterministic synthetic corpora, 14 pages, 6 tables

详情
AI中文摘要

企业代理越来越多地在受限制的检索系统、委托工作流和政策约束的证据环境中运行。在这些设置中,可以通过正确执行访问控制来确保系统仍然产生一个看似完整的答案,即使材料证据位于调用者的授权边界之外。本文引入了Partial Evidence Bench,一个确定性的基准,用于衡量这种失败模式。该基准包含三个场景家族——尽职调查、合规审计和安全事件响应,共72个任务,ACL划分的语料库、 oracle完整答案、 oracle授权视图答案、 oracle完整性判断以及结构化缺口报告oracle。它评估系统在四个维度:答案正确性、完整性意识、缺口报告质量以及不安全的完整性行为。检查进的基线显示,在所有发布的家族中,静默过滤在所有情况下都是灾难性不安全的,而显式失败并报告行为可以消除不安全的完整性而不导致任务陷入琐碎的回避。初步的现实模型运行显示,系统在是否过度声称完整性、保守地低估或以企业可使用的形式报告不完整性方面存在模型依赖性和场景敏感性差异。该基准的更广泛贡献是使治理关键的代理失败可测量,而无需人类法官或易受污染的静态语料库。

英文摘要

Enterprise agents increasingly operate inside scoped retrieval systems, delegated workflows, and policy-constrained evidence environments. In these settings, access control can be enforced correctly while the system still produces an answer that appears complete even though material evidence lies outside the caller's authorization boundary. This paper introduces Partial Evidence Bench, a deterministic benchmark for measuring that failure mode. The benchmark ships three scenario families -- due diligence, compliance audit, and security incident response -- with 72 tasks total, ACL-partitioned corpora, oracle complete answers, oracle authorized-view answers, oracle completeness judgments, and structured gap-report oracles. It evaluates systems along four surfaces: answer correctness, completeness awareness, gap-report quality, and unsafe completeness behavior. Checked-in baselines show that silent filtering is catastrophically unsafe across all shipped families, while explicit fail-and-report behavior eliminates unsafe completeness without collapsing the task into trivial abstention. Preliminary real-model runs show model-dependent and scenario-sensitive differences in whether systems overclaim completeness, conservatively underclaim, or report incompleteness in an enterprise-usable form. The benchmark's broader contribution is to make a governance-critical agent failure measurable without human judges or contamination-prone static corpora.

2605.05370 2026-05-08 cs.LG cs.AI

SPADE: Faster Drug Discovery by Learning from Sparse Data

SPADE:通过从稀疏数据中学习实现更快的药物发现

Rahul Nandakumar, Ben Fauber, Deepayan Chakrabarti

发表机构 * NVIDIA(英伟达)

AI总结 SPADE通过从稀疏数据中学习,以更少的测试次数找到高质量的药物分子,相比深度学习和贝叶斯优化方法在样本效率上提升了7%-32%,且在评分速度上快10倍。

详情
AI中文摘要

药物发现旨在寻找能强烈且选择性地与目标蛋白结合的分子(配体)。然而,少于5%的候选配体能通过甚至药物发现的早期阶段。此外,我们希望方法能适用于没有先验数据的新型蛋白。从头开始,必须迭代选择和测试候选配体,以在尽可能少的测试中找到足够多的高质量配体。我们提出的方法SPADE引入了一种新的配体选择方法,仅需平均40次测试即可找到10个高质量配体。在一对一比较中,SPADE在更多蛋白质上优于深度学习和贝叶斯优化方法,实现了样本效率的中位数改进7%-32%。SPADE的评分速度也比其最接近的竞争对手快10倍。数据集和代码可在https://anonymous.4open.science/r/SPADE_Fast_Drug_Discovery_by_Learning_from_Sparse_Data-F028/README.md获取。

英文摘要

Drug discovery seeks molecules (ligands) that bind strongly and selectively to a target protein. However, fewer than 5% of candidate ligands pass the bar for even the early stages of drug discovery. Furthermore, we want methods that work for novel proteins for which we have no prior data. Starting from scratch, we have to iteratively select and test candidate ligands such that we find enough ligands of the desired quality in as few tests as possible. Our proposed algorithm, named SPADE, introduces a novel approach to ligand selection that requires only 40 tests on average to find 10 high-quality ligands. In one-vs-one comparisons, SPADE outperforms deep learning and Bayesian optimization methods on more proteins, achieving median improvements of 7%-32% in sample efficiency. SPADE is also 10x faster than its closest competitor at scoring candidate drugs. Dataset and code is available at https://anonymous.4open.science/r/SPADE_Fast_Drug_Discovery_by_Learning_from_Sparse_Data-F028/README.md

2605.05365 2026-05-08 cs.AI cs.CL

ZAYA1-8B Technical Report

ZAYA1-8B 技术报告

Robert Washbourne, Rishi Iyer, Tomas Figliolia, Henry Zheng, Ryan Lorig-Roach, Sungyeon Yang, Pritish Yuvraj, Quentin Anthony, Yury Tokpanov, Xiao Yang, Ganesh Nanduru, Stephen Ebert, Praneeth Medepalli, Skyler Szot, Srivatsan Rajagopal, Alex Ong, Bhavana Mehta, Beren Millidge

发表机构 * Zyphra

AI总结 ZAYA1-8B 是基于 MoE++ 架构的推理聚焦混合专家模型,通过全栈 AMD 平台训练,在数学和编程基准中表现优异,采用 RL 策略和 Markovian RSA 方法提升推理性能。

详情
AI中文摘要

我们介绍了 ZAYA1-8B,一个专注于推理的混合专家(MoE)模型,具有7亿个活跃参数和80亿总参数,基于 Zyphra 的 MoE++ 架构构建。ZAYA1-8B 的核心预训练、中训练和监督微调均在全栈 AMD 计算、网络和软件平台上进行。在不到100亿个活跃参数的情况下,ZAYA1-8B 在多个具有挑战性的数学和编码基准上与 DeepSeek-R1-0528 相比不逊色,并在大幅更大的开放权重推理模型中保持竞争力。ZAYA1-8B 从头开始训练推理,从预训练开始使用保留答案的修剪方案包含推理数据。训练后使用四阶段的 RL 级联:在数学和谜题上进行推理预热;一个 400 任务的 RLVE-Gym 课程;数学和代码 RL 使用测试时间计算轨迹和从竞争性编程参考中构建的合成代码环境;以及行为 RL 用于聊天和指令遵循。我们还引入了 Markovian RSA,一种测试时间计算方法,通过递归聚合并行推理轨迹,在每轮之间只向前传递有限长度的推理尾部。在 TTC 评估中,Markovian RSA 将 ZAYA1-8B 提升到 AIME'25 的 91.9% 和 HMMT'25 的 89.6%,仅向前传递 4K 个令牌的尾部,缩小了与包括 Gemini-2.5 Pro、DeepSeek-V3.2 和 GPT-5-High 等更大推理模型之间的差距。

英文摘要

We present ZAYA1-8B, a reasoning-focused mixture-of-experts (MoE) model with 700M active and 8B total parameters, built on Zyphra's MoE++ architecture. ZAYA1-8B's core pretraining, midtraining, and supervised fine-tuning (SFT) were performed on a full-stack AMD compute, networking, and software platform. With under 1B active parameters, ZAYA1-8B matches or exceeds DeepSeek-R1-0528 on several challenging mathematics and coding benchmarks, and remains competitive with substantially larger open-weight reasoning models. ZAYA1-8B was trained from scratch for reasoning, with reasoning data included from pretraining onward using an answer-preserving trimming scheme. Post-training uses a four-stage RL cascade: reasoning warmup on math and puzzles; a 400-task RLVE-Gym curriculum; math and code RL with test-time compute traces and synthetic code environments built from competitive-programming references; and behavioral RL for chat and instruction following. We also introduce Markovian RSA, a test-time compute method that recursively aggregates parallel reasoning traces while carrying forward only bounded-length reasoning tails between rounds. In TTC evaluation, Markovian RSA raises ZAYA1-8B to 91.9\% on AIME'25 and 89.6\% on HMMT'25 while carrying forward only a 4K-token tail, narrowing the gap to much larger reasoning models including Gemini-2.5 Pro, DeepSeek-V3.2, and GPT-5-High.

2605.05360 2026-05-08 cs.LG cs.AI

COPYCOP: Ownership Verification for Graph Neural Networks

COPYCOP:图神经网络的版权验证

Rahul Nandakumar, Deepayan Chakrabarti

发表机构 * McCombs School of Business(麦科姆斯商学院) University of Texas at Austin(德克萨斯大学奥斯汀分校)

AI总结 研究提出COPYCOP算法,用于检测图神经网络间是否存在模仿关系,通过理论保证和实验验证其在多种数据集和架构上的准确性和鲁棒性。

详情
AI中文摘要

给定两个输出节点嵌入的图神经网络(GNN),如何确定它们是否独立训练?攻击者可能专门训练一个GNN以模仿另一个GNN的嵌入。为掩盖这种关系,对抗性GNN可能会变换其输出嵌入。尽管存在不同的架构、权重和嵌入维度,我们的算法(命名为CopyCop)能够识别此类模仿的GNN,不同于现有的水印和指纹方法。我们还为CopyCop提供了理论保证。最后,对14个数据集和5种GNN架构的实验表明,CopyCop在广泛对抗性攻击和变换中具有准确性和鲁棒性。代码可在:https://anonymous.4open.science/r/CopyCop-Graph-Ownership-Verification-8143/README.md

英文摘要

Given two GNNs that output node embeddings, how can we determine if they were trained independently? An adversary could have trained one GNN specifically to mimic the other GNN's embeddings. To obscure this relationship between the GNNs, the adversarial GNN might then transform its output embeddings. The two GNNs could have different architectures, weights, and embedding dimensions, and the adversary can transform the embeddings. Despite these stringent conditions, our algorithm (named CopyCop) can identify such copycat GNNs, unlike existing watermarking and fingerprinting methods. We also provide theoretical guarantees for CopyCop. Finally, experiments on 14 datasets and 5 GNN architectures demonstrate that CopyCop is accurate and robust against a broad class of adversarial attacks and transformations. Code is available at: https://anonymous.4open.science/r/CopyCop-Graph-Ownership-Verification-8143/README.md

2605.05358 2026-05-08 cs.LG cs.CV

Balancing Stability and Plasticity in Sequentially Trained Early-Exiting Neural Networks

在顺序训练的早期退出神经网络中平衡稳定性与可塑性

Alaa Zniber, Ouassim Karrakchou, Mounir Ghogho

发表机构 * TICLab, International University of Rabat, Morocco(拉巴特国际大学TIC实验室,摩洛哥) College of Computing, University Mohammed VI Polytechnic, Morocco(摩洛哥穆莱·伊斯梅尔Polytechnic大学计算机学院) School of Electronic and Electrical Engineering, University of Leeds, UK(利兹大学电子与电气工程学院)

AI总结 本文提出两种方法平衡早期退出网络的稳定性与可塑性,通过保护关键参数和保留输出分布提升性能,实验显示在低计算预算下有显著加速。

Comments Accepted for publication at IEEE ICIP 2026

详情
AI中文摘要

早期退出神经网络通过允许输入在中间分类器中退出,实现自适应推理,减少简单样本的计算量同时保持高精度。在实践中,退出可以顺序训练,逐步添加到共享骨干网络中;然而,这种顺序训练会导致新引入的退出干扰已学习的退出,降低早期分类器性能。我们通过保留现有退出中的知识并允许新退出专业化来解决此问题。我们提出两种替代方法,分别在模型的不同层次上操作。第一种通过保护对已训练退出重要的参数来约束学习,第二种在网络适应时保留早期退出的输出分布。这些方法直接反映了持续学习中研究的稳定性-可塑性权衡。因此,我们利用弹性权重巩固(Elastic Weight Consolidation)来约束关键权重,并利用无遗忘学习(Learning without Forgetting)来保留输出分布。在标准基准上的实验表明,我们的方法一致提升了早期退出性能,实现了比现有顺序训练方法更高的准确性和在低计算预算下的显著加速。

英文摘要

Early-exiting neural networks enable adaptive inference by allowing inputs to exit at intermediate classifiers, reducing computation for easy samples while maintaining high accuracy. In practice, exits can be trained sequentially by incrementally adding them to a shared backbone; however, this sequential training can cause newly introduced exits to interfere with previously learned ones, degrading the performance of earlier classifiers. We address this problem by retaining the knowledge embedded in existing exits while allowing new ones to specialize. We propose two alternative approaches that operate at different levels of the model. The first constrains learning by protecting parameters that are important for previously trained exits, while the second preserves the output distributions of earlier exits as the network adapts. These alternatives directly reflect the stability-plasticity trade-off studied in continual learning. Accordingly, we leverage \textit{Elastic Weight Consolidation} to constrain critical weights and \textit{Learning without Forgetting} to preserve output distributions. Experiments on standard benchmarks show that our approaches consistently improve early-exit performance, achieving higher accuracy over existing sequential training methods and significant performance speedups at low computational budgets.

2605.05354 2026-05-08 cs.LG

A Multi-Head Attention Approach for SLA Compliance Monitoring in Data Centers

面向数据中心SLA合规监控的多头注意力方法

Omanshu Thapliyal

发表机构 * Researcher, Strategic Data Solutions Lab(研究人员,战略数据解决方案实验室) Hitachi America Ltd.(日立美国有限公司) Santa Clara, USA(美国圣克拉拉)

AI总结 本文提出一种多头注意力模型,用于提前预测数据中心SLA违规,通过结构化JSON编码SLA规则生成训练数据,提升合规监控效率和财务风险控制能力。

Comments 6 pages, 9 figures, 46th IEEE International Conference on Distributed Computing Systems

详情
AI中文摘要

数据中心合规模板中的服务级别协议(SLA)定义了电力、温度和湿度的精确阈值,违规惩罚以每月固定费用中的信用额度形式表示。传统监控方法只能在违规发生后检测,限制了补救机会。本文提出一种框架,将SLA规则编码为结构化JSON对象以生成训练数据,无需人工标注。我们训练了一个按客户定制的多头Transformer模型,每个注意力头专注于一个SLA规则,学习导致违规前30分钟的时序依赖性。训练后,推理服务生成结构化预测事件,并将其转换为三种角色特定的视图:财务视图暴露信用责任,运营视图揭示风险评分和推荐干预措施,合规视图将预测与不可变的 telemetry 签名捆绑以供审计。通过将模型架构直接对齐合同义务,该框架使运营商能够预测SLA违规,优先处理纠正措施并减少财务处罚。

英文摘要

Service level agreements (SLAs) in data center colocation contracts define precise thresholds for power, temperature, and humidity, with tiered violation penalties expressed as credits against monthly recurring charges. Traditional reactive monitoring detects breaches only after they occur, limiting remediation opportunities. We present a framework that encodes SLA rules as structured JSON objects to generate training data without manual annotation. We train a per-customer multi-head transformer model in which each attention head specializes in one SLA rule, learning temporal dependencies that precede violations by 30 minutes. Post-training, the inference service emits structured prediction events transformed into three role-specific views: finance schemas exposing credit liability, operations schemas surfacing risk scores and recommended interventions, and compliance schemas bundling predictions with immutable telemetry signatures for audit. By aligning model architecture directly with contractual obligations, this framework enables operators to anticipate SLA breaches, prioritize corrective actions, and minimize financial penalties.

2605.05353 2026-05-08 cs.CL cs.AI

Counterargument for Critical Thinking as Judged by AI and Humans

对AI和人类判断下批判性思维的反驳论研究

Tosin Adewumi, Marcus Liwicki, Foteini Simistira Liwicki, Lama Alkhaled, Hamam Mokayed, Esra Sümer-Arpak

发表机构 * Machine Learning Group, EISLAB, Luleå University of Technology(机器学习组、EISLAB、吕勒奥技术大学)

AI总结 本研究探讨生成式AI环境下学生写作中反驳论的使用,发现学生自写反驳论包含逻辑等批判性思维要素,且生成式AI可基于明确标准评估学生写作,与人类评估结果一致。

Comments 9 pages

详情
AI中文摘要

本研究通过干预方法探讨了在生成式AI环境下学生写作中反驳论的使用,尤其在存在作弊风险和认知卸载的情况下。我们向某大学课程中的36名学生提供了4个精心挑选的论点陈述(来自热门辩论集),让他们进行写作。我们使用六个已建立的评分标准(焦点、逻辑、内容、风格、正确性和参考)对35份合格的写作进行三次人工评估(两次学生互评和一次经验教师评估),在5点李克特量表上对所有合格样本(n)进行评分。使用相同的评分标准和指南,我们还用六个前沿大语言模型作为评委评估了这些提交。我们的混合方法设计包括每个评估的定性开放式反馈和定量方法。结果显示,(1)学生自行撰写的针对AI生成内容的反驳论包含逻辑,这是批判性思维的关键组成部分,(2)生成式AI可以基于明确的评分标准大规模评估学生的书面作品,这些评估结果通常与人类评估结果一致,如Gwets AC2一致性值在所有模型中除了一个外均为0.33。

英文摘要

This intervention study investigates the use of counterarguments in writing for critical thinking by students in the context of Generative AI (GenAI). This is especially as risks of cheating and cognitive offloading exist with the use of GenAI. We presented 36 students in a particular university course with 4 carefully selected thesis statements (from a set of popular debates) to write about anyone of them. We used six established rubrics (focus, logic, content, style, correctness and reference) to conduct three human assessments (two student peer-reviews and one experienced teacher) per writeup on a 5-point Likert scale for all the qualified samples (n) of 35 submissions (after disqualifying one for irregularity). Using the same rubrics and guidelines, we also assessed the submissions using six frontier LLMs as judges. Our mixed-method design included qualitative open-ended feedback per assessment and quantitative methods. The results reveal that (1) the students' self-written counterarguments to AI-generated content contains logic, among other things, which is a key component of critical thinking, and (2) GenAI can be successfully used at scale to assess students' written work, based on clear rubrics, and these assessments generally align with human assessments as shown with Gwets AC2 inter-rater reliability values of 0.33 for all the models except one.

2605.05351 2026-05-08 cs.CV

egenioussBench: A New Dataset for Geospatial Visual Localisation

egenioussBench:一个用于地理视觉定位的新数据集

Phillipp Fanta-Jende, Francesco Vultaggio, Alexander Kern, Yasmin Loeper, Markus Gerke

发表机构 * Autonomous Systems, Center for Vision, Automation(自主系统,视觉、自动化中心)

AI总结 本文提出一个基于地理参考数据的视觉定位基准,包含城市级空中3D网格和CityGML LoD2模型,通过智能手机图像和高精度地面真值实现大规模可扩展定位挑战。

详情
AI中文摘要

我们介绍了egenioussBench,一个基于地理参考数据的视觉定位基准:一个城市级空中3D网格和一个CityGML LoD2模型。这种配对反映了可部署的测绘资产,并支持超越传统SfM方法的真实可扩展性。查询数据包含通过PPK和GCP/CP辅助调整获得的智能手机图像,具有厘米级精度的、与地图无关的地面真值。从2,709张图像中,我们通过渲染深度估计完整的共视矩阵,并选择最大独立集,推导出非共视子集;发布的数据包括一个测试集42张非共视图像(地面真值被保留)和一个验证集412张连续图像(包含姿态,例如用于训练姿态回归器和自我验证)。该基准具有一个公开的排行榜,通过多个姿态误差阈值的分箱指标进行评估,同时包含全局统计(中位数、RMSE、异常比率),确保公平、同类比较。这些设计选择暴露了现实的跨视图和跨领域挑战,同时为推进大规模视觉定位提供了严格、可扩展的路径。我们将在https://github.com/fratopa/egenioussBench和https://www.egeniouss.eu上提供评估代码和数据。

英文摘要

We present egenioussBench, a visual localisation benchmark built on geospatial reference data: a city-scale airborne 3D mesh and a CityGML LoD2 model. This pairing reflects deployable mapping assets and supports true scalability beyond traditional SfM-based approaches. The query data comprise smartphone images with centimetre-accurate, map-independent ground truth obtained via PPK and GCP/CP-aided adjustment. From 2,709 images, we derive a non-co-visible subset by estimating the full co-visibility matrix from rendered depth and selecting a maximum independent set; the released data include a test split of 42 non-co-visible images with withheld ground truth and a validation split of 412 sequential images with poses, e.g. for training of pose regressors and self-validation. The benchmark features a public leaderboard evaluated with binning metrics at multiple pose-error thresholds alongside global statistics (median, RMSE, outlier ratio), ensuring fair, like-for-like comparison across mesh- and LoD2-based methods. Together, these design choices expose realistic cross-view and cross-domain challenges while providing a rigorous, scalable path for advancing large-scale visual localisation. We make the evaluation code and data availeable at https://github.com/fratopa/egenioussBench and https://www.egeniouss.eu/

2605.05344 2026-05-08 cs.CV cs.AI cs.IR

Open-SAT: LLM-Guided Query Embedding Refinement for Open-Vocabulary Object Retrieval in Satellite Imagery

Open-SAT: 一种基于LLM的查询嵌入细化方法用于卫星影像开放词汇物体检索

Md Adnan Arefeen, Biplob Debnath, Ravi K. Rajendran, Murugan Sankaradas, Srimat T. Chakradhar

发表机构 * North South University(北南大学) NEC Laboratories America(NEC实验室)

AI总结 Open-SAT通过LLM引导在推理阶段优化查询与卫星影像内容的对齐,提升开放词汇检索性能,实验表明其F1分数提升达16.04%。

详情
AI中文摘要

在卫星应用中,用户查询通常为开放式的自然语言,超出预定义类别。这给图像检索带来挑战,因为系统需泛化到未见过的对象和概念。尽管视觉-语言模型(VLMs)如CLIP被广泛用于文本-图像检索,但即使微调变体也难以准确对齐此类查询与卫星影像。为此,我们提出Open-SAT,一种无需训练的查询嵌入细化算法,通过推理阶段优化查询与卫星图像内容的对齐。Open-SAT利用VLMs计算图像块的嵌入,并存储在向量数据库中以实现高效检索。在查询阶段,它利用大型语言模型(LLMs)通过整合感兴趣对象及其周围环境的上下文信息来细化文本嵌入。阈值自由的检索机制进一步提升了准确性和效率。在三个公开基准测试中,实验结果表明Open-SAT将F1分数提升高达16.04%,同时检索的图像块数量相当。这些结果证明了Open-SAT在开放词汇卫星图像检索中的有效性,无需额外训练或监督。

英文摘要

In satellite applications, user queries often take the form of open-ended natural language, extending beyond a fixed set of predefined categories. This open-vocabulary nature poses significant challenges for retrieving relevant image tiles, as the retrieval system must generalize to a wide range of unseen objects and concepts. While vision-language models (VLMs) such as CLIP are widely used for text-image retrieval, even fine-tuned variants often struggle to accurately align such queries with satellite imagery. To address this, we propose Open-SAT, a training-free query embedding refinement algorithm that operates at inference time to improve alignment between user queries and satellite image content. Open-SAT uses VLMs to compute embeddings for image tiles, which are stored in a vector database for efficient retrieval. At query time, it leverages Large Language Models (LLMs) to refine the text embeddings by incorporating contextual information about objects of interest and their surroundings. A threshold-free retrieval mechanism further enhances accuracy and efficiency. Experimental results in three public benchmarks demonstrate that Open-SAT improves the F1 score by up to 16.04%, while retrieving a comparable number of image tiles. These results demonstrate the effectiveness of Open-SAT in open-vocabulary satellite image retrieval, leveraging LLM guidance without the need for additional training or supervision.

2605.05341 2026-05-08 cs.LG cs.AI math.OC stat.ML

Feature Starvation as Geometric Instability in Sparse Autoencoders

稀疏自编码器中的特征枯竭作为几何不稳定性

Faris Chaudhry, Keisuke Yano, Anthea Monod

发表机构 * Imperial College London South Kensington Campus(伦敦帝国理工学院南肯辛顿校区) Institute for Statistical Mathematics(统计数学研究所)

AI总结 本文提出了一种自适应弹性净稀疏自编码器(AEN-SAEs),通过引入ℓ2结构项和自适应ℓ1重新加权,解决稀疏自编码器中的特征枯竭问题,同时保持重建能力。

Comments 26 pages, 3 figures, 5 tables

详情
AI中文摘要

稀疏自编码器(SAEs)用于将大型语言模型(LLMs)的密集、多义内部表示解耦为可解释的单义概念。然而,标准ℓ1正则化SAEs面临特征枯竭(死神经元)和收缩偏差问题,通常需要计算昂贵的启发式重采样和非可微的硬掩码方法来克服这些挑战。我们主张特征枯竭不仅仅是数据多样性差的偶然现象,而是过完备字典的优化-几何病理:ℓ1诱导的稀疏编码映射不稳定且根本上与浅层、近似编码器不一致。为了解决这种结构不稳定性,我们引入了自适应弹性净SAEs(AEN-SAEs),一种完全可微的架构,基于经典的稀疏回归。AEN-SAEs结合了一个ℓ2结构项,强制强凸性和Lipschitz稳定性,以及自适应ℓ1重新加权,消除收缩偏差并抑制虚假特征,从而共同控制诱导多面体几何的曲率和交互结构。理论上,我们证明AEN-SAEs产生Lipschitz连续的稀疏编码映射,并在温和假设下恢复全局特征支持。经验上,在合成设置和LLMs(Pythia 70M,Llama 3.1 8B)中,AEN-SAEs在不使用辅助启发式方法的情况下缓解了特征枯竭,同时保持了竞争性的重建能力。

英文摘要

Sparse autoencoders (SAEs) are used to disentangle the dense, polysemantic internal representations of large language models (LLMs) into interpretable, monosemantic concepts. However, standard $\ell_1$-regularized SAEs suffer from feature starvation (dead neurons) and shrinkage bias, often requiring computationally expensive heuristic resampling and nondifferentiable hard-masking methods to bypass these challenges. We argue that feature starvation is not merely an empirical artifact of poor data diversity, but a fundamental optimization-geometric pathology of overcomplete dictionaries: the $\ell_1$-induced sparse coding map is unstable and fundamentally misaligned with shallow, amortized encoders. To address this structural instability, we introduce adaptive elastic net SAEs (AEN-SAEs), a fully differentiable architecture grounded in classical sparse regression. AEN-SAEs combine an $\ell_2$ structural term that enforces strong convexity and Lipschitz stability with adaptive $\ell_1$ reweighting that eliminates shrinkage bias and suppresses spurious features, thereby jointly controlling the curvature and interaction structure of the induced polyhedral geometry. Theoretically, we show that AEN-SAEs yield a Lipschitz-continuous sparse coding map and recover the global feature support under mild assumptions. Empirically, across synthetic settings and LLMs (Pythia 70M, Llama 3.1 8B), AEN-SAEs mitigate feature starvation without auxiliary heuristics while maintaining competitive reconstruction abilities.

2605.05339 2026-05-08 cs.RO math.OC

Passive Fault Tolerance through Tension-to-Thrust Feed-Forward: Hybrid Input-to-State Stability for Decentralized Multi-UAV Slung-Load Transport under Abrupt Cable Severance

通过张力到推力前馈实现被动容错:用于突发缆线断裂下分布式多无人机吊挂运输的混合输入到状态稳定性

Hadi Hajieghrary, Paul Schmitt

发表机构 * Independent Researcher(独立研究者) Reynolds & Moore

AI总结 本文提出一种被动架构,通过将每个无人机测量的缆线张力直接输入到其高度推力指令中,以实现突发缆线断裂下的分布式多无人机吊挂运输的混合输入到状态稳定性。

Comments Submitted for review at IEEE Transactions on Control Systems Technology For the paper and simulation code see: https://github.com/Hadi-Hajieghrary/Tether_Grace.git

详情
AI中文摘要

在多无人机吊挂运输中,突发缆线断裂会重新分配负载并改变活跃约束集,留下有限时间进行故障诊断和重新配置。现有控制器依赖于协调力分配、同伴状态交换或固定缆线拓扑,因此缺乏对未通知断裂的认证分布式恢复机制。我们提出了一种被动架构,将每个车辆测量的缆线张力直接路由到其高度推力指令中,$T_i^{\mathrm{ff}}=T_i$,同时周围比例-导数、抗摆和投影级联保持局部跟踪可行性。主要贡献是一个条件混合实用输入到状态稳定性证书,将松弛- excursion- 有界紧缆减少、断裂后有界Lyapunov跳跃、故障间衰减和每故障周期收缩$ρ\in (0,1)$组成一个显式的恢复包络,在声明的执行器、松弛和驻留假设下。我们通过Drake多体仿真验证控制器,使用五架无人机、10 kg负载、Kelvin-Voigt缆线、Dryden风和单次和双次断裂计划:闭环达到0.312-0.328 m RMSE,76.1-95.2 mm峰值下垂,且在负载摆动周期内恢复。禁用身份使巡航误差增加34-39%,峰值下垂增加3.6x-4.0x,识别局部张力前馈作为测试分布式级联中的主导被动恢复机制。

英文摘要

Abrupt cable severance in multi-UAV slung-load transport redistributes load and changes the active constraint set, leaving limited time for fault diagnosis and reconfiguration. Existing controllers rely on coordinated force allocation, peer-state exchange, or fixed cable topology, and therefore lack a certified decentralized recovery mechanism for unannounced severance. We present a passive architecture that routes each vehicle's measured cable tension directly into its altitude thrust command, $T_i^{\mathrm{ff}}=T_i$, while a surrounding proportional-derivative, anti-swing, and projection cascade preserves local tracking feasibility. The main contribution is a conditional hybrid practical input-to-state-stability certificate that composes a slack-excursion-bounded taut-cable reduction, bounded post-severance Lyapunov jumps, inter-fault decay, and per-fault-cycle contraction $ρ\in (0,1)$ into an explicit recovery envelope under stated actuator, slack, and dwell assumptions. We validate the controller in Drake multibody simulation with five vehicles, a 10 kg payload, Kelvin-Voigt cables, Dryden wind, and single- and dual-severance schedules: the closed loop attains 0.312-0.328 m RMSE, 76.1-95.2 mm peak sag, and recovery within one payload-pendulum period. Disabling the identity inflates cruise error by 34-39% and peak sag by 3.6x-4.0x, identifying local tension feed-forward as the dominant passive recovery mechanism in the tested decentralized cascade.

2605.05338 2026-05-08 cs.RO

Track A*: Fast Visibility-Aware Trajectory Planning for Active Target Tracking

Track A*: 为主动目标跟踪的快速可见性感知轨迹规划

Hanxuan Chen, Kangli Wang, Ji Pei

发表机构 * Autel Robotics(奥特 Robotics)

AI总结 本文提出Track A*,一种基于离线搜索的轨迹规划器,用于在离散的四维时空网格上实现可见性感知的目标跟踪。通过分层有向无环图搜索和三种工程优化,Track A*在保证实用性的同时提升了可扩展性,实验证明其在计算成本低的情况下具有稳健的可见性性能。

详情
AI中文摘要

为构建多模跟踪数据集和在可重复条件下评估在线跟踪规划器,主动目标跟踪需要离线参考轨迹。我们提出了Track A star (TA star),一种针对离散四维时空网格(x, y, z, t)上的可见性感知目标跟踪目标的离线搜索基于轨迹规划器。TA star结合了分层有向无环图(DAG)搜索与三种工程优化:对抗包围体积层次结构(BVH)的跨时间障碍距离缓存、每层束修剪和可配置的多射线可见性评估器。TA star在该离散图上使用束修剪启发式搜索,以高效地找到高质量的跟踪轨迹。尽管它以牺牲严格的理论最优性为代价,但我们的实验证明在计算成本低的情况下具有稳健的接近基线的可见性性能。在1000个场景的压力测试中,TA star在八个CARLA优化地图上收敛所有场景并在32个工作线程中完成,耗时45秒;在248个场景的受控比较中,与未优化的优先队列A star基线(二叉堆实现)相比,TA star在相同场景输入和5 x 10^6扩展上限下,将平均规划时间减少23.0倍,最坏情况规划时间减少11.8倍,同时将收敛率从56.9%提高到100%。在n=141基线收敛子集中,TA star仅将平均可见性降低0.15个百分点(pp),没有场景超过5 pp的下降。我们在此特定条件下将TA star定位为一个实用的离线参考规划器,讨论了在如Town07密集植被等环境中的限制和失败案例。

英文摘要

Offline reference trajectories for active target tracking are needed both for building multi-modal tracking datasets and for benchmarking online tracking planners under repeatable conditions. We present Track A star (TA star), an offline search-based trajectory planner that targets the visibility-aware target tracking objective on a discretized four-dimensional spatio-temporal grid (x, y, z, t). TA star combines a layered Directed Acyclic Graph (DAG) search with three engineering optimizations: cross-time obstacle distance caching against a Bounding Volume Hierarchy (BVH), per-layer beam pruning, and a configurable multi-ray visibility evaluator. TA star employs a beam-pruned heuristic search on this discrete graph to efficiently find high-quality tracking trajectories. While it trades strict theoretical optimality for practical scalability, our empirical results demonstrate robust, near-baseline visibility performance at a fraction of the computational cost. On a 1000-scenario stress test across eight CARLA Optimized maps, TA star converges on all scenarios and completes in 45 s using 32 workers; on a 248-scenario controlled comparison against an unoptimized priority-queue A star baseline (BinaryHeap implementation) under identical scenario inputs and a 5 x 10^6 expansion cap, TA star reduces mean planning time by 23.0x and worst-case planning time by 11.8x, while raising convergence from 56.9% to 100%. On the n=141 baseline-converged subset, TA star changes average visibility by only -0.15 percentage points (pp), with no scenario exceeding a 5 pp drop. We position TA star as a practical offline reference planner under these specific conditions, with limitations and failure cases discussed for environments such as Town07 dense vegetation.

2605.05331 2026-05-08 cs.CV cs.AI cs.LG

ViTok-v2: Scaling Native Resolution Auto-Encoders to 5 Billion Parameters

ViTok-v2:将原生分辨率自编码器扩展到50亿参数

Philippe Hansen-Estruch, Jiahui Chen, Vivek Ramanujan, Orr Zohar, Yan Ping, Animesh Sinha, Markos Georgopoulos, Edgar Schoenfeld, Ji Hou, Felix Juefei-Xu, Sriram Vishwanath, Ali Thabet

发表机构 * University of Texas, Austin(德克萨斯大学奥斯汀分校) University of Washington(华盛顿大学) Stanford University(斯坦福大学) Spellbrush Meta Superintelligence Labs(Meta超智能实验室) Georgia Institute of Technology(佐治亚理工学院)

AI总结 ViTok-v2通过NaFlex实现跨分辨率和宽高比泛化,引入新的DINOv3感知损失,提升重建性能并稳定训练,达到当前最佳水平。

详情
AI中文摘要

视觉变换器(ViT)自编码器已崭露头角,成为图像的有力分词器,提供了优于卷积分词器的重建性能。然而,现有ViT分词器无法探索这一领域,因为性能在训练分辨率之外会下降,且依赖对抗损失阻碍了稳定扩展。ViTok(Hansen-Estruch等,2025)发现压缩比r调解了重建-生成的权衡,较低的r意味着更好的重建但更难生成,因此提高分词器的重建性能是更帕累托最优分词器的关键。我们引入了ViTok-v2,通过NaFlex实现跨分辨率和宽高比的泛化,以及新的DINOv3感知损失,取代LPIPS和GAN目标,以在任何规模下实现稳定训练。ViTok-v2在约20亿张图像上训练并扩展到50亿参数,成为迄今为止最大的图像自编码器。ViTok-v2在256p分辨率上达到或超越了当前最佳重建水平,在512p及更高分辨率上超越了所有基线。在与流匹配生成器的联合扩展实验中,我们展示了同时扩展自编码器和生成器可以推进这一权衡的帕累托前沿。

英文摘要

Vision Transformer (ViT) autoencoders have emerged as compelling tokenizers for images, offering improved reconstruction over convolutional tokenizers. However, existing ViT tokenizers cannot explore this landscape as performance degrades outside training resolutions, and reliance on adversarial losses prevents stable scaling. ViTok (Hansen-Estruch et al., 2025) found that the compression ratio r mediates a reconstruction-generation trade-off where lower r means better reconstructions but harder generations, so improving tokenizer reconstruction is key to more Pareto-optimal tokenizers. We introduce ViTok-v2, which addresses these limitations with native resolution support via NaFlex for generalization across resolutions and aspect ratios, and a novel DINOv3 perceptual loss that replaces both LPIPS and GAN objectives for stable training at any scale. ViTok-v2 is trained on about 2B images and scaled to 5B parameters, the largest image autoencoder to date. ViTok-v2 matches or exceeds state-of-the-art reconstruction at 256p and outperforms all baselines at 512p and above. In joint scaling experiments with flow matching generators, we show that scaling both the autoencoder and the generator advances the Pareto frontier of this trade-off.

2605.05329 2026-05-08 cs.AI cs.LG

Understanding Annotator Safety Policy with Interpretability

通过可解释性理解标注者安全政策

Alex Oesterling, Donghao Ren, Yannick Assogba, Dominik Moritz, Sunnie S. Y. Kim, Leon Gatys, Fred Hohman

发表机构 * Harvard University(哈佛大学) Apple(苹果公司)

AI总结 本文提出Annotator Policy Models,通过学习标注行为揭示安全政策差异,解决标注分歧根源问题,提升安全政策设计的透明性和包容性。

Comments 38 pages, 13 figures, ACM FAccT 2026

详情
AI中文摘要

安全政策定义了安全和不安全的人工智能输出,指导数据标注和模型开发。然而,标注分歧普遍存在,可能源于操作失误(标注者误解或误执行任务)、政策模糊(政策措辞留有解释空间)或价值多元主义(不同标注者对安全有不同的观点)。区分这些来源至关重要。例如,操作失误需要质量控制,模糊需要政策澄清,多元主义需要讨论纳入不同观点。然而,理解为何标注者存在分歧是困难的。直接询问标注者理由成本高,显著增加标注负担,并且对人类和LLM标注者来说,自报理由往往无法反映实际决策过程。我们引入Annotator Policy Models(APMs),可解释模型,通过学习标注行为单独学习标注者的内部安全政策,使标注者推理可见且可比较,而无需额外标注努力。我们验证APMs准确建模标注者安全政策(>80%准确率),忠实预测对反事实编辑的响应,并在受控环境中恢复已知的政策差异。将APMs应用于LLM和人类标注,我们展示了两个核心应用:(1)通过揭示标注者如何不同地解读安全指令来揭示政策模糊性;(2)通过发现不同人口群体在安全优先级上的系统性差异来揭示价值多元主义。这些能力共同支持更精准、透明和包容的安全政策设计。

英文摘要

Safety policies define what constitutes safe and unsafe AI outputs, guiding data annotation and model development. However, annotation disagreement is pervasive and can stem from multiple sources such as operational failures (annotators misunderstand or misexecute the task), policy ambiguity (policy wording leaves room for interpretation), or value pluralism (different annotators hold different perspectives on safety). Distinguishing these sources matters. For example, operational failures call for quality control, ambiguity calls for policy clarification, and pluralism calls for deliberation about incorporating diverse perspectives. Yet understanding why annotators disagree is difficult. Directly asking annotators for their reasoning is costly, substantially increasing annotation burden, and can be unreliable for both human and LLM annotators as self-reported reasoning often fails to reflect actual decision processes. We introduce Annotator Policy Models (APMs), interpretable models that learn annotators' internal safety policies from labeling behavior alone, making annotator reasoning visible and comparable without additional annotation effort. We validate that APMs accurately model annotator safety policy (>80% accuracy), faithfully predict responses to counterfactual edits, and recover known policy differences in controlled settings. Applying APMs to LLM and human annotations, we demonstrate two core applications: (1) surfacing policy ambiguity by revealing how annotators interpret safety instructions differently, and (2) surfacing value pluralism by uncovering systematic differences in safety priorities across demographic groups. Together, these capabilities support more targeted, transparent, and inclusive safety policy design.