arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 2086
2501.00106 2026-05-08 cs.SE cs.AI

LicenseGPT: A Fine-tuned Foundation Model for Publicly Available Dataset License Compliance

LicenseGPT:一种针对公开数据集许可合规的微调基础模型

Jingwen Tan, Gopi Krishnan Rajbahadur, Zi Li, Xiangfu Song, Jianshan Lin, Dan Li, Zibin Zheng, Ahmed E. Hassan

发表机构 * Sun Yat-Sen University(中山大学) Huawei(华为) Queen's University(女王大学)

AI总结 本文提出LicenseGPT,一种专门用于数据集许可合规分析的微调基础模型,通过提升预测准确率和减少分析时间,为法律从业者提供高效工具。

详情
AI中文摘要

数据集许可合规是开发商业AI产品中的关键且复杂问题,尤其在越来越多地使用公开数据集时。数据集许可的模糊性带来了重大法律风险,即使对软件知识产权律师来说,准确解读权利和义务也极具挑战性。本文介绍了LicenseGPT,一种专门针对数据集许可合规分析的微调基础模型(FM)。我们首先评估了现有法律FM(即专门用于理解和处理法律文本的FM),发现表现最好的模型仅能达到43.75%的预测一致率(PA)。LicenseGPT在由法律专家标注的500个许可数据集上进行微调,显著将PA提升至64.30%,优于法律和通用FM。通过A/B测试和与软件知识产权律师的用户研究,我们证明LicenseGPT将分析时间减少了94.44%,从108秒降至每份许可6秒,而不影响准确性。软件知识产权律师认为LicenseGPT是一种有价值的补充工具,提高了效率,同时承认在复杂情况下仍需人类监督。我们的工作强调了专门AI工具在法律实践中的潜力,并为从业者和研究人员提供了一个公开可用的资源。

英文摘要

Dataset license compliance is a critical yet complex aspect of developing commercial AI products, particularly with the increasing use of publicly available datasets. Ambiguities in dataset licenses pose significant legal risks, making it challenging even for software IP lawyers to accurately interpret rights and obligations. In this paper, we introduce LicenseGPT, a fine-tuned foundation model (FM) specifically designed for dataset license compliance analysis. We first evaluate existing legal FMs (i.e., FMs specialized in understanding and processing legal texts) and find that the best-performing model achieves a Prediction Agreement (PA) of only 43.75%. LicenseGPT, fine-tuned on a curated dataset of 500 licenses annotated by legal experts, significantly improves PA to 64.30%, outperforming both legal and general-purpose FMs. Through an A/B test and user study with software IP lawyers, we demonstrate that LicenseGPT reduces analysis time by 94.44%, from 108 seconds to 6 seconds per license, without compromising accuracy. Software IP lawyers perceive LicenseGPT as a valuable supplementary tool that enhances efficiency while acknowledging the need for human oversight in complex cases. Our work underscores the potential of specialized AI tools in legal practice and offers a publicly available resource for practitioners and researchers.

2410.01871 2026-05-08 cs.GT cs.AI cs.CY econ.GN q-fin.EC

Auction-Based Regulation for Artificial Intelligence

基于拍卖的人工智能监管

Marco Bornstein, Zora Che, Suhas Julapalli, Abdirisak Mohamed, Amrit Singh Bedi, Furong Huang

发表机构 * Department of Computer Science, University of Maryland, College Park, MD, USA(马里兰大学计算机科学系,College Park, MD, USA) SAP Labs, LLC(SAP实验室) Department of Computer Science, University of Central Florida, FL, USA(佛罗里达大学计算机科学系,FL, USA)

AI总结 本文提出基于拍卖的AI监管机制,通过数学框架激励企业部署合规模型并参与监管,实验证明其在提升合规率和参与率方面优于传统方法。

Comments 26 pages, 7 figures, 3 tables. Accepted at ACM FAccT 2026

详情
AI中文摘要

英文摘要

In an era of "moving fast and breaking things", regulators have moved slowly to pick up the safety, bias, and legal debris left in the wake of broken Artificial Intelligence (AI) deployment. While there is much-warranted discussion about how to address the safety, bias, and legal woes of state-of-the-art AI models, rigorous and realistic mathematical frameworks to regulate AI are lacking. Our paper addresses this challenge, proposing an auction-based regulatory mechanism that provably incentivizes agents (i) to deploy compliant models and (ii) to participate in the regulation process. We formulate AI regulation as an all-pay auction where enterprises submit models for approval. The regulator enforces compliance thresholds and further rewards models exhibiting higher compliance than their peers. We derive Nash Equilibria demonstrating that rational agents will submit models exceeding the prescribed compliance threshold. Empirical results show that our regulatory auction boosts compliance rates by 20% and participation rates by 15% compared to baseline regulatory mechanisms, outperforming simpler frameworks that merely impose minimum compliance standards.

2402.08106 2026-05-08 math.OC cs.LG math.PR

Mirror Descent-Ascent for mean-field min-max problems

镜像下降-上升法用于均场极小-极大问题

Razvan-Andrei Lascu, Mateusz B. Majka, Łukasz Szpruch

发表机构 * Center for Advanced Intelligence Project, RIKEN(日本东京RIKEN先进智能项目中心) School of Mathematical and Computer Sciences, Heriot-Watt University(赫里奥特-瓦特大学数学与计算机科学学院) Maxwell Institute for Mathematical Sciences, Edinburgh, UK(爱丁堡数学科学研究所) School of Mathematics, University of Edinburgh(爱丁堡大学数学学院) The Alan Turing Institute(艾伦·图灵研究所)

AI总结 本文研究了用于解决测度空间上极小-极大问题的镜像下降-上升算法的两种变体,分析了其收敛性,并提出了统一的理论基础。

Comments 57 pages; substantially revised version with improved presentation, re-worked main theorems, and added numerical experiments

详情
AI中文摘要

本文研究了用于解决测度空间上极小-极大问题的镜像下降-上升算法的两种变体,分析了其收敛性,并提出了统一的理论基础。

英文摘要

We study two variants of the mirror descent-ascent (MDA) algorithm for solving min-max problems on the space of measures: simultaneous and alternating. We work under assumptions of convexity-concavity and relative smoothness of the payoff function with respect to a suitable Bregman divergence, defined on the space of measures via flat derivatives. We establish non-asymptotic convergence rates to mixed Nash equilibria, measured in the Nikaidô-Isoda error, proving an $\mathcal{O}(N^{-1/2})$ rate for simultaneous MDA and an improved $\mathcal{O}(N^{-2/3})$ rate for alternating MDA. The main technical contribution is an infinite-dimensional dual space analysis that relates Bregman divergences on measures to dual Bregman divergences on spaces of bounded continuous functions, allowing us to control asymmetric commutator terms created by alternating updates. The results substantially generalize prior analyses restricted to bilinear objectives and also apply to nonlinear convex-concave problems on measure spaces, thereby providing a unified theoretical foundation for MDA in mean-field min-max optimization.

2605.05942 2026-05-08 quant-ph cs.LG

Architecture Shape Governs QNN Trainability: Jacobian Null Space Growth and Parameter Efficiency

架构形状决定QNN可训练性:雅可比零空间增长与参数效率

Michael Poppel, David Bucher, Maximilian Zorn, Markus Baumann, Sebastian Wölckert, Claudia Linnhoff-Popien, Philipp Altmann, Jonas Stein

发表机构 * Department of Computer Science, LMU Munich(慕尼黑大学计算机科学系) Aqarios GmbH(Aqarios公司)

AI总结 研究揭示架构形状(N,L)在固定编码预算E下影响QNN可训练性,通过分析雅可比矩阵的秩缺陷机制,发现串行单量子比特架构存在结构梯度饥饿现象,而并行架构通过独立相位轨迹避免此问题,且参数效率更高。

详情
AI中文摘要

变分量子电路通过角度编码实现截断傅里叶级数,架构安排N个量子比特与L个编码层,共享编码预算E=NL,生成相同的频率谱和频率冗余,并需要相同的最小参数数量以控制系数。尽管这种等价性,可训练性在固定E下随着架构形状(N,L)显著变化。我们识别出系数匹配雅可比矩阵J的结构秩缺陷是导致这一差异的机制。对于串行单量子比特架构,我们证明无论参数数量P如何,rank(J) ≤ 2L+1,且ker(J)的维数≥P-(2L+1)随P增加而无界,这一现象我们称为结构梯度饥饿:随着P增加,越来越多的参数在损失函数中被结构上解耦。并行架构通过独立的相位轨迹避免了这一问题,确保J^(par)的最小奇异值σ_min(J^(par)) > 0在P ≤ 2E+1时普遍成立,因此没有参数位于ker(J)中。对于实践者,我们进一步表明,增加特征映射(FM)层和增加可训练块这两种增加参数数量的自然途径有根本不同的影响:增加FM层单调增强雅可比矩阵QFIM特征值谱,并在1.6-2.2倍更少的参数下实现R²≥0.95,而增加可训练块仅通过经典插值机制提高训练,无量子特定优势。

英文摘要

Variational quantum circuits with angle encoding implement truncated Fourier series, and architectures arranging $N$ qubits with $L$ encoding layers each -- sharing encoding budget $E = NL$ -- generate identical frequency spectra, identical frequency redundancy, and require the same minimum parameter count for coefficient control. Despite this equivalence, trainability varies substantially with architecture shape $(N,L)$ at fixed $E$. We identify structural rank deficiency of the coefficient matching Jacobian $J$ as the mechanism responsible. For serial single-qubit architectures, we prove $\mathrm{rank}(J) \leq 2L+1$ regardless of parameter count $P$, with $\dim(\ker J) \geq P-(2L+1)$ growing without bound -- a phenomenon we term \emph{structural gradient starvation}: a growing fraction of parameters become structurally decoupled from the loss as $P$ increases at fixed $L$. Parallel architectures avoid this via independent phase trajectories, ensuring $σ_{\min}(J^{(\mathrm{par})}) > 0$ generically for $P \leq 2E+1$, so no parameter lies in $\ker J$. For practitioners, we further show that the two natural routes to increasing parameter count have fundamentally different effects: adding feature map (FM) layers monotonically strengthens the Jacobian QFIM eigenvalue spectrum and achieves $R^2 \geq 0.95$ with $1.6$--$2.2\times$ fewer parameters than adding trainable blocks across all tested architectures, while trainable blocks improve training only through the classical interpolation mechanism with no quantum-specific benefit.

2605.05920 2026-05-08 cs.AR cs.AI cs.PF

LLM-Driven Design Space Exploration of FPGA-based Accelerators

基于FPGA的加速器的LLM驱动的设计空间探索

Vinamra Sharma, Xingjian Fu, Jude Haris, José Cano

发表机构 * School of Computing Science, University of Glasgow(格拉斯哥大学计算机科学学院)

AI总结 本文提出SECDA-DSE框架,利用大语言模型自动化FPGA加速器的设计空间探索,通过结构化探索器和LLM栈实现高效配置生成与优化。

Comments Accepted to the Workshop on Intelligent System Design (InSyDe) co-located with EuroSys '26

详情
AI中文摘要

设计基于FPGA的现代人工智能加速器需要在包含架构参数、数据流策略和内存层次的复杂硬件设计空间中导航,这一过程耗时且资源消耗大。尽管SECDA方法通过SystemC仿真和FPGA执行实现快速硬件-软件协同设计,但确定最优加速器配置仍需大量手动工作和领域专业知识。本文提出SECDA-DSE框架,将大语言模型整合到SECDA生态系统中,包括围绕SECDA构建的工具,用于自动化FPGA加速器的设计空间探索。SECDA-DSE结合结构化的DSE探索器生成加速器配置,以及通过检索增强生成和思维链提示进行推理引导的LLM栈,同时包含反馈循环以实现持续改进。通过基于生成加速器设计的初步高层次综合评估,展示了SECDA-DSE在Zynq-7000 FPGA上满足综合时间和资源约束的可行性。

英文摘要

Designing field-programmable gate array (FPGA)-based accelerators for modern artificial intelligence workloads requires navigating a large and complex hardware design space encompassing architectural parameters, dataflow strategies, and memory hierarchies, making the process time-consuming and resource-intensive. While the SECDA methodology enables rapid hardware-software co-design of accelerators through SystemC simulation and FPGA execution, identifying optimal accelerator configurations still requires substantial manual effort and domain expertise. This work presents SECDA-DSE, a framework that integrates Large Language Models (LLMs) into the SECDA ecosystem, comprising tools built around SECDA to automate the design space exploration (DSE) of FPGA-based accelerators. SECDA-DSE combines a structured DSE Explorer for generating accelerator configurations with an LLM Stack that performs reasoning-guided exploration using retrieval-augmented generation and chain-of-thought prompting, alongside a feedback loop that enables reinforced fine-tuning for continuous improvement. We demonstrate the feasibility of SECDA-DSE through an initial high-level synthesis based evaluation of a generated accelerator design that meets synthesis timing and resource constraints on an Zynq-7000 FPGA.

2605.05914 2026-05-08 quant-ph cs.AI cs.LG

Quantum-enhanced Large Language Models on Quantum Hardware via Cayley Unitary Adapters

通过凯莱单位ary适配器在量子硬件上增强大型语言模型

Borja Aizpurua, Sukhbinder Singh, Augustine Kshetrimayum, Saeed S. Jahromi, Roman Orus

发表机构 * Multiverse Computing Parque Científico y Tecnológico de Gipuzkoa(吉普乌科阿科技公园) Department of Basic Sciences(基础科学系) Tecnun – University of Navarra(塔努大学) Centre for Social Innovation(社会创新中心) Donostia International Physics Center(多斯蒂亚国际物理中心) Ikerbasque Foundation for Science(Ikerbasque科学基金会)

AI总结 本研究通过在预训练语言模型的冻结投影层中插入量子电路块,利用IBM Quantum System Two处理器提升Llama 3.1 8B模型的困惑度,仅需6000个额外参数,并在真实量子处理器上验证端到端推理。

Comments 31 pages, 6 figures

详情
AI中文摘要

大型语言模型(LLMs)已彻底改变人工智能,但经典架构存在根本限制:每个可训练参数都需要经典内存,且随着模型规模增大而变得不可行。量子计算提供了一种质的不同路径,但实际硬件上的演示仍难以实现实用模型。本文展示,通过在预训练LLM的冻结投影层中插入凯莱参数化的量子电路块,并在156量子位的IBM Quantum System Two超导处理器上执行,可将Llama 3.1 8B模型的困惑度降低1.4%,仅需6000个额外参数,并在真实量子处理器上验证端到端推理。对SmolLM2(13500万参数)的系统研究显示,随着量子电路块维度的增加,困惑度单调降低,83%恢复压缩引起的退化,并能回答经典基线失败的问题,通过明显的噪声表达性相变识别出在更大量子位规模下实现量子实用性的具体路径。

英文摘要

Large language models (LLMs) have transformed artificial intelligence, yet classical architectures impose a fundamental constraint: every trainable parameter demands classical memory that scales unfavourably with model size. Quantum computing offers a qualitatively different pathway, but practical demonstrations on real hardware have remained elusive for models of practical relevance. Here we show that Cayley-parameterised unitary adapters -- quantum circuit blocks inserted into the frozen projection layers of pre-trained LLMs and executed on a 156-qubit IBM Quantum System Two superconducting processor -- improve the perplexity of Llama 3.1 8B, an 8-billion-parameter model in widespread use, by 1.4% with only 6,000 additional parameters and end-to-end inference validated on real Quantum Processing Unit (QPU). A systematic study on SmolLM2 (135M parameters), chosen for its tractability, reveals monotonically improving perplexity with unitary block dimension, 83% recovery of compression-induced degradation, and correct answers to questions that both classical baselines fail -- with a sharp noise-expressivity phase transition identifying the concrete path to quantum utility at larger qubit scales.

2605.05882 2026-05-08 stat.ML cs.AI cs.CY cs.LG

Tuning Derivatives for Causal Fairness in Machine Learning

为机器学习中的因果公平性调整导数

Filip Edström, Guilherme W. F. Barros, Tetiana Gorbach, Xavier de Luna

发表机构 * Department of Statistics, Umeå School of Business, Economics and Statistics(统计系,乌梅亚商学院、经济学和统计学学院) Integrated Science Lab, Department of Physics(整合科学实验室,物理系)

AI总结 本文提出一种针对连续保护属性的因果公平性框架,通过路径特定偏导数形式化统计平衡与预测平衡,并设计公平调优算法在无法实现完全公平时进行SP与PP的权衡。

详情
AI中文摘要

人工智能系统在社会中日益普及,但其预测通常继承种族、性别或年龄等保护属性的偏见。经典公平性观念,尤其是统计平衡(SP),要求预测独立于保护属性,但当这些属性影响中介变量且被视为商业必需时,SP过于严格。最近的因果方法通过区分允许和不允许的因果路径,并补充预测平衡(PP),要求预测器复制商业必需的影响。现有基于路径的定义主要适用于分类属性。本文提出一种新的公平性框架,适用于连续保护属性的结构因果模型。我们通过路径特定偏导数形式化SP和PP,建立这些标准与先前因果定义一致的条件,并刻画当预测器在不允许路径上满足SP而在允许路径上实现PP时存在的公平性。基于此理论,我们提出一个公平调优算法,要么构建此类预测器,或在无法实现时进行SP和PP的权衡。我们通过模拟和真实数据实验评估我们的方法,与先前方法进行比较,并展示在考虑PP时表现更优。

英文摘要

Artificial-intelligence systems are becoming ubiquitous in society, yet their predictions typically inherit biases with respect to protected attributes such as race, gender, or age. Classical fairness notions, most notably Statistical Parity (SP), demand that predictions be independent of the protected attributes, but are overly restrictive when these attributes influence mediating variables that are considered business necessities. Recent causal formulations relax SP by distinguishing allowed from not-allowed causal paths and by complementing SP with Predictive Parity (PP), requiring the predictor to replicate the legitimate influence of business-necessities. Existing path-based definitions are mainly practical when applied to categorical attributes. This paper introduces a new framework for fairness in structural causal models that is tailored to continuous protected attributes. We formalize SP and PP through path-specific partial derivatives, establish conditions under which these criteria coincide with prior causal definitions, and characterize when a fair predictor, one that satisfies SP along not-allowed paths while achieving PP along allowed paths, exists. Building on this theory, we propose a fair tuning algorithm that either constructs such a predictor or, when not possible, allows for a trade-off between SP and PP. We present experiments on simulated and real data to evaluate our proposal, compare it with previously proposed methods, and show that it performs better when PP is considered.

2605.05873 2026-05-08 stat.ML cs.AI cs.LG math.ST stat.ME stat.TH

CITE: Anytime-Valid Statistical Inference in LLM Self-Consistency

CITE: 在大语言模型自我一致性中实现任意时间有效的统计推断

Hirofumi Ota, Naoto Iwase, Yuki Ichihara, Junpei Komiyama, Masaaki Imaizumi

发表机构 * Komaba Institute for Science, Graduate School of Arts and Sciences, The University of Tokyo(东京大学艺术科学研究生院Komaba研究所) Nagoya University(名古屋大学) NARA Institute of Science and Technology (NAIST) / Mohamed bin Zayed University of Artificial Intelligence (MBZUAI)(NAIST科学与技术研究所 / 摩洛哥本·泽德人工智能大学) Mohamed bin Zayed University of Artificial Intelligence (MBZUAI) / RIKEN AIP(摩洛哥本·泽德人工智能大学 / RIKEN AIP) Graduate School of Arts and Sciences, The University of Tokyo / RIKEN AIP / Kyoto University(东京大学艺术科学研究生院 / RIKEN AIP / 京都大学)

AI总结 本文提出CITE算法,通过交并检验与E过程实现任意时间有效的模型响应分布唯一模式认证,无需预先知道可能答案集,并在扩散尾设置中提升认证效果。

详情
AI中文摘要

大语言模型常通过采样多个输出并聚合最终答案来提升推理能力,但精确控制误差水平仍具挑战性。特别是当停止规则数据依赖且可能答案集未知时,决定何时停止采样尤为困难。本文研究了预设目标答案作为模型响应分布唯一模式的任意时间有效认证,提出CITE算法,该算法在任意数据驱动停止下可证明控制假认证率,无需先验知识。同时证明了无类别集大小限制的停止时间速率,建立了主范围内与常数匹配的最小最大下界,并扩展到置信度加权投票。模拟和大语言模型自我一致性实验显示在扩散尾设置中实现了经验误差控制和认证改进。

英文摘要

Large language models often improve reasoning by sampling multiple outputs and aggregating their final answers, but precise and efficient control of error levels remains a challenging task. In particular, deciding when to stop sampling remains difficult when the stopping rule is data-dependent and the set of possible answers is not known in advance. We study anytime-valid certification of a prespecified target answer as the unique mode of the model's response distribution, a guarantee distinct from answer correctness. We propose the Certification by Intersection-union Testing with E-processes (CITE) algorithm, which provably controls false certification at any prescribed level under arbitrary data-driven stopping, without requiring prior knowledge of the answer category set. We also prove an category-set-size-free stopping-time rate, establish matching minimax lower bounds up to constants in the main regime, and extend the construction to confidence-weighted voting. Simulations and LLM self-consistency experiments show empirical error control and improved certification in diffuse-tail settings.

2605.05846 2026-05-08 cs.CR cs.AI

LoopTrap: Termination Poisoning Attacks on LLM Agents

LoopTrap: 对 LLM agent 的终止污染攻击

Huiyu Xu, Zhibo Wang, Wenhui Zhang, Ziqi Zhu, Yaopeng Wang, Kui Ren, Chun Chen

发表机构 * The State Key Laboratory of Blockchain and Data Security(区块链与数据安全国家重点实验室) Zhejiang University(浙江大学) School of Cyber Science and Engineering(网络安全与工程学院) Southeast University(东南大学)

AI总结 本文研究了LLM agent迭代执行循环中的终止污染风险,提出10种攻击策略并通过实验证明不同agent的行为特征影响攻击效果,设计了自动化框架LoopTrap以提升红队攻击效率。

详情
AI中文摘要

现代LLM agent通过迭代执行循环解决复杂任务,通过反复推理、行动和自我评估来判断任务完成。本文发现,尽管这种自主循环促进自主性,但也引入了关键风险:通过向agent的上下文注入恶意提示,攻击者可以扭曲agent的终止判断,使其认为任务未完成,导致计算无界。为理解这一威胁,我们将其定义并系统化地刻画为终止污染,并设计了10种代表性攻击策略。通过覆盖8个LLM agent和60个任务的实证研究,我们证明不同LLM agent表现出不同的行为特征,这些特征决定了哪些策略成功。这些可转移的模式可以作为原则性的指导,帮助设计针对以前未见过的agent和任务的有效攻击,实现大规模红队行动,超越手动设计的模板。基于这些见解,我们引入LoopTrap,一个自动化红队框架,通过利用agent的行为倾向合成目标特定的恶意提示。LoopTrap首先通过轻量级探测构建目标agent的四个漏洞维度行为档案。然后进行自适应陷阱合成,通过自我评分机制路由到最有效的策略并选择最佳注入。最后,成功的陷阱被抽象为可重用的技能库,而失败的尝试则通过自我反思进行优化,确保持续改进。广泛的评估显示,LoopTrap在8个主流agent上实现了平均3.57倍的步骤放大,峰值为25倍。

英文摘要

Modern LLM agents solve complex tasks by operating in iterative execution loops, where they repeatedly reason, act, and self-evaluate progress to determine when a task is complete. In this work, we show that while this self-directed loop facilitates autonomy, it also introduces a critical risk: by injecting malicious prompts into the agent's context, an adversary can distort the agent's termination judgment, making it believe the task remains incomplete and leading to unbounded computation.To understand this threat, we define and systematically characterize it as Termination Poisoning and design 10 representative attack strategies. Through a empirical study spanning 8 LLM agents and 60 tasks, we demonstrate that different LLM agents exhibit distinct behavioral signatures that determine which strategies succeed. These transferable patterns can serve as principled guidance for crafting effective attacks against previously unseen agents and tasks, enabling scalable red-teaming beyond manually designed templates. Building on these insights, we introduce LoopTrap, an automated red-teaming framework that synthesizes target-specific malicious prompts by exploiting agent behavioral tendencies. LoopTrap first constructs a behavioral profile of the target agent along four vulnerability dimensions via lightweight probing. It then performs adaptive trap synthesis, routing to the most effective strategy and selecting optimal injections via a self-scoring mechanism. Finally, successful traps are abstracted into a reusable skill library, while failed attempts are refined through self-reflection, ensuring continuous improvement. Extensive evaluation shows that LoopTrap achieves an average of 3.57$\times$ step amplification across 8 mainstream agents, with a peak of 25$\times$.

2605.05818 2026-05-08 cs.CR cs.AI cs.CL

LeakDojo: Decoding the Leakage Threats of RAG Systems

LeakDojo:解码RAG系统的泄漏威胁

Maosen Zhang, Jianshuo Dong, Boting Lu, Wenyue Li, Xiaoping Zhang, Tianwei Zhang, Han Qiu

发表机构 * Tsinghua University, China(清华大学, 中国) Ant International, China(蚂蚁集团, 中国) Nanyang Technological University, Singapore(南洋理工大学, 新加坡)

AI总结 LeakDojo通过可控评估揭示RAG系统泄漏风险,发现查询生成和对抗指令独立影响泄漏,且指令能力越强泄漏风险越高,RAG可信度提升可能增加泄漏风险。

Comments Findings of ACL 2026

详情
AI中文摘要

检索增强生成(RAG)使大语言模型(LLM)能利用外部知识,但也暴露了有价值的RAG数据库给泄漏攻击。随着RAG系统复杂性和LLM指令遵循能力增强,现有研究未能系统评估RAG泄漏风险。我们提出LeakDojo,一个可配置的评估框架,通过在14个LLM、4个数据集和多样RAG系统上基准测试六种现有攻击,发现查询生成和对抗指令独立影响泄漏,且指令能力越强泄漏风险越高,RAG可信度提升可能增加泄漏风险。这些发现为理解并缓解RAG泄漏提供了可行见解。代码库可在https://github.com/yeasen-z/LeakDojo获取。

英文摘要

Retrieval-Augmented Generation (RAG) enables large language models (LLMs) to leverage external knowledge, but also exposes valuable RAG databases to leakage attacks. As RAG systems grow more complex and LLMs exhibit stronger instruction-following capabilities, existing studies fall short of systematically assessing RAG leakage risks. We present LeakDojo, a configurable framework for controlled evaluation of RAG leakage. Using LeakDojo, we benchmark six existing attacks across fourteen LLMs, four datasets, and diverse RAG systems. Our study reveals that (1) query generation and adversarial instructions contribute independently to leakage, with overall leakage well approximated by their product; (2) stronger instruction-following capability correlates with higher leakage risk; and (3) improvements in RAG faithfulness can introduce increased leakage risk. These findings provide actionable insights for understanding and mitigating RAG leakage in practice. Our codebase is available at https://github.com/yeasen-z/LeakDojo.

2605.05808 2026-05-08 stat.ML cs.LG math.ST stat.TH

Ratio-based Loss Functions

基于比率的损失函数

Lena Helgerth, Andreas Christmann

发表机构 * Department of Mathematics, University of Bayreuth, Chair of Stochastics and Machine Learning(数学系,拜罗伊特大学,随机过程与机器学习研究所)

AI总结 本文综述了一类称为比率型的损失函数,探讨其连续性、Lipschitz连续性、凸性和可微性等性质,为未来研究提供基础。

详情
AI中文摘要

机器学习和人工智能算法的关键依赖于三个要素:风险函数(损失函数的期望)、函数空间(假设空间)以及允许的概率测度集。本文综述了一类称为比率型的损失函数。在监督学习中,分类任务中的边际损失函数依赖于输出值$y_i$与预测值$f(x_i)$的乘积,而回归任务中的距离型损失函数则依赖于$y_i$与$f(x_i)$的差值。距离型损失函数在加法模型假设合理时特别有用,即常见信号加噪声假设。然而,在文献中,为回归目的提出的几种损失函数考虑的是乘法误差结构,并关注相对误差,即$y_i$与$f(x_i)$的比率。本文系统研究了此类比率型损失函数,并提出几种新的损失函数,可能对未来研究有帮助。我们专注于研究比率型损失函数的一般性质,如连续性、Lipschitz连续性、凸性和可微性,因为这些性质在大多数机器学习算法中起核心作用。因此,本文不专注于特定机器学习算法以推导通用一致性、学习速率或稳定性结果。相反,我们希望为未来研究提供方向。

英文摘要

Algorithms in machine learning and AI do critically depend on at least three key components: (i) the risk function, which is the expectation of the loss function, (ii) the function space, which is often called the hypothesis space, and (iii) the set of probability measures, which are allowed for the specified algorithm. This paper gives a survey of a certain class of loss functions, which we call ratio-based. In supervised learning, margin-based loss functions for classification tasks depending on the product of the output values $y_i$ and the predictions $f(x_i)$ as well as distance-based loss functions depending on the difference of $y_i$ and $f(x_i)$ for regression are common. Distance-based loss functions are in particular useful, if an additive model assumption seems plausible, i.e. the common signal plus noise assumption. However, in the literature, several loss functions proposed for regression purposes have a multiplicative error structure in mind and pay attention to relative errors, i.e. to the ratio of $y_i$ and $f(x_i)$. In this survey article, we systematically investigate such ratio-based loss functions and propose a few new losses, which may be interesting for future research. We concentrate on investigating general properties of ratio-based loss functions like continuity, Lipschitz-continuity, convexity, and differentiability, because these properties play a central role in most machine learning algorithms. Therefore, we do not focus on some specific machine learning algorithm to derive universal consistency, learning rates, or stability results. Instead, we want to enable future research in this direction.

2605.05807 2026-05-08 cs.CR cs.AI

LCC-LLM: Leveraging Code-Centric Large Language Models for Malware Attribution

LCC-LLM:利用代码导向的大语言模型进行恶意软件归因

Christopher G. Pedraza Pohlenz, Hassan Jalil Hadi, Ali Hassan, Ali Shoker

发表机构 * CyberSaR, King Abdullah University of Science and Technology(CyberSaR,国王阿卜杜勒·阿齐兹大学科学与技术学院)

AI总结 本文提出LCC-LLM,一种代码导向的恶意软件归因和多任务静态恶意软件分析框架,通过构建包含34000个PE样本的LCCD数据集,结合静态分析与多源安全知识,提升恶意软件归因的可靠性与实用性。

详情
AI中文摘要

LLMs正被越来越多地用于恶意软件分析;然而,当前基于LLM的恶意软件归因仍受限于不支持的指标和不足的代码层面基础,难以识别恶意和易受攻击的代码段。为解决这些限制,本研究引入LCC-LLM,一种以代码为中心的基准数据集和证据导向的框架,用于恶意软件归因和多任务静态恶意软件分析。所提出的LCCD数据集包含约34000个PE样本,通过大规模反汇编流水线处理,并使用反汇编C代码、汇编代码、CFG/FCG制品、十六进制数据、PE元数据、可疑API证据和结构特征进行表示。除了数据集构建外,LCC-LLM整合LangGraph主导的静态分析与多源网络安全知识,以支持证据导向的恶意软件推理。该框架采用七层检索增强生成管道,CoVe用于IoC验证,并采用多维质量门以提高事实可靠性及分析师导向的决策支持。课程有序的指令数据用于使用QLoRA微调DeepSeek-R1-Distill-Qwen-14B和Qwen3-Coder-30B-A3B。在43种恶意软件分析任务类型上的评估实现了平均语义相似度为0.634,其中在结构化报告生成、IoC提取、漏洞评估、恶意软件配置提取和恶意软件类别检测等任务层面表现最佳。在使用MalwareBazaar样本的现实案例研究中,该基础管道实现了10/10的结构化分析通过率,生成CFG/FCG证据、MITRE ATT&CK映射、检测指导和分析师准备的报告。这些结果表明,以代码为中心的表示、检索基础和验证引导的推理提高了LLM辅助恶意软件归因的可靠性和操作实用性。

英文摘要

LLMs are increasingly explored for malware analysis; however, current LLM-based malware attribution remains limited by unsupported indicators and insufficient code-level grounding for identifying malicious and vulnerable code segments. To address these limitations, this research introduces LCC-LLM, a code-centric benchmark dataset and evidence-grounded framework for malware attribution and multi-task static malware analysis. The proposed LCCD dataset contains approximately 34K PE samples processed through a large-scale reverse-engineering pipeline and represented using decompiled C code, assembly code, CFG/FCG artifacts, hexadecimal data, PE metadata, suspicious API evidence, and structural features. Beyond dataset construction, LCC-LLM integrates LangGraph-orchestrated static analysis with multi-source cybersecurity knowledge to support evidence-grounded malware reasoning. The framework employs a seven-layer retrieval-augmented generation pipeline, CoVe for IoC validation, and a multi-dimensional quality gate to improve factual reliability and analyst-oriented decision support. Curriculum-ordered instruction data is used to fine-tune DeepSeek-R1-Distill-Qwen-14B and Qwen3-Coder-30B-A3B using QLoRA. Evaluation across 43 malware-analysis task types achieves an average semantic similarity of 0.634, with the highest task-level performance in structured report generation, IoC extraction, vulnerability assessment, malware configuration extraction, and malware class detection. In a real-world case study using MalwareBazaar samples, the grounded pipeline achieves a 10/10 structured analysis pass rate, producing CFG/FCG evidence, MITRE ATT&CK mappings, detection guidance, and analyst-ready reports. These results show that code-centric representations, retrieval grounding, and verification-guided reasoning improve the reliability and operational usefulness of LLM-assisted malware attribution.

2605.05789 2026-05-08 cs.CR cs.CV

Stego Battlefield: Evaluating Image Steganography Attacks and Steganalysis Defenses

隐写战场:评估图像隐写术攻击与隐写分析防御

Zhen Sun, Zongmin Zhang, Leyi Sheng, Yule Liu, Yifan Liao, Ke Li, Xinhu Zheng, Jiaheng Wei, Wenyuan Yang, Xinlei He

发表机构 * Wuhan University(武汉大学) The Hong Kong University of Science and Technology (Guangzhou)(香港科技大学(广州)) Sun Yat-sen University(中山大学)

AI总结 本文提出SADBench基准,系统评估隐写攻击能力和隐写分析防御能力,揭示攻击稳定性、检测成本及转移性差异,揭示现实威胁在社交媒体中持续存在。

Comments 23 pages

详情
AI中文摘要

图像隐写术广泛用于保护用户隐私和实现隐蔽通信。然而,它也可以被对手滥用作为隐蔽通道,绕过内容审查,传播有害语义,甚至在图像中隐藏恶意指令以引发大模型危险输出,构成持续演化的实际安全风险。为解决缺乏统一系统评估框架的问题,我们提出SADBench,一个系统基准,评估对手通过隐写术注入有害信息的能力以及防御方通过隐写分析检测此类威胁的能力。关键在于SADBench包含4个核心任务:隐写攻击能力评估、隐写分析防御能力评估、效率评估和转移性评估。它在多样化的覆盖分布上评估图像-负载和文本-负载隐写术,利用有害视觉语义和有毒指令模拟恶意攻击。在广泛攻击和检测器集合上,SADBench揭示:(i) INN和自动编码器方法相比其他架构表现更稳定;(ii) 领域内检测几乎完美且成本更低;(iii) 存在关键的转移性不对称,攻击能稳健泛化到新分布,而检测器无法适应;(iv) 现实威胁在社交媒体中持续存在,其中负载要么能承受最小压缩,或通过模拟训练有效适应攻击压缩。总体而言,SADBench建立了一个系统、可重复和可扩展的框架来量化风险,为隐写防御的可衡量和安全驱动进步铺平道路。

英文摘要

Image steganography is widely used to protect user privacy and enable covert communication. However, it can also be abused by the adversary as a covert channel to bypass content moderation, disseminate harmful semantics, and even hide malicious instructions in images to elicit dangerous outputs from large models, posing a practical security risk that continues to evolve. To address the lack of a unified and systematic evaluation framework, we propose SADBench, a systematic benchmark that assesses the adversary's ability to inject harmful secrets via steganography and the defender's ability to detect such threats through steganalysis. Crucially, SADBench comprises $4$ core tasks, namely steganography attack capability evaluation, steganalysis defense capability evaluation, efficiency evaluation, and transferability evaluation. It evaluates both image-payload and text-payload steganography across diverse cover distributions, utilizing harmful visual semantics and toxic instructions to simulate malicious attacks. Across a broad set of attacks and detectors, SADBench reveals that (i) INN and autoencoder-based methods demonstrate superior stability compared to other architectures, (ii) in-domain detection is near-perfect and cheaper than generation, (iii) a critical asymmetry exists in transferability where attacks robustly generalize to new distributions while detectors fail to adapt, and (iv) real-world threats persist on social media, where payloads either survive minimal compression or effectively adapt to aggressive compression via simulated training. Overall, SADBench establishes a systematic, reproducible, and extensible framework to quantify risks, paving the way for measurable and security-driven advancements in steganography defense.

2605.05768 2026-05-08 math.ST cs.LG stat.ML stat.TH

Optimal Confidence Band for Kernel Gradient Flow Estimator

核梯度流估计器的最佳置信带

Yuqian Cheng, Zhuo Chen, Qian Lin

发表机构 * Department of Mathematical Sciences(数学科学系) Department of Statistics and Data Science(统计与数据科学系)

AI总结 本文研究了核回归方法中核梯度流的 supremum-范数泛化误差和统一推断,基于容量-源条件框架,推导了连续和离散核梯度流的收敛速率,并构建了同时置信带,其宽度最优。

详情
AI中文摘要

本文研究了核回归方法中核梯度流的 supremum-范数泛化误差和统一推断。在广泛采用的容量-源条件框架下,我们首先建立了连续和离散核梯度流的 supremum-范数泛化误差的收敛速率,其中源条件 $s>α_0$,其中 $α_0\in(0,1)$ 表示核函数的嵌入指数。此外,我们证明这些速率与最小最大最优速率匹配。基于这一结果,我们进一步构建了连续和离散核梯度流的同时置信带。值得注意的是,所提出置信带的宽度也是最优的,即其收缩速率大于,但可以任意接近最小最大最优速率。

英文摘要

In this paper, we investigate the supremum-norm generalization error and the uniform inference for a specific class of kernel regression methods, namely the kernel gradient flows. Under the widely adopted capacity-source condition framework in the kernel regression literature, we first establish convergence rates for the supremum norm generalization error of both continuous and discrete kernel gradient flows under the source condition $s>α_0$, where $α_0\in(0,1)$ denotes the embedding index of the kernel function. Moreover, we show that these rates match the minimax optimal rates. Building on this result, we then construct simultaneous confidence bands for both continuous and discrete kernel gradient flows. Notably, the widths of the proposed confidence bands are also optimal, in the sense that their shrinkage rates are greater than, while can be arbitrarily close to, the minimax optimal rates.

2605.05767 2026-05-08 cs.HC cs.CL

Priming, Path-dependence, and Plasticity: Understanding the molding of user-LLM interaction and its implications from (many) chat logs in the wild

预设、路径依赖与可塑性:从(众多)真实聊天日志理解用户与大语言模型交互的塑造及其影响

Shengqi Zhu, Jeffrey M. Rzeszotarski, David Mimno

发表机构 * Cornell University(康奈尔大学) Loyola University Maryland(洛约拉大学马里兰分校)

AI总结 通过分析14万次聊天记录,研究发现用户与LLM的交互模式在早期迅速形成并稳定,长期结果与早期探索密切相关,揭示了用户行为的路径依赖和可塑性特征。

详情
AI中文摘要

用户与LLM的交互受到先前经验与个体探索的影响,但实验室研究无法提供真实环境中的因素。本文通过大规模真实聊天日志分析,发现用户交互模式在早期迅速形成并稳定,长期结果与早期探索密切相关,同时存在并行动态,如按任务类型组织表达或对模型版本更新的响应。这些结果揭示了“代理悖论”:尽管LLM输入空间无约束且由用户驱动,但实际用户探索较少。本文呼吁在设计过程中考虑交互塑造过程及其在后续研究中的整合。

英文摘要

User interactions with LLMs are shaped by prior experiences and individual exploration, but in-lab studies do not provide system designers with visibility into these in-the-wild factors. This work explores a new approach to studying real-world user-LLM interactions through large-scale chat logs from the wild. Through analysis of 140K chatbot sessions from 7,955 anonymized global users over time, we demonstrate key patterns in user expressions despite varied tasks: (1) LLM users are not tabula rasa, nor are they constantly adapting; rather, interaction patterns form and stabilize rapidly through individual early trajectories; (2) Longitudinal outcomes, such as recurring text patterns and retention rates, are strongly correlated with early exploration; (3) Parallel dynamics are present, including organizing expressions by task types such as emotional support, or in response to model-version updates. These results present an ``agency paradox'': despite LLM input spaces being unconstrained and user-driven, we in fact see less user exploration. We call for design consideration surrounding the molding procedure and its incorporation in future research.

2605.05755 2026-05-08 stat.ML cs.AI cs.LG

Transformers Provably Implement In-Context Reinforcement Learning with Policy Improvement

转换器可证明地实现上下文强化学习中的策略改进

Haodong Liang, Lifeng Lai

发表机构 * Department of Electrical and Computer Engineering(电气与计算机工程系) University of California, Davis(加州大学戴维斯分校)

AI总结 研究转换器在上下文强化学习中实现策略改进的能力,通过显式参数构造证明线性自注意力块可实现半梯度SARSA和actor-critic方法,并提供首次收敛保证。

Comments 25 pages, 4 figures

详情
AI中文摘要

我们研究了转换器执行上下文强化学习(ICRL)的能力,其中模型必须从轨迹数据中推断并执行学习算法而无需参数更新。我们证明线性自注意力转换块可通过显式参数构造可证明地实现策略改进方法,包括半梯度SARSA和actor-critic。除了存在性,我们设计了教师模仿训练过程,分析了其梯度流动态,并在ICRL文献中建立了首个收敛保证:在适当的训练MDP分布丰富条件下,梯度流局部且指数收敛到对应于所需RL更新的最优参数流形。经验上,训练转换器在随机生成的表格MDP上确认了这些预测:学习模型恢复了我们的显式构造参数结构,并在未见过的MDP上交付强的上下文控制性能。这些结果阐明了转换器架构如何内化并执行经典强化学习算法,弥合了ICRL中的机理理解与训练动态。

英文摘要

We investigate the ability of transformers to perform in-context reinforcement learning (ICRL), where a model must infer and execute learning algorithms from trajectory data without parameter updates. We show that a linear self-attention transformer block can provably implement policy-improvement methods, including semi-gradient SARSA and actor-critic, via explicit parameter constructions. Beyond existence, we design a teacher-mimicking training procedure, analyze its gradient-flow dynamics, and establish the first convergence guarantee in the ICRL literature: under suitable richness conditions on the training MDP distribution, gradient flow converges locally and exponentially to an optimal parameter manifold corresponding to the desired RL update. Empirically, training transformers on randomly generated tabular MDPs confirms these predictions: the learned models recover the parameter structure of our explicit constructions and, when deployed on unseen MDPs, deliver strong in-context control performance. Together, these results illuminate how transformer architectures internalize and execute classical reinforcement learning algorithms in context, bridging mechanistic understanding and training dynamics in ICRL.

2605.05746 2026-05-08 cond-mat.mtrl-sci cs.LG physics.chem-ph physics.comp-ph

Polarizable atomic multipoles for learning long-range electrostatics

可极化的原子多极子用于学习长程静电学

Dongjin Kim, Daniel S. King, Yoonjae Park, Roya Savoj, Sebastien Hamel, Xiaoyu Wang, Bingqing Cheng

发表机构 * Department of Chemistry, UC Berkeley, California 94720, United States(加州大学伯克利分校化学系) Bakar Institute of Digital Materials for the Planet, UC Berkeley, California 94720, United States(为地球的数字材料巴卡研究所,加州大学伯克利分校) Lawrence Livermore National Laboratory, Livermore, CA, USA(劳伦斯利弗莫尔国家实验室) Chemical Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California, 94720, United States(劳伦斯伯克利国家实验室化学科学部)

AI总结 本文提出一种半局部框架,利用可极化的原子多极子学习静电学,通过多极子层次和响应项提升势能面精度,尤其在长程效应关键系统中表现突出,恢复了物理意义的电响应。

详情
AI中文摘要

长程静电学和极化仍是扩展机器学习相互作用势能(MLIPs)到离子、极性和界面系统的主要障碍。本文介绍了一种半局部框架,利用可极化的原子多极子从能量和力中学习静电学。局部等变描述符预测环境依赖的潜变量单极子、偶极子和四极子,而残差非局部电荷转移和极化通过非自洽线性响应在诱导电荷和偶极子中被捕获。在四个多样化的基准和四个短程MLIP架构中,多极子层次和响应项系统性地提高了势能面的准确性,最大的收益出现在长程效应至关重要的系统中。更重要的是,学习到的潜变量恢复了物理意义的电响应:准确的Born有效电荷张量、涌现的极化率、与实验一致的红外光谱,以及半定量的水和混合MAPbI3钙钛矿的拉曼光谱。这种系统性可改进、物理透明的框架使训练于标准能量和力标签的MLIPs能够预测极化敏感的观测量。

英文摘要

Long-range electrostatics and polarization remain central obstacles to extending machine learning interatomic potentials (MLIPs) to ionic, polar, and interfacial systems. Here, we introduce a semi-local framework for learning electrostatics from energies and forces using polarizable atomic multipoles. Local equivariant descriptors predict environment-dependent latent monopoles, dipoles, and quadrupoles, while residual non-local charge transfer and polarization are captured by non-self-consistent linear response in induced charges and dipoles. Across four diverse benchmarks and four short-range MLIP architectures, the multipole hierarchy and response terms systematically improve potential energy surface accuracy, with the largest gains in systems where long-range effects are essential. More importantly, the learned latent variables recover physically meaningful electrical responses: accurate Born effective charge tensors, emergent polarizabilities, infrared spectra in close agreement with experiments, and semi-quantitative Raman spectra for bulk water and hybrid MAPbI$_3$ perovskite. This systematically improvable, physically transparent framework enables MLIPs trained on standard energy and force labels to predict polarization-sensitive observables.

2605.05724 2026-05-08 cs.MA cs.AI

Auto Research with Specialist Agents Develops Effective and Non-Trivial Training Recipes

通过专家代理的自动化研究开发出有效的非琐碎的训练配方

Jingjie Ning, Xiaochuan Li, Ji Zeng, Hao Kang, Chenyan Xiong

发表机构 * School of Computer Science, Carnegie Mellon University(卡内基梅隆大学计算机科学学院)

AI总结 本文研究了由外部测量驱动的自动化研究闭环,通过专家代理划分配方表面并共享测量 lineage,实现了自主的代码编写、实验提交和反馈吸收,提升了多个任务的性能。

详情
AI中文摘要

我们研究了由外部测量驱动的自动化研究闭环。每个提交的试验包含假设、可执行代码修改、评估者拥有的结果以及反馈,这些反馈塑造下一个提案。输出不是生成的论文或单个模型检查点,而是一条可审计的提案、代码差异、实验、分数和失败标签的轨迹。我们通过专家代理实例化这个闭环,这些代理划分配方表面并在试验之间共享测量 lineage。核心经验发现是,lineage 反馈使代理能够将评估者结果(包括崩溃、预算超支、大小失败和准确性门缺失)转化为后续的程序级配方修改,而不是一次性建议。在一次设置和启动后,经过1,197个头条运行试验和600个参数高尔夫控制试验,人类没有选择提案、编辑配方、覆盖分数或修复失败试验。在三个头条运行中,相同的提交试验循环将参数高尔夫验证bpb降低了0.81%,将NanoChat-D12 CORE提高了38.7%,将CIFAR-10 Airbench96的运行时间减少了4.59%。轨迹包括对157个头条运行提交的严格架构-领域审计和程序重写,如NanoChat注意力内核路径变化。在此范围内,循环自主编写代码、提交实验、吸收反馈、在每个环境中应用和结合已知技术,并改进公共起始配方。

英文摘要

We study auto research as a closed empirical loop driven by external measurement. Each submitted trial carries a hypothesis, an executable code edit, an evaluator-owned outcome, and feedback that shapes the next proposal. The output is not a generated paper or a single model checkpoint, but an auditable trajectory of proposals, code diffs, experiments, scores, and failure labels. We instantiate this loop with specialist agents that partition recipe surfaces and share measured lineage across trials. The central empirical finding is that lineage feedback lets agents turn evaluator outcomes, including crashes, budget overruns, size failures, and accuracy-gate misses, into later program-level recipe edits rather than one-shot suggestions. Across 1,197 headline-run trials plus 600 Parameter Golf control trials after one-time setup and launch, humans did not choose proposals, edit recipes, override scores, or repair failed trials during the search. In the three headline runs, the same submitted-trial loop reduces Parameter Golf validation bpb by $0.81\%$, raises NanoChat-D12 CORE by $38.7\%$, and reduces CIFAR-10 Airbench96 wallclock by $4.59\%$, with each task measured by its own external evaluator and legality checks. The trace includes a strict architecture-domain audit of 157 headline-run submissions and program rewrites such as a NanoChat attention-kernel path change. Within this scope the loop autonomously writes code, submits experiments, absorbs feedback, applies and combines known techniques inside each environment, and improves public starting recipes.

2605.05705 2026-05-08 math.NA cs.LG cs.NA math.PR stat.ML

Convex-Geometric Error Bounds for Positive-Weight Kernel Quadrature

具有正权重核二次求积的凸几何误差界

Satoshi Hayakawa

发表机构 * The University of Tokyo(东京大学)

AI总结 本文研究了在正权重约束下核二次求积的误差界,通过随机凸包的几何结果,证明了在固定维度下,通过凸组合可实现高概率的均值近似,从而在有利的谱条件下超越蒙特卡洛方法。

Comments 22 pages

详情
AI中文摘要

核二次求积可以利用RKHS谱结构并在光滑积分算子上优于蒙特卡洛方法,但优化的二次求积权重通常为符号权重且可能数值不稳定。我们研究在权重受限为正(即简单权重)时,谱加速是否仍有可能。在精确目标固定池设置中,已有一个大小为N的i.i.d.候选池,并需重新加权以近似核均值嵌入。我们证明正重新加权问题由池生成的随机凸包而非等权重经验平均所支配。主要几何结果表明,一个有界的d维随机向量的均值可被N个i.i.d.样本的凸组合以O(d/N)的精度高概率近似,优于固定维度下的等权重平均。通过增强的Mercer截断论证,将此d维凸包近似转化为全RKHS最坏误差。所得正权重KQ界包含谱尾项和有限样本凸包项,从而在有利的谱条件下实现超越蒙特卡洛的速率,包括在指数谱衰减下近O(1/N)的速率。我们还提供了一个构造性的Frank-Wolfe算法,直接在池原子上操作,保持简单权重,并具有显式的优化误差界。

英文摘要

Kernel quadrature can exploit RKHS spectral structure and outperform Monte Carlo on smooth integrands, but optimized quadrature weights are generally signed and may be numerically unstable. We study whether spectral acceleration remains possible when the weights are constrained to be positive, i.e., simplex weights. In the exact-target fixed-pool setting, an evaluated i.i.d. candidate pool of size $N$ is already available and the task is to reweight it so as to approximate the kernel mean embedding. We show that this positive reweighting problem is governed not by the equal-weight empirical average, but by the random convex hull generated by the pool. Our main geometric result shows that the mean of a bounded $d$-dimensional random vector can be approximated by a convex combination of $N$ i.i.d. samples at accuracy $O(d/N)$ with high probability, sharper than equal-weight averaging in the fixed-dimensional regime. We transfer this $d$-dimensional convex-hull approximation to full RKHS worst-case error through an augmented Mercer-truncation argument. The resulting positive-weight KQ bounds consist of a spectral tail term and a finite-sample convex-hull term, yielding Monte-Carlo-beating rates in favorable spectral regimes, including near-$O(1/N)$ rates up to logarithmic factors under exponential spectral decay. We also provide a constructive Frank--Wolfe algorithm that operates directly on the pool atoms, maintains simplex weights, and admits an explicit optimization-error bound.

2605.05700 2026-05-08 cs.SE cs.AI

An Empirical Study of Proactive Coding Assistants in Real-World Software Development

对现实软件开发中主动编码助手的实证研究

Lehui Li, Ruixuan Jia, Guo-Ye Yang, Jia Li

发表机构 * College of AI, Tsinghua University(清华大学人工智能学院) Fitten Tech Co., Ltd.(Fitten科技有限公司)

AI总结 本文通过大规模实证研究,分析了模拟与现实数据在主动意图预测中的差异,提出ProCodeBench基准,并指出模拟数据无法替代真实数据。

详情
AI中文摘要

大型语言模型(LLM)基于的编码助手已取得显著进展,但大多数系统仍为反应式,要求开发者明确表达需求。主动编码助手旨在从集成开发环境(IDE)交互和仓库上下文推断隐含的开发者意图,从而减少交互开销并提供更无缝的帮助。然而,这一方向的研究受限于大规模真实世界开发者行为数据的稀缺性。现有研究往往依赖LLM模拟的IDE轨迹,其对真实开发行为的忠实度仍不明确。本文通过大规模实证研究探讨了这种模拟到现实的差距。我们通过定制的Visual Studio Code扩展,从1,246名经验丰富的行业开发者身上收集了三天的实时IDE交互轨迹,并构建了配对的LLM模拟轨迹进行对比分析。我们的分析显示,模拟轨迹在行为多样性、时间结构和探索模式上与真实轨迹存在显著差异。基于收集的数据,我们引入了ProCodeBench,一个用于主动意图预测的真实世界基准。实验表明,当前方法在真实IDE轨迹下仍远未可靠,表明基于模拟的评估可能高估真实世界性能。最后,我们的训练研究表明,模拟数据无法替代真实数据,但可在真实世界微调前作为补充。这些发现强调了真实开发者行为数据在评估和训练主动编码助手中的重要性。

英文摘要

Large language model (LLM)-based coding assistants have made substantial progress, yet most systems remain reactive, requiring developers to explicitly formulate their needs. Proactive coding assistants aim to infer latent developer intent from integrated development environment (IDE) interactions and repository context, thereby reducing interaction overhead and supporting more seamless assistance. However, research in this direction is limited by the scarcity of large-scale real-world developer behavior data. Existing studies therefore often rely on LLM-simulated IDE traces, whose fidelity to real development behavior remains unclear. In this paper, we investigate this simulation-to-reality gap through a large-scale empirical study. We collect real IDE interaction traces from 1{,}246 experienced industry developers over three consecutive days using a custom Visual Studio Code extension, and construct paired LLM-simulated traces for controlled comparison. Our analysis shows that simulated traces differ substantially from real traces in behavioral diversity, temporal structure, and exploratory patterns. Based on the collected data, we introduce \textbf{ProCodeBench}, a real-world benchmark for proactive intent prediction. Experiments with representative LLMs, retrieval-augmented methods, and agentic baselines show that current approaches remain far from reliable under real IDE traces, suggesting that simulation-based evaluation can overestimate real-world performance. Finally, our training study shows that simulated data cannot replace real data, but can complement it when used before real-world fine-tuning. These findings highlight the importance of real developer behavior data for evaluating and training proactive coding assistants.

2605.05699 2026-05-08 cs.PF cs.AI

When Quantization Is Free: An int4 KV Cache That Outruns fp16 on Apple Silicon

量化为何免费:一种int4 KV缓存性能超越fp16的苹果硅芯片

Mohamed Amine Bergach

发表机构 * Illumina

AI总结 本文提出一种int4 KV缓存方法,在苹果硅芯片上实现比fp16更快的性能,通过融合金属内核和内存压缩,保持质量并提升效率。

详情
AI中文摘要

KV缓存量化被视为质量与延迟的权衡。我们展示在苹果硅芯片的统一内存中,一种融合的金属内核(符号随机化FFT + 每通道λ + 每组abs-max + int4 nibble pack)作为HuggingFace的Cache子类,能够在Gemma-3 1B和Qwen2.5-1.5B上以256-4096个标记前缀运行得比fp16更快,内存压缩率提升3倍且质量保持(PPL=0.000 Qwen短提示;Gemma增加3.6 hook PPL)。内核的约25ns/vec开销低于3倍压缩带来的带宽节省。融合内核还解决了Qwen的4位每标记灾难(PPL从+7975降至+638.6,减少12.5倍)在182GFLOPS / D=128的情况下。支持发现:SRFT和SRHT在KV质量上统计上无法区分(我们选择SRFT用于混合基数和矩阵乘法对齐);学习旋转的消融显示固定随机SRFT基底的正则化作用(学习R+λ不使用SRFT降低校准MSE 84.9% vs 50.3%但产生更差PPL);Householder旋转在k=d/2反射器时在d=256时几乎无损。

英文摘要

KV-cache quantization is framed as a quality--latency trade-off. We show it is \emph{inverted} on Apple Silicon's unified memory: a single fused Metal kernel (sign-randomized FFT $+$ per-channel $λ$ $+$ per-group abs-max $+$ int4 nibble pack), exposed as a HuggingFace \texttt{Cache} subclass, runs \emph{faster than fp16} across $256$--$4096$-token prefixes on Gemma-3 1B ($-3$ to $-8\%$ ms/tok) and at short context on Qwen2.5-1.5B ($-0.7$ to $-2.6\%$ through $1$K), with $3\times$ persistent memory compression and quality preserved ($\dPPL = 0.000$ Qwen short-prompt; $+3.6$ hook $\dPPL$ Gemma). The kernel's $\sim\!25$\,ns/vec overhead is below the bandwidth savings from $3\times$ compression. The fused kernel also closes Qwen's 4-bit per-token catastrophe ($\dPPL = +7975 \to +638.6$, $12.5\times$ reduction) at $182$\,GFLOPS / $D{=}128$. Supporting findings: $\SRFT$ and $\SRHT$ are statistically indistinguishable for KV quality (we pick $\SRFT$ for mixed-radix and matrix-multiply alignment); a learned-rotation ablation surfaces a regularization role for the fixed random SRFT base (learning $R+λ$ without SRFT lowers calibration MSE $84.9\%$ vs $50.3\%$ but yields worse PPL); Householder rotations at $k{=}d/2$ reflectors are effectively lossless at $d{=}256$.

2605.05696 2026-05-08 cs.DC cs.AI cs.LG

Irminsul: MLA-Native Position-Independent Caching for Agentic LLM Serving

Irminsul:基于MLA的原生位置无关缓存用于代理LLM服务

Bole Ma, Jan Eitzinger, Harald Köstler

发表机构 * Erlangen National High Performance Computing Center(埃朗根国家高性能计算中心)

AI总结 本文提出Irminsul,一种基于MLA的原生位置无关缓存,通过内容哈希键和δ旋转规则提升代理LLM服务性能,实现高达83%的提示词恢复率和63%的预填能量节省。

详情
AI中文摘要

代理LLM工作负载在每个回合中将位相同的token置于移位位置,导致前缀缓存在首次字节差异处失效。操作员报告缓存命中回归范围从适度的延迟到严重的TTFT峰值,最高达10-16秒。先前的位置无关缓存系统纠正RoPE于完整的d_K维键上,这是由GQA架构造成的成本,而非缓存本身。多头潜在注意力(MLA)在DeepSeek-V2/V3/R1、Kimi-K2/Moonlight、GLM-5和Mistral Large 3中大规模部署,将每个KV行分解为位置无关的c_KV和64维k_r,可封闭形式纠正。这种结构使内容寻址缓存成为自然选择,而非GQA的退步。我们提出Irminsul,扩展SGLang的radix缓存,通过CDC分段的内容哈希键和δ旋转规则处理k_r。我们评估了三个原生MLA-MoE部署——DeepSeek-V2-Lite(16B/2.4B)、Kimi Moonlight-16B-A3B和JoyAI-Flash(48B/3B)——在所有三个上的输出一致性以及在两个端点上的恢复测量;Irminsul在代理流量中恢复多达约83%的提示词token,同时在每次缓存命中中提供63%的预填能量节省。我们主张内容寻址缓存应作为服务堆栈中的第一类原语,而非前缀匹配的退步。

英文摘要

Agentic LLM workloads put bit-identical tokens at shifted positions every turn, voiding prefix caches at the first byte of divergence. Operators report cache-hit regressions ranging from moderate slowdowns to severe TTFT spikes of 10-16s on unchanged content. Prior position-independent caching systems correct RoPE on the full $d_K$-dimensional key, an architectural cost imposed by GQA, not by caching itself. Multi-Head Latent Attention, deployed at scale in DeepSeek-V2/V3/R1, Kimi-K2/Moonlight, GLM-5, and Mistral Large 3, factors each KV row into a position-free $c_{KV}$ and a 64-dim $k_r$ correctable in closed form; this structure motivates content-addressed caching as a natural fit rather than a GQA workaround. We present Irminsul, which extends SGLang's radix cache with content-hash keying over CDC-chunked segments and a $δ$-rotation rule for $k_r$. We evaluate three native MLA-MoE deployments - DeepSeek-V2-Lite (16B/2.4B), Kimi Moonlight-16B-A3B, and JoyAI-Flash (48B/3B) - with output-consistency on all three and recovery measured on the two endpoints; Irminsul recovers up to ~83% of prompt tokens above exact-prefix on agentic traffic while delivering 63% prefill energy savings per cache hit. We argue that content-addressed caching belongs in the serving stack as a first-class primitive, not a retrofit over prefix matching.

2605.05683 2026-05-08 stat.ML cs.LG

Spectral Lens: Activation and Gradient Spectra as Diagnostics of LLM Optimization

谱镜:激活与梯度谱作为大语言模型优化的诊断工具

Andy Zeyi Liu, Elliot Paquette, John Sous

发表机构 * Department of Applied Physics, Yale University, New Haven, CT 06511, United States(耶鲁大学应用物理系) Department of Mathematics and Statistics, McGill University(麦吉尔大学数学与统计学系) Department of Applied Physics, Yale University, New Haven, Connecticut 06511, USA(耶鲁大学应用物理系) Energy Sciences Institute, Yale University, West Haven, Connecticut 06516, USA(耶鲁大学能源科学研究所)

AI总结 研究通过激活协方差和梯度SVD谱分析,揭示了批量大小对表示几何的影响、激活协方差尾部对后续token效率的预测作用,以及激活谱头部与梯度谱对学习动态变化的诊断作用。

详情
AI中文摘要

训练损失和吞吐量可能隐藏语言模型训练中的不同内部表示。为了探讨这些隐藏机制,我们使用谱测量作为实用且操作性的诊断工具。通过受修改NanoGPT代码库影响的解码器-only模型控制家族,我们引入了一种基于激活协方差和每样本梯度SVD谱的实证协议。这种双视角揭示了三个实证发现和一个机理解释。首先,批量大小作为潜在的表示几何决定因素:达到相同损失的运行系统地呈现出不同的激活谱。其次,训练早期测量的激活协方差尾部可靠地预测了后续token效率。第三,激活谱头部(主导模式)的移动,结合梯度谱,表征了底层学习动态变化,区分了学习侧架构改进与主要执行侧收益。这些预测和诊断信号在12层、36层和48层模型层级中持续存在。最后,一个机理模型证明了主要观察结果,并解释了激活协方差谱如何与任务对齐的特征学习相关联。

英文摘要

Training loss and throughput can hide distinct internal representation in language-model training. To examine these hidden mechanics, we use spectral measurements as practical and operational diagnostics. Using a controlled family of decoder-only models adapted from the modded NanoGPT codebase, we introduce an empirical protocol based on activation covariance and per-sample gradient SVD spectra. This dual-view reveals three empirical findings and one mechanistic explanation. First, batch size acts as a latent determinant of representation geometry: runs that reach equal loss settle into systematically distinct activation spectra. Second, the activation covariance tail measured early in training reliably forecasts downstream token efficiency. Third, movement of the activation spectrum head (leading modes), together with gradient spectra, characterizes underlying learning-dynamics changes, separating learning-side architectural improvements from primarily execution-side gains. These predictive and diagnostic signals persist across the 12-, 36-, and 48-layer model tiers. Finally, a mechanistic model proves the main observations and explains how activation covariance spectra correlate with task-aligned feature learning.

2605.05648 2026-05-08 cs.CY cs.AI cs.HC

The Missing Evaluation Axis: What 10,000 Student Submissions Reveal About AI Tutor Effectiveness

缺失的评估维度:10,000名学生提交揭示AI导师有效性

Rose Niousha, Samantha Boatright Smith, Bita Akram, Peter Brusilovsky, Arto Hellas, Juho Leinonen, John DeNero, Narges Norouzi

发表机构 * University of California, Berkeley, USA(加州大学伯克利分校) North Carolina State University, USA(北卡罗来纳州立大学) University of Pittsburgh, USA(匹兹堡大学) Aalto University, Finland(阿尔托大学)

AI总结 研究通过分析10,235份学生代码提交,探讨AI导师的反馈行为影响,发现基于学生互动的数据能更全面评估AI导师效果。

Comments Accepted to the 27th International Conference on Artificial Intelligence in Education (AIED 2026), Main Conference Track

详情
AI中文摘要

当前基于AI的辅导系统主要根据反馈信息的教育质量进行评估。然而,仅关注教育质量不足以全面评估AI导师效果,因为学生如何利用反馈是关键问题。本文提出一个包含学生互动数据的评估框架,应用于编程课程中的10,235份学生提交,以测量学生是否根据反馈行动并正确应用。通过比较不同学期的两个AI导师,发现基于行为的评估能揭示教育质量无法捕捉到的学生参与模式差异,并且这些行为信号与学生对反馈的感知帮助程度关联更紧密,从而提供更完整和可行的AI导师性能评估。

英文摘要

Current Artificial Intelligence (AI)-based tutoring systems (AI tutors) are primarily evaluated based on the pedagogical quality of their feedback messages. While important, pedagogy alone is insufficient because it ignores a critical question: what do students actually do with the feedback they receive? We argue that AI tutor evaluation should be extended with a behavioral dimension grounded in student interaction data, which complements pedagogical assessment. We propose an evaluation framework and apply it to 10,235 code submissions with corresponding AI tutor feedback from an introductory undergraduate programming course to measure whether students act on tutor feedback and whether those actions are applied correctly. Using this framework to compare two deployed AI tutors across different semesters in a large-scale introductory computer science course reveals substantial differences in student engagement patterns that are not captured by pedagogy-only evaluation. Moreover, these engagement-based behavioral signals are more strongly associated with student perception of helpful feedback than pedagogical quality alone, providing a more complete and actionable picture of AI tutor performance.

2605.05625 2026-05-08 quant-ph cs.LG

Quantum Kernels for Parity-Structured Classification: A Hybrid Pipeline

量子核用于奇偶结构分类:一种混合流程

Tushar Pandey

发表机构 * Texas A\&M University, Department of Mathematics College Station, TX, USA ORCID: 0000-0001-7448-5723

AI总结 本文研究量子核在奇偶复杂度下的优势,通过混合流程揭示奇偶结构,发现高复杂度下量子核性能显著优于经典方法。

详情
AI中文摘要

奇偶(XOR)分类需要检测离散的高阶特征交互,经典核无法高效捕捉。本文研究量子核优势依赖于奇偶复杂度,即进入XOR规则的特征数,并发现明确的阈值行为。通过将ZZ量子特征映射与二进制{0, π}编码(特征中位数阈值后输入电路)结合,以暴露奇偶结构。二进制编码的消融实验表明,在低复杂度(n=5特征)时,二进制RBF达到83.4%±1.7%,量子核为81.2%±1.9%,显示编码驱动性能。在高复杂度(n=11特征,11个量子比特,r=3 ZZ重复)时,所有经典方法崩溃至近随机(约50%),二进制RBF仅达到54.3%±1.1%,而量子ZZ核达到66.3%±3.2%(均值±标准差,10种子),比二进制消融高12.0个百分点,且核目标对齐度高7倍(0.094±0.020 vs. 0.013±0.001)。这些结果表明,奇偶复杂度是量子核优势出现的明确轴,而非仅由编码驱动。

英文摘要

Parity (XOR) classification requires detecting discrete, high-order feature interactions that smooth classical kernels cannot efficiently capture. We study how quantum kernel advantage depends on parity complexity, the number of features entering the XOR rule, and find a clear threshold behavior. We pair a ZZ quantum feature map with binary {0, pi} encoding (features median thresholded before circuit input) to expose parity structure. A binary encoding ablation, RBF SVM trained on the identical {0, pi} features, separates encoding from circuit effects: at low complexity (n = 5 features), binary RBF achieves 83.4% +/- 1.7% and the quantum kernel 81.2% +/- 1.9%, showing encoding drives performance there. At high complexity (n = 11 features, 11 qubits, r = 3 ZZ repetitions), all classical methods collapse to near-random (approx. 50%), binary RBF reaches only 54.3% +/- 1.1%, and the quantum ZZ kernel achieves 66.3% +/- 3.2% (mean +/- std, 10 seeds), a +12.0 percentage-point margin over the binary ablation and approx. 7x higher kernel-target alignment (0.094 +/- 0.020 vs. 0.013 +/- 0.001). These results identify parity complexity as a concrete axis along which genuine quantum kernel advantage, not attributable to encoding alone, emerges.

2605.05606 2026-05-08 stat.ML cs.LG math.PR

Variational Smoothing and Inference for SDEs from Sparse Data with Dynamic Neural Flows

变分平滑与SDEs从稀疏数据中推断的动态神经流

Yu Wang, Arnab Ganguly

发表机构 * Department of Mathematics(数学系) Louisiana State University(路易斯安那州立大学)

AI总结 本文提出基于动态神经流的变分平滑方法,通过条件后向时间分数建模SDEs,实现高效轨迹采样和参数估计,实验表明在非线性系统中具有更高的可扩展性。

Comments Yu Wang and Arnab Ganguly contributed equally to this work. Corresponding to Arnab Ganguly

详情
AI中文摘要

随机微分方程(SDEs)为部分观测系统的时序动态建模提供了灵活的框架。核心任务是从数据中校准此类模型,需要从稀疏、噪声观测中推断潜在轨迹和参数。经典平滑方法常受限于路径退化和较差的可扩展性。本文开发了一种基于后验SDE特征化的方法,定义为条件后向时间分数,即解Kolmogorov反向方程的函数梯度,并在观测时间进行乘法更新。通过神经网络学习该条件分数,使其同时满足 governing PDE 和观测诱导的跳跃条件,从而将连续时间动态与离散贝叶斯更新整合。所得到的分数诱导了一个具有相同扩散系数但修改后的漂移的后验SDE,实现了高效的后验轨迹采样。进一步推导了一个基于似然的目标函数用于学习SDE参数,得到联合状态平滑和参数估计的证据下界(ELBO)。这导致了一种变分EM式过程,其中神经条件分数被优化以近似平滑分布,随后通过诱导后验样本最大化SDE参数。在非线性系统中的实验表明,使用极少量观测可实现准确且稳定的推断,相较于经典MCMC方法具有显著的可扩展性改进。

英文摘要

Stochastic differential equations (SDEs) provide a flexible framework for modeling temporal dynamics in partially observed systems. A central task is to calibrate such models from data, which requires inferring latent trajectories and parameters from sparse, noisy observations. Classical smoothing methods for this problem are often limited by path degeneracy and poor scalability. In this work, we developed a novel method based on characterization of the posterior SDE in terms of conditional backward-in-time score defined as the gradient of a function solving a Kolmogorov backward equation with multiplicative updates at observation times. We learn this conditional score using neural networks trained to satisfy both the governing PDE and the observation-induced jump conditions, thereby integrating continuous-time dynamics with discrete Bayesian updates. The resulting score induces a posterior SDE with the same diffusion coefficient but a modified drift, enabling efficient posterior trajectory sampling. We further derive a likelihood-based objective for learning the SDE parameters, yielding an evidence lower bound (ELBO) for joint state smoothing and parameter estimation. This leads to a variational EM-style procedure, where the neural conditional score is optimized to approximate the smoothing distribution, followed by a maximization step over the SDE parameters using samples from the induced posterior. Experiments on nonlinear systems demonstrate accurate and stable inference with a very few observations demonstrating significant improved scalability compared to classical MCMC methods.

2605.05581 2026-05-08 cs.DC cs.LG

A Scalable Digital Twin Framework for Energy Optimization in Data Centers

可扩展的数字孪生框架用于数据中心的能量优化

Raphael Hendrigo de Souza Gonçalves, Wendel Marcos dos Santos

发表机构 * Federal University of São João del-Rei(圣约翰德尔雷伊联邦大学) Federal Institute of Education Science and Technology of São Paulo(圣保罗教育科技联邦 institute)

AI总结 本文提出一种可扩展的数字孪生框架,通过物联网数据采集、云计算和机器学习技术实现实时监控与能耗管理,实验结果表明在提升能效和降低能耗方面有显著改进。

Comments 11 pages, 2 figures

详情
AI中文摘要

本文提出了一种可扩展的数字孪生框架,用于数据中心的能量优化。该框架集成了基于物联网的数据采集、云计算和机器学习技术,以实现实时监控、预测和智能能耗管理。开发了一个受控的小规模数据中心环境来监控诸如电力消耗、温度和计算负载等变量。采用了长短期记忆(LSTM)模型来预测能源需求并支持运营决策。实验结果表明,在能效方面有所改进,包括电力消耗的减少和功率利用率(PUE)的提升。尽管是在受限环境中评估的,所提出的框架展示了作为可持续数据中心管理的可扩展且经济有效的解决方案的强大潜力。

英文摘要

This study proposes a scalable Digital Twin framework for energy optimization in data centers.The framework integrates IoT-based data acquisition, cloud computing, and machine learning techniques to enable real-time monitoring, forecasting, and intelligent energy management. A controlled small-scale data center environment was developed to monitor variables such as power consumption, temperature, and computational workload. Long Short-Term Memory (LSTM) models were employed to predict energy demand and support operational decision-making. Experimental results demonstrated improvements in energy efficiency, including reductions in power consumption and enhancements in Power Usage Effectiveness (PUE). Despite being evaluated in a constrained environment, the proposed framework demonstrates strong potential as a scalable and cost-effective solution for sustainable data center management.

2605.05575 2026-05-08 eess.SY cs.RO cs.SY math.OC

Maximal Controlled Invariant-MPC: Enhancing Feasibility and Reducing Conservatism through Terminal CBF Constraint in Safety-Critical Control

最大可控不变MPC:通过终端CBF约束提升可行性并减少保守性

Tanmay Dokania, Yashwanth Kumar Nakka

发表机构 * Graduate Student, Daniel Guggenheim School of Aerospace Engineering(丹尼尔·古吉恩航空航天工程学院研究生) Assistant Professor, Director, Aerospace Robotics Lab, Daniel Guggenheim School of Aerospace Engineering(丹尼尔·古吉恩航空航天工程学院助理教授、主任、航空航天机器人实验室)

AI总结 本文提出一种利用CBF作为终端约束的MPC方法,通过增加预测时间 horizon 提高可行性与可达集,减少保守性,并通过非holonomic系统仿真验证了其有效性。

Comments Under review

详情
AI中文摘要

安全关键系统的最优控制往往依赖于约束的保守性。控制屏障函数(CBFs)作为表示此类约束的介质,但构造最小保守性CBF是一个计算上不可行的问题。因此,能够保证安全同时减少保守性的方法将有助于提高所考虑系统的最优性。本文提出了一种使用CBF作为终端约束的模型预测控制(MPC)公式,证明随着预测时间 horizon 的增加,该方法能提高可行性与可达集。证明的构造性允许对非线性优化问题进行预热启动,从而显著减少计算时间。为简单的非holonomic系统设置仿真以验证结果,并观察到不可行点的数量减少了1.7到2.7倍。通过系统能够跟踪完全位于控制屏障函数不安全区域内的轨迹,展示了可达状态空间的增加。

英文摘要

Optimal control for safety-critical systems is often dependent on the conservativeness of constraints. Control Barrier Functions (CBFs) serve as a medium to represent such constraints, but constructing a minimally conservative CBF is a computationally intractable problem. Therefore, approaches that can guarantee safety while reducing conservatism will help improve the optimality of the system under consideration. Here, we present a Model Predictive Control (MPC) formulation using CBF as a terminal constraint, which is proven to improve feasibility and reachable sets with increasing prediction horizon. The constructive nature of the proofs allows for warm-starting the nonlinear optimization problem, thereby reducing the computational time substantially. Simulations are set up for a simple nonholonomic system to numerically validate the results, and it is observed that the number of infeasible points decreased by a factor of 1.7 to 2.7. The increase in reachable state space was demonstrated by the ability of the system to track trajectories that are entirely inside the unsafe region of the control barrier function.

2605.05573 2026-05-08 astro-ph.IM cs.AI

AstroAlertBench: Evaluating the Accuracy, Reasoning, and Honesty of Multimodal LLMs in Astronomical Classification

AstroAlertBench: 评估多模态大语言模型在天文学分类中的准确性、推理和诚实性

Claire Chen, Jiabao Sean Xiao, Shuze Daniel Liu, Facundo Perez Paolino, Luke Handley, Theophile Jegou du Laz, Ricky Nilsson, Alice Zou, Matthew Graham, Ashish Mahabal

发表机构 * California Institute of Technology(加州理工学院) Massachusetts Institute of Technology(麻省理工学院) Purdue University(普渡大学)

AI总结 本文提出AstroAlertBench,通过多阶段逻辑链评估多模态大语言模型在天文学事件分类中的性能,揭示高精度与模型诚实性之间的矛盾,并引入人机协同评估协议。

详情
AI中文摘要

现代天文观测站产生大量多模态数据,成为专家人工审核的关键瓶颈。尽管多模态大语言模型在解读复杂视觉和文本输入方面表现出色,但其在进行专业科学分类并提供可解释推理的能力仍缺乏研究。我们引入AstroAlertBench,一个综合性的多模态基准,旨在评估LLM在天文学事件审核中的性能,沿三个阶段的逻辑链:元数据基础、科学推理和五类层次分类。我们使用1,500个真实世界的警报样本,来自Zwicky瞬态设施(ZTF)的宽视场调查,该调查扫描北半球天空以检测瞬变天文事件。在该数据集上,我们评估了13个前沿闭源和开源LLM,支持视觉输入。我们的结果表明,高精度并不总能与模型的“诚实性”一致,即模型自我评估其推理能力的能力,这影响其作为现实助手的可靠性。我们进一步初始化了人机协同评估协议,作为未来社区规模参与的先驱。共同,AstroAlertBench提供了一个开发校准和可解释天文学助手的框架。

英文摘要

Modern astronomical observatories generate a massive volume of multimodal data, creating a critical bottleneck for expert human review. While multimodal large language models (LLMs) have shown promise in interpreting complex visual and textual inputs, their ability to perform specialized scientific classification while providing interpretable reasoning remains understudied. We introduce AstroAlertBench, a comprehensive multimodal benchmark designed to evaluate LLM performance in astronomical event review along a three-stage logical chain: metadata grounding, scientific reasoning, and hierarchical classification over five categories. We use a pilot sample of 1,500 real-world alerts from the Zwicky Transient Facility (ZTF), a wide-field survey that scans the northern sky to detect transient astronomical events. On this dataset, we benchmark 13 frontier closed-source and open-weight LLMs that support visual input. Our results reveal that high accuracy does not always align with model ``honesty,'' defined as the ability to self-evaluate its reasoning, which affects its reliability as a real-world assistant. We further initialize a human-in-the-loop evaluation protocol as a precursor to future community-scale participation. Together, AstroAlertBench provides a framework for developing calibrated and interpretable astronomical assistants.

2605.05568 2026-05-08 stat.ML cs.LG

Relaxed Sparsest-Permutation Formulation for Causal Discovery at Scale

放宽的稀疏排列公式用于大规模因果发现

Sunmin Oh, Sang-Yun Oh, Gunwoong Park

发表机构 * Department of Statistics(统计学系) Seoul National University(首尔国立大学) Department of Statistics and Applied Probability(统计学与应用概率系) University of California Santa Barbara(加州大学圣芭芭拉分校) Scientific Data Division(科学数据部) Lawrence Berkeley National Laboratory(伯克利国家实验室) Interdisciplinary Program in Artificial Intelligence(人工智能跨学科项目) Institute for Data Innovation in Science(科学数据创新研究所)

AI总结 本文提出一种放宽的稀疏排列方法,用于大规模因果发现,通过稀疏三角因子在精度支持筛查图上进行搜索,提高了计算效率和可扩展性。

详情
AI中文摘要

尽管大规模数据集的可用性增加,但因果结构学习在大规模上仍计算上不可行。我们重新审视了线性结构方程模型中的稀疏排列学习,并表明精确的Cholesky因子分解对于结构恢复是不必要的。这一观察促使了一种支持级别放松,搜索稀疏三角因子在精度支持筛查图上。放松的公式可以通过掩码零填充不完全Cholesky因子分解高效评估,使候选顺序的可扩展比较成为可能。在总体层面,我们建立了在无取消和稀疏最马克等价类(MEC)恢复假设下的正确性,以及对顺序误指定的鲁棒性。受这些保证的启发,我们引入了SCOPE,一种稀疏Cholesky流水线,提供了放松公式的可扩展实现。在合成和真实数据集上的实验表明,SCOPE在MEC恢复准确性方面与明显较慢的基线相匹配,同时实现了显著减少的运行时间和扩展到10k变量。

英文摘要

Despite the growing availability of large datasets, causal structure learning remains computationally prohibitive at scale. We revisit sparsest-permutation learning for linear structural equation models and show that exact Cholesky factorization is unnecessary for structure recovery. This observation motivates a support-level relaxation that searches for sparse triangular factors over a precision-support screening graph. The relaxed formulation can be efficiently evaluated via masked zero-fill incomplete Cholesky factorization, enabling scalable comparison of candidate orderings. At the population level, we establish soundness for Markov equivalence class (MEC) recovery under no-cancellation and sparsest Markov representation assumptions, as well as robustness to ordering misspecification. Motivated by these guarantees, we introduce SCOPE, a sparse-Cholesky pipeline that provides a scalable implementation of the relaxed formulation. Experiments on synthetic and real datasets demonstrate that SCOPE matches the MEC recovery accuracy of substantially slower baselines, while achieving significantly reduced runtime and scaling to 10k variables.