arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 8098
2606.03428 2026-06-03 cs.NE cs.AI cs.LG

PrimeSVT: An Automated Memory-aware Pruning Framework with Prioritized Compression Policy for Spiking Vision Transformers

PrimeSVT: 一种具有优先压缩策略的自动化内存感知剪枝框架用于脉冲视觉Transformer

Rachmad Vidya Wicaksana Putra, Achyuta Muthuvelan, Alberto Marchisio, Muhammad Shafique

发表机构 * eBRAIN Lab, Division of Engineering, New York University (NYU) Abu Dhabi(eBRAIN实验室,工程系,纽约大学(NYU)阿布扎克分校) New York University (NYU) Abu Dhabi, United Arab Emirates (UAE)(纽约大学(NYU)阿布扎克分校,阿拉伯联合酋长国(UAE))

AI总结 提出PrimeSVT框架,通过自动化结构化剪枝和优先压缩策略,在满足精度和内存约束下压缩脉冲视觉Transformer,实现内存节省26.68%且精度损失小于3%。

Comments 8 pages, 8 figures, 3 tables

详情
AI中文摘要

脉冲视觉Transformer(SViT)的大尺寸仍然阻碍其嵌入式实现,因此需要模型压缩。现有工作通过非结构化剪枝压缩SViT模型,这需要专门的硬件加速器来利用其特定的稀疏模式以最大化效率提升。此外,它们的手动方法需要大量设计时间来为每个网络找到合适的剪枝设置,因此这种方法不可扩展。为了解决这一限制,我们提出了PrimeSVT,一种新颖的框架,对预训练的SViT模型执行自动化的内存感知结构化剪枝,从而在推理期间最大化其效率提升,适用于广泛使用的计算架构。为此,PrimeSVT首先根据层的大小(即参数数量)对SViT层进行排序,根据它们在不同剪枝率下的鲁棒性识别目标剪枝层,然后利用这个顺序从最大层到最小层逐层顺序压缩模型(即所谓的优先压缩策略),同时考虑用户定义的约束(即可接受的精度和内存节省)。在每一层中,PrimeSVT基于L2范数值采用通道级滤波器剪枝,以结构性地移除不重要的权重。实验结果表明,PrimeSVT通过自动化单次剪枝节省了26.68%的内存,同时将精度保持在原始未剪枝SViT模型(73.3%)的3%以内(未微调时为70.3%,微调后为72.9%),从而满足了精度和内存约束。这些表明我们的PrimeSVT框架实现了SViT及其嵌入式实现的设计自动化。

英文摘要

The large sizes of Spiking Vision Transformers (SViTs) still hinder their embedded implementation, highlighting the need for model compression. State-of-the-art works compress SViT models through unstructured pruning, which needs specialized hardware accelerators for their specific sparsity patterns to maximize efficiency gains. Moreover, their manual approach requires a huge design time to find an appropriate pruning setting for each network, thus making this approach not scalable. To address this limitation, we propose PrimeSVT, a novel framework that performs automated memory-aware structured pruning on pre-trained SViT models, thereby maximizing their efficiency gains during inference amenable to widely-used computing architectures. To achieve this, PrimeSVT first sorts the SViT layers based on their sizes (i.e., number of parameters), identifies the targeted pruning layers based on their robustness under different pruning rates, then leverages this order for compressing the model layer-by-layer sequentially from the largest one to the smallest one (i.e., so-called prioritized compression policy), while considering the user-defined constraints (i.e., acceptable accuracy and memory saving). In each layer, PrimeSVT employs channel-wise filter pruning based on their L2-norm values to structurally remove the non-significant weights. Experimental results show that PrimeSVT saves 26.68% memory through automated single-shot pruning, while preserving accuracy within 3% (70.3% without fine-tuning and 72.9% with fine-tuning) from the original unpruned SViT model (73.3%), thus meeting the accuracy and memory constraints. These show that our PrimeSVT framework enables design automation for SViTs and their embedded implementation.

2606.03381 2026-06-03 cs.CR cs.AI

AI Model Extraction Attacks: Bypassing Single-Client Assumptions in Defenses

AI模型提取攻击:绕过防御中的单客户端假设

Maxime Schwarzer, Johannes F. Loevenich, Gustavo Sánchez, Laurin Holz, Thies Möhlenhof, Tobias Hürten, Roberto Rigolin F. Lopes, Veit Hagenmeyer

发表机构 * ETH Zurich(苏黎世联邦理工学院) University of Zurich(苏黎世大学) University of Tübingen(图宾根大学)

AI总结 本文通过提出CerberusAI框架,系统性地证明模型提取攻击中的单客户端假设(SCA)在高级持续性威胁(APT)等协同攻击者面前无效,并展示基本轮询查询分布策略即可绕过PRADA等防御机制,呼吁转向无状态、独立于身份的防御架构。

详情
AI中文摘要

确保部署在军事指挥控制(C2)系统和关键基础设施中的人工智能(AI)模型的保护对于维持信息优势至关重要。模型提取攻击(MEA)构成了重大威胁,因为它们使对手能够复制专有模型、泄露受保护信息并准备离线对抗性攻击。然而,当前的防御策略主要依赖于单客户端假设(SCA),即隐含地假设攻击源自孤立身份。本工作系统地证明了在协同威胁行为者(如高级持续性威胁APT)存在的情况下,SCA从根本上无效。我们引入了一个模块化、开源框架CerberusAI,用于可复现的模型窃取研究,并利用它模拟分布式攻击场景。我们的实证评估表明,成熟的防御机制(如防止深度神经网络模型窃取攻击PRADA)可以通过基本的轮询查询分布策略被绕过,导致检测性能显著下降。此外,我们证明即使是全局聚合方法也可以通过自适应流量混合使其在操作上变得无用。这些结果强调了在模型提取攻击领域需要向有状态、独立于身份的防御架构进行范式转变。本文最初发表于由信息系统技术(IST)科学与技术委员会IST-224-RSY组织的国际军事通信与信息系统会议(ICMCIS),该会议于2026年5月12-13日在英国巴斯举行,并获得了最佳论文奖。

英文摘要

Ensuring the protection of Artificial Intelligence (AI) models deployed in military Command and Control (C2) systems and critical infrastructure is essential for maintaining information superiority. Model Extraction Attacks (MEAs) pose a significant threat, as they enable adversaries to replicate proprietary models, compromise protected information, and prepare offline adversarial attacks. However, current defense strategies predominantly rely on the Single Client Assumption (SCA), which is the implicit assumption that attacks originate from isolated identities. This work systematically demonstrates that the SCA is fundamentally invalid in the presence of coordinated threat actors, such as Advanced Persistent Threats (APTs). We introduce a modular, open-source framework called CerberusAI for reproducible model-stealing research, and use it to simulate distributed attack scenarios. Our empirical evaluation shows that well-established defense mechanisms, such as Protecting Against Deep Neural Network Model Stealing Attacks (PRADA), can be bypassed by basic round-robin query distribution strategies, resulting in a significant reduction in detection performance. Furthermore, we demonstrate that even global aggregation approaches can be rendered operationally useless through adaptive traffic mixing. These results highlight the need for a paradigm shift towards stateful, identity-independent defense architectures in the field of model extraction attacks. This paper was originally presented at the International Conference on Military Communication and Information Systems (ICMCIS), organized by the Information Systems Technology (IST) Scientific and Technical Committee, IST-224-RSY - the ICMCIS, held in Bath, United Kingdom, 12-13 May 2026 and won the best paper award.

2606.03344 2026-06-03 cs.CR cs.LG

RogueMerge: Robust and Unified Attacks against LLM Model Merging

RogueMerge: 针对大语言模型合并的鲁棒统一攻击

Jinghuai Zhang, Yetian He, Kunlin Cai, Han Zhao, Fnu Suya, Yuan Tian

发表机构 * University of California, Los Angeles(加州大学洛杉矶分校)

AI总结 提出RogueMerge框架,通过联合优化、元学习模拟和分布鲁棒优化,解决模型合并中针对自回归生成、未知合并配置和攻击提示泛化的三大挑战,实现鲁棒且统一的攻击。

详情
AI中文摘要

模型合并通过聚合来自未验证公共平台的任务向量,将专门能力组合到单个大语言模型中,暴露了关键的供应链攻击面:由于任何恶意行为都可以编码到任务向量中,并且合并允许第三方向量直接写入模型权重,攻击者提供的任务向量可以启用或放大多种下游威胁。先前的工作仅研究针对分类器的模型合并的后门攻击,使用静态算术启发式方法,由于三个原因无法有效处理针对生成式大语言模型的各种攻击。(i) 大语言模型依赖于自回归解码,合并引入的微小参数漂移会在令牌间累积并迅速降低攻击效果。(ii) 攻击者不知道受害者的合并配置,导致独立优化的静态攻击向量容易被稀释或破坏。(iii) 实际威胁诱导必须泛化到优化期间未见过的攻击提示,静态向量无法充分编码。我们提出RogueMerge,这是第一个原则性的统一框架,解决了所有三个挑战。为了处理自回归生成,我们用联合优化替代静态算术,明确强制合并后的攻击成功。为了处理未知的合并设置,我们将攻击注入表述为随机最小-最大问题,并通过元学习风格的模拟来解决。为了在异构攻击提示间泛化,我们采用分布鲁棒优化,并在大语言模型规模下推导出可处理的一阶泰勒近似,具有可证明的误差界。在四种威胁、六种合并算法和超过170个合并的大语言模型上,RogueMerge始终优于现有攻击。它还在多种合并设置下保持稳定,并能抵抗标准防御。

英文摘要

Model merging composes specialized capabilities into a single LLM by aggregating task vectors sourced from unverified public platforms, exposing a critical supply-chain attack surface: Because any malicious behavior can be encoded into a task vector, and merging grants third-party vectors direct write access to model weights, an attacker-provided task vector can enable or amplify diverse downstream threats. Prior work studies only backdoor attacks against model merging for classifiers using static arithmetic heuristics, which fail to effectively handle diverse attacks on generative LLMs for three reasons. (i) LLMs rely on autoregressive decoding, where the minor parameter drift introduced by merging compounds across tokens and rapidly degrades the attack. (ii) Attackers have no knowledge of the victim's merging configurations, causing a static attack vector optimized in isolation to be easily diluted or destroyed. (iii) Practical threat induction must generalize to attack prompts unseen during optimization, which static vectors cannot adequately encode. We present RogueMerge, the first principled, unified framework that addresses all three challenges. To handle autoregressive generation, we replace static arithmetic with a joint optimization that explicitly enforces attack success after merging. To handle unknown merging settings, we formulate attack injection as a stochastic min-max problem and solve it via meta-learning-style simulation. To generalize across heterogeneous attack prompts, we employ distributionally robust optimization and derive a tractable first-order Taylor approximation at LLM scale, with a provable error bound. Across four threats, six merging algorithms, and over 170 merged LLMs, RogueMerge consistently outperforms existing attacks. It also remains stable across diverse merging settings and resists standard defenses.

2606.03327 2026-06-03 cs.DB cs.CL

CAPER: Clause-Aligned Process Supervision for Text-to-SQL

CAPER: 面向Text-to-SQL的子句对齐过程监督

Lujie Ban, Jiasheng Shi, Jinyang Li, Xiaolin Han, Tsz Nam Chan, Chenhao Ma

发表机构 * The Chinese University of Hong Kong, Shenzhen(香港中文大学(深圳)) The University of Hong Kong(香港大学) Northwestern Polytechnical University(西北工业大学) Shenzhen University(深圳大学)

AI总结 提出CAPER方法,通过反事实干预SQL抽象语法树自动推导子句级监督,训练轻量级Clause-PRM模型CAPER-9B,用于策略优化和候选验证,在BIRD和Spider数据集上提升了执行准确率和故障定位能力。

详情
AI中文摘要

Text-to-SQL系统通常通过查询级执行正确性进行评估,但这种终端信号对于哪个中间SQL决策导致成功或失败几乎没有指导作用。Token级密集监督也不适合:SQL token与完整的语义决策不对齐,可能惩罚执行等效的查询,并且难以大规模可靠标记。因此,我们提出CAPER,通过对SQL抽象语法树进行反事实干预自动推导子句级监督,实现用于奖励建模的根因错误定位;所得数据用于训练CAPER-9B,一个轻量级的Clause-PRM,为策略优化和候选验证提供子句边界反馈。在BIRD和Spider上的实验表明,子句对齐监督不仅提高了执行准确率(相对于GPT-5.4实现了高达15.3%的相对EX提升),还增强了故障定位能力,在保留的故障上达到了84.53%的准确率和90.60%的MRR。我们的项目页面位于此https URL。

英文摘要

Text-to-SQL systems are typically evaluated by query-level execution correctness, but this terminal signal provides little guidance about which intermediate SQL decision caused success or failure. Token-level dense supervision is also ill-suited: SQL tokens do not align with complete semantic decisions, can penalize execution-equivalent queries, and are difficult to label reliably at scale. We therefore propose CAPER, which automatically derives clause-level supervision via counterfactual intervention on the SQL abstract syntax tree, enabling root-cause error localization for reward modeling; the resulting data is used to train CAPER-9B, a lightweight Clause-PRM that provides clause-boundary feedback for policy optimization and candidate verification. Experiments on BIRD and Spider show that clause-aligned supervision not only improves execution accuracy, achieving up to a 15.3% relative EX improvement over GPT-5.4, but also strengthens failure-localization capability, reaching 84.53% accuracy and 90.60% MRR on held-out failures. Our project page is at https://github.com/banrichard/RL-NL2SQL.

2606.03288 2026-06-03 cs.CY cs.AI

AI-Generated Traces for Novice Programmers: Learning Effects and Learner Differences in a Multi-Institutional Study

AI生成的新手程序员追踪:多机构研究中的学习效果与学习者差异

Yuri Noviello, Naaz Sibia, Anastasiia Birillo, Thomas Overklift Vaupel Klein, Michael Liut, Gosia Migut

发表机构 * Delft University of Technology(代尔夫特理工大学) University of Toronto(多伦多大学) JetBrains Research(JetBrains研究)

AI总结 本研究提出AI生成的类比动画追踪(GATs),通过多机构实验比较其与文本解释对新手程序员学习程序执行的影响,发现GATs在即时学习上有选择性优势,但效果依赖情境且短暂,且受学习者参与度调节。

详情
AI中文摘要

入门编程(CS1)课程常常难以支持学生对程序执行的理解。虽然可视化可以使执行过程明确,但其有效性取决于设计和情境,而AI生成可视化的实证证据仍然有限。我们提出了生成动画追踪(GATs),即基于AI生成的、类比驱动的、配有旁白的动画,协调源代码、执行状态和概念类比。我们在两个机构的CS1课程中(Python,N=961;Java,N=151)进行了一项研究,比较GATs与文本解释。我们测量了即时学习表现和体验、课程结束时的参与度和考试成绩。结果表明,GATs可以在即时学习方面产生选择性优势,但优势取决于情境且是短期的。我们观察到GATs对表现的影响受到学习者参与度概况的调节。这一发现强调了个性化方法的重要性。

英文摘要

Introductory programming (CS1) courses often struggle to support students' understanding of program execution. While visualizations can make execution processes explicit, their effectiveness depends on design and context, and empirical evidence for AI-generated visualizations remains limited. We propose Generated Animated Traces (GATs), AI-generated, analogy-based, narrated animations that coordinate source code, execution state, and conceptual analogies. We conduct a study at two institutions in CS1 courses (Python, N=961; Java N=151) comparing GATs to textual explanations. We measure immediate learning performance and experience, end-of-course engagement and exam performance. Results show that GATs can yield selective benefits for immediate learning, but benefits are context-dependent and short-term. We observe that GATs' influence on performance is moderated by learner engagement profiles. This finding underscores the importance of personalized approaches.

2606.03257 2026-06-03 cs.NE cs.AI cs.LG

PSViT: A Methodology for Structurally Pruning Spiking Vision Transformers

PSViT:一种结构剪枝脉冲视觉Transformer的方法

Rachmad Vidya Wicaksana Putra, Achyuta Muthuvelan, Alberto Marchisio, Muhammad Shafique

发表机构 * eBRAIN Lab, Division of Engineering, New York University (NYU) Abu Dhabi(eBRAIN实验室,工程系,纽约大学(NYU)阿布扎赫德分校) New York University (NYU) Abu Dhabi, United Arab Emirates (UAE)(纽约大学(NYU)阿布扎赫德分校,阿拉伯联合酋长国(UAE))

AI总结 提出PSViT方法,通过结构化剪枝(均匀通道滤波器和基于敏感性的细粒度剪枝)压缩脉冲视觉Transformer,在ImageNet-1K上实现22.4%内存节省且精度损失小于3%。

Comments 8 pages, 7 figures, 3 tables

详情
AI中文摘要

脉冲视觉Transformer(SViT)模型是很有前景的低功耗ViT模型,用于解决基于视觉的任务,具有最先进的性能。然而,它们的大尺寸限制了在资源受限的嵌入式平台上的部署,凸显了模型压缩的需求。一种突出的压缩技术是剪枝,最先进的工作采用非结构化剪枝技术来压缩SViT模型。这种技术需要专门针对稀疏模式定制的硬件架构才能最大化其效率优势,使得这种方法不可扩展。为了解决这个问题,我们提出了PSViT,一种对SViT模型进行结构化剪枝的新方法,从而使得利用现有且广泛使用的计算架构高效加速其推理成为可能。为此,PSViT采用了几个关键步骤:均匀通道滤波器剪枝以结构化消除非显著权重,敏感性分析以评估单层通道剪枝对精度和网络大小的影响,以及基于敏感性分析和给定网络架构的细粒度通道剪枝。实验结果表明,PSViT通过单次剪枝有效获得了22.4%的内存节省,同时在ImageNet-1K上保持高精度(未经微调为70.3%,经微调为72.8%),与原始未剪枝SViT模型(73.3%)相比精度损失在3%以内。这些结果还表明,PSViT方法推进了在资源受限应用中实现高效SViT部署的努力。

英文摘要

Spiking Vision Transformer (SViT) models are promising low-power ViT models for solving vision-based tasks with state-of-the-art performance. However, their large sizes limit their deployments for resource-constrained embedded platforms, underscoring the needs of model compression. One of prominent compression techniques is pruning, and the state-of-the-art works employ unstructured pruning techniques to compress SViT models. Such techniques require specialized hardware architectures tailored for the sparsity patterns to maximize their efficiency benefits, making this approach not scalable. To address this, we propose PSViT, a novel methodology to perform structured pruning on SViT models, hence making it possible to efficiently accelerate their inference using the existing and widely-used computing architectures. To do this, PSViT employs several key steps: uniform channel-wise filter pruning to structurally eliminate the non-significant weights, sensitivity analysis to evaluate the impact of channel-wise pruning of individual layer on accuracy and network size, as well as fine-grained channel-wise pruning based on the sensitivity analysis and the given network architecture. Experimental results show that PSViT effectively obtains 22.4% memory saving through single-shot pruning, while maintaining high accuracy within 3% (70.3% without fine-tuning and 72.8% with fine-tuning) from the original non-pruned SViT model (73.3%) on the ImageNet-1K. These results also show that the PSViT methodology advances the effort in enabling efficient SViT deployments on resource-constrained applications.

2606.03210 2026-06-03 cs.CE cs.LG cs.NA math.NA

Critical evaluation of PINN for FWD inverse analysis and differentiable FEM as an alternative

PINN 在 FWD 反分析中的批判性评估及可微有限元方法作为替代方案

Yongjin Choi, Hyeonbin Moon, Seunghwa Ryu

发表机构 * KAIST(韩国科学技术院)

AI总结 本文批判性评估了物理信息神经网络(PINN)在多层路面系统落锤式弯沉仪(FWD)反分析中的表现,并提出可微有限元方法(DiffFEM)作为更准确、稳定和高效的替代方案。

详情
AI中文摘要

基于自动微分的反分析方法,包括物理信息神经网络(PINN)和可微编程,最近因其计算精确梯度和收敛效率的能力而显示出巨大潜力。然而,它们对落锤式弯沉仪(FWD)反计算的适用性尚未被探索。本研究基于合成基准,批判性评估了基于PINN的多层路面系统反分析,并研究了可微有限元方法(DiffFEM)作为替代方案。标准PINN由于层状路面系统固有的尖锐域不连续性而无法恢复层模量。尽管我们使用了具有域分解的扩展PINN(XPINN),它在不连续域上表现更好,但其性能仍然对损失权重和网络架构高度敏感,并且在测量噪声下会退化。相比之下,DiffFEM始终获得更准确、稳定且计算高效的反演结果。这些结果表明,将控制物理作为硬约束强加的DiffFEM比基于PINN的方法(其中控制物理通过损失函数作为软约束施加)具有更好的准确性、鲁棒性和计算效率。更广泛地说,研究结果表明,在基于PINN和DiffFEM的反分析之间进行选择需要仔细考虑,当存在高效且稳健的可微正演求解器时,DiffFEM提供了实际优势。

英文摘要

Automatic-differentiation-based inverse analysis methods, including physics-informed neural networks (PINNs) and differentiable programming, have recently shown great promise due to their ability to compute accurate gradients and convergence efficiency. However, their applicability to falling weight deflectometer (FWD) backcalculation remains unexplored. This study critically evaluates PINN-based inverse analysis for a multilayer pavement system and investigates differentiable finite element method (DiffFEM) as an alternative based on a synthetic benchmark. The standard PINN does not recover layer moduli because of the sharp domain discontinuities inherent to layered pavement systems. Although we use an extended PINN with domain decomposition (XPINN), which shows better performance on discontinuous domains, its performance remains highly sensitive to loss weighting and network architecture, and degrades under measurement noise. By contrast, DiffFEM consistently achieves more accurate, stable, and computationally efficient inversion results. These results indicate that DiffFEM, which enforces the governing physics as a hard constraint, yields better accuracy, robustness, and computational efficiency than PINN-based approaches, in which the governing physics is imposed as a soft constraint through the loss function. More broadly, the findings suggest that the choice between PINN- and DiffFEM-based inverse analysis needs careful consideration, with DiffFEM offering practical advantages when an efficient and robust differentiable forward solver is available.

2606.03183 2026-06-03 cs.MM cs.CV cs.SD eess.AS

Inference-Time Scaling for Joint Audio-Video Generation

联合音视频生成的推理时缩放

Jaemin Jung, Kyeongha Rho, Inkyu Shin, Joon Son Chung

发表机构 * Korea Advanced Institute of Science and Technology(韩国科学技术院) Luma AI

AI总结 针对联合音视频生成中多目标优化的挑战,提出多验证器框架与自适应奖励加权算法,在无需额外训练的情况下显著提升语义对齐、感知质量和音视频同步。

Comments Accepted by Transactions on Machine Learning Research (TMLR). Project page: https://jung-jaemin.github.io/ITS-AVGen-Proj/

详情
AI中文摘要

联合音视频生成旨在合成与文本提示语义对齐且精确同步的逼真音视频对。现有联合音视频生成模型通常需要大量训练资源来提高保真度,而推理时缩放(ITS)最近在单模态领域成为一种有前景的无训练替代方案。然而,将ITS从单模态扩展到多模态领域并非易事,因为它需要平衡多个异构目标。在本文中,我们首次对联合音视频生成的ITS进行了全面研究。我们首先证明多验证器框架对于解决单目标指导的局限性(包括非对称性能权衡和验证器欺骗)至关重要。通过系统分析,我们随后确定了一个最优的多验证器组合,该组合在所有质量维度上产生均衡的改进。最后,为了有效聚合多样化的奖励信号,我们提出了自适应奖励加权(ARW),一种新颖的测试时优化算法。ARW将奖励聚合视为在线优化问题,利用可学习参数校准奖励方差,无需奖励分布的先验知识,从而确保鲁棒的多目标选择。在VGGSound和JavisBench-mini基准上的实验结果表明,我们的框架显著增强了生成输出的语义对齐、感知质量和音视频同步。合成样本和代码可在项目页面获取:this https URL。

英文摘要

Joint audio-video generation aims to synthesize realistic audio-video pairs that are both semantically aligned with text prompts and precisely synchronized. While existing joint audio-video generation models often require substantial training resources to improve fidelity, Inference-Time Scaling (ITS) has recently emerged as a promising training-free alternative in single-modality domains. However, extending ITS from a single modality to multimodal domains is non-trivial, as it requires balancing multiple heterogeneous objectives. In this paper, we present the first comprehensive study of ITS for joint audio-video generation. We first demonstrate that a multi-verifier framework is essential to address the limitations of single-objective guidance, including asymmetric performance trade-offs and verifier hacking. Through systematic analysis, we then identify an optimal multi-verifier combination that yields balanced improvements across all quality dimensions. Finally, to effectively aggregate diverse reward signals, we propose Adaptive Reward Weighting (ARW), a novel test-time optimization algorithm. ARW treats reward aggregation as an online optimization problem, utilizing learnable parameters to calibrate reward variances without requiring prior knowledge of reward distributions, thereby ensuring robust multi-objective selection. Experimental results on VGGSound and JavisBench-mini benchmarks demonstrate that our framework significantly enhances semantic alignment, perceptual quality, and audio-visual synchronization of generated outputs. Synthesized samples and code are available on the project page: https://jung-jaemin.github.io/ITS-AVGen-Proj.

2606.03173 2026-06-03 cs.CY cs.LG cs.SI

Auditing Engagement Incentives in the Kidfluencer Ecosystem: A Multimodal Weak Supervision Approach

审计儿童网红生态系统中的参与激励:一种多模态弱监督方法

Zijing Wei, Chao Peter Yang, Xuanjie Chen

发表机构 * University of California, Berkeley(加州大学伯克利分校) Stanford University(斯坦福大学)

AI总结 本研究采用多模态弱监督方法审计YouTube儿童网红频道,发现剥削信号与观看量显著正相关,且表演性劳动、情感诱饵和隐私侵犯能带来参与度溢价。

详情
AI中文摘要

YouTube上“儿童网红”的兴起引发了对儿童数字劳动和剥削的伦理担忧。尽管新兴立法试图规范这一生态系统,但由于大规模操作化剥削的困难,将剥削与参与度联系起来的实证证据仍然稀缺。本研究对79个儿童网红频道的5,051个视频进行了多模态AI审计,使用弱监督方法检测剥削信号,无需大规模人工标注。我们聚合了噪声标注函数——包括基于LLM的标题分类和基于GPT-4 Vision的缩略图与描述分析,涵盖六个基于文献的维度——为每个视频分配一个概率剥削分数。一项多标注者验证研究(N=107)显示与人类判断高度一致(宏平均F1=0.911),并对整体剥削风险具有高敏感性(召回率=0.960,F1=0.793)。我们的发现揭示了表演性劳动、情感诱饵和隐私侵犯的显著参与度溢价。剥削分数与观看次数相关(Spearman ρ=0.229,p<10^{-50}),控制频道层面变化的混合效应回归显示,剥削分数每增加一个单位,观看次数增加4.4倍(p<0.001)。频道内分析表明,情感诱饵的中位观看次数提升+65.6%,表演性内容提升+56.0%(FDR校正p<0.001),且在同年稳健性检验中效果持续(p=0.030)。相比之下,明确的商业内容(产品植入)没有溢价(-3.8%,不显著),表明平台奖励的是儿童身份和劳动的商品化,而非传统广告。这些发现挑战了仅关注财务信托的政策框架,表明参与度与儿童的密集表演性劳动系统性地相关。

英文摘要

The rise of `kidfluencers' on YouTube has raised ethical concerns about child digital labor and exploitation. While emerging legislation attempts to regulate this ecosystem, empirical evidence linking exploitation to engagement remains scarce, given the difficulty of operationalizing exploitation at scale. This study presents a multimodal AI audit of 5,051 videos across 79 kidfluencer channels, using weak supervision to detect exploitation signals without large-scale manual labels. We aggregate noisy labeling functions -- including LLM-based classification of titles and GPT-4 Vision analysis of thumbnails and descriptions across six literature-grounded dimensions -- to assign a probabilistic exploitation score to each video. A multi-annotator validation study (N=107) shows strong agreement with human judgment (macro-average F1 $= 0.911$) and high sensitivity for overall exploitation risk (recall $= 0.960$, F1 $= 0.793$). Our findings reveal a significant engagement premium for performative labor, emotional bait, and privacy violations. Exploitation scores correlate with view counts (Spearman $ρ= 0.229$, $p < 10^{-50}$), and mixed-effects regression controlling for channel-level variation shows that a one-unit increase in exploitation score yields a $4.4\times$ increase in views ($p < 0.001$). Within-channel analyses indicate median view boosts of $+65.6\%$ for emotional bait and $+56.0\%$ for performative content (FDR-corrected $p<0.001$), with effects holding in same-year robustness checks ($p=0.030$). Explicit commercial content (product placement), by contrast, shows no premium ($-3.8\%$, n.s.), suggesting the platform rewards commodification of the child's identity and labor over traditional advertising. These findings challenge policy frameworks focused solely on financial trusts, showing that engagement is systematically tied to the intensive, performative labor of children.

2606.03136 2026-06-03 cs.CR cs.CL

PsychoPass: Geometric Profiling of Multi-Turn Adversarial LLM Conversations

PsychoPass: 多轮对抗性LLM对话的几何轮廓分析

Muberra Ozmen, Subhabrata Majumdar

发表机构 * Coveo Montreal, QC, Canada(加拿大蒙特利尔 Coveo) Indian Institute of Management Bangalore(班加罗尔印度管理学院)

AI总结 提出PsychoPass框架,通过提取对话轨迹在嵌入空间中的几何特征,在有害内容生成前预测多轮越狱攻击,并发现早期几何信号具有鲁棒性。

详情
AI中文摘要

对大型语言模型(LLM)的多轮越狱攻击揭示了当前防护措施的不匹配:它们作用于单个轮次,而攻击则作为跨对话的轨迹展开。我们提出从内容转向动态,将对话建模为表示空间中的路径,并询问对抗意图是否在其几何形状中早期编码。我们引入了PsychoPass,一个从嵌入空间中的对话轨迹提取几何特征以在有害内容生成前预测潜在攻击的框架。这些特征在朴素分类器中实现了近乎完美的性能,这很大程度上可以通过包含轮次数作为特征来解释。去除这一混淆因素后,仍保留了一个较小但一致的几何信号,其分类性能不显著依赖于编码器选择。关键的是,该信号在对话早期出现:仅从短前缀开始,攻击结果就高于随机水平,比基线防护更可靠。一项支持性理论分析通过长度与形状的分解、基于前缀长度的检测界限以及编码器不变性解释了这些发现。综合来看,这些结果表明对抗性对话留下了早期、表示鲁棒的几何指纹,适用于在线监控。

英文摘要

Multi-turn jailbreak attacks on large language models (LLMs) reveal a mismatch in current guardrails: they operate on individual turns, while attacks unfold as trajectories across conversations. We propose a shift from content to dynamics, modeling conversations as paths in representation space and asking whether adversarial intent is encoded early in their geometry. We introduce PsychoPass, a framework that extracts geometric features from conversation trajectories in embedding space to predict a potential attack before harmful content is produced. These features achieve near-perfect performance in naïve classifiers, which is largely explained by the inclusion of number of turns as a feature. After removing this confound, a smaller but consistent geometric signal remains, with classification performance that does not depend meaningfully on encoder choice. Crucially, this signal appears early in the conversation: attack outcomes remain above chance from short prefixes alone, more reliably than baseline guardrails. A supporting theoretical analysis explains these findings via a decomposition of length and shape, a detection bound based on prefix length, and encoder invariance. Together, these results show that adversarial conversations leave an early, representation-robust geometric fingerprint suitable for online monitoring.

2606.03128 2026-06-03 cs.CR cs.AI cs.CL cs.LG

Decoupled Smart Contract Audits: Lightweight LLM Framework via Distillation and Aggregation

解耦式智能合约审计:通过蒸馏与聚合的轻量级LLM框架

Bagus Rakadyanto Oktavianto Putra, Muhamad Risqi Utama Saputra, Widyawan, Guntur Dharma Putra

发表机构 * University of Indonesia(印度尼西亚大学)

AI总结 提出一种基于轻量级开源LLM(0.6B-4B参数)的解耦式智能合约审计框架,通过rsLoRA、知识蒸馏和链式验证聚合策略,在漏洞检测中达到98.25%准确率,优于7B-34B参数模型。

Comments 12 pages, 4 figures, 5 tables. Accepted to IEEE ICWS 2026

详情
AI中文摘要

智能合约面临关键安全挑战,需要在去中心化网络服务中进行彻底审计。虽然大型语言模型(LLMs)在自动漏洞检测中展现出潜力,但现有方法缺乏严重性评估和可操作的修复建议,且计算开销过大。在本研究中,我们引入了一个高效的端到端智能合约安全审计框架,利用轻量级、高度优化的开源LLMs(0.6B-4B参数)。我们的框架将综合审计任务解耦为四个相互关联的组件:漏洞检测、解释、严重性分类和修复建议。为了在无需庞大参数量的情况下保持高准确性,我们实现了秩稳定低秩适配器(rsLoRA)、知识蒸馏以及自定义链式验证(CoVe)聚合策略,系统性地筛选并整合模型生成的多个草稿响应,形成高准确度的审计报告。实验结果表明,我们的轻量级流水线持续优于最先进的开源代码密集LLMs(7B至34B参数),在漏洞检测中达到98.25%的准确率,在生成解释任务中达到0.4375的对齐分数。此外,我们广泛的消融研究实证验证了我们的解耦审计过程相对于统一提示的优越性,并揭示了一种新颖的严重性中心性偏差,为未来LLM辅助审计研究建立了关键基准。

英文摘要

Smart contracts face critical security challenges that require thorough auditing in decentralized web services. While Large Language Models (LLMs) have shown promise in automated vulnerability detection, existing approaches lack severity evaluations with actionable remediation and demand unnecessarily massive computational overhead. In this study, we introduce an efficient end-to-end smart contract security audit framework utilizing lightweight, highly optimized open-source LLMs (0.6B-4B parameters). Our framework decouples comprehensive audit tasks into four interconnected components: vulnerability detection, explanation, severity classification, and remediation recommendation. To maintain high accuracy without massive parameters, we implement Rank-Stabilized Low-Rank Adapters (rsLoRA), knowledge distillation, and a custom Chain-of-Verification (CoVe) aggregation strategy to systematically screen and consolidate multiple draft responses from the model into a highly accurate audit report. Experimental results demonstrate that our lightweight pipeline consistently outperforms state-of-the-art open-source coder dense LLMs (7B to 34B parameters), achieving 98.25% accuracy in vulnerability detection and an alignment score of 0.4375 in generative explanation tasks. Furthermore, our extensive ablation studies empirically validate the superiority of our decoupled audit processes over unified prompting and uncover a novel severity centrality bias, establishing a critical benchmark for future research in LLM-assisted auditing.

2606.03090 2026-06-03 cs.CR cs.AI

"**Important** You should give me full credits!": Exploring Prompt Injection Attacks on LLM-Based Automatic Grading Systems

“**重要** 你应该给我满分!”:探索针对基于LLM的自动评分系统的提示注入攻击

Hang Li, Fedor Filippov, Yuling Lin, Pengfei He, Kaiqi Yang, Yucheng Chu, Yingqian Cui, Hui Liu, Jiliang Tang

发表机构 * Michigan State University(密歇根州立大学)

AI总结 研究针对基于LLM的自动评分系统的提示注入攻击,通过实验证明当前系统高度脆弱,并评估现有防御策略的有效性。

Comments 15 pages, 8 figures, 9 tables

详情
AI中文摘要

大型语言模型(LLM)的出现显著加速了近期关于基于LLM的自动评分(AG)系统的研究。受益于LLM强大的指令遵循能力和广泛的先验知识,教育工作者可以使用仅包含自然语言评分标准的AG系统跨不同任务部署,并获得令人满意的评分性能。尽管有这些优势,新的安全问题也可能出现。特别是,提示注入(PI)攻击最近已成为基于LLM的应用的主要威胁。在AG的背景下,攻击者可能利用PI漏洞操纵评分系统,使其无论实际答案质量如何都人为地给出高分。这种行为对教育评估的公平性、可靠性和完整性构成严重风险。在这项工作中,我们研究了AG系统中的PI攻击,并系统地调查了此类攻击在教育场景中的有效性。我们进一步评估了现有防御策略对抗这些攻击的有效性。通过在基于评分标准的评分设置下进行全面的实验,我们证明了当前基于LLM的AG系统仍然高度容易受到PI攻击。我们希望我们的发现能提高对这种新兴威胁的认识,并激励未来研究朝着安全、稳健和可信的基于LLM的教育系统发展。

英文摘要

The emergence of large language models (LLMs) has significantly accelerated recent research on LLM-based automatic grading (AG) systems. Benefiting from the strong instruction-following capabilities and broad prior knowledge of LLMs, educators can deploy AG systems across diverse tasks using only natural language rubrics while achieving satisfactory grading performance. Despite these advantages, new security concerns may also arise. In particular, prompt injection (PI) attacks have recently become a major threat to LLM-based applications. In the context of AG, attackers can potentially exploit PI vulnerabilities to manipulate grading systems into assigning artificially high scores regardless of the actual answer quality. Such behavior poses serious risks to the fairness, reliability, and integrity of educational assessment. In this work, we study PI attacks in AG systems, and systematically investigate the effectiveness of such attacks in educational scenarios. We further evaluate the effectiveness of existing defensive strategies against these attacks. Through comprehensive experiments under rubric-based grading settings, we demonstrate that current LLM-based AG systems remain highly vulnerable to PI attacks. We hope that our findings raise awareness of this emerging threat and motivate future research toward secure, robust, and trustworthy LLM-based educational systems.

2606.03063 2026-06-03 cs.LO cs.CL

ZX-Calculus:Trace-Indexed Dependent Types and Epistemic Semantics

ZX-演算:迹索引依赖类型与认知语义

Peng Chen

发表机构 * School of Information Science, Beijing Language and Culture University(北京语言文化大学信息科学学院)

AI总结 提出ZX-演算,通过迹索引类型、预层非单调语义和构造性AGM信念修正,保守扩展Martin-Löf依赖类型论,并给出Coq机械化证明。

详情
AI中文摘要

我们提出ZX-演算(知识演化演算),它是Martin-Löf依赖类型论(MLTT)的保守扩展,集成了迹索引类型、预层非单调语义和构造性AGM信念修正。本文附带Coq机械化证明(34个完整证明;两个核心结果零未完成)。(I)迹类型。FinTrace(s0,sn)是一个带类型的执行迹的归纳族。FinTrace和Star(Step)作为路径类型同构,但判断上不相等;TraceElim显式暴露事件标签e:Event,为事件驱动归纳提供了更符合人体工程学的接口。我们证明了迹可达性对应、确定性重放以及通过可归约候选(带传输引理,RC-elim推迟;所有其他核心结果经Coq验证)的规范性框架。(II)层语义。迹索引命题是自由迹偏序范畴Tf上的逆变层。分离定理(显式反模型)区分了证明论单调性和语义非单调性。项模型是初始CwF(句法泛性质,非经典完备性)。(III)AGM信念修正。我们给出了一个显式的构造性部分交收缩算法,经(C1)-(C4)验证。所有八条AGM公设(R1)-(R8)都是定理。R7和R8的证明使用了析取加固引理,并给出了自包含的构造性推导。(IV)集成。B^AGM在顺序修正中不满足层复合律BP-comp(显式反模型,Coq验证)。我们引入单步修正系统(SSRS),证明B^AGM是有效的SSRS(Coq验证),并表明这足以处理迹态射、收缩刻画和修正见证。BP-comp失败揭示了路径依赖信念修正与函子一致性之间的基本张力,此前未被识别。

英文摘要

We propose ZX-Calculus (Knowledge Evolution Calculus), a conservative extension of Martin-Lof Dependent Type Theory (MLTT) integrating trace-indexed types, presheaf non-monotone semantics, and constructive AGM belief revision. A Coq mechanisation accompanies the paper (34 complete proofs; zero admits for the two central results). (I) Trace types. FinTrace(s0,sn) is an inductive family of typed execution traces. FinTrace and Star(Step) are isomorphic as path types but not judgementally equal; TraceElim exposes the event label e:Event explicitly, giving a more ergonomic interface for event-driven induction. We prove the Trace-Reachability Correspondence, Deterministic Replay, and a canonicity framework via reducibility candidates with a Transport Lemma (RC-elim deferred; all other Core results are Coq-verified). (II) Sheaf semantics. Trace-indexed propositions are contravariant sheaves over the free trace partial-order category Tf. A Separation Theorem (explicit countermodel) distinguishes proof-theoretic monotonicity from semantic non-monotonicity. The term model is an initial CwF (syntactic universal property, not classical completeness). (III) AGM belief revision. We give an explicit constructive partial meet contraction algorithm verified against (C1)-(C4). All eight AGM postulates (R1)-(R8) are theorems. Proofs of R7 and R8 use the Disjunctive Entrenchment Lemma, given a self-contained constructive derivation. (IV) Integration. B^AGM fails the sheaf composition law BP-comp for sequential revision (explicit countermodel, Coq-verified). We introduce Single-Step Revision Systems (SSRS), prove B^AGM is a valid SSRS (Coq-verified), and show this suffices for trace morphisms, retraction characterisation, and revision witnesses. The BP-comp failure reveals a fundamental tension between path-dependent belief revision and functor consistency, not previously identified.

2606.03061 2026-06-03 cs.DC cs.AI cs.LG cs.NI cs.SY eess.SY

Brief Announcement: Generative Markov Model for Distributed Computing Systems

简要公告:分布式计算系统的生成马尔可夫模型

Alfreds Lapkovskis, Ali Beikmohammadi, Sindri Magnússon, Praveen Kumar Donta

发表机构 * Department of Computer and Systems Sciences, Stockholm University, Sweden(斯德哥尔摩大学计算机与系统科学系)

AI总结 针对分布式计算系统的异构性和复杂性,提出一种基于结构化状态分解的生成马尔可夫模型,实现可处理的模拟、推理和策略学习,并通过协作AI推理案例验证其有效性。

Comments Submitted to 40th International Symposium on Distributed Computing (DISC 2026)

详情
AI中文摘要

新兴的分布式计算范式,如计算连续体,本质上是异构、随机和复杂的。高效且有效地利用连续体中所有可用资源需要一个统一的系统形式化模型。为了解决这一差距,我们提出了一个通用框架,将分布式计算系统建模为生成马尔可夫模型,该模型在结构化系统状态上进行分解。在我们的模型中,状态分解为高维变量,每个变量进一步在其元素上分解,反映了分布式系统固有的稀疏依赖结构。这产生了一个可处理的模型,能够对原本难以处理的系统状态进行模拟、推理和策略学习,从而将分布式计算与马尔可夫链理论和强化学习(RL)联系起来。我们通过一个协作AI推理的案例研究来展示我们的框架,其中专用服务器将资源与服务用户自愿提供的资源相结合。我们的结果表明,集中式调度在规模上成为瓶颈,而将计算分布到用户设备上可减少延迟和服务器资源消耗。这些发现突显了自适应决策在分布式计算系统中的价值,并展示了该框架在建模、模拟和优化方面的实用性。

英文摘要

Emerging distributed computing paradigms, such as the computing continuum, are inherently heterogeneous, stochastic, and complex. Efficiently and effectively utilizing all available resources across the continuum demands a unified formal model of the system. To address this gap, we propose a general framework for modeling distributed computing systems as a generative Markov model, factorized over a structured system state. In our model, the state decomposes into high-dimensional variables, each further factorized over its elements, reflecting the sparse dependency structure inherent to distributed systems. This yields a tractable model enabling simulation, inference, and policy learning over otherwise intractable system states, bridging distributed computing with Markov chain theory and reinforcement learning (RL). We demonstrate our framework through a case study of collaborative AI inference, in which a dedicated server combines resources with those volunteered by service users. Our results show that centralized scheduling becomes a bottleneck at scale, while distributing computation across user devices reduces both latency and server resource consumption. These findings highlight the value of adaptive decision-making in distributed computing systems and demonstrate the framework's utility for modeling, simulation, and optimization.

2606.03034 2026-06-03 cs.MA cs.AI

Capability Advertisement as a Market for Lemons: A Trust Layer for Heterogeneous Agent Networks

能力广告作为柠檬市场:异构智能体网络的信任层

Gaurav Naresh Mittal

发表机构 * University of California, Berkeley(加州大学伯克利分校)

AI总结 针对LLM智能体网络中的能力虚假声称问题,提出基于柠檬市场理论的信任层,通过概率描述、筛选和声誉机制实现可信委托。

详情
AI中文摘要

大型语言模型(LLM)智能体已开始相互委托工作。诸如模型上下文协议(MCP)和智能体间协议(A2A)等协议允许智能体发布其能力并允许其他智能体调用,且此类智能体的公共注册表已经出现。这些协议假设所广告的能力是静态的、真实的事实。然而,真实的智能体并非如此:其能力是概率性的,随输入变化,在底层模型更新时漂移,并且由于智能体本身是语言模型,它可以完全自信地描述自己却可能是错误的。因此,调用者看到的是智能体声称能做什么,而非实际能做什么,且没有原则性的方法区分可靠提供者和流利的冒名顶替者。我们认为这些困难有一个共同原因:柠檬市场。当质量隐藏且声称成本低廉时,好与坏的提供者变得难以区分,诚实的可靠性得不到回报,市场向最差参与者退化。经济学提供了三种补救措施:信号传递、筛选和声誉,而这些在当今的智能体协议中均不存在。我们做出四项贡献:(1)一个故障分类,将自信-错误命名为非对抗性的、相关的拜占庭故障子类,而经典容错模型对此建模不当;(2)一个柠檬市场模型,表明基于信仰的协议仅允许低信任均衡;(3)信任层,一个轻量级、协议无关的窄腰,位于MCP和A2A之上,添加概率能力描述、筛选和声誉,并在维持过度声称的成本超过其收益时允许分离均衡;(4)一个针对委托链的可靠性组合界限,具有端到端放置论证。该设计无需模型重新训练,并在其信任锚缺失或损坏时优雅降级。

英文摘要

Large language model (LLM) agents have begun to delegate work to one another. Protocols such as the Model Context Protocol (MCP) and the Agent2Agent protocol (A2A) let an agent publish what it can do and let others call it, and public registries of such agents are already appearing. These protocols assume an advertised capability is a static, truthful fact. A real agent is none of these things: its competence is probabilistic, varies with input, drifts when the underlying model is updated, and, because the agent is itself a language model, it can describe itself with complete confidence and be wrong. A caller therefore sees what an agent claims to do, not what it can do, with no principled way to tell a reliable provider from a fluent impostor. We argue these difficulties share one cause: the market for lemons. When quality is hidden and claims are cheap, good and bad providers become indistinguishable, honest reliability goes unrewarded, and the market decays toward its worst participants. Economics offers three remedies, signaling, screening, and reputation, and none are present in today's agent protocols. We make four contributions: (1) a failure taxonomy that names confident-wrong as a non-adversarial, correlated subclass of Byzantine faults that classical fault-tolerance mismodels; (2) a market-for-lemons model showing that faith-based protocols admit only a low-trust equilibrium; (3) the Trust Layer, a thin, protocol-agnostic narrow waist above MCP and A2A that adds probabilistic capability descriptors, screening, and reputation, and admits a separating equilibrium when the cost of sustaining an overclaim exceeds the gain from it; and (4) a reliability-composition bound for delegation chains with an end-to-end placement argument. The design needs no model retraining and degrades gracefully when its trust anchors are absent or corrupt.

2606.03026 2026-06-03 cs.NE cs.AI cs.LG

Spike-Aware C++ INT8 Inference for Sparse Spiking Language Models on Commodity CPUs

面向稀疏脉冲语言模型在商用CPU上的脉冲感知C++ INT8推理

Ting Liu

发表机构 * SymbolicLight Research(SymbolicLight研究院)

AI总结 本文提出一种脉冲感知的C++推理运行时,利用稀疏二进制脉冲状态作为执行原语,结合混合布局、AVX2/FMA内核和INT8量化,在商用CPU上实现脉冲语言模型的高效解码,吞吐量优于同等规模稠密模型但质量略逊。

Comments 11 pages, 7 tables

详情
AI中文摘要

脉冲语言模型展现出激活稀疏性,而稠密Transformer运行时无法直接利用。本文从系统角度研究这一特性。基于SymbolicLight V1脉冲门控语言模型家族,我们实现了一个C++ CPU推理运行时,将稀疏二进制脉冲状态视为执行原语,而非仅应用事后权重压缩。该运行时结合了清单驱动的权重加载器、混合行/列内存布局、AVX2/FMA内核、每通道对称INT8量化以及脉冲条件稀疏路径的整数域累加。在AMD Ryzen 7 5800X上,早期标量FP32基线解码速度为9.5 tokens/s。混合布局AVX2 FP32将其提升至14.7 tokens/s,而AVX2 INT8在相同step-30k导出模型上达到19.9 tokens/s,同时将权重占用从3.49 GB降至1.06 GB。对于可用的186k步874M参数INT8导出模型,C++运行时在单线程CPU基准测试中解码速度为22.63 tokens/s,相比之下,TinyLlama-1.1B Q8_0为16.31 tokens/s,Falcon3-1B Q8_0为11.26 tokens/s,Qwen2.5-1.5B Q8_0为9.70 tokens/s。线程扩展在四个CPU线程时达到47.90 tokens/s,512 token预填充从单线程的29.86 tokens/s提升至八线程的94.68 tokens/s。吞吐量提升伴随着质量代价:SNN报告WikiText-2困惑度为24.80,差于同一基准中的稠密基线。我们将结果定位为稀疏语言运行时的推理系统研究,长期动机在于可能受益于传感器和执行器附近本地低核推理的具身和边缘智能体。脉冲感知执行可以改善稀疏脉冲语言模型的CPU吞吐量和内存行为,而模型质量、受控稠密训练基线、具身任务评估和测量CPU能耗仍是开放问题。

英文摘要

Spiking language models expose activation sparsity that dense Transformer runtimes do not directly exploit. This paper studies that property from a systems perspective. Building on the SymbolicLight V1 spike-gated language model family, we implement a C++ CPU inference runtime that treats sparse binary spike states as an execution primitive rather than only applying post-hoc weight compression. The runtime combines a manifest-driven weight loader, mixed row/column memory layout, AVX2/FMA kernels, per-channel symmetric INT8 quantization, and integer-domain accumulation for spike-conditioned sparse paths. On an AMD Ryzen 7 5800X, an early scalar FP32 baseline decodes at 9.5 tokens/s. Mixed-layout AVX2 FP32 raises this to 14.7 tokens/s, and AVX2 INT8 reaches 19.9 tokens/s on the same step-30k export while reducing the weight footprint from 3.49 GB to 1.06 GB. For the available 186k-step 874M-parameter INT8 export, the C++ runtime decodes at 22.63 tokens/s in a single-thread CPU benchmark, compared with 16.31 tokens/s for TinyLlama-1.1B Q8_0, 11.26 tokens/s for Falcon3-1B Q8_0, and 9.70 tokens/s for Qwen2.5-1.5B Q8_0 under llama.cpp. Thread scaling reaches 47.90 tokens/s at four CPU threads, and 512-token prefill improves from 29.86 to 94.68 tokens/s from one to eight threads. The throughput result comes with a quality cost: the SNN reports WikiText-2 perplexity 24.80, worse than the dense baselines in the same benchmark. We frame the result as an inference-systems study for sparse language runtimes, with longer-term motivation in embodied and edge agents that may benefit from local, low-core inference near sensors and actuators. Spike-aware execution can improve CPU throughput and memory behavior for sparse spiking language models, while model quality, controlled dense training baselines, embodied-task evaluation, and measured CPU energy remain open problems.

2606.03019 2026-06-03 cs.CY cs.AI

Reproducibility is the New Copyleft: Defining AGI-oriented Reproducible Builds

可重现性是新的Copyleft:定义面向AGI的可重现构建

Masayuki Hatta

发表机构 * Surugadai University(上贺茂大学)

AI总结 本文提出面向通用人工智能(AGI)的可重现构建作为Copyleft的功能等价物,通过定义七项要求来确保模型从声明输入到输出的比特精确可重现性,并论证协议而非平台是更优的治理框架。

Comments Accepted at AGI-26. To appear in the proceedings (Springer LNCS)

详情
AI中文摘要

Copyleft,如GNU通用公共许可证中所实施的,是一种利用版权保证用户自由的法律技巧,通过将源代码的可用性与每次分发行为绑定。其规范力量依赖于一个隐含的技术前提:源代码和目标代码之间存在定义明确、可人工审计且可重现的关系。大型语言模型以及未来的通用人工智能(AGI)系统系统地违反了这一前提。重建模型所需的工件——代码、数据、权重、超参数、工具链和硬件配置——各自受到独立的法律、技术和经济约束,当前没有任何开源框架能完全解决这些问题。足够强大的AI系统还可以将许可下的源代码重写为功能等效的衍生作品,从而剥离原始义务,这是一种Copyleft无法有效防御的洗白形式。本文认为,对于AGI,Copyleft的功能等价物必须基于可重现构建,而非代码的共享相同条款:可重现构建是一种保证从声明输入到输出比特精确可重构性的实践。我们回顾了Copyleft的逻辑,批判性地审视了Maffulli的“第二次解放”论点(即AI实现了Stallman的梦想),并表明除非AGI系统本身是可重现的,否则该论点不成立。借鉴开源AI定义(OSAID)、模型开放框架(MOF)、OpenMDW和确定性推理研究,我们定义了面向AGI的可重现构建的七项要求。我们进一步论证,模型上下文协议(MCP)和类似的AI到AI耦合机制构成了一个新的动态链接层,Copyleft式许可对此并不适用,而Masnick的“协议而非平台”框架提供了更有前景的治理模板。

英文摘要

Copyleft, as implemented in licenses such as the GNU General Public License, was a legal hack that used copyright to guarantee user freedom by tying the availability of source code to every act of distribution. Its normative force rested on an implicit technical premise: that source code and object code stand in a well-defined, humanly auditable, and reproducible relationship. Large language models and, prospectively, Artificial General Intelligence (AGI) systems systematically violate this premise. The artifacts jointly required to reconstruct a model -- code, data, weights, hyperparameters, toolchain, and hardware configuration -- are each subject to independent legal, technical, and economic constraints that no current open-source framework fully resolves. Sufficiently capable AI systems can also rewrite licensed source into functionally equivalent derivatives stripped of their original obligations, a form of laundering against which copyleft has no effective defense. This paper argues that a functional analogue of copyleft for AGI must be grounded not in share-alike clauses over code, but in reproducible builds: a practice guaranteeing bit-exact reconstructability from declared inputs. We review the logic of copyleft, critically examine Maffulli's Second Liberation thesis according to which AI fulfills Stallman's dream, and show that the argument collapses unless AGI systems are themselves reproducible. Drawing on the Open Source AI Definition (OSAID), the Model Openness Framework (MOF), OpenMDW, and deterministic-inference research, we define seven requirements for AGI-oriented reproducible builds. We further argue that the Model Context Protocol (MCP) and analogous AI-to-AI coupling mechanisms constitute a new dynamic linking layer for which copyleft-style licensing is ill-suited, and that Masnick's "protocols, not platforms" framework offers a more promising governance template.

2606.02982 2026-06-03 cs.PF cs.DC cs.LG

DriftSched: Adaptive QoS-Aware Scheduling under Runtime Token Drift for Multi-Tenant GPU Inference

DriftSched: 多租户GPU推理中运行时令牌漂移下的自适应QoS感知调度

Kathiravan Palaniappan

发表机构 * University of California, Berkeley(加州大学伯克利分校)

AI总结 提出DriftSched框架,通过运行时反馈驱动的漂移补偿和自适应偏差校正,解决多租户LLM推理中令牌漂移导致的调度问题,在NVIDIA L4 GPU上实现平均38.8%的估计误差降低和42%的中位延迟改善。

Comments 17 pages, 22 figures, 7 tables

详情
AI中文摘要

大型语言模型(LLM)推理服务的快速增长增加了对高效多租户GPU调度的需求。尽管现代推理运行时(如vLLM)通过连续批处理和优化内存管理提高了吞吐量,但准确估计异构推理请求的运行时成本仍然是一个重大挑战。在实践中,观察到的输出长度通常偏离准入时的估计值,产生运行时令牌漂移,可能导致工作负载错误分类、队列不平衡、尾延迟增加和服务质量(QoS)下降。本文提出了DriftSched,一个用于NVIDIA L4 GPU上多租户LLM推理服务的自适应QoS感知调度框架。DriftSched结合了工作负载分类、令牌预算估计、租户感知队列管理和运行时反馈驱动的漂移补偿,以改进准入时的调度决策。该框架在异构多租户工作负载下评估了FIFO、优先级、加权、最短作业优先(SJF)和老化优先级调度策略。实验结果表明,各工作负载类别存在可测量的运行时令牌漂移。自适应偏差校正将工作负载估计误差平均降低38.8%(MAE)和40.5%(RMSE),提高了工作负载分类稳定性和调度准确性。在所有评估的调度器中,SJF实现了最佳整体性能,在持续GPU争用下,相对于FIFO,中位端到端延迟降低了约42%,P99延迟降低了约16%。该工作贡献了一个自适应漂移感知调度架构、一个运行时令牌漂移补偿机制,以及一个用于评估共享GPU基础设施上QoS感知LLM推理调度的可重复基准测试框架。

英文摘要

The rapid growth of large language model (LLM) inference services has increased the demand for efficient multi-tenant GPU scheduling. While modern inference runtimes such as vLLM improve throughput through continuous batching and optimized memory management, accurately estimating the runtime cost of heterogeneous inference requests remains a significant challenge. In practice, observed output lengths often deviate from admission-time estimates, creating runtime token drift that can lead to workload misclassification, queue imbalance, increased tail latency, and degraded Quality-of-Service (QoS). This paper presents DriftSched, an adaptive QoS-aware scheduling framework for multi-tenant LLM inference serving on NVIDIA L4 GPUs. DriftSched combines workload classification, token-budget estimation, tenant-aware queue management, and runtime feedback-driven drift compensation to improve admission-time scheduling decisions. The framework evaluates FIFO, Priority, Weighted, Shortest-Job-First (SJF), and Aging Priority scheduling policies under heterogeneous multi-tenant workloads. Experimental results demonstrate measurable runtime token drift across workload categories. Adaptive bias correction reduces workload estimation error by an average of 38.8% (MAE) and 40.5% (RMSE), improving workload classification stability and scheduling accuracy. Among all evaluated schedulers, SJF achieves the best overall performance, reducing median end-to-end latency by approximately 42% and P99 latency by approximately 16% relative to FIFO under sustained GPU contention. The work contributes an adaptive drift-aware scheduling architecture, a runtime token-drift compensation mechanism, and a reproducible benchmarking framework for evaluating QoS-aware LLM inference scheduling on shared GPU infrastructure.

2606.02967 2026-06-03 cs.ET cs.AI cs.AR cs.SY eess.SY

Glass Box at Orbit: A Constitutional AI Verification Framework for Trustworthy Autonomous CubeSat Intelligence

轨道上的玻璃盒:面向可信自主立方星智能的宪法AI验证框架

Karthik Barma, Anil Sanneboyina, V C Premchand Yadav

发表机构 * University of California, Berkeley(加州大学伯克利分校) Stanford University(斯坦福大学)

AI总结 提出玻璃盒框架,通过运行时宪法AI验证层拦截自主航天器决策,利用六项物理约束和七项线性时序逻辑安全不变式确保安全,并证明其验证开销与模型规模无关。

Comments 12 pages, 2 figures, 2 tables, 32 references. Paper 1 of the Project October series on autonomous orbital intelligence

详情
AI中文摘要

航天工业正在悄然构建一个尚未被充分认识的事物:在地球上空550公里处运行数千个自主AI工作负载的轨道数据中心,且无人类参与。微软、AWS以及越来越多的轨道计算企业正在将云规模处理从地面转移到轨道。然而,它们都尚未回答治理问题——当轨道数据中心规模的自主AI系统在太空中做出错误决策时,如何在决策变得不可逆转之前阻止它们?我们引入玻璃盒:一个运行时宪法AI验证层,在单个命令到达任何航天器子系统之前,拦截来自机载AI策略的每个候选动作,并根据六项基于物理的宪法约束和七项线性时序逻辑(LTL)安全不变式对其进行评估。每个批准的动作都附带一个加权可解释性分数E(a_t)(范围[0,1])和完整的宪法审计日志。我们在Project October中演示了玻璃盒:一个针对CubeSat级航天器的完全模拟的五层自主轨道智能架构。我们证明玻璃盒的验证开销为O(N_c),其中N_c是宪法规则的数量,与模型大小或航天器状态维度无关。我们提供了宪法约束语法的完整形式规范、通过Z3和NuSMV模型检查验证的七项LTL安全不变式,以及一个详细的工作示例,展示玻璃盒在电池状态退化的日食入口处拦截不安全推理请求。随着轨道计算向数据中心基础设施规模发展,运行时宪法验证不再是研究上的新奇事物——它是每个自主轨道平台最终将需要的任务关键型安全基础设施。

英文摘要

The space industry is quietly building toward something nobody has fully reckoned with: orbital data centers running thousands of autonomous AI workloads with no human in the loop, 550 km above the Earth. Microsoft, AWS, and a growing list of orbital computing ventures are moving cloud-scale processing off the ground and into orbit. What none of them have answered yet is the governance question -- when autonomous AI systems at orbital data center scale make wrong decisions in space, what stops those decisions before they become irreversible? We introduce Glass Box: a runtime constitutional AI verification layer that intercepts every candidate action from an onboard AI policy and evaluates it against six physics-grounded constitutional constraints and seven Linear Temporal Logic (LTL) safety invariants before a single command reaches any spacecraft subsystem. Every approved action carries a weighted explainability score E(a_t) in [0,1] and a complete constitutional audit log. We demonstrate Glass Box within Project October: a fully simulated five-layer autonomous orbital intelligence architecture for CubeSat-class spacecraft. We prove that Glass Box verification overhead is O(N_c) in the number of constitutional rules, independent of model size or spacecraft state dimension. We present a complete formal specification of the constitutional constraint grammar, seven LTL safety invariants verified by Z3 and NuSMV model checking, and a detailed worked example of Glass Box intercepting an unsafe inference request at eclipse-entry under degraded battery state. As orbital computing scales toward data center infrastructure, runtime constitutional verification is no longer a research novelty -- it is mission-critical safety infrastructure that every autonomous orbital platform will eventually require.

2606.02964 2026-06-03 cs.AR cs.CL cs.LG

Multi-Segment Attention: Enabling Efficient KV-Cache Management for Faster Large Language Model Serving

多段注意力:实现高效KV缓存管理以加速大型语言模型服务

Chunan Shi, Yilei Chen, Yilin Chen, Xupeng Miao, Bin Cui

发表机构 * Peking University(北京大学)

AI总结 提出AsymCache,一种计算延迟感知的KV缓存管理系统,通过多段注意力、缓存驱逐策略和自适应分块调度器,在保持无损精度的同时显著降低TTFT和TPOT。

详情
AI中文摘要

大型语言模型(LLM)推理依赖键值(KV)缓存以避免冗余的注意力计算。虽然近似KV缓存保留技术通过牺牲模型精度来减少内存使用,但无损方法则从GPU内存中驱逐KV缓存块并按需重建以保留精确输出。现有的无损KV缓存管理系统主要基于访问频率或位置启发式做出驱逐决策,而不考虑不同KV缓存块如何影响GPU注意力内核的执行效率。在本文中,我们提出了AsymCache,一种用于LLM推理的计算延迟感知KV缓存管理系统,它明确地将缓存驻留决策与GPU注意力内核性能对齐,包括三个关键组件:用于高效非连续KV上下文处理的多段注意力(MSA)、联合优化命中率和位置感知重计算成本的缓存驱逐策略,以及用于高硬件利用率的自适应分块调度器。实验表明,与最新基线相比,AsymCache将TTFT降低了高达1.90-2.03倍,每输出令牌时间(TPOT)降低了1.62-1.71倍,证实了该方法在常见工作负载中的有效性,并验证了其平衡计算效率与缓存命中率的设计目标。此外,AsymCache的低级设计允许无缝集成到诸如Continuum的代理服务系统中,进一步将平均作业延迟降低高达18.1%。

英文摘要

Large Language Model (LLM) inference relies on key-value (KV) caches to avoid redundant attention computation. While approximate KV cache retention techniques reduce memory usage by sacrificing model accuracy, lossless approaches instead evict KV cache blocks from GPU memory and reconstruct them on demand to preserve exact outputs. Existing lossless KV cache management systems primarily base eviction decisions on access frequency or positional heuristics, without considering how different KV cache blocks affect the execution efficiency of GPU attention kernels. In this paper, we propose AsymCache, a computation-latency-aware KV cache management system for LLM inference that explicitly aligns cache residency decisions with GPU attention kernel performance, including three key components: Multi-Segment Attention (MSA) for efficient non-contiguous KV context processing, a cache eviction policy that jointly optimizes hit rate and position-aware recomputation cost, and an adaptive chunking scheduler for high hardware utilization. Experiments show that AsymCache reduces TTFT by up to 1.90-2.03x and time-per-output-token (TPOT) by 1.62-1.71x over latest baselines, confirming the effectiveness of the method in common workloads and validating its design goal of balancing computational efficiency with cache hit rate. Moreover, the low-level design of AsymCache allows seamless integration into agent serving systems such as Continuum, where it further reduces average job latency by up to 18.1%.

2606.02958 2026-06-03 cs.CR cs.AI

Echelon: Auditable Aggregate-Only Language-Model Adaptation Across Privacy Boundaries

Echelon: 跨隐私边界的可审计聚合专用语言模型适配

Hina Dixit, Punit Kumar, Irene Tenison, Nevasini Sasikumar

发表机构 * University of California, Berkeley(加州大学伯克利分校) Stanford University(斯坦福大学)

AI总结 提出Echelon架构,通过强制设备级模型状态不可导出为系统不变量,仅允许聚合后的跨边界数据传输,并结合缓冲半异步安全聚合、陈旧感知加权等机制,在1B参数LoRA适配中实现低通信开销下的稳定训练。

详情
AI中文摘要

跨组织语言模型适配日益面临严格的治理约束:在许多部署中,设备级模型状态(参数、激活值、优化器状态及每设备更新)无法导出到管理边界之外。现有的分布式和联邦学习栈通常假设跨站点模型交换,然后改造隐私机制,这使合规性复杂化并导致审计脆弱。我们提出Echelon,一种边界优先的训练架构,将设备级模型状态不可导出作为系统不变量强制执行。设备在每个边界内本地训练;唯一的跨边界负载是安全聚合的边界级增量加上O(1)协调元数据,并通过具体的审计接口暴露。将交换限制为聚合改变了优化问题:系统必须在广域网延迟、异构参与、节点波动和非独立同分布数据下保持稳定,尽管全局层面从未看到每设备更新。Echelon结合了缓冲半异步安全聚合、陈旧感知加权、参与窗口、近端局部目标以及漂移感知外同步控制器。在M=2个边界上的1B参数LoRA适配中,预算匹配的竞赛(三个种子,24.88M tokens)达到验证损失3.887 +/-0.010,并在固定token、固定字节、固定挂钟时间和固定同步次数预算下,在调优的低通信基线中表现最佳或并列最佳。在OpenWebText压力测试中,Echelon在评估的广域网和非独立同分布处理下维持2,139-2,176 tokens/s的吞吐量;Echelon-DA在广域网延迟下相对于隐私对等的DiLoCo+SA基线改善了达到目标的时间,并且在200ms模拟延迟或严重非独立同分布分区下质量最多下降2.2%。

英文摘要

Cross-organization language-model adaptation increasingly faces hard governance constraints: in many deployments, device-level model state-parameters, activations, optimizer state, and per-device updates-cannot be exported outside an administrative boundary. Existing distributed and federated stacks typically assume cross-site model exchange and then retrofit privacy mechanisms, which complicates compliance and makes auditing brittle. We present Echelon, a boundary-first training architecture that enforces device-level model-state non-export as a systems invariant. Devices train locally inside each boundary; the only cross-boundary payloads are securely aggregated boundary-level deltas plus O(1) coordination metadata, exposed through a concrete audit surface. Restricting exchange to aggregates changes the optimization problem: the system must remain stable under WAN delay, heterogeneous participation, churn, and non-IID data even though the global plane never sees per-device updates. Echelon combines buffered semi-asynchronous secure aggregation, staleness-aware weighting, participation windows, proximal local objectives, and a drift-aware outer synchronization controller. In 1B-parameter LoRA adaptation across M= 2 boundaries, a budget-matched contest over three seeds (24.88M tokens) reaches validation loss 3.887 +/-0.010 and is best or tied-best among tuned low-communication baselines under fixed-token, fixed-bytes, fixed-wall-clock, and fixed-sync-count budgets. In OpenWebText stress tests, Echelon sustains 2,139-2,176 tokens/s across evaluated WAN and non-IID treatments, Echelon-DA improves time-to-target under WAN latency relative to a privacy-parityDiLoCo+SA baseline, and quality degrades by at most 2.2% under 200ms emulated latency or severe non-IID partitioning.

2606.02902 2026-06-03 cs.CY cs.LG

Fairness Definitions and Metrics in Deep Reinforcement Learning for Drug Discovery in Healthcare: A Rapid Evidence Review

医疗保健中深度强化学习的公平性定义与指标:药物发现的快速证据综述

Esmaeil Shakeri, Ronnie de Souza Santos, Behrouz Far

发表机构 * Department of Electrical and Software Engineering, Schulich School of Engineering, University of Calgary(电气与软件工程系,Schulich工程学院,卡尔加里大学)

AI总结 本文通过快速证据综述,系统总结了深度强化学习在药物分子生成中公平性的定义、测量指标,并分析了数据集组成、奖励设计对公平性的影响。

Comments 10 pages, 6 figures, 3 tables. Accepted as a full paper at a symposium of IEEE COMPSAC 2026

详情
AI中文摘要

深度强化学习(DRL)越来越多地应用于从头分子设计,但数据、奖励和评估的选择可能导致在不同疾病区域和化学类型上的性能不均。尽管如此,目前尚无关于DRL药物发现中公平性如何定义、测量和测试的简明综合。在这篇快速证据综述中,我们综合了医疗保健中DRL驱动分子生成的公平性定义和指标。我们关注三个问题:(i)数据集组成和划分策略(特别是支架划分与随机划分)如何影响评估和分布偏移;(ii)奖励设计(如QED、对接、毒性、合成可及性)如何产生或减轻偏差,重点关注癌症靶点;(iii)哪些可测量指标最能捕捉公平性。这包括癌症与非癌症适应症之间以及癌症亚型之间的均等性。还包括关键物理化学描述符的分布平衡、支架/化学类型多样性、组间有效性、毒性和合成可及性。从2017年起,我们检索了主要的生物医学、计算机科学和工程文献数据库,并使用arXiv进行地平线扫描。记录通过PRISMA式程序筛选,并通过内容编码分析,将报告的均等性结果与数据集和奖励选择联系起来。我们的综述为DRL分子生成提供了一套简洁的公平性定义和指标。它为报告分布均等性和结果均等性提供了实用指南。它还总结了数据集和奖励选择如何与观察到的均等性效应相关,并指出了与可信、癌症相关的DRL生成相关的未解决问题。

英文摘要

Deep reinforcement learning (DRL) is increasingly applied to de novo molecular design, but choices in data, rewards, and evaluation can yield uneven performance across disease areas and chemotypes. Despite this, there is no concise synthesis of how fairness is defined, measured, and tested in DRL-based drug discovery. In this rapid evidence review, we synthesize fairness definitions and metrics for DRL-driven molecule generation in healthcare. We focus on three questions: (i) how dataset composition and split strategies, especially scaffold versus random splits, affect evaluation and distribution shift; (ii) how reward design (e.g., QED, docking, toxicity, synthetic accessibility) can create or mitigate bias, with emphasis on cancer targets; and (iii) which measurable metrics best capture fairness. This includes parity across cancer versus non-cancer indications and across cancer subtypes. It also includes distributional balance in key physicochemical descriptors, scaffold/chemotype diversity, groupwise validity, toxicity, and synthetic accessibility. From 2017 onward, we searched major biomedical, computer science, and engineering literature databases and used arXiv for horizon scanning. Records were screened using PRISMA-style procedures and analyzed via content coding to link reported parity outcomes to dataset and reward choices. Our review provides a concise set of fairness definitions and metrics for DRL molecule generation. It offers practical guidance for reporting distribution parity and outcome parity. It also summarizes how dataset and reward choices relate to observed parity effects and identifies open gaps relevant to trustworthy, cancer-relevant DRL generation.

2606.02883 2026-06-03 cs.HC cs.AI cs.CY cs.IR

LLM-Assisted Reranking to Operationalize Nuanced Objectives in Recommender Systems

LLM辅助重排序以在推荐系统中实现细微目标

Amir Ghasemian, Homa Hosseinmardi, Upasana Dutta, Duncan J. Watts

发表机构 * Department of Communication, University of California, Los Angeles, CA 90095(通信系,加州大学洛杉矶分校,CA 90095) Department of Computer and Information Science, University of Pennsylvania, Philadelphia, PA 19104(计算机与信息科学系,宾夕法尼亚大学,Philadelphia, PA 19104) Amenenberg School of Communication, University of Pennsylvania, Philadelphia, PA 19104(安纳伯格通信学院,宾夕法尼亚大学,Philadelphia, PA 19104) Operations, Information, and Decisions Department, University of Pennsylvania, Philadelphia, PA 19104(运营、信息与决策系,宾夕法尼亚大学,Philadelphia, PA 19104)

AI总结 本研究通过零样本指令提示对YouTube侧边栏候选进行重排序,发现无约束的LLM辅助重排序会放大极端和阴谋论内容,而轻量级提示正则化可在轻微损失相关性的情况下减少极端内容并增加意识形态多样性。

Comments 30 pages total; 11 pages, 5 figures, 2 tables (main text); 19 pages, 11 figures, 9 tables (appendix)

详情
AI中文摘要

推荐系统已从内容组织工具发展为塑造日常行为的复杂系统。通过控制我们所看到的内容,它们塑造了我们的感知,引发了对过滤气泡、激进化、两极分化和社会不平等的担忧。大型语言模型(LLM)实现了更强大的个性化,加剧了这些动态。然而,大多数推荐系统针对参与度或有限的准确性指标进行调优,很少关注更广泛的社会影响,例如个性化如何重塑社会重要领域中的曝光度。我们研究了LLM辅助重排序在提高个性化的同时,是否无意中放大了对意识形态极端或阴谋论政治内容的曝光,这是一种在新闻推荐中理论上存在但尚未得到实证表征的风险。使用真实的新闻消费历史,我们通过零样本、基于指令的提示对YouTube侧边栏候选进行重排序。我们比较了基线提示与一个约束变体,该变体保持主题相关性并扩大意识形态曝光,同时减少阴谋论或极端内容。在没有约束的情况下,重排序加强了个性化,但增加了对历史中包含此类内容的用户的阴谋论和极端主义材料的曝光。轻量级提示级正则化减少了对极端内容的推广并增加了意识形态多样性,同时相关性损失较小。合成实验表明,LLM通过语言中的统计规律而非对意识形态的语义理解进行重排序,这解释了为什么朴素提示会放大这些模式,而正则化可以重塑它们。总之,我们的结果突显了LLM在高风险推荐中实现上下文细微差别的能力,以及评估LLM辅助个性化超越准确性并将提示设计视为有价值负载而非中性默认的必要性。

英文摘要

Recommender systems have grown from content-organization tools into sophisticated systems that shape daily behavior. By controlling what we see, they shape what we perceive, raising concerns about filter bubbles, radicalization, polarization, and social inequality. Large language models (LLMs) enable more powerful personalization, intensifying these dynamics. Yet most recommenders are tuned for engagement or limited accuracy metrics, with little attention to broader social implications, e.g. how personalization reshapes exposure in socially consequential domains. We investigate whether LLM-assisted reranking, while improving personalization, inadvertently amplifies exposure to ideologically extreme or conspiratorial political content, a risk theorized but not empirically characterized in news recommendation. Using real news-consumption histories, we rerank YouTube's sidebar candidates through zero-shot, instruction-based prompting. We compare a baseline prompt with a constrained variant that preserves topical relevance and broadens ideological exposure while reducing conspiratorial or extreme content. Without constraints, reranking strengthened personalization but increased exposure to conspiratorial and extremist material for users whose histories contained such content. Lightweight prompt-level regularization reduced promotion of extreme content and increased ideological diversity, with modest relevance loss. Synthetic experiments suggest that LLMs rerank via statistical regularities in language rather than semantic understanding of ideology, clarifying why naive prompts amplify these patterns and why regularization can reshape them. Together, our results highlight the power of LLMs to operationalize contextual nuance in high-stakes recommendation, and the need to evaluate LLM-assisted personalization beyond accuracy and treat prompt design as a value-laden rather than neutral default.

2606.02872 2026-06-03 eess.SY cs.MA cs.RO cs.SY

Terminal Time and Angle-Constrained Nonlinear Intercept Guidance

终端时间和角度约束的非线性拦截制导

Shivam Bajpai, Abhinav Sinha

发表机构 * University of California(加州大学)

AI总结 针对单一控制输入下的欠驱动非线性拦截问题,提出基于分层滑模的制导律,同时控制终端时间和角度,并扩展至常速目标拦截。

详情
AI中文摘要

本文考虑使用横向加速度作为唯一控制输入,同时控制拦截器的撞击时间和撞击角度的问题。由于单一控制输入,非线性交战运动学本质上是欠驱动的,这使得制导律综合变得复杂。为了克服这一挑战,开发了一种基于分层滑模的制导律,以同时调节两个终端约束。所提出的架构包括一个两层滑模流形。第一层由分别对应撞击时间和撞击角度误差动力学的两个子滑模面组成,而第二层引入了一个组合两个单独子滑模面的复合滑模流形。然后,设计了一种变增益自适应制导律,以确保对静止目标的带时间和角度约束的拦截,并进一步扩展至拦截常速目标。针对各种交战场景进行了仿真,以证明所提出方法的有效性。

英文摘要

This paper considers the problem of simultaneously controlling an interceptor's impact time and impact angle using its lateral acceleration as the sole control input. With a single control input, the nonlinear engagement kinematics is inherently underactuated, which complicates guidance law synthesis. To overcome this challenge, a hierarchical sliding mode-based guidance law is developed to concurrently regulate the two terminal constraints. The proposed architecture consists of a two-layer sliding manifold. The first layer comprises two sub-sliding surfaces corresponding to the impact time and impact angle error dynamics, respectively, while the second layer introduces a composite sliding manifold that combines the two individual sub-surfaces. Then, a variable-gain adaptive guidance law is designed to ensure time and angle-constrained interception against a stationary target, which is further extended to intercept a constant velocity target. Simulations are conducted for various engagement scenarios to attest to the efficacy of the proposed approach.

2606.02867 2026-06-03 cs.MA cs.AI q-bio.PE

The Epi-LLM Framework: probing LLM behavioral priors through epidemiological agent-based models

Epi-LLM框架:通过流行病学基于智能体的模型探究LLM行为先验

Petra Ferenz, Ava Keeling, Tobias O'Keefe, Lorenzo Stigliano, Francesco Di Lauro, Andres Colubri, Jasmina Panovska-Griffiths

发表机构 * Big Data Institute, Li Ka Shing Center for Health Information and Discovery, University of Oxford, Oxford, United Kingdom(大数据研究所、李嘉诚健康信息与发现中心、牛津大学、牛津、英国) Leverhulme Centre for Demographic Science, Nuffield Department of Population Health, University of Oxford, Oxford, United Kingdom(勒弗赫姆人口科学中心、努尔菲尔德人口健康系、牛津大学、牛津、英国) Pandemic Sciences Institute, Nuffield Department of Medicine, University of Oxford, Oxford, United Kingdom(流行病学科学研究所、努尔菲尔德医学系、牛津大学、牛津、英国) Department of Genomics and Computational Biology, UMass Chan Medical School, United States(基因组与计算生物学系、UMass Chan医学学校、美国) Broad Institute of Harvard and MIT, United States(哈佛大学和麻省理工学院Broad研究所、美国) The Queen’s College, University of Oxford, Oxford, United Kingdom(女王学院、牛津大学、牛津、英国)

AI总结 提出Epi-LLM框架,整合基于智能体的建模、真实流行病游戏和大语言模型,模拟疫情中智能体行为,发现LLM智能体减少峰值感染,感知健康严重性是隔离行为最强预测因子,且LLM架构影响疫情动态。

Comments Submitted to American Journal of Epidemiology

详情
AI中文摘要

流行病期间的人类行为会影响传染病动态,但量化这一点仍然极具挑战性。本文介绍了Epi-LLM框架:一种新颖的集成方法,结合了基于智能体的建模、真实流行病游戏和大语言模型(LLM),其中合成智能体社会在疫情接触网络上进行推理并动态适应。将合成智能体行为与无干预的SEIR基线和来自AUIB流行病游戏研究的人类参与者数据进行比较,我们发现四种不同架构的LLM智能体减少了峰值活跃感染,在15天模拟的第6天,隔离合规率达到58-65%。二项广义线性模型显示,感知健康严重性是隔离行为的最强预测因子(β = 0.33, p = 0.002),伪R²为0.055,与人类试验中观察到的0.072相当。LLM架构是疫情动态的关键决定因素:低方差架构为测试行为规则提供了更高的内部效度,而高方差模型可能更好地代表现实世界中的决策。仅凭地理标签无法诱导文化差异化的行为;需要明确的态参数化。这项原理验证工作为将Epi-LLM框架部署为可扩展、无风险的模拟环境用于大流行准备研究奠定了基础。

英文摘要

Human behaviour during epidemics affects infectious disease dynamics, but quantifying this remains deeply challenging. Here we introduce the Epi-LLM framework: a novel integration of agent-based modelling, real-life epigames, and large language models (LLMs) in which a synthetic society of agents reasons and adapts dynamically over an outbreak contact network. Comparing synthetic agent behaviour against a no-intervention SEIR baseline and human participant data from the AUIB epigame study, we find that LLM agents across four different architectures reduced peak active infections, with quarantine compliance peaking at 58-65% on day six of the 15-day simulation. A binomial generalised linear model showed that perceived health severity was the strongest predictor of quarantine behaviour ($β= 0.33, p = 0.002$), yielding a pseudo-$R^2$ of 0.055, comparable to the 0.072 observed in the human trial. LLM architecture is a key determinant of epidemic dynamics: low-variance architectures offer greater internal validity for testing behavioural rules, while high-variance models may better represent real-world decision-making. Geographic labels alone do not induce culturally differentiated behaviour; explicit attitudinal parameterisation is required. This proof-of-principle work lays the groundwork for deploying the Epi-LLM framework as a scalable, risk-free simulation environment for pandemic preparedness research.

2606.02834 2026-06-03 cs.CR cs.AI

Large Byte Model: Teaching Language Models About Compiled Code

大型字节模型:教会语言模型关于编译代码的知识

Florian Störtz, Catalin-Andrei Stan, Alexandru Dinu, Sandra Servia-Rodríguez, Mihaela Gaman, Calin Miron, Edward Raff

发表机构 * CrowdStrike U.K.(CrowdStrike英国分公司) CrowdStrike Romania(CrowdStrike罗马尼亚分公司) CrowdStrike USA(CrowdStrike美国分公司)

AI总结 本文提出首个字节原生大语言模型,通过定制字节分词器扩展词汇表,使其能直接处理可执行文件原始字节并回答恶意软件分析问题,在家族分类和架构分类上分别达到69%和98%的准确率。

详情
AI中文摘要

恶意软件分析始于可执行程序的原始字节,而将其“提升”到更高级表示(如汇编)的工具成本高昂且容易出错。大型语言模型(LLM)无法处理原始字节表示并回答相关问题。为此,我们提出了首个字节原生LLM。基于使用定制字节分词器的词汇扩展技术,该模型能够回答关于恶意软件二进制的复杂问题,准确率从恶意软件家族分类的69%到架构分类的98%不等。我们的发现表明,在训练过程中提供领域知识对此应用至关重要——现成的模型既缺乏准确性也缺乏洞察力。我们已将该新兴解决方案部署给有限数量的分析师,以收集反馈进行进一步改进。

英文摘要

Malware analysis starts with the raw bytes of an executable program, and tools to "lift" these to higher-level representations, such as assembly, are expensive and subject to error. Large Language Models (LLMs) cannot process raw byte representations and answer questions about them. To this end, we present the first byte-native LLM. Based on a vocabulary expansion technique using a bespoke byte tokenizer, such a model is capable of responding to complex questions about malware binaries, with accuracies ranging from 69% for malware family classification to 98% for architecture classification. Our findings indicate that providing domain knowledge during training is essential for this application -- off-the-shelf models lack both accuracy and insight. We've deployed this emerging solution to a limited number of analysts to gather feedback for further improvements.

2606.02822 2026-06-03 cs.CR cs.AI

Which Defense Closes Which Threat? Attributing OWASP-LLM-Top-10 Coverage and Its Brittleness Under Paraphrasing

哪种防御措施应对哪种威胁?归因OWASP-LLM-Top-10覆盖及其在释义下的脆弱性

Alexandre Cristovão Maiorano

发表机构 * Lumytics

AI总结 本文通过归因分析,测量了不同防御家族(拒绝过滤、预算控制等)对OWASP-LLM-Top-10威胁的覆盖情况,并揭示了拒绝防御在释义攻击下的脆弱性。

Comments 17 pages, 4 figures, 7 tables

详情
AI中文摘要

生产级LLM应用堆叠了多种防御家族——拒绝短语过滤器、令牌预算控制、模型白名单、速率限制、工具注册认证——然而现有的攻防模拟(BAS)基准报告单一的总体覆盖数字,隐藏了哪个家族应对哪种威胁。我们测量归因。我们将四个OWASP-LLM-Top-10感知的智能体添加到一个21智能体的基线扫描器中,并针对四个合成LLM端点的格点:$L_0$(无防御)、$L_1$(仅拒绝)、$L_2$(仅预算)和$L_3$(全栈)。$L_1$和$L_2$是兄弟单轴消融,互不为子集;$L_3$是它们的并集加上工具注册认证和凭证清洗。在$N=10$次重复中,每个OWASP的发现计数清晰:仅拒绝消除所有LLM01(越狱)和LLM07(系统提示泄露)发现;仅预算通过终止多步序列消除所有LLM02(敏感信息泄露)和LLM10(无限制消耗)发现;LLM06(过度代理)需要全栈。我们探测释义下的脆弱性:使用300个Gemini生成的释义(在60模板脆弱性语料库上$K=5$),$L_1$拒绝阻断率在LLM01上下降15个百分点,在LLM07上下降25个百分点。第五个目标$L_4$-real,将存根后端替换为Gemini-2.5-flash,使用相同的$L_3$正则表达式,并与$L_1$完全匹配,表明除了正则表达式外没有可测量的对齐贡献(不是关于对齐的一般性声明)。预算控制没有下降(在扣除速率限制下限后为0个百分点)。一个通过静态基准的拒绝白名单可以被LLM驱动的释义器击败而不改变攻击意图;预算控制抵抗相同的变异。

英文摘要

Production LLM applications stack several defense families -- refusal-phrase filters, token-budget controls, model allowlists, rate limits, tool-registry authentication -- yet existing breach-and-attack-simulation (BAS) benchmarks report a single aggregate coverage number, hiding which family closes which threat. We measure attribution. We add four OWASP-LLM-Top-10-aware agents to a 21-agent baseline scanner and target a lattice of four synthetic LLM endpoints: $L_0$ (no defenses), $L_1$ (refusal-only), $L_2$ (budget-only), and $L_3$ (full stack). $L_1$ and $L_2$ are sibling single-axis ablations, not subsets of each other; $L_3$ is their union plus tool-registry authentication and credential scrubbing. Across $N=10$ replications, the per-OWASP finding count is clean: refusal alone removes all LLM01 (jailbreak) and LLM07 (system-prompt leakage) findings; budget alone removes all LLM02 (sensitive-info disclosure) and LLM10 (unbounded consumption) findings by terminating multi-step sequences; LLM06 (excessive agency) requires the full stack. We probe brittleness under paraphrasing: with 300 Gemini-generated paraphrases ($K=5$ over a 60-template brittleness corpus), $L_1$ refusal block rate falls 15 pp on LLM01 and 25 pp on LLM07. A fifth target, $L_4$-real, swaps the stub backend for Gemini-2.5-flash behind the same $L_3$ regex and matches $L_1$ exactly, indicating no measurable alignment contribution beyond the regex (not a general claim about alignment). Budget controls show no drop (0 pp once the rate-limit floor is factored out). A refusal whitelist that clears a static benchmark can be defeated by an LLM-driven paraphraser without changing attack intent; a budget control resists the same mutation.

2606.02781 2026-06-03 cs.AR cs.AI cs.ET

CRAM-ER: Error-Resilient Spintronic Computational Random Access Memory for Scalable In-Memory Computation

CRAM-ER:面向可扩展存内计算的容错自旋计算随机存取存储器

Sohan Salahuddin Mugdho, Md. Shahedul Hasan, Brahmdutta Dixit, Yang Lv, Jian-Ping Wang, Cheng Wang

发表机构 * Electrical and Computer Engineering Iowa State University of Science and Technology(电气与计算机工程学院爱荷华州立大学科学与技术学院) Electrical and Computer Engineering University of Minnesota Twin Cities(电气与计算机工程学院明尼苏达大学双城分校)

AI总结 针对基于MRAM的计算随机存取存储器(CRAM)在加速深度神经网络时面临的概率性开关错误和低吞吐量问题,提出一种混合自旋-CRAM与CMOS加法器树的容错架构(CRAM-ER),通过硬件-软件协同设计实现高能效、高可靠性的矩阵向量乘法。

详情
AI中文摘要

深度神经网络(DNN)在多个领域取得了最先进的性能。然而,传统的冯·诺依曼计算范式面临严重的内存瓶颈。新兴的近内存和存内计算方法缓解了这一问题,但引入了显著的外围开销。基于MRAM的计算随机存取存储器(CRAM)能够实现无外围开销的原位逻辑,提供了一种密集、节能的解决方案。然而,概率性的MRAM开关会导致门级错误,限制了CRAM在加速DNN时的可扩展性和可靠性。此外,大量的顺序MRAM写入严重制约了CRAM的吞吐量。为了解决这些挑战,我们提出了一种容错CRAM(CRAM-ER)架构,用于可扩展的存内矩阵向量乘法(MVM)。我们的错误感知硬件-软件协同设计框架利用混合自旋-CRAM + CMOS加法器树架构来减轻器件级错误的影响,展示了具有高面积和能效的MVM功能。我们进一步开发了错误感知模型微调和细粒度纠错技术,以增强错误容限。在DNN基准测试上对CMOS+自旋混合架构的评估显示,在将CRAM延迟降低多达两个数量级的同时,实现了近乎无损的精度,在能效和能量延迟积方面均优于CPU/GPU+高带宽DRAM。

英文摘要

Deep neural networks (DNNs) have achieved state-of-the-art performance across diverse domains. However, typical Von Neumann compute paradigms face severe memory bottlenecks. Emerging near-memory and compute-in-memory approaches alleviate this but incur significant peripheral overhead. Computational Random Access Memory (CRAM) based on MRAM enables in-situ logic without peripheral overhead, offering a dense, energy-efficient solution. However, probabilistic MRAM switching induces gate-level errors that limit the scalability and reliability of CRAM for accelerating DNN. Moreover, the large number of sequential MRAM writes severely constrains CRAM throughput. To address these challenges, we propose an error-resilient CRAM (CRAM-ER) architecture for scalable in-memory matrix-vector multiplications (MVMs). Our error-aware hardware-software co-design framework leverages a hybrid spintronic-CRAM + CMOS adder-tree architecture to mitigate the impact of device-level errors, demonstrating MVM functionality with high area and energy efficiency. We further develop an error-aware model fine-tuning and fine-grained error correction for enhanced error resilience. Evaluations of the CMOS+spintronic hybrid architecture on DNN benchmarks show near-lossless accuracy while reducing CRAM latency by up to 2 orders of magnitude, outperforming CPU/GPU+high-bandwidth DRAM in both energy efficiency and energy-delay product.

2606.02737 2026-06-03 cs.IR cs.AI cs.CL

Attention Calibration for Position-Fair Dense Information Retrieval

面向位置公平的密集信息检索的注意力校准

Andrianos Michail, Elias Schuhmacher, Juri Opitz, Simon Clematide, Rico Sennrich

发表机构 * Department of Computational Linguistics University of Zurich(计算语言学系苏黎世大学)

AI总结 针对密集检索模型的位置偏差问题,提出在推理时通过注意力校准(引入强度系数λ插值原始与完全校准分布)来提升位置公平性,无需重新训练且不牺牲整体检索效果,在多个数据集和模型上验证了部分校准优于完全校准,并提供了默认配置。

详情
AI中文摘要

密集检索模型存在位置偏差:当相关信息出现在段落较后位置时,检索效果会下降(Zeng et al., 2025)。我们探究是否可以在推理时减少这种偏差,无需重新训练且不牺牲整体检索效果。为此,我们将推理时的注意力校准(Schuhmacher et al., 2026)适配到下游检索,并引入强度系数λ,在原始注意力分布和完全校准的注意力分布之间进行插值。在SQuAD-PosQ和FineWeb-PosQ上的三个嵌入模型上,我们考察了篮子大小、校准层集和强度如何影响位置公平性与检索效果之间的权衡,发现部分校准通常优于完全校准。单个配置(B=128, λ=0.5, 50%层深度)在FineWeb-PosQ上提升了所有三个模型跨位置组的nDCG@10的调和平均值,无需逐模型调参,并且适用于<s>-池化和最后token池化两种架构。该默认配置无需修改即可迁移到PosIR(涵盖10种语言和31个领域),在所有16种长度四分位×模型×检索设置组合中降低了位置敏感指数,同时保持或提升了整体nDCG@10。我们在以下网址发布扩展后的代码库:this https URL

英文摘要

Dense retrieval models exhibit positional bias: retrieval effectiveness degrades when relevant information appears later in a passage (Zeng et al., 2025). We ask whether this bias can be reduced at inference time, without retraining and without sacrificing overall retrieval effectiveness. To this end, we adapt inference-time attention calibration (Schuhmacher et al., 2026) to downstream retrieval and extend it with a strength coefficient lambda that interpolates between the original and fully calibrated attention distributions. Across three embedding models on SQuAD-PosQ and FineWeb-PosQ, we examine how basket size, calibrated layer set, and strength affect the trade-off between positional fairness and retrieval effectiveness, finding that partial calibration frequently outperforms full calibration. A single configuration (B=128, lambda=0.5, 50% layer depth) improves the harmonic mean of nDCG@10 across positional groups on FineWeb-PosQ for all three models without per-model tuning, and applies to both <s>-pooled and last-token-pooled architectures. This default configuration transfers without modification to PosIR, which spans 10 languages and 31 domains, reducing the Position Sensitivity Index in all 16 length-quartile x model x retrieval-setting combinations, while preserving or improving aggregate nDCG@10. We release our extended codebase at https://github.com/impresso/fair-sentence-transformers

2606.02644 2026-06-03 cs.CR cs.AI

A New Framework for Cybersecurity Refusals in AI Agents

AI代理中网络安全拒绝的新框架

Eliot Krzysztof Jones, Mateusz Dziemian, Matt Fredrikson, J Zico Kolter

发表机构 * Gray Swan Gray Swan AI Carnegie Mellon University(卡内基梅隆大学)

AI总结 提出首个针对AI代理在进攻性安全场景中建立拒绝边界的框架,包括拒绝原则、任务分类和评估方法,并发现8个前沿模型中6个拒绝率接近零。

详情
AI中文摘要

代理脚手架显著提升了LLM在复杂、长期任务上的表现,在网络安全等领域带来了广泛益处和放大风险。现有的AI代理网络安全基准主要关注能力测量——代理能多有效地完成进攻性安全任务——但忽略了一个关键问题:代理何时以及如何拒绝有害请求?我们提出了首个在进攻性安全场景中建立拒绝边界的框架。我们的框架定义了(1)任务应被拒绝的原则性标准,(2)应被拒绝的任务类别,以及(3)在良性和对抗条件下测量代理鲁棒性的评估方法。我们应用该框架评估当前基于LLM的代理在一系列基于Web的进攻性安全场景中是否遵守适当的拒绝边界,发现测试的8个前沿模型中有6个拒绝率接近零,只有2个模型(GPT-5.2和GPT-5.1 Codex)表现出任何有意义的拒绝行为。

英文摘要

Agentic scaffolds have dramatically improved LLM performance on complex, long-horizon tasks, yielding both broad benefits and amplified risks in domains like cybersecurity. Existing benchmarks for AI agents in cybersecurity focus mainly on measuring proficiency--how effectively agents can complete offensive security tasks--but neglect a critical question: when and how should agents refuse harmful requests? We present the first framework for establishing refusal boundaries in offensive security contexts. Our framework defines (1) principled criteria for when tasks should be refused, (2) categories of tasks that warrant refusal, and (3) evaluation methodology for measuring agent robustness under both benign and adversarial conditions. We apply this framework to assess how current LLM-powered agents adhere to appropriate refusal boundaries across a range of web-based offensive security scenarios, finding that 6 of 8 frontier models tested show near-zero refusal rates, with only 2 models (GPT-5.2 and GPT-5.1 Codex) demonstrating any meaningful refusal behavior.