arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 21516
2606.04329 2026-06-04 cs.CR cs.AI

From Untrusted Input to Trusted Memory: A Systematic Study of Memory Poisoning Attacks in LLM Agents

从不可信输入到可信内存:LLM智能体中内存投毒攻击的系统研究

Pritam Dash, Tongyu Ge, Aditi Jain, Tanmay Shah, Zhiwei Shang

发表机构 * University of California, Berkeley(加州大学伯克利分校)

AI总结 本文系统研究了基于LLM的智能体中的内存投毒攻击,识别了四种内存写入通道和九种结构漏洞,提出了六类攻击的分类法,并设计了评估基准MPBench,发现更积极读写内存的智能体更易被利用,且现有提示注入防御无法覆盖内存投毒攻击。

详情
AI中文摘要

内存是AI智能体的核心组件,使其能够在交互中积累知识并提高性能。然而,持久性内存引入了内存投毒的风险,即单个对抗性内存写入可以对智能体行为产生长期影响。我们对基于LLM的智能体中的内存投毒进行了系统研究。我们识别了四种内存写入通道和九种模型能力、系统提示设计以及智能体系统架构中的结构漏洞,这些漏洞使得这些通道可被利用。基于这些漏洞,我们提出了六类内存投毒攻击的分类法。此外,我们设计了MPBench——一个用于评估内存投毒攻击的基准,并表明设计为更积极读写和检索内存的智能体更容易被利用。我们还表明,现有的提示注入防御无法覆盖内存投毒攻击。我们的发现为理解和缓解针对AI智能体的内存投毒攻击提供了基础。

英文摘要

Memory is a core component of AI agents, enabling them to accumulate knowledge across interactions and improve performance. However, persistent memory introduces the risk of memory poisoning, where a single adversarial memory write can exert long-term influence over agent behavior. We present a systematic study of memory poisoning in LLM-based agents. We identify four memory write channels and nine structural vulnerabilities in model capabilities, system prompt design, and agent system architecture that make these channels exploitable. Based on these vulnerabilities, we develop a taxonomy of six classes of memory poisoning attacks. Furthermore, we design MPBench -- a benchmark for evaluating memory poisoning attacks, and show that agents designed to write and retrieve memory more aggressively are more exploitable. We also show that existing prompt injection defenses fail to cover memory poisoning attacks. Our findings provide a foundation for understanding and mitigating memory poisoning attacks against AI agents.

2606.04328 2026-06-04 cs.NI cs.AI

Generalizable Multi-Task Learning for Wireless Networks Using Prompt Decision Transformers

基于提示决策变压器的无线网络可泛化多任务学习

Fatih Temiz, Shavbo Salehi, Melike Erol-Kantarci

发表机构 * IEEE University of California, Berkeley(加州大学伯克利分校)

AI总结 提出PromptDT框架,将多小区选择重构为序列建模问题,利用离线轨迹和任务特定提示实现跨异构网络配置的可扩展学习,在无需重训练的情况下提升多任务QoE达49%。

Comments Accepted paper at IEEE International Mediterranean Conference on Communications and Networking (MeditCom) 2026

详情
AI中文摘要

未来无线网络需要快速适应高度异构的环境和动态任务配置,这要求从传统的基于规则和优化的无线资源管理(RRM)转向人工智能(AI)驱动的RRM。AI驱动的方法可以学习复杂的非线性关系,泛化到不同的网络条件,并实现实时、可扩展和自主的决策。在RRM技术中,协调多点(CoMP)传输对于减轻小区间干扰和提升小区边缘性能至关重要,从而在密集部署中改善体验质量(QoE)。然而,最优多小区选择仍然是一个复杂的组合挑战,因为它需要在动态流量和信道条件下联合优化许多可能的服务小区组合。尽管取得了成功,但传统的深度强化学习(DRL)方法,如近端策略优化(PPO),在状态和动作空间变化时存在样本效率低、泛化能力有限和重新训练成本高的问题。为了解决这些瓶颈,我们提出了一种基于提示决策变压器(PromptDT)的多任务学习框架,该框架能够跨不同网络配置学习,并将多小区选择重构为序列建模问题。通过利用离线轨迹和任务特定提示,PromptDT实现了跨不同网络配置(包括变化的基站和用户设备数量以及调度策略)的可扩展学习。实验结果表明,与基线相比,PromptDT在多任务设置中将QoE提高了高达49%,且性能随模型容量正向扩展。此外,PromptDT能有效泛化到未见过的任务,实现对新网络配置的鲁棒少样本适应,无需重新训练或微调。

英文摘要

Future wireless networks demand rapid adaptation to highly heterogeneous environments and dynamic task configurations, necessitating a shift from conventional rule-based and optimization-driven radio resource management (RRM) toward artificial intelligence (AI)-driven RRM. AI-driven approaches can learn complex nonlinear relationships, generalize across diverse network conditions and enable real-time, scalable and autonomous decision-making. Among RRM techniques, coordinated multipoint (CoMP) transmission is pivotal for mitigating inter-cell interference and enhancing cell-edge performance, thereby improving quality of experience (QoE) in dense deployments. However, optimal multi-cell selection remains a complex combinatorial challenge as it requires jointly optimizing over many possible serving-cell combinations under dynamic traffic and channel conditions. Despite their success, conventional deep reinforcement learning (DRL) methods such as proximal policy optimization (PPO) suffer from poor sample efficiency, limited generalization, and costly retraining when state and action spaces change. To address these bottlenecks, we propose a Prompt Decision Transformer (PromptDT) based multi-task learning framework capable of learning across diverse network configurations and reformulating multi-cell selection as a sequence modeling problem. By leveraging offline trajectories and task-specific prompts, PromptDT enables scalable learning across diverse network configurations, including varying base stations and user equipment counts, and scheduler policies. Experimental results demonstrate that PromptDT improves QoE by up to 49% in multi-task settings compared to baselines, with performance scaling positively alongside model capacity. Moreover, PromptDT generalizes effectively to unseen tasks, achieving robust few-shot adaptation to new network configurations without retraining or fine-tuning.

2606.04319 2026-06-04 cs.GR cs.CV

PureLight: Learning Complex Luminaires with Light Tracing

PureLight: 使用光线追踪学习复杂光源

Pedro Figueiredo, Zixuan Li, Beibei Wang, Miloš Hašan, Nima Khademi Kalantari

发表机构 * Texas A&M University(德克萨斯大学) Nankai University(南开大学) Nanjing University of Science and Technology(南京理工大学) NVIDIA(NVIDIA公司)

AI总结 提出一种基于神经网络的公式,通过光线追踪和归一化流网络学习复杂光源的辐射分布,并蒸馏为轻量级MLP以实现高效渲染。

Comments 9 pages, 10 figures

详情
AI中文摘要

我们提出了一种神经公式来估计复杂光源的外观。我们专注于具有复杂光传输(例如,被多个镜面层包围的小型发射器)的具有挑战性的光源,这些光源对于(双向)路径追踪来说很难处理。为此,我们使用光线追踪从发射器构建路径到出射表面,并将外观估计公式化为一个分布学习问题。具体来说,我们使用一个大型归一化流网络对出射表面上的出射辐射概率密度函数(pdf)进行建模,并将出射辐射恢复为估计的pdf与通量的乘积。为了实现高效推理,我们将学习到的外观蒸馏到一个轻量级MLP中,该MLP直接估计出射表面上的辐射。我们还训练了一个采样网络用于从光源进行有效的直接照明计算,以及一个混合网络将光源合成到场景中。我们的公式使得在任意场景中使用低样本数渲染具有挑战性的光源成为可能。

英文摘要

We propose a neural formulation for estimating the appearance of complex luminaires. We focus on challenging luminaires with complex light transport (e.g., small emitters enclosed by multiple specular layers) that are difficult for (bidirectional) path tracing. To this end, we use light tracing to construct paths from emitters to the exit surfaces and formulate appearance estimation as a distribution learning problem. Specifically, we model the probability density function (pdf) of outgoing radiance on the exit surfaces using a large normalizing flow network, and recover the outgoing radiance as the product of the estimated pdf and flux. To enable efficient inference, we distill the learned appearance into a lightweight MLP that directly estimates radiance on the exit surfaces. We additionally train a sampling network for effective direct illumination computation from the luminaire, and a blending network to composite the luminaire into the scene. Our formulation makes it feasible to render challenging luminaires using low sample counts in arbitrary scenes.

2606.04317 2026-06-04 cs.CR cs.LG cs.SE

Toward a Generalized Defense Across Sparse, Continuous, and Structured Parameter Attacks

面向稀疏、连续和结构化参数攻击的通用防御

Bin Duan, Zeyu Bai, Guowei Yang

发表机构 * School of Electrical Engineering and Computer Science, The University of Queensland, Australia(电气工程与计算机科学学院,昆士兰大学,澳大利亚)

AI总结 提出 ParDef 框架,通过密钥通道重参数化、QC-LDPC 量化和自适应鲁棒推理,实现对多种参数攻击的通用防御,在保持高性能的同时降低攻击成功率。

详情
AI中文摘要

深度神经网络越来越多地部署在异构和部分不可信的环境中,模型通过云存储、CI/CD 流水线、容器化服务和边缘执行平台进行分发。这种广泛的部署场景使模型参数面临各种完整性风险。与输入空间对抗攻击不同,参数攻击直接篡改模型的内部参数,并持续影响所有后续推理。现有防御要么需要重新训练,要么导致显著的精度下降,或者仅限于特定的攻击类别。然而,在实际部署场景中,参数攻击的形式往往不可预测。为了解决这一挑战,我们提出了 ParDef,一种针对深度神经网络面向多种类型参数攻击的通用防御。ParDef 集成了密钥通道重参数化(隐藏敏感参数方向)、QC-LDPC 量化(嵌入冗余并支持纠错)以及自适应鲁棒推理(在不确定性下稳定预测)。我们在 CIFAR-10、CIFAR-100 和 Tiny-ImageNet 上使用 ResNet 和 VGG 模型的评估表明,ParDef 在不同参数攻击下持续降低攻击成功率,同时保持较高的模型性能,且仅引入适度的部署开销。这些结果凸显了 ParDef 是一种实用且通用的 DNN 部署防御方案。

英文摘要

Deep neural networks are increasingly deployed across heterogeneous and partially untrusted environments, where models are distributed through cloud storage, CI/CD pipelines, containerized services, and edge execution platforms. This broad deployment landscape exposes model parameters to various integrity risks. Unlike input-space adversarial attacks, parameter attacks directly tamper with the model's internal parameters and persist across all subsequent inferences. Existing defenses either require retraining, incur significant accuracy degradation, or are limited to specific attack classes. However, in real-world deployment scenarios, the forms of parameter attacks are often unpredictable. To address this challenge, we present ParDef, a generalized defense for deep neural networks against diverse types of parameter attacks. ParDef integrates keyed channel reparameterization, which obscures sensitive parameter directions, QC-LDPC quantization, which embeds redundancy and supports error correction, and adaptive robust inference, which stabilizes predictions under uncertainty. Our evaluation on CIFAR-10, CIFAR-100, and Tiny-ImageNet using ResNet and VGG models demonstrates that ParDef consistently reduces attack success rates across different parameter attacks while maintaining high model performance and incurring only moderate deployment overhead. These results highlight that ParDef is a practical and generalized defense for DNN deployments.

2606.04266 2026-06-04 cs.CR cs.LG

Long-Term and Short-Term Transistor Aging in Deep Neural Networks: Impact and Mitigation

深度神经网络中的长期与短期晶体管老化:影响与缓解

Alireza Sarmadi, Virinchi Roy Surabhi, Prashanth Krishnamurthy, Hussam Amrouch, Ramesh Karri, Farshad Khorrami

发表机构 * Dept. of Electrical and Computer Engineering, New York University (NYU) Tandon School of Engineering(纽约大学电气与计算机工程系(Tandon工程学院)) School of Computation, Information and Technology, Technical University of Munich (TUM)(慕尼黑技术大学计算、信息与技术学院)

AI总结 本文研究了长期和短期晶体管老化对深度神经网络推理精度的影响,并提出了一种老化感知重训练方法来缓解性能下降。

Comments 28 pages, 16 figures

详情
AI中文摘要

深度神经网络(DNN)被用于各种实际应用,例如图像分类和语音识别。在集成电路(IC)的硬件上实现的DNN的推理精度会在晶体管老化等现象下下降。老化会减慢晶体管的开关速度,由于时钟无法维持而导致系统级时序违规。为了在整个预期寿命内保持可靠性,设计人员添加保护带以防止时序违规;然而,添加大的时序保护带会导致性能(速度或吞吐量)损失。本章详细讨论了长期和短期晶体管老化对DNN推理精度的影响。此外,为了减轻老化对DNN精度的影响并控制它们,提出了一种老化感知重训练方法,以生成即使在激进(即小于所需)保护带下也具有弹性的DNN。这提高了DNN在老化引起的退化情况下的推理精度。本章在用于图像分类的DNN硬件实现上,使用现成的图像数据集讨论了这些影响以及缓解策略。还简要讨论了短期老化作为检测集成电路中硬件木马的激励机制的应用。

英文摘要

Deep neural networks (DNNs) are used in a variety of real-world applications including, for example, image classification and speech recognition. The inference accuracy of DNN implemented on hardware in integrated circuits (ICs) degrades under phenomena such as transistor aging. Aging slows down the switching speed of transistors, resulting in system-level timing violations due to unsustainable clocks. To maintain reliability for the entire projected lifetime, designers add guardbands to prevent timing violations; however, adding large timing guardbands causes losses in performance (speed or throughput). This chapter provides a detailed discussion of the effects of long-term and short-term transistor aging on DNN inference accuracy. Furthermore, to mitigate aging effects on DNN's accuracy and keep them at bay, a methodology for aging-aware retraining is presented in order to generate a resilient DNN even when aggressive (i.e., smaller than required) guardbands are used. This improves the inference accuracy of the DNNs even in the presence of aging-induced degradation. These effects are discussed in this chapter along with mitigation strategies on a hardware implementation of a DNN for image classification on an off-the-shelf image dataset. The application of short-term aging as an excitation mechanism for the detection of hardware Trojans in integrated circuits is also briefly discussed.

2606.04265 2026-06-04 math.OC cs.LG cs.NA math.NA

Nonlocal Mean Field Schrödinger Bridge with Learned Interactions

具有学习相互作用的非局部平均场薛定谔桥

Daisuke Inoue, Mathieu Laurière, Dante Kalise

发表机构 * Department of Mathematics, Imperial College London(伦敦帝国学院数学系) Shanghai Frontiers Science Center of Artificial Intelligence and Deep Learning(上海前沿人工智能与深度学习科学中心) NYU-ECNU Institute of Mathematical Sciences, NYU Shanghai(纽约大学上海数学科学研究所)

AI总结 本文提出一种使用神经网络代理近似非局部相互作用的平均场薛定谔桥方法,将推理时的每步计算成本从二次降低到线性,并推导了代理误差传播的稳定性界限。

Comments 31 pages, 15 figures

详情
AI中文摘要

薛定谔桥问题构建一个以最小能量连接初始分布和终端分布的随机过程。本文考虑其平均场扩展,即平均场薛定谔桥,用于相互作用粒子系统。对于非局部相互作用,评估产生的依赖于粒子的分布项的计算量随种群规模呈二次增长,这使得大规模问题难以处理。我们通过使用神经网络代理近似非局部相互作用来解决这一瓶颈。由此产生的四阶段交替算法将推理时每步成本从种群规模的二次降低到线性。我们还推导了Grönwall型稳定性界限,显示代理误差如何传播到生成的轨迹。在导航和意见动力学任务的数值实验中,所提出的方法再现了通过解析评估获得的轨迹,并减少了训练时间。

英文摘要

The Schrödinger Bridge Problem constructs a stochastic process that connects an initial distribution to a terminal distribution with minimum energy. This work considers its mean-field extension, the Mean-Field Schrödinger Bridge, for interacting particle systems. With nonlocal interactions, evaluating the resulting particle-dependent distributional terms can scale quadratically with the population size, which makes large-scale problems intractable. We address this bottleneck by approximating the nonlocal interactions with neural network surrogates. The resulting four-stage alternating algorithm reduces the per-step cost from quadratic to linear in the population size at inference. We also derive Grönwall-type stability bounds that show how surrogate errors propagate to the generated trajectories. In numerical experiments on navigation and opinion-dynamics tasks, the proposed method reproduces trajectories obtained with analytical evaluation and reduces training time.

2606.04210 2026-06-04 eess.AS cs.LG cs.SD

Representation Matters in Randomized Smoothing for Audio Classification

表示在音频分类的随机平滑中至关重要

Jong-Ik Park, Shreyas Chaudhari, José M. F. Moura, Carlee Joe-Wong

发表机构 * University of California, Berkeley(加州大学伯克利分校)

AI总结 研究随机平滑在音频分类中的表示问题,通过实验揭示预处理和表示选择对认证鲁棒性的影响,并提出报告规范。

详情
AI中文摘要

随机平滑(RS)在添加高斯噪声的向量空间中认证鲁棒性。在音频分类中,该空间通常不是唯一确定的,因为标准流程会对波形进行归一化、范围控制,并将其转换为log-mel或其他频谱特征。我们表明,除非认证对象和预处理策略明确,否则直接RS是欠定义的。在两个音频基准(关键词识别和环境声音分类)上,我们研究了波形、特征空间和后处理平滑。我们的诊断显示了为什么表示感知的报告是必要的:在相同的平滑水平$σ=0.0025$下,两个数据集共享相同的中位数原始半径$.007996$,但不同的波形能量产生不同的SNR等效尺度($83.98$ vs. $90.97$ dB);log-mel平滑在环境声音上给出更高的正半径认证准确率($68.42\%$ vs. $65.53\%$),认证了更多具有非零半径的样本,但基于特征而非波形;裁剪或峰值归一化将有效扰动范数改变约$230$--$351\times$。因此,我们建议音频RS研究选择并报告任务特定的认证对象和扰动模型,包括扰动位置、增益策略、原始半径以及任何噪声后的几何变化。

英文摘要

Randomized smoothing (RS) certifies robustness in the vector space where Gaussian noise is added. In audio classification, this space is often not uniquely defined as standard pipelines normalize, range-control, and transform waveforms into log-mel or other spectral features. We show that direct RS is therefore under-specified unless the certified object and preprocessing policy are explicit. On two audio benchmarks, keyword spotting and environmental-sound classification, we study waveform, feature-space, and post-processed smoothing. Our diagnostics show why representation-aware reporting is necessary: at the same smoothing level $σ=0.0025$, the two datasets share the same median raw radius $.007996$, but different waveform energies yield different SNR-equivalent scales ($83.98$ vs. $90.97$ dB); log-mel smoothing gives higher positive-radius certified accuracy on environmental sounds ($68.42\%$ vs. $65.53\%$), certifying more examples with nonzero radius but over features rather than waveforms; and clipping or peak normalization changes the effective perturbation norm by roughly $230$--$351\times$. We therefore recommend that audio RS studies choose and report the task-specific certified object and perturbation model, including the perturbation location, gain policy, raw radius, and any post-noise geometry changes.

2606.04205 2026-06-04 cs.MM cs.AI cs.CL cs.CV cs.LG cs.SD

DetectZoo: A Unified Toolkit for AI-Generated Content Detection Across Text, Audio, and Image Modalities

DetectZoo:一个用于跨文本、音频和图像模态的AI生成内容检测的统一工具包

Sajad Ebrahimi, Nima Jamali, Bardia Shirsalimian, Kelly McConvey, Wentao Zhang, Jalehsadat Mahdavimoghaddam, Maksym Taranukhin, Maura Grossman, Vered Shwartz, Yuntian Deng, Ebrahim Bagheri

发表机构 * University of Toronto(多伦多大学) University of Waterloo(滑铁卢大学) Toronto Metropolitan University(多伦多 Metropolitan 大学) University of British Columbia(不列颠哥伦比亚大学) Vector Institute(向量研究所)

AI总结 提出DetectZoo,一个首个统一的多模态AI生成内容检测工具包,通过标准化数据预处理、评估流程和集成61个检测器与22个基准数据集,实现公平可重复的基准测试。

详情
AI中文摘要

生成模型的日益普及和能力提升模糊了人类与机器生成内容之间的界限,推动了跨文本、图像和音频检测领域的大量研究。大多数现有的检测器要么是商业软件,要么是开源但带有不兼容的代码库、定制化的预处理、评估协议和评估指标,这使得它们的采用、公平比较和复现变得相当困难。为了解决这一关键差距,我们引入了DetectZoo,这是首个可扩展的工具包,旨在为跨文本、音频和图像模态的AI生成内容检测提供统一接口。DetectZoo标准化了从数据摄取和预处理到模型评估的完整实证流程,为研究人员提供了一个统一的框架来系统地基准测试最先进的检测器。通过将多样的公共数据集和基线检测算法集成到单一的统一API下,我们的工具包促进了严格且可重复的评估。DetectZoo提供了61个检测器的参考实现、22个基准数据集的原生加载器,以及一个标准化的评估流程,通过通用接口报告多个指标。每个检测器都是自包含的,但可通过同一接口访问,自动缓存预训练权重,并复现原始发表的结果。DetectZoo降低了多模态AI取证的入门门槛,使研究人员能够识别跨领域的性能差距,并加速开发鲁棒、可泛化的检测技术。开源仓库和全面文档可在https://github.com/sadjadeb/DetectZoo 获取,且可通过pip install detectzoo安装该包。

英文摘要

The growing popularity and capacity of generative models have eroded the distinction between human and machine-generated content, motivating a growing body of work on detection across text, images, and audio. Most available detectors are either commercial software or, if open-source, come with incompatible codebases with bespoke preprocessing, evaluation protocols, and evaluation metrics, which make their adoption, fair comparison, and reproduction quite difficult. To address this critical gap, we introduce DetectZoo, a first-of-its-kind, extensible toolkit designed to provide a unified interface for AI-generated content detection across text, audio, and image modalities. DetectZoo standardizes the complete empirical pipeline, from data ingestion and preprocessing to model assessment, offering researchers a cohesive framework to benchmark state-of-the-art detectors systematically. By integrating diverse public datasets and baseline detection algorithms under a single, unified API, our toolkit facilitates rigorous and reproducible evaluation. DetectZoo provides reference implementations of 61 detectors, native loaders for 22 benchmark datasets, and a standardized evaluation pipeline that reports multiple metrics through a common interface. Each detector is self-contained yet accessible through the same interface, automatically caches pretrained weights, and reproduces the original published results. DetectZoo lowers the barrier to entry for multi-modal AI forensics, enabling researchers to identify performance gaps across domains and accelerating the development of robust, generalizable detection techniques. The open-source repository and comprehensive documentation are publicly available at https://github.com/sadjadeb/DetectZoo, and the package can be installed via pip install detectzoo.

2606.04197 2026-06-04 cs.MA cs.CL cs.SI physics.soc-ph

Exploring the Topology and Memory of Consensus: How LLM Agents Agree, Fragment, or Settle When Forming Conventions

探索共识的拓扑与记忆:LLM智能体在形成惯例时如何达成一致、分裂或稳定

Aliakbar Mehdizadeh, Martin Hilbert

发表机构 * Department of Communication, University of California, Davis(通信系,加州大学戴维斯分校)

AI总结 研究LLM多智能体系统中记忆深度与通信拓扑的交互作用,发现记忆对协调的影响符号会因网络中心化程度而反转,并揭示了记忆介导的速度-统一性权衡。

Comments Submitted to the Journal of Artificial Societies and Social Simulation (JASSS)

详情
AI中文摘要

一个LLM智能体应该记住多少,以及多智能体系统在试图达成共识时应该如何连接?我们展示了这两个设计选择以某种方式交互,使得记忆对协调的影响符号发生翻转。通过对八个固定的16智能体拓扑上的网络化命名游戏进行432次模拟运行,我们改变了记忆深度和网络结构。更长的记忆在去中心化网络中减缓了达到稳态的时间,但在中心化网络中加速了这一过程;相同的参数根据拓扑将系统推向相反的方向。关键的是,中心化网络中的“更快稳定”意味着更快地锁定到一个碎片化的平台,而不是达到系统范围的共识,这可以用来产生分歧的意见。我们进一步记录了一种记忆介导的速度-统一性权衡:中心化网络始终比去中心化网络保留更多竞争性惯例,但它们的稳定速度严重依赖于记忆。在智能体层面,网络内分析表明,高中介性的桥梁遭受中介惩罚,而局部聚类邻域中的智能体实现更高的协调成功。最后,为了寻找可解析的生成机制,我们发现智能体的选择被虚拟博弈很好地捕捉,表明是基于信念而非基于奖励的适应。实际意义:记忆深度和通信拓扑应共同设计,而不是孤立优化。

英文摘要

How much should an LLM agent remember, and how should multi-agent systems be connected when trying to reach consensus? We show these two design choices interact in a way that flips the sign of memory's effect on coordination. Across 432 simulation runs of a networked Naming Game on eight fixed 16-agent topologies, we vary memory depth and network structure. Longer memory slows the time to reach steady state in decentralized networks but accelerates it in centralized ones; the same parameter pushes the system in opposite directions depending on topology. Critically, "faster settling" in centralized networks means locking in to a fragmented plateau more quickly, not reaching system-wide consensus, which can be used to generate diverging opinions. We further document a memory-mediated speed-unity trade-off: centralized networks consistently preserve more competing conventions than decentralized networks, but their settling speed depends sharply on memory. At the agent level, within-network analyses show that high-betweenness bridges suffer a brokerage penalty while agents in locally clustered neighborhoods achieve higher coordination success. Finally, in search of analytically tractable generative mechanisms, we find that agents' choices are well captured by Fictitious Play, indicating belief-based rather than reward-based adaptation. The practical implication: memory depth and communication topology should be co-designed, not optimized in isolation.

2606.04193 2026-06-04 cs.CR cs.AI cs.DC

Notarized Agents: Receiver-Attested Confidential Receipts for AI Agent Actions

公证代理:面向AI代理行为的接收方认证保密收据

Juan Figuera

发表机构 * Independent Researcher, Sello Project(独立研究者,Sello项目)

AI总结 针对AI代理日志自审计的信任缺陷,提出接收方签名收据协议Sello,通过HPKE加密、JWS绑定和Merkle日志实现防篡改追踪。

Comments 22 pages. Reference implementation at https://github.com/juanfiguera/sello

详情
AI中文摘要

当前AI代理的可观测性在结构上存在缺陷:生成活动日志的实体与被记录活动的实体是同一个。被攻破或有缺陷的代理可以省略、篡改或伪造自身的追踪记录,而运行代理的操作员没有独立的方法检测篡改。我们提出了一类协议来解决这个问题,通过反转信任边界:接收代理调用的服务使用自己的密钥对观察到的内容签名收据,将收据加密给代理的所有者,并将其发布到公共透明度日志中。所有者可以在不信任代理或其操作员的情况下重建防篡改追踪。我们将该类协议实例化为Sello,一种结合了当前任何系统都不具备的四个属性的协议:(P1)接收方签名,(P2)通过JWS将HPKE加密绑定到所有者公钥的授权令牌,(C3)发布到见证人共同签名的Merkle日志,以及(P4)通过令牌引用进行所有者端发现。我们描述了该协议,分析了在攻击者控制代理及其操作员的情况下的安全性,给出了加密操作的微基准测试,并将Sello与相邻的收据协议工作(Signet、AgentROA、Agent Passport System、draft-farley-acta、SCITT)进行了比较。我们讨论了已知的限制,包括抑制攻击、服务合谋和采用激励问题。

英文摘要

Current AI agent observability is structurally compromised: the entity producing the activity log is the same entity whose activity is being logged. A compromised or buggy agent can omit, alter, or fabricate its own traces, and the operator running the agent has no independent way to detect tampering. We propose a class of protocols that resolves this by inverting the trust boundary: the service that receives an agent's call signs a receipt of what it observed using its own key, encrypts the receipt to the agent's owner, and publishes it to a public transparency log. The owner reconstructs a tamper-evident trail without trusting the agent or its operator. We instantiate the class as Sello, a protocol combining four properties absent in any current system: (P1) receiver-side signing, (P2) HPKE encryption to an owner public key bound to the authorization token via JWS, (P3) publication to a witness-cosigned Merkle log, and (P4) owner-side discovery by token reference. We describe the protocol, analyze its security under an adversary that controls the agent and its operator, present microbenchmarks of the cryptographic operations, and situate Sello among adjacent receipt-protocol work (Signet, AgentROA, Agent Passport System, draft-farley-acta, SCITT). We discuss known limitations including the suppression attack, service collusion, and the adoption-incentive problem.

2606.04165 2026-06-04 hep-ex cs.LG hep-ph physics.ins-det

CaloTrilogy: Toward a Breakthrough in One-Step, End-to-End, Physics-Guided Shower Generation for Modern Calorimeters

CaloTrilogy:迈向现代量热器一步式端到端物理引导簇射生成的突破

Cheng Jiang, Sitian Qian, Kevin Pedro, Oz Amram, Huilin Qu, Maggie Voetberg

发表机构 * School of Physics and Astronomy, University of Edinburgh(爱丁堡大学物理与天文学学院) Department of Physics, University of Wisconsin-Madison(威斯康星大学麦迪逊分校物理系) Fermi National Accelerator Laboratory(费米国家加速器实验室) State Key Laboratory of Dark Matter Physics, Tsung-Dao Lee Institute & School of Physics and Astronomy, Shanghai Jiao Tong University(上海交通大学暗物质物理国家重点实验室、李政道研究所及物理与天文学学院) Key Laboratory for Particle Astrophysics and Cosmology (MOE) & Shanghai Key Laboratory for Particle Physics and Cosmology, Shanghai Jiao Tong University(教育部粒子天体物理与宇宙学重点实验室及上海粒子物理与宇宙学重点实验室,上海交通大学)

AI总结 提出一种结合平均速度场积分器、学习生成先验和物理引导损失项的框架,实现一步或少量评估步骤的高质量簇射生成,性能与最先进的流和扩散模型相当。

详情
AI中文摘要

当前和未来对撞机的高精度量热器模拟对计算资源的需求快速增长,促使开发机器学习替代传统蒙特卡洛工具(如Geant4)。流匹配和基于扩散的生成模型因其样本质量而成为高维快速模拟的主流方法,但通常在推理时需要${\cal O}(100)$次函数评估,并常依赖辅助网络约束全局可观测量,损害了简化的端到端生成。我们引入了一个统一框架,改进了速度、簇射质量和物理保真度之间的平衡。该方法结合了:(i)平均速度场积分器,实现一步或少量评估的采样;(ii)从数据而非随机噪声构建的簇射空间学习生成先验;(iii)训练期间对关键可观测量施加归纳偏置的物理引导损失项。这些元素是训练时的正则化器,保持了端到端推理且无额外成本。仅需一步或少量评估步骤,该模型在多个公开的高粒度量热器数据集上达到了与最先进的流和扩散模型竞争的簇射质量。结果表明层间簇射结构与底层物理一致,为未来的快速模拟工作流提供了有力候选。

英文摘要

High-precision calorimeter simulation at current and future colliders imposes rapidly growing computational demands, motivating the development of machine-learning surrogates for traditional Monte Carlo tools such as Geant4. Flow matching and diffusion-based generative models have become leading approaches for high-dimensional fast simulation because of their sample quality, but typically require ${\cal O}(100)$ function evaluations at inference and often rely on auxiliary networks to constrain global observables, compromising streamlined end-to-end generation. We introduce a unified framework that improves the balance between speed, shower quality, and physics fidelity. The method combines: (i) an average velocity field integrator that enables sampling in one or a few evaluations; (ii) a learned generative prior in shower space, constructed from data rather than random noise; and (iii) physics-guided loss terms that impose inductive biases on key observables during training. These elements are training time regularizers, preserving end-to-end inference with no additional cost. With only one or a few evaluation steps, the model achieves shower quality competitive with state-of-the-art flow and diffusion approaches, tested on several public high granularity calorimeter datasets. The results demonstrate inter-layer shower structure consistent with the underlying physics, providing a strong candidate for future fast simulation workflows.

2606.04155 2026-06-04 cs.HC cs.CL cs.CY

SocialCoach: Personalized Social Skill Learning with RL-based Agentic Tutoring and Practice

SocialCoach: 基于强化学习的智能辅导与练习的个性化社交技能学习

Tianfu Wang, Max Xiong, Jianxun Lian, Hongyuan Zhu, Zhengyu Hu, Yuxuan Lei, Linxiao Gong, Xiaofang Li, Peiting Tsai, Nicholas Jing Yuan, Qi Zhang

发表机构 * HKUST (GZ)(香港科技大学(广州)) Duke University(杜克大学) MSRA Beijing(微软研究院北京) Microsoft Beijing(微软北京)

AI总结 提出SocialCoach系统,利用多智能体管道构建知识语料库、强化学习优化自适应练习调度,并结合沉浸式实践与反思辅导,以解决社交技能学习中专家辅导稀缺和知行差距问题。

详情
AI中文摘要

社交技能如谈判和领导力在当今互联世界中对于个人和职业成功至关重要。然而,由于专家辅导的稀缺,可扩展且有效的培训仍然是一个重大挑战。在本文中,我们介绍了SocialCoach,一个全面的LLM驱动的智能辅导系统,用于大规模个性化社交技能发展。首先,SocialCoach利用多智能体管道,从多样化的专家来源自动构建一个基于教学法的理论到实践知识语料库。其次,为了个性化学习旅程,它采用了一个自适应练习调度模块,遵循处方-检索-适应过程。为了在克服冷启动问题的同时最大化长期学习体验,该策略通过强化学习在学习者模拟环境中进行优化。最后,SocialCoach整合了沉浸式目标驱动练习、因果驱动能力评估和基于知识的反思辅导,以帮助解决知行差距。我们在产品EQoach中部署了该系统,并进行了广泛实验。结果表明,SocialCoach在模拟路径质量和评委评估的辅导质量上优于基线方法,而早期用户反馈表明其具有强烈的感知参与度和有用性。这些发现为个性化、游戏化的软技能学习教学平台提供了一种实用架构。

英文摘要

Social skills such as negotiation and leadership are crucial for personal and professional success in today's interconnected world. However, scalable and effective training remains a significant challenge due to the scarcity of expert coaching. In this paper, we introduce SocialCoach, a holistic LLM-powered agentic tutoring system for personalized social skill development at scale. First, SocialCoach automatically constructs a pedagogically-grounded, theory-to-practice knowledge corpus from diverse expert sources, leveraging a multi-agent pipeline. Second, to personalize the learning journey, it employs an adaptive practice scheduling module that follows a prescription-retrieval-adaptation process. To maximize the long-term learning experience while overcoming the cold-start problem, this policy is optimized within a learner simulation environment through reinforcement learning. Finally, SocialCoach integrates immersive, goal-driven practice, causality-driven proficiency assessment and knowledge-grounded, reflective tutoring to help address the knowing-doing gap. We deploy it in our product, EQoach, and conduct extensive experiments. The results show that SocialCoach improves simulated pathway quality and judge-rated tutoring quality over baseline approaches, while early user feedback indicates strong perceived engagement and usefulness. These findings suggest a practical architecture for personalized and gamified pedagogical platforms on soft skill learning.

2606.04154 2026-06-04 q-bio.QM cs.LG

EpiFormer: Learning Antigen-Antibody Interactions for Epitope Prediction via Geometric Deep Learning

EpiFormer: 通过几何深度学习学习抗原-抗体相互作用进行表位预测

Mansoor Ahmed, Huirong Chai, Haoxin Wang, Hemanth Venkateswara, Murray Patterson

发表机构 * Georgia State University(佐治亚州立大学) Georgia Institute of Technology(佐治亚理工学院)

AI总结 提出EpiFormer编码器-解码器框架,通过GNN层间交叉注意力实现抗原-抗体双向信息流,结合稀疏感知目标,在表位预测任务上F1分数提升超40%。

详情
AI中文摘要

抗体通过结合称为表位的特定表面区域来中和外来抗原。计算表位预测对于理解免疫识别和指导抗体工程至关重要。然而,现有方法面临三个基本挑战:抗体感知模型独立编码每条链并在后期才进行组合,无法捕捉定义结合界面的共依赖结构特征;而严重的类别不平衡和已知抗体-抗原复合物的稀缺使得标准训练目标无效。我们提出EpiFormer,一个通用的编码器-解码器框架,联合解决这些挑战。我们的关键设计原则是在GNN编码层内进行交错交叉注意力,使得抗原-抗体信息流贯穿整个表示学习过程,而不仅仅在输出时。这种早期融合原则与主干无关,从简单的GCN到等变模型,在各种GNN架构上都能提供一致的改进。我们进一步表明,当与早期融合架构配对时,稀疏感知目标对于表位预测任务是有效的。EpiFormer在标准基准上的F1分数比之前的最佳方法提高了40%以上,展示了泛化能力和跨数据集迁移性。值得注意的是,EpiFormer发现已知的生物学原理作为端到端训练的涌现行为,其中学习到的交叉注意力门控倾向于抗原到抗体的信息流,与两条链在结合界面的不对称角色一致,并且模型对几何特征而非进化特征的偏好与已建立的发现(表位残基并非进化保守)一致。源代码可在https://github.com/mansoor181/epiformer.git获取。

英文摘要

Antibodies neutralize foreign antigens by binding to specific surface regions called epitopes. Computational epitope prediction is critical for understanding immune recognition and guiding antibody engineering. However, existing methods face three fundamental challenges: antibody-aware models encode each chain independently and combine them only at a late stage, failing to capture co-dependent structural features that define binding interfaces, whereas severe class imbalance and scarcity of known antibody-antigen complexes render standard training objectives ineffective. We propose EpiFormer, a general encoder-decoder framework that addresses these challenges jointly. Our key design principle is interleaved cross-attention within GNN encoding layers, enabling bidirectional antigen-antibody information flow throughout representation learning rather than only at the output. This early-fusion principle is backbone-agnostic, providing consistent gains across GNN architectures from simple GCNs to equivariant models. We further show that sparsity-aware objectives are effective when paired with early-fusion architectures for the epitope prediction task. EpiFormer improves over the previous best method by over 40% in F1 score on standard benchmarks, demonstrating generalizability and cross-dataset transferability. Notably, EpiFormer discovers known biological principles as emergent behaviors of end-to-end training, where the learned cross-attention gates favor antigen-to-antibody information flow, consistent with the asymmetric roles of the two chains at the binding interface, and the model's preference for geometric over evolutionary features aligns with the established finding that epitope residues are not evolutionarily conserved. The source code is available at: https://github.com/mansoor181/epiformer.git

2606.04141 2026-06-04 cs.CR cs.AI

Caught in the Act(ivation): Toward Pre-Output and Multi-Turn Detection of Credential Exfiltration by LLM Agents

当场抓获(激活):面向LLM智能体的凭证泄露预输出和多轮检测

Kargi Chauhan, Pratibha Revankar

发表机构 * University of California, Berkeley(加州大学伯克利分校)

AI总结 研究通过激活探针、蜜令令牌和累积信息流追踪三种互补防御方法,在预输出和多轮对话中检测LLM智能体的凭证泄露。

详情
AI中文摘要

LLM智能体通常将敏感凭证与不可信检索内容置于同一上下文窗口中,为间接提示注入诱导凭证泄露提供了直接途径。我们通过三种互补防御研究这种失效模式。首先,我们探究激活探针能否在输出令牌发出前检测凭证访问。其次,我们从格式特定的字符模型构建蜜令令牌,并使用分裂共形预测校准检测。第三,我们将多轮泄露视为累积信息流问题,并跨对话轮次追踪估计的泄露预算。在开放权重模型的受控实验中,激活特征能够高精度区分良性提示和凭证窃取提示,包括在保留编码变换下。在一个小型合成多轮测试集中,累积会计检测到了每轮检测器遗漏的攻击。这些结果是初步的:多轮基准测试为内部小型数据集,激活方法需要白盒访问,信息估计器提供的是实用信号而非正式上界。尽管如此,结果表明凭证泄露防御应结合预输出监控、校准的金丝雀检测和时间泄露会计,而非仅依赖文本级输出过滤器。

英文摘要

LLM agents often place sensitive credentials in the same context window as untrusted retrieved content, creating a direct path for indirect prompt injection to induce credential exfiltration. We study this failure mode through three complementary defenses. First, we ask whether activation probes can detect credential access before output tokens are emitted. Second, we construct honeytokens from format-specific character models and calibrate detection with split conformal prediction. Third, we treat multi-turn exfiltration as a cumulative information-flow problem and track an estimated leakage budget across conversation turns. In controlled experiments on open-weight models, activation features separate benign and credential-seeking prompts with high accuracy, including under held-out encoding transformations. In a small synthetic multi-turn suite, cumulative accounting detects attacks that per-turn detectors miss. These results are preliminary: the multi-turn benchmark is in-house and small, the activation method requires white-box access, and the information estimator provides a practical signal rather than a formal upper bound. Still, the results suggest that credential-exfiltration defenses should combine pre-output monitoring, calibrated canary detection, and temporal leakage accounting rather than relying only on text-level output filters.

2606.04126 2026-06-04 cs.AR cs.AI cs.SE

HighTide: An Agent-Curated Open-Source VLSI Benchmark Suite

HighTide:一个由智能体策划的开源VLSI基准测试套件

Benjamin Goldblatt, Paolo Pedroso, Farhad Modaresi, Ethan Sifferman, Matthew R. Guthaus

发表机构 * University of California, Santa Cruz(加州大学圣克鲁兹分校)

AI总结 提出HighTide,一个由AI辅助策划的开源VLSI基准测试套件,通过12种智能体技能覆盖设计生命周期,并集成Bazel增量编译和远程缓存。

详情
AI中文摘要

我们介绍HighTide,一个不断演进的AI辅助基准测试套件。具体贡献包括:(i) 一个涵盖多种设计语言和技术节点的多样化开源套件,(ii) 基于Bazel的增量RTL到GDS编译,支持远程缓存,(iii) 通过十二种智能体技能进行AI辅助设计策划,覆盖设计生命周期、流程优化、工具参考和元维护,并配有每个设计的决策日志,作为跨套件调优理由的长期记忆,以及(iv) 一个包含RTL编译验证的基础设施,用于稳定发布。该套件公开可用,并旨在与开源硬件生态系统共同成长。

英文摘要

We introduce HighTide, an evolving AI-assisted benchmark suite. Specifically, the contributions are: (i) a diverse open-source suite spanning multiple design languages and technology nodes, (ii) Bazel-based incremental RTL-to-GDS compilation with remote caching, (iii) AI-assisted design curation through twelve agent skills covering the design lifecycle, flow optimization, tool reference, and meta-maintenance, backed by per-design decision logs that serve as long-term memory of tuning rationale across the suite, and (iv) an infrastructure with RTL compilation verification for stable releases. The suite is publicly available and designed to grow with the open-source hardware ecosystem.

2606.04123 2026-06-04 math.OC cs.AI cs.RO

Semantic Constraint Synthesis for Adaptive Trajectory Optimization via Large Language Models

基于大语言模型的语义约束综合用于自适应轨迹优化

Eleanor Brosius, Yuji Takubo, Daniele Gammelli, Simone D'Amico, Marco Pavone

发表机构 * Stanford University(斯坦福大学)

AI总结 提出利用大语言模型将自然语言描述的任务需求转化为可执行的轨迹优化代码和数学公式,在航天器交会场景中实现了从语义需求重构凸轨迹优化问题的高成功率。

Comments 7 pages, 4 figures, Presented as a short paper at IEEE CVPR 2026, AI4Space Workshop

详情
AI中文摘要

轨迹优化是实现太空探索中安全可靠自主操作的关键组成部分。随着太空任务在频率、复杂性和范围上的增加,迫切需要快速制定数学上合理的轨迹优化问题,以准确反映任务目标和操作约束。然而,将任务意图转化为易于处理的轨迹优化分析公式需要大量的领域专业知识。本文提出一个框架,利用大语言模型(LLMs)将任务需求和约束的自然语言描述转化为可执行的轨迹优化代码及相应的数学公式。在航天器交会场景中的实验表明,从语义任务需求重构凸轨迹优化问题具有高成功率。最终,这项工作凸显了LLMs在连接高层意图与形式化优化模型方面的潜力,从而实现更灵活高效的航天器轨迹设计。

英文摘要

Trajectory optimization is a critical component for enabling safe and reliable autonomous operations in space exploration. As space missions increase in frequency, complexity, and scope, there is a growing need to rapidly formulate mathematically sound trajectory optimization problems that accurately reflect mission objectives and operational constraints. However, translating mission intent into tractable analytical formulations for trajectory optimization requires substantial domain expertise. This paper presents a framework that leverages large language models (LLMs) to translate natural language descriptions of mission requirements and constraints into executable trajectory optimization code and corresponding mathematical formulations. Experiments in spacecraft rendezvous scenarios demonstrate a high success rate in reconditioning a convex trajectory optimization problem from semantic mission requirements. Ultimately, this work highlights the potential of LLMs to bridge high-level intent and formal optimization models, enabling more flexible and efficient trajectory design of spacecraft.

2606.04121 2026-06-04 cs.LO cs.LG cs.SE

veriFIRE: an Industrial Case Study in Verifying Consistency Properties for a DNN-Based Wildfire Detection System

veriFIRE:基于DNN的野火检测系统一致性属性验证的工业案例研究

Idan Refaeli, Maya Swisa, Itay Buchnik, Alon Zada, Guy Amir, Elad Mandelbaum, Ziv Freund, Guy Katz

发表机构 * The Hebrew University of Jerusalem(海法大学) Elbit Systems - ISTAR & EW - Elisra L.T.D Cornell University(康奈尔大学)

AI总结 本文提出一种端到端方法,通过将应用需求编码为求解器兼容查询,利用现有神经网络验证器验证野火检测系统中的单调性和有界响应等一致性属性,并在真实背景样本上评估,展示了工业系统可获得有意义的领域特定保证。

Comments To appear in The 9th International Symposium on AI Verification (SAIV)

详情
AI中文摘要

我们介绍了veriFIRE项目的当前工作:一个工业界与学术界的合作项目,旨在应用验证来提高一个真实世界安全关键系统的可靠性。具体来说,我们针对一个用于野火检测的机载平台,该平台包含两个深度神经网络。我们提出了一种端到端的方法来验证该系统中的 extit{一致性属性}。我们的方法将基于应用的需求编码为现有神经网络验证器可求解的查询。我们研究了关键操作场景下的感兴趣属性:(i) 检测器置信度随目标强度增加而单调递增;(ii) 在传感器物理上合理的模糊下,检测器响应有界。我们使用最先进的神经网络验证后端实例化这些编码,并在真实背景样本上大规模评估。对于第一个属性,所有验证查询在五分钟内解决。对于第二个属性,验证难度显著增加,突出了更丰富、更高维规格的关键可扩展性挑战。总体而言,结果表明可以为工业系统获得有意义的、领域特定的保证。

英文摘要

We present our ongoing work on the veriFIRE project: a collaboration between industry and academia, aimed at applying verification to increase the reliability of a real-world, safety-critical system. Specifically, we target an airborne platform for wildfire detection, which incorporates two deep neural networks. We present an end-to-end methodology for verifying \textit{consistency properties} in this system. Our approach encodes application-grounded requirements into solver-compatible queries for existing neural network verifiers. We study properties of interest over critical operational scenarios: (i) monotonicity of detector confidence as target intensity increases; and (ii) bounded detector response under physically plausible blur over the sensor. We instantiate these encodings using state-of-the-art neural network verification backends and evaluate them at scale on real background samples. For the first property, all verification queries are solved in under five minutes. For the second property, verification is substantially harder, highlighting key scalability challenges for richer, higher-dimensional specifications. Overall, the results demonstrate that meaningful, domain-specific guarantees can be obtained for industrial systems.

2606.04108 2026-06-04 cs.GR cs.AI cs.CV cs.LG

SymTRELLIS: Symmetry-Enforced Voxel Latents for 3D Generation

SymTRELLIS: 对称性增强的体素潜变量用于3D生成

Guangda Ji, Qimin Chen, Qinchan Li, Mingrui Zhao, Kai Wang, Hao Zhang

发表机构 * Simon Fraser University(西蒙 Fraser大学)

AI总结 提出SymTRELLIS方法,通过在流模型生成过程中对预测速度进行对称化平均,强制任意有限点群对称性,无需重新训练VAE或流模型,显著降低对称性误差。

详情
AI中文摘要

单视图3D生成模型已取得令人印象深刻的视觉质量,但它们并非为满足结构或功能需求而设计,在实践中常常存在不足。对称性就是这样一个需求:违反对称性,即使是微小的违反,也可能使模型在物理上不可用。我们提出SymTRELLIS,一种在TRELLIS.2的基于流的3D生成过程中强制任意有限点群对称性(旋转、反射和多面体对称)的方法,无需重新训练底层的VAE或流模型。我们的关键思想是将空间变换在潜空间中的作用近似为体素潜变量上的学习线性算子,通过一个轻量级的空间变换潜映射器实现,该映射器在通用的非对称3D数据上训练。在生成时,我们通过在每一步ODE中对所有对称等价变换的预测流速度进行平均来强制对称性,这一过程称为速度对称化。对称性规格可以从初始TRELLIS.2生成中自动估计,或由用户提供,从而实现超越输入图像暗示的刻意折叠操作。在一个包含266个严格对称物体的基准测试上(涵盖2到20倍旋转和多面体对称群),与TRELLIS.2、Hunyuan3D-2.1和TripoSG相比,SymTRELLIS显著降低了所有对称性误差指标,同时保持了与基础模型相当的重建精度。

英文摘要

Single-view 3D generative models have achieved impressive visual quality, yet they are not designed to satisfy structural or functional requirements, and in practice, often fall short. Symmetry is one such requirement: violations, even subtle ones, on symmetry can render a model physically unusable. We present SymTRELLIS, a method that enforces arbitrary finite point group symmetries (rotational, reflectional, and polyhedral) during the flow-based 3D generation of TRELLIS.2, without retraining the underlying VAE or flow model. Our key idea is to approximate the latent-space action of spatial transformations as a learned linear operator on voxel latents, implemented as a lightweight spatial-transform latent mapper trained on generic, non-symmetric 3D data. At generation time, we enforce symmetry by averaging predicted flow velocities across all symmetry-equivalent transformations at each ODE step, a process we call velocity symmetrization. The symmetry specification can be estimated automatically from an initial TRELLIS.2 generation or supplied by the user, enabling deliberate fold manipulation beyond what the input image suggests. On a curated benchmark of 266 strictly symmetric objects spanning 2- to 20-fold rotations and polyhedral symmetry groups, SymTRELLIS substantially reduces all symmetry error metrics compared to TRELLIS.2, Hunyuan3D-2.1, and TripoSG, while maintaining reconstruction accuracy comparable to the base model.

2606.04104 2026-06-04 cs.SE cs.AI cs.CR

Proof-Carrying Agent Actions: Model-Agnostic Runtime Governance for Heterogeneous Agent Systems

证明携带型智能体动作:异构智能体系统的模型无关运行时治理

Zexun Wang

发表机构 * Ond Holdings Inc(Ond控股公司)

AI总结 提出一种运行时无关的治理模型PCAA,通过动作证书和五个检查点实现异构智能体系统的统一授权与审计,并在参考实现中验证其可移植性和有效性。

Comments 25 pages, 2 tables, 3 figures. Implementation-informed systems paper with bounded public validation

详情
AI中文摘要

智能体系统通过具有非常不同控制点的运行时执行:本地编码工具、框架SDK、托管智能体平台、API网关和仅观察集成。因此,一个高风险动作(如外部发布数据)可能在一个运行时中表现为shell命令,在另一个运行时中表现为工具调用,在第三个运行时中表现为托管会话转换。这使得难以一致地回答一个基本的治理问题:什么动作被授权,由谁授权,具有什么批准语义,以及执行后有什么证据? 本文提出了证明携带型智能体动作(PCAA),这是一种以动作证书而非供应商原生会话记录为中心的运行时无关治理模型。PCAA围绕五个检查点组织控制:动作前可接受性、动作开放、假设捕获、批准和结果关闭。它将这些检查点绑定到一个可移植的动作信封、运行时和批准收据以及可重放证明。该模型以两种实际方式扩展:证书是外部性感知的,携带边界事实(如目标可见性和账户来源),并且批准由明确的可执行性类别描述,而不是由单一的已审查或未审查位描述。 我们通过异构智能体控制平面中的参考实现和披露受限的评估协议来研究该模型。在一个从24个可执行种子扩展到跨四个运行时家族的96个轨迹的保护基准上,PCAA在消融下暴露不同故障模式的同时保持了路由质量。本文贡献了围绕证书携带动作的运行时治理的系统公式化,以及一个基于实现的说明,说明该公式化如何在运行时变更下保持可移植性,而不会崩溃为供应商特定的控制面。

英文摘要

Agent systems execute through runtimes with very different control points: local coding tools, framework SDKs, managed agent platforms, API gateways, and observer-only integrations. A high-risk action such as publishing data externally may therefore appear as a shell command in one runtime, a tool call in another, and a hosted session transition in a third. This makes it difficult to answer a basic governance question consistently: what action was authorized, under whose authority, with what approval semantics, and with what evidence after execution? This paper presents Proof-Carrying Agent Actions (PCAA), a runtime-neutral governance model centered on an action certificate rather than on a vendor-native session record. PCAA organizes control around five checkpoints: pre-action admissibility, action open, assumption capture, approval, and outcome closure. It binds these checkpoints to a portable action envelope, runtime and approval receipts, and replay-ready proof. The model is extended in two practical ways: the certificate is externality-aware, carrying boundary facts such as destination visibility and account provenance, and approval is described by explicit enforceability classes rather than by a single reviewed or unreviewed bit. We study the model through a reference implementation in a heterogeneous agent control plane and a disclosure-bounded evaluation protocol. On a protected benchmark expanded from 24 executable seeds to 96 traces across four runtime families, PCAA preserves route quality while exposing distinct failure modes under ablation. The paper contributes a systems formulation of runtime governance around certificate-bearing actions and an implementation-grounded account of how that formulation can remain portable under runtime churn without collapsing into vendor-specific control surfaces.

2606.04071 2026-06-04 cs.CR cs.CL cs.LG

Covert Influence Between Language Models

语言模型之间的隐蔽影响

Avidan Shah, Jay Chooi, Jinghua Ou, Shi Feng

发表机构 * MATS New York University(纽约大学) Harvard University(哈佛大学) George Washington University(乔治华盛顿大学)

AI总结 本文研究语言模型间通过微调、蒸馏和上下文学习三种接口实现隐蔽影响的风险,并提出使用逐点归因分数选择载体以放大训练时影响,发现自然语言载体相比数字载体更难被人类检测且跨模型迁移性更差。

详情
AI中文摘要

随着语言模型越来越多地消费彼此的输出,隐蔽影响——即发送者的载荷(其被条件化传播的行为倾向)通过人类无法检测的载体转移到接收者的现象——成为一种日益增长的风险。我们通过三种接口(监督微调、在线策略蒸馏和上下文学习)刻画了这一风险,并发现它们在实现不留下人类可见痕迹的影响规模上有所不同。利用推理时逐样本归因分数,我们研究了所有三种接口下的隐蔽影响,并具备选择能够放大训练时影响的载体的能力,解锁了先前工作无法实现的载荷转移。我们进一步提供证据表明,使用自然语言载体的隐蔽影响与先前使用数字载体的研究是不同的现象,因为前者更难以被人类检测且跨模型家族的迁移性更差。这些结果共同表明,隐蔽影响的风险面比先前认识到的更广,我们研究了逐点归因评分方法作为调查和缓解该风险的工具。

英文摘要

As language models increasingly consume one another's outputs, covert influence -- a phenomenon where a sender's payload (the behavioral disposition it is conditioned to propagate) transfers to a receiver through carriers undetectable by humans -- becomes a growing risk. We characterize this risk across three interfaces: supervised fine-tuning, on-policy distillation, and in-context learning, and find that they vary in the scale of influence achievable without leaving behind human-visible traces. Using inference-time per-sample attribution scores, we study covert influence across all three interfaces with the ability to select carriers that amplify training-time influence, unlocking payload transfers that prior work could not achieve. We further provide evidence that covert influence with natural-language carriers is a distinct phenomenon from prior studies using number carriers, as the latter is more resistant to human detection and less portable across model families. Together, these results suggest that the risk surface for covert influence is broader than previously recognized, and we study pointwise attribution scoring methods as a tool to investigate and mitigate it.

2606.04069 2026-06-04 cs.CR cs.LG

Bayesian Membership Privacy for Graph Neural Networks

图神经网络的贝叶斯成员隐私

Sinan Yıldırım, Megha Khosla

发表机构 * University of California, Berkeley(加州大学伯克利分校)

AI总结 针对图神经网络中结构相关性和随机训练图采样导致的成员推断问题,提出贝叶斯成员隐私(BMP)框架,通过贝叶斯假设检验量化节点级成员隐私,并设计采样感知审计机制以评估隐私泄露。

详情
AI中文摘要

现有的图神经网络(GNN)隐私分析很大程度上继承了非图设置中的假设,忽略了结构相关性和随机训练图采样。特别是,节点相关的先验使得仅凭第一类和第二类错误不足以刻画最优的成员推断测试。为了解决这个问题,我们引入了贝叶斯成员隐私(BMP),这是一种采样感知的节点级成员隐私公式,它结合了节点相关的先验,并将图采样概率视为对手知识的一部分。BMP将成员推断视为贝叶斯假设检验,并据此以后验成员概率来量化成员隐私。我们探讨了BMP与文献中现有定义相关的理论性质。我们进一步提出了一种实用的、采样感知的审计机制,用于估计BMP的参数,作为GNN中节点级隐私泄露的度量。我们在基准图数据集上进行了实验,结果表明BMP提供了细粒度的隐私洞察,而这些洞察仅通过全局攻击准确率是无法看到的。

英文摘要

Existing privacy analyses for Graph Neural Networks (GNNs) largely inherit assumptions from non-graph settings, overlooking structural correlations and stochastic training-graph sampling. In particular, node-dependent priors make type-I and type-II errors alone insufficient to characterize the best membership inference test. To address this, we introduce Bayesian Membership Privacy (BMP), a sampling-aware formulation of node-level membership privacy that incorporates node-dependent priors and treats graph sampling probabilities as part of the adversary's knowledge. BMP casts membership inference as a Bayesian hypothesis test and accordingly quantifies membership privacy in terms of posterior membership probability. We explore theoretical properties of BMP in relation to the existing definitions in the literature. We further propose a practical, sampling-aware auditing mechanism to estimate the parameters of BMP as a measure of node-level privacy leakage in GNNs. We conduct experiments on benchmark graph datasets and show that BMP yields fine-grained privacy insights that are not visible through global attack accuracy alone.

2606.04067 2026-06-04 cs.CR cs.AI

Need to Know: Contextual-Integrity-Grounded Query Rewriting for Privacy-Conscious LLM Delegation

须知:基于语境完整性的隐私意识LLM委托查询重写

Xinyue Huang, Xiaochun Cao, Wenyuan Yang

发表机构 * Sun Yat-sen University(中山大学)

AI总结 针对LLM委托中查询隐私泄露问题,提出基于语境完整性的查询重写框架,通过CI引导的强化学习训练重写器,在保留任务关键信息的同时抑制非必要敏感披露,实现最佳隐私-效用权衡。

详情
AI中文摘要

随着LLM日益融入日常工作流程,发送到云端LLM的用户查询通常混合了任务必需内容和任务非必需的敏感披露,但基于类型的PII编辑是上下文无关的,可能引发两个问题:过度披露未类型化的敏感上下文和过度移除承载答案的片段。我们在语境完整性下重新定义隐私保护查询重写:只有当某个片段对任务必要时才应转发。我们引入了DelegateCI-Bench,这是首个基于任务的语境完整性基准,用于隐私意识委托,包含3,167个样本,结合了涵盖11个任务和20种任务类型的高质量合成数据、基于WildChat的真实用户查询以及一个包含密集敏感信息的医学挑战集。基于此基准,我们提出了一个CI引导的强化学习框架,将必要和非必要的敏感片段转化为可验证的优化信号,并训练一个查询重写器,以保留任务关键信息同时抑制不必要的敏感披露。实验表明,我们学习的重写器实现了最佳的隐私-效用权衡,与设备端基线相比,平均效用提升高达+10.1。

英文摘要

As LLMs become increasingly woven into everyday workflows, user queries sent to cloud hosted LLMs routinely mix task-essential content with task non-essential sensitive disclosures, yet type based PII redaction is context agnostic and may raise two issues: over disclosing untyped sensitive context and over removing answer bearing spans. We recast privacy preserving query rewriting under Contextual Integrity: a span should be forwarded only if it is necessary for the task. We introduce DelegateCI-Bench, the first task based Contextual Integrity benchmark for privacy-conscious delegation, comprising 3,167 samples that combine high quality synthetic data spanning 11 tasks and 20 task types, WildChat based real user queries, and a medical challenge set with dense sensitive information. Building on this benchmark, we propose a CI-guided reinforcement learning framework that converts essential and non-essential sensitive spans into verifiable optimization signals, and train a query rewriter to preserve task critical information while suppressing unnecessary sensitive disclosure. Experiments show that our learned rewriter achieves the best privacy-utility tradeoff, achieving up to +10.1 average utility over on-device baselines.

2606.04066 2026-06-04 q-bio.NC cs.LG

SC-TauPath: A Structural Connectivity Attribution Framework for Mapping Tau Propagation Pathways in Alzheimer's Disease

SC-TauPath:一种用于映射阿尔茨海默病中tau蛋白传播路径的结构连接归因框架

Jing Zhang, Norman Scheel, Minheng Chen, Tong Chen, Yanjun Lyu, David C. Zhu, Rong Zhang, Dajiang Zhu

发表机构 * University of Texas at Arlington(德克萨斯理工大学) Michigan State University(密歇根州立大学) University of Texas Southwestern Medical Center(德克萨斯西南医学中心)

AI总结 提出SC-TauPath框架,结合网络扩散模型增强的多层感知机和梯度×输入归因方法,从体内神经影像数据中映射tau蛋白传播路径,并验证了与Braak分期解剖学的一致性。

详情
AI中文摘要

理解结构连接如何与阿尔茨海默病(AD)中的tau蛋白传播相关联仍然是一个核心未解问题,然而现有的计算模型要么严重依赖生物物理假设,要么缺乏神经生物学可解释的路径图。我们提出了SC-TauPath,一个结构连接(SC)归因框架,用于从体内神经影像数据中映射tau蛋白传播路径。SC-TauPath将网络扩散模型(NDM)增强的多层感知机与梯度×输入归因相结合,以评分每个SC边对tau预测的贡献,然后将这些归因分数转化为多尺度路径图(骨干边、高流量路径和枢纽ROI),这验证了已建立的Braak分期解剖学。应用于234名ADNI参与者,这些参与者具有配对的DTI SC和18F-Flortaucipir PET数据,SC-TauPath实现了强交叉验证的tau预测,并产生了与已建立的Braak分期解剖学一致的基于归因的路径图,表明SC编码了AD中区域tau分布的特定空间信息。

英文摘要

Understanding how structural connections are associated with tau propagation in Alzheimer's disease (AD) remains a central open question, yet existing computational models either rely heavily on biophysical assumptions or lack neurobiologically interpretable pathway maps. We present SC-TauPath, a structural connectivity (SC) attribution framework that maps tau propagation pathways from in vivo neuroimaging data. SC-TauPath combines a Network Diffusion Model (NDM)-augmented multilayer perceptron with gradient $\times$ input attribution to score each SC edge's contribution to tau prediction, then translates these attribution scores into multi-scale pathway maps (backbone edges, high-traffic routes, and hub ROIs), which validates established Braak staging anatomy. Applied to 234 ADNI participants with paired DTI SC and 18F-Flortaucipir PET, SC-TauPath achieves strong cross-validated tau prediction and yields attribution-based pathway maps consistent with established Braak staging anatomy, demonstrating that SC encode spatially specific information about regional tau distribution in AD.

2606.04065 2026-06-04 stat.ML cs.LG math.ST stat.TH

Finite-Iteration Local Dynamics and Warm Starts for Alternating Power Iteration in Spiked Tensor PCA

尖峰张量PCA中交替幂迭代的有限迭代局部动力学与热启动

Yanjin Xiang, Zhihua Zhang

发表机构 * Peking University(北京大学)

AI总结 研究固定阶非对称秩一张量模型中同步交替幂迭代的有限迭代局部理论,提出与初始化无关的误差分解和热启动机制。

Comments 67 pages, 0 figures. The paper studies local dynamics and warm-start analysis for alternating power iteration in spiked tensor PCA

详情
AI中文摘要

我们研究了固定阶非对称秩一张量模型中的同步交替幂迭代。主要贡献是一个与任何特定初始化无关的有限迭代局部理论。一旦迭代进入种植秩一方向的足够小邻域,其误差分解为几何衰减的瞬态部分和由种植点处固定正交噪声收缩引起的内在噪声基底。确定性有限样本条件被明确陈述,但在粗粒度的固定阶多线性噪声事件下,它们简化为固定或缓慢扩展局部半径的保守高信号区域。然后,我们将热启动机制与任何特定谱构造分离。一个通用的单扫描原理表明,如果符号兼容的初始器具有相关性γ_N,第一扫描噪声水平a_N,且a_N/(γ_N^{d-1}ω_{N,d})→0,则可以选择一个扩展半径r_N=o(ω_{N,d}),使得第一扫描进入局部盆地。进入后,局部仿射收缩导致收敛到该盆地中唯一的信息性局部不动点。对于中心Gram初始化,我们通过信号保持的仅噪声留一比较和平均留一片收缩估计(称为压回估计),在独立同分布有限四阶矩噪声下验证了所需的相关性和同一样本第一扫描噪声界。留一比较保持尖峰固定并对删除坐标取平均,因此种植坐标通过ℓ₂加权和而非最坏情况非相干界进入。

英文摘要

We study simultaneous alternating power iteration for fixed-order asymmetric rank-one spiked tensor models. Our main contribution is a finite-iteration local theory that is independent of any particular initialization. Once the iterates enter a sufficiently small neighborhood of the planted rank-one direction, their error decomposes into a geometrically decaying transient and an intrinsic noise floor caused by fixed orthogonal noise contractions at the planted point. The deterministic finite-sample conditions are stated explicitly, but under a coarse fixed-order multilinear noise event they reduce to a conservative high-signal regime for fixed or slowly expanding local radii. We then separate the warm-start mechanism from any specific spectral construction. A generic one-sweep principle shows that, if a sign-compatible initializer has correlation \(γ_N\), first-sweep noise level \(a_N\), and \(a_N/(γ_N^{d-1}ω_{N,d})\to0\), then one can choose an expanding radius \(r_N=o(ω_{N,d})\) for which the first sweep enters the local basin. After entry, the local affine contraction yields convergence to the unique informative local fixed point in that basin. For centered-Gram initialization, we verify the required correlation and same-sample first-sweep noise bound under i.i.d. finite-fourth-moment noise by a signal-preserving noise-only leave-one comparison and an averaged leave-one slice-contraction estimate, which we call a pressed-back estimate. The leave-one comparison keeps the spike fixed and averages over the deleted coordinate, so planted coordinates enter through \(\ell_2\)-weighted sums rather than worst-case incoherence bounds.

2606.04057 2026-06-04 cs.SE cs.AI cs.LG

The Invisible Lottery: How Subtle Cues Steer Algorithm Choice in LLM Code Generation

隐形彩票:微妙线索如何引导LLM代码生成中的算法选择

Akanksha Narula, Mofasshara Binte Rafique, Laurent Bindschaedler

发表机构 * University of Washington(华盛顿大学) Google Research(谷歌研究院)

AI总结 通过大量控制实验,发现提示中的偶然线索(如上下文词或元数据)会系统性地改变LLM在代码生成中选择的算法族分布,影响性能、安全性和可维护性,而直接命名算法是最可靠的缓解措施。

详情
AI中文摘要

大型语言模型(LLM)现在生成大量生产代码,通常用于具有多个有效算法解决方案的任务。偶然的提示线索,即任务规范之外的上下文词或元数据,可以引导模型选择哪个算法,即使所有输出都通过相同的测试。提示敏感性作为提高输出质量的工具已被广泛研究。这里,输出策略意味着在固定正确性下的算法选择。我们将算法引导定义为线索引起的算法族分布变化,并在11个任务、19种线索类型(18个通道加上一个记忆化语义与表面消融,在改变排版和标点的同时保留含义)以及15个模型配置上进行了46,535次控制实验。我们发现算法族分布存在大规模、系统性的变化(高达100个百分点),与线索语义基本一致,包括在速率限制等应用任务中。直接命名算法是我们测试的最可靠的缓解措施。因此,偶然的上下文在性能、安全性和可维护性上创造了一个“隐形彩票”。

英文摘要

Large language models (LLMs) now generate substantial production code, often for tasks with multiple valid algorithmic solutions. Incidental prompt cues, meaning contextual words or metadata outside the task specification, can steer which algorithm the model selects, even when all outputs pass the same tests. Prompt sensitivity is well studied as a tool to improve output quality. Here, output policy means algorithm choice under fixed correctness. We define algorithm steering as cue-induced shifts in algorithm-family distributions and run 46,535 controlled experiments across 11 tasks, 19 cue types (18 channels plus a memoization semantic-vs-surface ablation that preserves meaning while changing typography and punctuation), and 15 model configurations. We find large, systematic shifts in algorithm-family distributions (up to 100 pp), largely consistent with cue semantics, including in applied tasks such as rate limiting. Direct algorithm naming is the most reliable mitigation we tested. Accidental context therefore creates an "invisible lottery" over performance, security, and maintainability.

2606.04035 2026-06-04 cs.SE cs.AI cs.LG

Unpredictable Safety: Domain-Dependent Compliance and the Transparency Gap in Open-Weight LLMs

不可预测的安全性:开放权重大语言模型中领域依赖的合规性与透明度差距

Zacharie Bugaud

发表机构 * Astera Institute(Astera研究院)

AI总结 通过7个伦理领域的标准化实验,发现开放权重大语言模型的合规率在14.7%到85.7%之间波动,且同一模型在不同领域表现高度不一致,揭示了安全机制缺乏透明度和一致性。

详情
AI中文摘要

我们对开放权重大语言模型中领域依赖的安全行为进行了系统研究:在7个伦理领域进行了7项标准化实验,测试了5个模型(12B--70B),共4200次交互,并采用双法官验证。使用双条件方法,每个场景在分析框架(识别危害)和操作框架(帮助实施危害)下进行测试,我们发现合规率从14.7%(人口贩卖)到85.7%(监控设计)不等,跨度达71个百分点,且非重叠的聚类自助法95%置信区间。可信部署需要可预测的安全行为,但我们发现合规性高度依赖于上下文:同一模型(Mistral Nemo 12B)在100%的请求中提供监控设计,但仅在26.7%的请求中协助贩卖。这种不可预测性对部署者来说是不透明的:技术框架绕过,即有害请求被重新定义为工程问题,从而覆盖安全训练,而没有任何外部信号表明拒绝阈值已改变。领域内异质性高达84.4个百分点,意味着即使在领域层面也无法预测安全行为。在通过GitHub Copilot CLI部署产品界面访问的五个前沿封闭模型(GPT-4.1/5.2, Claude Haiku/Sonnet/Opus 4.x;n=4,163个响应)上进行的复制实验,再现了相同的领域分层,绝对水平有所减弱但形状相同,其中两个低规范化领域(科学欺诈、监控)再次最为宽松。这些结果表明,当前的安全机制缺乏可信AI部署所需的透明度和一致性。

英文摘要

We present a systematic study of domain-dependent safety behavior in open-weight LLMs: 7 standardized experiments across 7 ethical domains, testing 5 models (12B--70B) in 4,200 interactions with dual-judge validation. Using a dual-condition methodology, each scenario tested in both an analytical framing (identify the harm) and an operational framing (help commit the harm), we find compliance rates vary from 14.7% (human trafficking) to 85.7% (surveillance design), a 71-percentage-point span with non-overlapping cluster-bootstrapped 95% CIs. Trustworthy deployment requires predictable safety behavior, yet we find compliance is highly context-dependent: the same model (Mistral Nemo 12B) provides surveillance designs in 100% of requests but assists with trafficking in only 26.7%. This unpredictability is opaque to deployers: the technical framing bypass, where harmful requests reframed as engineering problems override safety training without any external signal that refusal thresholds have shifted. Within-domain heterogeneity reaches 84.4pp, meaning safety behavior cannot be predicted even at the domain level. A replication on five frontier closed models (GPT-4.1/5.2, Claude Haiku/Sonnet/Opus 4.x; n=4,163 responses) accessed via the GitHub Copilot CLI deployed-product surface reproduces the same domain stratification, attenuated in absolute level but identical in shape, with the two low-codification domains (science fraud, surveillance) again the most permissive. These results show that current safety mechanisms lack the transparency and consistency required for trustworthy AI deployment.

2606.04027 2026-06-04 cs.CR cs.AI

MaskForge: Structure-Aware Adaptive Attacks for Jailbreaking Diffusion Large Language Models

MaskForge:用于越狱扩散大语言模型的结构感知自适应攻击

Yingzi Ma, Zhengyue Zhao, Xiaogeng Liu, Minhui Xue, Yue Zhao, Chaowei Xiao

发表机构 * University of Wisconsin-Madison(威斯康星大学麦迪逊分校) Johns Hopkins University(约翰霍普金斯大学) University of Southern California(南加州大学) Responsible AI Research (RAIR) Centre, The University of Adelaide(阿德莱德大学负责任人工智能研究中心)

AI总结 提出MaskForge,一种全黑盒自适应攻击方法,通过优化结构模式库实现扩散大语言模型的红队测试,平均攻击成功率达79.3%。

Comments 28 pages, 7 figures, 11 tables. Preprint

详情
AI中文摘要

扩散大语言模型(dLLMs)通过在双向上下文下迭代去噪部分掩码序列来生成文本,展现出与自回归LLMs不同的安全表面。由于掩码令牌是原生输入,且令牌由置信度而非位置决定,因此可以通过填充和在受监控前缀之外诱导有害内容。现有的越狱方法要么忽略了这种原生填充能力,要么依赖于低多样性的掩码模板,这些模板统一应用于所有目标,缺乏结构适应性或累积攻击经验。我们提出MaskForge,一种全黑盒自适应攻击,将dLLM红队测试转化为对不断增长的结构模式库的优化搜索。MaskForge将成功的尝试抽象为可重用的模式,使用UCB bandit选择与目标兼容的模式,并在当前库失败时调用评分器引导的备用方案。成功的尝试被蒸馏回模式库,使得经验能够跨目标累积。在五个公开dLLM和三个基准测试中,MaskForge实现了79.3%的平均攻击成功率,相比最强的竞争dLLM基线相对提升17.6%。成熟的模式库进一步迁移到AdvBench而无需任何更新,实现了88.2%的攻击成功率和相比最强竞争基线67%的相对提升。

英文摘要

Diffusion large language models (dLLMs) generate text by iteratively denoising partially masked sequences under bidirectional context, exposing a safety surface distinct from autoregressive LLMs. Because mask tokens are native inputs and tokens are committed by confidence rather than position, harmful content can be induced through infilling and outside the monitored prefix. Existing jailbreaks either miss this native infill capability or rely on low-diversity mask-bearing templates applied uniformly across goals, with little structural adaptation or accumulated attack experience. We propose MaskForge, a fully black-box adaptive attack that casts dLLM red-teaming as optimized search over a growing library of structural patterns. MaskForge abstracts successful attempts into reusable schemas, selects goal-compatible patterns with a UCB bandit, and invokes a scorer-guided fallback when the current library fails. Successful attempts are distilled back into the pattern library, enabling experience to accumulate across goals. Across five public dLLMs and three benchmarks, MaskForge achieves an average attack success rate of 79.3%, a 17.6% relative improvement over the strongest competing dLLM baseline. The matured pattern library further transfers to AdvBench without any updates, achieving a 88.2% attack success rate and a 67% relative improvement over the strongest competing baseline.

2606.04025 2026-06-04 cs.SE cs.AI

The Biomimetic Architecture of Software 4.0

软件4.0的仿生架构

Philip Sheldrake, Dirk Scheffler

发表机构 * Unnamed Labs Amsterdam(阿姆斯特丹无名实验室) Unnamed Labs Karlsruhe(卡尔斯鲁厄无名实验室)

AI总结 本文提出软件4.0范式,通过自创生异质架构融合人类智能、神经AI与反射符号基底,解决概率-符号阻抗不匹配问题,并介绍实现该架构的编程语言Recognitive。

Comments 14 pages

详情
AI中文摘要

主流编程范式继承了一种为单个人脑指导本地机器的过去时代优化的执行模型,使得当代系统背负着历史路径依赖。当被迫承载多维连接主义智能时,这种脆弱的组装模型在深刻的概率-符号阻抗不匹配的重压下断裂。虽然当代软件3.x框架试图通过将大型语言模型(LLM)封装在日益复杂的外部框架中来修补这种不匹配,但这种螺旋上升的架构复杂性只会增加静态代码组装的开销。为了从根源而非表象解决问题,本文引入了软件4.0——一个由人类智能、神经AI和原生反射符号基底构成的自创生异质架构。在此范式下,软件从待解析的惰性语料转变为自我调节的代谢网络,原生地验证、修改和演化自身的结构完整性。我们提出了Recognitive,即实现该架构的编程语言和平台。通过将结构验证的负担卸载到确定性基底上,它解锁了一种优越的推理时扩展机制——其中连接主义计算完全转化为深度语义探索和假设遍历,而非以毁灭性的计算和财务成本来概率性地模拟结构约束。超越传统的“软件工厂”思维,我们概述了将连接主义意图落地并全面进入智能时代所需的理论基础。这是一篇基础性愿景论文;类型系统和操作语义的经验评估及形式化规范是未来工作的主题。

英文摘要

Dominant programming paradigms inherit an execution model optimised for a bygone era of a single human mind instructing a local machine, leaving contemporary systems burdened with historical path dependencies. When forced to host multi-dimensional, connectionist intelligence, this brittle assembly model fractures under the weight of a profound probabilistic-symbolic impedance mismatch. While contemporary Software 3.x frameworks attempt to patch the mismatch by encasing large language models (LLMs) in increasingly complicated external harnesses, this spiralling architectural complexity only compounds the carrying cost of static code assembly. To address the cause rather than the effects, this paper introduces Software 4.0 -- an autopoietic heterarchy of human intelligence, neural AI, and natively reflective symbolic substrate. Under this paradigm, software is transformed from an inert corpus to be parsed into a self-regulating metabolic network that natively verifies, modifies, and evolves its own structural integrity. We present Recognitive, the programming language and platform that materialises this architecture. By offloading the burden of structural verification to a deterministic substrate, it unlocks a superior inference-time scaling regime -- one where connectionist compute translates entirely into deep semantic exploration and hypothesis traversal rather than the ruinous computational and financial cost of simulating structural constraints probabilistically. Moving beyond the legacy 'Software Factory' mindset, we outline the theoretical foundations required to ground connectionist intent and arrive fully in the intelligence age. This is a foundational vision paper; empirical evaluation and formal specification of the type system and operational semantics are the subject of future work.

2606.04023 2026-06-04 cs.SE cs.AI

CodegenBench: Can LLMs Write Efficient Code Across Architectures?

CodegenBench: 大型语言模型能否跨架构编写高效代码?

Jie Li, Wenzhao Wu, Junqi Hu, Qinrui Zheng, Bowen Wu, Juepeng Zheng, Yutong Lu, Haohuan Fu

发表机构 * Sun Yat-sen University(中山大学) National Supercomputing Center in Wuxi(无锡国家超级计算机中心) National Supercomputing Center in Shenzhen(深圳国家超级计算机中心) University of the Chinese Academy of Sciences(中国科学院大学) Tsinghua Shenzhen International Graduate School(清华大学深圳国际研究生院)

AI总结 提出CodegenBench基准测试,评估LLMs在x86_64、Sunway和Kunpeng三种架构上生成高效并行代码的能力,发现其在通用架构上表现良好,但在领域特定架构上性能显著下降。

Comments 29 pages, 22 figures

详情
AI中文摘要

尽管大型语言模型(LLMs)在通用编程和GPU加速环境(如PyTorch、CUDA)的代码生成任务中得到了广泛评估,但它们在面向CPU的高性能计算(HPC)跨不同架构上的能力仍未充分探索。为填补这一空白,我们引入了CodegenBench,这是一个全面的基准测试套件,旨在评估在三种不同硬件平台(x86_64、Sunway和Kunpeng)上生成高效并行代码的能力。我们的基准测试包含106个标准基本线性代数子程序(BLAS)例程,建立了一个基础基线,以及20个针对每个独特超级计算架构(LeetSunway和LeetKunpeng)改编的专门计算内核。我们的广泛评估揭示,虽然最先进的LLMs能够为像x86_64这样的普遍架构生成优化代码,但在公共文档和训练数据有限的领域特定架构上,它们表现出显著的性能下降,突显了跨平台泛化的关键局限性。此外,我们对影响代码质量的因素(如实现长度和任务复杂度)的分析表明,当前LLMs在需要简洁代码片段的中等难度问题上最为有效。我们开源了我们的数据集和自动化评估基础设施,以促进未来在LLM驱动的高性能代码生成方面的研究。资源可在https://anonymous.4open.science/r/CodegenBench-EDE1/和https://anonymous.4open.science/r/CodegenBenchDataset-2551/获取。

英文摘要

While large language models (LLMs) have been extensively evaluated on code generation tasks for general-purpose programming and GPU-accelerated environments (e.g., PyTorch, CUDA), their capabilities in CPU-oriented high-performance computing (HPC) across diverse architectures remain underexplored. To bridge this gap, we introduce CodegenBench, a comprehensive benchmark suite designed to evaluate the generation of efficient parallel code across three distinct hardware platforms: x86_64, Sunway, and Kunpeng. Our benchmark comprises 106 standard Basic Linear Algebra Subprograms (BLAS) routines establishing a fundamental baseline, alongside 20 specialized computational kernels adapted for each of the unique supercomputing architectures (LeetSunway and LeetKunpeng). Our extensive evaluation reveals that while state-of-the-art LLMs can generate optimized code for ubiquitous architectures like x86_64, they exhibit significant performance degradation on domain-specific architectures with limited public documentation and training data, highlighting critical limitations in cross-platform generalization. Furthermore, our analysis of factors influencing code quality such as implementation length and task complexity indicates that current LLMs are most effective for moderately difficult problems requiring concise code snippets. We open-source our dataset and automated evaluation infrastructure to facilitate future research in LLM-driven high-performance code generation. The resources are available at https://anonymous.4open.science/r/CodegenBench-EDE1/ and https://anonymous.4open.science/r/CodegenBenchDataset-2551.

2606.04021 2026-06-04 q-bio.QM cs.LG

Structure-Aware Prediction of PROTAC-Mediated Protein Degradability via Graph Neural Networks

通过图神经网络进行PROTAC介导的蛋白质降解性的结构感知预测

Bryan Cheng, Austin Jin

发表机构 * Independent Researcher(独立研究者)

AI总结 提出DegradoMap,一种仅利用蛋白质结构和E3连接酶身份预测PROTAC降解性的图神经网络,在目标未见和E3未见评估中优于基线,并推荐最优E3连接酶。

Comments 10 pages, 5 figures, ACM-BCB 2026 Main Conference Full Paper

详情
AI中文摘要

蛋白水解靶向嵌合体(PROTACs)可以选择性降解致病蛋白,然而预测哪些靶点适合降解仍然是一个关键瓶颈:现有计算方法需要完整的PROTAC分子结构,而该信息在合成前不可用。我们提出DegradoMap,一种图神经网络,仅从蛋白质结构和E3连接酶身份预测PROTAC介导的降解性——这是靶点选择阶段可用的最小信息。该模型通过赖氨酸加权图池化(每蛋白质归一化)编码生物物理先验,通过交叉注意力建模蛋白质-E3兼容性,并整合来自癌症依赖性图谱的细胞环境。在PROTAC-8K基准(3,101个样本,155个靶点,10种E3连接酶)上,DegradoMap在靶点未见评估中达到0.646±0.124 AUROC(最佳种子:0.7449),在CRBN→VHL E3未见迁移中达到0.811 AUROC,优于GNN和机器学习基线。该模型还以74%的Hit@3准确率推荐最优E3连接酶。两个发现具有更广泛的意义:对于此标量预测任务,E(3)-等变架构的性能低于更简单的不变设计;ESM-2嵌入仅在仔细正则化下提升峰值性能——简单集成失败。DegradoMap为降解性评估提供合成前的计算指导;其良好校准的置信度分数(ECE=0.029,靶点未见)使从业者能够优先选择高置信度预测进行实验验证。然而,高种子方差(std=0.124)和有限的E3覆盖范围需要集成以实现可靠部署。

英文摘要

Proteolysis-targeting chimeras (PROTACs) can selectively degrade disease-causing proteins, yet predicting which targets are amenable to degradation remains a critical bottleneck: existing computational methods require the complete PROTAC molecular structure, information unavailable before synthesis. We present DegradoMap, a graph neural network that predicts PROTAC-mediated degradability from protein structure and E3 ligase identity alone -- the minimal information available at the target selection stage. The model encodes biophysical priors through lysine-weighted graph pooling with per-protein normalization, models protein-E3 compatibility via cross-attention, and integrates cellular context from the Cancer Dependency Map. On the PROTAC-8K benchmark (3,101 samples, 155 targets, 10 E3 ligases), DegradoMap achieves 0.646+-0.124 AUROC on target-unseen evaluation (best seed: 0.7449) and 0.811 AUROC on CRBN->VHL E3-unseen transfer, outperforming GNN and machine learning baselines. The model additionally recommends optimal E3 ligases with 74% Hit@3 accuracy. Two findings carry broader implications: E(3)-equivariant architectures underperform the simpler invariant design for this scalar prediction task, and ESM-2 embeddings improve peak performance only with careful regularization -- naive integration fails. DegradoMap provides pre-synthesis computational guidance for degradability assessment; its well-calibrated confidence scores (ECE = 0.029, target-unseen) enable practitioners to prioritize high-confidence predictions for experimental follow-up. However, the high seed variance (std = 0.124) and limited E3 coverage require ensembling for reliable deployment.