arXivDaily arXiv每日学术速递 周一至周五更新
重置
cs.CR密码安全83
2606.12395 2026-06-11 cs.CR 新提交

MARCIM-WG: A cyber wargame proposal based on math modeling applied in a naval scenario

MARCIM-WG:基于数学建模的海军场景网络兵棋推演方案

Diego Cabuya-Padilla, Daniel Díaz-López, Carlos Castaneda-Marroquín

AI总结 提出MARCIM-WG学习型网络兵棋,基于北约方法论设计,结合实体棋盘与计算仿真,通过高低级设计规范在虚构海战场景中验证,干预组态势感知能力提升34个百分点。

详情
Comments
8 pages, 5 figures, 2 tables, paper in proceedings of the XI National Cybersecurity Research Conference (JNIC) in Barcelona, Spain, May, 2026
AI中文摘要

随着海上行动日益依赖互联的数字生态系统,网络事件可能通过海上网络传播并降低关键服务。因此,加强战略网络态势感知(CSA)需要培训机制,使决策者能够应对不断变化的攻击动态、有限的资源以及需要与事件响应程序协调行动的需求。本文介绍了MARCIM-WG,一种面向学习的海上网络防御兵棋,按照北约兵棋方法论设计,并作为混合桌面体验实现,结合物理棋盘(令牌、指示器和特殊卡片)与由计算仿真模型支持的分析辅助裁决。该方案通过高层设计(HLD)和低层设计(LLD)规范进行说明,并在虚构的海上网络危机场景中实例化,以实现结构化决策周期、摩擦和可衡量的后果。验证结合了(i)在三种配置(悲观、中性/最可能、乐观)下基于操作场景的评估,以验证决策敏感性和结果一致性,以及(ii)使用与等效对照组比较设计的CSA能力和学习成果评估。结果显示干预组提高了34.0个百分点,其中理解相关能力提升最大。

英文摘要

As maritime operations increasingly depend on interconnected digital ecosystems, cyber incidents can propagate across maritime networks and degrade critical services. Strengthening strategic Cyber Situational Awareness (CSA) therefore requires training mechanisms that expose decision-makers to evolving attack dynamics, constrained resources, and the need to align actions with incident-response procedures. This paper introduces MARCIM-WG, a learning-oriented maritime cyberdefense wargame designed following the NATO wargaming methodology and implemented as a hybrid tabletop experience combining a physical board (tokens, indicators, and special cards) with analytically-assisted adjudication supported by a computational simulation model. The proposal is specified through High-Level Design (HLD) and Low-Level Design (LLD) specifications and instantiated in a fictional maritime cyber crisis scenario to enable structured decision cycles, friction, and measurable consequences. Validation combines (i) an operational scenario-based assessment under three configurations (pessimistic, neutral/most likely, optimistic) to verify decision sensitivity and outcome coherence, and (ii) a CSA competency and learning-outcome evaluation using a comparative design against an equivalent control group. Results show a +34.0 percentage-point improvement in the intervention group, with the largest gains in comprehension-related competencies.

2606.12354 2026-06-11 cs.CR 新提交

ECYSAP EYE: From Cyber Situational Awareness to Mission-Centric Decision Support for Enhanced Cyberspace Operations

ECYSAP EYE:从网络态势感知到以任务为中心的决策支持,增强网络空间行动

Pantaleone Nespoli, Daniel Díaz-López, Sergio Lopez Bernal, Francisco Oliva Bermejo, Pedro González Megías, Jorge Maestre Vidal, Víctor Sobrino García, Gregorio Martínez Pérez

AI总结 提出ECYSAP EYE系统之系统架构,通过七类任务相关制品(如RCyP、CySRs等)实现从感知到决策再到执行的过渡,支持增量部署与验证,提升网络空间任务规划与执行中的态势感知与决策支持能力。

详情
Comments
4 pages, 1 figure, 1 table, paper in proceedings of the XI National Cybersecurity Research Conference (JNIC) in Barcelona, Spain, May, 2026
AI中文摘要

运营组织越来越需要超越孤立技术警报的网络态势感知(CySA)能力,提供可嵌入异构工具链和网络安全或网络防御流程的任务相关制品。ECYSAP EYE通过一种面向采用的系统之系统(SoS)架构满足这一需求,该架构围绕七组以任务为中心的制品:识别网络空间图(RCyP)、网络态势报告(CySRs)、假设分析报告(WIAR)、选项建议(OPRE)、操作员仪表板/人机界面(DSH)、行动执行(AE)和事后报告(AAR)。ECYSAP EYE架构构建了从感知(全频谱RCyP视图)到面向决策的推理(WIAR/CySRs/OPRE),再到操作执行和学习(DSH/AE/AAR)的过渡,具有支持增量部署和验证的明确集成面。本文从技术转移角度介绍这一创新项目,总结了更新后的架构、七组制品的功能角色,以及在任务规划与执行背景下网络态势对决策过程的预期影响。

英文摘要

Operational organizations increasingly require Cyber Situational Awareness (CySA) capabilities that go beyond isolated technical alerts, providing mission-relevant artefacts that can be embedded into heterogeneous toolchains and cyber security or cyber defense processes. ECYSAP EYE addresses this need through an adoption-oriented System-of-Systems (SoS) architecture centered on seven groups of mission-focused artefacts: the Recognized Cyberspace Picture (RCyP), Cyber Situational Reports (CySRs), the What-If Analysis Report (WIAR), Option Recommendations (OPRE), an operator Dashboard/HMI (DSH), Action Enforcement (AE), and After-Action Reports (AAR). The ECYSAP EYE architecture structures the transition from perception (full-spectrum RCyP views), to decision-oriented reasoning (WIAR/CySRs/OPRE), and to operational execution and learning (DSH/AE/AAR), with explicit integration surfaces that support incremental deployment and validation. This paper presents this innovative project from a technology transfer perspective, summarizing the updated architecture, the functional role of seven groups of artefacts, and the expected impact of cyber situations on the decision-making process in the context of a mission planning and execution.

2606.12341 2026-06-11 cs.CR 新提交

OCELOT: Inference-Leakage Budgets for Privacy-Preserving LLM Agents

OCELOT:面向隐私保护LLM代理的推理泄露预算

Jin Xie, Songze Li

AI总结 提出OCELOT运行时中介,通过后验风险控制和证人验证降级机制,在代理轨迹中预算对手信念更新,平衡任务效用与隐私泄露。

详情
AI中文摘要

大型语言模型(LLM)代理越来越多地代表用户行事——读取个人文件、调用工具、与外部服务交易——每一步都可能跨信任边界泄露个人身份信息(PII)。这里的隐私不是单个输出的属性,而是整个轨迹的属性,三个特性使其难以处理:泄露是累积的,因为单独无害的发布在诚实但好奇或共谋的接收者处累积成关于受保护秘密的推断;双向的,因为恶意观察可以注入指令,将代理自身的推理模型转而针对用户;任务相关的,因为同一字段对某个接收者是必要的,对另一个却是多余的。每次发布的上下文完整性过滤器、信息流控制和后验泄露监视器各自解决了部分问题,但没有一个能在运行时控制基于推断的累积泄露。我们将代理隐私重新定义为后验风险控制,并提出了OCELOT,一种运行时中介,它预算对手关于秘密的信念在轨迹中可能改进的程度,而不是过滤输出。其机制“证人验证降级”将判断与信任分离:一个不受信任的、本地微调的防御模型检查每个候选发布并发出结构化证据——标记原子和提议的降级操作符——然后由确定性验证器审计,为所选变体收取认证的最小熵成本,并在防篡改账本上记录接收者信任加权预算,授权最小披露的有用发布。在多样化的代理基准测试和近期防御中,OCELOT在更高任务效用下实现了显著更低的泄露,抵抗自适应注入、越狱、累积推断和接收者共谋,且仅增加适度开销。

英文摘要

Large language model (LLM) agents increasingly act on a user's behalf -- reading personal files, calling tools, transacting with external services -- possibly leaking personally identifiable information (PII) across trust boundaries at every step. Privacy here is a property not of a single output but of an entire trajectory, and three properties make it hard: leakage is cumulative, as individually innocuous releases accumulate across honest-but-curious or colluding sinks into inferences about a protected secret; bidirectional, as a malicious observation can inject instructions that turn the agent's own reasoning model against the user; and task-dependent, as the same field is necessary for one recipient yet gratuitous for another. Per-release contextual-integrity filters, information-flow controls, and posterior-leakage monitors each address part of this but none controls cumulative, inference-based leakage at runtime. We recast agent privacy as \emph{posterior-risk control} and present OCELOT, a runtime mediator that budgets how much an adversary's belief about a secret may improve across a trajectory, rather than filtering outputs. Its mechanism, \emph{Witness-Verified Declassification}, separates judgment from trust: an untrusted, locally fine-tuned defender model inspects each candidate release and emits structured evidence -- labeled atoms and proposed declassification operators -- which a deterministic verifier audits, charging a certified min-entropy cost for the chosen variant and authorizing the least-disclosing useful release under a sink-trust-weighted budget recorded on a tamper-evident ledger. Across diverse agent benchmarks and recent defenses, OCELOT attains significantly lower leakage at higher task utility, resists adaptive injection, jailbreak, cumulative inference, and sink collusion, and adds only modest overhead.

2606.12320 2026-06-11 cs.AI cs.CC cs.CR cs.SE 新提交

A Five-Plane Reference Architecture for Runtime Governance of Production AI Agents

生产AI代理运行时治理的五平面参考架构

Krti Tallam

发表机构 * Kamiwaza

AI总结 针对生产AI代理打破传统数据边界治理假设的问题,提出由推理平面和四个执行平面组成的五平面参考架构,通过可组合原语实现运行时治理,阻断七种威胁并验证四个正确性不变式。

详情
Comments
65 pages, 3 figures, 5 tables. Reference architecture with a reference implementation of the policy-engine core and microbenchmark results; full-system evaluation identified as future work
AI中文摘要

企业安全旨在治理数据边界:受保护表面是静态和传输中的数据,控制措施——访问控制、数据丢失防护、边界检查——治理该边界的穿越。生产AI代理瓦解了这一假设。代理代表企业读取上下文、调用工具、调用连接器并修改记录系统,因此风险转移到工作流内部,进入一系列单独允许但可能转变未经授权业务流程的动作序列。现有策略引擎无法扩展到这种机制:它们根据原子主体评估请求时决策,而代理系统需要对复合主体进行状态化评估,这些主体的权限通过委托链衰减。我们提出了一种用于生产代理运行时治理的参考架构,由四个可组合原语构建:五平面分解(一个裁决意图的推理平面,以及四个执行平面——网络、身份、端点、数据——实现决策)、任意停止中介、具有能力衰减的复合主体,以及作为结构化证据基础的审计。我们定义了六种中断原语的分类,这些原语泛化了允许和拒绝,陈述并论证了四个正确性不变式,并展示了在五个具体工作流中阻断七种生产代理威胁。策略引擎核心的参考实现提供了测量证据:衰减正确性和证据可重构性在每次试验中成立,裁决运行在个位数微秒内,审计基础的防篡改行为完全符合设计。我们明确范围:该架构治理委托行为,而非模型行为,针对实时代理基准的全系统评估是下一步工作。

英文摘要

Enterprise security was built to govern data boundaries: the protected surface was data at rest and in transit, and the controls -- access control, data-loss prevention, perimeter inspection -- governed crossings of that boundary. Production AI agents dissolve this assumption. An agent reads context, calls tools, invokes connectors, and modifies systems of record on an enterprise's behalf, so risk moves inside the workflow, into sequences of individually-permitted actions that may transform a business process no one authorized. Existing policy engines do not extend to this regime: they evaluate request-time decisions against atomic principals, where agentic systems require stateful evaluation against composite principals whose authority attenuates through delegation chains. We present a reference architecture for the runtime governance of production agents, built from four composable primitives: a five-plane decomposition (a reasoning plane that adjudicates intent, and four enforcement planes -- network, identity, endpoint, data -- that realize the decision), stop-anywhere mediation, composite principals with capability attenuation, and audit as a structured evidence substrate. We define a taxonomy of six interruption primitives that generalize allow and deny, state and argue for four correctness invariants, and demonstrate the foreclosure of seven production-agent threats across five concrete workflows. A reference implementation of the policy-engine core supplies measured evidence: attenuation correctness and evidence reconstructability hold on every trial, adjudication runs in single-digit microseconds, and the audit substrate's tamper-evidence behaves exactly as designed. We are explicit about scope: the architecture governs delegated action, not model behavior, and a full-system evaluation against a live agent benchmark is the invited next step.

2606.12290 2026-06-11 cs.CR 新提交

Selection Integrity for LLM Graph Memory: An Accumulability Criterion for Information-Flow-Blind Retrieval

LLM图记忆的选择完整性:面向信息流盲检索的可累积性准则

Zeming Fei, Hongming Fei, Xiaoyang Wang, Yang yang, Prosanta Gope, Biplab Sikdar, Ying Zhang

AI总结 针对图记忆检索中信息流控制盲区,提出可累积性准则,证明无源结构写入可导致不可逆转账被误导,并通过重分配性而非依赖性预测漏洞,提出认证子图重计算防御。

详情
AI中文摘要

智能体记忆正在转向图结构,目前为其构建的溯源防御都检查一件事:智能体检索到的记录的来源。我们证明,这类防御在构造上是盲的。长期图记忆在可写图结构上运行全局选择步骤,因此不可信主体写入的结构会改变哪些认证事实被选中,而引用的证据保持完全认证;忠实的信息流控制(IFC)检查读者所用内容的来源(全部已认证),在文档问答基板和真实多会话智能体记忆上,做出与无防御完全相同的字节级决策。在最严重的实例中,无源结构写入在499个实时操作中静默地误导28次不可逆账本转移:忠实IFC允许每一次,而\authselect\\阻止每一次。然后我们精确刻画哪些记忆暴露:当选择器的结构项可以重新分配top-$k$成员中$\Omega(1)$份额越过所选事实的边界时,该通道被允许。个性化PageRank可以,因为无源写入重新路由了守恒的随机游走质量;内容固定的重排序器不能,而Graphiti的节点距离(比PageRank更依赖结构)保持免疫。可重分配性(而非依赖性)是预测指标。我们证明了一般情况下的免疫情况,以及在验证的瓶颈条件下的开放情况。关闭该通道迫使任何溯源防御在认证子图上重新计算选择,这正是\authselect\\所做的,零超额阻塞和2-3%延迟。

英文摘要

Agent memory is moving to graphs, and the provenance defenses now being built for it all check one thing: the provenance of the records an agent retrieves. We show that this entire class of defense is blind by construction. A long-term graph memory runs a global selection step over writable graph structure, so structure that an untrusted principal writes changes \emph{which} authenticated facts are selected while the cited evidence stays fully authenticated; faithful information-flow control (IFC), checking the provenance of what the reader uses (all of it authenticated), makes the byte-identical decision to no defense at all, across document-QA substrates and real multi-session agent memory. In the most consequential instance, a no-source structural write silently misdirects $28$ irreversible ledger transfers over $499$ live actions: faithful IFC permits every one, and \authselect\ prevents every one. We then characterize exactly which memories are exposed: a selector admits the channel when its structural term can reallocate an $\Omega(1)$ share of top-$k$ membership past a selected fact's margin. Personalized PageRank can, since a sourceless write reroutes conserved random-walk mass; a content-fixed reranker cannot, and Graphiti's node-distance, which leans on structure \emph{more} than PageRank does, stays immune. Reallocatability, not reliance, is the predictor. We prove the immune case in general and the open case under a chokepoint condition we verify. Closing the channel forces any provenance defense to recompute selection on the authenticated subgraph, which is what \authselect\ does, at zero over-block and $2$--$3\%$ latency.

2606.12259 2026-06-11 cs.CR cs.AR 新提交

Partitioned Tags, Shared Data: Reconciling Strict Cache Isolation with Write-Shared Coherence

分区标签,共享数据:严格缓存隔离与写共享一致性的调和

Kartik Ramkrishnan, Stephen McCamant, Antonia Zhai, Pen Chung Yew

AI总结 提出SCP方法,通过仅分区标签、共享数据池并调整大小避免容量驱逐,结合时序混淆和写泄漏阈值,在严格隔离下实现写共享一致性,有效防御Prime+Probe和Flush+Reload攻击。

详情
AI中文摘要

缓存分区是针对基于驱逐的缓存侧信道攻击最强大的结构性防御之一,然而一个存在十年的设计问题阻碍了其在安全共享操作系统环境中的广泛部署。该问题是写共享一致性在严格分区下会崩溃。我们提出SCP(安全且一致的分区),它通过仅分区标签、共享单个数据池,并调整数据池大小以避免容量驱动的跨分区驱逐,从而将严格的驱逐隔离与写共享一致性结合起来。时序混淆将保护扩展到分区间的查找路径。通过将写操作在泄漏阈值超过后路由到LLC,减轻了共享可写行上的基于一致性的泄漏,这使得攻击者的写探测延迟与受害者活动无关。使用gem5实现,SCP缓解了Prime+Probe和Flush+Reload攻击,这些是更复杂缓存攻击的基础。我们还展示了一个共享可写行攻击被缓解。所有这些攻击的结果都不优于随机猜测。SCP的硬件成本是LLC SRAM适度增加2.8%。在我们评估的SPEC CPU2017基准测试中,性能在IPC上与DAWG相差在0.3%以内。共享密集型微基准测试展示了基于系统指定泄漏阈值的可调安全-性能权衡。

英文摘要

Cache partitioning is among the strongest structural defenses against eviction-based cache side channels, yet a decade-old design issue has blocked its widespread deployment in secure shared-OS settings. The issue is that write-shared coherence collapses under strict partitioning. We present SCP (Secure and Coherent Partitioning), which combines strict eviction isolation with write-shared coherence by partitioning only the tags, sharing a single data pool, and sizing the data pool so capacity-driven cross-partition eviction cannot occur. Timing obfuscation extends protections to the inter-partition lookup path. Coherence-based leakage on shared-writeable lines is mitigated by routing those writes through to the LLC once a leakage threshold is crossed, which makes attacker write probe latency independent of victim activity. Using gem5 for implementation, SCP mitigates Prime+Probe and Flush+Reload, which are the basis for more sophisticated cache attacks. We also demonstrate that a shared-writeable-line attack is mitigated. All these attacks yield results no better than random guessing. SCP's hardware cost is a modest +2.8% LLC SRAM. Performance matches DAWG within 0.3% IPC on the SPEC CPU2017 benchmarks that we evaluated. Sharing-intensive microbenchmarks demonstrate a tunable security-performance tradeoff based on a system-specified leakage threshold.

2606.12251 2026-06-11 cs.LG cs.AI cs.CR 新提交

Reinforcement Learning Disrupts Gradient-Based Adversarial Optimization

强化学习破坏基于梯度的对抗优化

Xinhai Zou, Chang Zhao, Alireza Aghabagherloo, Dave Singelée, Robin Degraeve, Bart Preneel

发表机构 * COSIC, KU Leuven(鲁汶大学COSIC) Imec Brubotics, VUB(布鲁塞尔自由大学Brubotics) DistriNet, KU Leuven(鲁汶大学DistriNet)

AI总结 研究通过强化学习训练图像分类器以破坏攻击者使用的梯度结构,发现RL作为隐式正则化器产生不稳定梯度方向和较小梯度幅度,使基于梯度的攻击失效,并与对抗训练结合实现双重防御。

详情
AI中文摘要

基于梯度的对抗攻击仍然是对深度神经网络(DNN)的主要威胁,因为它们利用梯度信息高效优化对抗扰动。为了解决这个问题,我们研究了强化学习(RL)训练是否可以通过使用策略梯度目标和epsilon-贪婪探索来训练图像分类器,从而破坏攻击者使用的梯度结构。通过在CIFAR-10、CIFAR-100和ImageNet-100上使用多种架构进行系统实验,我们发现RL训练的分类器显著破坏了基于梯度的对抗优化。为了解释这一点,我们使用损失景观可视化、静态和动态梯度指标以及预测熵进行了全面的机制分析。我们的分析揭示,RL充当隐式正则化器,产生具有高度不稳定梯度方向和较小梯度幅度的模型。这种组合使得每个PGD步骤在方向上不可靠且幅度有限,导致基于梯度的攻击在实际迭代预算内失败。我们进一步表明,将RL与对抗训练(RL-adv)结合提供了在两个互补层面运作的双层防御:RL退化攻击者可用的梯度信息(梯度级防御),而对抗训练强化决策边界(边界级防御)。RL-adv在所有评估的主要攻击类型(包括基于梯度的PGD、AutoAttack、基于迁移和基于查询的攻击)中实现了最高的鲁棒性,显著优于SL-adv。这些发现将RL诱导的梯度破坏识别为一种互补的鲁棒性机制,并激励未来研究结合SL效率与RL梯度正则化特性的混合SL-RL训练调度。

英文摘要

Gradient-based adversarial attacks remain a dominant threat to deep neural networks (DNNs), as they exploit gradient information to efficiently optimize adversarial perturbations. To address this, we investigate whether reinforcement learning (RL) training can disrupt the gradient structure used by attackers by training image classifiers with policy-gradient objectives and epsilon-greedy exploration. Through systematic experiments across CIFAR-10, CIFAR-100, and ImageNet-100 with multiple architectures, we find that RL-trained classifiers significantly disrupt gradient-based adversarial optimization. To explain this, we conduct a comprehensive mechanism analysis using loss landscape visualization, static and dynamic gradient indicators, and predictive entropy. Our analysis reveals that RL acts as an implicit regularizer, producing models with highly unstable gradient directions and smaller gradient magnitudes. This combination makes each PGD step both unreliable in direction and limited in magnitude, causing gradient-based attacks to fail within practical iteration budgets. We further show that combining RL with adversarial training (RL-adv) provides a dual-layer defense operating at two complementary levels: RL degrades gradient information available to attackers (gradient-level defense), while adversarial training strengthens decision boundaries (boundary-level defense). RL-adv achieves the highest robustness across all major attack types evaluated, including gradient-based (PGD, AutoAttack), transfer-based, and query-based attacks, outperforming SL-adv by a significant margin. These findings identify RL-induced gradient disruption as a complementary robustness mechanism and motivate future research on hybrid SL-RL training schedules that combine SL's efficiency with RL's gradient-regularization properties.

2606.12225 2026-06-11 cs.CR 新提交

Bridging the Smart City Cybersecurity Data Gap Through AI-Driven Synthetic Dataset Generation

弥合智慧城市网络安全数据鸿沟:基于AI驱动的合成数据集生成

Stephanie Polczynski, John D. Hastings, Varghese Vaidyan, Kyle Korman

AI总结 提出AI合成数据生成框架,利用生成模型产生高保真网络安全数据集,解决真实数据稀缺问题,支持智慧城市安全工具开发与评估。

详情
Comments
10 pages, 1 figure, 2 tables
AI中文摘要

智慧城市依赖于互联的网络物理系统,这些系统集成了传感器、物联网设备、云平台以及AI驱动的服务和决策。虽然这些系统增强了城市服务,但由于其庞大的攻击面、异构的数据流和不断演变的威胁向量,也引入了复杂的网络安全挑战。为智慧城市开发和验证网络安全工具需要能够准确代表真实运行条件的高质量数据集。然而,真实世界的数据集往往不完整、包含隐私敏感数据、难以获取,或者缺乏足够的恶意活动来支持工具开发。本研究通过提出一个专门为智慧城市网络安全研究设计的基于AI的合成数据生成(SDG)框架,解决了这一关键差距。所提出的框架利用生成式人工智能模型来生成高保真的合成网络安全数据集,这些数据集复制了真实的设备行为、网络交互和网络攻击场景。合成数据集根据协议标准的一致性、与原始数据集的统计相似性以及在常见安全工具中的实用性进行评估。由此产生的合成数据生成框架和评估指标有望通过使研究人员能够更有效地建模威胁和更全面地评估防御技术,从而推进智慧城市网络安全,更好地保护关键智慧城市基础设施。

英文摘要

Smart cities rely on interconnected cyber-physical systems that integrate sensors, IoT devices, cloud platforms, and AI-driven services and decision-making. While these systems enhance city services, they also introduce complex cybersecurity challenges due to their large attack surfaces, heterogeneous data flows, and evolving threat vectors. Developing and validating cybersecurity tools for smart cities requires high-quality datasets that accurately represent real operational conditions. However, real-world datasets are often incomplete, contain privacy-sensitive data, are difficult to access, or lack sufficient malicious activity to support tool development. This research addresses this critical gap by proposing an AI-based synthetic data generation (SDG) framework designed specifically for smart city cybersecurity research. The proposed framework leverages generative artificial intelligence models to produce high-fidelity synthetic cybersecurity datasets that replicate realistic device behaviors, network interactions, and cyber-attack scenarios. The synthetic datasets are evaluated for conformity to protocol standards, statistical similarity to original datasets, and utility in common security tools. The resulting synthetic data generation framework and evaluation metrics are expected to advance smart city cybersecurity by enabling researchers to model threats more effectively and evaluate defensive techniques more comprehensively to better protect critical smart city infrastructures.

2606.12212 2026-06-11 cs.SE cs.CR 新提交

Mind your key: An Empirical Study of LLM API Credential Leakage in iOS Apps

注意你的密钥:iOS应用中LLM API凭证泄露的实证研究

Pinran Gao, Lingxiang Wang, Ying Zhang, Fan Yang

AI总结 本研究首次系统调查iOS应用中LLM API密钥泄露问题,通过动态分析框架LLMKeyLens检测444个应用,发现282个存在可被利用的凭证泄露,并识别出三种泄露模式,三个月后仅28%完成修复。

详情
Comments
12 pages, 4 figures, 4 tables
AI中文摘要

大型语言模型(LLM)快速集成到移动应用中引入了一类新的凭证安全风险:泄露的凭证允许未经授权访问LLM推理服务,给开发者造成经济损失。先前关于凭证泄露的工作主要集中在Android应用上;迄今为止,尚无实证研究系统调查iOS应用中的LLM API密钥泄露。我们首次对集成LLM的应用中的API密钥泄露进行了深入的实证研究。我们构建了一个包含444个iOS应用的高质量数据集,这些应用通过标准化流程从1092个候选应用中筛选出来,并开发了LLMKeyLens,一个动态分析框架,通过流量拦截、特定于提供商的密钥提取和主动有效性确认来检测LLM API密钥泄露,无需源代码访问或二进制解密。我们的分析显示,282个应用在网络流量中暴露了可利用的LLM API凭证,涉及至少十个提供商。我们识别出三种泄露模式:基于JWT的令牌泄露(48%)、未经身份验证的后端代理访问(33%)和明文API密钥传输(19%)。为评估修复情况,我们在负责任披露三个月后重新分析了相同的282个易受攻击的应用;只有28%修复了报告漏洞,而72%仍然可利用,问题持续源于未经身份验证的后端和损坏的JWT实现。我们的发现表明,LLM API密钥泄露在iOS生态系统中既普遍又持久,暴露出开发者实践与安全集成原则之间的系统性差距,并表明安全的LLM集成不仅需要开发者意识,还需要提供商明确的安全指导和平台级强制执行。

英文摘要

The rapid integration of large language models (LLMs) into mobile applications has introduced a new class of credential security risk: leaked credentials that grant unauthorized access to LLM inference services, causing financial damage to developers. Prior work on credential leakage has focused primarily on Android apps; to date, no empirical study has systematically investigated LLM API key leakage in iOS applications. We present the first in-depth empirical study of API key leakage in LLM-integrated apps. We construct a high-quality dataset of 444 iOS applications, filtered from 1092 candidates through a standardized process, and develop LLMKeyLens, a dynamic analysis framework that detects LLM API key leakage via traffic interception, provider-specific key extraction, and active validity confirmation, requiring neither source code access nor binary decryption. Our analysis reveals that 282 applications expose exploitable LLM API credentials in network traffic, spanning at least ten providers. We identify three leakage patterns: JWT-based token leakage (48%), unauthenticated backend proxy access (33%), and plaintext API key transmission (19%). To assess remediation, we re-analyzed the same 282 vulnerable applications three months after responsible disclosure; only 28% had remediated the reported vulnerability, while 72% remained exploitable, with persistent issues stemming from unauthenticated backends and broken JWT implementations. Our findings show that LLM API key leakage is both prevalent and persistent in the iOS ecosystem, exposing a systemic gap between developer practice and secure integration principles, and suggest that secure LLM integration requires not only developer awareness but also explicit security guidance from providers and platform-level enforcement.

2606.12075 2026-06-11 cs.CR cs.LG 新提交

Categorical Robustness Assessment for Machine Learning based Network Intrusion Detection Systems

基于机器学习的网络入侵检测系统的分类鲁棒性评估

Mayank Raj, Nathaniel D. Bastian, Lance Fiondella, Gokhan Kul

AI总结 本文系统比较了CNN、LSTM和随机森林三种分类器在对抗攻击下的鲁棒性,发现随机森林基线准确率虽高但极易被攻破,而CNN表现最稳健。

详情
AI中文摘要

网络入侵检测系统(NIDS)广泛使用机器学习(ML),但ML模型可能受到对抗性攻击的操纵。这些攻击向网络流量数据添加精心设计的扰动,导致误分类。虽然先前的工作已经证明了孤立环境下的对抗性漏洞,但在受控攻击条件下,跨架构以及基于攻击类别和类型的系统比较仍然有限,这使得从业者在对抗性环境中部署哪些模型缺乏明确指导。本文提出了一个简单的问题:当攻击者试图操纵系统时,哪种分类器架构实际上能够保持稳定?我们对三种流行架构进行了测试:一维卷积神经网络(CNN)、长短期记忆网络(LSTM)和随机森林(RF)集成。使用ACI-IoT-2023数据集(超过120万个样本,涵盖12种攻击类型),我们使用FGSM和PGD对抗攻击对每个模型进行攻击,这些攻击在归一化特征空间中应用基于梯度的扰动,符合既定的对抗性ML评估协议,扰动预算范围为$\epsilon=0.01$到$\epsilon=0.1$。令人惊讶的是,随机森林实现了近乎完美的基线准确率(99.98%),但在攻击下灾难性地崩溃,在我们测试的最小扰动下下降了73个百分点。另一方面,CNN在$\epsilon=0.01$时保持了95.5%的准确率,并且随着扰动的增加而优雅地退化。LSTM介于两者之间。这些发现颠覆了传统观念:如果模型在对抗压力的第一个迹象下就崩溃,那么高基线准确率毫无意义。对于在对抗性环境中部署入侵检测的从业者,我们推荐基于CNN的架构,并提供特定场景的部署指导。

英文摘要

Network Intrusion Detection Systems (NIDS) heavily utlize Machine Learning (ML) but ML models can be manipulated via adversarial attacks. These attacks add carefully crafted perturbations to network traffic data that leads to misclassifications. While prior work has demonstrated adversarial vulnerabilities in isolated settings, systematic cross-architecture as well as class and category of attack based comparisons under controlled attack conditions remain limited, leaving practitioners without clear guidance on which models to deploy in adversarial environments. This paper asks a simple question: what type of classifier architectures actually hold up when attackers try to manipulate the systems? We put three popular architectures through their paces: a 1D Convolutional Neural Network, a Long Short-Term Memory (LSTM) network, and a Random Forest (RF) ensemble. Using the ACI-IoT-2023 dataset (over 1.2 million samples spanning 12 attack types), we subject each model with FGSM and PGD adversarial attacks, which apply gradient-based perturbations in normalized feature space consistent with established adversarial ML evaluation protocols, at perturbation budgets ranging from $\epsilon=0.01$ to $\epsilon=0.1$. Surprisingly, Random Forest achieved near-perfect baseline accuracy (99.98\%), yet collapsed catastrophically under attack, dropping 73 percentage points at the smallest perturbation we tested. CNN, on the other hand, retained 95.5\% accuracy at $\epsilon=0.01$ and degraded gracefully as perturbations increased. LSTM fell somewhere in between. These findings flip the conventional wisdom where high baseline accuracy means nothing if a model shatters at the first sign of adversarial pressure. For practitioners deploying intrusion detection in adversarial environments, we recommend CNN-based architectures and provide scenario-specific deployment guidance.

2606.12064 2026-06-11 cs.SE cs.CR 新提交

Undefined Behavior in C and C++: An Experiment With Desktop Use Cases

C和C++中的未定义行为:桌面使用场景的实验

Jukka Ruohonen, Krzysztof Sierszecki

AI总结 通过编译器实现的未定义行为检测器,实验发现Linux桌面环境下C/C++程序普遍存在未定义行为,59个任务产生近1.1万条警告,多数来自Mesa图形库和GUI交互。

详情
Comments
Submitted
AI中文摘要

未定义行为是C和C++编程中的惯用现象;这类行为是指使用了语言不施加任何要求的错误程序构造,例如整数溢出。本文通过实证实验,探究在Linux发行版的典型桌面使用中,底层执行的未定义行为的程度。分析基于编译器中实现的未定义行为检测器。根据结果,未定义行为很常见。通过完成59个简单的实验任务,由32个用C或C++编写的独特程序和库生成了近1.1万条独特的未定义行为警告。其中,大多数警告与Mesa图形库相关,并通过与图形用户界面交互产生。仅登录GNOME桌面环境就生成了超过500条独特警告。在所有警告中,绝大多数是关于虚表指针的。相关的堆栈跟踪通常也很长。凭借这些及其他结果,本文为关于C和C++的实证文献做出了贡献。

英文摘要

Undefined behavior is idiomatic to C and C++ programming; such behavior is a use of an erroneous program construct for which the languages impose no requirements, such as integer overflows. The paper presents an empirical experiment seeking to probe the extent of undefined behavior executing underneath typical desktop use of a Linux distribution. The analysis is based on an undefined behavior sanitizer implemented in a compiler. According to the results, undefined behavior is common. By completing 59 simple experimental tasks, nearly 11 thousand unique undefined behavior warnings were generated by 32 unique programs and libraries written in C or C++. Of these warnings, most were associated with the Mesa graphics library and generated by interacting with graphical user interfaces. Merely logging into the GNOME desktop environment generated over 500 unique warnings. Of all warnings, the clear majority was about virtual table pointers. The associated stack traces were also lengthy in general. With these and other results, the paper contributes to the empirical literature on C and C++.

2606.12011 2026-06-11 cs.CR 新提交

InjectV: Modeling Fault Injection Attacks in RISC-V Simulation Environment

InjectV:在RISC-V仿真环境中建模故障注入攻击

Niccolò Lentini, Giorgio Fardo, Stefano Di Carlo, Alessandro Savino

AI总结 提出InjectV框架,基于gem5模拟器在RISC-V平台上实现精确、引导式的故障注入,支持寄存器和存储器瞬态故障攻击,实验表明相比传统方法节省95.8%时间。

详情
AI中文摘要

故障注入攻击(FIA)对硬件安全构成重大威胁,能够通过在计算或存储中诱导恶意故障来破坏系统。由于物理故障实验的高成本、复杂性和有限可用性,特别是在硅前开发阶段,评估对此类攻击的韧性具有挑战性。架构级仿真提供了一种面向开发者的白盒视角,用于系统性的漏洞评估。本文介绍了InjectV,一个基于gem5模拟器构建的RISC-V平台故障注入攻击框架。InjectV能够在安全关键执行点(如控制流决策、计数器和比较)实现精确、引导式的故障注入,从而系统性地探索攻击向量。它目前支持寄存器和存储器中的瞬态故障攻击,拓宽了模拟多种攻击场景的能力。在FISSC套件(包括VerifyPIN应用的强化变体)的安全基准测试上的实验结果表明,InjectV能够有效识别故障注入点,相比传统故障注入方法节省了95.8%的时间。

英文摘要

Fault Injection Attacks (FIAs) are a significant threat to hardware security, capable of compromising systems by inducing malicious faults in computation or storage. Evaluating resilience against such attacks is challenging due to the high cost, complexity, and limited availability of physical fault experiments, particularly during pre-silicon development. Architectural-level simulation offers a developer-oriented, white-box perspective for systematic vulnerability assessment. This paper introduces InjectV, a fault injection attack framework for RISC-V platforms built on the gem5 simulator. InjectV enables precise, guided fault injection at security-critical execution points, such as control-flow decisions, counters, and comparisons, allowing systematic exploration of attack vectors. It currently supports transient fault attacks in registers and memory, broadening its ability to simulate diverse attack scenarios. Experimental results on security benchmarks from the FISSC suite, including hardened variants of the VerifyPIN application, demonstrate InjectV's ability to effectively identify fault-injection points, achieving a 95.8% time-saving advantage over traditional fault injection approaches.

2606.11967 2026-06-11 cs.CR cs.IT math.CO 新提交

Quadratic APN Functions in Dimension 8 via Gröbner Basis Search in a Self-Equivalence Subspace

通过自等价子空间中的Gröbner基搜索发现8维二次APN函数

Oleksandr Kuznetsov

AI总结 本文在8维自等价子空间中通过Gröbner基搜索发现566个二次APN函数,其中4个新CCZ等价类(500个函数)未被现有数据库收录,并验证了搜索管道的正确性。

详情
AI中文摘要

我们描述了一种在结构化自等价子空间内对8维二次APN(几乎完美非线性)函数的计算搜索。搜索空间是一个40维二元线性子空间,由所有与5阶线性自同构(Beierle、Brinkmann和Leander 2021年分类中的第22类)交换的函数组成,此前报道该子空间不含APN函数。我们的方法结合了通过显式RREF参数化的随机采样(每核心小时约600次新的APN阳性评估)和Magma中的Gröbner基计算,以枚举每个中心点24维超平面中的所有APN函数(每个超平面约10分钟)。在覆盖全部65,536个超平面中0.65%的428次超平面计算中,我们获得了566个二次APN函数,它们在正交导数不变量下形成6个CCZ等价类。其中4个类(包含500个函数)与2025年数据库中的3,775,599个二次APN函数或2020年前的12,921个实例汇编中的任何条目均不匹配。两个类(66个函数)与Gold函数x^3和x^9 CCZ等价,证实了搜索管道的正确性。成员分析表明,三个新类(B、C、D)完全位于原始子空间之外,且仅出现在以Gold函数为中心的切片中,展示了Gröbner基阶段的关键作用。在532次以数据库函数为切片中心的实验和20次以随机中心进行的实验中,未发现APN邻居,表明网关现象是搜索空间自等价结构特有的。由于正交导数不变量是二次APN函数的完全CCZ不变量,缺失匹配签名提供了CCZ不等价的严格证明。

英文摘要

We describe a computational search for quadratic APN (Almost Perfect Nonlinear) functions in dimension 8 within a structured self-equivalence subspace. The search space is a 40-dimensional binary linear subspace consisting of all functions commuting with a linear automorphism of order 5 (class 22 in the taxonomy of Beierle, Brinkmann, and Leander, 2021), previously reported to contain no APN functions. Our approach combines random sampling via an explicit RREF parameterization (approximately 600 fresh APN-positive evaluations per core-hour) with Gröbner basis computation in Magma to enumerate all APN functions in a 24-dimensional hyperplane through each center (approximately 10 minutes per hyperplane). From 428 hyperplane computations, covering 0.65% of all 65,536 hyperplanes, we obtained 566 quadratic APN functions forming six CCZ-equivalence classes under the ortho-derivative invariant. Four classes, comprising 500 functions, match no entry in the 2025 database of 3,775,599 quadratic APN functions or in the pre-2020 compilation of 12,921 instances. Two classes (66 functions) are CCZ-equivalent to the Gold functions x^3 and x^9, confirming the correctness of the search pipeline. A membership analysis shows that the three new classes (B, C, D) lie entirely outside the original subspace and occur only in Gold-centered slices, demonstrating the essential role of the Gröbner basis stage. In 532 experiments using database functions as slice centers and 20 experiments with random centers, no APN neighbors were found, indicating that the gateway phenomenon is specific to the self-equivalence structure of the search space. Since the ortho-derivative invariant is a complete CCZ-invariant for quadratic APN functions, the absence of matching signatures provides a rigorous proof of CCZ-inequivalence.

2606.11949 2026-06-11 cs.LG cs.CR stat.ML 新提交

Online Shift Detection and Conformal Adaptation for Deployed Safety Classifiers

已部署安全分类器的在线漂移检测与共形自适应

Jun Wen Leong

AI总结 提出在线监测系统,使用校准序列统计检测分布漂移,并通过共形弃权层自适应阈值恢复目标错误率,在800个实验单元中实现86.6%有效检测。

详情
Comments
16 pages, 4 figures, 7 tables. Code and data at this https URL
AI中文摘要

我们提出了一种在线监测系统,用于检测已部署安全分类器中的分布漂移,使用校准的序列统计量来检测分类器何时移出分布。一旦检测到,共形弃权层会自适应调整决策阈值,以恢复目标错误率ε=0.1。在一项预注册的析因评估(4个分类器×5种漂移条件×20个种子×2个窗口大小,共800个单元)中,该系统实现了86.6%的有效检测(693/800,95% CI [84.1%, 88.8%]),平均延迟为39.5步。检测在三种真实标签机制下均有效:合成发作(86.6%)、真实时间越狱(85%,17/20)和GCG对抗攻击。加权共形预测为DeBERTa恢复了高达39个百分点的丢失覆盖率(ESS=46/300),但所有其他分类器均崩溃(ESS≈300):逻辑密度比估计在高维嵌入空间中实现了完美的源/目标可分离性,将所有重要性权重裁剪至下限。DeBERTa显示出从有效校正(释义,ESS=46)到几乎完全崩溃(对抗后缀,ESS=206)的梯度。PCA降至32维打破了崩溃,为Llama Guard恢复了33个百分点,为ShieldGemma恢复了21个百分点。方差分解显示分类器(η²=0.243)、漂移类型(η²=0.237)及其交互作用(η²=0.185)均对检测延迟方差有显著贡献(所有p<0.001),表明需要针对每个分类器的监测配置文件。

英文摘要

We present an online monitoring system for distributional shift in deployed safety classifiers, using calibrated sequential statistics to detect when a classifier has moved out of distribution. Upon detection, a conformal abstention layer adapts decision thresholds to recover a target error rate epsilon=0.1. In a pre-registered factorial evaluation (4 classifiers x 5 shift conditions x 20 seeds x 2 window sizes, 800 cells), the system achieves 86.6% valid detection (693/800, 95% CI [84.1%, 88.8%]) with mean latency of 39.5 steps. Detection holds across three ground-truth regimes: synthetic onset (86.6%), real temporal jailbreaks (85%, 17/20), and GCG adversarial attacks. Weighted conformal prediction recovers up to 39 pp of lost coverage for DeBERTa (ESS=46/300) but collapses for all other classifiers (ESS~300): logistic density ratio estimation achieves perfect source/target separability in high-dimensional embedding spaces, clipping all importance weights to the floor. DeBERTa shows a gradient from effective correction (paraphrase, ESS=46) to near-total collapse (adversarial suffix, ESS=206). PCA to 32 dimensions breaks the collapse, recovering 33 pp for Llama Guard and 21 pp for ShieldGemma. Variance decomposition reveals classifier (eta^2=0.243), shift type (eta^2=0.237), and their interaction (eta^2=0.185) all contribute substantially to detection latency variance (all p<0.001), indicating per-classifier monitoring profiles are necessary.

2606.11884 2026-06-11 cs.CV cs.CR 新提交

Image Quality Assessment of Identity Cards Using Measures from Open Face Image Quality

使用开放人脸图像质量度量对身份证进行图像质量评估

Gregor Grote, Juan E. Tapia, Christian Rathgeb

发表机构 * da/sec - Biometrics and Internet Security Research Group, Hochschule Darmstadt(达姆施塔特应用科学大学生物识别与互联网安全研究组)

AI总结 本文通过将OFIQ标准中的捕获相关质量度量应用于身份证图像,提出一种预处理流程,并分析这些度量与三种呈现攻击检测算法性能的相关性,表明基于某些OFIQ度量的质量评估可显著提升PAD性能。

详情
Comments
Presented on IWBF 2026 (14th International Workshop on Biometrics and Forensics)
AI中文摘要

本文通过将开放人脸图像质量(OFIQ)标准中的捕获相关质量度量应用于身份证图像,解决了远程验证系统中身份证图像质量评估的挑战。我们的预处理流程包括角点检测、透视归一化和全面的前景掩码,以确保准确且无偏的质量度量计算。我们通过分析这些度量与三种呈现攻击检测(PAD)算法在四个不同身份证数据集上的性能相关性来评估其有效性,其中两个数据集包含真实(即原始)图像,两个包含打印的模拟身份证。我们的结果表明,基于某些OFIQ度量的质量评估可以显著提升PAD性能。

英文摘要

This paper addresses the challenge of assessing image quality in ID cards in remote verification systems by applying capture-related quality measures from the Open Face Image Quality (OFIQ) standard to ID card images. Our preprocessing pipeline includes corner detection, perspective normalization, and comprehensive foreground masking to ensure accurate and unbiased quality measure computation. We evaluate the effectiveness of these measures by analyzing their correlation with the performance of three presentation attack detection (PAD) algorithms across four diverse ID card datasets, where two datasets contain bona fide, i.e. pristine, images and two contain printed mock ID cards. Our results suggest that quality assessment based on some OFIQ measures can significantly improve PAD performance.

2606.11878 2026-06-11 cs.CR 新提交

Gerrymandering the Warp: Non-Control-Data Attacks on CUDA Collective Decision

扭曲 Warp:针对 CUDA 集体决策的非控制数据攻击

Igor Santos-Grueiro

AI总结 本文提出集体语义破坏(CSC)攻击,利用 CUDA 集体操作中的参与元数据(如掩码、谓词等)绕过安全决策,并引入集体完整性契约(CIC)防御机制。

详情
Comments
17 pages
AI中文摘要

CUDA 集体操作通常位于安全决策路径上:投票接受批次、归约聚合证据、洗牌选择代表、屏障在使用前检查状态。这些决策不仅依赖于计算值,还依赖于哪些通道被代表、它们贡献了什么证据、哪个通道代表群体、以及哪个检查过的状态到达提交。我们将这些参与元数据识别为决策性的非控制数据。我们定义了集体语义破坏(CSC),一种非控制数据攻击家族,其中范围有效的掩码、谓词、源通道、描述符、组标签或时期导致符合 CUDA 规范的集体在错误的成员、贡献、角色或验证使用状态上授权决策。内核到达预期的集体站点并执行预期的原语;原语代表了错误的授权集合。我们使用站点本地的参与-授权契约对 CSC 进行建模。受保护的集体在授权前派生、重新计算、检查或冻结成员、贡献、角色和时间状态。我们在 NVIDIA CUDA 集体原语、触发通道、紧凑工作负载风格内核、简化习语桥和准入保护框架上评估 CSC。在涵盖四个授权维度的 CUDA 定义的契约一致性套件中,损坏的参与元数据导致 102/102 实例中的可信参考不匹配,而强化变体在 102/102 中保留了该参考。我们单独报告了 13 个同步敏感实例。然后,我们引入了集体完整性契约(CIC),一种包装规范,在集体使用前绑定参与元数据。对于 CUDA 集体决策,安全性既依赖于计算的值,也依赖于代表的参与者。

英文摘要

CUDA collective operations often sit on security decision paths: votes accept batches, reductions aggregate evidence, shuffles select representatives, and barriers order checked state before use. Such decisions depend not only on computed values, but also on which lanes are represented, what evidence they contribute, which lane speaks for the group, and which checked state reaches commit. We identify this participation metadata as decision-making non-control data. We define Collective Semantic Corruption (CSC), a non-control-data attack family in which range-valid masks, predicates, source lanes, descriptors, group labels, or epochs cause a CUDA-conforming collective to authorize a decision over the wrong membership, contribution, role, or validation-to-use state. The kernel reaches the intended collective site and executes the expected primitive; the primitive represents the wrong authority set. We model CSC with a site-local participation-authority contract. A protected collective derives, recomputes, checks, or freezes membership, contribution, role, and temporal state before authorization. We evaluate CSC across NVIDIA CUDA collective primitives, trigger channels, compact workload-style kernels, reduced idiom bridges, and admission-guard harnesses. In a CUDA-defined contract-conformance suite spanning the four authority dimensions, corrupted participation metadata causes a trusted-reference mismatch in 102/102 instances, while hardened variants preserve that reference in 102/102. We report 13 synchronization-sensitive instances separately. We then introduce Collective Integrity Contracts (CIC), a wrapper discipline that binds participation metadata before collective use. For CUDA collective decisions, security depends on both the values computed and the participants represented.

2606.11871 2026-06-11 cs.CR 新提交

WarpGuard: Protected-Site Control-Flow Integrity for CUDA SASS Binaries

WarpGuard: CUDA SASS 二进制程序的安全点控制流完整性

Igor Santos-Grueiro

AI总结 针对 GPU 内存漏洞可导致设备端控制流破坏的问题,提出 WarpGuard,首个在已执行 SASS 上实施安全点 CFI 的系统,通过认证返回地址、验证前向目标等机制,在 77 个 CUDA 程序上分类 51621 个控制流点并执行 5220 万次动态检查,有效防御控制流攻击。

详情
AI中文摘要

最近的 CUDA 利用工作表明,GPU 内存漏洞可以升级为设备端控制流破坏,因为内核随后会消耗被破坏的返回延续、函数指针、调度表条目或分支目标。对于已部署的 CUDA 二进制程序,相关的安全边界是执行的 NVIDIA SASS,经过 PTX 降级、内联、ABI 决策、寄存器分配、溢出、谓词和 SIMT 执行后;源代码或 PTX 级别的策略不捕获此边界。我们提出 WarpGuard,据我们所知,这是第一个针对在已执行 SASS 上运行的 CUDA 设备二进制程序的安全点 CFI 系统。WarpGuard 在安全点强制执行:恢复的 SASS 指令或序列,这些指令或序列消耗控制流状态,提供足够的二进制证据以推导策略,在发布前进行检查,并在违反时失败关闭。它认证仪器化返回的后向边缘延续状态,验证每个安全点的可恢复前向目标,并报告安全分母之外的固定边缘、不支持、配置文件排除、回退和无表面结果。在 77 个 CUDA 程序上,WarpGuard 分类了 51621 个 SASS 控制流点,包括 1343 个返回和 154 个受支持的前向目标集条目,并记录了 5220 万次动态检查。在代表性的后向和前向边缘破坏攻击中,原生执行达到攻击者选择的行为,仅检测模式记录预期的违规,而强制措施在发布无效的受保护传输之前失败关闭。公共代码证据表明,相同的 SASS 消耗模式出现在真实的 CUDA 系统中,包括运行时调度表、cuFFT 回调、生成的可调用表和上传的设备函数指针。WarpGuard 为 CUDA SASS 提供了可审计的安全点 CFI,并将动态仪器化强制与无回调的 SASS 时序和补丁缓存可行性分开。

英文摘要

Recent CUDA exploitation work shows that GPU memory bugs can escalate into device-side control-flow corruption, as kernels later consume corrupted return continuations, function pointers, dispatch-table entries, or branch targets. For deployed CUDA binaries, the relevant security boundary is executed NVIDIA SASS, after PTX lowering, inlining, ABI decisions, register allocation, spills, predication, and SIMT execution; source- or PTX-level policies do not capture this boundary. We present WarpGuard, to our knowledge the first protected-site CFI system for CUDA device binaries operating on executed SASS. WarpGuard enforces at protected sites: recovered SASS instructions or sequences that consume control-flow state, provide sufficient binary evidence to derive policy, are checked before release, and fail closed on violation. It authenticates backward-edge continuation state for instrumented returns, validates recoverable forward targets per site, and reports fixed-edge, unsupported, profile-excluded, fallback, and no-surface outcomes outside the protected denominator. On 77 CUDA artifacts, WarpGuard classifies 51,621 SASS control-flow sites, including 1,343 returns and 154 supported forward target-set entries, and records 52.2 million dynamic checks. In representative backward- and forward-edge corruption attacks, native execution reaches attacker-selected behavior, detect-only mode records the expected violation, and enforcement fails closed before releasing the invalid protected transfer. Public-code evidence shows that the same SASS consumption patterns occur in real CUDA systems, including runtime dispatch tables, cuFFT callbacks, generated callable tables, and uploaded device-function pointers. WarpGuard delivers auditable protected-site CFI for CUDA SASS and separates dynamic-instrumentation enforcement from callback-free SASS timing and patch-cache feasibility.

2606.11839 2026-06-11 cs.CR 新提交

Systematic Cybersecurity Risk Analysis of European Rail Traffic Management System

欧洲铁路交通管理系统的系统性网络安全风险分析

Kacper Darowski, Sebastian N. Peters, Lukas Lautenschlager

AI总结 本研究系统建模ERTMS组件,分析其面临的安全威胁,发现遗留标准引入的漏洞在多种部署场景中持续存在,并指出全面转向ETCS 2级是提升网络安全的关键措施。

详情
Comments
Extended version of the paper accepted at ARES 2026 CPRA. First three authors contributed equally to this work
AI中文摘要

欧洲铁路交通管理系统(ERTMS)是欧盟广泛采用的统一列车管理标准。尽管该标准支持完全自动驾驶等用例,但网络安全一直是事后考虑。风险分析能够系统评估威胁和缓解措施并确定优先级。迄今为止,尚不清楚ERTMS中哪些威胁最为重要。本研究系统建模ERTMS组件,并根据底层技术中识别的威胁分析其安全性。结果表明,尽管ERTMS在铁路安全中发挥关键作用,但其安全状况令人担忧。使用EuroBalises和GSM-Railway(GSM-R)等遗留标准引入了漏洞,这些漏洞在最小ERTMS实现、采用各种可选安全措施的部署以及系统未来演进(例如采用未来铁路移动通信系统(FRMCS))中持续存在。全面过渡到欧洲列车控制系统(ETCS)2级被认为是推进ERTMS网络安全的最重要措施。结果表明,ERTMS需要向安全方向转变,以确保可用性和安全运行。虽然所选方法证明了其可行性并显示了ERTMS的剩余弱点,但未来需要开展以铁路为中心的适应性研究,以改进计算风险的量化和评估。

英文摘要

European Rail Traffic Management System (ERTMS) is a widely adopted standard unifying train management in the EU. While the standard allows for use cases like fully autonomous driving, cybersecurity has been an afterthought. Risk analysis enables the systematic assessment and prioritization of threats and mitigations. To date, it remains unclear which threats are most significant in ERTMS. This study systematically models components of ERTMS and analyzes their security in light of threats identified in the underlying technologies. The results suggest a concerning state of ERTMS, despite its critical role in railway safety. The use of legacy standards like EuroBalises and GSM-Railway (GSM-R) introduces vulnerabilities that persist across minimal ERTMS implementations, deployments incorporating various optional safety measures, and prospective future evolutions of the system, e.g., adopting Future Railway Mobile Communication System (FRMCS). Fully transitioning to European Train Control System (ETCS) level 2 was identified as the most significant measure for advancing ERTMS cybersecurity. The results indicate that a shift of ERTMS toward security is required to ensure availability and safe operation. While the chosen methodology proved its feasibility and shows remaining weaknesses of ERTMS, future work is needed to develop railway-centric adaptations to improve the quantification and evaluation of the computed risks.

2606.11828 2026-06-11 cs.SD cs.AI cs.CR cs.MM 新提交

Feature-Aligned Speech Watermarking for Robustness to Reconstruction Distortions

特征对齐的语音水印技术以抵抗重建失真

Haiyun Li, Shuhai Peng, Zhisheng Zhang, Jingran Xie, Xiaofeng Xie, Hanyang Peng, Zhiyong Wu

发表机构 * Shenzhen International Graduate School, Tsinghua University(清华大学深圳国际研究生院) Shenzhen Key Laboratory of Intelligent Media and Content Understanding(深圳市智能媒体与内容理解重点实验室) Tencent AI Lab(腾讯人工智能实验室)

AI总结 提出特征对齐水印方法,通过将水印与原始语音特征分布对齐,在保持不可感知性的同时提高水印能量,增强对语音重建模型的鲁棒性。

详情
Comments
Accepted by ICME2026
AI中文摘要

音频水印旨在将可识别信息嵌入音频中同时保持不可感知性。现有方法采用高保真、低能量设计以保持感知质量,但由此产生的水印在语音重建模型的抑制下缺乏鲁棒性。由于现有设计中固有的鲁棒性-保真度权衡,提高鲁棒性具有挑战性,增加水印能量会提高鲁棒性但降低保真度。为解决此问题,我们提出一种特征对齐的水印方法,将水印与原始语音特征分布对齐,允许更高的水印能量以提高鲁棒性同时保持不可感知性。我们使用预训练的语音编解码器生成伪语音水印,并将其融合到输入音频的频谱图中,通过VAD损失和感知损失引导在浊音区域嵌入。实验表明,我们的方法在保持与现有方法相当的不可感知性的同时,在见过和未见过的语音重建模型下均显著提高了鲁棒性。

英文摘要

Audio watermarking aims to embed identifiable information into audio while remaining imperceptible. Existing methods adopt high-fidelity, low-energy designs to preserve perceptual quality, but the resulting watermarks lack robustness under suppression by speech reconstruction models. Improving robustness is challenging due to the inherent robustness-fidelity trade-off in existing designs, where increasing watermark energy improves robustness but reduces fidelity. To address this problem, we propose a feature-aligned watermarking method that aligns the watermark with the original speech feature distribution, allowing higher watermark energy to improve robustness while preserving imperceptibility. We use a pretrained speech codec to generate a pseudo-speech watermark and fuse it into the spectrogram of the input audio, with VAD loss and perceptual losses guiding embedding within voiced regions. Experiments show that our method maintains imperceptibility comparable to existing approaches while substantially improving robustness under both seen and unseen speech reconstruction models.

2606.11827 2026-06-11 cs.CR 新提交

Jaguar: Fast Private CNN Inference with Power-of-Two Homomorphic Arithmetic

Jaguar: 基于2的幂同态算术的快速私有CNN推理

Yewon Jeong, Nayoung Jung, Hyeri Roh, Woo-Seok Choi

AI总结 提出Jaguar系统,采用2的幂密文环设计,通过SPA-Conv卷积核和精确密文侧截断,消除NTT瓶颈和后ReLU截断协议,在ImageNet模型上实现2-3.7倍延迟降低。

详情
Comments
29 pages, 8 figures, including appendix
AI中文摘要

混合HE/2PC私有CNN推理仍然受到卷积中素数模同态算术以及精度流的瓶颈,该精度流在调用单独的截断协议之前以双倍位宽运行ReLU。我们提出Jaguar,一个基于单一设计选择——2的幂密文环——的系统,解决了这两个问题。该选择实现了SPA-Conv,一种系数域卷积核,用标量-多项式累加取代以NTT为中心的多项式乘法,以及通过本地右移实现的精确密文侧截断,使得ReLU直接以目标定点精度运行,并消除了后ReLU截断协议。在NTT仍然真正有用的情况下——在客户端,用于解密过程中的单个多项式乘法——我们通过一个辅助NTT素数恢复它,在保持解密为O(N log N)的同时保留了2的幂协议基础。在禁用AVX的ImageNet规模ResNet-18、ResNet-50和MobileNetV2上,Jaguar的端到端延迟比Cheetah低2.07-3.72倍,比Rhombus低2.16-3.36倍,通信量比Cheetah低1.16-1.76倍。

英文摘要

Hybrid HE/2PC private CNN inference remains bottlenecked by prime-modulus homomorphic arithmetic in convolution and by a precision flow that runs ReLU at doubled bitwidth before invoking a separate truncation protocol. We present Jaguar, a system built on a single design choice--a power-of-two ciphertext ring--that addresses both. The choice enables SPA-Conv, a coefficient-domain convolution kernel that replaces NTT-centric polynomial multiplication with scalar-polynomial accumulation, and an exact ciphertext-side truncation by local right shifts that lets ReLU run directly at the target fixed-point precision and eliminates the post-ReLU truncation protocol. Where NTT remains genuinely useful--at the client, for the single polynomial multiplication during decryption--we recover it through an auxiliary NTT prime, preserving the power-of-two protocol substrate while keeping decryption O(N log N). On ImageNet-scale ResNet-18, ResNet-50, and MobileNetV2 with AVX disabled, Jaguar achieves 2.07-3.72x lower end-to-end latency than Cheetah and 2.16-3.36x lower than Rhombus, with 1.16-1.76x lower communication than Cheetah.

2606.11817 2026-06-11 cs.CR cs.AI cs.CL cs.SE 新提交

Grammar-Constrained Decoding Can Jailbreak LLMs into Generating Malicious Code

语法约束解码可诱使大语言模型生成恶意代码

Yitong Zhang, Shiteng Lu, Jia Li

AI总结 本文发现语法约束解码(GCD)可被利用发起名为CodeSpear的越狱攻击,使LLM生成恶意代码;并提出安全对齐方法CodeShield,通过生成蜜罐代码防御该攻击。

详情
AI中文摘要

大型语言模型(LLM)越来越多地用于代码生成,引发了对它们可能被滥用来生成恶意代码的担忧。与此同时,语法约束解码(GCD)已被广泛采用,通过强制语法有效性来提高LLM生成代码的可靠性。在本文中,我们揭示了一个反直觉的风险:这种面向可靠性的技术本身可能成为攻击面。我们发现了一种新的越狱攻击,称为CodeSpear,它利用GCD诱导LLM生成恶意代码。我们的实验表明,仅应用良性代码语法约束即可有效越狱LLM。为了解决这一漏洞,我们提出了CodeShield,一种安全对齐方法,即使在攻击者控制的语法约束下也能稳健地保持安全行为。CodeShield通过在代码模态中对齐模型,教其在GCD下生成蜜罐代码。这种代码在语义上是无害的,因此不会实现恶意请求,并且在结构上是多样化的,因此难以通过语法收紧来抑制。同时,当自然语言可用时,CodeShield仍然保留自然语言的拒绝。在4个基准测试中对10个流行LLM的实验表明,CodeSpear优于代表性的越狱基线,平均攻击成功率提高了30个百分点以上。CodeShield在CodeSpear下恢复了安全性,同时保持了良性实用性。我们的发现揭示了GCD的一个基本风险,并呼吁对其潜在安全影响给予更多关注。

英文摘要

Large Language Models (LLMs) are increasingly used for code generation, raising concerns that they may be misused to produce malicious code. Meanwhile, Grammar-Constrained Decoding (GCD) has been widely adopted to improve the reliability of LLM-generated code by enforcing syntactic validity. In this paper, we reveal a counterintuitive risk: this reliability-oriented technique can itself become an attack surface. We uncover a new jailbreak attack, termed CodeSpear, that exploits GCD to induce LLMs into generating malicious code. Our experiments show that simply applying a benign code grammar constraint can effectively jailbreak LLMs. To address this vulnerability, we propose CodeShield, a safety alignment approach that robustly preserves safe behavior even under attacker-controlled grammar constraints. CodeShield aligns the model in the code modality by teaching it to generate honeypot code under GCD. Such code is semantically harmless, so it does not implement the malicious request, and structurally diverse, so it is difficult to suppress through grammar tightening. At the same time, CodeShield still preserves natural-language refusals when natural language is available. Experiments on 10 popular LLMs across 4 benchmarks show that CodeSpear outperforms representative jailbreak baselines and increases the attack success rate by more than 30 percentage points on average. CodeShield also restores safety under CodeSpear while preserving benign utility. Our findings reveal a fundamental risk of GCD and call for greater attention to its potential security implications.

2606.11804 2026-06-11 cs.AI cs.CR cs.LG 新提交

Toward Trustworthy AI: Multi-Target Adversarial Attacks and Robust Defenses for Continuous Data Summarization

迈向可信赖的人工智能:针对连续数据摘要的多目标对抗攻击与鲁棒防御

Yuefang Lian, Longkun Guo, Zhongrui Zhao, Zhigang Lu, Yanan Cai, Shuchao Pang, Dachuan Xu, Jason Xue

发表机构 * Nankai University(南开大学) James Cook University(詹姆斯库克大学) Western Sydney University(西悉尼大学) Beijing University of Technology(北京工业大学) Fuzhou University(福州大学) Nanjing University of Science and Technology(南京理工大学) CSIRO's Data 61(澳大利亚联邦科学与工业研究组织Data61) The University of Adelaide(阿德莱德大学)

AI总结 研究通过DR-子模优化在相似性层面扰动下对连续数据摘要进行对抗攻击,提出多目标攻击生成和鲁棒防御的近似算法,实验表明攻击有效且防御能改善鲁棒性-缓解权衡。

详情
Comments
Submitted to IEEE Transactions on Information Forensics and Security (IEEE TIFS)
AI中文摘要

可信赖的人工智能需要可靠的数据处理管道,而不仅仅是鲁棒的下游预测模型。作为上游组件,数据摘要决定了哪些信息被保留并传递给后续的学习或决策模块。因此,对摘要过程的对抗性扰动可能以上游方式损害可信赖的人工智能:它们可能改变所选摘要,降低其代表性,并进一步降低后续学习任务的效用。在本文中,我们通过DR-子模优化研究相似性层面扰动下的连续数据摘要对抗攻击。我们证明了一类多分辨率图像摘要目标可以表示为非负子模集函数的多线性扩展,并满足具有$m$-弱单调性的DR-子模性。然后,我们将多目标攻击生成表述为一个最小-最大问题,其中优化相似性结构的一个可容许扰动以降低多个目标摘要模型。为了缓解此类扰动,我们将针对混合攻击类型的鲁棒防御表述为一个正则化的最大-最小问题。对于这两个问题,我们开发了具有理论保证的近似算法。在真实数据和受控聚类基准上的实验表明,所提出的攻击在代表性的低到中等预算范围内是有效的,并且可以导致下游任务性能损失。所提出的防御在结构化设置中改善了鲁棒性-缓解权衡,同时也揭示了真实数据上鲁棒保护的参数敏感性。

英文摘要

Trustworthy AI requires reliable data-processing pipelines, not only robust downstream predictive models. As an upstream component, data summarization determines which information is retained and passed to subsequent learning or decision modules. Therefore, adversarial perturbations to the summarization process can compromise trustworthy AI in an upstream manner: they may alter the selected summary, reduce its representativeness, and further degrade the utility of subsequent learning tasks. In this paper, we study adversarial attacks on continuous data summarization under similarity-level perturbations through DR-submodular optimization. We show that a class of multi-resolution image summarization objectives can be formulated as multilinear extensions of non-negative submodular set functions and satisfy DR-submodularity with $m$-weak monotonicity. We then formulate multi-target attack generation as a min-max problem, where one admissible perturbation of the similarity structure is optimized to degrade multiple target summarization models. To mitigate such perturbations, we formulate robust defense against mixed attack types as a regularized max-min problem. For both problems, we develop approximation algorithms with theoretical guarantees. Experiments on real-data and controlled clustered benchmarks show that the proposed attack is effective in representative low-to-moderate budget regimes and can induce downstream task-performance loss. The proposed defense improves the robustness--mitigation trade-off in structured settings, while also revealing the parameter sensitivity of robust protection on real data.

2606.11803 2026-06-11 cs.CR cs.NI 新提交

SwarmSense-DNN: A Trustworthy and Decentralized Neural Framework for Proactive Anomaly Defense in Consumer IoT

SwarmSense-DNN:面向消费物联网中主动异常防御的可信去中心化神经框架

Jing Yang, Vijay Govindarajan, Saad Arif, Xu Xu, Mohamed Kallel, Zaffar Ahmed Shaikh, Zhe Liu, Chunhong Yuan, Lip Yee Por

AI总结 提出SwarmSense-DNN,一种结合群体智能与深度神经网络的去中心化框架,通过分层联邦学习与图注意力机制实现分布式IoT环境中的协同异常检测,在五个基准数据集上达到95.44%平均检测精度并降低67%通信开销。

详情
Comments
11 pages, 14 figures
AI中文摘要

消费物联网设备的快速增长带来了针对AI网络威胁的可信异常检测的前所未有的挑战,需要实时、隐私保护和可扩展的防御机制。传统的集中式策略在处理分布式消费数据时面临关键限制,包括通信瓶颈、单点故障和隐私漏洞。我们提出SwarmSense-DNN,一种新颖的去中心化神经框架,采用群体智能在分布式IoT环境中进行安全、协作的异常检测。该框架将自主智能体与深度神经网络集成,形成一个自组织的防御系统,无需集中协调即可检测不断演变的异常。它利用带有图神经网络和注意力机制的分层联邦学习来捕获局部和全局异常行为,同时确保数据隐私。大量实验证明了SwarmSense-DNN的优越性能:它在五个基准数据集上实现了95.44%的平均检测精度,同时将通信开销降低了67%。该框架通过差分隐私保障对对抗性威胁保持稳健的弹性,并在节点故障和AI攻击下表现出强大的容错能力。

英文摘要

The rapid growth of consumer IoT devices has introduced unprecedented challenges in trustworthy anomaly detection against AI-enabled cyber threats, requiring real-time, privacy-preserving, and scalable defense mechanisms. Traditional centralized strategies face critical limitations, including communication bottlenecks, single points of failure, and privacy vulnerabilities when processing distributed consumer data. We propose SwarmSense-DNN, a novel decentralized neural framework employing swarm intelligence for secure, cooperative anomaly detection across distributed IoT environments. The framework integrates autonomous agents with deep neural networks to form a self-organizing defense system that detects evolving anomalies without centralized coordination. It utilizes hierarchical federated learning with graph neural networks and attention mechanisms to capture local and global anomaly behaviors while ensuring data privacy. Extensive experiments demonstrate SwarmSense-DNN's superior performance: it achieves 95.44% average detection accuracy across five benchmark datasets while reducing communication overhead by 67%. The framework maintains robust resilience against adversarial threats through differential privacy safeguards and demonstrates strong fault tolerance under node failures and AI-enabled attacks.

2606.11760 2026-06-11 cs.DS cs.CR cs.DB 新提交

A Fast Gaussian Mechanism under Continual Observation, with Applications

持续观测下的快速高斯机制及其应用

Rasmus Pagh, Sia Sejer

AI总结 针对持续更新场景下的私有向量发布问题,提出一种基于布朗桥的常数时间采样方法,实现高斯噪声的快速生成,并应用于差分隐私计数草图,提升正交范围计数查询和连接大小估计的性能。

详情
AI中文摘要

我们考虑在更新下私有发布$k$维向量的问题:从零向量开始,在时间$t_1, t_2,\dots$,向量分别加上$x^{(1)}, x^{(2)},\dots$。对于正整数$T, k$,我们将更新建模为数据集$\{(t_i, x^{(i)})\}_i$,其中$t_i \in [T]$且$x^{(i)} \in B_k$($k$维单位球)。如果两个这样的数据集的对称差大小至多为$1$,则称它们为相邻的。持续发布包括每个时间步$t=1,\dots,T$的和$A^{(t)} = \sum_{i \;: \; t_i \leq t} x^{(i)}$。经典的持续发布技术允许我们以$\text{polylog}(T)$的加性噪声幅度发布$A^{(1)},\dots,A^{(T)}$的近似,计算时间为$O(kT)$,即使在在线自适应情况下(数据持续揭示当前时间步)也是如此。受私有草图技术的启发,我们考虑在时间步$t$仅需发布$A^{(t)}$中条目的\emph{子集}的设置。我们的新结果是,可以在\emph{常数时间}内采样给定噪声向量中的任何所需条目,同时精确再现具有高斯噪声的二叉树机制的分布。对已知$O(\log T)$时间界的改进来自一种新的数据结构,它允许我们使用布朗桥在常数时间内以正确的相关性采样新的噪声值。我们提出了两个独立感兴趣的数据管理应用,它们将我们的技术与差分隐私CountSketch结合使用:1)正交范围计数查询的动态数据结构,具有比先前数据结构更好的隐私/准确性/空间权衡;2)连接大小估计,其中我们还展示了改进的高概率界。

英文摘要

We consider the problem of privately releasing a $k$-dimensional vector under updates: Starting with a zero vector, at times $t_1, t_2,\dots$ the vector is updated by adding $x^{(1)}, x^{(2)},\dots$, respectively. For positive integers $T$, $k$ we model the updates as a data set $\{(t_i, x^{(i)})\}_i$, where $t_i \in [T]$ and $x^{(i)} \in B_k$ (the $k$-dimensional unit ball). Two such data sets are said to be neighboring if their symmetric difference has size at most $1$. The continual release consists of the sum $A^{(t)} = \sum_{i \;: \; t_i \leq t} x^{(i)}$ for each time step $t=1,\dots,T$. Classical continual release techniques allow us to release an approximation of $A^{(1)},\dots,A^{(T)}$ with additive noise of magnitude $\text{polylog}(T)$, computed in time $O(kT)$, even in the on-line, adaptive case where data is continually revealed for the current time step. Motivated by private sketching techniques, we consider the setting where only a \emph{subset} of entries in $A^{(t)}$ need to be released at time step $t$. Our new result is that it is possible to sample any desired entry in a given noise vector in \emph{constant time} while reproducing exactly the distribution of the binary tree mechanism with Gaussian noise. The improvement on the known time bound of $O(\log T)$ comes from a new data structure that allows us to sample a new noise value with the correct correlations in constant time using Brownian bridges. We present two data management applications, of independent interest, that use our technique in conjunction with differentially private CountSketches: 1) A dynamic data structure for orthogonal range counting queries with a better privacy/accuracy/space trade-off than previous data structures, and 2) Join size estimation, where in addition we show improved high-probability bounds.

2606.11736 2026-06-11 cs.CR cs.DC cs.ET 新提交

MHOT: Height-Optimized Authenticated Data Structure for Blockchain State Commitment

MHOT:面向区块链状态承诺的高度优化认证数据结构

Sipeng Xie, Qianhong Wu, Minghang Li, Qiyuan Gao, Bo Qin, Qin Wang

AI总结 针对Merkle Patricia Trie树高增长及Nurgle攻击问题,提出MHOT,通过区分位索引实现自适应扇出和最小高度,并引入分层证明降低证明开销,在以太坊主网负载下实现9倍写吞吐量提升和0%攻击成功率。

详情
Comments
Usenix Sec'26
AI中文摘要

状态根计算占区块链区块处理时间的78%。以太坊的规范认证数据结构,即Merkle Patricia Trie(MPT),遭受严重的树高增长问题,并容易受到\textit{Nurgle攻击}(SP'24),其中攻击者通过哈希碰撞膨胀路径深度,以可忽略的成本降低系统性能。现有防御措施通过增加节点扇出(跨度)来限制树高,但更高的扇出会指数级增加证明大小。先前的工作使用向量承诺来缓解这种权衡,但代价是需要可信设置或昂贵的验证。我们提出\textsc{Mhot},一种用于区块链状态承诺的高度最优认证数据结构,它保留了基于哈希的标准验证,无需可信设置。与MPT的固定前缀索引(将跨度和扇出指数级耦合)不同,\textsc{Mhot}通过实际区分键的区分位进行索引,实现了具有线性扇出耦合的自适应跨度和可证明的最小高度。为了防止高扇出膨胀证明,我们引入了分层证明,一种两层Merkle结构,将每节点证明开销从O(k)降低到O(log k)。在以太坊主网负载下,\textsc{Mhot}相比MPT实现了高达9倍的写吞吐量、4倍低的写放大和2倍小的证明。在Nurgle攻击下,即使攻击者消耗了整个区块的gas预算,\textsc{Mhot}仍保持0%的攻击成功率(相比之下,MPT为99.97%)。我们的结果有些令人惊讶地表明,高度最优性(而非新的密码学原语!)是可扩展且抗攻击的区块链状态承诺的关键抽象。

英文摘要

State root computation dominates (78%) blockchain block processing time. Ethereum's canonical authenticated data structure, i.e., Merkle Patricia Trie (MPT), suffers from severe tree-height growth and is vulnerable to \textit{Nurgle attacks} (SP'24), where adversaries inflate path depth via hash collisions and degrade system performance at negligible cost. Existing defenses increase node fanout (span) to bound tree height, but higher span inflates proof size exponentially. Prior work mitigates this trade-off using vector commitments, at the cost of trusted setup or expensive verification. We present \textsc{Mhot}, a height-optimal authenticated data structure for blockchain state commitment that preserves standard hash-based verification without trusted setup. Unlike MPT's fixed-prefix indexing, which couples span and fanout exponentially, \textsc{Mhot} indexes by discriminative bits that actually distinguish keys, achieving adaptive span with linear fanout coupling and provably minimal height. To prevent high fanout from inflating proofs, we introduce hierarchical proofs, a two-layer Merkle construction that reduces per-node proof overhead from O(k) to O(log k). On Ethereum mainnet workloads, \textsc{Mhot} achieves up to 9X higher write throughput, 4X lower write amplification, and 2X smaller proofs than MPT. Under Nurgle attacks, even when the adversary consumes an entire block's gas budget, \textsc{Mhot} maintains a 0% attack success rate (v.s., 99.97% for MPT). Our results, somewhat surprisingly, show that height optimality (not new crypto primitives!) is the key abstraction for scalable and attack-resilient blockchain state commitment.

2606.11729 2026-06-11 cs.CR cs.NI 新提交

A VPN-as-a-Service Tailored Enabler for Computing-constrained Environments

面向计算受限环境的VPN即服务定制化使能器

Carolina Fernández-Martínez, César Cajas Parra, Shuaib Siddiqui

AI总结 提出一种云原生VPN即服务(VPNaaS),可动态编排为每个租户部署独立隧道,集成IAM工具,并适应计算或熵受限环境,支持RSA或椭圆曲线密钥算法选择。

详情
Comments
Proc. 2025 IEEE 11th International Conference on Network Softwarization (NetSoft), 2025
AI中文摘要

工业界已采用零信任(Zero Trust, ZT)架构原则和实现用于云原生环境,遵循更严格的安全要求,面向内部和外部租户。这些方法结合了细粒度身份管理和监控,用于清单编制和更好地分析设备安全态势以实现整体保护,同时通过严格关注点分离和隔离来强制执行最小权限。在网络方面,ZT方法也依赖隔离和最小权限;通过每个租户连接到给定基础设施的独立安全隧道来实现。此类实现也可应用于实验基础设施内部及与之的连接。在此意义上,本工作贡献了一种云原生VPN即服务(VPNaaS)的设计和评估,该服务可以:(i) 轻松编排以动态部署每个远程连接到基础设施的租户的独立隧道;(ii) 与常见的身份和访问管理(IAM)工具集成,这是ZT部署的关键;(iii) 适应计算或熵受限环境。该解决方案是可定制的,允许选择RSA或椭圆曲线(EC)作为密钥生成算法及其参数,以实现更安全的密钥并适应资源受限环境。

英文摘要

Industry has embraced Zero Trust (ZT) architectural tenets and implementations for cloud-native environments, following stricter security requirements to both internal and external tenants. Among others, these approaches combine fine-grained identity management and monitoring for both inventorying and better analysing the devices' security posture for overall protection, along with strict separation of concerns and isolation to enforce minimal privilege. Networking-wise, ZT approaches rely as well on isolation and least privilege; enacted by separate, secure tunnels per tenant connecting to a given infrastructure. Such implementations can also be applied to the connectivity within and towards experimental infrastructures. In this sense, this work contributes the design and evaluation of a cloud-native VPN-as-a-Service (VPNaaS) that can be (i) easily orchestrated to deploy on-the-fly, separate tunnels per each tenant remotely connecting to the infrastructure; (ii) integrated with common Identity and Access Management (IAM) tools, key to ZT deployments; and (iii) adapt to computing- or entropy- constrained environments. This solution is customisable and allows, among others, to select from RSA or Elliptic Curves (EC) as key generation algorithm and their parameters to achieve more secure keys and adapt to resource-constrained environments.

2606.11698 2026-06-11 cs.CR cs.AI 新提交

T2S: A Rehearsal-Based Approach for Extraction-Resistant Model Watermarking

T2S:一种基于排练的防提取模型水印方法

Jian-Ping Mei, Weibin Zhang, Ao Yao, Tiantian Zhu, Jie Xiao

AI总结 针对模型提取攻击,提出一种基于排练的水印嵌入框架,通过模拟提取过程并利用被盗模型在触发集上的损失微调水印知识,增强水印的迁移性和鲁棒性。

详情
AI中文摘要

模型水印通过嵌入独特知识来诱导独特行为特征,从而保护AI模型的知识产权。主要技术挑战在于确保水印对水印模型的各种后处理攻击具有鲁棒性。模型提取攻击是最严重的威胁,攻击者利用预测输出训练替代模型,非法复制原始模型的功能。在这项工作中,我们提出了一种基于排练的水印嵌入框架,以增强模型水印对模型提取攻击的鲁棒性。通过模拟提取过程,我们的方法利用\textit{模拟被盗模型}在触发集上的损失作为训练信号,微调目标模型中的水印知识。这个微调步骤鼓励水印以增强可迁移性的方式嵌入,从而增加其在被盗模型中持续存在并保持可检测的机会。在不同设置下进行的全面实验表明,所提出的方法显著提高了模型水印对模型提取和后续水印移除攻击的鲁棒性。

英文摘要

Model watermarking safeguards AI model intellectual property by embedding distinctive knowledge that induces unique behavioral signatures. The primary technical challenge lies in ensuring watermark robustness against various post-processing attacks on the watermarked model. Model extraction attacks emerge as the most severe threat, where adversaries exploit prediction outputs to train surrogate models that illegally replicate the original model's functionality. In this work, we propose a rehearsal-based watermark embedding framework to enhance the robustness of model watermarks against model extraction attacks. By simulating the extraction process, our method leverages the loss of a \textit{simulated stolen model} on a trigger set as a training signal to fine-tune the watermark knowledge within the target model. This fine-tuning step encourages the watermark to be embedded in a way that boosts transferability, thereby increasing its chances of persisting and remaining detectable in stolen models. Comprehensive experiments conducted under diverse settings demonstrate that the proposed method significantly improves the robustness of model watermarks against both model extraction and subsequent watermark removal attacks.

2606.11672 2026-06-11 cs.CR cs.AI 新提交

Can Open-Source LLM Agents Replace Static Application Security Testing Tools? An Empirical Assessment

开源LLM代理能否取代静态应用安全测试工具?一项实证评估

Derek Yohn, Luke Flancher, Mirajul Islam, Khaled Slhoub

AI总结 评估基于开源LLM的代理在静态应用安全测试中的性能,与SAST工具Bandit对比,发现当前不适合实际应用。

详情
Comments
Keywords: Agentic AI, Cybersecurity, Large Language Models, Static Application Security Testing, Model performance evaluation
AI中文摘要

本文探讨了代理式AI工具在网络安全领域的价值。我们评估了基于通用GenAI大语言模型(LLM)的代理在三种不同Ollama托管的通用开源模型驱动下的有效性。我们使用精确率、召回率、误报数以及基于捕获指标交互计算的综合得分,评估每个代理的性能,并与现有经过验证的静态应用安全测试(SAST)工具Bandit的基线性能进行比较。我们的研究结果驳斥了现代开源GenAI LLM代理在当前现实条件下适用于SAST扫描这一专门任务的看法。

英文摘要

This paper explores the value of agentic AI tools for cybersecurity purposes. We evaluate the efficacy of a general-purpose GenAI Large Language Model- (GenAI-) based agent when powered by three different Ollama-hosted general-purpose open source models. We assess each agent's performance using precision, recall, false positive count, and a calculated composite score based upon the interplay of the captured metrics, against the baseline performance of an existing, vetted Static Application Security Testing (SAST) tool, Bandit. Our findings refute the notion that a modern open-source GenAI LLM-based agent is currently suitable for the specialized task of SAST scanning under realistic conditions.

2606.11671 2026-06-11 cs.CR cs.AI 新提交

Runtime Skill Audit: Targeted Runtime Probing for Agent Skill Security

运行时技能审计:针对智能体技能安全的目标运行时探测

Tu Lan, Chaowei Xiao

AI总结 提出运行时技能审计(RSA)动态分析方法,通过目标运行时条件探测技能行为,在100个技能上达到90.0%准确率,优于静态基线。

详情
AI中文摘要

智能体技能让LLM智能体能够复用指令、资源、工具和工作流,但也为恶意行为提供了新的隐藏场所。一个技能在其文档或代码中可能看起来无害,但只有在与特定用户请求、本地资产、持久状态或多步骤工具交互调用时才会变得有害。这使得纯静态审查变得脆弱。我们提出运行时技能审计(RSA),一种动态分析方法,通过询问技能介导的智能体在目标运行时条件下实际做了什么来审计技能。RSA不是用相同的通用任务测试每个技能,而是分析风险相关接口,准备执行上下文以触发这些接口,并根据产生的跟踪证据分配安全标签。我们在OpenClaw上实现RSA,并在100个技能上针对代表性静态基线进行评估。RSA达到90.0%的准确率,88.0%的真阳性率和8.0%的假阳性率,比最佳静态基线提高13.0个百分点。在自进化攻击下,静态检测器在一两轮后崩溃,而RSA在每轮中持续检测出19-20个恶意技能。

英文摘要

Agent skills let LLM agents reuse instructions, resources, tools, and workflows, but they also create a new place for malicious behavior to hide. A skill may look benign in its documentation or code while becoming harmful only when it is invoked with particular user requests, local assets, persistent state, or multi-step tool interactions. This makes purely static vetting brittle. We present Runtime Skill Audit (RSA), a dynamic analysis method that audits skills by asking what the skill-mediated agent actually does under targeted runtime conditions. Instead of testing every skill with the same generic tasks, RSA profiles risk-relevant interfaces, prepares the execution context needed to exercise them, and assigns security labels from the resulting trace evidence. We instantiate RSA on OpenClaw and evaluate it on 100 skills against representative static baselines. RSA achieves 90.0\% accuracy with an 88.0\% true positive rate and an 8.0\% false positive rate, improving accuracy by 13.0 percentage points over the best static baseline. Under self-evolving attacks, static detectors collapse after one or two rounds, while RSA continues to detect 19--20 out of 20 malicious skills across rounds.

2606.11667 2026-06-11 cs.CR 新提交

A Robust Framework for Sybil Attack Detection in Vehicular Ad Hoc Networks

车载自组织网络中女巫攻击检测的鲁棒框架

Md. Sadmin Tahmid Khan, Md. Saim Ahmmed Utsho, Mosarrat Jahan

AI总结 提出一种利用GPS数据和DBSCAN聚类的鲁棒框架,通过构建精确轨迹和自动参数选择,在稀疏和密集区域分别降低68%和70%的误报率,同时大幅降低检测时间。

详情
AI中文摘要

女巫攻击通过使用虚假身份制造交通拥堵的假象,破坏了车载自组织网络(VANETs)的可靠和安全运行。现有的检测机制难以有效处理女巫攻击,因为它们(i)由于女巫车辆和合法车辆的轨迹重叠,容易产生高误报率(FPR),(ii)由于需要手动校准地面数据,不适合实际部署,(iii)由于严重依赖路边单元(RSU)和车辆的存在,在稀疏分布下效果不佳,以及(iv)由于计算开销而效率低下。本文针对这些缺陷,提出了一种鲁棒的框架来解决这些问题。该方案通过利用GPS位置数据,构建更准确且可区分的轨迹,从而降低FPR。此外,它采用DBSCAN聚类来识别女巫车辆,实现了无监督的参数选择。GPS数据消除了对RSU和车辆的依赖,使得该方案在稀疏和密集区域都有效。此外,该方案轻量级且在不同容量的车辆上保持一致。实验结果表明,该方案在密集区域将FPR降低了约68%,在稀疏区域降低了70%。此外,在稀疏区域将假阴性率(FNR)降低了67%,并在密集和稀疏区域都实现了与现有方法相当的检测率。另外,该方案在密集区域将检测时间减少了近80%,在稀疏区域减少了43%。

英文摘要

Sybil attacks create an illusion of traffic congestion by utilizing fake identities, which undermines the reliable and safe operation of vehicular ad hoc networks (VANETs). Existing detection mechanisms struggle to effectively handle Sybil attacks as they are (i) susceptible to high false positive rates (FPR) due to the overlapping trajectories of both Sybil and legitimate vehicles, (ii) not practical for real-world deployment due to manual calibrations with ground data, (iii) ineffective for sparse distribution of roadside units (RSUs) and vehicles as they depend heavily on the presence of both, and (iv) inefficient due to computational overheads. This paper addresses these shortcomings and proposes a robust framework to tackle these issues. The proposed scheme reduces the FPR by utilizing GPS location data, enabling the construction of more accurate and distinguishable trajectories. Besides, it employs DBSCAN clustering to identify Sybil vehicles, facilitating unsupervised parameter selection. GPS data eliminates the dependency on RSUs and vehicles, making this scheme effective in both sparse and dense regions. Additionally, the proposed scheme is lightweight and consistent across vehicles with heterogeneous capacities. Experimental results demonstrate that the proposed scheme reduces the FPR by approximately 68% in dense regions and 70% in sparse areas. Furthermore, it lowers the false negative rate (FNR) by 67% in the sparse region and achieves a competitive detection rate compared to the existing methods in both dense and sparse regions. Additionally, the proposed scheme decreases the detection time by almost 80% in dense regions and 43% in sparse ones.