arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 1971
专题追踪
2606.19081 2026-06-18 q-bio.NC cs.HC 新提交

Retrieval-Based Brain Decoding by Alignment, not Complexity

基于对齐而非复杂性的检索式脑解码

Matteo Ciferri, Matteo Ferrante, Nicola Toschi

AI总结 本文通过跨多数据集实验证明,线性对比解码器在脑解码中优于岭回归和标准非线性方法,表明解码增益更多来自训练目标而非架构复杂性。

详情
AI中文摘要

认知科学中的一个著名理论认为,大脑中的概念被组织为高维向量,语义含义由该空间中的方向和相对角度捕获。脑解码是从神经活动中重建或检索刺激(或其表示)的努力,涉及找到一个近似大脑如何表示概念的函数。这激发了对对比目标作为逆转脑损失函数的生物合理候选者的研究。在这项工作中,我们研究了如何将功能磁共振成像(fMRI)活动与视觉、语言和音频基础模型的嵌入空间进行一般性映射。尽管神经计算在微观尺度上是高度非线性的,但fMRI测量平均了跨空间和时间的信号,并进一步被噪声平滑,从而有效地线性化了可观察的表示。与这些观点一致,我们在多个数据集上的实验表明,线性对比解码器始终优于岭回归和标准非线性替代方案,并且这些结果在图像、文本和声音中普遍适用。这些发现表明,解码增益更多地来自训练目标的选择而非架构复杂性,指向对比线性模型作为脑解码的原则性策略。

英文摘要

A prominent theory in cognitive science suggests that concepts in the brain are organized as high-dimensional vectors, with semantic meaning captured by directions and relative angles in this space. Brain decoding is the effort of reconstructing or retrieving stimuli (or their representations) from neural activity and involves finding a function that approximates how the brain represents concepts. This motivates the investigation of contrastive objectives as biologically plausible candidates to reverse the brain loss function. In this work, we study how functional MRI (fMRI) activity can generally be mapped with the embedding spaces of foundation models in vision, language, and audio. Although neural computations are highly non-linear at the microscale, fMRI measurements average signals across space and time, further smoothed by noise, effectively linearizing the observable representation. Consistent with these views, our experiments across multiple datasets demonstrate that linear contrastive decoders consistently outperform ridge regression and standard non-linear alternatives, and that these results generalize across images, text, and sound. These findings indicate that decoding gains arise more from the choice of training objective than from architectural complexity, pointing to contrastive-linear models as a principled strategy for brain decoding.

2606.18969 2026-06-18 stat.ME cs.MS stat.ML 新提交

Balanced Twins: Causal Inference on Time Series with Hidden Confounding

平衡双胞胎:存在隐藏混杂的时间序列因果推断

Ouali Maha, Ghattas Badih, Flachaire Emmanuel, Charpentier Philippe, Bozzi Laurent

AI总结 提出神经框架同时学习个体时间序列的低维潜在表示和倾向得分,通过灵活匹配恢复反事实,估计处理组的平均处理效应,适用于交错干预和隐藏混杂场景。

详情
AI中文摘要

准确估计时间序列中的处理效应对于评估实际应用中的干预措施至关重要,尤其是当处理分配受到未观测因素的偏差影响时。在许多实际环境中,干预措施在不同时间点被不同个体采用,导致交错的处理暴露和异质性的处理前历史。在这种情况下,汇总处理单元的结果轨迹是不明确的,因此个体处理效应(ITE)估计成为可靠因果推断的前提。因此,我们通过首先恢复个体层面的反事实来研究估计处理组平均处理效应(ATT)的问题。我们引入了一个神经框架,同时学习个体时间序列的低维潜在表示和倾向得分。然后,这些估计通过一个灵活的匹配过程来近似个体处理效应,该过程避免了合成控制方法中常用的经典凸性约束。通过在个体层面操作,我们的方法自然地适应交错干预,并在潜在偏差下改进反事实估计,而不依赖于显式的时间建模假设。我们在实际能源消耗数据和临床时间序列上展示了我们的方法,包括高频电力需求响应项目和重症监护病房(ICU)个体的半合成数据,其中隐藏混杂、交错处理采纳和非平稳动态普遍存在。

英文摘要

Accurately estimating treatment effects in time series is essential for evaluating interventions in real-world applications, especially when treatment assignment is biased by unobserved factors. In many practical settings, interventions are adopted at different times across individuals, leading to staggered treatment exposure and heterogeneous pre-treatment histories. In such cases, aggregating outcome trajectories across treated units is ill-defined, making individual treatment effect (ITE) estimation a prerequisite for reliable causal inference. We therefore study the problem of estimating the average treatment effect for the treated (ATT) by first recovering individual-level counterfactuals. We introduce a neural framework that learns simultaneously low-dimensional latent representations of individual time series and propensity scores. These estimates are then used to approximate the individual treatment effects through a flexible matching procedure that avoids classical convexity constraints commonly used in synthetic control methods. By operating at the individual level, our approach naturally accommodates staggered interventions and improves counterfactual estimation under latent bias, without relying on explicit temporal modeling assumptions. We illustrate our approach on both real-world energy consumption data and clinical time series, including high-frequency electricity demand-response programs and semi-synthetic data for individuals in intensive care unit (ICU), where hidden confounding, staggered treatment adoption, and non-stationary dynamics are prevalent.

2606.18574 2026-06-18 econ.TH cs.GT 新提交

Stable and Fair Random Allocations in a Two-Sided Discrete-Concave Market

双边离散凹市场中的稳定与公平随机分配

Kenzo Imamura, Yasushi Kawase

AI总结 针对双边环境中随机分配存在的稳定性与公平性问题,本文利用离散凹(M^♮-凹)估值,证明了存在事前稳定且公平的分配,并通过Birkhoff–von Neumann定理的推广,将事前稳定分数分配分解为稳定确定性分配的彩票。

Comments Appears in the Twenty-Seventh ACM Conference on Economics and Computation (EC'26)

详情
AI中文摘要

随机分配被广泛用于处理双边环境中的平局和无差异。在这种环境中,常用的程序如随机破平可能无法从事前角度确保稳定性和公平性。我们证明,当代理人具有离散凹(M$^\ atural$-凹)估值时,存在事前稳定且公平的分配。为了建立这一结果,我们将我们的框架与Alkan和Gale引入的稳定性模型联系起来。特别地,我们证明事前稳定且公平的分数分配恰好被刻画为在由凹闭包诱导的选择函数下,结合对称严格凸破平规则的Alkan–Gale稳定结果。我们进一步证明,任何事前稳定的分数分配都可以通过Birkhoff–von Neumann定理的推广,分解为稳定确定性分配的彩票。最后,我们研究了一个不依赖基数估值而假设序数偏好的设定。在这个序数框架内,我们建立了事前稳定且公平的分数分配的存在性。该设定在拟阵约束下的带合同匹配框架中表述。由此产生的类包括现有模型,例如具有响应选择对应的一对多随机分配,并涵盖了广泛的应用,包括带有彩票的受控学校选择。

英文摘要

Random allocations are widely used to handle ties and indifferences in two-sided environments. In such environments, commonly used procedures such as random tie-breaking may fail to ensure stability and fairness from an ex ante perspective. We show that when agents have discrete concave (M$^\natural$-concave) valuations, there exists an ex ante stable and fair allocation. To establish this result, we relate our framework to the model of stability introduced by Alkan and Gale. In particular, we show that ex ante stable and fair fractional allocations are exactly characterized as Alkan--Gale stable outcomes under choice functions induced from concave closures together with a symmetric strictly convex tie-breaking rule. We further prove that any ex ante stable fractional allocation can be decomposed into a lottery over stable deterministic allocations, using a generalization of the Birkhoff--von Neumann theorem. Finally, we study a setting that does not rely on cardinal valuations and instead assumes ordinal preferences. Within this ordinal framework, we establish the existence of an ex ante stable and fair fractional allocation. This setting is formulated within the matching-with-contracts framework under matroid constraints. The resulting class includes existing models, such as one-to-many random allocation with responsive choice correspondences, and captures a wide range of applications, including controlled school choice with lotteries.

2606.18536 2026-06-18 stat.AP cs.SE 新提交

Analytics for Quality Assurance for Item Pools (AQuAP): Monitoring and Maintaining Item Bank Health in AI-Driven Assessment Systems

题库质量保证分析(AQuAP):AI驱动评估系统中题库健康的监控与维护

Alina A. von Davier, Xiaowan Zhang, Yigal Attali, Yena Park, Jacqueline Church, Andrew Runge, Geoff T. LaFlair, Alexander Tsigler

AI总结 提出AQuAP仪表盘环境,通过有效题库规模等指标监控题库质量,支持大规模自动与人工结合的试题开发,确保高利害测试的题库健康。

Comments 11 pages, 4 figures

详情
AI中文摘要

教育评估的大规模数字化使得题库的持续监督既必要又复杂。本文提出了题库质量保证分析(AQuAP),一个用于监控试题质量和题库健康的仪表盘环境。AQuAP支持高利害测试中大规模试题生成程序的操作实施,这些程序包含在试题工厂(一个自动化和人工支持的测试开发框架)中。本文描述了AQuAP与试题开发过程的关系,概述了题库质量保证的更广泛度量框架,并强调了有效题库规模(EBS)作为题库活力的核心指标。EBS量化了在内容重复发生之前可以构建的独立测试会话数量,当与曝光度和使用度量结合时,它提供了对题库安全性、多样性和效率的洞察。我们进一步引入了题库健康度量,如最大曝光度、最大条件曝光度、调整后的有效题库规模和极少施测比例,所有这些都扩展了试题利用情况的图景。AQuAP展示了操作分析如何将心理测量概念转化为高容量、AI驱动的测试程序的质量保证工具。本文以多邻国英语测试(DET)流程为例进行说明。

英文摘要

The large-scale digitization of educational assessment has made the continuous oversight of item banks both essential and complex. This paper presents Analytics for Quality Assurance for Item Pools (AQuAP), a dashboard environment for monitoring item quality and item bank health. AQuAP supports the operational implementation of the large scale item generation procedures for high-stakes tests as included in the Item Factory, a framework for automated and human-supported test development. The paper describes AQuAP in relationship with the process of item development, outlines the broader metric framework for item-pool quality assurance, and highlights the Effective Bank Size (EBS) as one central indicator of pool vitality. EBS quantifies how many independent test sessions can be constructed before content repetition occurs and, when coupled with exposure and usage metrics, provides insight into item bank security, diversity, and efficiency. We further introduce bank-health metrics, such as maximum exposure, maximum conditional exposure, adjusted effective bank size, and the rarely-administered fraction, all of which extend this picture of item utilization. AQuAP illustrates how operational analytics can translate psychometric concepts into quality assurance tools for high-volume, AI-enabled testing programs. This work is illustrated with the Duolingo English Test (DET) processes.

2606.19324 2026-06-18 cs.GT 新提交

Mean-Payoff-Parity and Lifting Strategies from MDPs to 2-Player Stochastic Games

平均收益-奇偶性与从MDP到双人随机博弈的提升策略

Mohan Dantam, Richard Mayr

AI总结 研究双人零和随机博弈中最优策略的复杂度,证明从MDP提升策略时指数级内存是必要的,并解决平均收益-奇偶性目标的最优策略复杂度,显示随机策略需要线性内存。

Comments Full version of a paper presented at CONCUR 2026

详情
AI中文摘要

我们考虑回合制双人零和随机博弈中最优策略的策略复杂度(即内存和随机化)。[Gimbert,Kelmendi:2023]中的结果表明如何将移位不变逆子混合目标的最优无记忆策略从MDP提升到双人随机博弈,内存模式数量呈指数增长。我们证明了相应的下界,即通常需要额外的指数级内存,即使对于随机策略也是如此。此外,我们解决了双人随机博弈中经过充分研究的平均收益-奇偶性目标的策略复杂度。该目标也是移位不变逆子混合的,但比该类的最坏情况更容易。在MDP中,最大化者具有最优的无记忆随机策略,而最优确定性策略需要指数级内存。然而,在随机博弈中,最优随机策略至少且至多需要线性内存(等于偶数颜色的数量)。最后,我们表明,将无记忆(或有限内存)确定性策略从MDP(或单人博弈)提升到双人博弈的不同构造甚至不能推广到无记忆随机策略。我们构造了一个移位不变目标,其中最大者和最小者在所有MDP中都具有最优的无记忆随机策略,但在确定性双人博弈中,最优(随机)最大者策略仍然需要无限内存。

英文摘要

We consider the strategy complexity (i.e., memory and randomization) of optimal strategies in turn-based 2-player zero-sum stochastic games. Results in [Gimbert,Kelmendi:2023] show how to lift optimal memoryless strategies for shift-invariant inverse-submixing objectives from MDPs to 2-player stochastic games with an exponential increase in the number of memory modes. We show the corresponding lower bound, i.e., the extra exponential memory is required in general, even for randomized strategies. Moreover, we solve the strategy complexity of the well-studied mean-payoff-parity objective in 2-player stochastic games. This objective is also shift-invariant inverse-submixing, but easier than the worst case for this class. In MDPs, Maximizer has optimal memoryless randomized strategies, while optimal deterministic strategies require exponential memory. However, in stochastic games, optimal randomized strategies require, at least and at most, linear memory (equal to the number of even colors). Finally, we show that a different construction for lifting memoryless (resp. finite-memory) deterministic strategies from MDPs (resp. 1-player games) to 2-player games cannot be generalized even to memoryless randomized strategies. We construct a shift-invariant objective where Max and Min each have optimal memoryless randomized strategies in all MDPs, but optimal (randomized) Max strategies still require infinite memory in deterministic 2-player games.

2606.19312 2026-06-18 cs.PL 新提交

QDSV: A Semantic Problem Representation and Multi-Backend Execution Framework for Quantum-Oriented Computation

QDSV:面向量子计算的语义问题表示与多后端执行框架

Jaime Alexander Jimenez Lozano, Sebastian Jimenez Giraldo

AI总结 提出QDSV框架,通过语义表示分离问题规范与后端实现,支持非电路形式的执行模式,并在EEG分类案例中验证了跨模拟器和硬件的稳定执行。

Comments 12 pages, 1 figure, 6 tables

详情
AI中文摘要

基于状态空间的谓词计算将问题规范与实现它的后端分离。基于arXiv:2606.15027中引入的模型,本文研究QDSV作为面向量子计算的语义多后端执行框架。我们描述了QDSV、QIntent和Qruba如何将声明式问题意图连接到结构化语义表示,在异构后端约束下实现该表示,并报告分离模型级语义输出与后端特定观测的执行轨迹输出。该框架支持不需要原始问题以电路形式编写的执行模式,同时在需要时仍允许生成电路兼容的工件。作为案例研究,我们使用来自Bonn和Delhi数据集的预处理信号特征评估EEG发作期/发作间期分类。该研究比较了经典机器学习参考、电路优先变分量子分类器基线、QDSV模拟器执行以及受控的IBM量子硬件运行。本文不声称通用量子优势或优于经典机器学习。其贡献在于一种语义执行验证,展示了问题优先表示如何在模拟器和硬件实现中保持稳定,同时保留可解释的执行轨迹输出。

英文摘要

Predicate-based computation over state spaces separates a problem specification from the backend that realizes it. Building on the model introduced in arXiv:2606.15027, this paper studies QDSV as a semantic, multi-backend execution framework for quantum-oriented computation. We describe how QDSV, QIntent, and Qruba connect declarative problem intent to a structured semantic representation, realize that representation under heterogeneous backend constraints, and report execution trace outputs that separate model-level semantic outputs from backend-specific observations. The framework supports execution modes that do not require the original problem to be authored as a circuit, while still allowing circuit-compatible artifacts when required. As a case study, we evaluate EEG ictal/interictal classification using prepared signal features from the Bonn and Delhi datasets. The study compares classical machine-learning references, a circuit-first variational quantum classifier baseline, QDSV simulator executions, and controlled IBM Quantum hardware runs. The paper does not claim general quantum advantage or superiority over classical machine learning. Its contribution is a semantic execution validation showing how a problem-first representation can remain stable across simulator and hardware realizations while retaining interpretable execution trace outputs.

2606.19311 2026-06-18 eess.SY cs.SY 新提交

A Lyapunov-Based Perspective on Absolute Stability

基于李雅普诺夫的绝对稳定性视角

Tessina H. Scholl

AI总结 本文提出一个李雅普诺夫类解释框架,用于非标量圆判据及其小增益和严格无源特例,通过加强扇区约束避免严格正定条件,并利用LMI、代数Riccati方程和矩阵方程推导二次解。

Comments 16 pages, 7 figures; Preprint version of a manuscript accepted for at-Automatisierungtechnik (special issue 08/2026)

详情
AI中文摘要

本文提出了绝对稳定性概念的统一视角。特别地,它为具有小增益和严格无源特例的非标量圆判据开发了一个李雅普诺夫类解释框架。为此,提出了一个避免严格正定条件的李雅普诺夫类函数的一般定义不等式,这是通过加强扇区约束实现的。我们讨论了推导二次解的不同方法:通过线性矩阵不等式(LMI)、代数Riccati方程和矩阵方程。利用卡尔曼-雅库博维奇-波波夫(KYP)引理,恢复了经典的频域结果。推导了一个基于无源指数的结果,简化了评估。总体而言,所呈现的相互关系可能对分析和教学都有用。

英文摘要

This article presents a unifying perspective on absolute stability concepts. In particular, it develops a Lyapunov-like explanatory framework for a nonscalar circle criterion with its small-gain and strict-passivity special cases. To this end, a general defining inequality for a Lyapunov-like function is proposed that avoids strict definiteness conditions, enabled by a strengthening of the sector constraint. We discuss different ways to derive a quadratic solution: via a linear matrix inequality (LMI), an algebraic Riccati equation, and a matrix equation. By exploiting the Kalman-Yakubovich-Popov (KYP) lemma, classical frequency-domain results are recovered. A passivity-index-based result is derived that simplifies the evaluation. Overall, the presented interrelations may be useful for both analysis and teaching.

2606.19271 2026-06-18 cs.DC 新提交

TurboServe: Serving Streaming Video Generation Efficiently and Economically

TurboServe: 高效经济地服务流式视频生成

Youhe Jiang, Haoxu Wang, Haotong Bao, Kai Jiang, Jianfei Chen, Jun Zhu, Fangcheng Fu, Jintao Zhang

AI总结 针对流式视频生成的会话时长和用户需求异构性,提出TurboServe系统,通过在线调度联合优化会话放置与GPU配置,采用迁移感知放置和负载驱动自动缩放,降低延迟和成本。

详情
AI中文摘要

流式视频生成正成为一种新的服务负载,用户与长时间运行的会话交互,会话逐步生成视频块。与离线视频生成或典型LLM服务不同,流式视频生成必须在活动和非活动期间保持会话状态,重复调度进行中的会话,并在严格的延迟目标下交付每个块。这在多用户、多GPU环境中带来了两个关键的服务挑战:会话时长异构性(长时间运行的会话使放置决策随时间变得次优)和时变用户需求异构性(活动会话数量在突发和空闲期间剧烈波动)。我们提出了TurboServe,这是首个专门为流式视频生成负载设计的服务系统。TurboServe将服务建模为一个在线调度问题,联合协调会话放置和GPU配置。其闭环调度算法结合了迁移感知放置控制器(通过跨GPU重新平衡会话以减少最大每块延迟)和负载驱动的自动缩放控制器(根据工作负载变化调整GPU预算以提高成本效率)。为在运行时支持这些决策,TurboServe实现了合并块处理(在同一GPU上批处理并发活动会话)、GPU-CPU卸载(用于会话挂起和恢复)以及基于NCCL的GPU-GPU迁移(用于在线重新平衡)。我们使用生数科技的真实生产轨迹,在多种模型大小和最多64个NVIDIA B300 GPU的集群上评估了TurboServe。与基线服务配置相比,TurboServe平均将最坏情况下的每块延迟降低了37.5%,总GPU运营成本降低了37.2%。我们的代码在此https URL公开。

英文摘要

Streaming video generation is emerging as a new serving workload in which users interact with long-lived sessions that generate video progressively, chunk by chunk. Unlike offline video generation or typical LLM serving, streaming video generation must preserve session state across active and idle periods, repeatedly schedule ongoing sessions, and deliver each chunk under a tight latency target. This creates two key serving challenges in multi-user, multi-GPU environments: session duration heterogeneity, where long-running sessions make placement decisions suboptimal over time, and temporal user-demand heterogeneity, where the number of active sessions fluctuates sharply across bursts and idle periods. We present TurboServe, the first serving system designed specifically for streaming video generation workloads. TurboServe formulates serving as an online scheduling problem that jointly coordinates session placement and GPU provisioning. Its closed-loop scheduling algorithm combines a migration-aware placement controller, which rebalances sessions across GPUs to reduce the maximum per-chunk latency, with a load-driven autoscaling controller, which adapts the GPU budget to workload variation for improved cost efficiency. To support these decisions at runtime, TurboServe implements coalesced chunk processing for batching concurrent active sessions on the same GPU, GPU-CPU offloading for session suspension and resumption, and NCCL-based GPU-GPU migration for online rebalancing. We evaluate TurboServe on real-world production traces from Shengshu Technology across multiple model sizes and GPU clusters with up to 64 NVIDIA B300 GPUs. Compared with baseline serving configurations, TurboServe reduces worst-case per-chunk latency by 37.5% and total GPU operating cost by 37.2% on average. Our code is publicly available at https://github.com/shengshu-ai/TurboServe.

2606.19263 2026-06-18 cs.SI cs.CY cs.MA econ.GN q-fin.EC 新提交

Digital Speech Acts Retain Control of Copyright with People, Not Platforms

数字言语行为:版权控制权归属于人而非平台

James Golike, Ehud Shapiro

AI总结 本文提出“数字言语行为”概念,即个人用自己的私钥在自有设备上对内容进行加密签名,从而确立归属、责任和作者身份,并论证该行为符合美国版权法保护条件,能确保个人对内容的控制权,为数字主权和民主自治奠定基础。

详情
AI中文摘要

法律先例保护计算机代码作为可版权化的表达。它们使集中式数字平台——运营着持有所有用户数据的企业服务器——能够通过版权、合同和技术架构的相互作用构建私人治理体制:创造几乎所有平台价值的人必须通过服务条款协议放弃有效的版权控制,作为参与的条件。相比之下,草根平台由加密身份标识的个人组成,他们独立于任何服务器或全球资源操作自己的联网智能手机;每个人在自己的设备上持有自己的数据,没有第三方占有或中介。在这里,我们定义了“数字言语行为”的概念——个人在自己的设备上用自己的私钥对个人内容进行加密签名的故意意志行为——通过该行为,个人同时确立了签名内容的归属、责任和作者身份。我们认为:(ia) 数字言语行为符合美国现有先例下的版权保护条件:《Burrow-Giles》将作者身份定位于尽管存在机械或算法过程但具有意志的创造性选择,《Feist》提供了最低创造性门槛,而持久设备存储满足了版权法的固定要求;(ib) 草根平台背后的数字社会契约通过设计保留了这一版权——签名内容不能与其签名分离,并且随着内容转发,完整的来源链不断累积——因此所有权和占有权在个人身上统一;(ic) 数字言语行为中的版权是数字主权和民主自治的先决条件。

英文摘要

Legal precedents protect computer code as copyrightable expression. They have enabled centralized digital platforms -- operating from corporate servers that hold all user data -- to construct private governance regimes through the interaction of copyright, contract, and technical architecture: people who create virtually all platform value must surrender effective copyright control through Terms of Service agreements as a condition of participation. In contrast, grassroots platforms consist of cryptographically-identified people operating their networked smartphones independently of any server or global resource; each person holds their own data on their own device, with no third party in possession or intermediation. Here, we define the notion of a \textit{digital speech act} -- a deliberate volitional act by a person of cryptographically signing personal content with the person's private key, carried out on the person's own device -- through which the person simultaneously establishes attribution, accountability, and authorship over the signed content. We contend that (\ia) digital speech acts qualify for copyright protection under existing U.S.\ precedent: \textit{Burrow-Giles} locates authorship in volitional creative choices despite mechanical or algorithmic processes, \textit{Feist} supplies the minimal-creativity threshold, and persistent device storage satisfies the Copyright Act's fixation requirement; (\ib) the digital social contract underlying grassroots platforms preserves this copyright by design -- signed content cannot be unbundled from its signature, and the full provenance chain accumulates as content is forwarded -- so that ownership and possession coalesce in the person; and (\ic) copyright in digital speech acts is a prerequisite for digital sovereignty and democratic self-governance.

2606.19242 2026-06-18 cs.SE 新提交

Runtime Compliance Verification for AI Agents

AI代理的运行时合规性验证

Nafiseh Kahani, Masoud Barati, Diana Addae

AI总结 提出C-Trace框架,通过运行时监控和形式化策略谓词,确保AI代理在工具调用和对话中遵守GDPR规则,将攻击成功率降至12%以下。

详情
AI中文摘要

AI代理现在通过工具使用、函数调用和多轮对话处理个人数据,这可能在《通用数据保护条例》(GDPR)下产生义务。当前的测试实践主要依赖于离线红队测试或静态提示审查,但它们无法在运行时保证代理行为遵循监管规则。我们提出了C-Trace(基于合规轨迹的运行时代理一致性执行),一个验证框架,它:(i)将GDPR要求的子集(包括同意、目的限制、数据最小化和删除权)表达为代理执行轨迹上的形式化策略谓词;(ii)使用运行时监视器拦截每个工具调用和模型输出,并拒绝不合规的动作;(iii)使用攻击对话(包括DSPy生成的提示和来自红队测试语料库的逐字提示)测试代理,试图诱导违规。我们在四个重新框架化为GDPR的案例研究上评估该框架。在每类别10%的提取器噪声(包括丢失和过度键入)下,监视器将攻击成功率保持在小于或等于12%,低于我们比较的基线,假阳性率小于或等于16%,并在完美提取下达到0%的攻击成功率。

英文摘要

AI agents now handle personal data through tool use, function calls, and multi turn dialogue, which can create obligations under the General Data Protection Regulation (GDPR). Current testing practices mainly rely on offline red teaming or static prompt review, but they do not guarantee at runtime that agent behavior follows regulatory rules. We propose C-Trace (Compliance Trace based Runtime Agent Conformance Enforcement), a verification framework that: (i) expresses a subset of GDPR requirements, including consent, purpose limitation, data minimization, and the right to erasure, as formal policy predicates over agent execution traces; (ii) uses a runtime monitor that intercepts every tool invocation and model output and rejects non-compliant actions; and (iii) tests the agent with attack dialogues, including DSPy generated prompts and verbatim prompts from red teaming corpora, that try to induce violations. We evaluate the framework on four case studies reframed to GDPR. Under 10 percent per-category extractor noise, including drop-out and over-typing, the monitor keeps the attack success rate at less than or equal to 12 percent, below the baselines we compare against, and false positives at less than or equal to 16 percent, and reaches 0 percent ASR under perfect extraction.

2606.19235 2026-06-18 cs.CR 新提交

CodeSentinel: A Three-Layer Defense Against Indirect Prompt Injection in Code Contexts

CodeSentinel:代码上下文中针对间接提示注入的三层防御

Po-Han Cheng, Chia-Mu Yu, Ying-Dar Lin, Yu-Sung Wu, Wei-Bin Lee

AI总结 针对代码大语言模型在检索外部代码时面临的间接提示注入攻击,提出CodeSentinel三层推理时净化器,结合语法引导预过滤、CST引导动态Min-K%评分和节点扰动分析,实现0.80节点级F1,优于现有方法。

详情
AI中文摘要

代码大语言模型越来越多地从仓库、文档、问题线程和编码代理环境中检索外部代码上下文,这创建了一个间接提示注入面,攻击者可以在注释、字符串、标识符或诱饵代码中隐藏指令。我们提出了CodeSentinel,一个三层推理时净化器。它使用Tree-sitter提取高风险模型面对CST节点,然后结合语法引导预过滤、CST引导动态Min-K%评分和节点扰动分析来检测对抗性和自然外观的语义触发器。检测到的节点在被下游代码LLM处理之前被移除或中和。在六个最近的攻击家族中,CodeSentinel实现了0.80的平均节点级F1,优于CodeGarrison、DePA和KillBadCode。

英文摘要

Code large language models increasingly retrieve external code context from repositories, documentation, issue threads, and coding-agent environments, creating an indirect prompt-injection surface where attackers hide instructions in comments, strings, identifiers, or decoy code. We propose CodeSentinel, a three-layer inference-time sanitizer. It uses Tree-sitter to extract high-risk model-facing CST nodes, then combines syntax-guided pre-filtering, CST-guided Dynamic Min-K\% scoring, and node perturbation analysis to detect adversarial and natural-looking semantic triggers. Detected nodes are removed or neutralized before reaching the downstream Code LLM. Across six recent attack families, \CodeSentinel achieves 0.80 average node-level F1, outperforming CodeGarrison, DePA, and KillBadCode.

2606.19216 2026-06-18 cs.SE cs.HC 新提交

No Two Developers Think Alike: How Problem-Solving Styles and Experience Shape Needs in Conversational Interaction with Copilot

没有两个开发者想法相同:问题解决风格和经验如何塑造与 Copilot 对话交互中的需求

Jonan Richards, Bruno Alves de Oliveira, Iury Oliveira, Igor Wiese, Mairieli Wessel

AI总结 通过混合方法出声思考研究,识别出5种交互模式和10种需求,并建立概念模型,揭示认知多样性如何影响开发者与GitHub Copilot的交互。

Comments Accepted at the International Conference on Software Maintenance and Evolution (ICSME), 2026

详情
AI中文摘要

基于LLM的对话式“编程助手”为开发者提供了诸多好处。然而,最近的研究表明,个体开发者对编程助手的需求存在差异,并且只有特定开发者群体才会遇到挑战。在本研究中,我们探讨了认知多样性在塑造与GitHub Copilot聊天交互中的作用。通过对27名专业开发者和学生进行混合方法的出声思考研究,我们表征了开发者交互中的5种不同的“交互模式”和10种潜在需求,形成了一个概念模型。我们描述了这些模式、需求与开发者的问题解决风格和经验概况之间的联系,展示了认知多样性如何塑造开发者的交互。我们为研究人员和从业者提供了关于如何设计、研究和运用编程助手以更好地满足多样化开发者需求的见解和建议。

英文摘要

Conversational LLM-based ``programming assistants'' provide a range of benefits to developers. However, recent studies demonstrate the variety in individual developers' needs regarding programming assistants, and challenges encountered by only specific groups of developers. In this study, we explore the role of cognitive diversity in shaping interactions with GitHub Copilot chat. Through a mixed-methods think aloud study with 27 professional developers and students, we characterize 5 distinct ``interaction modes'' and 10 underlying needs in developers' interactions, forming a conceptual model. We characterize links between these modes, needs, and developers' problem-solving styles and experience profiles, showing how cognitive diversity may shape developers' interactions. We provide insights and recommendations for researchers and practitioners on how to design, research, and employ programming assistants to better account for diverse developer needs.

2606.19210 2026-06-18 cs.DS 新提交

Guarded Epoch Bloom Filters for Sliding-Window Membership

面向滑动窗口成员查询的受保护纪元布隆过滤器

Faruk Alpay, Levent Sarioglu

AI总结 提出受保护纪元布隆过滤器,通过划分比特预算、旋转纪元并引入保护纪元,实现滑动窗口内确定性存活覆盖和有限过期保留,在14比特/存活项下将假阳性率从0.191降至0.02225且零假阴性。

Comments 6 pages, 1 table; ancillary files provided with code, tests, experiment scripts, CSV outputs, and validation logs

详情
AI中文摘要

流中的近似成员查询通常需要最近窗口语义,而非对所有已见项的成员查询。本文研究了受保护纪元布隆过滤器,它是计数和稳定布隆过滤器的滑动窗口替代方案。该结构将固定比特预算划分为旋转纪元,仅插入当前纪元,在纪元边界清除整个段,并保留一个额外的保护纪元。该保护纪元产生一个确定性的存活窗口不变性:在最后W个位置插入的每个项仍然被表示,而旋转导致的过期保留被限制在目标窗口之外的一个纪元内。我们给出了构造,证明了其存活覆盖和有界过期属性,推导了假阳性近似,并包含一种阻塞变体,通过将探测限制在每个纪元一个块来改善局部性。实验涵盖了225种合成流配置和45种来自时间戳排序的Web服务器访问日志流的配置。在每存活项14比特下,受保护纪元过滤器将合成中位假阳性率从四比特计数布隆基线的0.191降至0.02225,同时保持测量的存活键假阴性为零。该方法不能替代精确删除;它针对的是系统中可接受有界过期阳性但存活窗口内不允许假阴性的场景。

英文摘要

Approximate membership queries in streams often need recent-window semantics rather than membership over all items ever seen. This paper studies guarded epoch Bloom filters, a sliding-window alternative to counting and stable Bloom filters. The structure partitions a fixed bit budget into rotating epochs, inserts only into the current epoch, clears whole segments at epoch boundaries, and keeps one additional guard epoch. This guard yields a deterministic live-window invariant: every item inserted in the last W positions remains represented, while rotation-induced stale retention is bounded by one epoch beyond the target window. We give the construction, prove its live-coverage and bounded-staleness properties, derive a false-positive approximation, and include a blocked variant that improves locality by confining probes to one block per epoch. Experiments cover 225 synthetic streaming configurations and 45 configurations from a timestamp-ordered web-server access-log stream. At 14 bits per live item, the guarded epoch filter reduces median synthetic false positives from 0.191 for a four-bit counting Bloom baseline to 0.02225 while preserving zero measured live-key false negatives. The method is not a replacement for exact deletion; it targets systems where bounded stale positives are acceptable but false negatives inside the live window are not.

2606.19191 2026-06-18 cs.CR 新提交

PhantomSkill: Malicious Code Injection in Agent Skill Ecosystems

PhantomSkill: 代理技能生态系统中的恶意代码注入

Yu-Ting Lin, Chia-Mu Yu

AI总结 提出PhantomSkill攻击框架,通过VulMask技术将恶意行为隐藏在技能的辅助资源中,利用漏洞形状的实现绕过检测,在保持良性功能的同时降低警告和恶意软件检测率。

详情
AI中文摘要

代理技能使得基于LLM的编码代理能够从第三方包获取领域特定能力,但也引入了新的供应链攻击面。我们提出PhantomSkill,一个攻击框架,将恶意行为隐藏在技能的辅助资源中,而非其文本描述中。其核心技术VulMask将明显的恶意脚本重写为漏洞形状的实现,其恶意行为仅在攻击者控制的触发条件下激活。这种设计将可见信号从明确的恶意意图转变为看起来普通的易受攻击代码。在代表性的宿主技能、攻击目标、编码代理、生成模型和自动审查器上,与明显的恶意脚本相比,VulMask在保持良性功能的同时减少了警告和恶意软件级别检测。我们的结果表明,技能生态系统需要资源级审查、执行时隔离以及将代理技能中的可利用漏洞视为潜在恶意载荷的安全策略。

英文摘要

Agent skills allow LLM-based coding agents to acquire domain-specific capabilities from third-party packages, but they also introduce a new supply-chain attack surface. We present PhantomSkill, an attack framework that hides malicious behavior in a skill's auxiliary resources rather than in its textual description. Its core technique, VulMask, rewrites overt malicious scripts into vulnerability-shaped implementations whose malicious behavior is activated only under attacker-controlled trigger conditions. This design shifts the visible signal from explicit malicious intent to ordinary-looking insecure code. Across representative host skills, attack goals, coding agents, generation models, and automated reviewers, VulMask preserves benign utility while reducing warning and malware-level detection compared with overt malicious scripts. Our results show that skill ecosystems require resource-level vetting, execution-time containment, and security policies that treat exploitable vulnerabilities in agent skills as potential malicious payloads.

2606.19169 2026-06-18 cs.GR cs.SY eess.SY 新提交

RespGeomLib: A Reproducible Parametric Engine for Generating Analysis-Ready Human Airway Lumen Geometry

RespGeomLib:一个可复现的参数化引擎,用于生成分析就绪的人类气道管腔几何结构

Nichula Wasalathilaka, Parakrama Ekanayake, Roshan Godaliyadda

AI总结 提出RespGeomLib,一个基于YAML规范的可复现参数化引擎,通过端口组装与隐式平滑混合生成无缝气道管腔表面,避免全局体素化,在定量上产生更清洁的分叉且更高效,支持形态测量引导生成和CFD仿真。

Comments Accepted to Publication at 2026 IEEE Mercon

详情
AI中文摘要

CT衍生的气道模型支持肺形态测量和气流模拟,但通常受限于远端扫描分辨率和分叉附近需要大量清理。程序化替代方案是可复现的,但许多依赖于拼接的管状基元,这些基元引入了非光滑连接和定义不清的开放边界。我们提出了RespGeomLib,一个可复现的参数化引擎,用于从紧凑的YAML规范生成分析就绪的人类气道管腔表面。该框架结合了基于端口的组装与隐式平滑最小混合,以产生无缝连接,同时通过解析段和分叉周围的局部隐式提取避免全树体素化。定量上,RespGeomLib产生比布尔/拼接基线更清洁的连接,并且比全树全局隐式提取更快且更节省内存。我们进一步展示了形态测量引导的树生成、受控合成气道变体以及具有稳定气流模拟的CFD就绪导出。RespGeomLib针对需要可复现形态测量、受控合成变体和模拟就绪管腔几何的生物医学工作流。代码公开于此https URL。

英文摘要

CT-derived airway models support pulmonary morphometry and airflow simulation, but are often limited by distal scan resolution and the need for substantial cleanup near bifurcations. Procedural alternatives are reproducible, yet many rely on stitched tubular primitives that introduce non-smooth junctions and poorly defined open boundaries. We present RespGeomLib, a reproducible parametric engine for generating analysis-ready human airway lumen surfaces from compact YAML specifications. The framework combines port-based assembly with implicit smooth-min junction blending to produce seamless junctions, while avoiding full-tree voxelization through analytic segments and local implicit extraction around bifurcations. Quantitatively, RespGeomLib yields cleaner junctions than a Boolean/stitch baseline and is substantially faster and more memory-efficient than whole-tree global implicit extraction. We further demonstrate morphometry-guided tree generation, controlled synthetic airway variants, and CFD-ready export with stable airflow simulation. RespGeomLib targets biomedical workflows requiring reproducible morphometry, controlled synthetic variants, and simulation-ready lumen geometry. The code is publicly available at https://nichula01.github.io/Respgeomlib/

2606.19167 2026-06-18 cs.SE 新提交

Teaching Software Engineering with LLM and MCP Integration: From Classroom to Industry Practice

用LLM和MCP集成教学软件工程:从课堂到工业实践

Kehui Chen, Jacky Keung, Weining Li, Xiangbing Shao, Yishu Li, Xiaoxue Ma

AI总结 本研究将LLM和MCP集成到软件工程协作教学模式中,通过嵌入驱动工具到教学、代码辅助和工程模拟,弥合传统教学与工业流程的差距,提升学生编程、问题解决和智能工具使用能力。

Comments Aceept by International Symposium on Educational Technology (ISET) 2026

详情
AI中文摘要

大型语言模型(LLM)和模型上下文协议(MCP)在工业软件工程中的快速集成,迫切要求更新软件工程教育以跟上新兴技术和不断变化的行业需求。本研究探讨了一种创新方法,将LLM和MCP集成到软件工程教育的协作教学模式中,旨在构建一个与实际工程实践紧密相连的实用学习框架。通过将LLM和MCP驱动的工具嵌入日常教学、代码辅助和工程模拟中,该模型有效弥合了传统教学与工业工作流程之间的差距。这种集成增强了学生的编程能力、实际问题解决能力以及使用智能工程工具的熟练度。此外,通过与行业实习的合作,学生可以在真实环境中应用这些技术,进一步加强学术准备与专业实践之间的联系。总体而言,本研究为人工智能时代软件工程教育的改革与创新提供了一条实用路径。

英文摘要

The rapid integration of Large Language Models (LLMs) and the Model Context Protocol (MCP) into industrial software engineering has created a pressing need to update software engineering education to align with emerging technologies and evolving industry demands. This study investigates an innovative approach that integrates LLMs and MCP into a collaborative teaching model for software engineering education, aiming to build a practical learning framework closely connected to real-world engineering practices. By embedding LLM and MCP driven tools into daily teaching, code assistance, and engineering simulations, the model effectively bridges the gap between traditional instruction and industrial workflows. This integration enhances students' programming competence, practical problem-solving abilities, and proficiency in using intelligent engineering tools. Furthermore, through partnerships with industry internships, students can apply these technologies in real-world settings, further strengthening the connection between academic preparation and professional practice. Overall, this research offers a practical pathway for reforming and innovating software engineering education in the era of artificial intelligence.

2606.19163 2026-06-18 cs.DC 新提交

Pulse: Training Acceleration for Large Diffusion Models with Automatic Pipeline Parallelism

Pulse: 面向大规模扩散模型的自动流水线并行训练加速

Boran Sun, Guoyong Jiang, Lin Zhang, Chen Chen, Yuechen Tao, Zhishu Che, Jieling Yu, Shan Chang, Huaxi Gu, Fangming Liu, Bo Li

AI总结 提出PULSE自动流水线并行策略,通过将跳跃连接层同设备放置、局部缓存激活值,消除跨流水线通信,结合动态规划分区器、ILP调度合成器和混合并行调优器,在通信受限硬件上实现最高2.3倍吞吐提升。

Comments Accepted by International Conference on Distributed Computing Systems(ICDCS'26)

详情
AI中文摘要

扩散模型目前是高保真图像和视频生成的主流方法,但在GPU集群上扩展其训练仍具挑战。与仅含Transformer的架构不同,扩散骨干通常采用具有异构层和长距离跳跃连接的UNet风格编码器-解码器结构。在传统流水线并行下,这些非局部依赖迫使大型跳跃激活值及其梯度穿越多个流水线边界,使得点对点(P2P)通信成为主要瓶颈,并显著降低流水线效率。本文提出PULSE,一种自动流水线并行训练策略,将跳跃局部性作为首要优化目标。PULSE通过将跳跃连接的编码器-解码器层放置在同一设备上,并在本地缓存跳跃激活值以供反向传播使用,从而消除跳跃引起的通信。为了在保持高流水线利用率的同时实现这种放置,PULSE协同设计了:(1)一种跳跃感知的动态规划分区器,在对称共置约束下平衡异构阶段负载;(2)一种基于ILP的调度合成器,为生成的阶段到设备映射生成气泡高效的波调度;(3)一种混合并行调优器,在内存和网络约束下选择流水线/数据并行度及微批次大小。大量实验表明,与最先进的并行策略相比,通信量可减少89%,在通信受限硬件上训练吞吐量可提升高达2.3倍。

英文摘要

Diffusion models are now a dominant approach for high-fidelity image and video generation, yet scaling their training across GPU clusters remains challenging. Unlike transformer-only architectures, diffusion backbones commonly adopt UNet-style encoder-decoder structures with heterogeneous layers and long-range skip connections. Under conventional pipeline parallelism, these non-local dependencies force large skip activations and their gradients to traverse multiple pipeline boundaries, making peer-to-peer (P2P) communication a dominant bottleneck and substantially reducing pipeline efficiency. In this paper, we present PULSE, an automatic pipeline-parallel training strategy that makes skip locality a first-class optimization objective. PULSE eliminates skip-induced communication by collocating skip-connected encoder-decoder layers on the same device and caching skip activations locally for later use in backpropagation. To realize this placement while maintaining high pipeline utilization, PULSE co-designs: (1) a skip-aware dynamic-programming partitioner that balances heterogeneous stage workloads under symmetric collocation constraints, (2) an ILP-based schedule synthesizer that generates bubble-efficient wave schedules for the resulting stage-to-device mapping, and (3) a hybrid parallelism tuner that selects pipeline/data-parallel degrees and microbatch sizes under memory and network constraints. Our extensive experiments show that the volume of communication can be reduced by 89 percent, and the training throughput can be increased by up to 2.3x on communication-bound hardware, compared with state-of-the-art parallelism strategies.

2606.19119 2026-06-18 cs.AR cs.DC 新提交

PuDGhost: Experimental Analysis of Computation Result Corruption in Processing-using-DRAM Operations on Real DRAM Chips and Implications for Future Systems

PuDGhost:真实DRAM芯片上处理即内存操作计算结果错误的实验分析及对未来系统的影响

Daichi Tokuda, İsmail Emir Yüksel, Tatsuya Kubo, Ataberk Olgun, Haocong Luo, Nisa Bostanci, Jikun Wang, A. Giray Yağlıkçı, Shinya Takamaeda-Yamazaki, Onur Mutlu

AI总结 本文揭示PuDGhost干扰现象,即DRAM中非激活行和其他列的数据会破坏处理即内存操作的正确性,并通过96个真实DDR4芯片实验量化影响,提出列筛选和计算行布局等缓解措施。

Comments To appear at ISCA 2026 (June 2026)

详情
AI中文摘要

处理即内存(PuD)是一种有前景的计算范式,通过同时多行激活(SiMRA)将每个DRAM列作为计算引擎,从而减轻主存与处理单元之间的频繁数据移动。不幸的是,DRAM密度缩放可能阻碍PuD的优势:更密集的单元阵列使行和列更接近,使得常规DRAM操作容易受到相邻单元的噪声和干扰。然而,先前没有工作研究未参与计算的行或列的干扰是否会损害PuD的鲁棒性。在这项工作中,我们揭示了PuDGhost,一种干扰现象,其中给定列中的PuD操作由于以下干扰而产生错误结果:1)非激活DRAM行中的数据,以及2)在同一SiMRA操作下同时计算的其他列中的数据。PuDGhost违背了每列计算仅依赖于其自身操作数数据的理想图景,威胁到未来的PuD系统。我们首次使用来自12个模块的96个真实DDR4 DRAM芯片对PuDGhost进行了广泛表征,量化了各种条件下这两种干扰源。在我们的15个新实证观察中,我们强调两个主要结果:1)相邻非激活行中的数据对SiMRA输出的影响高达10%(随机输入),2)同时计算列中的数据对SiMRA输出的影响高达48%(随机输入)。在这些发现的指导下,我们提出了跨PuD计算栈多个层的对策。具体来说,我们在真实DDR4 DRAM芯片上评估了:1)鲁棒的列筛选,降低了在存在PuDGhost时使用不可靠列的风险,以及2)一种计算行布局,通过在计算行之间使用专用行来缓解PuDGhost。我们的解决方案大大提高了PuD计算精度,并为未来鲁棒的PuD系统奠定了基础。

英文摘要

Processing-using-DRAM (PuD) is a promising computation paradigm that alleviates frequent data movement between main memory and processing units by using each DRAM column as a computation engine via simultaneous multiple-row activation (SiMRA). Unfortunately, DRAM density scaling may hinder PuD's benefits: denser cell arrays bring rows and columns closer, making regular DRAM operations susceptible to noise and interference from neighboring cells. Yet no prior work investigates whether interference from rows or columns not intended to participate in computation can compromise PuD robustness. In this work, we reveal PuDGhost, an interference phenomenon where a PuD operation in a given column produces erroneous results due to interference from 1) data in non-activated DRAM rows and 2) data in other columns that compute concurrently under the same SiMRA operation. PuDGhost violates the ideal picture that each column's computation depends solely on its own operand data, threatening future PuD systems. We present the first extensive characterization of PuDGhost using 96 real DDR4 DRAM chips from 12 modules, quantifying these two interference sources under various conditions. Among our 15 new empirical observations, we highlight two major results: 1) data in adjacent non-activated rows affects SiMRA outputs by up to 10% for random inputs, and 2) data in concurrently computing columns affects SiMRA outputs by up to 48% for random inputs. Guided by these findings, we propose countermeasures across multiple layers of the PuD computing stack. Specifically, we evaluate on real DDR4 DRAM chips: 1) robust column screening that reduces the risk of using unreliable columns in the presence of PuDGhost, and 2) a compute row layout that mitigates PuDGhost via dedicated rows between compute rows. Our solutions greatly improve PuD computation accuracy and provide a foundation for robust future PuD systems.

2606.19106 2026-06-18 cs.CR cs.CY 新提交

Quantifying Compromise Risk in Exceptional Access Architectures Under Sparse and Indirect Evidence

在稀疏和间接证据下量化特殊访问架构中的泄露风险

Alan Woodward

AI总结 针对特殊访问系统缺乏公开泄露数据的问题,构建结构化不确定性框架,通过历史类比、蒙特卡洛场景、信道独立性分解和贝叶斯结构风险模型,量化传输层与平台层EA架构的系统性泄露风险,发现两类架构风险均高于无EA基线,且分布形态不同。

详情
AI中文摘要

合法的特殊访问(EA)系统持有用于授权方解密受保护通信的加密密钥。关于其风险的争论长期且定性,因两个问题而复杂化:不存在EA特定泄露事件的公开数据集,因此评估必须使用稀疏的间接证据;先前的工作将结构不同的设计视为等效,尽管运营商基础设施中的传输层EA(T-EA)和平台层的覆盖层EA(OTT-EA)在加密密钥与密文数据的关系上有所不同。本文构建了一个结构化不确定性框架,用于评估EA架构中的系统性泄露风险。它不产生预测性预测,因为证据无法支持;它将稳健的发现与依赖于校准的发现分开。对T-EA和OTT-EA应用了四个分析层:三个实证支柱(历史类比、蒙特卡洛场景层、信道独立性分解)加上一个并行子图攻击图上的贝叶斯结构风险模型。核心发现是结构性的。首先,任何类别的配备EA的架构都承担比无EA反事实更高的建模风险,这一顺序独立于校准。其次,类别在分布形状上不同:T-EA风险由中心趋势主导,OTT-EA风险在关联活动下由尾部主导。第三,在结构化判断的目标溢价区间内,T-EA的校准条件年概率范围从1.4%到12.9%。在数十年时间跨度上,累积泄露远高于零;关键材料外泄是不可逆的,对OTT-EA更大的用户群体影响严重。该框架量化泄露概率,而非预期危害;后果建模和收益估计不在其范围内。

英文摘要

Lawful exceptional access (EA) systems hold the cryptographic keys that decrypt protected communications for authorised parties. The debate over their risks has been long and qualitative, complicated by two problems: no public dataset of EA-specific compromise events exists, so assessment must use sparse, indirect evidence; and prior work has treated structurally different designs as equivalent, though transmission-layer EA in carrier infrastructure (T-EA) and over-the-top EA at the platform layer (OTT-EA) differ in how cryptographic keys relate to ciphertext data. This paper builds a structured uncertainty framework for evaluating systemic compromise risk in EA architectures. It does not produce predictive forecasts, which the evidence cannot support; it separates findings robust to assumptions from those that depend on calibration. Four analytical layers are applied to T-EA and OTT-EA: three empirical pillars (historical analogues, a Monte Carlo scenario layer, a channel-independence decomposition) plus a Bayesian Structural Risk Model on a parallel-subgraph attack graph. The central findings are structural. First, EA-equipped architectures of either class carry strictly higher modelled risk than their no-EA counterfactual, an ordering independent of calibration. Second, the classes differ in distribution shape: T-EA risk is dominated by central tendency, OTT-EA by the tail under correlated campaigns. Third, calibration-conditional annual probability ranges span 1.4% to 12.9% for T-EA across the structured-judgement targeting-premium interval. Over multi-decade horizons, cumulative compromise is well above zero; key-material exfiltration is irreversible, weighing heavily on OTT-EA's larger user populations. The framework quantifies compromise probability, not expected harm; consequence modelling and benefit estimation are outside its scope.

2606.19080 2026-06-18 eess.SY cs.SY 新提交

Byzantine-Resilient Federated Multi-Agent Optimization Framework for Cyber-Secure Interconnected Microgrids

面向网络安全互联微电网的拜占庭弹性联邦多智能体优化框架

Ali Peivand, Seyyed Mostafa Nosratabadi

AI总结 提出BR-FedMAPPO框架,结合三重表面移动目标防御与自适应隔离策略,通过两阶段拜占庭弹性聚合规则抵御隐蔽虚假数据注入攻击,保护分布式学习通道并维持经济调度性能。

详情
AI中文摘要

配电网络日益数字化,使得互联微电网集群面临隐蔽虚假数据注入攻击,这些攻击绕过不良数据检测器,通过联络线耦合和共享学习通道传播。本文提出BR-FedMAPPO,一种拜占庭弹性联邦多智能体近端策略优化框架,学习三重表面移动目标防御和自适应隔离策略以实现网络安全运行。每个微电网托管一个本地Actor-Critic智能体,其策略被划分为全局联邦共享编码器和私有保留动作头,因此没有微电网暴露其D-FACTS线路、电池储能单元或联络线容量的配置、基数或位置。动作向量扰动D-FACTS电抗、重定向BES注入、重塑微电网间交换,并包含连续孤岛信号。两阶段拜占庭弹性聚合规则结合了修剪均值滤波和奖励加权更新。该方案基于F1分数和假阳性率纳入检测质量分数,以惩罚引起误报的客户端。在基于IEEE 30节点和118节点测试系统的四个互联微电网上的仿真结果表明,该框架能有效缓解协调的S-FDI攻击,通过自适应隔离遏制级联中断,保护分布式学习通道免受恶意模型操纵,同时保持成本感知的调度性能。

英文摘要

The escalating digitalization of distribution networks has exposed interconnected Microgrid (MG) clusters to Stealthy False Data Injection Attacks that bypass Bad Data Detectors and propagate through tie-line couplings and shared learning channels. This paper proposes BR-FedMAPPO, a Byzantine-Resilient Federated Multi-Agent Proximal Policy Optimization framework that learns a triple-surface Moving Target Defense and an adaptive isolation strategy for cyber-secure operation. Each MG hosts a local Actor-Critic Agent whose policy is partitioned into a globally federated shared encoder and a privately retained action head, so no MG exposes the configurations, cardinality, or locations of its D-FACTS lines, Battery Energy Storage (BES) units, or tie-line capacities. The action vector perturbs D-FACTS reactances, redirects BES injections, reshapes inter-MG exchanges, and includes a continuous islanding signal. A two-stage Byzantine-resilient aggregation rule combines trimmed-mean filtering with reward-weighted updates. This scheme incorporates a detection-quality score based on the F1-score and False Positive Rate to penalize clients causing false alarms. Simulation results on four interconnected MGs based on the IEEE 30- and 118-bus test systems demonstrate effective mitigation of coordinated S-FDI attacks, containment of cascading disruptions through adaptive isolation, and protection of distributed learning channels against malicious model manipulations while maintaining cost-aware dispatch performance.

2606.19076 2026-06-18 cs.CR 新提交

Compute-Budgeted Exploitability Evidence Graphs for Prospective Vulnerability Triage

计算预算下的可利用性证据图用于前瞻性漏洞分类

Faruk Alpay, Taylan Alpay

AI总结 提出一种计算预算下的漏洞分类方法,利用时间证据图限制每CVE的证据数量,实现泄漏安全的前瞻性召回提升,并引入可审计的证据证书。

Comments 11 pages, 3 figures, 1 table; ancillary files provided; artifacts: https://huggingface.co/datasets/Lightcap/cbeeg-evidence-graphs

详情
AI中文摘要

防御者无法同时修补所有新披露的漏洞,因此可利用性预测必须以前瞻性而非回顾性的方式进行评估。我们研究了计算预算下的漏洞分类,其中每个CVE仅根据固定决策时间可见的公开证据进行评分。将公告、漏洞利用档案、修复提交和黑客社区讨论表示为时间证据图;预算选择器仅允许每个CVE少量证据文档,每个分数都配有一个可审计的证书,列出支持信号、时间戳、来源层和泄漏标志。在来自公开来源的12012个前瞻性CVE上,预算证据选择将泄漏安全的前瞻性召回@50从仅严重性基线的0.010提高到0.026,而每个CVE两个证据文档捕获了大部分价值。强大的交叉编码器重排序器将前瞻性召回降低到0.016,表明与CVE的语义相关性并不等同于可利用性证据。最重要的是,使用未过滤证据的朴素随机分割将表观前瞻性召回夸大了8.5倍,EPSS高召回夸大了5.0倍。主要贡献是泄漏安全的评估协议和可重复的证据证书,用于可争议的漏洞优先级排序声明。

英文摘要

Defenders cannot patch every newly disclosed vulnerability at once, so exploitability prediction must be evaluated prospectively rather than retrospectively. We study compute-budgeted vulnerability triage in which each CVE is scored only from public evidence visible by a fixed decision time. Advisories, exploit archives, fix commits, and hacker-community discourse are represented as a temporal evidence graph; a budgeted selector admits only a few evidence documents per CVE, and every score is paired with an auditable certificate listing the supporting signals, timestamps, source layers, and leakage flags. On 12012 prospective CVEs from public sources, budgeted evidence selection raises leakage-safe prospective recall@50 from 0.010 for a severity-only baseline to 0.026, while two evidence documents per CVE capture most of the value. A strong cross-encoder reranker lowers prospective recall to 0.016, showing that semantic relevance to a CVE is not the same as evidence of exploitation. Most importantly, a naive random split with unfiltered evidence inflates apparent prospective recall by 8.5x and EPSS-high recall by 5.0x. The main contribution is a leakage-safe evaluation protocol and reproducible evidence certificates for contestable vulnerability-prioritization claims.

2606.19063 2026-06-18 cs.CR 新提交

PYPILINE: Malicious PyPI Package Detection via Suspicious API Knowledge and Agent Workflow

PYPILINE:通过可疑API知识和Agent工作流检测恶意PyPI包

Siyuan Pang, Zhengwei Jiang, Yepeng Yao, Zijing Fan, Haozhe Li, Baoxu Liu

AI总结 提出PYPILINE方法,结合可疑API知识库与Agent工作流,通过静态分析构建知识库并自动检测恶意PyPI包,在精度、召回率和F1分数上显著优于现有工具。

详情
AI中文摘要

恶意PyPI包的检测对于维护开源软件供应链的安全至关重要。现有方法主要依赖规则或传统机器学习,存在可解释性差且难以适应新型攻击的问题。为此,我们提出PYPILINE,一种结合可疑API知识库与Agent工作流的新型检测方法。PYPILINE首先对已知恶意包进行静态分析,提取抽象语法树并生成API调用图,从中自动提取并构建结构化的可疑API知识库。在检测阶段,利用该知识库增强推理能力。通过Agent工作流,PYPILINE对未知包进行深度语义分析,并输出结构化的、可解释的恶意性评估报告。实验结果表明,PYPILINE在精度96.7%、召回率99.6%和F1分数98.1%上显著优于现有最先进工具,其精度比基线工具高出5.7至24.2个百分点。此外,我们对恶意包进行了实证研究,系统揭示了常见的攻击策略以及最常被滥用的API。通过配备工具调用的AI Agent工作流,实现可疑API知识的自动向量数据库检索和通过邮件服务器发送分析报告,PYPILINE提供了一种实用、高效且便捷的恶意包检测解决方案,以增强开源生态系统安全。

英文摘要

The detection of malicious PyPI packages is crucial for maintaining the security of the open source software supply chain. Existing methods, which primarily rely on rules or traditional machine learning, suffer from poor interpretability and difficulty in adapting to novel attacks. To address this, we propose PYPILINE, a novel detection method that combines a suspicious API knowledge base with an Agent workflow. PYPILINE first conducts static analysis on known malicious packages, extracting abstract syntax trees and generating API call graphs, from which it automatically extracts and constructs a structured suspicious API knowledge base. During the detection phase, this knowledge base is used to enhance reasoning capabilities. Through an Agent workflow, PYPILINE performs in depth semantic analysis of unknown packages and outputs a structured, interpretable maliciousness assessment report. The experimental results show that PYPILINE significantly outperforms existing state-of-the-art tools in precision of 96.7\%, recall of 99.6\%, and F1-score of 98.1\%, with its precision surpassing baseline tools by 5.7 to 24.2 percentage points. Additionally, we conducted an empirical study on malicious packages, systematically revealing prevalent attack strategies, as well as the most commonly abused APIs. Equipped with tool-calling AI agent workflows for automated vector database retrieval of suspicious API knowledge and mail server delivery of analysis reports, PYPILINE delivers a practical, efficient, and convenient malicious package detection solution to strengthen open-source ecosystem security.

2606.19058 2026-06-18 cs.NI 新提交

Atomic Handover for 6G Nomadic Non-Public Networks Using Edge-Based Spectrum Brokering

基于边缘频谱代理的6G游牧式非公共网络原子切换

Daniel Lindenschmitt, Hans D. Schotten

AI总结 针对游牧式非公共网络在异构基础设施间移动时的连接挑战,提出一种零接触原子切换方法,联合考虑移动管理和动态频谱协调,通过边缘频谱代理和认知频谱管理器实现单步网络选择与频谱分配,仿真验证了其在移动医疗场景中的有效性。

Comments 9 pages, 4 figures, IEEE PIMRC 2026

详情
AI中文摘要

游牧式非公共网络(NNPN)预计将在未来的6G系统中发挥重要作用,通过为应急响应或临时活动等场景提供移动且可快速部署的网络基础设施。在此类环境中,保持无缝连接具有挑战性,因为当跨异构基础设施移动时,网络附着和频谱接入可能需要同时调整。本文研究了NNPN的切换机制,并提出了一种零接触方法,该方法联合考虑移动管理和动态频谱协调。所提出的架构引入了基于边缘的频谱代理(Spectrum Broker)与认知频谱管理器(Cognitive Spectrum Manager)相结合,以支持原子切换过程,其中网络选择和频谱分配在单一步骤中完成。该概念通过基于MATLAB的移动医疗场景仿真进行评估,其中一辆配备NNPN的救护车在公共陆地移动网络(PLMN)和非公共网络(NPN)之间过渡。

英文摘要

Nomadic Non-Public Networks (NNPN) are expected to play an important role in future 6G systems by enabling mobile and rapidly deployable network infrastructures for scenarios such as emergency response or temporary events. In such environments, maintaining seamless connectivity is challenging, as both network attachment and spectrum access may need to be adapted simultaneously when moving across heterogeneous infrastructures. In this paper, we investigate handover mechanisms for NNPN and propose a zero-touch approach that jointly considers mobility management and dynamic spectrum coordination. The proposed architecture introduces an edge-based Spectrum Broker in combination with a Cognitive Spectrum Manager to support an atomic handover procedure, where network selection and spectrum allocation are performed in a single step. The concept is evaluated using a MATLAB-based simulation of a mobile healthcare scenario, where an ambulance with its NNPN transitions between Public Land Mobile Networks (PLMN) and Non-Public Networks (NPN).

2606.19055 2026-06-18 cs.AR cs.CR 新提交

CHERI-D: Secure and efficient inline object ID for CHERI temporal memory safety

CHERI-D:用于CHERI时间内存安全的安全高效内联对象ID

Yuecheng Wang, Jonathan Woodruff, Alfredo Mazzinghi, Peter Rugg, Samuel W. Stark, Alexandre Joannou, Robert N. M. Watson, Simon W. Moore

AI总结 提出CHERI-D架构扩展,通过内联对象ID元数据实现高效的时间内存安全,显著降低撤销开销并支持严格的释放后使用缓解。

详情
AI中文摘要

我们提出CHERI-D,一种对CHERI的架构扩展,支持高效的时间内存安全。高效的内存安全对于编程语言、操作系统和硬件设计日益重要,而CHERI是一种领先的硬件/软件系统,提供原生空间安全性和时间内存安全性的基础。由于CHERI缺乏对时间内存安全性的内在架构支持,最先进的CHERI时间安全解决方案Cornucopia Reloaded是一种基于软件的解决方案,提供重新分配后使用(UAR)保护而非更强的释放后使用(UAF)缓解,并且由于延迟重新分配和撤销而遭受性能开销。CHERI-D将对象标识(ID)元数据与能力指针关联,以提供分配的时间完整性。CHERI的空间安全性允许CHERI-D将对象ID安全地内联存储在分配数据中,可能利用未使用的碎片。通过仿真和硬件评估,CHERI-D显著降低了Cornucopia Reloaded的撤销开销,同时使其能够支持严格的释放后使用缓解。

英文摘要

We propose CHERI-D, an architectural extension to CHERI that supports efficient temporal memory safety. Efficient memory safety is an increasing priority for programming languages, operating systems, and hardware designs, and CHERI is a leading hardware/software system that provides native spatial safety and a foundation for temporal memory safety. Due to CHERI lacking intrinsic architectural support for temporal memory safety, the state-of-the-art CHERI temporal safety solution, Cornucopia Reloaded, is a software-based solution that provides use-after-reallocation (UAR) protections instead of the stronger use-after-free (UAF) mitigation, and suffers performance overhead due to delayed reallocation and revocation. CHERI-D associates object identification (ID) metadata with capability pointers to provide temporal integrity of allocations. CHERI spatial safety allows CHERI-D to store object IDs safely inline with allocation data, potentially within unused fragmentation. Evaluated in simulation and in hardware, CHERI-D significantly reduces the revocation overhead of Cornucopia Reloaded while allowing it to support strict use-after-free mitigation.

2606.19037 2026-06-18 cs.IR 新提交

Querit-Reranker: Training Compact Multilingual Rerankers via Efficient Label-Free Distribution Adaptation

Querit-Reranker: 通过高效无标签分布适应训练紧凑型多语言重排序器

Yunfei Zhong, Jun Yang, Wei Huang, Yinqiong Cai, Haosheng Qian, Yixing Fan, Ruqing Zhang, Lixin Su, Daiting Shi, Jiafeng Guo

AI总结 提出Querit-Reranker系列多语言交叉编码器重排序模型,采用数据驱动的无标签适应管道,通过合成查询挖掘和教师软标签进行分布适应,并利用球面线性插值合并检查点,在BEIR和MIRACL上显著提升nDCG@10,在MTEB多语言重排序上达到最优性能。

详情
AI中文摘要

可部署的多语言重排序器必须能够跨语言、领域和目标排序任务进行泛化,同时保持足够的效率以用于第二阶段重排序。然而,将它们适应新的目标分布通常需要大量特定任务的相关性标注,这获取成本高昂。我们提出了Querit-Reranker,这是一个多语言交叉编码器重排序器家族,通过数据中心的管道进行标签高效适应。我们将其实例化为Querit-Reranker-A0.4B(从内部MoE骨干网络初始化,具有0.4B激活参数)和Querit-Reranker-4B(从Qwen3-Embedding-4B初始化)。我们的管道首先从大规模面向排序的数据中学习通用相关性建模,然后通过合成查询挖掘和教师分数作为连续软标签来适应目标分布。为了巩固互补的任务适应优势,我们进一步通过球面线性插值合并检查点,获得一个无需运行时集成开销即可部署的单一模型。使用Qwen3-Embedding-0.6B作为共享的第一阶段检索器,Querit-Reranker-A0.4B在BEIR上将平均nDCG@10从54.11提升到59.28,在MIRACL上从59.87提升到67.70。在MTEB Multilingual v2 Reranking上,它也显著优于更大的基于嵌入的基线,而Querit-Reranker-4B在公开可用模型中进一步实现了最先进的性能。我们在Hugging Face上发布了这两个模型。

英文摘要

Deployable multilingual rerankers must generalize across languages, domains, and target ranking tasks while remaining efficient enough for second-stage reranking. However, adapting them to new target distributions typically requires extensive task-specific relevance annotations, which are costly to obtain. We present Querit-Reranker, a family of multilingual cross-encoder rerankers trained with a data-centric pipeline for label-efficient adaptation. We instantiate it as Querit-Reranker-A0.4B, initialized from an in-house MoE backbone with 0.4B activated parameters, and Querit-Reranker-4B, initialized from Qwen3-Embedding-4B. Our pipeline first learns general relevance modeling from large-scale ranking-oriented data, then adapts to target distributions through synthetic-query mining with teacher scores as continuous soft labels. To consolidate complementary task-adapted strengths, we further merge checkpoints via spherical linear interpolation, obtaining a single deployable model without runtime ensembling overhead. Using Qwen3-Embedding-0.6B as the shared first-stage retriever, Querit-Reranker-A0.4B improves average nDCG@10 from 54.11 to 59.28 on BEIR and from 59.87 to 67.70 on MIRACL. On MTEB Multilingual v2 Reranking, it also substantially outperforms larger embedding-based baselines, while Querit-Reranker-4B further achieves state-of-the-art performance among publicly available models. We release both models on Hugging Face.

2606.19034 2026-06-18 cs.DB 新提交

Evaluating Learned Spatial Indexes

评估学习型空间索引

Sachith Pai, Jun Yang, Michael Mathioudakis

AI总结 通过实验研究学习型索引在空间范围查询中的性能,分析块大小、数据倾斜、选择性等因素的影响,并基于工作负载提出索引选择决策树。

详情
AI中文摘要

学习型索引通过使搜索结构适应数据和负载分布来提高查询性能。尽管已提出许多学习型索引,但它们在空间范围查询中的权衡仍未被充分理解,因为性能不仅取决于模型精度,还取决于数据和查询倾斜、布局粒度、选择性和存储行为。在这项工作中,我们对空间范围查询的学习型索引进行了实验研究。我们考察了一组代表性索引,并回答了七个基本问题:(1) 块大小如何影响查询延迟,以及在变化的选择性下哪些配置能产生最优性能?(2) 倾斜的数据和查询分布如何影响索引性能?(3) 索引如何平衡细化和扫描成本,哪些设计更偏向其中一种?(4) 与内存设置相比,基于磁盘的存储条件如何改变最优块大小和延迟权衡?(5) 不同索引的构建成本是多少,在多少查询量下这些成本能被摊销?(6) 对于给定的数据和查询负载,哪种索引预期表现最佳?(7) 从合成数据中学到的索引选择见解能否推广到真实数据分布?为了进行分析,我们使用了一个具有通用存储后端、标准化查询执行管道以及受控的数据和查询倾斜变化的框架。我们的实验揭示了关于细化与扫描权衡、块大小的影响以及选择性与布局有效性之间相互作用的关键见解。我们将这些发现综合成一个基于工作负载的索引选择决策树,并在真实的OpenStreetMap点集上使用合成查询进行验证,确认其建议表现出最小的决策遗憾,通常能产生接近最优的查询性能。

英文摘要

Learned indexes improve query performance by adapting search structures to data and workload distributions. Although many learned indexes have been proposed, their trade-offs remain insufficiently understood for spatial range queries, where performance depends not only on model accuracy but also on data and query skew, layout granularity, selectivity, and storage behavior. In this work, we perform an experimental study of learned indexes for spatial range queries. We examine a representative set of indexes and address seven fundamental questions: (1) How does block size influence query latency, and what configurations yield optimal performance under varying selectivities? (2) How do skewed data and query distributions impact index performance? (3) How do indexes balance refinement and scan costs, and which designs favor one over the other? (4) How do disk-based storage conditions alter optimal block size and latency trade-offs compared to in-memory settings? (5) What are the construction costs of different indexes, and under what query volumes are these costs amortized? (6) For a given data and query workload, which index is expected to perform best? (7) Do index-selection insights learned from synthetic data generalize to real-world data distributions? To enable the analysis, we use a framework with a common storage backend, standardized query execution pipelines, and controlled variations in data and query skew. Our experiments reveal critical insights into refinement vs. scan trade-offs, the impact of block size, and the interplay between selectivity and layout effectiveness. We synthesize these findings into a workload-based decision tree for index selection and validate it on real OpenStreetMap point sets with synthetic queries, confirming that its recommendations exhibit minimal decision regret and typically yield near-optimal query performance.

2606.19021 2026-06-18 cs.CE 新提交

On a variational model for phase transformation in SiO2 glass

关于SiO2玻璃中相变的一个变分模型

Sarah Dinkelacker-Steinhoff, Klaus Hackl

AI总结 针对SiO2玻璃在压力下的压实机制,提出一个变分框架,通过二元相变解释应力响应,并匹配实验数据,揭示弹性模量与体积变化的关系。

详情
AI中文摘要

SiO2玻璃在压力下的压实机制包括在某些条件下弹性模量的特定降低和复杂的非弹性行为,其性质尚未完全理解。在我们的工作中,我们建立了一个描述SiO2玻璃在静水压力下演化的变分框架。基于先前提出的非弹性材料多相转变模型,我们假设压实过程中等温条件,并将典型的S形应力响应解释为二元相变的指标。在此过程中,两个体积分数宏观共存,并在其间发展出诸如剪切带或向错图案等微结构。我们限制我们的方法仅解析体积分数,而不解析相应的微结构。尽管如此,所得模型与实验结果非常吻合。数值示例成功说明了弹性模量的变化与相应体积随压力变化之间的关系。

英文摘要

The compaction mechanisms of SiO2 glass under pressure include under certain conditions a specific reduction of the elastic moduli and a complex inelastic behavior whose nature is not yet fully understood. In our work we establish a variational framework describing the evolution of SiO2 glass under hydrostatic pressure. Based on a previous work that presents a model for multi-phase transformations in inelastic materials, we assume isothermal conditions during a compaction process and interpret the typical sigmoidal stress response as indicator of a binary phase transformation. During the process, two volume fractions coexist macroscopically and microstructures such as shear bands or disclination pattern develop in between. We restrict our approach to resolve only the volume fractions, not the corresponding microstructures. Nevertheless, the resulting model is shown to match experimental findings very well. Numerical examples successfully illustrate the relationship between the changes in the elastic moduli and the corresponding change in volume with respect to pressure.

2606.19003 2026-06-18 cs.DC 新提交

On the Notions of Bounded Bypass, and How to Make any Deadlock-Free MUTEX Protocol Satisfy One of Them

关于有界旁路的概念,以及如何使任何无死锁的MUTEX协议满足其中之一

Rob van Glabbeek, Daniele Gorla, Myrthe Spronck

AI总结 本文形式化定义了有界旁路及其弱化版本(间歇有界旁路和门后有界旁路),并证明Bar-David算法可将任何无死锁互斥协议升级为满足有界旁路(或间歇有界旁路),界限为进程数的二次函数。

Journal ref Distributed Computing Journal, Springer 2026

详情
AI中文摘要

在互斥文献中,有界旁路长期以来被用作饥饿自由性的强化,但据我们所知,它仍然缺乏作为独立活性属性的令人满意的定义。此外,我们遇到了一些MUTEX协议,这些协议需要稍微弱化该概念才能满足。为了解决这些问题,我们首先提供了有界旁路的形式化定义(同时也修正了Raynal之前的定义),然后引入了门后和间歇有界旁路的概念,这是两种介于饥饿自由和有界旁路之间的活性属性。本质上,间歇有界旁路通过忽略在共享寄存器的一组有限写操作执行期间可能发生的旁路来弱化有界旁路。正交地,门后有界旁路忽略了在锁协议的有限初始阶段可能发生的旁路。此外,我们研究了Yoah Bar-David在1998年提出的一种算法,该算法用于增强任何无死锁MUTEX协议的活性属性,并证明:(1) 在原子寄存器设置中,该算法将任何无死锁互斥协议升级为有界旁路协议,其界限与进程数的二次方成正比;(2) 在安全寄存器和规则寄存器设置中,同一算法确保了有界旁路的间歇版本,仍然具有二次(但略有不同)的界限。最后,我们为本文定义的不同有界旁路概念提供了逻辑公式,并通过模型检查确认了本文的所有主张。这对工作的理论发展产生了积极影响,因为它使我们能够识别并纠正定义和证明中的小错误/歧义。

英文摘要

In the literature on mutual exclusion, bounded bypass has been used for a long time as a strengthening of starvation-freedom, but, to the best of our knowledge, it still lacks a satisfying definition as a liveness property on its own. Moreover, we have encountered MUTEX protocols for which this notion needs to be slightly weakened in order to be met. To solve these issues, we first provide a formal definition of bounded bypass (that also corrects a previous definition from Raynal) and then introduce the notions of post-doorway and intermittent bounded bypass, two liveness properties that lie between starvation-freedom and bounded bypass. Essentially, intermittent bounded bypass weakens bounded bypass by ignoring the possible bypasses that may happen during the execution of a certain finite set of write operations to shared registers. Orthogonally, post-doorway bounded bypass ignores the bypasses that may happen during a finite initial phase of the lock protocol. Furthermore, we study an algorithm proposed by Yoah Bar-David in 1998 to enhance the liveness properties of any deadlock-free MUTEX protocol and prove that: (1) in the setting of atomic registers, this algorithm upgrades any deadlock-free mutual exclusion protocol to a bounded bypass one, with a bound that is quadratic in the number of processes; and (2) in the setting of safe and regular registers, the very same algorithm ensures the intermittent version of bounded bypass, still with a quadratic (but slightly different) bound. Finally, we provide logical formulae for the different notions of bounded bypass defined in this paper and use them to confirm all claims made here, by using model checking. This had a positive impact on the theoretical development of the work, since it allowed us to identify and correct small mistakes/ambiguities in definitions and proofs.

2606.18981 2026-06-18 cs.ET 新提交

Not Your Usual FFT: QFT$\rightarrow$FFT via Classical Quantum-Circuit Simulation

非寻常FFT:通过经典量子电路模拟从QFT到FFT

Stefano Markidis, Gilbert Netzer, Luca Pennati, Frej Larssen, Ivy Peng

AI总结 提出QFT→FFT,一种通过经典量子计算机模拟执行量子傅里叶变换电路来计算离散傅里叶变换的HPC FFT库,性能与FFTW相当或更优。

Comments 12 pages, 6 figures, to be published in the Proceedings of the International Conference on Computational Science 2026

详情
AI中文摘要

我们介绍了QFT→FFT,这是一族HPC FFT库,通过在经典量子计算机模拟器上执行量子傅里叶变换(QFT)电路来计算离散傅里叶变换。输入数组直接映射到状态振幅,并带有显式的归一化/索引,使QFT成为FFT原语的即插即用替代品。一个与后端无关的规划器构建融合门调度和内存布局适配器,以提高算术强度并减少内存数据移动。我们在Google的C++ \ exttt{qsim}之上实现了这一设计,并评估了OpenMP、AVX和CUDA后端。在AMD EPYC Zen2处理器上,我们的AVX性能与使用64线程的多线程FFTW相当。在NVIDIA A100上,CUDA后端在较大规模下比AMD EPYC Zen2上的AVX和FFTW快4倍以上。我们还采用了近似QFT(AQFT),它截断截止值$k$以上的小角度受控旋转,从而减少电路深度和运行时间,同时保持精度。

英文摘要

We introduce QFT$\rightarrow$FFT, a family of HPC FFT libraries that compute the discrete Fourier transform by executing a quantum Fourier transform (QFT) circuit on classical quantum computer simulators. Input arrays are mapped directly to state amplitudes with explicit normalization/indexing, making QFT a drop-in replacement for FFT primitives. A backend-agnostic planner builds a fused-gate schedule and memory layout adapters to increase arithmetic intensity and reduce memory data movement. We implement this design on top of Google's C++ \texttt{qsim} and evaluate OpenMP, AVX, and CUDA backends. On an AMD EPYC Zen2 processor, our AVX performance is on par with that of multithreaded FFTW, utilizing 64 threads. On an NVIDIA A100, the CUDA backend achieves more than $4\times$ lower time than both AVX and FFTW on AMD EPYC Zen2 at larger sizes. We also employ an approximate QFT (AQFT) that truncates small-angle controlled rotations beyond a cutoff $k$, reducing circuit depth and runtime while preserving accuracy.

2606.18978 2026-06-18 cs.ET cs.AR 新提交

Nanoscale memristive devices: Threats and solutions

纳米级忆阻器件:威胁与解决方案

Amir M. Hajisadeghi, Javad Talafy, Hamid R. Zarandi

AI总结 本文聚焦ReRAM和STT-RAM两种忆阻存储技术,分析其读写错误和软错误等可靠性威胁,并综述现有解决方案,最后探讨忆阻器作为计算单元在交叉阵列中实现逻辑与算术运算的潜力。

详情
AI中文摘要

由于其激励特性,忆阻器有望取代基于CMOS的存储器,后者在深亚微米工艺技术中面临各种功能挑战。与基于CMOS的存储器(如SRAM)相比,忆阻器具有非易失性、低泄漏和高密度等优点。在这方面,电阻式RAM(ReRAM)和自旋转移矩RAM(STT-RAM)忆阻器因其在工艺成熟度以及存储操作能量、存储延迟和面积等指标上的优越性,在其他基于忆阻器的存储技术中脱颖而出。因此,本章重点介绍这两种基于忆阻器的存储技术。尽管这些类型的存储器具有良好特性,但它们仍面临一些可靠性威胁。可靠性参数相互影响,检查其正面和负面效应对所提出解决方案的有效性具有重要影响。从一种观点来看,威胁可分为两类:(1)读/写错误和(2)软错误。在本章中,我们全面描述了这些威胁,并介绍了最先进的解决方案,这些方案使得忆阻器(特别是ReRAM和STT-RAM)能够在不同应用中得到广泛使用。最后,我们介绍了忆阻器作为计算单元的新兴能力,旨在最小化计算中的数据恢复,并展示了如何在交叉阵列中执行逻辑和算术计算。

英文摘要

Due to their incentivizing features, memristors are a promising candidate for replacing CMOS-based memories, which are faced with various functional challenges in deep submicron process technologies. Memristors are nonvolatile, have low leakage, and are dense in comparison to CMOS-based memories like SRAM. In this regard, resistive RAM (ReRAM) and spin-transfer-torque RAM (STT-RAM) memristors are distinguished among other memristor-based memory technologies, due to their superiority in process maturity and metrics such as memory operation energy, memory latency, and area. Hence, this chapter focuses on these two memristor-based memory technologies. Despite the good features of these types of memory, they suffer from some reliability threats. Reliability parameters affect each other, and examining their positive and negative effects has a significant impact on the effectiveness of the proposed solutions. In one view, the threats can be categorized into two classes: (1) read/write error and (2) soft error. In this chapter, we comprehensively describe these threats and present the state-of-the-art solutions that enable the widespread use of memristors, particularly ReRAM and STT-RAM, in different applications. Finally, we introduce the emerging ability of memristors as a computing unit aiming to minimize data restoration in computing, and we show how to perform logic and arithmetic computation in a crossbar array.