arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 1971
专题追踪
2606.18977 2026-06-18 cs.LO 新提交

PaSTTeL: Parallel analysiS framework for Termination and non-Termination of Lasso programs

PaSTTeL:用于Lasso程序终止与非终止性分析的并行框架

Anissa Kheireddine, Souheib Baarir, Hugo De Sa Pereira Pinto

AI总结 提出PaSTTeL,一个模块化、通用的并行组合框架,统一了Lasso程序终止性分析的前沿方法,支持新算法集成、并发执行和嵌入外部项目,实验表现竞争力强。

详情
AI中文摘要

证明Lasso程序的终止性或非终止性是程序验证中的一个具有挑战性的问题。为了在统一的执行框架下整合当前最先进的方法,我们提出了PaSTTeL,一个用于Lasso程序终止和非终止性分析的模块化、通用的并行组合框架。PaSTTeL的设计目标是:(1) 促进新分析算法集成到组合中,(2) 并发执行注册的策略,以及(3) 作为一个自包含的库组件,可以无缝嵌入到任何需要(非)终止性分析的外部项目中。初步实验表明,PaSTTeL的一个实例在与最先进工具的竞争中表现出色。

英文摘要

Proving termination or non-termination of lasso programs is a challenging problem in program verification. To unify state-of-the-art approaches under a common execution framework, we present PaSTTeL, a modular and generic parallel portfolio framework for termination and non-termination analysis of lasso programs. PaSTTeL is designed to: (1) facilitate the integration of new analysis algorithms into the portfolio, (2) execute registered strategies concurrently, and (3) act as a self-contained library component that can be seamlessly embedded into any external project requiring (non-)termination analysis. Initial experiments demonstrate that an instantiation of PaSTTeL performs competitively against state-of-the-art tools.

2606.18966 2026-06-18 cs.DC cs.CR 新提交

A Composable CRDT Layer for Byzantine-Resilient Deterministic Reconstruction

一种可组合的拜占庭容错确定性重建CRDT层

Amos Brocco

AI总结 提出Melda,一种非侵入式增量状态CRDT,通过确定性状态重建实现拜占庭容错,确保即使存在任意更新注入也能收敛。

详情
AI中文摘要

无冲突复制数据类型(CRDT)无需协调即可保证强最终一致性,但通常假设参与者良性,并依赖验证或排除来处理拜占庭行为。我们通过确定性状态重建解决这个问题:不决定哪些更新是可接受的,而是将所有接受的更新纳入,仅一部分贡献于重建状态。我们在Melda中实例化该方法,Melda是一种用于JSON文档的非侵入式增量状态CRDT,并表明其重建模型即使在任意更新注入下也能保证收敛:对抗性更新要么被结构拒绝,要么作为重建过程的输入。我们形式化该模型,并证明从同一组更新派生状态的副本不会因歧义、遗漏或消息重排序而发散。我们进一步表明,认证、授权和机密性可以分层而不影响收敛。总体而言,该方法表明,通过将更新传播与状态派生解耦,可以实现拜占庭容错,允许外部传播或共识机制独立处理更新的一致性。

英文摘要

Conflict-free Replicated Data Types (CRDTs) ensure Strong Eventual Consistency without coordination, but typically assume benign participants and rely on validation or exclusion to handle Byzantine behavior. We address this problem through deterministic state reconstruction: rather than deciding which updates are admissible, all accepted updates are incorporated, while only a subset contributes to the reconstructed state. We instantiate this approach in Melda, a non-intrusive delta-state CRDT for JSON documents, and show that its reconstruction model guarantees convergence even under arbitrary update injection: adversarial updates are either structurally rejected or treated as inputs to the reconstruction process. We formalize this model and prove that replicas deriving state from the same set of updates cannot diverge despite equivocation, omission, or message reordering. We further show that authentication, authorization, and confidentiality can be layered without affecting convergence. Overall, this approach suggests that Byzantine tolerance can be achieved by decoupling update propagation from state derivation, allowing agreement on updates to be handled independently by external dissemination or consensus mechanisms.

2606.18965 2026-06-18 cs.GT 新提交

Convergence of Replicator Dynamics in the Repeated Prisoner's Dilemma with Restarts

重启重复囚徒困境中复制动力学的收敛性

Benedict Russell, Chin-wing Leung, Paolo Turrini

AI总结 研究在触发重启机制下自私智能体群体进行重复囚徒困境博弈时,复制动力学的收敛性,发现增加策略长度可促进合作稳定,并推导出稳定策略的解析条件。

详情
AI中文摘要

我们研究了一群自私智能体在触发重启机制下进行重复囚徒困境博弈。在该机制下,智能体与对手进行一系列对称博弈,如果行动不一致则重启交互。我们的工作聚焦于充分混合的智能体群体中复制动力学的收敛性,其中合作的出现受到个体剥削动机的挑战。通过构建对应的参数化标准型博弈,每个智能体采用长度为m的策略序列,我们证明增加策略长度能够使合作出现并稳定。我们为受限策略长度提供了精确的收敛保证,并在一般收益配置下给出了合作策略稳定性所需的参数条件。通过推导稳定序列数量的精确公式,我们发现了稳定性所需的结构性质:智能体必须学会先背叛——即所谓的“欺辱期”——然后才能无限合作。我们的分析表明,虽然存在最优合作序列,但智能体倾向于选择具有更长欺辱期的次优序列,这些序列拥有更大的吸引域。

英文摘要

We investigate a population of self-interested agents playing a repeated Prisoner's Dilemma under the trigger-restart mechanism. Under such a mechanism, agents play a sequence of symmetric games with their partner, and restart the interaction if their actions disagree. Our work focuses on the convergence of replicator dynamics in a well-mixed population of agents, where the emergence of cooperation is challenged by the individual incentive for exploitation. Formulating the corresponding parametrised normal-form game, with agents each adopting a length-m strategy sequence, we show that increasing the strategy length enables cooperation to emerge and stabilise. We provide exact convergence guarantees for restricted strategy lengths and, in the general payoff configuration, provide the necessary parametric conditions for the stability of cooperative strategies. By deriving an exact formula for the number of stable sequences, we find structural properties necessary for stability, as agents must learn to initially defect - the so-called "hazing period" - before cooperating indefinitely. Our analysis shows that, while optimal cooperative sequences exist, agents favour less-optimal sequences with a longer hazing period, which possess larger basins of attraction.

2606.18958 2026-06-18 cs.DC cs.OS 新提交

LiveStack: OS Support for Cluster-Scale Full-Stack Live Simulation

LiveStack: 集群规模全栈实时仿真的操作系统支持

Yiliang Wan, Haifeng Sun, Yihan Yang, Jonas Kaufmann, Antoine Kaufmann, Jialin Li

AI总结 提出LiveStack,一种基于Linux虚拟化栈的操作系统级方法,通过四个子系统实现集群规模全栈仿真的高保真与高性能,将仿真控制作为操作系统核心职责。

详情
AI中文摘要

集群规模的全栈仿真对于在部署前评估分布式软件栈和新兴硬件组件至关重要。这种仿真必须同时实现未修改生产栈的全栈保真度和迭代配置探索所需的仿真性能。然而,现有方法无法同时满足这两点。我们提出LiveStack,一种基于Linux虚拟化栈构建的集群规模全栈仿真的操作系统级方法。LiveStack包含四个子系统:面向仿真的调度、实时内存层次管理、仿真感知的IPC以及分布式仿真编排。它们共同在共享的仿真时间下协调实时和建模组件,同时控制共置实时主机之间的干扰。这些机制指向仿真原生的操作系统支持,其中仿真控制和编排成为操作系统的核心职责。

英文摘要

Cluster-scale full-stack simulation is essential for evaluating distributed software stacks and emerging hardware components before deployment. Such simulation must achieve both full-stack fidelity for the unmodified production stack and the simulation performance required for iterative configuration exploration. However, no existing method achieves both. We present LiveStack, an OS-level approach to cluster-scale full-stack simulation built on top of the Linux virtualization stack. LiveStack comprises four subsystems: simulation-oriented scheduling, live memory hierarchy management, simulation-aware IPC, and distributed simulation orchestration. Together, they coordinate live and modeled components under shared simulated time while controlling interference among co-located live hosts. These mechanisms point toward simulation-native OS support, where simulation control and orchestration become core OS responsibilities.

2606.18940 2026-06-18 cs.DC 新提交

Urban Limits as Design Constraints: Identifying Suitable Locations for Distributed, Photovoltaic-Powered Servers

城市限制作为设计约束:确定分布式光伏供电服务器的合适位置

Justin Chikhaoui, Thomas Leduc, Daniel Siret, Abdoulaye Gamatie

AI总结 本文提出一种方法,在结构、环境和社会限制下识别适合部署分布式服务器的城市位置,结合法律框架、城市项目、公民咨询和科学文献,评估可行场地类型,并以法国蒙彼利埃为例展示城市约束和本地资源如何影响分散式太阳能数字基础设施的可行性。

Comments Paper in Proceedings of LIMITS 2026: 12th Workshop on Computing within Limits, 2026-06-23-25, Online

详情
AI中文摘要

城市地区面临着日益增长的数字需求、有限的资源和社会约束的建筑环境之间的紧张关系。尽管边缘计算和雾计算等分布式计算范式被广泛认为是减少延迟和能耗的解决方案,但科学文献在很大程度上忽视了这些基础设施在城市中可以在物理和社会上部署的位置,并且通常忽略了城市限制、环境影响和公平性考虑。本文提出了一种方法,用于在结构、环境和社会限制下识别适合部署分布式服务器的城市位置。它完全依赖现有基础设施和人工表面,结合法律框架、正在进行的城市项目、公民咨询和科学文献,构建了一个基于地点的可行场地类型词汇表,并通过能源、空间和定性标准进行评估。应用于法国城市蒙彼利埃,我们的结果说明了城市限制和本地资源如何塑造分散式太阳能数字基础设施的可行性,并强调了在地化方法对于在城市限制内重新思考数字服务的价值。

英文摘要

Urban territories face growing tensions between increasing digital demand, limited resources, and socially constrained built environments. Although distributed computing paradigms such as edge and fog computing are widely presented as solutions for reducing latency and energy dissipation, the scientific literature largely overlooks where such infrastructures can be physically and socially deployed within cities, and typically neglects urban constraints, environmental impacts, and equity considerations. This paper proposes a methodology for identifying suitable urban locations for deploying distributed servers under structural, environmental, and social limits. Relying exclusively on existing infrastructures and anthropised surfaces, it combines legal frameworks, ongoing urban projects, citizen consultations, and scientific literature to construct a place-based glossary of viable site typologies, evaluated through energy, spatial, and qualitative criteria. Applied to the French city of Montpellier, our results illustrate how urban constraints and local resources shape the feasibility of decentralised, solar-powered digital infrastructures, and highlight the value of territorialised approaches for rethinking digital services within urban limits.

2606.18921 2026-06-18 cs.GT 新提交

Epistemic Pairwise Maximin Share

认知成对最大最小份额

Michal Feldman, Amos Fiat, Yael Nissan, Tomasz Ponitka

AI总结 提出认知成对最大最小份额(EPMMS)这一新的公平分配概念,研究其存在性与计算,在加性估值、双值估值等场景下取得存在性及高效算法结果。

Comments 41 pages, 5 figures

详情
AI中文摘要

我们引入了认知成对最大最小份额(EPMMS),这是不可分割物品公平分配中的一个新公平性概念。该设定中的两个基本概念是至任意物品无嫉妒(EFX)和成对最大最小份额(PMMS),其中PMMS比EFX更强。虽然EFX已被广泛研究,但对PMMS的了解要少得多。最近的工作表明,通过认知视角放松EFX可以在EFX问题上取得重大进展,这引发了一个问题:类似的方法是否也能增进我们对PMMS的理解。受此启发,我们启动了EPMMS的研究,即PMMS的认知松弛。EPMMS比EEFX更具挑战性:最近在认知EFX上取得进展的关键方法本质上无法扩展到EPMMS。我们建立了以下结果。(1)对于加性估值,$4/5$-EPMMS分配存在且可以高效计算。(2)对于双值估值,EPMMS分配存在且可以高效计算;事实上,我们获得了更强的认知群体最大最小份额(EGMMS)保证,这也加强了该设定下MMS分配的存在性。(3)我们证明了在两种MMS分配不一定存在的设定中EPMMS分配存在:具有三个加性主体或两类加性主体的实例。

英文摘要

We introduce epistemic pairwise maximin share (EPMMS), a new fairness notion for fair division of indivisible goods. Two fundamental notions in this setting are envy-freeness up to any item (EFX) and pairwise maximin share (PMMS), with PMMS being stronger than EFX. While EFX has been extensively studied, far less is known about PMMS. Recent work shows that relaxing EFX via an epistemic perspective leads to substantial progress on the EFX problem, raising the question of whether a similar approach can advance our understanding of PMMS. Motivated by this, we initiate the study of EPMMS, the epistemic relaxation of PMMS. EPMMS is more challenging than EEFX: the key approaches underlying recent progress on epistemic EFX inherently fail to extend to EPMMS. We establish the following results. (1) For additive valuations, $4/5$-EPMMS allocations exist and can be efficiently computed. (2) For bivalued valuations, EPMMS allocations exist and can be efficiently computed; in fact, we obtain the stronger guarantee of epistemic groupwise maximin share (EGMMS), which also strengthens the existence of MMS allocations for this setting. (3) We prove that EPMMS allocations exist in two settings where MMS allocations need not exist: instances with three additive agents or two types of additive agents.

2606.18900 2026-06-18 cs.DC 新提交

Compressed-Resident Genomics: Full-Pipeline Device-Resident GPU LZ77 Decode with Position-Invariant Random Access

压缩驻留基因组学:全流水线设备驻留GPU LZ77解码与位置不变随机访问

Yakiv Shavidze

AI总结 提出全设备驻留GPU解码流水线,实现高达260GB/s的FASTQ解码速度,并支持位置不变随机访问(0.362ms内解码任意读取),索引比.fai小6.3倍,同时通过范围解码策略避免VRAM溢出。

Comments 5 pages, 3 tables. ACEAPEX and aceapex_cuda included in lzbench 2.3

详情
AI中文摘要

基因组档案的增长速度超过了解压缩的跟进速度:欧洲核苷酸档案库持有数十PB的压缩数据,而gzip本质上是顺序的。GPU解压缩器(nvCOMP DEFLATE在A100上约50GB/s)解码整个文件,不支持随机访问;CPU基因组工具(CRAM、samtools)支持区域查找,但仅限于CPU速度。我们扩展了ACEAPEX(一种绝对偏移并行LZ77编解码器,包含在官方lzbench 2.3版本中),增加了我们先前工作中没有的三项贡献。首先,一个完整的设备驻留GPU解码流水线(熵和匹配解析均在设备上),在FASTQ上达到高达260GB/s的速度,弥补了早期论文中仅匹配阶段的差距。其次,位置不变随机访问,带有紧凑的坐标索引:任意读取在0.362ms内解码,比热samtools faidx快约6倍,读取到块索引比.fai小6.3倍。第三,一种范围解码策略,将输出大小与VRAM解耦,在50GB基因组上保持165.7GB/s的速度,而整个文件解码会耗尽内存。所有结果都是比特完美的。我们还测量了Meta的开源DietGPU ANS在H100上的解码速度为592GB/s,比我们目前使用的专有熵阶段更快,表明完全开源的高吞吐量堆栈是可行的。代码采用MIT许可证。

英文摘要

Genomic archives grow faster than decompression keeps up: the European Nucleotide Archive holds tens of petabytes of fastq.gz, and gzip is fundamentally sequential. GPU decompressors (nvCOMP DEFLATE at ~50GB/s on A100) decode whole files with no random access; CPU genomic tools (CRAM, samtools) support region seeks but only at CPU speed. We extend ACEAPEX, an absolute-offset parallel LZ77 codec included in the official lzbench 2.3 release, with three contributions absent from our prior work. First, a full device-resident GPU decode pipeline (entropy and match resolution both on-device) reaching up to 260GB/s on FASTQ, closing the match-phase-only gap of the earlier paper. Second, position-invariant random access with a compact coordinate index: an arbitrary read decodes in 0.362ms, ~6x faster than warm samtools faidx, with a read-to-block index 6.3x smaller than a .fai. Third, a range-decode strategy that decouples output size from VRAM, sustaining 165.7GB/s on a 50GB genome where whole-file decode runs out of memory. All results are bit-perfect. We also measure Meta's open DietGPU ANS on H100 at 592GB/s decode, faster than the proprietary entropy stage we currently use, showing a fully open high-throughput stack is viable. Code is MIT-licensed.

2606.18878 2026-06-18 cs.DS cs.DB cs.FL 新提交

Tractable Gap-Constraint Languages for Complex Event Recognition

复杂事件识别的可处理间隙约束语言

Antoine Amarilli, Florin Manea, Tina Ringleb, Markus L. Schmid

AI总结 研究带间隙约束的子序列匹配问题,提出左凸语言类,可在O(|D|(|u|+|C|))时间内求解,并用于复杂事件识别中的高效枚举。

Comments 50 pages

详情
AI中文摘要

对于字符串 $u, D \in \Sigma^*$,$u$ 在 $D$ 中的子序列嵌入是一个函数 $e \colon \{1, 2, \ldots, |u|\} \to \{1, 2, \ldots, |D|\}$,满足对每个 $i \in \{1, 2, \ldots, |u|-1\}$ 有 $e(i) < e(i+1)$,且 $u$ 的第 $i$ 个符号等于 $D$ 的第 $e(i)$ 个符号。$u$ 的间隙约束是一个三元组 $(i, j, L)$,其中 $1 \leq i < j \leq |u|$,$L$ 是 $\Sigma$ 上的正则语言。如果 $D$ 中严格位于 $e(i)$ 和 $e(j)$ 之间的因子是 $L$ 中的单词,则嵌入 $e$ 满足间隙约束 $(i, j, L)$。我们研究带间隙约束的子序列匹配问题,该问题与复杂事件识别(CER)相关:给定 $u, D \in \Sigma^*$ 和一组间隙约束 $C$,找到 $u$ 在 $D$ 中满足 $C$ 中所有间隙约束的嵌入。通常,子序列匹配是NP完全的,唯一已知的可处理变体限制了间隙约束的区间结构。在这项工作中,我们表明,如果间隙约束语言满足我们称之为左凸性的性质:只要 $u v w \in L$ 且 $v \in L$,则也有 $uv \in L$,那么我们可以相当高效地(实际上,在SETH下是最优的)在时间 $O(|D| (|u| + |C|))$ 内解决具有任意区间结构的间隙约束子序列匹配。左凸语言足够表达CER中考虑的有趣现实场景,例如长度约束 $L = \{w \mid a \leq |w| \leq b\}$,其中 $a, b \in \mathbb{N}$。我们还展示了如何使用我们的算法高效枚举所有满足条件的嵌入,这对于CER中的可能应用尤为重要。最后,我们展示了非左凸语言如何导致难解性,即如果除了长度约束外,还允许 $\{aa, \epsilon\}$ 作为唯一的非左凸约束语言,那么问题再次变为NP完全的。

英文摘要

For strings $u, D \in Σ^*$, a subsequence embedding of $u$ in $D$ is a function $e \colon \{1, 2, \ldots, |u|\} \to \{1, 2, \ldots, |D|\}$ with $e(i) < e(i+1)$ for every $i \in \{1, 2, \ldots, |u|-1\}$ and the $i$-th symbol of $u$ equals the $e(i)$-th symbol of $D$. A gap-constraint for $u$ is a triple $(i, j, L)$ with $1 \leq i < j \leq |u|$ and $L$ is a regular language over $Σ$. An embedding $e$ satisfies a gap-constraint $(i, j, L)$ if the factor of $D$ strictly between positions $e(i)$ and $e(j)$ is a word from $L$. We investigate the subsequence matching problem with gap-constraints, which is relevant in the context of complex event recognition (CER): given $u, D \in Σ^*$ and a set $C$ of gap-constraints, find an embedding of $u$ in $D$ that satisfies all gap-constraints from $C$. In general, subsequence matching is NP-complete and the only known tractable variants restrict the interval structure of the gap-constraints. In this work, we show that we can solve subsequence matching with gap-constraints with an arbitrary interval structure rather efficiently (in fact, optimally under SETH) in time $O(|D| (|u| + |C|))$ if the gap-constraint languages satisfy a property which we dub left-convexity: whenever $u v w \in L$ and $v \in L$, then also $uv \in L$. Left-convex languages are sufficiently expressive to model interesting real-world scenarios considered in CER, e.g., length constraints $L = \{w \mid a \leq |w| \leq b\}$ for $a, b \in \mathbb{N}$. We also show how our algorithm can be used in order to efficiently enumerate all satisfying embeddings, which is particularly relevant for possible applications in CER. Finally, we show how non-left-convex languages can lead to intractability, i.e., if in addition to length constraints we allow $\{aa, ε\}$ as the only non-left-convex constraint language, then the problem is NP-complete again.

2606.18855 2026-06-18 cs.SE 新提交

Toward Semantically-Seeded, Graph-Propagated Impact Analysis Across Software Artifacts: A Vision

面向语义种子与图传播的跨软件制品影响分析:一个愿景

Momil Seedat

AI总结 提出一种无需训练、可解释的融合方法,结合语义相似性与结构依赖,通过异构制品图与传播机制覆盖两种方法的盲点,实现跨需求-配置-服务-测试链的影响分析。

详情
AI中文摘要

当单个软件制品发生变化——一个需求、一个配置值或一个函数——工程师必须确定还有什么受到影响。现有的变更影响分析(CIA)工具往往孤立地依赖两种信号之一:从文本中恢复的语义相似性(信息检索可追溯性、代码搜索、嵌入),或结构依赖跟踪(调用图、IDE“查找用法”、测试影响选择)。每种方法都有其特有的盲点。语义驱动的方法会遗漏与变更没有共享词汇的受影响制品;结构驱动的方法会遗漏在意义上相关但未被边连接的制品,并且大多数仅对代码而非需求-配置-服务-测试链进行操作。我们主张一种无需训练且可解释的分析器,它在同一嵌入上融合两种信号。我们将系统建模为一个异构制品图,其类型化边通过静态分析恢复,通过余弦相似度计算相对于变更制品的语义先验,通过行归一化的传播矩阵进行多跳衰减传播,并通过单个可调权重λ融合两者。在一个支付子系统(5个标记的变更场景)上进行的小型但完整的概念验证显示了我们关心的机制:与变更没有文本重叠的制品仍然通过传播被恢复,而单独传播无法到达的辅助函数则通过语义层被恢复。融合是唯一覆盖两个盲点的配置,λ充当显式的精确率/召回率控制。借鉴四个公开记录的生成故障,我们认为相同的公式可以扩展到仅靠代码分析无法触及的操作制品(镜像、指标、仪表盘、数据模式)。

英文摘要

When a single software artifact changes - a requirement, a configuration value, or a function - engineers must determine what else is impacted. Existing change-impact-analysis (CIA) tooling tends to rely on one of two signals in isolation: semantic similarity recovered from text (information-retrieval traceability, code search, embeddings), or structural dependency following (call graphs, IDE "find usages", test-impact selection). Each has a characteristic blind spot. A semantically driven tool misses an impacted artifact whose text shares no vocabulary with the change; a structurally driven tool misses artifacts related in meaning but not joined by an edge, and most operate only over code rather than the Requirement-Config-Service-Test chain. We argue for a training-free and interpretable analyzer that fuses both signals over the same embeddings. We model the system as a heterogeneous artifact graph with typed edges recovered by static analysis, compute a semantic prior by cosine similarity to the changed artifact, propagate impact multi-hop with decay over a row-normalized propagation matrix, and blend the two with a single tunable weight lambda. A small but complete proof-of-concept on a payment subsystem (5 labelled change scenarios) shows the mechanism we care about: artifacts with zero textual overlap with the change are still recovered through propagation, and helper functions that propagation alone cannot reach are recovered through the semantic layer. The fusion is the only configuration that covers both blind spots, and lambda acts as an explicit precision/recall control. Drawing on four publicly documented production failures, we argue that the same formulation extends to operational artifacts (images, metrics, dashboards, data schemas) that code-only analysis cannot reach.

2606.18851 2026-06-18 eess.SY cs.SY 新提交

From Tokens to Energy Flexibility: Quantization-Enabled Demand Response for Data Centers with LLM Inference Workloads

从令牌到能量灵活性:面向LLM推理工作负载的数据中心量化使能需求响应

Bojun Du, Xiaoyi Fan, Ershun Du, Long Chen, Jianpei Han, Qingchun Hou, Ning Zhang, Chongqing Kang

AI总结 提出一种量化使能的能量管理框架,通过建立量化-功率模型和两阶段需求响应模型,实现多园区协同优化,降低数据中心运营成本34.3%。

Comments 10 pages, 7 figures

详情
AI中文摘要

大型语言模型(LLM)推理的快速增长正在造成显著的数据中心负载,在日益紧张的电网条件和需求响应(DR)要求下,这些负载面临着越来越多的能量管理挑战。传统的数据中心能量管理主要依赖于时间和空间上的工作负载转移以及园区级能量资产调度,但通常将LLM推理需求视为聚合负载。因此,这些方法未能利用LLM服务的内部特性,从而忽视了模型量化等LLM特定技术所提供的灵活性。为了释放这种灵活性,本文提出了一种面向电网响应型LLM推理数据中心的量化使能能量管理框架。首先,建立了一个量化-功率模型,将每个模型-量化配置映射到一个紧凑的可调度参数集。其次,开发了一个两阶段量化使能的需求响应模型,以考虑模型实例切换、请求路由和精度选择。第三,引入了一种多园区协同优化方法,通过将电网侧电力和碳信号与量化使能的需求响应模型相结合,参与需求响应。案例研究表明,所提出的框架在不减少服务令牌量的情况下,将数据中心总运营成本降低了34.3%,验证了模型量化作为电网响应型LLM数据中心能量管理的有效灵活性杠杆。

英文摘要

The rapid growth of large language model (LLM) inference is creating significant data-center loads that face increasing energy-management challenges under tightening grid conditions and demand response (DR) requirements. Conventional data-center energy management mainly relies on temporal and spatial workload shifting and campus-level energy asset scheduling, but it usually treats LLM inference demand as an aggregate load. As a result, these approaches fail to exploit the internal characteristics of LLM serving and therefore overlook the flexibility offered by LLM-specific techniques such as model quantization. To unlock this flexibility, this paper proposes a quantization-enabled energy management framework for grid-responsive LLM inference data centers. First, a quantization-to-power model is established to map each model--quantization configuration to a compact set of dispatchable parameters. Second, a two-stage quantization-enabled DR model is developed to account for model instance switching, request routing, and precision selection. Third, a multi-campus co-optimization method is introduced for DR participation by integrating grid-side electricity and carbon signals with the quantization-enabled DR model. Case studies show that the proposed framework reduces total data-center operating cost by 34.3\% without curtailing served token volume, validating model quantization as an effective flexibility lever for grid-responsive LLM data-center energy management.

2606.18822 2026-06-18 eess.SY cs.SY 新提交

Bridging Data-Driven and Model-Based Methods: A Learn-to-Optimize Architecture for Distributed Optimal Power Flow

桥接数据驱动与基于模型的方法:一种用于分布式最优潮流的学习优化架构

Yibo Ding, Zhao Xu, Yuhong Zhao, Jian Zhao, Jiaqi Ruan, Zhaoyang Dong

AI总结 提出一种将ADMM展开为深度神经网络并嵌入可微优化层的学习优化架构,实现分布式最优潮流的近瞬时可解释分布式决策,在最优性和可行性上优于现有数据驱动方法。

Comments This work has been submitted to the IEEE for possible publication on May 2026

详情
AI中文摘要

本文提出了一种学习优化(LTO)架构,用于分布式最优潮流(D-OPF),作为数据驱动和基于模型方法之间的桥梁。通过将交替方向乘子法(ADMM)展开为深度神经网络(NN)并嵌入可微优化层,我们的架构实现了近瞬时的可解释分布式决策。对于主流的D-OPF松弛形式,我们的架构做出的决策在最优性上与最先进的求解器相当,并且在可行性上优于现有的数据驱动方法。对比案例研究验证了我们的架构在最优性和可行性方面的有效性。

英文摘要

This letter proposes a learn-to-optimize (LTO) architecture for distributed optimal power flow (D-OPF) as the nexus between data-driven and model-based methods. By unfolding alternating direction method of multipliers (ADMM) into a deep neural network (NN) and embedding differentiable optimization layers, our architecture realizes near-instantaneous interpretable distributed decision-making. For mainstream relaxed formulations of D-OPF, the decisions from our architecture achieve comparable optimality with that of state-of-the-art solvers and excelled feasibility compared with existing data-driven approaches. Comparative case studies underpin the effectiveness of our architecture regarding the optimality and feasibility.

2606.18814 2026-06-18 cs.IR 新提交

LensKit-Auto

LensKit-Auto的改进与增强

Max Breit, Anass Amezian El Idrissi, Rishikesh Giriraj Kulkarni, Luca Quade

AI总结 本文改进了LensKit-Auto框架,使其能自动寻找适合数据集的推荐算法和超参数组合,增强了易用性和可视化功能,并适配了最新版本的LensKit框架。

详情
AI中文摘要

推荐系统在视频流、社交媒体和数字市场等领域有广泛应用,但选择合适的算法和超参数是一个持续挑战。本文改进了LensKit-Auto框架,使其能够自动寻找适合数据集的推荐算法和超参数组合。LensKit-Auto的主要优势在于其易用性,用户可将其作为黑箱输入数据集,获取最适合该数据集的算法和超参数信息。本文还更新了LensKit-Auto以适配最新版本的LensKit框架,实现了Tree Parzen Estimator等新功能,更新了文档,并增加了优化过程的可视化能力。此外,本文还适配了现有的元学习框架,生成适合LensKit-Auto的元数据集,以未来可能整合元学习。这些改进使LensKit-Auto更加完善,甚至非专业用户也能找到适合其应用场景的算法。

英文摘要

Recommender systems have a wide area of application, e.g. in fields like video streaming, social media, or digital marketplaces. But, for a recommender-system, finding the right algorithm with the right hyperparameters is a reoccurring challenge. There is no one-fits-all solution, since the performance of one algorithm can vary immensely on different data sets. Due to the challenges of finding the right algorithm and the broad use of recommender-systems, it is of interest to create an Automated Recommender System (AutoRecSys) that takes on the task of finding the right algorithm-hyperparameter-combination for a given data set. In this work, we present the enhancement of LensKit-Auto, a framework introduced by Vente et al., that solves exactly this task of finding a fitting algorithm-hyperparameter-combination. LensKit-Auto's biggest strength lies in its ease of use, where it operates as a black-box, into which the user can feed their data set and receive the information of which algorithm and hyperparameters work best on this data set. In this work, we bring LensKit-Auto up to date, so that it works with the new version of its underlying framework, LensKit. We also implement further functionalities, such as the Tree Parzen Estimator as an additional optimization method, the ability to reuse the found algorithm, updated documentation, and the ability to visualize the optimization process. We also adapt an existing meta-learning framework to generate a suitable meta-dataset for LensKit-Auto, which could enable the integration of meta-learning into LensKit-Auto in the future. The presented changes bring LensKit-Auto up to date and enhance its usability, so that even non-experts in the field can find the right algorithm for their use case.

2606.18795 2026-06-18 cs.SI 新提交

Opinion Polarization in LLM-Based Social Networks: Manipulation and Mitigation

基于LLM的社交网络中的意见极化:操纵与缓解

Ali Safarpoor Dehkordi, Mohammad Shirzadi, Ahad N. Zehmakan

AI总结 研究在基于大语言模型模拟的社交网络中,对手如何通过有限预算操纵意见极化,并评估两种防御机制(反应性和主动性)的效果,发现两者均无法完全恢复基线极化状态。

Comments 14 pages, 7 figures

详情
AI中文摘要

在线社交网络在面对试图通过操纵意见来放大意见极化的对手时有多脆弱?缓解这种操纵有多困难?现有研究使用意见动态的数学模型来探讨这一问题。虽然这些模型提供了有价值的理论见解,但它们依赖于关于交互、消息内容和意见更新的简化假设,限制了它们能够捕捉的对抗策略及其发现在现实环境中的适用性。基于大语言模型的模拟提供了一种更丰富的替代方案:智能体可以被赋予多样化的角色,通过自然语言进行交流,并以上下文相关的方式回应说服性或对抗性内容。这使得研究难以用经典数学模型表示的操纵策略成为可能。据我们所知,本研究首次在基于LLM的模拟社交网络框架中系统分析了极化的放大和缓解。在我们的框架中,具有多样化角色的LLM智能体通过交换自然语言帖子在社交网络上进行交互,并相应地更新他们的意见。我们表明,即使预算有限的对手也能显著增加极化。然后,我们研究了两类防御机制:反应性缓解(指派特定用户主动对抗操纵)和主动性干预(通过不针对特定用户的一般机制增加抵抗力)。我们的结果表明,尽管这些机制减少了对抗攻击的影响,但它们通常无法将网络恢复到其基线极化状态。这些发现表明,这两种方法都不能完全克服网络的脆弱性,凸显了此类攻击的潜在风险。

英文摘要

How vulnerable are online social networks to adversaries who seek to amplify opinion polarization by manipulating opinions, and how difficult is it to mitigate such manipulation? Existing studies have examined this question using mathematical models of opinion dynamics. While these models offer valuable theoretical insights, they rely on simplified assumptions about interactions, message content, and opinion updates, limiting the adversarial strategies they can capture and the applicability of their findings to real-world settings. Large language model (LLM)-based simulations provide a richer alternative: agents can be assigned diverse personas, communicate through natural language, and respond to persuasive or adversarial content in a context-dependent way. This enables the study of manipulation strategies that are difficult to represent using classical mathematical models. To the best of our knowledge, this study provides the first systematic analysis of polarization amplification and mitigation in an LLM-based simulated social network framework. In our framework, LLM agents with diverse personas interact over a social network by exchanging natural language posts and updating their opinions accordingly. We show that even an adversary with a limited manipulation budget can considerably increase polarization. We then study two classes of defense mechanisms: reactive mitigations, which assign specific users to actively counter manipulation, and proactive interventions, which increase resistance through general mechanisms not tied to particular users. Our results show that although these mechanisms reduce the impact of adversarial attacks, they generally do not restore the network to its baseline polarization state. These findings suggest that neither approach fully overcomes the vulnerability of the network, highlighting the potential risk of such attacks.

2606.18789 2026-06-18 eess.SY cs.SY 新提交

PowerAgentBench-SS: A Benchmark for Agentic AI in Power System Steady-State Studies

PowerAgentBench-SS:电力系统稳态研究中智能体AI的基准测试

Costas Mylonas, Magda Foti, Andrea Pomarico, Matheus Duarte, Qian Zhang, Emmanouel Varvarigos

AI总结 提出PowerAgentBench-SS基准框架,用于评估LLM智能体在电力系统稳态研究中执行工程工作流的能力,通过工具API、验证预算和风险敏感指标区分智能体性能。

详情
AI中文摘要

电力系统基准测试通常评估数值求解器、预测模型或顺序控制器。这些基准是必要的,但它们不直接测试大型语言模型(LLM)智能体是否能执行工程工作流:检查电网案例、选择工具、调用模拟器、筛选 contingencies、提出可接受的缓解措施、验证结果并生成可审计的证据链。本文介绍了PowerAgentBench-SS,一个用于评估电力系统运行和规划研究中工具使用智能体的稳态基准框架。该基准向智能体公开案例数据、动作约束、工具API和验证预算,同时隐藏的评估器重新计算物理有效性并对提交的报告进行评分。我们定义了智能体接口、工具契约、证据日志和风险敏感指标,包括提交召回率、证据支持召回率、发现召回率、假安全惩罚、严重性遗憾、残余违规分数、动作成本、工具使用效率和工作流诊断。为了使框架具体化,我们在可复现的直流热N-2 contingency搜索试点中实例化该协议,使用确定性IEEE 39节点运行点变体,包括脚本基线、LLM JSON命令适配器、三个本地托管的Ollama LLM智能体和一个OpenAI API智能体。结果表明为什么仅求解器或仅答案评估是不够的:智能体不仅通过顶级contingency发现来区分,还通过验证预算使用、显式提交、类型强制、重复验证、证据支持报告和缓解行为来区分。

英文摘要

Power system benchmarks usually evaluate numerical solvers, prediction models, or sequential controllers. These benchmarks are necessary, but they do not directly test whether a Large Language Model (LLM) agent can execute an engineering workflow: inspect a grid case, select tools, call simulators, screen contingencies, propose admissible mitigations, validate results, and produce an auditable evidence trail. This paper introduces PowerAgentBench-SS, a steady-state benchmark framework for evaluating tool-using agents in power system operation and planning studies. The benchmark exposes public case data, action constraints, a tool API, and a validation budget to an agent, while a hidden evaluator recomputes physical validity and scores the submitted report. We define the agent interface, tool contract, evidence log, and risk-sensitive metrics, including submitted recall, evidence-backed recall, found recall, false-safe penalties, severity regret, residual violation score, action cost, tool-use efficiency, and workflow diagnostics. To make the framework concrete, we instantiate the protocol in a reproducible DC thermal N-2 contingency-search pilot on deterministic IEEE 39-bus operating-point variants, with scripted baselines, an LLM JSON-command adapter, three locally hosted Ollama LLM agents, and one OpenAI API agent. The results show why solver-only or answer-only evaluation is insufficient: agents are distinguished not only by top-contingency discovery, but also by validation-budget use, explicit submission, type coercions, duplicate validations, evidence-backed reporting, and mitigation behavior.

2606.18771 2026-06-18 cs.CR 新提交

A Predictive Neural Network Architecture for Early Detection of Low-Rate Cyberattacks

一种用于早期检测低速率网络攻击的预测性神经网络架构

Mert Nakıp

AI总结 提出IDQS框架,结合RTP-QoS预测神经网络和PDM决策模型,通过预测与实际的QoS差异早期检测LDoS攻击,在SDN-SlowRate-DDoS和CIC-IDS2017数据集上分别达到79%和91%的检测准确率,推理时间仅0.28秒。

Journal ref Nakıp, M. (2026). A predictive neural network architecture for early detection of low-rate cyberattacks. Knowledge-Based Systems, 343, 115995

详情
AI中文摘要

低速率拒绝服务(LDoS)攻击因其隐蔽和持久的特性,对物联网网络构成重大挑战,常常逃避传统入侵检测系统。本文提出IDQS(通过QoS预测进行入侵检测),一种轻量级且主动的早期LDoS攻击检测框架。IDQS集成了两个新的关键组件:(i)RTP-QoS,一种递归趋势预测神经网络,基于历史流量模式学习和预测未来的服务质量(QoS),以及(ii)PDM,一种成对决策模型,评估预测QoS与实际QoS之间的差异以识别潜在攻击。在公开的SDN-SlowRate-DDoS和CIC-IDS2017数据集上评估,IDQS在大多数攻击场景中分别实现了超过79%和91%的检测准确率,具有高召回率和低假阴性率,同时保持仅0.28秒的端到端推理时间。结果证明了IDQS在资源受限的物联网环境中实时部署的有效性和效率。

英文摘要

Low-Rate Denial of Service (LDoS) attacks pose a significant challenge to IoT networks due to their subtle and prolonged nature, often evading traditional intrusion detection systems. This paper presents IDQS (Intrusion Detection via QoS Prediction), a lightweight and proactive framework for early LDoS attack detection. IDQS integrates two new key components: (i) RTP-QoS, a Recurrent Trend Predictive Neural Network that learns and forecasts future Quality of Service (QoS) based on historical traffic patterns, and (ii) PDM, a Pairwise Decision Model that evaluates discrepancies between predicted and actual QoS to identify potential attacks. Evaluated on the public SDN-SlowRate-DDoS and CIC-IDS2017 datasets, IDQS respectively achieves over 79% and 91% detection accuracy across most attack scenarios with high recall and low false negatives, while maintaining an end-to-end inference time of just 0.28 seconds. The results demonstrate the effectiveness and efficiency of IDQS for real-time deployment in resource-constrained IoT environments.

2606.18764 2026-06-18 cs.NI 新提交

Direct-V2X Support with 5G Network-based Communications: Performance, Challenges and Solutions

基于5G网络的直连V2X支持:性能、挑战与解决方案

M. C. Lucas-Estañ, B. Coll-Perales, T. Shimizu, J. Gozálvez, T. Higuchi, S. Avedisov, O. Altintas, M. Sepulcre

AI总结 研究分析了5G V2N2V通信支持关键V2X服务的可行性,评估了端到端延迟,并提出了应对非对称部署挑战的解决方案。

Journal ref IEEE Network, vol. 37, no. 4, pp. 200-207, July/August 2023

详情
AI中文摘要

本研究分析了使用基于5G网络的车辆到网络到车辆(V2N2V)通信支持关键V2X服务的可行性。研究评估了在单运营商和多运营商场景下不同网络部署中5G V2N2V通信的端到端延迟。研究表明,基于MEC的网络部署可以通过5G V2N2V通信支持关键V2X服务。然而,这需要使用本地对等点和共享数据中心或MEC联盟来应对非对称网络部署带来的挑战。这为V2N2V通信补充直连车辆到车辆(V2V)连接以提高可靠性或在侧链路网络拥塞时卸载流量提供了可能性。

英文摘要

This study analyzes the feasibility of supporting critical V2X services using 5G network-based Vehicle-to-Network-to-Vehicle (V2N2V) communications. The study evaluates the end-to-end latency of 5G V2N2V communications under different network deployments in single and multi-operator scenarios. The study shows that critical V2X services can be supported using 5G V2N2V communications over MEC-based network deployments. However, this requires the use of local peering points and shared data centers or MEC federation to address challenges arising from asymmetric network deployments. This opens the possibility for V2N2V communications to complement direct Vehicle-to-Vehicle (V2V) connections for increased reliability or to offload traffic under sidelink network congestion.

2606.18763 2026-06-18 cs.NI 新提交

An open-source implementation and validation of 5G NR Configured Grant for URLLC in ns-3 5G LENA: a scheduling case study in Industry 4.0 scenarios

面向URLLC的5G NR配置授权在ns-3 5G LENA中的开源实现与验证:工业4.0场景下的调度案例研究

Ana Larrañaga, M. Carmen Lucas-Estañ, Sandra Lagén, Zoraze Ali, Imanol Martinez, Javier Gozálvez

AI总结 本文首次在开源5G NR仿真器ns-3 5G-LENA中实现了配置授权(CG),并通过工业4.0场景验证了其在不同调度策略下的时延性能,结果表明与先前分析研究一致,且高效资源利用对满足关键工业服务需求至关重要。

Journal ref Journal of Network and Computer Applications, Volume 215, 2023, 103638

详情
AI中文摘要

工厂正经历向高成本效益、零缺陷制造的数字化转型,这需要能够满足严格时延和可靠性要求的通信网络。5G及未来网络被设计用于支持超可靠低时延通信(URLLC)。然而,上行传输采用动态调度会因请求和分配无线资源的信令而引入额外时延。为降低时延,5G NR定义了配置授权(CG),它预分配上行资源,无需每包调度请求。实现URLLC功能的精确仿真工具对于评估5G网络支持时间关键型服务的能力以及研究新解决方案至关重要。然而,此类工具的可用性有限,据作者所知,目前尚无开源5G NR仿真器支持CG。为弥补这一空白,本文提出了在开源5G NR仿真器中首次实现CG。具体而言,CG已集成到ns-3 5G-LENA系统级仿真器中,并公开发布。此外,正交频分多址(OFDMA)实现得到增强,以更好地反映5G NR的灵活性。该实现通过工业4.0场景进行验证,分析了不同调度策略下CG实现的时延性能。结果表明,5G-LENA获得的时延与先前分析研究报告的结果高度吻合。此外,研究强调了高效无线资源利用对于降低时延和满足关键工业服务需求的重要性。

英文摘要

Factories are undergoing a digital transformation towards cost-efficient, zero-defect manufacturing, creating the need for communication networks capable of meeting stringent latency and reliability requirements. 5G and beyond networks are being designed to support Ultra-Reliable Low-Latency Communications (URLLC). However, the use of dynamic scheduling for uplink transmissions introduces additional latency due to the signaling required to request and allocate radio resources. To reduce this latency, 5G NR defines Configured Grant (CG), which pre-allocates uplink resources and eliminates the need for per-packet scheduling requests. Accurate simulation tools implementing URLLC features are essential to evaluate the capability of 5G networks to support time-critical services and to investigate new solutions. Nevertheless, the availability of such tools is limited and, to the best of the authors' knowledge, no open-source 5G NR simulator currently supports CG. To address this gap, this paper presents the first implementation of CG in an open-source 5G NR simulator. Specifically, CG has been integrated into the ns-3 5G-LENA system-level simulator and made publicly available. In addition, the Orthogonal Frequency Division Multiple Access (OFDMA) implementation has been enhanced to better reflect the flexibility of 5G NR. The implementation is validated through Industry 4.0 scenarios, where the latency performance achieved with CG under different scheduling policies is analyzed. Results show that the latency obtained with 5G-LENA closely matches that reported in previous analytical studies. Furthermore, the study highlights the importance of efficient radio resource utilization to reduce latency and satisfy the requirements of critical industrial services.

2606.18762 2026-06-18 cs.NI 新提交

5G UE and Network Asset Administration Shells for the Integration of 5G and Industry 4.0 Systems

5G用户设备和网络资产管理壳用于5G与工业4.0系统的集成

Jorge Gómez-Jerez, Jorge Cañete-Martín, M. Carmen Lucas-Estañ, Javier Gozalvez

AI总结 提出首个公开的完整5G系统资产管理壳设计,包括UE和网络AAS,遵循5G-ACIA、工业4.0平台和3GPP标准,以简化5G与工业4.0系统的集成。

Journal ref Proceedings of 2024 IEEE International Conference on Emerging Technologies and Factory Automation (IEEE ETFA 2024), Sept. 2024, Padova, Italy

详情
AI中文摘要

5G是实现智能制造全面数字化的基础技术。资产管理壳(AAS)的使用可以促进5G与工业4.0系统和应用的集成,同时最小化与5G网络管理相关的复杂性。本研究提出了首个完整5G系统AAS的设计与实现,并已向社区公开[1]。它包括一个5G UE(用户设备)AAS和一个5G NW(网络)AAS,这些AAS按照5G-ACIA指南以及工业4.0平台和3GPP标准设计。这些AAS旨在提供和暴露5G所需的数据和能力,以促进5G与工业4.0系统和应用的集成。

英文摘要

5G is a fundamental technology for the full digitalization of smart manufacturing. The use of Asset Administration Shells (AAS) can facilitate the integration of 5G with Industry 4.0 systems and applications while minimizing the complexities associated with the 5G network management. This study presents the design and implementation of the first full 5G system AAS that is openly released to the community [1]. It includes a 5G UE (User Equipment) AAS and a 5G NW (network) AAS that have been designed following the 5G-ACIA guidelines as well as the Plattform Industrie 4.0 and 3GPP standards. The AASs have been defined to provide and expose the data and capabilities of 5G necessary to facilitate the integration of 5G with Industry 4.0 systems and applications.

2606.18761 2026-06-18 eess.SY cs.SY 新提交

LQR based stabilization of an 1D heat equation with advection and memory effects

基于LQR的具有对流和记忆效应的一维热方程镇定

Bhargav Pavan Kumar Sistla, Vivek Natarajan

AI总结 针对含指数记忆项和对流的一维热方程,通过LQR方法设计状态反馈律实现指数镇定,并验证了可镇定性条件。

详情
AI中文摘要

我们推导了一个在运动流体中热传递的一维模型,该模型包含傅里叶传导、指数衰减记忆项以及在绝热边界条件下的对流。我们数值构造了一个有界状态反馈律,使得闭环解对于每个初始状态以至少$\omega>0$的衰减率指数衰减到零,即我们解决了$\omega$-镇定问题。我们明确描述了状态算子$A$的特征值,其中一部分收敛到一个有限的负聚点,该聚点设定了可达到衰减率的上界。由于$A$缺乏紧预解式,我们证明了谱是其特征值的闭包,每个特征值具有有限代数重数,并利用这一点验证了可镇定性。对于低于聚点界的$\omega$,只要控制算子$B$满足非正交条件,问题可解。为了计算增益,我们构造了一个LQR问题并求解有限维近似:对于每个$n$,我们构造逼近$A$、$B$的$A_n$、$B_n$,并求解相关的代数Riccati方程得到增益$K_n$。我们证明,对所有足够大的$n$,可以选择$K_n$使得$A_n+B_nK_n$的每个特征值满足$\operatorname{Re}\lambda<-\omega$,并且我们建立了$(A_n+\omega I,B_n)$关于$n$的一致可镇定性。因此,对于大的$n$,这些增益解决了原系统的$\omega$-镇定问题。我们通过一个数值例子验证了结果。

英文摘要

We derive a one-dimensional model for heat transfer in a moving fluid incorporating Fourier conduction, an exponentially decaying memory term, and advection under thermally insulated boundary conditions. We numerically construct a bounded state feedback law driving the closed-loop solution to zero exponentially with decay rate at least $ω>0$ for every initial state, i.e., we solve the $ω$-stabilization problem. We explicitly describe the eigenvalues of the state operator $A$, a subset of which converges to a finite negative accumulation point that sets the upper bound on the achievable decay rate. Since $A$ lacks compact resolvent, we show that the spectrum is the closure of its eigenvalues, each of finite algebraic multiplicity, and use this to verify stabilizability. For $ω$ below the accumulation bound, the problem is solvable provided the control operator $B$ satisfies a non-orthogonality condition. To compute gains, we formulate an LQR problem and solve finite-dimensional approximations: for each $n$ we construct $A_n$, $B_n$ approximating $A$, $B$ and solve the associated algebraic Riccati equation for a gain $K_n$. We show that, for all sufficiently large $n$, $K_n$ can be chosen so every eigenvalue of $A_n+B_nK_n$ satisfies $\operatorname{Re}λ<-ω$, and we establish stabilizability of $(A_n+ωI,B_n)$ uniformly in $n$. Hence, for large $n$, these gains solve the $ω$-stabilization problem for the original system. We validate the results numerically with an example.

2606.18741 2026-06-18 cs.DC 新提交

ReMP: Low-Downtime Runtime Model-Parallelism Reconfiguration for LLM Serving

ReMP:面向LLM服务的低停机时间运行时模型并行重配置

Haipeng Yuan, Kaining Zheng, Yongshu Bai, Yuchen Zhang, Yunquan Zhang, Baodong Wu, Xiang Gao, Daning Cheng

AI总结 提出ReMP框架,通过解耦拓扑与运行时状态、二维KV缓存迁移等技术,实现LLM推理服务中模型并行拓扑的在线动态调整,将重配置停机时间从分钟级降至1-7秒。

详情
AI中文摘要

当前大语言模型(LLM)推理系统普遍采用张量并行(TP)和流水线并行(PP)的组合来部署超大规模模型。然而,现有系统将模型并行拓扑视为静态配置,无法在运行时灵活调整。这种刚性设计与实际场景中动态变化的推理负载存在根本矛盾。最先进的系统缺乏在线重配置能力,只能通过重启服务来切换配置,导致数分钟的服务中断、KV缓存丢失以及高昂的重计算开销。为解决此问题,本文提出ReMP,一种支持低停机时间的运行时模型并行重配置框架。ReMP通过三项关键技术实现动态调整:(1)将模型并行拓扑与运行时状态解耦,避免完全重建服务;(2)设计二维KV缓存迁移机制,在TP/PP变化后保留可复用的缓存状态;(3)实现端到端的在线重配置。实验表明,ReMP能在7B到70B参数规模的模型上,在1-7秒内完成大多数拓扑切换,相比重启方法实现数十至上百倍的加速。此外,在动态负载下,ReMP显著优于固定配置,在TTFT、TPOT和输出吞吐量方面表现出更优性能。

英文摘要

Current large language model (LLM) inference systems universally deploy ultra-large-scale models using a combination of Tensor Parallelism (TP) and Pipeline Parallelism (PP). However, existing systems treat the model parallelism topology as a static configuration that cannot be flexibly adjusted at runtime. This rigid design creates a fundamental contradiction with the dynamically changing inference workloads in real-world scenarios. State-of-the-art systems lack online reconfiguration capabilities and can only switch configurations by restarting the service, resulting in several minutes of service interruption, KV cache loss, and prohibitive recomputation overhead. To address this problem, this paper presents ReMP, a runtime model parallelism reconfiguration framework that supports low downtime. ReMP achieves dynamic adjustment through three key techniques: (1) decoupling the model parallelism topology from runtime state to avoid full service reconstruction; (2) designing a two-dimensional KV cache migration mechanism to preserve reusable cache states after TP/PP changes; and (3) implementing end-to-end online reconfiguration. Experiments demonstrate that ReMP can complete most topology switches within 1-7 seconds on models ranging from 7B to 70B parameters, achieving speedups of tens to over a hundred times compared to the restart approach. Moreover, ReMP significantly outperforms fixed configurations under dynamic workloads, delivering superior performance in terms of TTFT, TPOT, and output throughput.

2606.18737 2026-06-18 cs.NI 新提交

Robustness Analysis of Australia's Internet Using a Multilayer Network Model

澳大利亚互联网的鲁棒性分析:基于多层网络模型

Benjamin Lang, Matthew Roughan, Mengbin Ye

AI总结 利用多层网络模型分析澳大利亚互联网中独立提供商网络的交互,研究对等互联提供的冗余以及共享风险链路组引入的脆弱性,发现多样性即使在共享风险下也能提供冗余。

Comments 7 pages, 5 figures, data and codebase available at https://github.com/lang-b/au-multilayer-model/

详情
AI中文摘要

澳大利亚依赖由多个长途链路网络构建的互联网。我们研究这些独立提供商网络的交互,以探究这些网络之间的对等互联如何为单个网络故障提供冗余,以及共享风险链路组(SRLGs)故障引入的潜在脆弱性——即不同提供商的看似独立的链路由于共享管道等共同物理依赖而同时失效。我们引入一个广义的多层网络模型,其中每一层代表一个互联网服务提供商(ISP)的网络,并包含一个促进ISP网络之间互连的互联网交换点(IXP)层。我们构建了一个澳大利亚特定模型,包含六个主要ISP。对该网络进行故障分析,发现多样性即使在存在共享风险的情况下也能提供冗余,表明多样化网络生态系统的重要性。

英文摘要

Australia depends on an Internet built from multiple networks of long-haul links. We study the interactions of these independent provider networks to investigate how the peering between these networks provides redundancy for failures on a single network, as well as the potential vulnerabilities introduced by failures of Shared Risk Link Groups (SRLGs), whereby ostensibly independent links of different providers fail simultaneously due to joint physical dependencies such as shared conduits. We introduce a generalised multilayer network model in which each layer represents the network of an individual Internet Service Provider (ISP), along with an Internet Exchange Point (IXP) layer that facilitates interconnections between ISP networks. We construct an Australia-specific model, consisting of six major ISPs. A failure analysis is performed on this network, revealing that diversity provides redundancy, even in the presence of shared risks, indicating the importance of a diverse network ecosystem.

2606.18710 2026-06-18 cs.CR 新提交

Image Prompt Reconstruction Attacks on Distributed MLLM Inference Frameworks

分布式多模态大模型推理框架上的图像提示重建攻击

Xinjian Luo, Hongyan Chang, Jianxin Wei, Yuncheng Wu, Xiaofeng Gao, Meikang Qiu, Ting Yu, Xue Liu

AI总结 研究分布式MLLM推理中中间嵌入泄露图像提示的风险,提出两种被动黑盒攻击方法MPAA和IEDA,实现像素级和语义级图像重建。

详情
AI中文摘要

分布式大语言模型(LLM)推理框架将孤立的消费级设备连接起来进行大规模模型推理,大幅降低了硬件限制。然而,最近的研究表明,参与者之间传输的中间嵌入可能会泄露私有提示。随着LLM演变为多模态LLM(MLLM),这种风险已扩展到文本之外:图像提示包含丰富的视觉和语义信息,使其中间嵌入高度隐私敏感。然而,分布式MLLM推理中的图像提示泄露问题在很大程度上尚未被探索。在本文中,我们研究了分布式MLLM框架中由中间嵌入引起的输入图像隐私风险。我们首先分析了从图像像素到中间表示的信息流。由于图像和文本嵌入通常在MLLM各层中交织,我们设计了一种图像嵌入提取算法作为重建攻击的前提,在我们的实验中,该算法在几乎所有MLLM层上实现了100%的提取准确率。在此基础上,我们开发了两种被动的黑盒图像重建攻击:MPAA和IEDA,反映了来自知识有限、能力有限的正常参与者的现实威胁。MPAA通过逐块信息提取和组装进行细粒度像素级重建,而IEDA通过嵌入引导的扩散生成进行粗粒度语义重建。我们在四个代表性的MLLM系列上评估了我们的攻击:Gemma 3、Phi 4 Multimodal、Qwen 2.5 VL和Llama 4 Scout。结果显示在各种设置下均具有一致优越的重建性能。我们进一步分析了MoE架构、图像预处理、模型大小和文本-图像依赖关系对攻击性能的影响。据我们所知,这是对MLLM图像重建攻击的首次研究。

英文摘要

Distributed large language model (LLM) inference frameworks connect isolated consumer-grade devices for large-scale model inference, substantially reducing hardware constraints. However, recent studies show that intermediate embeddings transmitted among participants can leak private prompts. As LLMs evolve into multimodal LLMs (MLLMs), this risk extends beyond text: image prompts contain rich visual and semantic information, making their intermediate embeddings highly privacy-sensitive. Yet, image-prompt leakage in distributed MLLM inference remains largely unexplored. In this paper, we investigate privacy risks to input images caused by intermediate embeddings in distributed MLLM frameworks. We first analyze the information flow from image pixels to intermediate representations. Since image and text embeddings are often intertwined across MLLM layers, we design an image embedding extraction algorithm as a prerequisite for reconstruction attacks, achieving 100% extraction accuracy across almost all MLLM layers in our experiments. Building on this, we develop two passive black-box image reconstruction attacks, MPAA and IEDA, reflecting realistic threats from normal participants with limited knowledge and capability. MPAA performs fine-grained pixel-level reconstruction via patch-wise information extraction and assembly, while IEDA performs coarse-grained semantic reconstruction through embedding-guided diffusion generation. We evaluate our attacks on four representative MLLM families: Gemma 3, Phi 4 Multimodal, Qwen 2.5 VL, and Llama 4 Scout. Results show consistently superior reconstruction performance in various settings. We further analyze the effects of MoE architecture, image preprocessing, model size, and text-image dependency on attack performance. To our knowledge, this is the first study of image reconstruction attacks on MLLMs.

2606.18696 2026-06-18 cs.DS 新提交

Compact multi-text index for circular Cartesian tree matching

紧凑的多文本索引用于循环笛卡尔树匹配

Roman Pauli, Eric Osterkamp, Dominik Köppl

AI总结 提出基于BWT的笛卡尔扩展BWT索引结构,支持动态扩展和静态压缩两种变体,用于高效循环笛卡尔树匹配。

Comments Implementation of https://doi.org/10.1007/s00224-025-10257-4

详情
AI中文摘要

笛卡尔树匹配(CTM)是一种结构模式匹配方法,用于识别具有相同笛卡尔树拓扑的序列,适用于具有自然变异性的数据,其中精确比较几乎没有语义意义。虽然CTM的理论算法已被广泛研究,但实际实现的系统实证评估仍然很少。本文提出了笛卡尔扩展Burrows-Wheeler变换(ceBWT)的实现,这是一种基于BWT的CTM索引结构。该实现支持动态可扩展和静态压缩两种索引变体。

英文摘要

Cartesian tree matching (CTM) is a structural pattern matching approach that identifies sequences with the same Cartesian tree topology, making it suitable for data with natural variability where exact comparisons carry little semantic meaning. While theoretical algorithms for CTM have been studied extensively, systematic empirical evaluations of practical implementations remain rare. This article presents an implementation of the Cartesian Extended Burrows-Wheeler Transform (ceBWT), a BWT-based index structure for CTM. The implementation supports both a dynamically extendable and a statically compressed index variant.

2606.18692 2026-06-18 cs.CY cs.HC 新提交

Through the WordStream Glass: Revisiting Quantitative Encoding for Qualitative Learning Analytics

透过WordStream之镜:重新审视用于定性学习分析的定量编码

Huyen N. Nguyen, Kathleen Bowe, Minh-Huyen Nguyen, Kit Thompson, Caleb M. Trujillo

AI总结 通过混合方法专家研究,探讨定量可视化工具如何支持定性教育研究,发现频率可视化作为定性分析入口的潜力与局限性,并提出整合定量与定性方法的设计启示。

Comments 9 pages, 5 figures

详情
AI中文摘要

数据驱动的学习分析可以揭示学生群体随时间变化的趋势,帮助教师改善学习环境。WordStream是一种用于主题演变的可视化习语,已在两个平台中实现:Journal Data Dashboard(用于分析形成性评估)和WordStream Maker(用于创建自定义可视化)。先前的工作为教育构建了这些平台(Vis4Ed),本文则探讨反向方向(Ed4Vis):定性教育研究能告诉我们关于构建更好可视化工具的什么?我们进行了一项混合方法专家研究(n=10),其中具有定性方法和课堂评估专业知识的STEM教育研究人员使用这两个平台分析来自数据可视化课程的学生日志回答。经过两轮主题分析和确认性检查,我们报告了涵盖工具体验、学科使用背景以及最重要的核心认识论分歧的主题。一些教师-研究人员认为基于频率的可视化是定性分析的有效切入点;其他人则警告说,它可能会掩盖罕见但关键的反应。我们将这些发现综合为未来工具的设计启示,以更好地整合定量技术与定性探究。所有补充材料均可在此https URL获取。

英文摘要

Data-driven learning analytics can surface trends across a student cohort over time, helping instructors improve the learning environment. WordStream, a visualization idiom for topic evolution, has been instantiated in two platforms toward this goal: the Journal Data Dashboard, for analyzing formative assessments, and WordStream Maker, for authoring custom visualizations. Where the prior work built these platforms for education (Vis4Ed), here we examine the reverse direction (Ed4Vis): what can qualitative education research tell us about building better visualization tools? We conducted a mixed-methods expert study (n=10) in which STEM education researchers with expertise in qualitative methods and classroom assessment used both platforms to analyze student journal responses from a data visualization course. Across two cycles of thematic analysis with confirmatory checking, we report themes spanning tool experience, disciplinary context of use, and, most importantly, a core epistemological dissensus. Some instructor-researchers regarded frequency-based visualization as a productive entry point to qualitative analysis; others cautioned it can obscure rare but critical responses. We synthesize these findings into design implications for future tools that better integrate quantitative technique with qualitative inquiry. All Supplementary Materials are available at https://osf.io/z2f8d.

2606.18673 2026-06-18 cs.CR 新提交

Understanding and Mitigating Prompt Leaking Attacks in Real-World LLM-Based Applications

理解并缓解真实世界基于LLM的应用中的提示泄露攻击

Yong Yang, Chong Fu, Tong Zhang, Rui Zeng, Qingming Li, Tianyu Du, Zonghui Wang, Shouling Ji, Wenzhi Chen

AI总结 本研究系统测量了1200个真实世界基于LLM的应用,发现超过80%会泄露系统提示,并提出了基于注意力漂移分析的AREA防御方法,在保持可用性的同时有效防止泄露。

Comments Accepted at ACM CCS 2026

详情
AI中文摘要

基于大型语言模型(LLM)的应用依赖系统提示来编码核心逻辑和开发者定义的约束,使得这些提示成为重要的知识产权。然而,系统提示容易受到提示泄露攻击。尽管先前的工作在受控环境中展示了此类攻击,但其在真实世界部署中的普遍性、原因和防御措施仍不清楚。本文对真实世界基于LLM的应用中的提示泄露进行了系统研究。我们测量了六个主要商业平台上的1200个应用,发现超过80%的部署在现实对抗性查询下泄露了系统提示,有时会暴露敏感信息,如第三方API密钥。我们还表明,现有防御措施往往无法在不降低可用性的情况下防止泄露。为了解释这些失败,我们进行了注意力层面的机制分析,并识别出注意力漂移,其中查询-键对齐偏差和softmax放大导致LLM逐渐忽略防御约束。基于这一洞察,我们提出了AREA,一种实用的防御方法,通过可优化的软提示重新锚定模型的注意力。实验和真实世界案例研究表明,AREA在匹配最先进防御措施的防泄露能力的同时,将平均可用性提高了33%以上,并将优化开销降低了近3倍。我们的负责任披露导致两家受影响的供应商将这些泄露归类为中危漏洞。

英文摘要

Large language model (LLM)-based applications rely on system prompts to encode core logic and developer-defined constraints, making these prompts important intellectual property. However, system prompts are vulnerable to prompt leaking attacks. Although prior work has shown such attacks in controlled settings, their prevalence, causes, and defenses in real-world deployments remain unclear. This paper presents a systematic study of prompt leaking in real-world LLM-based applications. We measure 1,200 applications across six major commercial platforms and find that over 80% of deployments leak system prompts under realistic adversarial queries, sometimes exposing sensitive information such as third-party API keys. We also show that existing defenses often fail to prevent leakage without degrading usability. To explain these failures, we conduct an attention-level mechanistic analysis and identify attention drift, where query-key alignment bias and softmax amplification cause LLMs to progressively ignore defensive constraints. Guided by this insight, we propose AREA, a practical defense that re-anchors the model's attention using an optimizable soft prompt. Experiments and real-world case studies show that AREA matches the leakage resistance of state-of-the-art defenses while improving average usability by over 33% and reducing optimization overhead by nearly 3x. Our responsible disclosure led two affected vendors to classify these leaks as medium-severity vulnerabilities.

2606.18671 2026-06-18 cs.HC 新提交

HANSEL: Extracting Breadcrumbs from Web Agent Trajectories for Interactive Verification

HANSEL: 从Web智能体轨迹中提取面包屑用于交互式验证

Yujin Zhang, Daye Nam

AI总结 提出HANSEL系统,从AI智能体轨迹中提取可交互验证的证据,减少用户审查负担,在基准测试中达到83.7%精确率和88.9%召回率,用户研究显示显著降低任务完成时间和感知努力。

Comments 13 pages, 6 figures

详情
AI中文摘要

AI Web智能体可以代表用户执行复杂的多步骤任务,例如搜索产品、比较选项和进行购买。然而,验证智能体输出的正确性仍然困难。现有的透明机制,包括完整轨迹日志、源链接、截图和LLM生成的摘要,将验证视为被动阅读任务,让用户筛选大量日志或信任可能不忠实的解释。我们提出HANSEL(突出显示智能体导航步骤作为证据链接),一个从Web智能体轨迹中提取交互式、可验证证据的系统。给定一个智能体轨迹,HANSEL提取证据页面和片段,并将其呈现为可导航、交互式的视图,并保留相关页面状态(例如,应用的过滤器、搜索查询和滚动位置),使用户能够验证智能体如何得出其答案。当智能体的答案无法追溯到任何访问过的页面时,HANSEL明确标记此缺口。在来自AssistantBench和Online-Mind2Web的45个任务上的技术评估显示,HANSEL在识别证据页面方面达到83.7%的精确率和88.9%的召回率,同时将轨迹量减少61.6%。在14名参与者的受控用户研究中,与标准智能体界面相比,HANSEL显著减少了任务完成时间和感知努力,而参与者在可用性、验证易用性和错误识别方面对其评价显著更高。我们的结果表明,将验证重新定义为交互式活动,而不是被动消费智能体解释,可以导致对AI智能体更高效的人工监督。

英文摘要

AI web agents can perform complex, multi-step tasks such as searching for products, comparing options, and making purchases on behalf of users. However, verifying the correctness of an agent's output remains difficult. Existing transparency mechanisms, including full trajectory logs, source links, screenshots, and LLM-generated summaries, treat verification as a passive reading task, leaving users to sift through overwhelming logs or trust potentially unfaithful explanations. We present HANSEL (Highlighting Agent Navigation Steps as Evidence Links), a system that extracts interactive, verifiable evidence from web-agent trajectories. Given an agent trajectory, HANSEL extracts evidence pages and snippets and presents them as navigable, interactive views with relevant page state preserved (e.g., applied filters, search queries, and scroll positions), enabling users to verify how the agent arrived at its answer. When the agent's answer cannot be traced to any visited page, HANSEL explicitly flags this gap. A technical evaluation on 45 tasks from AssistantBench and Online-Mind2Web shows that HANSEL achieves 83.7% precision and 88.8% recall in identifying evidence pages, while reducing trajectory volume by 61.6%. In a controlled user study with 14 participants, HANSEL significantly reduced task completion time and perceived effort compared to a standard agent interface, while participants rated it significantly higher on usability, verification ease, and error identification. Our results demonstrate that reframing verification as an interactive activity, rather than passive consumption of agent explanations, leads to more efficient human oversight of AI agents.

2606.18665 2026-06-18 cs.GT 新提交

EFX Allocations Exist on Multi-Graphs

多图上EFX分配的存在性

Mahyar Afshinmehr, Arash Ashuri, Pouria Mahmoudkhan, Kurt Mehlhorn, Amir Mohammad Shahrezaei

AI总结 研究不可分割物品的公平分配,证明在多图且可取消估值下EFX分配存在,并给出多项式时间算法。

详情
AI中文摘要

我们研究不可分割物品在智能体间的公平分配,重点关注嫉妒限制。核心公平概念是无嫉妒至任意物品(EFX),要求任何对其他智能体的嫉妒在移除后者束中的任意单个物品后消失。EFX分配的存在性被认为是公平分配中的一个重大开放问题。迄今为止,它仅在有限设置中被确立。Christodoulou等人[2023]证明了图估值下EFX分配的存在性。在该设置中,智能体对应底层图的节点,物品对应边,且任何物品仅对相应边的端点有正值。他们的证明关键依赖于图是简单的限制,即对于任意一对智能体,至多有一个物品对两者都有价值。对于多图估值,其中多个物品可能被同一对智能体估值,仅已知部分结果。Amanatidis等人[2024]和Kaviani等人[2025]分别获得了EFX的2/3和√2/2近似;Kaviani等人[2024]在受限可加估值下建立了存在性;Afshinmehr等人[2025a]在假设包含非平行边的最短环长度至少为4时证明了存在性。在本文中,我们通过证明在可取消估值(可加估值函数的严格超类)下多图实例中EFX分配的存在性,解决了这一开放问题。我们的证明是算法性的,并在估值函数可取消时多项式时间内计算出此类分配。这项工作为适用于任意数量智能体的少数EFX存在性结果做出了贡献。

英文摘要

We study the fair allocation of indivisible goods among agents, with a focus on limiting envy. A central fairness notion is envy-freeness up to any good (EFX), which requires that any envy toward another agent vanishes after the removal of any single good from the latter's bundle. The existence of EFX allocations is considered a major open problem in fair division. So far, it has only been established in limited settings. Christodoulou et al. [2023] proved the existence of EFX allocations for graphical valuations. In this setting, the agents correspond to the nodes of an underlying graph, and the goods correspond to the edges, and any good has positive value only for the endpoints of the corresponding edge. Their proof crucially relies on the restriction that the graph is simple, meaning that for any pair of agents, there is at most one good that has value to both. For multigraph valuations, where multiple goods may be valued by the same pair of agents, only partial results are known. Amanatidis et al. [2024] and Kaviani et al. [2025] obtained 2/3 and sqrt(2)/2 approximations of EFX, respectively; Kaviani et al. [2024] established existence under restricted additive valuations; and Afshinmehr et al. [2025a] proved existence under the assumption that the shortest cycle containing non-parallel edges has length at least 4. In this paper, we resolve this open problem by proving the existence of EFX allocations for multigraph instances under cancelable valuations, a strict superclass of additive valuation functions. Our proof is algorithmic and computes such allocations in polynomial time when the valuation functions are cancelable. This work contributes to the small number of EFX existence results that apply to an arbitrary number of agents.

2606.18662 2026-06-18 cs.DS cs.CC 新提交

On (Non-)Isomorphism of Self-Dual Lattices and Codes

自对偶格与自对偶码的(非)同构问题

Huck Bennett, Kyle Fridberg

AI总结 研究自对偶格上的格同构问题(LIP),提出2^{n/2+o(n)}时间随机算法和coNP协议,利用特征向量等结构性质。

详情
AI中文摘要

最近一系列由密码学应用驱动的工作研究了格同构问题(LIP)的复杂性。本文研究自对偶格 $\cal{L} \subset \mathbb{R}^n$ 上的LIP,这类格在许多应用中自然出现。我们的主要结果是针对自对偶格上LIP的 $2^{n/2 + o(n)}$ 时间随机算法和一个 $\mathsf{coNP}$ 协议。这些结果扩展了最近关于ZLIP(判定一个格是否与 $\mathbb{Z}^n$ 同构的问题)的工作。特别地,前者扩展了Bennett、Ganju、Peetathawachai和Stephens-Davidowitz(Eurocrypt, 2023)以及Ducas(Des. Codes Cryptogr., 2024)的 $2^{n/2 + o(n)}$ 时间算法。后者扩展了Hunkenschröder(Math. Prog. Series A, 2024)的 $\mathrm{ZLIP} \in \mathsf{coNP}$ 结果。我们的结果利用了自对偶格 $\cal{L} \subset \mathbb{R}^n$ 的两个关键结构性质:(1)每个这样的格 $\cal{L}$ 同构于 $\cal{L}_0 \oplus \mathbb{Z}^r$,其中 $\cal{L}_0$ 是自对偶格且 $\lambda_1(\cal{L}_0)^2 \geq 2$;(2)每个这样的格 $\cal{L}$ 具有特征向量,即存在向量 $\mathbf{w} \in \cal{L}$ 使得对每个 $\mathbf{v} \in \cal{L}$,有 $\langle\mathbf{v}, \mathbf{w}\rangle \equiv \langle\mathbf{v}, \mathbf{v}\rangle \pmod{2}$。我们的结果使用了Elkies和Gaulter关于具有长最短特征向量的格的一系列工作,并且如果对Elkies(Math. Res. Lett., 1995)的一个相关问题给出肯定回答,结果可以加强。我们还研究了自对偶码上的置换码等价问题(PCE),并观察到类似的结构性质意味着对某些此类码上的PCE存在多项式时间算法。这给出了一个具有大核的码的自然类,其中PCE是容易的。

英文摘要

A recent line of work motivated by cryptographic applications has studied the complexity of the Lattice Isomorphism Problem (LIP). In this work, we study LIP on self-dual lattices $\cal{L} \subset \mathbb{R}^n$, which appear naturally in many applications. Our main results are a $2^{n/2 + o(n)}$-time randomized algorithm for LIP and a $\mathsf{coNP}$ protocol for LIP on a broad class of self-dual lattices. These results extend recent work on ZLIP, the problem of deciding whether a lattice is isomorphic to $\mathbb{Z}^n$. In particular, the former result extends the $2^{n/2 + o(n)}$-time algorithms for ZLIP of Bennett, Ganju, Peetathawachai, and Stephens-Davidowitz (Eurocrypt, 2023) and of Ducas (Des. Codes Cryptogr., 2024). The latter result extends the $\mathrm{ZLIP} \in \mathsf{coNP}$ result of Hunkenschröder (Math. Prog. Series A, 2024). Our results leverage two key structural properties of self-dual lattices $\cal{L} \subset \mathbb{R}^n$: (1) every such lattice $\cal{L}$ is isomorphic to $\cal{L}_0 \oplus \mathbb{Z}^r$ for some self-dual lattice $\cal{L}_0$ with $λ_1(\cal{L}_0)^2 \geq 2$, and (2) every such lattice $\cal{L}$ has \emph{characteristic vectors}, i.e., there exist vectors $\mathbf{w} \in \cal{L}$ such that for every $\mathbf{v} \in \cal{L}$, $\langle\mathbf{v}, \mathbf{w}\rangle \equiv \langle\mathbf{v}, \mathbf{v}\rangle \pmod{2}$. Our results use a line of work by Elkies and Gaulter on lattices with long shortest characteristic vectors, and can be strengthened assuming a positive answer to a related question of Elkies (Math. Res. Lett., 1995). We also study Permutation Code Equivalence (PCE) on self-dual codes, and we observe that similar structural properties imply a polynomial-time algorithm for PCE on certain such codes. This gives a natural class of codes with large hull for which PCE is easy.

2606.18651 2026-06-18 cs.CR 新提交

TGCM: Topic-Guided Generative Disentanglement of Interleaved APT Technique Sequences

TGCM: 主题引导的生成式解缠交错APT技术序列

Guo-Wei Wong, Ming-Chuan Yang, Shou-De Lin, Wang-Chien Lee, Meng~Chang Chen

AI总结 提出TGCM框架,利用一致性模型和MITRE ATT&CK主题先验,从交错的多源APT技术序列中直接解缠出单源攻击链,在合成和真实数据集上优于15种基线方法。

Comments 13 pages,

详情
AI中文摘要

在企业环境中,多个高级持续性威胁(APT)活动经常同时展开,产生审计日志,其中不同行为者(来源)的攻击技术随时间交错出现。这种设置自然引出一个未知K交错序列分离(UKISD)问题:从交错的技术序列中恢复多个潜在活动,同时推断其数量和技术级分配。现有方法从统计模式挖掘到基于溯源的分析,通常假设单活动设置或依赖刚性启发式,限制了其在重叠活动、共享技术和可变执行长度等现实条件下的有效性。我们提出主题引导的一致性建模(TGCM),一个生成式解缠框架来解决UKISD问题。TGCM利用一致性模型学习从交错的多活动观察到结构化单活动序列的直接逆映射,仅需单步推理。为了支持语义连贯的攻击链,TGCM整合了源自MITRE ATT&CK叙述的主题引导先验,在分解过程中提供高层战术约束。我们在合成数据集、已建立的混合数据集以及来自DARPA TC-E3和TC-E5的事件轨迹上评估TGCM,并与15个代表性基线(涵盖模式挖掘、深度学习和基于LLM的方法)进行比较。结果表明,在高度交错和技术共享情况下,TGCM的分离鲁棒性优于基线,并且TGCM能够零样本泛化到自然交错的野外基准(DARPA TC-E5),无需重新训练。

英文摘要

In enterprise environments, multiple Advanced Persistent Threat (APT) campaigns often unfold concurrently, producing audit logs in which attack techniques across actors (sources) are interleaved over time. This setting naturally gives rise to an Unknown-K Interleaved Sequence Demixing (UKISD) problem: recovering multiple latent campaigns from an interleaved technique sequence while jointly inferring their number and technique-level assignments. Existing approaches, ranging from statistical pattern mining to provenance-based analysis, typically assume single-campaign settings or rely on rigid heuristics, limiting their effectiveness under realistic conditions involving overlapping campaigns, shared techniques, and variable execution lengths. We present Topic-Guided Consistency Modeling (TGCM), a generative disentanglement framework to tackle the UKSID problem. TGCM leverages Consistency Models to learn a direct inverse mapping from interleaved multi-campaign observations to structured single-campaign sequences in a single inference step. To favor semantically coherent attack chains, TGCM incorporates a topic-guided prior derived from MITRE ATT\&CK narratives, providing high-level tactical constraints during decomposition. We evaluate TGCM on synthetic datasets, established mixed datasets, and incident traces from DARPA TC-E3 and TC-E5, comparing against 15 representative baselines spanning pattern mining, deep learning, and LLM-based methods. Results indicate improved separation robustness over baselines under heavy interleaving and technique sharing, and show that TGCM generalizes zero-shot to a naturally interleaved in-the-wild benchmark (DARPA TC-E5) without retraining.

2606.18642 2026-06-18 cs.DC 新提交

HI-HCQC: A Tightly-Coupled Hardware Interface with High-Efficiency Communication for Hybrid Classical-Quantum Computing

HI-HCQC:面向混合经典-量子计算的高效通信紧耦合硬件接口

Shibo Liang, Junchao Wang, Zeyuan Wang, Feng Wang, Xiaoyu Li, Lei Li, FuDong Liu, Zheng Shan

AI总结 针对经典-量子混合计算中数据交换延迟高、吞吐量低的问题,提出基于RFSoC的紧耦合硬件接口HI-HCQC,集成PCIe Gen3 x8接口,实现微波脉冲合成、量子比特读出和高吞吐数据传输,实验表明能降低延迟并提升任务吞吐量。

详情
AI中文摘要

混合经典-量子计算需要在经典处理器和量子控制硬件之间频繁进行数据交换。然而,现有的超导量子控制系统通常通过以太网等松耦合接口连接,导致通信延迟高、任务吞吐量受限。为解决这一问题,我们提出了HI-HCQC,一种基于RFSoC的硬件接口,用于紧耦合的混合经典-量子计算。HI-HCQC集成了高速RF-DAC、RF-ADC、可编程逻辑、嵌入式处理器、时钟同步电路和PCIe Gen3 x8接口,支持直接微波脉冲合成、量子比特读出以及主机服务器与量子测量控制单元之间的高吞吐数据传输。实验结果表明,HI-HCQC支持六个控制通道和一个复用读出通道,实现了稳定的微波生成和采集,并成功执行了量子比特光谱学、拉比振荡、T1测量、单发读出、随机基准测试和CZ门表征。与传统控制系统相比,HI-HCQC降低了代表性量子门和电路任务的端到端执行延迟,并显著提高了任务吞吐量。这些结果表明,PCIe耦合的RFSoC控制硬件为可扩展且高效的混合经典-量子计算系统提供了实用基础。

英文摘要

Hybrid classical-quantum computing requires frequent data exchange between classical processors and quantum control hardware. However, existing superconducting quantum control systems are commonly connected through loosely coupled interfaces such as Ethernet, resulting in high communication latency and limited task throughput. To address this issue, we present HI-HCQC, an RFSoC-based hardware interface for tightly coupled hybrid classical-quantum computing. HI-HCQC integrates high-speed RF-DACs, RF-ADCs, programmable logic, embedded processors, clock synchronization circuits, and a PCIe Gen3 x8 interface, enabling direct microwave pulse synthesis, qubit readout, and high-throughput data transfer between host servers and quantum measurement-control units. Experimental results show that HI-HCQC supports six control channels and one multiplexed readout channel, achieves stable microwave generation and acquisition, and successfully performs qubit spectroscopy, Rabi oscillation, T1 measurement, single-shot readout, randomized benchmarking, and CZ-gate characterization. Compared with a conventional control system, HI-HCQC reduces end-to-end execution latency for representative quantum gate and circuit tasks and significantly improves task throughput. These results demonstrate that PCIe-coupled RFSoC control hardware provides a practical foundation for scalable and efficient hybrid classical-quantum computing systems.