arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

检索范围排序方式

检索时间范围

重置

HOT 人工智能、机器人等 9

cs.AI 人工智能 cs.CV 计算机视觉 cs.CL 自然语言处理 cs.RO 机器人 cs.LG 机器学习 cs.SD 声音 cs.ET 新兴技术 eess.AS 音频语音 eess.IV 图像视频

CS 计算机 41

cs 计算机 cs.AI 人工智能 cs.AR 硬件架构 cs.CC 计算复杂性 cs.CE 计算工程 cs.CG 计算几何 cs.CL 自然语言处理 cs.CR 密码安全 cs.CV 计算机视觉 cs.CY 计算机与社会 cs.DB 数据库 cs.DC 分布式计算 cs.DL 数字图书馆 cs.DM 离散数学 cs.DS 数据结构 cs.ET 新兴技术 cs.FL 形式语言 cs.GL 综述文献 cs.GR 图形学 cs.GT 博弈论 cs.HC 人机交互 cs.IR 信息检索 cs.IT 信息论 cs.LG 机器学习 cs.LO 计算机逻辑 cs.MA 多智能体 cs.MM 多媒体 cs.MS 数学软件 cs.NA 数值分析 cs.NE 神经进化 cs.NI 网络架构 cs.OH 其他计算机 cs.OS 操作系统 cs.PF 性能 cs.PL 编程语言 cs.RO 机器人 cs.SC 符号计算 cs.SD 声音 cs.SE 软件工程 cs.SI 社会信息网络 cs.SY 系统控制

ECON 经济学 4

econ 经济学 econ.EM 计量经济 econ.GN 一般经济 econ.TH 理论经济

EESS 电气与系统 5

eess 电气与系统 eess.AS 音频语音 eess.IV 图像视频 eess.SP 信号处理 eess.SY 系统控制

MATH 数学 33

math 数学 math.AC 交换代数 math.AG 代数几何 math.AP 偏微分方程 math.AT 代数拓扑 math.CA 经典分析 math.CO 组合数学 math.CT 范畴论 math.CV 复变函数 math.DG 微分几何 math.DS 动力系统 math.FA 泛函分析 math.GM 一般数学 math.GN 一般拓扑 math.GR 群论 math.GT 几何拓扑 math.HO 历史综述 math.IT 信息论 math.KT K理论 math.LO 逻辑 math.MG 度量几何 math.MP 数学物理 math.NA 数值分析 math.NT 数论 math.OA 算子代数 math.OC 优化控制 math.PR 概率 math.QA 量子代数 math.RA 环与代数 math.RT 表示论 math.SG 辛几何 math.SP 谱理论 math.ST 统计理论

PHYSICS 物理 55

astro-ph 天体物理 astro-ph.CO 宇宙学 astro-ph.EP 地球行星 astro-ph.GA 星系物理 astro-ph.HE 高能天体 astro-ph.IM 天文仪器 astro-ph.SR 太阳恒星 cond-mat 凝聚态 cond-mat.dis-nn 无序神经 cond-mat.mes-hall 介观纳米 cond-mat.mtrl-sci 材料科学 cond-mat.other 其他凝聚态 cond-mat.quant-gas 量子气体 cond-mat.soft 软凝聚态 cond-mat.stat-mech 统计力学 cond-mat.str-el 强关联电子 cond-mat.supr-con 超导 gr-qc 广义相对论 hep-ex 高能实验 hep-lat 格点高能 hep-ph 高能唯象 hep-th 高能理论 math-ph 数学物理 nlin 非线性科学 nlin.AO 自适应系统 nlin.CD 混沌动力学 nlin.CG 胞自动机 nlin.PS 斑图孤子 nlin.SI 可积系统 nucl-ex 核物理实验 nucl-th 核物理理论 physics 物理 physics.acc-ph 加速器物理 physics.ao-ph 大气海洋 physics.app-ph 应用物理 physics.atm-clus 原子分子团簇 physics.atom-ph 原子物理 physics.bio-ph 生物物理 physics.chem-ph 化学物理 physics.class-ph 经典物理 physics.comp-ph 计算物理 physics.data-an 数据分析 physics.ed-ph 物理教育 physics.flu-dyn 流体动力学 physics.gen-ph 普通物理 physics.geo-ph 地球物理 physics.hist-ph 物理史哲 physics.ins-det 仪器探测 physics.med-ph 医学物理 physics.optics 光学 physics.plasm-ph 等离子体 physics.pop-ph 科普物理 physics.soc-ph 物理与社会 physics.space-ph 空间物理 quant-ph 量子物理

Q-BIO 定量生物 11

q-bio 定量生物 q-bio.BM 生物分子 q-bio.CB 细胞行为 q-bio.GN 基因组学 q-bio.MN 分子网络 q-bio.NC 神经认知 q-bio.OT 其他定量生物 q-bio.PE 种群进化 q-bio.QM 定量方法 q-bio.SC 亚细胞过程 q-bio.TO 组织器官

Q-FIN 定量金融 10

q-fin 定量金融 q-fin.CP 计算金融 q-fin.EC 经济学 q-fin.GN 一般金融 q-fin.MF 数学金融 q-fin.PM 投资组合 q-fin.PR 证券定价 q-fin.RM 风险管理 q-fin.ST 统计金融 q-fin.TR 交易微观结构

STAT 统计 7

stat 统计 stat.AP 统计应用 stat.CO 统计计算 stat.ME 统计方法 stat.ML 机器学习 stat.OT 其他统计 stat.TH 统计理论

2606.20453 2026-06-19 cs.CY cs.HC 新提交

Directors Duties in the Age of Agentic Artificial Intelligence

代理人工智能时代的董事职责

Deirdre Ahern

AI总结探讨董事在采纳代理AI时如何平衡股东与员工利益，分析四种公司治理模型，主张通过更广泛的法律视角促进员工福利。

Journal ref Cambridge Forum on AI: Law and Governance 2, e7 (2026)

详情

DOI: 10.1017/cfl.2026.10049

AI中文摘要

随着董事会采用包括代理AI在内的人工智能以提高运营效率，这为利润最大化提供了新机会。AI的采用越来越与员工角色替代相关联，在公司中，员工作为利益相关者的利益需要探讨。一个新颖的问题是，在AI崛起的时代，当AI在公司中的角色接近或超越人类员工时，AI是否应被赋予利益相关者地位。本文探讨了董事履行公司最佳利益职责时的四种公司目的模型：股东至上模型、开明股东价值模型、利益相关者友好模型和利益相关者价值模型，强调了董事在董事会围绕AI的决策中容纳员工利益的可用空间。结论是，鉴于董事在其最佳利益职责方面免受法律审查的程度，采取更广泛的法律视角来促进员工福利将有利于员工、董事和公司的利益。这将使董事与员工进行有意义的接触，并提供再培训机会以适应AI时代。

英文摘要

As boards engage with the adoption of Artificial Intelligence including agentic AI to drive operational efficiencies, this presents new opportunities for profit maximisation. AI adoption is increasingly identified with employee role displacement and in companies, and the interests of employees as stakeholders require exploration. A novel question posed is whether in an age of AI ascendancy AI may warrant being given stakeholder status as its role in the company approximates or eclipses that of human employees. The article probes four distinct models of corporate purpose within the duty on directors to act in the best interests of the company, the shareholder primacy model, the Enlightened Shareholder value model, the stakeholder friendly model, and the stakeholder value model, highlighting the available scope for directors to accommodate the interests of employees around AI adoption in decision-making by boards around AI. It is concluded that given the degree to which directors are insulated from legal scrutiny in relation to their best interests duty, adopting a wider law in context approach to promote employee welfare would serve the interests of employees, directors and companies alike. This would see directors engaging meaningfully with employees and providing opportunities for reskilling to adapt to the age of AI.

URL PDF HTML ☆

赞 0 踩 0

2606.20444 2026-06-19 cs.CR cs.SE 新提交

Image Encryption Algorithm Based on Convolutional Neural Networks and Dynamic S-Box Generation

基于卷积神经网络和动态S盒生成的图像加密算法

Ans Ibrahim, Fadhil Abbas Fadhil, Mahameed Reza Feizi Derakhshi, Maryam Mahdi Alhusseini, Nikolai Safiullin

AI总结提出一种结合CNN与经典密码学的动态图像加密方法，通过CNN学习特征生成自适应S盒，增强非线性、唯一性和输入依赖性，提高抗攻击能力。

2606.20414 2026-06-19 cs.AR 新提交

ExSpike: A General Full-Event Neuromorphic Architecture for Exploiting Irregular Sparsity with Event Compression

ExSpike: 一种利用事件压缩开发不规则稀疏性的通用全事件神经形态架构

Yuehai Chen, Farhad Merchant

AI总结提出ExSpike通用全事件神经形态架构，通过数据流优化实现纯事件驱动执行，并引入相邻位置事件压缩减少冗余累加，在FPGA上实现高能效SNN加速。

Comments Accepted by the 36th International Conference on Field-Programmable Logic and Applications (FPL 2026); 9 pages, 9 figures

详情

AI中文摘要

脉冲神经网络（SNN）因其稀疏的时空活动而有望实现节能计算。然而，有效将这种不规则稀疏性转化为实际的性能和能耗增益仍然具有挑战性，因为全事件计算架构尚未得到充分探索。本文提出ExSpike，一种通用的全事件神经形态架构，充分利用SNN中的不规则稀疏性。为了实现纯事件驱动执行，我们首先提出一组数据流优化，确保每个SNN层的输入保持基于脉冲，从而在整个网络中实现全事件执行。然后，我们设计了一种硬件高效的全事件架构，命名为ExSpike，它支持优化的纯事件驱动数据流以及用于脉冲驱动自注意力的额外注意力核心。为了进一步提高计算效率，我们引入了相邻位置事件压缩，以减少跨空间相邻脉冲序列的冗余累加。ExSpike在AMD Xilinx Virtex-7 FPGA上实现，并在分类和分割任务上进行了评估。实验结果表明，ExSpike在保持竞争性精度的同时，在多种SNN模型上实现了高归一化能效，最高可达479.15 GOPS、281.85 GOPS/W和0.80 GOPS/W/PE。特别是，ExSpike的PE归一化能效比最先进的基于FPGA的SNN加速器（FireFly-T）高出10倍。ExSpike的代码可在\url{this https URL}获取。

英文摘要

Spiking neural networks (SNNs) promise energy-efficient computing due to their sparse spatio-temporal activity. However, effectively translating such irregular sparsity into practical performance and energy gains remains challenging, as full-event computing architectures are still underexplored. This paper proposes ExSpike, a general full-event neuromorphic architecture that fully exploits irregular sparsity in SNNs. To realize pure event-driven execution, we first propose a set of dataflow optimizations to ensure that the inputs to each SNN layer remain spike-based, thereby enabling full-event execution throughout the network. We then design a hardware-efficient full-event architecture, named ExSpike, which supports the optimized pure event-driven dataflow and an additional Attention Core for spike-driven self-attention. To further improve computing efficiency, we introduce adjacent-position event compression to reduce redundant accumulations across spatially adjacent spike sequences. ExSpike is implemented on an AMD Xilinx Virtex-7 FPGA and evaluated on both classification and segmentation workloads. Experimental results show that ExSpike achieves high normalized energy efficiency across diverse SNN models while maintaining competitive accuracy, delivering up to 479.15 GOPS, 281.85 GOPS/W, and 0.80 GOPS/W/PE. In particular, ExSpike achieves up to 10$\times$ higher PE-normalized energy efficiency than the SOTA FPGA-based SNN accelerator (FireFly-T). The code for ExSpike is available at \url{https://github.com/xiaoyuehai/ExSpike}.

URL PDF HTML ☆

赞 0 踩 0

2606.20410 2026-06-19 cs.MS 新提交

MaRDI Open Interfaces for Interoperable Nonlinear Optimization

MaRDI 开放接口：实现可互操作的非线性优化

Dmitry I. Kabanov, Stephan Rave, Mario Ohlberger

AI总结提出MaRDI开放接口软件包，通过统一数值问题接口和自动数据编组，提升非线性优化中不同求解器和编程语言间的互操作性，减少代码修改和测试成本。

Comments 12 pages, 1 figure, 1 table, deRSE2026

2606.20401 2026-06-19 eess.SY cs.SY 新提交

PowerAgentBench-Dyn: A Benchmark for Agentic AI in Power System Dynamic Studies

PowerAgentBench-Dyn：电力系统动态研究中智能体AI的基准测试

Qian Zhang, Andrea Pomarico, Costas Mylonas, Magda Foti, Alberto Berizzi, Le Xie

AI总结提出PowerAgentBench-Dyn基准，用于评估基于LLM的智能体在电力系统动态分析任务中的能力，涵盖模型质量审查和安全风险筛选两个任务。

详情

AI中文摘要

基于大型语言模型（LLM）的智能体越来越多地被用于通过与软件工具交互、解释中间结果以及自主规划后续行动来自动化多步骤工程工作流。电力系统动态研究是这些智能体一个特别有前景但尚未充分探索的应用领域。与静态计算任务不同，动态研究通常需要更多时间进行模型参数校准、工程判断以及在受限动作空间下的决策。本文介绍了PowerAgentBench-Dyn，一个旨在评估智能体AI系统在电力系统动态分析任务上的基准测试。该基准针对那些不能简化为单一优化或编码任务的问题，而是需要经验丰富的电力系统工程师日常执行的那种推理、工具使用和迭代实验。所提出的框架包括两个初始基准任务。第一个是动态模型质量审查基准，评估智能体根据系统运营商指定的模型质量合规标准验证和诊断动态模型的能力。第二个是动态安全风险筛选基准，评估智能体利用语义记忆和有限的仿真预算从未见故障数据集中识别、排序和分析最关键短路事故，并提出和评估可能的缓解措施的能力。对于每个任务，我们定义了仿真环境、观测和动作空间以及评估指标。该基准在基于度量的意义上是可复现的：发布案例和仿真器设置定义了确定性评估器，而随机智能体行为通过重复运行使用成功率和其他指标进行评估。该基准支持未来用于电力系统运行和规划的智能体AI的开发。

英文摘要

Large Language Model (LLM)-based agents are increasingly being used to automate multi-step engineering work flows by interacting with software tools, interpreting intermediate results, and autonomously planning subsequent actions. Power system dynamic studies represent a particularly promising yet largely unexplored application domain for these agents. Unlike static computational tasks, dynamic studies often require more time on model parameter calibration, engineering judgment, and decision making under constrained action spaces. This paper introduces PowerAgentBench-Dyn, a benchmark designed to evaluate Agentic AI systems on power system dynamic-analysis tasks. The benchmark targets problems that cannot be reduced to a single optimization or coding task, but instead require a type of reasoning, tool usage, and iterative experimentation routinely performed by experienced power system engineers. The proposed framework includes two initial benchmark tasks. The first, the Dynamic Model Quality Review Benchmark, evaluates agents' ability to validate and diagnose dynamic models based on model-quality compliance criteria specified by system operators. The second, the Dynamic Security Risk Screening Benchmark, assesses agents' capability to leverage semantic memory and a limited simulation budget to identify, rank, and analyze the most critical short-circuit contingencies from an unseen fault dataset, as well as propose and evaluate possible mitigation measures. For each task, we define the simulation environment, observation and action spaces, and evaluation metrics. The benchmark is reproducible in a metric-based sense: released cases and simulator settings define a deterministic evaluator, while stochastic agent behavior is assessed over repeated runs using success rates and other metrics. The benchmark supports the development of future Agentic AI for power system operation and planning.

URL PDF HTML ☆

赞 0 踩 0

2606.20399 2026-06-19 cs.CC 新提交

Linked Fates: How Small of an Ambiguity Increase Can Make the Difference Between Equaling and Separating from P?

关联的命运：歧义增加多小才能区分P与等于P？

Benjamin Carleton, Michael C. Chavrimootoo, Lane A. Hemaspaandra, David E. Narváez, Conor Taliancich, Melissa Welsh

AI总结研究NP的歧义有界版本UP_{≤f(n)}是否与P相等，通过路径毒化和填充技术，证明了某些歧义范围下P=UP_{≤f1(n)}蕴含P=UP_{≤f2(n)}，并给出了其他情况下不成立的相对化结果。

详情

AI中文摘要

NP的歧义有界版本，记为$\mathrm{UP}_{\leq f(n)}$，通过$f(n)$限制非确定性多项式时间图灵机在长度为$n$的输入上接受路径的数量。这些类别从Valiant的完全无歧义（$f(n)=1$）类$\mathrm{UP}$到$\mathrm{NP}$本身，其中没有界限或等价地有指数界限（$f(n) = 2^{n^{O(1)}}$）。本文旨在理解这些类别中哪些在是否等于确定性多项式时间的问题上共存亡。通俗地说，哪些歧义范围具有关联的命运？即，对于满足$(\forall n)[f_1(n) \leq f_2(n)]$的非递减函数对$(f_1,f_2)$，何时有$\mathrm{P} = \mathrm{UP}_{\leq f_1(n)} \implies \mathrm{P} = \mathrm{UP}_{\leq f_2(n)}$。更具体地，哪些对鲁棒地成立，即在现实世界和所有相对化世界中成立？哪些对不鲁棒地成立，即存在一个谕示$A$使得$\mathrm{P}^A = \mathrm{UP}_{\leq f_1(n)}^A \subsetneq \mathrm{UP}_{\leq f_2(n)}^A$？先前唯一已知的正面结果是Watanabe 1988年的结果：$\mathrm{P} = \mathrm{UP}_{\leq 1} \implies (\forall k \geq 1)[\mathrm{P} = \mathrm{UP}_{\leq k}]$，该结果甚至鲁棒地成立。他的结果虽然优美，但仅适用于常数有界歧义。作为我们的正面结果，我们提出了一个适用于更高歧义水平的新情况类（定理3.8），且甚至鲁棒地适用。为了给出我们的情况类，我们利用了两种方法：一种新颖的路径毒化方法，即使在超常数歧义上也有效（定理3.5），以及填充技术的新应用（定理3.3/3.4）。作为负面结果，我们表明在几乎所有其他情况下，没有关联鲁棒地成立。

英文摘要

Ambiguity-bounded versions of $\mathrm{NP}$, denoted $\mathrm{UP}_{\leq f(n)}$, bound by $f(n)$ the number of accepting paths the nondeterministic polynomial-time Turing machine can have on inputs of length $n$. Such classes range from Valiant's completely unambiguous ($f(n)=1$) class $\mathrm{UP}$ to $\mathrm{NP}$ itself, where there is no bound or, equivalently, there is the toothless exponential bound ($f(n) = 2^{n^{O(1)}}$). This paper seeks to understand which of these classes stand and fall together as to whether they equal deterministic polynomial time. Informally put, what ranges of ambiguities have linked fates? That is, for which pairs of nondecreasing functions, $(f_1 ,f_2)$, satisfying $(\forall n)[f_1(n) \leq f_2(n)]$, does it hold that $\mathrm{P} = \mathrm{UP}_{\leq f_1(n)} \implies \mathrm{P} = \mathrm{UP}_{\leq f_2(n)}$. More particularly, for which pairs does that hold robustly, i.e., it holds in the real world and every relativized world? And for which pairs does that implication fail to hold robustly, i.e., there is an oracle $A$ such that $\mathrm{P}^A = \mathrm{UP}_{\leq f_1(n)}^A \subsetneq \mathrm{UP}_{\leq f_2(n)}^A$? The only previously known positive result is Watanabe's 1988 result that $ \mathrm{P} = \mathrm{UP}_{\leq 1} \implies (\forall k \geq 1)[\mathrm{P} = \mathrm{UP}_{\leq k}]$, which even holds robustly. His result, though lovely, applies only to constant-bounded ambiguities. As our positive result, we present a new class of cases (Theorem 3.8) that apply (and even robustly apply) at greater ambiguity levels. To give our class of cases, we leverage two approaches: a novel path-poisoning approach that works even on superconstant ambiguities (Theorem 3.5) and a new application of the power of padding (Theorems 3.3/3.4). As negative results, we show that for essentially all other cases, no linkage holds robustly.

URL PDF HTML ☆

赞 0 踩 0

2606.20375 2026-06-19 cs.HC cs.CY 新提交

Organizing in the Digital Age: Understanding Community, Challenges, and Consequences in Digitally-facilitated Labor Organizing

数字时代的组织：理解数字辅助劳工组织中的社区、挑战与后果

Frederick Reiber, Alishah Chator, Dana Calacci, Allison McDonald

AI总结本研究通过17次定性访谈，分析劳工组织如何使用Discord、WhatsApp和Slack等数字平台进行组织，揭示了技术安全、信息过载和信任建立等挑战与机遇。

Comments To appear in CSCW 2026

2606.20374 2026-06-19 cs.DC 新提交

ARGUS: Production-Scale Tracing and Performance Diagnosis for over 10,000-GPU Clusters

ARGUS：面向超过10,000 GPU集群的生产级追踪与性能诊断

Jiasheng Zhou, Longbin Zeng, Clavis Chen, Ruiming Lu, Qinwei Yang, Leyi Ye, Ray Ying, Key Zhang

AI总结提出低开销、细粒度的始终在线追踪与实时分析系统ARGUS，通过分解训练调用层次、统一数据管道和渐进式诊断框架，在超过10,000 GPU集群上实现<2%开销的持续故障检测与性能优化。

详情

AI中文摘要

大规模LLM训练需要始终在线、细粒度的可观测性以实现有效的规模性能诊断。粗粒度的资源监控器无法定位根本原因，而细粒度的分析器会产生高昂（5%-30%）的开销和海量追踪数据，使得在大型生产集群中始终在线部署不切实际。我们提出ARGUS，一个面向10,000+ GPU规模生产集群中训练工作负载的低开销、细粒度、始终在线的追踪与实时分析系统。ARGUS将沿训练调用层次的观测分解为CPU调用栈、框架语义和GPU内核执行，始终在线收集的总开销低于2%。它构建统一数据管道，将原始内核事件压缩约3,700倍，从每个rank每步10 MB降至2.7 KB。其渐进式诊断框架通过迭代时间、阶段级和内核级分析自动隔离异常窗口、落后rank和性能下降的内核。在超过10,000 GPU的生产集群上部署超过六个月，ARGUS持续支持故障慢速检测和性能优化。我们的案例研究进一步展示了其在代表性异常中的有效性，包括计算落后、链路退化、流水线气泡放大、FlashAttention JIT停滞以及被通信症状掩盖的计算落后。

英文摘要

Large-scale LLM training requires always-on, fine-grained observability for effective performance diagnosis at scale. Coarse resource monitors alone cannot localize root causes, and fine-grained profilers incur prohibitive (5%-30%) overheads and massive trace volumes, making always-on deployment impractical in large production clusters. We propose ARGUS, a low-overhead, fine-grained, always-on tracing and real-time analysis system for training workloads in 10,000+ GPU-scale production clusters. ARGUS decomposes observation along the training call hierarchy into CPU call stacks, framework semantics, and GPU kernel execution, with always-on collection under a combined overhead of less than 2%. It builds a unified data pipeline and compresses raw kernel events by approximately 3,700x from 10 MB to 2.7 KB per rank per step. Its progressive diagnosis framework automatically isolates anomalous windows, straggler ranks, and degraded kernels through iteration-time, phase-level, and kernel-level analysis. Deployed for over six months on a 10,000+ GPU production cluster, ARGUS has supported continuous fail-slow detection and performance optimization. Our case studies further demonstrate its effectiveness across representative anomalies, including compute stragglers, link degradation, pipeline-bubble amplification, FlashAttention JIT stalls, and compute stragglers masked by communication symptoms.

URL PDF HTML ☆

赞 0 踩 0

2606.20361 2026-06-19 eess.SY cs.SY 新提交

Sparse add-on controller design: A Youla approach to system-level performance

稀疏附加控制器设计：一种面向系统级性能的Youla方法

M. van der Hulst, N. Dirkx, R. A. González, K. Tiels, J. van de Wijdeven, T. oomen

AI总结提出一种基于Youla参数化的稀疏附加控制器设计框架，通过凸优化求解稀疏H2综合问题，实现系统级性能与互联复杂度的最优权衡。

2606.20351 2026-06-19 cs.LO cs.PL 新提交

A cubical formalisation of conditional independence, Bayesian conditioning, and Pearl's d-separation soundness

条件独立性、贝叶斯条件化和Pearl的d-分离正确性的立方形式化

Karen Sargsyan

AI总结本文在Cubical Agda中形式化概率单子，提出一种高阶归纳类型，通过引入贝叶斯公式修正标准凸代数交换公理的不足，并验证了半图oid公理、do-演算规则和d-分离定理。

详情

AI中文摘要

自Stone以来概率单子形式化中常见的标准凸代数交换公理被证明不足以支持完整的贝叶斯条件化。我们在Cubical Agda中精确地说明了这一点：有限分布作为高阶归纳类型，条件独立性作为核之间的立方路径，递归贝叶斯条件化作为全支撑片段上的全函数。将条件化提升到完整的HIT暴露了一个结构上的不匹配——重新排列的四叶混合的两半携带由贝叶斯公式关联的不同贝叶斯权重，而不是标准公理提供的单个共享内部权重。我们展示了解决这一问题的最小推广，并证明了标准形式是退化情况，其中两个内部权重重合。基于这一观察，我们在抽象有序域接口之上以构造性方式验证了代数上下文，无需任何公设：绑定交换性、四个半图oid公理、交（通过结构性Σ-见证简化为收缩，无需正性）、Pearl的do-演算规则1、2和3的核形式、有限类型贝叶斯条件化，以及任意n顶点有限有向无环图（DAG）上干预和贝叶斯形式的Pearl的d-分离定理（正确性）。概率单子也被验证为马尔可夫范畴；抽象接口在Q处实现。

英文摘要

The standard convex-algebra interchange axiom, common to probability-monad formalisations since Stone, is provably too weak to support full Bayesian conditioning. We make this precise in Cubical Agda: finite distributions as a higher inductive type, conditional independence as a cubical path between kernels, recursive Bayesian conditioning as a total function on a full-support fragment. Lifting conditioning to the full HIT exposes a structural mismatch -- the two halves of the rearranged 4-leaf mix carry distinct Bayesian weights related by Bayes' formula, not the single shared inner weight the standard axiom provides. We exhibit the minimal generalisation that resolves this and prove the standard form is the degenerate case where the two inner weights coincide. Around this observation we verify the algebraic context constructively, with zero postulates above an abstract ordered-field interface: bind commutativity, the four semi-graphoid axioms, intersection (reduced to contraction via structural $Σ$-witnesses, without positivity), Pearl's do-calculus Rules~1, 2, and~3 in kernel form, finite-type Bayesian conditioning, and Pearl's d-separation theorem (soundness) on arbitrary $n$-vertex finite directed acyclic graphs (DAGs) in both interventional and Bayesian forms. The probability monad is also verified as a Markov category; the abstract interface discharges at $Q$.

URL PDF HTML ☆

赞 0 踩 0

2606.20331 2026-06-19 cs.DS cs.CC 新提交

Computing Twin-Width via Treedepth and Vertex Integrity

通过树深度和顶点完整性计算双宽度

Robert Ganian, Mathis Rocton

AI总结本文证明，当参数化为树深度时，近似双宽度是固定参数可解的；当参数化为顶点完整性时，精确计算双宽度是固定参数可解的，首次为非平凡参数化算法提供最优收缩序列。

Comments A short version of this preprint appeared at STACS 2026

详情

DOI: 10.4230/LIPIcs.STACS.2026.42

AI中文摘要

双宽度是一个图参数，已成为解释一阶模型检验在许多图类上固定参数可解性的核心。尽管其算法重要性，计算双宽度仍然知之甚少：甚至识别双宽度至多为4的图是NP难的，并且没有已知的以双宽度本身为参数的固定参数近似。最近突破这一障碍的方法侧重于首先开发以不同于双宽度的参数化来计算或近似双宽度的固定参数算法。我们的第一个结果表明，当以树深度为参数时，近似双宽度是固定参数可解的，从而打破了所有先前可处理的参数化都基于删除距离的长期障碍。证明通过有向双宽度进行，首次提供了该变体可能在算法上更易处理的构造性证据。作为第二个主要结果，我们表明，以顶点完整性为参数时，精确计算双宽度是固定参数可解的。这构成了计算最优收缩序列的第一个非平凡参数化算法。

英文摘要

Twin-width is a graph parameter that has become central to explaining the fixed-parameter tractability of first-order model checking across many graph classes. Despite its algorithmic importance, computing twin-width remains poorly understood: even recognizing graphs of twin-width at most four is NP-hard, and no fixed-parameter approximations parameterized by twin-width itself are known. A recent approach towards breaking this barrier focuses on first developing fixed-parameter algorithms for computing or approximating twin-width under parameterizations distinct from twin-width. Our first result establishes that approximating twin-width is fixed-parameter tractable when parameterized by treedepth, thereby breaking the long-standing barrier that all previous tractable parameterizations were based on deletion distance. The proof proceeds via oriented twin-width, yielding the first constructive evidence that this variant may be easier to handle algorithmically. As our second main result, we show that computing twin-width exactly is fixed-parameter tractable with respect to vertex integrity. This constitutes the first non-trivial parameterized algorithm for computing optimal contraction sequences.

URL PDF HTML ☆

赞 0 踩 0

2606.20318 2026-06-19 cs.DB 新提交

AgenticDB: Agentic Performance Reconfiguration for Database Workloads

AgenticDB: 面向数据库工作负载的代理式性能重配置

Xinyue Yang, Chaozheng Wang, Chen Zheng, Heng Zhang, Yanjun Wu

AI总结提出AgenticDB框架，通过运行时交互实现数据库系统级和操作系统级重配置，诊断瓶颈并积累经验，在MySQL和PostgreSQL上平均性能提升118.1%。

详情

AI中文摘要

数据库配置调优对工作负载性能至关重要，但在实际部署中进行实用调优仍然困难。现有的自动调优器大多将调优视为对DBMS旋钮值的迭代搜索。这种形式导致执行成本高，过早缩小配置空间，并且未能充分解决实际需求：从系统反馈中诊断运行时瓶颈，探索操作系统级重配置机会，稳健地执行更改，以及从先前的试验和任务中学习。我们提出AgenticDB，一个用于数据库工作负载重配置的代理式框架。AgenticDB实现了一个上下文驱动的工具，通过与目标数据库环境交互，提出DBMS级和操作系统级更改，在安全约束下应用它们，观察工作负载性能和运行时状态，并使用执行反馈来指导后续决策。这种运行时交互使AgenticDB能够诊断瓶颈，探索更广泛的DBMS和操作系统级重配置空间，避免不安全或不支持的操作，并在重配置任务内部和之间积累经验。因此，AgenticDB将数据库调优转变为一种自我改进的重配置过程，其中运行时反馈迭代地改进后续决策。我们在MySQL和PostgreSQL上使用YCSB、Sysbench和TPC-H工作负载进行了广泛实验。结果表明，AgenticDB在所有评估的工作负载上实现了最佳最终性能，平均比最强基线提高118.1%，并将总到达最佳时间减少22.6%。结果还表明，其操作系统级动作空间、稳健的执行生命周期和增强记忆的规划有助于实现更有效和实用的数据库重配置。

英文摘要

Database configuration tuning is critical for workload performance, but practical tuning on real deployments remains difficult. Existing automatic tuners mostly formulate tuning as iterative search over DBMS knob values. This formulation leads to high execution cost, prematurely narrows the configuration space, and leaves practical requirements insufficiently addressed: diagnosing runtime bottlenecks from system feedback, exploring OS-level reconfiguration opportunities, executing changes robustly, and learning from previous trials and tasks. We propose AgenticDB, an agentic framework for database workload reconfiguration. AgenticDB implements a context-grounded harness that interacts with the target database environment by proposing DBMS- and OS-level changes, applying them under safety constraints, observing workload performance and runtime states, and using execution feedback to guide subsequent decisions. This runtime interaction enables AgenticDB to diagnose bottlenecks, explore a broader DBMS- and OS-level reconfiguration space, avoid unsafe or unsupported actions, and accumulate experience within and across reconfiguration tasks. As a result, AgenticDB turns database tuning into a self-refining reconfiguration process in which runtime feedback iteratively improves later decisions. We conduct extensive experiments on MySQL and PostgreSQL using YCSB, Sysbench, and TPC-H workloads. The results show that AgenticDB achieves the best final performance on all evaluated workloads, improving over the strongest baseline by 118.1% on average and reducing aggregate time-to-best by 22.6%. The results also demonstrate that its OS-level action space, robust execution lifecycle, and memory-enhanced planning contribute to more effective and practical database reconfiguration.

URL PDF HTML ☆

赞 0 踩 0

2606.20301 2026-06-19 eess.SY cs.SY 新提交

Data-Driven Control from Poisoned Data: Fundamental Limitations and Secure DeePC

来自中毒数据的数据驱动控制：基本局限性与安全DeePC

Takumi Shinohara, Henrik Sandberg, Karl Henrik Johansson

AI总结针对任意数据中毒攻击，提出安全DeePC算法，通过截断输出和在线重建实现有限时间内的MPC等效性能。

详情

AI中文摘要

我们研究了存在任意数据中毒攻击时的数据驱动控制问题。假设一部分离线输出数据存储在未受保护的位置，可能被对手篡改。我们首先建立了由这种中毒数据引起的数据驱动控制的基本局限性：仅从数据集无法检测/识别中毒攻击；未受保护的数据对于具有最坏情况保证的控制器设计是非信息性的；未受保护输出的硬约束是不可认证的。受这些局限性和数据使能预测控制（DeePC）技术的启发，我们提出了安全DeePC，一种能够抵御中毒攻击的数据驱动控制算法。它首先仅使用受保护数据集运行输出截断的DeePC，直到在线输入变得持续激励。然后利用在线测量重建部分离线数据集，最后返回到全输出DeePC。安全DeePC在特定条件下几乎必然在有限时间内实现MPC等效性能。仿真结果证明了所提框架对抗中毒攻击的有效性。

英文摘要

We study a data-driven control problem in the presence of arbitrary data poisoning attacks. We assume that a subset of offline output data is stored in unprotected locations and may be poisoned by an adversary. We first establish fundamental limitations for data-driven control arising from such poisoned data: poisoning attacks are not detected/identified from the dataset alone; unprotected data are non-informative for controller design with worst-case guarantees; and hard constraints on unprotected outputs are not certifiable. Motivated by these limitations and the data-enabled predictive control (DeePC) technique, we propose Secure DeePC, a data-driven control algorithm that is resilient against poisoning attacks. It first runs output-truncated DeePC using only the protected dataset until the online input becomes persistently exciting. It then uses online measurements to reconstruct the partial offline dataset, and finally returns to full-output DeePC. Secure DeePC achieves MPC-equivalent performance in finite time almost surely under certain conditions. Simulation results illustrate the efficacy of the proposed framework against poisoning attacks.

URL PDF HTML ☆

赞 0 踩 0

2606.20254 2026-06-19 cs.CR 新提交

Quantization as a Malicious Task: Removing Quantization-Conditioned Backdoors via Task Arithmetic

量化作为恶意任务：通过任务算术移除量化条件后门

Kaihsun Yang, Min-Yan Tsai, Chia-Mu Yu

AI总结提出QVec方法，通过将量化引起的权重变化视为恶意任务向量，在部署前进行参数校正，无需重训练或触发样本即可防御量化条件后门。

详情

AI中文摘要

模型量化被广泛采用，以在资源受限设备上部署深度神经网络时减少内存使用和推理成本。然而，最近的研究揭示了一种新的安全威胁，称为量化条件后门（QCBs），其中模型在全精度下行为正常，但仅在量化后激活恶意行为。现有的防御通常修改量化过程或校正激活统计，往往引入额外的计算开销或依赖特定的量化设置。在这里，我们提出QVec，一种从参数空间角度防御QCBs的方法。我们观察到，全精度模型与其量化版本之间的权重差异编码了一种结构化的行为偏移，可以解释为恶意任务向量，而非随机量化噪声。基于这一见解，QVec通过在部署前进行受控的参数校正来抵消这一恶意方向。QVec无需重新训练，无需触发样本，仅需一次量化传递来估计参数偏移，以及轻量级的超参数搜索。在图像分类基准和多个大型语言模型（LLM）攻击场景中的大量实验表明，QVec在保持干净性能的同时，持续抑制后门激活。

英文摘要

Model quantization is widely adopted to reduce memory usage and inference cost when deploying deep neural networks on resource-constrained devices. However, recent studies have revealed a new security threat known as Quantization-Conditioned Backdoors (QCBs), where a model behaves normally in full precision but activates malicious behavior only after quantization. Existing defenses typically modify quantization procedures or correct activation statistics, often introducing additional computational overhead or relying on specific quantization settings. Here, we present QVec, a parameter-space perspective for defending against QCBs. We observe that the weight difference between a full-precision model and its quantized counterpart encodes a structured behavioral shift, which can be interpreted as a malicious task vector rather than random quantization noise. Based on this insight, QVec counteracts this malicious direction through controlled parameter correction prior to deployment. QVec requires no retraining, no trigger samples, and only a single quantization pass to estimate the parameter shift, together with a lightweight hyperparameter search. Extensive experiments across image classification benchmarks and multiple Large Language Model (LLM) attack scenarios demonstrate that QVec consistently suppresses backdoor activation while preserving clean performance.

URL PDF HTML ☆

赞 0 踩 0

2606.20251 2026-06-19 cs.CR 新提交

TrustMix: How to Mix Messages in a Mobile Ad-hoc Network

TrustMix：如何在移动自组织网络中混合消息

Yu Shen, Aiswarya Walter, Stefanie Roos

AI总结提出TrustMix协议，通过分组转发和洗牌实现无中心信任的匿名通信，利用可链接环签名限制速率，在随机预言机模型下证明安全性，仿真和Android实现验证了匿名性和吞吐量。

Comments Accepted at ICDCS 2026, 11 pages

详情

AI中文摘要

混合网络是实现匿名性、防御各种流量分析攻击的高效方法。然而，混合网络通常为基础设施网络设计，无法直接应用于移动自组织网络（MANET）。现有的少数MANET解决方案需要预先了解拓扑结构或依赖可信中心方。本文提出TrustMix，一种无需任何中心可信方的MANET混合协议。在TrustMix中，各方加入组，消息通过多个组转发以提供匿名性。用户只需在附近找到一个他们认为可信的方，然后将消息转发到该方的组，该方在转发到其他组之前对消息进行洗牌，从而无法将原始消息与转发消息关联。此外，即使所选方是敌对的，只有当其组内所有方都是敌对的时，他们才能破坏匿名性，因为所有方都参与洗牌。除了匿名性，TrustMix还通过可链接环签名对消息数量实施速率限制，从而能够在不泄露身份的情况下检测到各方发送超过允许数量的消息。我们在随机预言机模型下证明了协议的安全性。我们使用现有的混合网络模拟器评估其匿名性，表明TrustMix显著提高了消息匿名性。最后，我们展示了基于Android的概念验证实现，并表明TrustMix在5个移动设备上实现了可接受的吞吐量。

英文摘要

Mix networks are a highly effective way to achieve anonymity, defending against a wide range of traffic-analysis attacks. However, mix networks are usually designed for infrastructure networks and cannot be directly applied in the context of mobile ad hoc networks (MANETs). The few existing solutions for MANETs require advance knowledge of the topology or a trusted central party. In this paper, we present TrustMix, a mix protocol for MANETs that operates without any central trusted party. In TrustMix, parties join groups and then messages are forwarded via multiple groups to provide anonymity. With TrustMix, users only need to find a party nearby that they consider trusted. They then forward the message to this party's group, and the party shuffles messages before forwarding to other groups, meaning that the original message and the forwarded message cannot be linked. Furthermore, even if the chosen party is adversarial, they can only break the anonymity if all parties in their group are adversarial as all of them contribute to the shuffling. In addition to anonymity, TrustMix also enforces rate limits on the number of messages through the use of linkable ring signatures, which allows detecting that parties send more messages that allowed without revealing identities. We prove the security of our protocol in the random oracle model. We evaluate its anonymity using an existing mix-network simulator and show that TrustMix significantly improves message anonymity. Finally, we present a proof-of-concept Android implementation and show that TrustMix achieves acceptable throughput with 5 mobile devices.

URL PDF HTML ☆

赞 0 踩 0

2606.20243 2026-06-19 cs.SE cs.MA 新提交

Phoenix: Safe GitHub Issue Resolution via Multi-Agent LLMs

Phoenix: 通过多智能体LLM实现安全的GitHub问题解决

Kipngeno Koech, Muhammad Adam, Baimam Boukar Jean Jacques, Joao Barros

AI总结提出多智能体LLM系统Phoenix，通过六个专业智能体和七层安全控制，在SWE-bench Lite子集上达到75%的解决率，并在真实问题中保持100%正确性。

详情

AI中文摘要

我们提出Phoenix，一个多智能体LLM系统，能够从分类到拉取请求创建解决GitHub问题，结合了七层安全控制与基线感知测试评估策略。Phoenix将工作分解给六个专业智能体：规划器、复现器、编码器、测试器、故障分析器和拉取请求（PR）智能体，所有智能体由基于标签的GitHub webhook状态机协调。在打开拉取请求之前，每次更改都会与基线测试运行进行对比。在SWE-bench Lite的24个实例子集上，在生产webhook路径上运行，Phoenix oracle解决了75%的实例，且成功运行中没有出现通过到通过的回归；这个精心挑选的子集不能直接与完整分割排行榜结果比较，我们讨论了比较的局限性。在14个仓库的42个真实问题上的补充试点实现了100%的正确性保持（CP；硬级别平均122秒）。人工检查显示，大约一半的拉取请求是定位良好的修复。另一半将代码放置在错误路径上，这是规划器定位的局限性，我们正在通过检索来解决。我们还报告了部署失败模式（WAF过滤、令牌过期、权限边界、不稳定的CI），这些模式促使了每种安全机制的引入。

英文摘要

We present Phoenix, a multi-agent LLM system that resolves GitHub issues from triage through pull-request creation, combining seven layered safety controls with a baseline-aware test evaluation strategy. Phoenix decomposes the work across six specialized agents. Planner, reproducer, coder, tester, failure analyst and Pull Request (PR) agent, all coordinated by a label-based GitHub webhook state machine. Every change is checked against a baseline test run before a pull request is opened. On a 24-instance slice of SWE-bench Lite. run on the production webhook path, Phoenix oracle-resolves 75% of instances with no pass-to-pass regressions on successful runs; this curated slice is not directly comparable to full-split leaderboard results, and we discuss the limits of the comparison. A complementary pilot on 42 real issues across 14 repositories yields 100% correctness preservation (CP; mean 122s on the hard tier). Manual inspection shows that about half of the resulting pull requests are well-targeted fixes. The other half place code at incorrect paths, a planner localization limitation we are addressing with retrieval. We also report the deployment failure modes (WAF filtering, token expiry, permission boundaries, flaky CI) that motivated each safety mechanism.

URL PDF HTML ☆

赞 0 踩 0

2606.20230 2026-06-19 cs.SE 新提交

SysML Modeling of Digital Twins for Renewable Energy Communities

可再生能源社区数字孪生的SysML建模

Mohammad Samadi, Luís Miguel Pinho, Andrey Sadovykh, Gabriela Lucas

AI总结针对可再生能源社区数字孪生工程中的异构性挑战，提出基于SysML的MBSE工作流，通过设备分类和社区组织视图建模，并引入SAREF4ENER本体弥补语义鸿沟。

Comments Presented at the Workshop on Digital Twin Experiences and Model-Based Testing Methods, 12 June 2026, Västerås, Sweden, co-located with the 30th Ada-Europe International Conference on Reliable Software Technologies (AEiC 2026)

详情

AI中文摘要

可再生能源社区（REC）正成为本地和全球共享可再生能源发电、存储和灵活负载的关键组织模型。由于涉及设备、合同和运行时数据的异构性，REC数字孪生的工程变得困难。在本文中，我们朝着REC数字孪生的基于模型的系统工程（MBSE）工作流迈出了第一步。从经过工业验证的REC领域模型出发，我们使用开源Modelio工具在SysML中重新表达了一个代表性的房屋子集，生成了两个块定义图——一个设备分类和一个社区组织视图。然后，我们讨论了普通SysML留下的四个语义鸿沟，并概述了如何将SAREF4ENER本体作为参考包导入以弥合这些鸿沟。将SysML与基于SAREF的智能能源数字孪生语义相结合在很大程度上仍未探索，我们将本文定位为沿着这条线的第一步。

英文摘要

Renewable Energy Communities (RECs) are emerging as a key organizational model for local and global sharing of renewable generation, storage, and flexible loads. Engineering Digital Twins of RECs is made difficult by the heterogeneity of devices, contracts, and runtime data involved. In this paper, we take a first step toward a Model-Based Systems Engineering (MBSE) workflow for REC's Digital Twins. Starting from an industrially-validated REC domain model, we re-express a representative house subset in SysML using the open-source Modelio tool, yielding two Block Definition Diagrams - a device taxonomy and a community organizational view. We then discuss four semantic gaps that plain SysML leaves open and sketch how the SAREF4ENER ontology could be imported as a reference package to close them. Combining SysML with SAREF-based semantics for smart-energy Digital Twins remains largely unexplored, and we position this paper as a first step along that line.

URL PDF HTML ☆

赞 0 踩 0

2606.20215 2026-06-19 cs.CR 新提交

GNSS Spoofing Threat for V2X communications

GNSS欺骗对V2X通信的威胁

Adolfo P. Jimenez, Juan Arquero-Gallego, Mario P. Luna, Jose E. Naranjo, Felipe Jimenez Alonso

AI总结本文提出利用廉价软件定义无线电（SDR）对V2X通信实施GNSS欺骗攻击的方法，并在真实设备上验证了攻击效果，揭示了V2X通信易受欺骗且难以检测的安全漏洞。

Comments 2026 IEEE\@. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

详情

AI中文摘要

全球导航卫星系统（GNSS）是车联网（V2X）领域提供关键定位、导航和授时（PNT）服务的核心技术，对于生成维护网络可靠性和车辆安全性的协作感知消息（CAM）不可或缺。然而，GNSS信号极易受到欺骗攻击，这是一种高级攻击，攻击者发送模拟合法卫星特征的精心构造信号，误导接收器计算出错误位置。本文提出了一种使用廉价软件定义无线电（SDR）进行物理欺骗的方法，描述了一个坐标生成流水线，该流水线采用基于Haversine的距离计算、时间离散化以模拟恒定速度，以及线性插值来生成高保真GPS基带信号。所提出的攻击在真实的Commsignia车载单元（OBU）和路侧单元（RSU）设备上，使用HackRF One在三种场景下进行了实验验证，这些场景模拟了90 km/h、145 km/h和200 km/h稳定速度下的合成轨迹。本文最重要的贡献是证明了V2X通信并不安全，因为它们容易受到GNSS欺骗攻击，导致服务降级而未被检测到。

英文摘要

Global Navigation Satellite Systems (GNSS) constitute a core technology for delivering crucial positioning, navigation, and timing (PNT) services in the Vehicle-to-Everything (V2X) domain, where they are indispensable for generating Cooperative Awareness Messages (CAM) that uphold network reliability and vehicular safety. Yet, GNSS signals are acutely exposed to spoofing, an advanced attack in which an adversary transmits crafted signals that replicate legitimate satellite characteristics, misleading the receiver into computing a false position. This work presents a methodology for conducting physical spoofing with inexpensive Software Defined Radio (SDR), describing a coordinate generation pipeline that employs Haversine-based distance calculations, temporal discretization to emulate constant velocity, and linear interpolation to produce high-fidelity GPS baseband signals. The proposed attack is experimentally validated on real Commsignia OnBoard Unit (OBU) and RoadSide Unit (RSU) devices using a HackRF One across three scenarios that emulate synthetic trajectories at steady speeds of 90 km/h, 145 km/h, and 200 km/h. The most significant contribution of this paper is the demonstration that V2X communications are not secured, as they are susceptible to GNSS spoofing attacks, which cause service degradation without being detected.

URL PDF HTML ☆

赞 0 踩 0

2606.20214 2026-06-19 cs.CR 新提交

Accelerating Trust Convergence in IIoT: A ML Approach for Dynamic Network Conditions

加速工业物联网中的信任收敛：一种针对动态网络条件的机器学习方法

Aymen Bouferroum, Valeria Loscri, Abderrahim Benslimane

AI总结针对工业物联网中网络质量波动导致信任收敛慢的问题，提出基于机器学习的信任收敛加速方法，通过预测收敛时间并动态调整转移概率，在挑战性条件下将收敛时间减少28.6%，并提升恶意节点场景下的评估准确性。

Comments Symposium: Communication \& Information Systems Security (CISS)

Journal ref IEEE Global Communications Conference (GLOBECOM) 2025, Dec 2025, Taipei, Taiwan. pp.4427-4432

详情

AI中文摘要

在工业物联网（IIoT）环境中，信任管理在保障系统安全方面起着至关重要的作用，尤其是在处理资源受限设备时。传统的信任模型往往忽视了网络质量波动的影响，导致信任收敛速度慢且评估不准确。在本文中，我们提出了一种动态信任管理解决方案，称为信任收敛加速（TCA）方法，该方法集成了机器学习（ML）以在恶劣网络条件下加速信任收敛。我们的模型基于关键网络指标预测信任收敛所需的时间单位，并动态调整信任模型中的转移概率以提高收敛速度。通过使用基于IEEE 802.11标准的模拟框架，该框架包含真实的Wi-Fi信道条件，我们展示了基于TCA方法的有效性，在挑战性条件下实现了高达28.6%的信任收敛时间减少。此外，所提出的解决方案在涉及恶意节点的场景中表现出韧性，提高了信任评估的准确性。这项工作为动态工业环境中的IIoT系统提供了一个可扩展且自适应的信任框架，确保了在不同网络条件下的稳健性能。

英文摘要

In Industrial Internet of Things (IIoT) environments, trust management plays a vital role in securing systems, especially when dealing with resource-constrained devices. Traditional trust models often overlook the impact of fluctuating network quality, leading to slower trust convergence and inaccurate assessments. In this paper, we propose a dynamic trust management solution, known as the Trust Convergence Acceleration (TCA) approach, which integrates Machine Learning (ML) to accelerate trust convergence under poor network conditions. Our model predicts the number of time units needed for trust convergence based on key network metrics and dynamically adapts transition probabilities in the trust model to enhance convergence speed. Using a simulation framework that incorporates realistic Wi-Fi channel conditions based on the IEEE 802.11 standard, we demonstrate the effectiveness of the TCA-based approach, achieving up to a 28.6% reduction in trust convergence time under challenging conditions. Furthermore, the proposed solution exhibits resilience in scenarios involving malicious nodes, improving trust evaluation accuracy. This work provides a scalable and adaptive trust framework for IIoT systems in dynamic industrial environments, ensuring robust performance under varying network conditions.

URL PDF HTML ☆

赞 0 踩 0

2606.20202 2026-06-19 cs.DS 新提交

Tight Algorithm and Hardness for Submodular Linear Ordering

子模线性排序的紧致算法与难度

Evan Abboud, Roy Schwartz

AI总结针对一般子模函数的最小线性排序问题，提出多项式时间O(√(n/ln n))近似算法，并证明信息论下界匹配，任何多项式时间算法无法达到o(√(n/ln n))近似比。

Comments 25 pages. Accepted to the 53rd International Colloquium on Automata, Languages, and Programming (ICALP 2026)

详情

AI中文摘要

我们考虑最小线性排序问题：给定基数为$n$的集合$N$和非负集函数$f\colon 2^N\rightarrow \mathbb{R}_{\geq 0}$，目标是找到$N$的一个排列$\pi$，使得$\pi$的所有前缀上$f$值的和最小。该问题已被研究用于各种集函数类，其中子模$f$的情况特别受关注，因为它涵盖了经典问题，包括最小线性排列和最小包含区间图。在这项工作中，我们通过建立匹配的上界和下界，解决了一般子模$f$的最小线性排序问题的近似性，并给出：$(1)$一个多项式时间算法，实现$O(\sqrt{n/\ln n})$-近似；以及$(2)$一个匹配的信息论难度结果，表明任何对$f$进行多项式次数求值的算法都无法实现$o(\sqrt{n/\ln n})$-近似。此前，已知的最佳近似难度为$2$，而$O(\sqrt{n/\ln n})$-近似仅对$f$既是子模又是对称的特殊情况已知。

英文摘要

We consider the Minimum Linear Ordering Problem: given a ground set $N$ of cardinality $n$ and a non-negative set function $f\colon 2^N\rightarrow \mathbb{R}_{\geq 0}$, the goal is to find an ordering $π$ of $N$ that minimizes the sum of the values of $f$ over all prefixes of $π$. This problem has been studied for various classes of set functions, and the case of a submodular $f$ is of special interest, as it captures classic problems including Minimum Linear Arrangement and Minimum Containing Interval Graph. In this work, we resolve the approximability of the Minimum Linear Ordering Problem for a general submodular $f$ by establishing matching upper and lower bounds and present: $(1)$ a polynomial-time algorithm achieving an $O(\sqrt{n/\ln n})$-approximation; and $(2)$ a matching information-theoretic hardness result, showing that no algorithm evaluating $f$ a polynomial number of times can achieve an $o(\sqrt{n/\ln n})$-approximation. Previously, the best known hardness of approximation was $2$, and an $O(\sqrt{n/\ln n})$-approximation was known only for the special case where $f$ is both submodular and symmetric.

URL PDF HTML ☆

赞 0 踩 0

2606.20173 2026-06-19 cs.SE 新提交

Qiskit Code Migration with LLMs

使用大语言模型进行Qiskit代码迁移

Jose Manuel Suarez, Luis Mariano Bibbo, Joaquin Bogado, Alenandro Fernandez

AI总结针对量子软件开发套件版本演进导致的代码维护问题，提出结合大语言模型与检索增强生成（RAG）的混合方法，利用自动生成的迁移场景分类体系引导模型，实现Qiskit代码跨版本自动迁移，有效减少幻觉并提升迁移建议质量。

详情

AI中文摘要

量子开发套件（QDK）的快速演进引入了一种特定形式的技术债务，损害了代码可维护性并阻碍了软件复用。在量子软件工程（QSE）这一专业领域，高质量训练数据的稀缺和新兴框架的高波动性加剧了这一挑战，常导致通用大语言模型（LLM）产生不可靠或幻觉结果。本文提出一种将LLM与检索增强生成（RAG）相结合的混合方法，用于自动化Qiskit代码的跨版本迁移。所提方法通过利用自动生成的迁移场景分类体系作为结构化、版本特定的知识源来指导模型，从而提升迁移建议的精度和可靠性。该方法通过一个自动化、可扩展的工作流实现，评估了不同检索方案（无约束和限制性）下的LLM（Google Gemini Flash-2.5和OpenAI Gpt-oss-20b）。结果表明，基于分类体系的RAG架构，特别是在限制性方案下，显著减少了幻觉并提高了描述质量，其中Google Gemini Flash-2.5在检测复杂重构场景方面表现出更优性能。这些发现证实了这种以数据为中心的方法在促进技术独立性、提供缓解API过时问题的鲁棒智能助手方面的潜力，从而确保量子算法在快速变化的生态系统中的长期可用性，并降低量子软件工程（QSE）的学习曲线。

英文摘要

The rapid evolution of Quantum Development Kits (QDKs) introduces a specific form of technical debt that compromises code maintainability and hinders software reuse. In the specialized domain of Quantum Software Engineering (QSE), this challenge is intensified by the scarcity of high-quality training data and the high volatility of emerging frameworks, which often lead general-purpose Large Language Models (LLMs) to produce unreliable or hallucinated results. This paper proposes a hybrid approach integrating LLMs with Retrieval-Augmented Generation (RAG) to automate the migration of Qiskit code across versions. The proposed methodology enhances the precision and reliability of migration suggestions by leveraging an automatically generated taxonomy of migration scenarios as the structured, version-specific knowledge source to guide the models. The approach is implemented through an automated, extensible workflow evaluating LLMs (Google Gemini Flash-2.5 and OpenAI Gpt-oss-20b) under different retrieval schemes (unconstrained and restrictive). Results demonstrate that the taxonomy-based RAG architecture, particularly under the restrictive scheme, significantly reduces hallucinations and improves descriptive quality, with Google Gemini Flash-2.5 showing superior performance in detecting complex refactoring scenarios. These findings confirm the potential of this data-centric methodology to foster technological independence and provide robust, intelligent assistants that mitigate API obsolescence, ensuring the long-term availability of quantum algorithms within a rapidly shifting ecosystem and flattening the learning curve within Quantum Software Engineering (QSE).

URL PDF HTML ☆

赞 0 踩 0

2606.20163 2026-06-19 eess.SY cs.SY 新提交

Techno-Economic Analysis of Shared Mobile Storage for Demand Charge Reduction

用于需求费用削减的共享移动储能技术经济分析

B Hari Kiran Reddy, Ge Chen, Junjie Qin

AI总结本文提出一个高保真车队管理框架，通过混合整数线性规划模型和启发式算法，评估共享电动汽车在考虑实际物流和运营约束下削减需求费用的技术经济可行性。

Comments 22 pages, 26 figures, journal

详情

AI中文摘要

本文研究了在实际物流和运营约束下，共享电动汽车车队用于削减需求费用的技术经济可行性。与忽略运输开销的理想化模型不同，我们提出了一个高保真车队管理框架，明确考虑了能源消耗的时空耦合、电动汽车驾驶员的人工成本和电池退化。我们将调度问题表述为混合整数线性规划，共同最小化需求费用和总拥有成本。为了解决路径依赖约束带来的计算复杂性，我们开发了一种基于边际价值的启发式算法，该算法以高计算效率实现了接近最优的性能。使用旧金山的真实数据，我们的分析表明，适度数量的电动汽车可以实现显著的需求费用节省，足以收回拥有和运营成本。我们的结果还显示了电价结构、车队规模和成本组成部分如何影响整体盈利能力。

英文摘要

This paper investigates the techno-economic viability of shared electric vehicle (EV) fleets for demand charge reduction under practical logistical and operational constraints. Unlike idealized models that overlook transit overheads, we propose a high-fidelity fleet management framework that explicitly accounts for the spatio-temporal coupling of energy consumption, labor costs for EV drivers, and battery degradation. We formulate the dispatch problem as a mixed-integer linear program (MILP) that jointly minimizes demand charges and total cost of ownership. To address the computational complexity arising from path-dependent constraints, we develop a marginal-value-based heuristic algorithm that achieves near-optimal performance with high computational efficiency. Using real-world data from San Francisco, our analysis reveals that a modest number of EVs can achieve significant demand charge savings, sufficient to recover the ownership and operational expenses. Our results also show how tariff structures, fleet size, and cost components influence overall profitability.

URL PDF HTML ☆

赞 0 踩 0

2606.20158 2026-06-19 cs.SE 新提交

N-Version Programming with Coding Agents

使用编码代理的N版本编程

Javier Ron, Benoit Baudry, Martin Monperrus

AI总结本文在当代AI编码代理背景下重新审视N版本编程，通过Knight-Leveson实验评估代理系统、模型和实现语言的多样性对故障模式的影响，发现常见模式故障，但多数投票三版本单元显著降低故障数，证明该策略的工程实用性。

详情

AI中文摘要

本文在当代AI编码代理背景下重新审视N版本编程这一经典概念。通过重访开创性的Knight-Leveson实验，我们研究了代理系统、模型和实现语言之间的多样性是否会产生多样化的故障模式。使用Knight-Leveson的发射拦截器程序规范，我们在共享的预言机和100万个随机测试输入的测试集上评估了48个代理生成的实现。结果显示，与Knight-Leveson的发现一致，存在大量的共模故障。进一步分析表明，许多这些同时发生的故障可以追溯到规范中特别困难或模糊的地方。我们还证明了编码代理的多样性带来了实际效益：在多数投票的三版本单元中，平均故障数从单版本的387.44下降到三版本的130.99，并且有11,844个N版本单元表现出零观测故障。我们的原始结果是迄今为止最强的证据，表明使用编码代理的N版本编程是一种有用的工程策略。

英文摘要

This paper revisits the classical concept on N-version programming in the setting of contemporary AI coding agents. Revisiting the seminal Knight-Leveson experiment, we study whether diversity across agent systems, models, and implementation languages creates diverse failure modes. Using the Knight-Leveson's, Launch Interceptor Program Specification, we evaluate 48 agent-generated implementations on a shared oracle and a campaign of 1,000,000 randomized test inputs. The results show substantial common-mode failure, along the findings of Knight-Leveson. Further analysis that many of those co-occuring failures can be traced to where is specification is particularly hard or ambiguous. We also demonstrate that diversity from coding agents provides practical benefit: across majority voting three-version units, the mean failure count drops from 387.44 for single versions to 130.99 for triples, and 11,844 N-version units exhibit zero observed failures. Our original results is the strongest evidence to date that N-Version Programming with coding agents is a useful engineering strategy.

URL PDF HTML ☆

赞 0 踩 0

2606.20134 2026-06-19 cs.LO cs.PL 新提交

An MSO Framework for Weak-Memory Verification and Robustness

弱内存验证与鲁棒性的MSO框架

Giovanna Kobus Conrado, Andreas Pavlogiannis

AI总结本文研究单子二阶逻辑作为弱内存元理论，证明顺序一致性执行有界树宽而TSO无界，展示多种模型可MSO公理化，并引入读自鲁棒性概念，实现统一验证算法。

Comments Accepted at CONCUR 2026

详情

AI中文摘要

内存模型是并发程序执行的形式化规范，解释了编译器和架构优化引入的弱行为。其数量和复杂性的增加促使人们通过在适当的元理论中公理化模型来统一验证整个模型类别。本文正式研究单子二阶逻辑（MSO）作为弱内存的元理论，通过证明各种流行弱内存模型的树宽和MSO可表达性结果，使得我们能够统一处理多个验证问题。总结如下：首先，我们证明顺序一致性（$\mathsf{SC}$）下的执行具有有界树宽，而总存储顺序（$\mathsf{TSO}$）下的执行则无界。其次，我们证明包括Release/Acquire和完整RC20在内的广泛模型是MSO可公理化的，而其他模型如Strong Release/Acquire和$\mathsf{TSO}$则不可，除非正交向量问题（在SETH下需要二次时间）可以在线性时间内解决。最后，我们引入读自鲁棒性概念，作为对近期粗粒度鲁棒性准则工作的扩展。我们证明树宽界限（上界和下界）对任何MSO可公理化模型$\mathsf{MM}$具有深远的算法意义：存在一个算法，对于每个程序$\mathsf{P}$，要么验证$\mathsf{P}$在$\mathsf{MM}$下的正确性，要么报告$\mathsf{P}$对$\mathsf{MM}$不是读自鲁棒的。总体而言，我们的结果为弱内存验证和鲁棒性建立了一个丰富且多功能的理论框架。

英文摘要

Memory models are formal specifications of concurrent-program executions, accounting for weak behaviors introduced by compiler and architectural optimizations. The increase of their number and complexity has spawned efforts for uniform verification across whole classes of models, by axiomatizing the models in an adequate metatheory that admits a uniform treatment. In this work, we formally study Monadic Second-Order logic (MSO) as a metatheory for weak memory, by proving results on the treewidth and MSO-expressibility of various popular weak-memory models, as this combination allows us to uniformly tackle several verification problems. In summary, our results are as follows. First, we prove that executions under Sequential Consistency ($\mathsf{SC}$) have bounded treewidth, while already those under Total Store Order ($\mathsf{TSO}$) do not. Second, we prove that a broad range of models, including Release/Acquire and the full RC20, are MSO-axiomatizable, while others, such as Strong Release/Acquire and $\mathsf{TSO}$, are not, unless the Orthogonal Vectors problem $\unicode{x2013}$ which requires quadratic time under SETH $\unicode{x2013}$ can be solved in linear time. Finally, we introduce the notion of reads-from robustness, as an extension to recent work on coarse robustness criteria. We show that our treewidth bounds (both upper and lower) have far-reaching algorithmic implications for any of our MSO-axiomatizable models $\mathsf{MM}$: there is an algorithm that, for every program $\mathsf{P}$, either verifies $\mathsf{P}$ under $\mathsf{MM}$ or reports that $\mathsf{P}$ is not reads-from robust against $\mathsf{MM}$. Overall, our results establish a rich and versatile theoretical framework for weak-memory verification and robustness.

URL PDF HTML ☆

赞 0 踩 0

2606.20129 2026-06-19 cs.SE 新提交

Learning Critical Testing Literacy Through Puzzles: an Experience Report

通过谜题学习关键测试素养：经验报告

Niels Doorn, Bart Th. Knaack, Tanja E. J. Vos, Beatriz Marín

AI总结本文报告了使用谜题教授关键测试素养（CTL）的13次工作坊经验，发现参与者通过解谜、汇报和反思的完整序列学习效果显著，并开发了开源分析工具。

详情

AI中文摘要

在本文中，我们报告了使用谜题学习CTL的工作坊经验和收获。背景：软件测试重要但难以教授。我们引入了一个基于谜题的学习活动知识体系来教授CTL，该体系基于关键测试者认知模型，形成了P4TEST教学框架。我们与学生、测试人员、教师和小学生共举办了13次工作坊，评估基于谜题的关键测试素养教学。经验：在11次工作坊中，我们采用半结构化方法，变化谜题、材料和时长。在另外两次工作坊中，我们引入了工作手册和出声思考环节，以收集更多关于学习体验的数据。观察：参与者普遍认为自己在解谜时进行实验。学生倾向于收敛于解决方案，而专业人员继续探索。情绪在行为中可见，但难以通过书面反思单独浮现。出声思考环节揭示了即时推理；书面反思引发了更多元认知反思。主题“意义建构/行动中反思”捕捉了参与者如何构建问题、应对死胡同和转变策略。反思：谜题本身并非干预手段；解谜、汇报和反思的完整序列才是。更刻意地设计这一序列是未来的工作。我们还开发了一个带有内置分析功能的开源网络应用程序，用于定制工作坊。

英文摘要

In this paper, we report our experiences and takeaways from workshops using puzzles to learn CTL. Background: Software testing is important yet difficult to teach. We introduced a BoK of puzzle-based learning activities to teach CTL, based on a model of critical tester's cognition, leading to the pedagogical framework P4TEST. We conducted thirteen workshops with students, testers, teachers, and primary school pupils to assess puzzle-based teaching of critical testing literacy. Experience: Across eleven workshops, we used a semi-structured approach, varying puzzles, materials, and timing. In two additional workshops, we introduced workbooks and think-aloud sessions to gather more data on the learning experience. Observations: Participants consistently perceived themselves as experimenting while solving puzzles. Students tended to converge on solutions, while professionals continued exploring. Emotions were visible in behaviour but hard to surface through written reflection alone. Think-aloud sessions revealed immediate reasoning; written reflections elicited more meta-cognitive reflection. The theme Sensemaking / reflection-in-action captured how participants framed problems, navigated dead ends, and shifted strategies. Reflections: Puzzles are not the intervention: the entire sequence of solving, debriefing, and reflecting is. Designing that sequence more deliberately is the work ahead. We also developed an open-source web application with built-in analytics to customise workshops.

URL PDF HTML ☆

赞 0 踩 0

2606.20127 2026-06-19 eess.SY cs.SY 新提交

Contraction-based Neural Control for Cooperative Aerial Payload Transportation with Variable-length Cables

基于收缩的神经控制用于可变长度缆绳的协同空中载荷运输

Yi Lok Lo, Longhao Qian, Hugh H. T. Liu

AI总结提出一种多无人机吊挂载荷系统的神经非线性控制框架，通过解耦动力学结构，联合训练神经收缩度量控制器和反馈控制器实现载荷轨迹跟踪，并利用可变长度缆绳进行避障。

Comments Submitted for publication in AIAA Scitech 2027

2606.20121 2026-06-19 cs.LO 新提交

BARReL: a modern backend for Atelier B in Lean

BARReL：Atelier B 在 Lean 中的现代后端

Ghilain Bergeron, Vincent Trélat

AI总结 BARReL 是一个 Lean 4 库，桥接工业 B 方法工具 Atelier B 与 Lean 证明助手，支持在 Lean 中交互式进行 B 开发，通过显式良定义条件编码部分算子，并利用依赖类型保证良定义性，同时提供基本自动化。

详情

AI中文摘要

BARReL 是一个 Lean 4 库，桥接了工业 B 方法工具 Atelier B 与 Lean 证明助手，使用户能够在 Lean 中交互式地进行形式化 B 开发（直至机器精化和实现），同时保留标准 B 语法。B 部分算子通过生成显式的良定义条件进行仔细编码，利用 Lean 的依赖类型从构造上强制实施良定义性纪律。也就是说，证明义务和证明步骤不能静默地依赖于类型错误或定义不当的实例化。BARReL 还具备基本自动化功能，尝试自动处理此类良定义条件。该实现完全使用 Lean 元编程编写，并设计为模块化：扩展支持的 B 片段通常只需添加新的语法和编码子句。我们通过一个小型但具有代表性的案例研究说明了该方法，并论证 BARReL 可以作为迈向基于 Lean 证明助手的高度可靠的 Atelier B 工具链的垫脚石。

英文摘要

BARReL is a Lean 4 library bridging Atelier B, an industrial tool for the B method, and the Lean proof assistant by enabling users to conduct their formal B developments -- up to machine refinement and implementation -- interactively inside Lean, while retaining standard B syntax. B partial operators are carefully encoded by generating explicit well-definedness conditions, leveraging Lean's dependent types to enforce a well-definedness discipline by construction. That is, proof obligations and proof steps cannot silently rely on ill-typed or ill-defined instantiations. BARReL also features basic automation to try to discharge such well-definedness conditions automatically. The implementation is written entirely using Lean meta-programming and is designed to be modular: extending the supported B fragment typically requires only adding new syntax and encoding clauses. We illustrate the approach on a small but representative case study, and argue that BARReL can act as a stepping stone towards a strongly reliable Atelier B toolchain grounded in the Lean proof assistant.

URL PDF HTML ☆

赞 0 踩 0

2606.20117 2026-06-19 cs.CE 新提交

Autoregressive Modelling and Synthetic Generation of High-Fidelity, Statistically Equivalent 3D Microstructures for As-Manufactured Misalignments in Fiber-Reinforced Composites

面向纤维增强复合材料中制造偏差的高保真、统计等效三维微观结构的自回归建模与合成生成

Mohamad A. Raja, Clemens Dransfeld, Boyang Chen

AI总结提出一种集成框架，通过X射线μCT数据提取纤维错位特征，结合copula、自回归和极端值建模，经贝叶斯优化校准后，迭代生成约2400根非重叠合成纤维，统计偏差低于10%。

详情

AI中文摘要

本研究提出一个集成框架，用于从实验X射线-μCT观测中处理、建模和生成统计代表性的三维纤维微观结构。首先，引入一种解析的切片-段椭圆相交方法，沿纤维深度提取每切片和每纤维的面内和面外错位轮廓。然后利用这些描述符构建一个随机模型，通过基于copula的面内依赖性、潜在自回归连续性和罕见极端错位模式，捕获切片级错位分布及其沿深度的演变。模型超参数通过贝叶斯优化校准，与原始统计描述符达到高度一致，偏差通常低于10%。优化后的统计模型与物理生成策略相结合，该策略从可变半径纤维种子层开始，通过逐切片迭代的三维生长方案进行，其中统计层引导纤维演化，基于Delaunay的邻域构建与基于椭圆的接触分辨率确保非重叠、半径增强的合成微观结构。该框架成功生成约2400根合成纤维，同时保持对原始X射线-μCT数据的强统计保真度。所提出的管道为生成统计等效、几何可接受且可立即用于仿真的纤维复合材料微观结构提供了一条有前景且可扩展的途径，用于虚拟测试和分析。

英文摘要

This study presents an integrated framework for processing, modelling, and generating statistically representative three-dimensional fiber microstructures from experimental X-ray-$μ$CT observations. First, an analytical slice-segment ellipse-intersection method is introduced to extract per-slice and per-fiber in-plane and out-of-plane misalignment profiles along the fiber depth. These descriptors are then used to construct a stochastic model that captures slice-wise misalignment distributions and their depth-wise evolution through, copula-based in-plane dependence, latent autoregressive continuity, and rare extreme-misalignment motifs. The model hyperparameters are calibrated using Bayesian optimization, achieving close agreement with the original statistical descriptors, with deviations generally below 10\%. The optimized statistical model is coupled with a physical generation strategy that begins with variable-radius fiber seeding layer and proceeds through an iterative slice-by-slice 3D growth scheme, where the statistical layer guides fiber evolution and Delaunay-based neighbourhood construction with ellipse-based contact resolution ensures non-overlapping, radius-augmented synthetic microstructures. The framework successfully generates about 2400 synthetic fibers while preserving strong statistical fidelity to the original X-ray-$μ$CT data. The proposed pipeline provides a promising and scalable route for generating statistically equivalent, geometrically admissible, and simulation-ready fiber composite microstructures for virtual testing and analysis.

URL PDF HTML ☆

赞 0 踩 0

2606.20102 2026-06-19 cs.CY cs.CR 新提交

Artificial Intelligence as Game Changer in Cybersecurity: What We Learned in 2025-2026, and how this is relevant for Africa

人工智能作为网络安全游戏规则改变者：2025-2026年我们学到的，以及这对非洲的意义

Mikael Alemu Gorsky

AI总结本文通过2025-2026年两个事件论证前沿语言模型已成为网络作战决定性工具，而非洲在模型构建、运营和获取上被完全排除，面临技能、算力和投资三重赤字，并遭受AI欺诈攻击，建议在6-12个月内通过威胁情报共享、治理采纳和伙伴关系应对。

Comments International Conference on Cybersecurity in the Era of Digital Transformation and Artificial Intelligence

详情

AI中文摘要

在2025年和2026年，两个事件解决了此前仅是推测的问题。第一个事件中，一个大型语言模型独立执行了国家支持的网络间谍活动的大部分任务，人类操作员仅在少数决策点介入。第二个事件中，最强大的网络相关模型被置于一个受控访问计划之下，仅限于经过审查的美国科技公司、盟国政府和欧洲标准机构；该范围不包括任何非洲政府、运营商或大学。这两个事件共同确立了本文的论点：前沿语言模型已成为网络作战的决定性工具，而该工具在一个小圈子内建造、拥有和配给，非洲被排除在外。本文记录了非洲在每一方面的排斥。该大陆不构建前沿模型，尚无法运营它们，并且目前无法获得最强大的模型。运营赤字沿着三个轴心展开：技能人才、计算和电力、投资，每个都根据当前数据衡量；与此同时，针对非洲移动货币系统（该大陆领先的数字经济部分）的AI欺诈攻击已经在增加。由此产生两个约束：开发者对前沿模型的把关（非洲决策无法打开），以及对基础设施供应商的选择性依赖（现已陷入地缘政治限制）。由于可比较但不受把关的模型预计在6至12个月内扩散，本文主张通过威胁情报共享、治理采纳和伙伴关系，在非洲人自主条件下，在该窗口内采取应对措施。

英文摘要

In 2025 and 2026, two events settled questions that had until then been speculative. In the first, a large language model executed the great majority of a state-aligned cyber-espionage campaign on its own, with human operators intervening at only a few decision points. In the second, the most capable cyber-relevant model was placed under a controlled-access program limited to a vetted set of United States technology firms, allied governments, and European standards bodies; that perimeter included no African government, operator, or university. Together the two events establish the argument of this paper: frontier language models have become a decisive instrument of cyber operations, and that instrument is built, owned, and rationed within a small circle from which Africa is absent. The paper documents Africa's exclusion on every count. The continent does not build frontier models, cannot yet operate them, and cannot, for now, obtain the most capable ones. The operational deficit is set out along three axes, skilled people, compute and electrical power, and investment, each measured against current figures; meanwhile AI-enabled fraud is already mounting against African mobile-money systems, the part of the digital economy the continent leads. Two constraints follow: the gating of frontier models by their developers, which no African decision can open, and a chosen dependence on infrastructure vendors now caught in geopolitical restriction. Because comparable but ungated models are forecast to spread within six to twelve months, the paper argues for a response that operates inside that window through threat-intelligence sharing, governance adoption, and partnership, undertaken by Africans on their own terms.

URL PDF HTML ☆

赞 0 踩 0

2606.20096 2026-06-19 cs.CG q-bio.NC 新提交

Quadratic Forms for Measuring Geometric Trees in 3-dimensional Space

用于测量三维空间中几何树的二次型

Yossi Bokor Bleile, Emanuele Cortinovis, Herbert Edelsbrunner, Shota Uka

AI总结提出使用二次型测量几何树的方向分布，并引入基于Fisher度量的六边形图模型进行可视化和统计分析。

Comments 16 pages, 6 figures

AI 大模型

视觉与机器人

科学与医疗

Directors Duties in the Age of Agentic Artificial Intelligence

Image Encryption Algorithm Based on Convolutional Neural Networks and Dynamic S-Box Generation

ExSpike: A General Full-Event Neuromorphic Architecture for Exploiting Irregular Sparsity with Event Compression

MaRDI Open Interfaces for Interoperable Nonlinear Optimization

PowerAgentBench-Dyn: A Benchmark for Agentic AI in Power System Dynamic Studies

Linked Fates: How Small of an Ambiguity Increase Can Make the Difference Between Equaling and Separating from P?

Organizing in the Digital Age: Understanding Community, Challenges, and Consequences in Digitally-facilitated Labor Organizing

ARGUS: Production-Scale Tracing and Performance Diagnosis for over 10,000-GPU Clusters

Sparse add-on controller design: A Youla approach to system-level performance

A cubical formalisation of conditional independence, Bayesian conditioning, and Pearl's d-separation soundness

Computing Twin-Width via Treedepth and Vertex Integrity

AgenticDB: Agentic Performance Reconfiguration for Database Workloads

Data-Driven Control from Poisoned Data: Fundamental Limitations and Secure DeePC

Quantization as a Malicious Task: Removing Quantization-Conditioned Backdoors via Task Arithmetic

TrustMix: How to Mix Messages in a Mobile Ad-hoc Network

Phoenix: Safe GitHub Issue Resolution via Multi-Agent LLMs

SysML Modeling of Digital Twins for Renewable Energy Communities

GNSS Spoofing Threat for V2X communications

Accelerating Trust Convergence in IIoT: A ML Approach for Dynamic Network Conditions

Tight Algorithm and Hardness for Submodular Linear Ordering

Qiskit Code Migration with LLMs

Techno-Economic Analysis of Shared Mobile Storage for Demand Charge Reduction

N-Version Programming with Coding Agents

An MSO Framework for Weak-Memory Verification and Robustness

Learning Critical Testing Literacy Through Puzzles: an Experience Report

Contraction-based Neural Control for Cooperative Aerial Payload Transportation with Variable-length Cables

BARReL: a modern backend for Atelier B in Lean

Autoregressive Modelling and Synthetic Generation of High-Fidelity, Statistically Equivalent 3D Microstructures for As-Manufactured Misalignments in Fiber-Reinforced Composites

Artificial Intelligence as Game Changer in Cybersecurity: What We Learned in 2025-2026, and how this is relevant for Africa

Quadratic Forms for Measuring Geometric Trees in 3-dimensional Space