arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 2094
2606.19396 2026-06-19 q-bio.QM 新提交

BioHarness: Substrate-Aware Evidence Assembly for Biomedical Question Answering across Literature, Knowledge Bases, and Biological Atlases

BioHarness:面向生物医学问答的底物感知证据组装——跨文献、知识库和生物图谱

Meng Xiao, Chuan Qin, Jinmiao Chen, Yihang Cheng, Yuanchun Zhou, Hengshu Zhu

AI总结 提出BioHarness,通过级联控制机制在文献检索、知识库和生物图谱间选择性组装证据,提升生物医学问答准确率,在19,302个问答项上得分从65.9提升至71.0。

Comments 14 Pages, 11 Figures, Keywords: biomedical question answering; retrieval-augmented generation; large language models; evidence assembly; biomedical knowledge bases; biological atlases

详情
AI中文摘要

动机:生物医学问答通常需要超越主题检索文献的证据,包括基因别名解析、数据库标识符标准化以及来自图谱的生物测量值。然而,现有的检索增强生成(RAG)系统通常遵循固定工作流程,缺乏明确机制来决定何时检索文本足够、何时需要经过整理的生物医学知识、或何时应调用对结构化测量值的可执行证据组装。这激发了一种底物感知的大语言模型(LLM)框架,能够跨文献、知识库和生物图谱选择性地组装足够的证据。结果:我们引入BioHarness,一种用于分阶段生物医学证据组装的LLM框架,涵盖文献检索、经过整理的生物医学知识资源以及来自图谱的结构化测量值。BioHarness首先尝试根据重排序的文献证据回答问题,并通过基于接地级联控制,仅在当前证据不确定、接地不足或底物不匹配时升级到REPL风格的证据组装。在涵盖七种答案格式的19,302个生物医学问答项上,BioHarness将最强非预言基线的综合得分从65.9提升至71.0。消融实验、案例研究和骨干扩展分析表明,这些提升源于通过重排序、实体接地和结构化测量访问修复证据-底物不匹配,而非不加区分地调用更多推理步骤、检索更多文献或依赖特定答案模型规模。

英文摘要

Motivation: Biomedical question answering often requires evidence beyond topically retrieved literature, including gene alias resolution, database identifier normalization, and atlas-derived biological measurements. However, existing retrieval-augmented generation (RAG) systems typically follow a fixed workflow and lack an explicit mechanism for deciding when retrieved text is sufficient, when curated biomedical knowledge is required, or when executable evidence assembly over structured measurements should be invoked. This motivates a substrate-aware large language model (LLM) harness that selectively assembles sufficient evidence across literature, knowledge bases, and biological atlases. Results: We introduce BioHarness, an LLM harness for staged biomedical evidence assembly across literature retrieval, curated biomedical knowledge resources, and atlas-derived structured measurements. BioHarness first attempts to answer from reranked literature evidence and escalates through grounded cascade control to REPL-style evidence assembly only when the current evidence is uncertain, weakly grounded, or substrate-mismatched. Across 19,302 biomedical QA items spanning seven answer formats, BioHarness improves the pooled score from 65.9 to 71.0 over the strongest non-oracle baseline. Ablations, case studies, and backbone-scaling analyses show that these gains arise from repairing evidence-substrate mismatches through reranking, entity grounding, and structured measurement access, rather than from indiscriminately invoking more reasoning steps, retrieving additional literature, or relying on a particular answer-model scale.

2606.20315 2026-06-19 q-bio.GN cs.CR 新提交

bioETH-Beacon: A Confidential On-Chain Genomic Beacon with Encrypted Counts, Filters, and Bounded Noise over a Fully Homomorphic EVM

bioETH-Beacon: 基于全同态EVM的机密基因组信标,支持加密计数、过滤和有界噪声

Christos Galanopoulos, Kimon Antonios Provatas, Ilias Georgakopoulos-Soares

AI总结 提出基于全同态EVM的智能合约原型bioETH-Beacon,实现加密基因组信标查询,通过加密计数、有界噪声和访问控制抵御成员推理攻击,并优化查询成本。

Comments 11 pages, 6 figures, 8 tables. Research prototype for privacy-preserving genomics using Fully Homomorphic Encryption (FHE) on blockchain (fhEVM)

详情
AI中文摘要

全球基因组学与健康联盟(GA4GH)Beacon协议允许研究人员查询某个基因组变异是否在参与队列中被观察到,并返回聚合的变异级计数。随着Beacon网络的发展,两个隐私风险依然存在:宿主机构可以看到明文查询,而重复的罕见变异查询可能支持成员推理攻击。我们提出了bioETH-Beacon,一个智能合约原型,它在全同态以太坊虚拟机(fhEVM)上对加密数据执行Beacon“聚合计数”查询。医院上传加密的标记计数条目,授权研究人员提交加密的标记查询,合约返回加密答案,通过链下密钥管理服务仅释放给合约链上ACL中指定的请求者。该设计组织为一个3x4的层级-查询族网格,涵盖基因型、性别、年龄和表型查询,层级在更强的机密性和更低的查询成本之间进行权衡。对于基因型路径,原型可以添加链上有界噪声以减轻探测攻击。基于多基因评分(PGS)目录的合成面板实验显示了预期的扩展行为,并证明当公共标记存在是可接受的权衡时,预聚合可以显著降低查询gas成本。总体而言,bioETH-Beacon提供了一个无需可信计算评估者的机密Beacon式基因组查询研究原型。

英文摘要

The Global Alliance for Genomics and Health (GA4GH) Beacon protocol lets researchers ask whether a genomic variant has been observed in a participating cohort and receive aggregate variant-level counts. As Beacon networks grow, two privacy risks remain: host institutions can see plaintext queries, and repeated rare-variant queries can support membership-inference attacks. We present bioETH-Beacon, a smart-contract prototype that runs the Beacon "aggregate count" query over encrypted data on a fully homomorphic Ethereum Virtual Machine (fhEVM). Hospitals upload encrypted marker-count entries, authorized researchers submit encrypted marker queries, and the contract returns an encrypted answer that is released, via an off-chain key-management service, only to the requester named in the contract's on-chain ACL. The design is organized as a 3x4 tier-by-query-family grid spanning genotype, sex, age, and phenotype queries, with tiers that trade stronger confidentiality for lower query cost. For genotype paths, the prototype can add bounded on-chain noise to mitigate probing attacks. Experiments on synthetic panels derived from a Polygenic Score (PGS) catalog show the expected scaling behavior and demonstrate that pre-aggregation can substantially reduce query gas when public marker presence is an acceptable trade-off. Overall, bioETH-Beacon provides a research prototype for confidential Beacon-style genomic querying without a trusted compute evaluator.

2606.19794 2026-06-19 econ.GN cs.CY q-fin.EC 新提交

Forecasting AI-Era Productivity: The Intellectually Converged Human Framework and a Missing Cognitive Mediator in Production Function Theory

预测AI时代的生产率:智力融合人类框架与生产函数理论中缺失的认知中介

Kwan Soo Shin, In Seok Kang

AI总结 本文提出智力融合人类(ICH)框架,通过引入四维认知构念“融合能力”(C)作为AI与生产率之间的认知中介,解释了AI投资未能带来相应生产率增长的理论悖论,并基于20个OECD国家的数据分析验证了AI与C的交互作用对全要素生产率变异的解释力。

Comments 78 pages, 3 figures

详情
AI中文摘要

为什么大规模AI投资未能产生相应的生产率增长?我们认为这一悖论在理论上是生成的:主流生产函数框架通过将AI视为可分离的生产要素,而未建模AI产生生产性价值的认知中介,从而遇到了结构性边界。这导致投资倾向于部署,而生产率需要先发展我们称之为融合能力(C)的东西。我们提出了智力融合人类(ICH)框架,这是生产函数理论的第五阶段框架:H-hat = H[1 + phi(A,C)],其中有效生产能力等于人力资本(H)乘以一个增强因子[1 + phi],phi由AI利用强度(A)和融合能力(C)共同决定,C是一个四维认知构念,涵盖具身理解、元认知、时间整合和整合思维。生产函数Y = F(K, H-hat)为索洛的TFP残差提供了一个以人为中心的机制:A_Solow = [1 + phi(A,C)]^(1-alpha)。该框架预测了三种具有不同政策含义的增强机制。对20个OECD经济体的描述性跨国分析显示,AIxC交互作用与86%的TFP变异相关,而仅AI为31%,这是小n理论传统中模式一致的发现。韩国是国家级欠增强的例证:高H、大量A、低C导致phi=0。我们将融合能力与相邻构念——吸收能力、动态能力和人力资本——区分开来,并证明C构成了先前框架中隐含的特定认知中介。我们推导出C优先的政策建议,并提出了三个可实证检验的命题及一个可证伪的10年预测。

英文摘要

Why does massive AI investment fail to generate commensurate productivity gains? We argue the paradox is theoretically generated: prevailing production function frameworks encounter a structural boundary by treating AI as a separable factor of production without modeling the cognitive mediation through which AI generates productive value. This directs investment toward deployment when productivity requires prior development of what we term convergence capacity (C). We propose the Intellectually Converged Human (ICH) framework, a fifth-stage framework for production function theory: H-hat = H[1 + phi(A,C)], where effective productive capacity equals human capital (H) scaled by an augmentation factor [1 + phi], with phi jointly determined by AI utilization intensity (A) and convergence capacity (C), a four-dimensional cognitive construct encompassing embodied understanding, metacognition, temporal integration, and integrative thinking. The production function Y = F(K, H-hat) provides a human-centered mechanism for Solow's TFP residual: A_Solow = [1 + phi(A,C)]^(1-alpha). The framework predicts three augmentation regimes with distinct policy implications. Descriptive cross-national analysis of 20 OECD economies shows the AIxC interaction is associated with 86% of TFP variance versus 31% for AI alone, a pattern-consistent finding in the small-n theoretical tradition. South Korea exemplifies national-scale under-augmentation: high H, substantial A, low C produce phi = 0. We distinguish convergence capacity from adjacent constructs, absorptive capacity, dynamic capability, and human capital, and demonstrate that C constitutes the specific cognitive mediator that prior frameworks have left implicit. We derive C-first policy prescriptions and offer three empirically testable propositions with a falsifiable 10-year forecast.

2606.20553 2026-06-19 cs.CR 新提交

From Efficiency to Leakage -- Privacy Backdoor in Federated Language Model Fine-Tuning

从效率到泄露——联邦语言模型微调中的隐私后门

Shanghao Shi, Chaoyu Zhang, Heng Jin, Yang Xiao, Yevgeniy Vorobeychik, William Yeoh, Ning Zhang, Y. Thomas Hou, Wenjing Lou

AI总结 提出NeuroImprint攻击,恶意参数服务器在参数高效微调中植入隐私后门,通过为每个样本分配独立神经元并限制单次更新,实现高保真重建训练文本。

详情
AI中文摘要

联邦学习(FL)使多方能够协作微调语言模型以完成特定领域任务,而无需共享原始数据。由于完整模型微调对FL客户端而言通常过于昂贵,参数高效微调(PEFT)已成为实践中的事实标准,它冻结基础模型,仅训练少量适配器。在本文中,我们表明恶意参数服务器可以隐秘地将PEFT适配器破坏为隐私后门,该后门隐式记忆客户端的训练样本,作为存储在独立神经元中的隔离的每样本参数更新,而不降低模型效用。具体来说,我们的攻击NeuroImprint为每个训练样本分配一个专用的记忆神经元,并约束每个神经元在局部微调轨迹中最多更新一次。这种设计减轻了语言模型微调中由大批量和状态优化器(如Adam/AdamW)引入的跨样本碰撞和跨步混合。微调后,得到的隔离的每样本更新可以通过闭式解析逆变换恢复文本嵌入,然后确定性地映射回令牌序列。为了理解我们方法的通用性,我们在多个语言模型(BERT、GPT-2、Qwen2和Llama3.2)上实现了NeuroImprint,并在涵盖不同领域的四个微调数据集上进行了评估。结果表明,我们的攻击能够以高语义保真度重建59%至79%的所有微调样本。

英文摘要

Federated learning (FL) enables multiple parties to collaboratively fine-tune language models for domain-specific tasks without sharing raw data. Since full model fine-tuning is often prohibitively expensive for FL clients, parameter-efficient fine-tuning (PEFT) has become the de facto approach in practice, freezing the base model and training only a small set of adapters. In this paper, we show that a malicious parameter server can stealthily corrupt a PEFT adapter into a privacy backdoor that implicitly memorizes the client's training samples as isolated per-sample parameter updates stored in separate neurons, without degrading model utility. Concretely, our attack, NeuroImprint, assigns a dedicated memorization neuron to each training sample and constrains that each neuron is updated at most once along the local fine-tuning trajectory. This design mitigates both cross-sample collisions and cross-step mixing introduced by large local batches and stateful optimizers (e.g., Adam/AdamW) in language-model fine-tuning. After fine-tuning, the resulting isolated per-sample updates can be analytically inverted in closed form to recover text embeddings, which are then deterministically mapped back to token sequences. To understand the generality of our method, we implemented NeuroImprint on multiple language models (BERT, GPT-2, Qwen2, and Llama3.2) and evaluated it across four fine-tuning datasets spanning diverse domains. The results demonstrate that our attack can reconstruct 59% to 79% of all finetuning samples with high semantic fidelity.

2606.20550 2026-06-19 cs.DL cs.HC cs.IR 新提交

Easy Reads: A Python program for making Scientific Papers on arXiv more Reader Friendly and Accessible

Easy Reads: 一个使arXiv上的科学论文更易读和更易访问的Python程序

Vishal Verma

AI总结 针对科学论文排版紧凑、可读性差的问题,提出Easy Reads——一个自动化、端到端的开源Python程序,通过自定义字体大小和列数等格式,从arXiv获取论文并重新排版,提升可读性和可访问性。

Comments 9 pages. Open-source software project available at: https://github.com/Curious-flow/Easy-Reads

详情
AI中文摘要

科学论文通常排版紧凑,具有小字体、小行距、双栏文本和紧密排列的图表等特点。虽然这些特性使论文更紧凑,但会妨碍可读性,降低可访问性,并可能使读者感到疲劳。arXiv是一个跨学科的科学论文开放获取库,被包括物理学和天体物理学社区在内的研究人员广泛使用。Easy Reads是一个自动化、端到端的开源Python程序,通过使arXiv上的论文更易读和更易访问来帮助解决上述挑战。Easy Reads可以通过URL自动从arXiv获取论文,并处理源TeX文件,允许自定义论文的格式特性,主要是字体大小和使用的栏数。Easy Reads的主要目标是促进科学论文的易读性。

英文摘要

Scientific papers are frequently dense and characterized by features such as small fonts and line spacing, double columns of text, and tightly arranged figures. While these features make papers more compact, they can hinder readability, make them less accessible, and can strain the reader. arXiv is a premier open-access repository for scientific papers across different fields and is used extensively by researchers, including those in the physics and astrophysics communities. Easy Reads is an automated, end-to-end, open-source Python program that helps address the stated challenge by making papers from arXiv more reader-friendly and accessible. Easy Reads can automatically fetch a paper from arXiv via its URL and work with the source TeX file to allow custom formatting of the paper features, primarily the font size, and the number of columns used. The main goal of Easy Reads is to facilitate ease of reading of scientific papers.

2606.20539 2026-06-19 cs.DB cs.DS 新提交

Caching for Dollars, Not Hits: An Exact Offline Reference for Cloud-Egress Caching and the Crossover That Decides When It Pays

为美元缓存,而非命中率:云出口缓存的精确离线参考及决定何时值得的交叉点

Madhulatha Mandarapu, Sandeep Kunkunuru

AI总结 针对云存储出口费用而非延迟的缓存问题,提出多项式时间精确离线最优策略,发现LRU的美元后悔随成本分散度上升,而成本感知的GreedyDual可大幅降低,并给出决定何时需要成本感知缓存的闭合形式交叉点。

Comments 6 pages, 3 figures. Code, benchmarks, and full pre-registration: https://github.com/samyama-ai/cloud-egress-cache

详情
AI中文摘要

当缓存未命中从云对象存储获取数据时,计费基于每次GET请求和每字节出口流量,而非延迟。经典缓存最小化未命中率,这是错误的目标:一个很少但昂贵获取的对象可能比一个频繁但廉价获取的对象花费数千倍。广义缓存理论界定了未命中成本目标,但尚无公开基准衡量实际部署的启发式策略在真实云价格下与美元最优离线策略的差距。我们提供了该参考。对于具有异构未命中成本的统一大小页面缓存,离线美元最优可通过积分区间线性规划在多项式时间内精确求解——经暴力验证;可变大小是NP难的,因此我们将基于流的离线界从命中率目标扩展到美元(成本-FOO),误差约4%。基于此参考我们发现:(i) 异质性遗憾定律——LRU的美元遗憾随未命中成本分散度上升(Spearman 0.87),而成本感知的GreedyDual将其降至约十分之一;(ii) 竞争边界——当预算恰好覆盖昂贵工作集时,GreedyDual的残余遗憾降至接近零,否则为开放区间;(iii) 闭合形式交叉点 s* = GET费用/出口费率(S3上约4 KB,GCS上约330 B),可预测哪些部署需要成本感知缓存。在真实Twitter轨迹上,仅价格向量即可使工作负载跨越s*,按预测改变状态。该工件是一个可复现的计费忠实基准;其构建的启发式策略和界为先前工作,已致谢。

英文摘要

When a cache miss fetches from cloud object storage, the bill is per GET request and per byte of egress, not latency. Classic caching minimizes the miss rate, the wrong objective: a rarely but expensively fetched object can cost thousands of times more dollars than a frequently but cheaply fetched one. Generalized-caching theory bounds the miss-cost objective, but no reported benchmark measures how far deployed heuristics sit from the dollar-optimal offline policy on real cloud prices. We supply that reference. For uniform-size page caches with heterogeneous miss costs the offline dollar-optimum is exact in polynomial time via an integral interval linear program -- validated against brute force; variable sizes are NP-hard, so we extend the flow-based offline bound from the hit-ratio objective to dollars (cost-FOO), tight to about four percent. Against this reference we find: (i) a heterogeneity-regret law -- LRU's dollar-regret rises with miss-cost dispersion (Spearman 0.87) while cost-aware GreedyDual cuts it to roughly a tenth; (ii) a contention frontier -- GreedyDual's residual regret collapses to near zero exactly when the budget fits the expensive working set, and is the open slice otherwise; and (iii) a closed-form crossover s* = GET_fee/egress_rate (about 4 KB on S3, 330 B on GCS) that predicts which deployments need dollar-aware caching at all. On a real Twitter trace the price vector alone moves the workload across s*, shifting the regime as predicted. The artifact is a reproducible billing-faithful benchmark; heuristics and bounds it builds on are prior work, credited.

2606.20492 2026-06-19 cs.CR cs.LO 新提交

A-COMPASS: Formal Foundations for Anonymity Analysis in Microdata

A-COMPASS:微观数据匿名性分析的形式化基础

Tamara Tagliavia, Silvia Ghilezan

AI总结 本文修改COMPASS语言为A-COMPASS,使其适用于微观数据表,支持匿名条件检查与匿名化操作,并证明其语义的确定性和组合性,可用于验证k-匿名和l-多样性等属性。

详情
AI中文摘要

在信息时代,主要问题之一是如何确保个人隐私。根据考虑隐私的背景,出现了各种数据隐私模型。然而,即使对于最基本的模型,这些模型的形式化验证领域仍未得到充分探索。验证隐私需求的一种尝试是合规断言语言(COMPASS)。在COMPASS中,可以指定表需要满足的匿名条件,以及条件不满足时将修改表的操作。它设计用于对预处理后的表进行操作,形式为一条记录对应一组人。在本文中,我们修改COMPASS语言,使其以通常的一条记录对应一个人的形式对微观数据表进行操作。修改后的语言称为A-COMPASS。除了检查先前应用的匿名条件外,A-COMPASS还作为新功能支持执行匿名化操作。我们进一步提供了A-COMPASS语言的语法和语义。我们还证明了引入的语义的最重要属性,如确定性和组合性。最后,我们提供了一种验证匿名属性(如k-匿名和l-多样性)的机制。

英文摘要

In the information age, one of the leading problems is how to ensure individual's privacy. Depending on the context in which privacy is considered, various data privacy models have emerged. However, the domain of formal verification of these models is still not sufficiently explored even when it comes to the most basic models. An attempt to verify privacy requirements is the Compliance Assertion Language (COMPASS). In COMPASS, one can specify an anonymity condition that a table needs to satisfy, and an action that will modify the table if the condition is not satisfied. It is designed to operate on preprocessed tables in a form one record - one group of people. In this paper, we modify the COMPASS language in order to operate on microdata tables in their usual form of one record - one person. The modified language is called A-COMPASS. Along with checking of previously applied anonymity conditions, A-COMPASS enables the execution of anonymization actions as a new feature. We further provide the syntax and the semantics for the A-COMPASS language. We also prove the most important properties of the introduced semantics like determinism and compositionality. Finally, we provide a mechanism to verify anonymity properties, such as k-anonymity and l-diversity.

2606.20490 2026-06-19 cs.MS 新提交

Software package MaRDI Open Interfaces for improved interoperability in numerical optimization

软件包MaRDI开放接口:提升数值优化互操作性

Dmitry I. Kabanov, Stephan Rave, Mario Ohlberger

AI总结 提出MaRDI开放接口软件包,通过统一非线性优化接口减少编码与测试工作,并以物理信息神经网络求解粘性Burgers方程为例验证其互操作性。

Comments 15 pages, 1 figure, 1 table, GAMM2026

详情
AI中文摘要

为了解决计算科学中的互操作性挑战,我们介绍了软件包MaRDI Open Interfaces的最新更新。该软件包旨在减少计算科学家在编写数值求解器绑定以及将实验代码适配到同一问题类型(例如,基准测试哪个求解器更好)的不同求解器接口上所花费的时间和编码/测试工作。通过简化这些任务,该软件包帮助研究人员专注于其计算项目的实际本质。在这里,我们展示了一个最近开发的非线性优化接口,并说明了如何将其应用于优化问题的计算实验。作为此类问题的一个例子,我们考虑了训练物理信息神经网络以预测粘性Burgers方程的解。

英文摘要

To address the challenges of interoperability in computational science, we present the latest updates to the software package MaRDI Open Interfaces. This software package aims to decrease the time and coding/testing efforts spent by computational scientists on tasks such as writing bindings to numerical solvers and adapting experiment codes to the varying interfaces of solvers for the same problem type (e.g., for benchmarking, which solver is better). By streamlining these tasks, this software package helps researchers focus on the actual essence of their computational projects. Here, we demonstrate a recently developed interface for nonlinear optimization and illustrate how it can be applied for computational experiments with optimization problems. As an example of such problem, we consider training of physics-informed neural networks to predict the solutions of viscous Burgers' equation.

2606.20465 2026-06-19 cs.CY cs.SI 新提交

Farmer Connect: Improving Farmers' Access to Produce Markets

Farmer Connect:改善农民进入农产品市场的途径

Micheal Amanya, Darius Kainamura, Christine Namatovu, Lailah Kobugabe, Solomon Buwule Fortune, Adones Rukundo

AI总结 针对乌干达小农户面临的市场准入难、议价能力弱等问题,提出基于合作社的数字平台Farmer Connect,通过移动优先架构和云后端支持群体管理、市场协调和收益透明,实现约85%的用户需求。

详情
AI中文摘要

乌干达的小农户玉米种植者仍然面临有限的市场准入、薄弱的议价能力、低价格透明度以及对中间商的严重依赖。这些问题因农产品协调不善、付款延迟以及合作社交易可见性差而加剧。本文介绍了Farmer Connect,一个基于合作社的数字平台,旨在支持农民群体之间的农产品管理、市场协调和透明的收益跟踪。该系统支持四种用户角色:管理员、监督员、农民和客户。其核心功能包括农民群体管理、贡献记录和验证、市场列表、订单处理、基于先进先出的农产品分配、收益可见性、移动货币支付支持和通知服务。该平台采用移动优先架构,配备基于云的后端服务和行政网页仪表板。功能实现表明,该系统能够支持基于群体的玉米营销和合作社协调所需的主要工作流程,约85%的已识别用户需求得到实现。研究表明,以合作社为中心的数字平台可以为改善小农户的透明度、协调性和买家准入提供实用框架。

英文摘要

Smallholder maize farmers in Uganda continue to face limited market access, weak bargaining power, low price transparency, and heavy reliance on intermediaries. These challenges are compounded by poor produce coordination, delayed payments, and weak visibility into cooperative transactions. This paper presents Farmer Connect, a cooperative-based digital platform designed to support produce management, marketplace coordination, and transparent earnings tracking among farmer groups. The system supports four user roles: administrators, supervisors, farmers, and customers. Its core functions include farmer group management, contribution recording and verification, marketplace listing, order processing, First In First Out based produce allocation, earnings visibility, mobile money payment support, and notification services. The platform was implemented using a mobile-first architecture with cloud-based backend services and an administrative web dashboard. Functional implementation showed that the system was able to support the major workflows required for group-based maize marketing and cooperative coordination, with approximately 85% of identified user requirements implemented. The study shows that cooperative-centered digital platforms can provide a practical framework for improving transparency, coordination, and buyer access for smallholder farmers.

2606.20454 2026-06-19 cs.FL 新提交

Minimality of Random Moore Automata under Prefix-Dependent Congruences

随机摩尔自动机在前缀依赖同余下的极小性

Matías Carrasco, Sergio Yovine

AI总结 研究随机确定性迁移系统中前缀依赖同余的平凡性,证明在标签独立且每个标签至少有三个可接受符号时,同余高概率为平凡。

Comments 9 pages

详情
AI中文摘要

我们研究带有状态输出的随机确定性迁移系统的前缀依赖同余。在此设定下,用于比较两个状态的可接受延续可能依赖于观察到的前缀,并且只有当没有共同的可接受延续能区分它们的未来输出时,两个状态才被识别。该框架包括概率确定性有限自动机作为一个激励性的特例。我们分析随机迁移模型,其中所有迁移值是独立且均匀的。每个状态还被分配一个独立标签,该标签指定其输出及其可接受符号集。如果两个独立标签以严格小于1的概率一致,并且每个标签至少有三个可接受符号,则诱导的同余以高概率是平凡的。证明结合了配对上的剪枝过程、控制其早期演化的无碰撞探索,以及表明剩余配对无法组织成非平凡等价类的第一矩论证。

英文摘要

We study prefix-dependent congruences for random deterministic transition systems with state outputs. In this setting, the admissible continuations used to compare two states may depend on the observed prefix, and two states are identified only if no common admissible continuation distinguishes their future outputs. The framework includes probabilistic deterministic finite automata as a motivating special case. We analyze the random transition model in which all transition values are independent and uniform. Each state is also assigned an independent label that specifies both its output and its set of admissible symbols. If two independent labels agree with probability strictly less than one, and every label has at least three admissible symbols, then the induced congruence is trivial with high probability. The proof combines a pruning process on pairs, a collision-free exploration controlling its early evolution, and a first-moment argument showing that the remaining pairs cannot organize into nontrivial equivalence classes.

2606.20453 2026-06-19 cs.CY cs.HC 新提交

Directors Duties in the Age of Agentic Artificial Intelligence

代理人工智能时代的董事职责

Deirdre Ahern

AI总结 探讨董事在采纳代理AI时如何平衡股东与员工利益,分析四种公司治理模型,主张通过更广泛的法律视角促进员工福利。

Journal ref Cambridge Forum on AI: Law and Governance 2, e7 (2026)

详情
AI中文摘要

随着董事会采用包括代理AI在内的人工智能以提高运营效率,这为利润最大化提供了新机会。AI的采用越来越与员工角色替代相关联,在公司中,员工作为利益相关者的利益需要探讨。一个新颖的问题是,在AI崛起的时代,当AI在公司中的角色接近或超越人类员工时,AI是否应被赋予利益相关者地位。本文探讨了董事履行公司最佳利益职责时的四种公司目的模型:股东至上模型、开明股东价值模型、利益相关者友好模型和利益相关者价值模型,强调了董事在董事会围绕AI的决策中容纳员工利益的可用空间。结论是,鉴于董事在其最佳利益职责方面免受法律审查的程度,采取更广泛的法律视角来促进员工福利将有利于员工、董事和公司的利益。这将使董事与员工进行有意义的接触,并提供再培训机会以适应AI时代。

英文摘要

As boards engage with the adoption of Artificial Intelligence including agentic AI to drive operational efficiencies, this presents new opportunities for profit maximisation. AI adoption is increasingly identified with employee role displacement and in companies, and the interests of employees as stakeholders require exploration. A novel question posed is whether in an age of AI ascendancy AI may warrant being given stakeholder status as its role in the company approximates or eclipses that of human employees. The article probes four distinct models of corporate purpose within the duty on directors to act in the best interests of the company, the shareholder primacy model, the Enlightened Shareholder value model, the stakeholder friendly model, and the stakeholder value model, highlighting the available scope for directors to accommodate the interests of employees around AI adoption in decision-making by boards around AI. It is concluded that given the degree to which directors are insulated from legal scrutiny in relation to their best interests duty, adopting a wider law in context approach to promote employee welfare would serve the interests of employees, directors and companies alike. This would see directors engaging meaningfully with employees and providing opportunities for reskilling to adapt to the age of AI.

2606.20444 2026-06-19 cs.CR cs.SE 新提交

Image Encryption Algorithm Based on Convolutional Neural Networks and Dynamic S-Box Generation

基于卷积神经网络和动态S盒生成的图像加密算法

Ans Ibrahim, Fadhil Abbas Fadhil, Mahameed Reza Feizi Derakhshi, Maryam Mahdi Alhusseini, Nikolai Safiullin

AI总结 提出一种结合CNN与经典密码学的动态图像加密方法,通过CNN学习特征生成自适应S盒,增强非线性、唯一性和输入依赖性,提高抗攻击能力。

详情
AI中文摘要

本文提出了一种动态图像加密方法,结合卷积神经网络(CNN)和经典密码学,以提高图像加密的安全性和灵活性。主要概念是基于训练好的CNN学习到的特征创建自适应替换盒(S-box)。与传统固定S-box相比,基于CNN的S-box具有更强的非线性、唯一性和输入图像依赖性,因为它们容易受到线性攻击和差分攻击。这种动态行为增强了混淆特性,使其更能抵抗统计和结构攻击。加密算法包括基于CNN的特征提取和创建个性化S-box来替换像素。通过熵、直方图分析、相关性、NPCR和UACI对基于CNN生成的S-box进行安全评估,表明该方案比传统方案更具弹性和灵活性。

英文摘要

The paper proposes a dynamic approach to image encryption, combining the use of Convolutional Neural Networks (CNNs) and classical cryptography to improve the security and flexibility of image encryption. The main concept is to create adaptive Substitution boxes (S-boxes) based on characteristics that are learned by a trained CNN. The CNN-based S-boxes can be relied on for more non-linearity, uniqueness, and input image dependence than the conventional fixed S-boxes because they are susceptible to the linear and differential attacks. This dynamic behaviour enhances the confusion property and makes it more resistant to statistical and structural attacks. The encryption algorithm consists of CNN-based feature extraction and the creation of a personalised S-box to replace the pixels. Entropy, histogram analysis, correlation, NPCR, and UACI enable security assessment of generated S-boxes based on the CNN, indicating that the scheme is more resilient and flexible than traditional ones.

2606.20414 2026-06-19 cs.AR 新提交

ExSpike: A General Full-Event Neuromorphic Architecture for Exploiting Irregular Sparsity with Event Compression

ExSpike: 一种利用事件压缩开发不规则稀疏性的通用全事件神经形态架构

Yuehai Chen, Farhad Merchant

AI总结 提出ExSpike通用全事件神经形态架构,通过数据流优化实现纯事件驱动执行,并引入相邻位置事件压缩减少冗余累加,在FPGA上实现高能效SNN加速。

Comments Accepted by the 36th International Conference on Field-Programmable Logic and Applications (FPL 2026); 9 pages, 9 figures

详情
AI中文摘要

脉冲神经网络(SNN)因其稀疏的时空活动而有望实现节能计算。然而,有效将这种不规则稀疏性转化为实际的性能和能耗增益仍然具有挑战性,因为全事件计算架构尚未得到充分探索。本文提出ExSpike,一种通用的全事件神经形态架构,充分利用SNN中的不规则稀疏性。为了实现纯事件驱动执行,我们首先提出一组数据流优化,确保每个SNN层的输入保持基于脉冲,从而在整个网络中实现全事件执行。然后,我们设计了一种硬件高效的全事件架构,命名为ExSpike,它支持优化的纯事件驱动数据流以及用于脉冲驱动自注意力的额外注意力核心。为了进一步提高计算效率,我们引入了相邻位置事件压缩,以减少跨空间相邻脉冲序列的冗余累加。ExSpike在AMD Xilinx Virtex-7 FPGA上实现,并在分类和分割任务上进行了评估。实验结果表明,ExSpike在保持竞争性精度的同时,在多种SNN模型上实现了高归一化能效,最高可达479.15 GOPS、281.85 GOPS/W和0.80 GOPS/W/PE。特别是,ExSpike的PE归一化能效比最先进的基于FPGA的SNN加速器(FireFly-T)高出10倍。ExSpike的代码可在\url{this https URL}获取。

英文摘要

Spiking neural networks (SNNs) promise energy-efficient computing due to their sparse spatio-temporal activity. However, effectively translating such irregular sparsity into practical performance and energy gains remains challenging, as full-event computing architectures are still underexplored. This paper proposes ExSpike, a general full-event neuromorphic architecture that fully exploits irregular sparsity in SNNs. To realize pure event-driven execution, we first propose a set of dataflow optimizations to ensure that the inputs to each SNN layer remain spike-based, thereby enabling full-event execution throughout the network. We then design a hardware-efficient full-event architecture, named ExSpike, which supports the optimized pure event-driven dataflow and an additional Attention Core for spike-driven self-attention. To further improve computing efficiency, we introduce adjacent-position event compression to reduce redundant accumulations across spatially adjacent spike sequences. ExSpike is implemented on an AMD Xilinx Virtex-7 FPGA and evaluated on both classification and segmentation workloads. Experimental results show that ExSpike achieves high normalized energy efficiency across diverse SNN models while maintaining competitive accuracy, delivering up to 479.15 GOPS, 281.85 GOPS/W, and 0.80 GOPS/W/PE. In particular, ExSpike achieves up to 10$\times$ higher PE-normalized energy efficiency than the SOTA FPGA-based SNN accelerator (FireFly-T). The code for ExSpike is available at \url{https://github.com/xiaoyuehai/ExSpike}.

2606.20410 2026-06-19 cs.MS 新提交

MaRDI Open Interfaces for Interoperable Nonlinear Optimization

MaRDI 开放接口:实现可互操作的非线性优化

Dmitry I. Kabanov, Stephan Rave, Mario Ohlberger

AI总结 提出MaRDI开放接口软件包,通过统一数值问题接口和自动数据编组,提升非线性优化中不同求解器和编程语言间的互操作性,减少代码修改和测试成本。

Comments 12 pages, 1 figure, 1 table, deRSE2026

详情
AI中文摘要

MaRDI开放接口是一个旨在提高科学计算互操作性的软件包,特别是针对非线性优化。为此,该包具有两个主要特点。首先,它为典型的数值问题提供统一接口,以帮助在同一问题类型的求解器之间切换。其次,它自动处理编程语言之间的数据编组。因此,计算科学家可以通过使用该包更快地进行实验,减少代码修改和测试工作。本文描述了该软件包的总体结构,并展示了非线性优化接口的示例。

英文摘要

MaRDI Open Interfaces is a software package that aims to improve interoperability in scientific computing, particularly, for nonlinear optimization. To this end, this package holds two main characteristics. First, it provides unified interfaces for typical numerical problems to help switching between solvers for the same problem type. Second, it automates data marshalling between programming languages. Hence, computational scientists can conduct experiments faster by using the package, with fewer code-modification and testing efforts. In this work we describe the general structure of the software package and show examples with the interface for nonlinear optimization.

2606.20401 2026-06-19 eess.SY cs.SY 新提交

PowerAgentBench-Dyn: A Benchmark for Agentic AI in Power System Dynamic Studies

PowerAgentBench-Dyn:电力系统动态研究中智能体AI的基准测试

Qian Zhang, Andrea Pomarico, Costas Mylonas, Magda Foti, Alberto Berizzi, Le Xie

AI总结 提出PowerAgentBench-Dyn基准,用于评估基于LLM的智能体在电力系统动态分析任务中的能力,涵盖模型质量审查和安全风险筛选两个任务。

详情
AI中文摘要

基于大型语言模型(LLM)的智能体越来越多地被用于通过与软件工具交互、解释中间结果以及自主规划后续行动来自动化多步骤工程工作流。电力系统动态研究是这些智能体一个特别有前景但尚未充分探索的应用领域。与静态计算任务不同,动态研究通常需要更多时间进行模型参数校准、工程判断以及在受限动作空间下的决策。本文介绍了PowerAgentBench-Dyn,一个旨在评估智能体AI系统在电力系统动态分析任务上的基准测试。该基准针对那些不能简化为单一优化或编码任务的问题,而是需要经验丰富的电力系统工程师日常执行的那种推理、工具使用和迭代实验。所提出的框架包括两个初始基准任务。第一个是动态模型质量审查基准,评估智能体根据系统运营商指定的模型质量合规标准验证和诊断动态模型的能力。第二个是动态安全风险筛选基准,评估智能体利用语义记忆和有限的仿真预算从未见故障数据集中识别、排序和分析最关键短路事故,并提出和评估可能的缓解措施的能力。对于每个任务,我们定义了仿真环境、观测和动作空间以及评估指标。该基准在基于度量的意义上是可复现的:发布案例和仿真器设置定义了确定性评估器,而随机智能体行为通过重复运行使用成功率和其他指标进行评估。该基准支持未来用于电力系统运行和规划的智能体AI的开发。

英文摘要

Large Language Model (LLM)-based agents are increasingly being used to automate multi-step engineering work flows by interacting with software tools, interpreting intermediate results, and autonomously planning subsequent actions. Power system dynamic studies represent a particularly promising yet largely unexplored application domain for these agents. Unlike static computational tasks, dynamic studies often require more time on model parameter calibration, engineering judgment, and decision making under constrained action spaces. This paper introduces PowerAgentBench-Dyn, a benchmark designed to evaluate Agentic AI systems on power system dynamic-analysis tasks. The benchmark targets problems that cannot be reduced to a single optimization or coding task, but instead require a type of reasoning, tool usage, and iterative experimentation routinely performed by experienced power system engineers. The proposed framework includes two initial benchmark tasks. The first, the Dynamic Model Quality Review Benchmark, evaluates agents' ability to validate and diagnose dynamic models based on model-quality compliance criteria specified by system operators. The second, the Dynamic Security Risk Screening Benchmark, assesses agents' capability to leverage semantic memory and a limited simulation budget to identify, rank, and analyze the most critical short-circuit contingencies from an unseen fault dataset, as well as propose and evaluate possible mitigation measures. For each task, we define the simulation environment, observation and action spaces, and evaluation metrics. The benchmark is reproducible in a metric-based sense: released cases and simulator settings define a deterministic evaluator, while stochastic agent behavior is assessed over repeated runs using success rates and other metrics. The benchmark supports the development of future Agentic AI for power system operation and planning.

2606.20399 2026-06-19 cs.CC 新提交

Linked Fates: How Small of an Ambiguity Increase Can Make the Difference Between Equaling and Separating from P?

关联的命运:歧义增加多小才能区分P与等于P?

Benjamin Carleton, Michael C. Chavrimootoo, Lane A. Hemaspaandra, David E. Narváez, Conor Taliancich, Melissa Welsh

AI总结 研究NP的歧义有界版本UP_{≤f(n)}是否与P相等,通过路径毒化和填充技术,证明了某些歧义范围下P=UP_{≤f1(n)}蕴含P=UP_{≤f2(n)},并给出了其他情况下不成立的相对化结果。

详情
AI中文摘要

NP的歧义有界版本,记为$\mathrm{UP}_{\leq f(n)}$,通过$f(n)$限制非确定性多项式时间图灵机在长度为$n$的输入上接受路径的数量。这些类别从Valiant的完全无歧义($f(n)=1$)类$\mathrm{UP}$到$\mathrm{NP}$本身,其中没有界限或等价地有指数界限($f(n) = 2^{n^{O(1)}}$)。本文旨在理解这些类别中哪些在是否等于确定性多项式时间的问题上共存亡。通俗地说,哪些歧义范围具有关联的命运?即,对于满足$(\forall n)[f_1(n) \leq f_2(n)]$的非递减函数对$(f_1,f_2)$,何时有$\mathrm{P} = \mathrm{UP}_{\leq f_1(n)} \implies \mathrm{P} = \mathrm{UP}_{\leq f_2(n)}$。更具体地,哪些对鲁棒地成立,即在现实世界和所有相对化世界中成立?哪些对不鲁棒地成立,即存在一个谕示$A$使得$\mathrm{P}^A = \mathrm{UP}_{\leq f_1(n)}^A \subsetneq \mathrm{UP}_{\leq f_2(n)}^A$?先前唯一已知的正面结果是Watanabe 1988年的结果:$\mathrm{P} = \mathrm{UP}_{\leq 1} \implies (\forall k \geq 1)[\mathrm{P} = \mathrm{UP}_{\leq k}]$,该结果甚至鲁棒地成立。他的结果虽然优美,但仅适用于常数有界歧义。作为我们的正面结果,我们提出了一个适用于更高歧义水平的新情况类(定理3.8),且甚至鲁棒地适用。为了给出我们的情况类,我们利用了两种方法:一种新颖的路径毒化方法,即使在超常数歧义上也有效(定理3.5),以及填充技术的新应用(定理3.3/3.4)。作为负面结果,我们表明在几乎所有其他情况下,没有关联鲁棒地成立。

英文摘要

Ambiguity-bounded versions of $\mathrm{NP}$, denoted $\mathrm{UP}_{\leq f(n)}$, bound by $f(n)$ the number of accepting paths the nondeterministic polynomial-time Turing machine can have on inputs of length $n$. Such classes range from Valiant's completely unambiguous ($f(n)=1$) class $\mathrm{UP}$ to $\mathrm{NP}$ itself, where there is no bound or, equivalently, there is the toothless exponential bound ($f(n) = 2^{n^{O(1)}}$). This paper seeks to understand which of these classes stand and fall together as to whether they equal deterministic polynomial time. Informally put, what ranges of ambiguities have linked fates? That is, for which pairs of nondecreasing functions, $(f_1 ,f_2)$, satisfying $(\forall n)[f_1(n) \leq f_2(n)]$, does it hold that $\mathrm{P} = \mathrm{UP}_{\leq f_1(n)} \implies \mathrm{P} = \mathrm{UP}_{\leq f_2(n)}$. More particularly, for which pairs does that hold robustly, i.e., it holds in the real world and every relativized world? And for which pairs does that implication fail to hold robustly, i.e., there is an oracle $A$ such that $\mathrm{P}^A = \mathrm{UP}_{\leq f_1(n)}^A \subsetneq \mathrm{UP}_{\leq f_2(n)}^A$? The only previously known positive result is Watanabe's 1988 result that $ \mathrm{P} = \mathrm{UP}_{\leq 1} \implies (\forall k \geq 1)[\mathrm{P} = \mathrm{UP}_{\leq k}]$, which even holds robustly. His result, though lovely, applies only to constant-bounded ambiguities. As our positive result, we present a new class of cases (Theorem 3.8) that apply (and even robustly apply) at greater ambiguity levels. To give our class of cases, we leverage two approaches: a novel path-poisoning approach that works even on superconstant ambiguities (Theorem 3.5) and a new application of the power of padding (Theorems 3.3/3.4). As negative results, we show that for essentially all other cases, no linkage holds robustly.

2606.20375 2026-06-19 cs.HC cs.CY 新提交

Organizing in the Digital Age: Understanding Community, Challenges, and Consequences in Digitally-facilitated Labor Organizing

数字时代的组织:理解数字辅助劳工组织中的社区、挑战与后果

Frederick Reiber, Alishah Chator, Dana Calacci, Allison McDonald

AI总结 本研究通过17次定性访谈,分析劳工组织如何使用Discord、WhatsApp和Slack等数字平台进行组织,揭示了技术安全、信息过载和信任建立等挑战与机遇。

Comments To appear in CSCW 2026

详情
AI中文摘要

当代美国劳动力高度分散,需要使用数字通信工具来弥合工会组织中的空间和时间差距。本研究深入分析了不同工会中的工人如何利用基于文本的数字通信平台(包括Discord、WhatsApp和Slack)进行劳工组织。通过17次定性访谈,我们考察了数字组织带来的挑战和机遇,识别了技术和社会障碍。我们的研究结果表明,尽管数字工具对当代劳工成功至关重要,但它们也引入了新的复杂性,例如应对技术安全、管理信息过载以及建立信任和共识。基于这些见解,我们将数字组织与数字工具在工会中的作用联系起来,以更广泛地理解数字组织。

英文摘要

The contemporary American labor force is highly dispersed, necessitating the use of digital communication tools to bridge spatial and temporal gaps in union organizing. This study provides an in-depth analysis of how workers within various labor unions utilize digital, text-based communication platforms -- including Discord, WhatsApp, and Slack -- for labor organizing. Through 17 qualitative interviews, we examine the challenges and opportunities presented by digital organizing, identifying both technical and social obstacles. Our findings reveal that although digital tools are integral to contemporary labor successes, they also introduce new complexities, such as navigating technical security, managing information overload, and building trust and consensus. Based on these insights, we draw connections to broader understandings of digital organizing and the role of digital tools in unions.

2606.20374 2026-06-19 cs.DC 新提交

ARGUS: Production-Scale Tracing and Performance Diagnosis for over 10,000-GPU Clusters

ARGUS:面向超过10,000 GPU集群的生产级追踪与性能诊断

Jiasheng Zhou, Longbin Zeng, Clavis Chen, Ruiming Lu, Qinwei Yang, Leyi Ye, Ray Ying, Key Zhang

AI总结 提出低开销、细粒度的始终在线追踪与实时分析系统ARGUS,通过分解训练调用层次、统一数据管道和渐进式诊断框架,在超过10,000 GPU集群上实现<2%开销的持续故障检测与性能优化。

详情
AI中文摘要

大规模LLM训练需要始终在线、细粒度的可观测性以实现有效的规模性能诊断。粗粒度的资源监控器无法定位根本原因,而细粒度的分析器会产生高昂(5%-30%)的开销和海量追踪数据,使得在大型生产集群中始终在线部署不切实际。我们提出ARGUS,一个面向10,000+ GPU规模生产集群中训练工作负载的低开销、细粒度、始终在线的追踪与实时分析系统。ARGUS将沿训练调用层次的观测分解为CPU调用栈、框架语义和GPU内核执行,始终在线收集的总开销低于2%。它构建统一数据管道,将原始内核事件压缩约3,700倍,从每个rank每步10 MB降至2.7 KB。其渐进式诊断框架通过迭代时间、阶段级和内核级分析自动隔离异常窗口、落后rank和性能下降的内核。在超过10,000 GPU的生产集群上部署超过六个月,ARGUS持续支持故障慢速检测和性能优化。我们的案例研究进一步展示了其在代表性异常中的有效性,包括计算落后、链路退化、流水线气泡放大、FlashAttention JIT停滞以及被通信症状掩盖的计算落后。

英文摘要

Large-scale LLM training requires always-on, fine-grained observability for effective performance diagnosis at scale. Coarse resource monitors alone cannot localize root causes, and fine-grained profilers incur prohibitive (5%-30%) overheads and massive trace volumes, making always-on deployment impractical in large production clusters. We propose ARGUS, a low-overhead, fine-grained, always-on tracing and real-time analysis system for training workloads in 10,000+ GPU-scale production clusters. ARGUS decomposes observation along the training call hierarchy into CPU call stacks, framework semantics, and GPU kernel execution, with always-on collection under a combined overhead of less than 2%. It builds a unified data pipeline and compresses raw kernel events by approximately 3,700x from 10 MB to 2.7 KB per rank per step. Its progressive diagnosis framework automatically isolates anomalous windows, straggler ranks, and degraded kernels through iteration-time, phase-level, and kernel-level analysis. Deployed for over six months on a 10,000+ GPU production cluster, ARGUS has supported continuous fail-slow detection and performance optimization. Our case studies further demonstrate its effectiveness across representative anomalies, including compute stragglers, link degradation, pipeline-bubble amplification, FlashAttention JIT stalls, and compute stragglers masked by communication symptoms.

2606.20361 2026-06-19 eess.SY cs.SY 新提交

Sparse add-on controller design: A Youla approach to system-level performance

稀疏附加控制器设计:一种面向系统级性能的Youla方法

M. van der Hulst, N. Dirkx, R. A. González, K. Tiels, J. van de Wijdeven, T. oomen

AI总结 提出一种基于Youla参数化的稀疏附加控制器设计框架,通过凸优化求解稀疏H2综合问题,实现系统级性能与互联复杂度的最优权衡。

详情
AI中文摘要

高科技系统的性能通常由机器中运行的多个闭环控制子系统共享的少数性能目标决定,例如同步、协调和对齐,这需要明确处理这些目标以实现最优性能的控制方法。本文旨在引入一种框架,通过设计系统级控制器作为现有子系统控制结构的附加组件来提高系统性能。所开发的方法使用Youla框架参数化所有稳定的系统级附加控制器,从而能够凸形式化稀疏$\mathcal{H}_2$综合问题。结果是一个稀疏附加控制器,实现了组合性能与互联复杂度之间的最优权衡,如数值模拟所示。

英文摘要

The performance of high-tech systems is often dictated by a few performance objectives shared among the many closed-loop controlled subsystems operating in the machine, such as synchronization, coordination, and alignment, which necessitates control methods that explicitly address them to achieve optimal performance. The aim of this paper is to introduce a framework that improves system performance through system-level controllers designed to be implemented as add-ons to the existing subsystem control structure. The developed method parametrizes all stabilizing system-level add-on controllers using the Youla framework, enabling a convex formulation of the sparse $\mathcal{H}_2$ synthesis problem. The result is a sparse add-on controller that achieves the optimal trade-off between combined performance and interconnection complexity, as demonstrated through numerical simulations.

2606.20351 2026-06-19 cs.LO cs.PL 新提交

A cubical formalisation of conditional independence, Bayesian conditioning, and Pearl's d-separation soundness

条件独立性、贝叶斯条件化和Pearl的d-分离正确性的立方形式化

Karen Sargsyan

AI总结 本文在Cubical Agda中形式化概率单子,提出一种高阶归纳类型,通过引入贝叶斯公式修正标准凸代数交换公理的不足,并验证了半图oid公理、do-演算规则和d-分离定理。

详情
AI中文摘要

自Stone以来概率单子形式化中常见的标准凸代数交换公理被证明不足以支持完整的贝叶斯条件化。我们在Cubical Agda中精确地说明了这一点:有限分布作为高阶归纳类型,条件独立性作为核之间的立方路径,递归贝叶斯条件化作为全支撑片段上的全函数。将条件化提升到完整的HIT暴露了一个结构上的不匹配——重新排列的四叶混合的两半携带由贝叶斯公式关联的不同贝叶斯权重,而不是标准公理提供的单个共享内部权重。我们展示了解决这一问题的最小推广,并证明了标准形式是退化情况,其中两个内部权重重合。基于这一观察,我们在抽象有序域接口之上以构造性方式验证了代数上下文,无需任何公设:绑定交换性、四个半图oid公理、交(通过结构性Σ-见证简化为收缩,无需正性)、Pearl的do-演算规则1、2和3的核形式、有限类型贝叶斯条件化,以及任意n顶点有限有向无环图(DAG)上干预和贝叶斯形式的Pearl的d-分离定理(正确性)。概率单子也被验证为马尔可夫范畴;抽象接口在Q处实现。

英文摘要

The standard convex-algebra interchange axiom, common to probability-monad formalisations since Stone, is provably too weak to support full Bayesian conditioning. We make this precise in Cubical Agda: finite distributions as a higher inductive type, conditional independence as a cubical path between kernels, recursive Bayesian conditioning as a total function on a full-support fragment. Lifting conditioning to the full HIT exposes a structural mismatch -- the two halves of the rearranged 4-leaf mix carry distinct Bayesian weights related by Bayes' formula, not the single shared inner weight the standard axiom provides. We exhibit the minimal generalisation that resolves this and prove the standard form is the degenerate case where the two inner weights coincide. Around this observation we verify the algebraic context constructively, with zero postulates above an abstract ordered-field interface: bind commutativity, the four semi-graphoid axioms, intersection (reduced to contraction via structural $Σ$-witnesses, without positivity), Pearl's do-calculus Rules~1, 2, and~3 in kernel form, finite-type Bayesian conditioning, and Pearl's d-separation theorem (soundness) on arbitrary $n$-vertex finite directed acyclic graphs (DAGs) in both interventional and Bayesian forms. The probability monad is also verified as a Markov category; the abstract interface discharges at $Q$.

2606.20331 2026-06-19 cs.DS cs.CC 新提交

Computing Twin-Width via Treedepth and Vertex Integrity

通过树深度和顶点完整性计算双宽度

Robert Ganian, Mathis Rocton

AI总结 本文证明,当参数化为树深度时,近似双宽度是固定参数可解的;当参数化为顶点完整性时,精确计算双宽度是固定参数可解的,首次为非平凡参数化算法提供最优收缩序列。

Comments A short version of this preprint appeared at STACS 2026

详情
AI中文摘要

双宽度是一个图参数,已成为解释一阶模型检验在许多图类上固定参数可解性的核心。尽管其算法重要性,计算双宽度仍然知之甚少:甚至识别双宽度至多为4的图是NP难的,并且没有已知的以双宽度本身为参数的固定参数近似。最近突破这一障碍的方法侧重于首先开发以不同于双宽度的参数化来计算或近似双宽度的固定参数算法。我们的第一个结果表明,当以树深度为参数时,近似双宽度是固定参数可解的,从而打破了所有先前可处理的参数化都基于删除距离的长期障碍。证明通过有向双宽度进行,首次提供了该变体可能在算法上更易处理的构造性证据。作为第二个主要结果,我们表明,以顶点完整性为参数时,精确计算双宽度是固定参数可解的。这构成了计算最优收缩序列的第一个非平凡参数化算法。

英文摘要

Twin-width is a graph parameter that has become central to explaining the fixed-parameter tractability of first-order model checking across many graph classes. Despite its algorithmic importance, computing twin-width remains poorly understood: even recognizing graphs of twin-width at most four is NP-hard, and no fixed-parameter approximations parameterized by twin-width itself are known. A recent approach towards breaking this barrier focuses on first developing fixed-parameter algorithms for computing or approximating twin-width under parameterizations distinct from twin-width. Our first result establishes that approximating twin-width is fixed-parameter tractable when parameterized by treedepth, thereby breaking the long-standing barrier that all previous tractable parameterizations were based on deletion distance. The proof proceeds via oriented twin-width, yielding the first constructive evidence that this variant may be easier to handle algorithmically. As our second main result, we show that computing twin-width exactly is fixed-parameter tractable with respect to vertex integrity. This constitutes the first non-trivial parameterized algorithm for computing optimal contraction sequences.

2606.20318 2026-06-19 cs.DB 新提交

AgenticDB: Agentic Performance Reconfiguration for Database Workloads

AgenticDB: 面向数据库工作负载的代理式性能重配置

Xinyue Yang, Chaozheng Wang, Chen Zheng, Heng Zhang, Yanjun Wu

AI总结 提出AgenticDB框架,通过运行时交互实现数据库系统级和操作系统级重配置,诊断瓶颈并积累经验,在MySQL和PostgreSQL上平均性能提升118.1%。

详情
AI中文摘要

数据库配置调优对工作负载性能至关重要,但在实际部署中进行实用调优仍然困难。现有的自动调优器大多将调优视为对DBMS旋钮值的迭代搜索。这种形式导致执行成本高,过早缩小配置空间,并且未能充分解决实际需求:从系统反馈中诊断运行时瓶颈,探索操作系统级重配置机会,稳健地执行更改,以及从先前的试验和任务中学习。我们提出AgenticDB,一个用于数据库工作负载重配置的代理式框架。AgenticDB实现了一个上下文驱动的工具,通过与目标数据库环境交互,提出DBMS级和操作系统级更改,在安全约束下应用它们,观察工作负载性能和运行时状态,并使用执行反馈来指导后续决策。这种运行时交互使AgenticDB能够诊断瓶颈,探索更广泛的DBMS和操作系统级重配置空间,避免不安全或不支持的操作,并在重配置任务内部和之间积累经验。因此,AgenticDB将数据库调优转变为一种自我改进的重配置过程,其中运行时反馈迭代地改进后续决策。我们在MySQL和PostgreSQL上使用YCSB、Sysbench和TPC-H工作负载进行了广泛实验。结果表明,AgenticDB在所有评估的工作负载上实现了最佳最终性能,平均比最强基线提高118.1%,并将总到达最佳时间减少22.6%。结果还表明,其操作系统级动作空间、稳健的执行生命周期和增强记忆的规划有助于实现更有效和实用的数据库重配置。

英文摘要

Database configuration tuning is critical for workload performance, but practical tuning on real deployments remains difficult. Existing automatic tuners mostly formulate tuning as iterative search over DBMS knob values. This formulation leads to high execution cost, prematurely narrows the configuration space, and leaves practical requirements insufficiently addressed: diagnosing runtime bottlenecks from system feedback, exploring OS-level reconfiguration opportunities, executing changes robustly, and learning from previous trials and tasks. We propose AgenticDB, an agentic framework for database workload reconfiguration. AgenticDB implements a context-grounded harness that interacts with the target database environment by proposing DBMS- and OS-level changes, applying them under safety constraints, observing workload performance and runtime states, and using execution feedback to guide subsequent decisions. This runtime interaction enables AgenticDB to diagnose bottlenecks, explore a broader DBMS- and OS-level reconfiguration space, avoid unsafe or unsupported actions, and accumulate experience within and across reconfiguration tasks. As a result, AgenticDB turns database tuning into a self-refining reconfiguration process in which runtime feedback iteratively improves later decisions. We conduct extensive experiments on MySQL and PostgreSQL using YCSB, Sysbench, and TPC-H workloads. The results show that AgenticDB achieves the best final performance on all evaluated workloads, improving over the strongest baseline by 118.1% on average and reducing aggregate time-to-best by 22.6%. The results also demonstrate that its OS-level action space, robust execution lifecycle, and memory-enhanced planning contribute to more effective and practical database reconfiguration.

2606.20301 2026-06-19 eess.SY cs.SY 新提交

Data-Driven Control from Poisoned Data: Fundamental Limitations and Secure DeePC

来自中毒数据的数据驱动控制:基本局限性与安全DeePC

Takumi Shinohara, Henrik Sandberg, Karl Henrik Johansson

AI总结 针对任意数据中毒攻击,提出安全DeePC算法,通过截断输出和在线重建实现有限时间内的MPC等效性能。

详情
AI中文摘要

我们研究了存在任意数据中毒攻击时的数据驱动控制问题。假设一部分离线输出数据存储在未受保护的位置,可能被对手篡改。我们首先建立了由这种中毒数据引起的数据驱动控制的基本局限性:仅从数据集无法检测/识别中毒攻击;未受保护的数据对于具有最坏情况保证的控制器设计是非信息性的;未受保护输出的硬约束是不可认证的。受这些局限性和数据使能预测控制(DeePC)技术的启发,我们提出了安全DeePC,一种能够抵御中毒攻击的数据驱动控制算法。它首先仅使用受保护数据集运行输出截断的DeePC,直到在线输入变得持续激励。然后利用在线测量重建部分离线数据集,最后返回到全输出DeePC。安全DeePC在特定条件下几乎必然在有限时间内实现MPC等效性能。仿真结果证明了所提框架对抗中毒攻击的有效性。

英文摘要

We study a data-driven control problem in the presence of arbitrary data poisoning attacks. We assume that a subset of offline output data is stored in unprotected locations and may be poisoned by an adversary. We first establish fundamental limitations for data-driven control arising from such poisoned data: poisoning attacks are not detected/identified from the dataset alone; unprotected data are non-informative for controller design with worst-case guarantees; and hard constraints on unprotected outputs are not certifiable. Motivated by these limitations and the data-enabled predictive control (DeePC) technique, we propose Secure DeePC, a data-driven control algorithm that is resilient against poisoning attacks. It first runs output-truncated DeePC using only the protected dataset until the online input becomes persistently exciting. It then uses online measurements to reconstruct the partial offline dataset, and finally returns to full-output DeePC. Secure DeePC achieves MPC-equivalent performance in finite time almost surely under certain conditions. Simulation results illustrate the efficacy of the proposed framework against poisoning attacks.

2606.20254 2026-06-19 cs.CR 新提交

Quantization as a Malicious Task: Removing Quantization-Conditioned Backdoors via Task Arithmetic

量化作为恶意任务:通过任务算术移除量化条件后门

Kaihsun Yang, Min-Yan Tsai, Chia-Mu Yu

AI总结 提出QVec方法,通过将量化引起的权重变化视为恶意任务向量,在部署前进行参数校正,无需重训练或触发样本即可防御量化条件后门。

详情
AI中文摘要

模型量化被广泛采用,以在资源受限设备上部署深度神经网络时减少内存使用和推理成本。然而,最近的研究揭示了一种新的安全威胁,称为量化条件后门(QCBs),其中模型在全精度下行为正常,但仅在量化后激活恶意行为。现有的防御通常修改量化过程或校正激活统计,往往引入额外的计算开销或依赖特定的量化设置。在这里,我们提出QVec,一种从参数空间角度防御QCBs的方法。我们观察到,全精度模型与其量化版本之间的权重差异编码了一种结构化的行为偏移,可以解释为恶意任务向量,而非随机量化噪声。基于这一见解,QVec通过在部署前进行受控的参数校正来抵消这一恶意方向。QVec无需重新训练,无需触发样本,仅需一次量化传递来估计参数偏移,以及轻量级的超参数搜索。在图像分类基准和多个大型语言模型(LLM)攻击场景中的大量实验表明,QVec在保持干净性能的同时,持续抑制后门激活。

英文摘要

Model quantization is widely adopted to reduce memory usage and inference cost when deploying deep neural networks on resource-constrained devices. However, recent studies have revealed a new security threat known as Quantization-Conditioned Backdoors (QCBs), where a model behaves normally in full precision but activates malicious behavior only after quantization. Existing defenses typically modify quantization procedures or correct activation statistics, often introducing additional computational overhead or relying on specific quantization settings. Here, we present QVec, a parameter-space perspective for defending against QCBs. We observe that the weight difference between a full-precision model and its quantized counterpart encodes a structured behavioral shift, which can be interpreted as a malicious task vector rather than random quantization noise. Based on this insight, QVec counteracts this malicious direction through controlled parameter correction prior to deployment. QVec requires no retraining, no trigger samples, and only a single quantization pass to estimate the parameter shift, together with a lightweight hyperparameter search. Extensive experiments across image classification benchmarks and multiple Large Language Model (LLM) attack scenarios demonstrate that QVec consistently suppresses backdoor activation while preserving clean performance.

2606.20251 2026-06-19 cs.CR 新提交

TrustMix: How to Mix Messages in a Mobile Ad-hoc Network

TrustMix:如何在移动自组织网络中混合消息

Yu Shen, Aiswarya Walter, Stefanie Roos

AI总结 提出TrustMix协议,通过分组转发和洗牌实现无中心信任的匿名通信,利用可链接环签名限制速率,在随机预言机模型下证明安全性,仿真和Android实现验证了匿名性和吞吐量。

Comments Accepted at ICDCS 2026, 11 pages

详情
AI中文摘要

混合网络是实现匿名性、防御各种流量分析攻击的高效方法。然而,混合网络通常为基础设施网络设计,无法直接应用于移动自组织网络(MANET)。现有的少数MANET解决方案需要预先了解拓扑结构或依赖可信中心方。本文提出TrustMix,一种无需任何中心可信方的MANET混合协议。在TrustMix中,各方加入组,消息通过多个组转发以提供匿名性。用户只需在附近找到一个他们认为可信的方,然后将消息转发到该方的组,该方在转发到其他组之前对消息进行洗牌,从而无法将原始消息与转发消息关联。此外,即使所选方是敌对的,只有当其组内所有方都是敌对的时,他们才能破坏匿名性,因为所有方都参与洗牌。除了匿名性,TrustMix还通过可链接环签名对消息数量实施速率限制,从而能够在不泄露身份的情况下检测到各方发送超过允许数量的消息。我们在随机预言机模型下证明了协议的安全性。我们使用现有的混合网络模拟器评估其匿名性,表明TrustMix显著提高了消息匿名性。最后,我们展示了基于Android的概念验证实现,并表明TrustMix在5个移动设备上实现了可接受的吞吐量。

英文摘要

Mix networks are a highly effective way to achieve anonymity, defending against a wide range of traffic-analysis attacks. However, mix networks are usually designed for infrastructure networks and cannot be directly applied in the context of mobile ad hoc networks (MANETs). The few existing solutions for MANETs require advance knowledge of the topology or a trusted central party. In this paper, we present TrustMix, a mix protocol for MANETs that operates without any central trusted party. In TrustMix, parties join groups and then messages are forwarded via multiple groups to provide anonymity. With TrustMix, users only need to find a party nearby that they consider trusted. They then forward the message to this party's group, and the party shuffles messages before forwarding to other groups, meaning that the original message and the forwarded message cannot be linked. Furthermore, even if the chosen party is adversarial, they can only break the anonymity if all parties in their group are adversarial as all of them contribute to the shuffling. In addition to anonymity, TrustMix also enforces rate limits on the number of messages through the use of linkable ring signatures, which allows detecting that parties send more messages that allowed without revealing identities. We prove the security of our protocol in the random oracle model. We evaluate its anonymity using an existing mix-network simulator and show that TrustMix significantly improves message anonymity. Finally, we present a proof-of-concept Android implementation and show that TrustMix achieves acceptable throughput with 5 mobile devices.

2606.20243 2026-06-19 cs.SE cs.MA 新提交

Phoenix: Safe GitHub Issue Resolution via Multi-Agent LLMs

Phoenix: 通过多智能体LLM实现安全的GitHub问题解决

Kipngeno Koech, Muhammad Adam, Baimam Boukar Jean Jacques, Joao Barros

AI总结 提出多智能体LLM系统Phoenix,通过六个专业智能体和七层安全控制,在SWE-bench Lite子集上达到75%的解决率,并在真实问题中保持100%正确性。

详情
AI中文摘要

我们提出Phoenix,一个多智能体LLM系统,能够从分类到拉取请求创建解决GitHub问题,结合了七层安全控制与基线感知测试评估策略。Phoenix将工作分解给六个专业智能体:规划器、复现器、编码器、测试器、故障分析器和拉取请求(PR)智能体,所有智能体由基于标签的GitHub webhook状态机协调。在打开拉取请求之前,每次更改都会与基线测试运行进行对比。在SWE-bench Lite的24个实例子集上,在生产webhook路径上运行,Phoenix oracle解决了75%的实例,且成功运行中没有出现通过到通过的回归;这个精心挑选的子集不能直接与完整分割排行榜结果比较,我们讨论了比较的局限性。在14个仓库的42个真实问题上的补充试点实现了100%的正确性保持(CP;硬级别平均122秒)。人工检查显示,大约一半的拉取请求是定位良好的修复。另一半将代码放置在错误路径上,这是规划器定位的局限性,我们正在通过检索来解决。我们还报告了部署失败模式(WAF过滤、令牌过期、权限边界、不稳定的CI),这些模式促使了每种安全机制的引入。

英文摘要

We present Phoenix, a multi-agent LLM system that resolves GitHub issues from triage through pull-request creation, combining seven layered safety controls with a baseline-aware test evaluation strategy. Phoenix decomposes the work across six specialized agents. Planner, reproducer, coder, tester, failure analyst and Pull Request (PR) agent, all coordinated by a label-based GitHub webhook state machine. Every change is checked against a baseline test run before a pull request is opened. On a 24-instance slice of SWE-bench Lite. run on the production webhook path, Phoenix oracle-resolves 75% of instances with no pass-to-pass regressions on successful runs; this curated slice is not directly comparable to full-split leaderboard results, and we discuss the limits of the comparison. A complementary pilot on 42 real issues across 14 repositories yields 100% correctness preservation (CP; mean 122s on the hard tier). Manual inspection shows that about half of the resulting pull requests are well-targeted fixes. The other half place code at incorrect paths, a planner localization limitation we are addressing with retrieval. We also report the deployment failure modes (WAF filtering, token expiry, permission boundaries, flaky CI) that motivated each safety mechanism.

2606.20230 2026-06-19 cs.SE 新提交

SysML Modeling of Digital Twins for Renewable Energy Communities

可再生能源社区数字孪生的SysML建模

Mohammad Samadi, Luís Miguel Pinho, Andrey Sadovykh, Gabriela Lucas

AI总结 针对可再生能源社区数字孪生工程中的异构性挑战,提出基于SysML的MBSE工作流,通过设备分类和社区组织视图建模,并引入SAREF4ENER本体弥补语义鸿沟。

Comments Presented at the Workshop on Digital Twin Experiences and Model-Based Testing Methods, 12 June 2026, Västerås, Sweden, co-located with the 30th Ada-Europe International Conference on Reliable Software Technologies (AEiC 2026)

详情
AI中文摘要

可再生能源社区(REC)正成为本地和全球共享可再生能源发电、存储和灵活负载的关键组织模型。由于涉及设备、合同和运行时数据的异构性,REC数字孪生的工程变得困难。在本文中,我们朝着REC数字孪生的基于模型的系统工程(MBSE)工作流迈出了第一步。从经过工业验证的REC领域模型出发,我们使用开源Modelio工具在SysML中重新表达了一个代表性的房屋子集,生成了两个块定义图——一个设备分类和一个社区组织视图。然后,我们讨论了普通SysML留下的四个语义鸿沟,并概述了如何将SAREF4ENER本体作为参考包导入以弥合这些鸿沟。将SysML与基于SAREF的智能能源数字孪生语义相结合在很大程度上仍未探索,我们将本文定位为沿着这条线的第一步。

英文摘要

Renewable Energy Communities (RECs) are emerging as a key organizational model for local and global sharing of renewable generation, storage, and flexible loads. Engineering Digital Twins of RECs is made difficult by the heterogeneity of devices, contracts, and runtime data involved. In this paper, we take a first step toward a Model-Based Systems Engineering (MBSE) workflow for REC's Digital Twins. Starting from an industrially-validated REC domain model, we re-express a representative house subset in SysML using the open-source Modelio tool, yielding two Block Definition Diagrams - a device taxonomy and a community organizational view. We then discuss four semantic gaps that plain SysML leaves open and sketch how the SAREF4ENER ontology could be imported as a reference package to close them. Combining SysML with SAREF-based semantics for smart-energy Digital Twins remains largely unexplored, and we position this paper as a first step along that line.

2606.20215 2026-06-19 cs.CR 新提交

GNSS Spoofing Threat for V2X communications

GNSS欺骗对V2X通信的威胁

Adolfo P. Jimenez, Juan Arquero-Gallego, Mario P. Luna, Jose E. Naranjo, Felipe Jimenez Alonso

AI总结 本文提出利用廉价软件定义无线电(SDR)对V2X通信实施GNSS欺骗攻击的方法,并在真实设备上验证了攻击效果,揭示了V2X通信易受欺骗且难以检测的安全漏洞。

Comments 2026 IEEE\@. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

详情
AI中文摘要

全球导航卫星系统(GNSS)是车联网(V2X)领域提供关键定位、导航和授时(PNT)服务的核心技术,对于生成维护网络可靠性和车辆安全性的协作感知消息(CAM)不可或缺。然而,GNSS信号极易受到欺骗攻击,这是一种高级攻击,攻击者发送模拟合法卫星特征的精心构造信号,误导接收器计算出错误位置。本文提出了一种使用廉价软件定义无线电(SDR)进行物理欺骗的方法,描述了一个坐标生成流水线,该流水线采用基于Haversine的距离计算、时间离散化以模拟恒定速度,以及线性插值来生成高保真GPS基带信号。所提出的攻击在真实的Commsignia车载单元(OBU)和路侧单元(RSU)设备上,使用HackRF One在三种场景下进行了实验验证,这些场景模拟了90 km/h、145 km/h和200 km/h稳定速度下的合成轨迹。本文最重要的贡献是证明了V2X通信并不安全,因为它们容易受到GNSS欺骗攻击,导致服务降级而未被检测到。

英文摘要

Global Navigation Satellite Systems (GNSS) constitute a core technology for delivering crucial positioning, navigation, and timing (PNT) services in the Vehicle-to-Everything (V2X) domain, where they are indispensable for generating Cooperative Awareness Messages (CAM) that uphold network reliability and vehicular safety. Yet, GNSS signals are acutely exposed to spoofing, an advanced attack in which an adversary transmits crafted signals that replicate legitimate satellite characteristics, misleading the receiver into computing a false position. This work presents a methodology for conducting physical spoofing with inexpensive Software Defined Radio (SDR), describing a coordinate generation pipeline that employs Haversine-based distance calculations, temporal discretization to emulate constant velocity, and linear interpolation to produce high-fidelity GPS baseband signals. The proposed attack is experimentally validated on real Commsignia OnBoard Unit (OBU) and RoadSide Unit (RSU) devices using a HackRF One across three scenarios that emulate synthetic trajectories at steady speeds of 90 km/h, 145 km/h, and 200 km/h. The most significant contribution of this paper is the demonstration that V2X communications are not secured, as they are susceptible to GNSS spoofing attacks, which cause service degradation without being detected.

2606.20214 2026-06-19 cs.CR 新提交

Accelerating Trust Convergence in IIoT: A ML Approach for Dynamic Network Conditions

加速工业物联网中的信任收敛:一种针对动态网络条件的机器学习方法

Aymen Bouferroum, Valeria Loscri, Abderrahim Benslimane

AI总结 针对工业物联网中网络质量波动导致信任收敛慢的问题,提出基于机器学习的信任收敛加速方法,通过预测收敛时间并动态调整转移概率,在挑战性条件下将收敛时间减少28.6%,并提升恶意节点场景下的评估准确性。

Comments Symposium: Communication \& Information Systems Security (CISS)

Journal ref IEEE Global Communications Conference (GLOBECOM) 2025, Dec 2025, Taipei, Taiwan. pp.4427-4432

详情
AI中文摘要

在工业物联网(IIoT)环境中,信任管理在保障系统安全方面起着至关重要的作用,尤其是在处理资源受限设备时。传统的信任模型往往忽视了网络质量波动的影响,导致信任收敛速度慢且评估不准确。在本文中,我们提出了一种动态信任管理解决方案,称为信任收敛加速(TCA)方法,该方法集成了机器学习(ML)以在恶劣网络条件下加速信任收敛。我们的模型基于关键网络指标预测信任收敛所需的时间单位,并动态调整信任模型中的转移概率以提高收敛速度。通过使用基于IEEE 802.11标准的模拟框架,该框架包含真实的Wi-Fi信道条件,我们展示了基于TCA方法的有效性,在挑战性条件下实现了高达28.6%的信任收敛时间减少。此外,所提出的解决方案在涉及恶意节点的场景中表现出韧性,提高了信任评估的准确性。这项工作为动态工业环境中的IIoT系统提供了一个可扩展且自适应的信任框架,确保了在不同网络条件下的稳健性能。

英文摘要

In Industrial Internet of Things (IIoT) environments, trust management plays a vital role in securing systems, especially when dealing with resource-constrained devices. Traditional trust models often overlook the impact of fluctuating network quality, leading to slower trust convergence and inaccurate assessments. In this paper, we propose a dynamic trust management solution, known as the Trust Convergence Acceleration (TCA) approach, which integrates Machine Learning (ML) to accelerate trust convergence under poor network conditions. Our model predicts the number of time units needed for trust convergence based on key network metrics and dynamically adapts transition probabilities in the trust model to enhance convergence speed. Using a simulation framework that incorporates realistic Wi-Fi channel conditions based on the IEEE 802.11 standard, we demonstrate the effectiveness of the TCA-based approach, achieving up to a 28.6% reduction in trust convergence time under challenging conditions. Furthermore, the proposed solution exhibits resilience in scenarios involving malicious nodes, improving trust evaluation accuracy. This work provides a scalable and adaptive trust framework for IIoT systems in dynamic industrial environments, ensuring robust performance under varying network conditions.

2606.20202 2026-06-19 cs.DS 新提交

Tight Algorithm and Hardness for Submodular Linear Ordering

子模线性排序的紧致算法与难度

Evan Abboud, Roy Schwartz

AI总结 针对一般子模函数的最小线性排序问题,提出多项式时间O(√(n/ln n))近似算法,并证明信息论下界匹配,任何多项式时间算法无法达到o(√(n/ln n))近似比。

Comments 25 pages. Accepted to the 53rd International Colloquium on Automata, Languages, and Programming (ICALP 2026)

详情
AI中文摘要

我们考虑最小线性排序问题:给定基数为$n$的集合$N$和非负集函数$f\colon 2^N\rightarrow \mathbb{R}_{\geq 0}$,目标是找到$N$的一个排列$\pi$,使得$\pi$的所有前缀上$f$值的和最小。该问题已被研究用于各种集函数类,其中子模$f$的情况特别受关注,因为它涵盖了经典问题,包括最小线性排列和最小包含区间图。在这项工作中,我们通过建立匹配的上界和下界,解决了一般子模$f$的最小线性排序问题的近似性,并给出:$(1)$一个多项式时间算法,实现$O(\sqrt{n/\ln n})$-近似;以及$(2)$一个匹配的信息论难度结果,表明任何对$f$进行多项式次数求值的算法都无法实现$o(\sqrt{n/\ln n})$-近似。此前,已知的最佳近似难度为$2$,而$O(\sqrt{n/\ln n})$-近似仅对$f$既是子模又是对称的特殊情况已知。

英文摘要

We consider the Minimum Linear Ordering Problem: given a ground set $N$ of cardinality $n$ and a non-negative set function $f\colon 2^N\rightarrow \mathbb{R}_{\geq 0}$, the goal is to find an ordering $π$ of $N$ that minimizes the sum of the values of $f$ over all prefixes of $π$. This problem has been studied for various classes of set functions, and the case of a submodular $f$ is of special interest, as it captures classic problems including Minimum Linear Arrangement and Minimum Containing Interval Graph. In this work, we resolve the approximability of the Minimum Linear Ordering Problem for a general submodular $f$ by establishing matching upper and lower bounds and present: $(1)$ a polynomial-time algorithm achieving an $O(\sqrt{n/\ln n})$-approximation; and $(2)$ a matching information-theoretic hardness result, showing that no algorithm evaluating $f$ a polynomial number of times can achieve an $o(\sqrt{n/\ln n})$-approximation. Previously, the best known hardness of approximation was $2$, and an $O(\sqrt{n/\ln n})$-approximation was known only for the special case where $f$ is both submodular and symmetric.