arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2606.07393 2026-06-08 cs.SE 新提交

Is US Defense Acquisition Ready to Acquire AI-Enabled Capabilities? Assessing the DoD Software Acquisition Pathway Through a Scenario-Based Policy Analysis

美国国防采办是否准备好采办人工智能赋能能力？通过基于场景的政策分析评估国防部软件采办路径

Daniel Lugo, James C. Davis

AI总结通过基于场景的政策分析，评估美国国防部软件采办路径是否足以应对AI采办的独特需求，发现核心指南中存在可操作性不足，建议增设AI支持子路径并完善工件。

Comments Submitted to ACM Digital Government: Research and Practice Journal on April 2026

详情

AI中文摘要

随着AI系统从实验原型过渡到关键任务工具，它们对动态数据、演化模型和治理的依赖引发了对现有采办路径能否跟上步伐的质疑。美国国防部通过适应性采办框架对其采办流程进行了现代化改造，其中软件采办路径（SWP）是采办软件密集型能力的主要机制。本文评估SWP是否足以应对AI采办的独特需求。我们进行了一项基于场景的评估，通过SWP的关键规划活动追踪一个假设的AI赋能项目，以评估政策如何转化为项目工件和决策。我们使用政策场景分析来检验以SWP为中心的治理堆栈是否为AI采办提供了足够的可操作支持。该治理堆栈为迭代交付和AI测试提供了可行基础。然而，我们在核心指南中发现了一个反复出现的可操作性问题。针对数据溯源、生命周期管理和人工监督的AI特定控制措施仍然分布在补充文件中，而不是嵌入到执行SWP的项目面对机制中。这种脱节使得项目办公室依赖于不一致的本地解释。最后，我们建议增设一个AI支持子路径并针对性地完善工件，以更好地弥合这种政策到工件的差距。

英文摘要

As AI systems transition from experimental prototypes to mission-critical tools, their dependence on dynamic data, evolving models, and governance raises questions about whether existing acquisition pathways can keep pace. The U.S. Department of Defense has modernized its acquisition processes through the Adaptive Acquisition Framework, with the Software Acquisition Pathway (SWP) serving as the primary mechanism for acquiring software-intensive capabilities. This paper evaluates whether SWP is sufficient to address the unique demands of AI acquisition. In this work, we perform a scenario-based evaluation that traces a notional AI-enabled program through key SWP planning activities to assess how policy translates into program artifacts and decisions. We use Policy Scenario Analysis to examine whether the SWP-centered governance stack provides sufficient actionable support for AI acquisition. The governance stack provides a viable foundation for iterative delivery and AI testing. However, we identify a recurring actionability problem in the core guidance. AI-specific controls for data provenance, lifecycle management, and human oversight remain distributed across supplemental documents rather than embedded in the program-facing mechanisms through which SWP is executed. This disconnect leaves program offices reliant on inconsistent local interpretation. We conclude by recommending an AI-supporting sub-path and targeted artifact refinements to better bridge this policy-to-artifact gap.

URL PDF HTML ☆

赞 0 踩 0

2606.07375 2026-06-08 eess.SY cs.CR cs.SY 新提交

An End-to-End Encrypted Control Pipeline for Multi-Agent Coordination via CKKS Homomorphic Encryption

基于CKKS同态加密的多智能体协同端到端加密控制管道

Sai Sandeep Damera, Maria Charitidou, Asim Zoulkarni, John S. Baras

AI总结针对云端多智能体协同中的隐私冲突，提出端到端加密控制管道，所有环节在CKKS加密数据上仅用加、乘和循环旋转操作，通过稳态卡尔曼增益和对角线法实现图拉普拉斯，推导周期性自举界以量化加密噪声影响，并在编队控制中验证。

Comments 8 pages, 4 figures. This work has been submitted to the IEEE for possible publication

详情

AI中文摘要

基于云的多智能体系统协同需要与中央服务器共享状态，这在协同与隐私之间产生了冲突。全同态加密（FHE）原则上解决了这一问题，但其严格的算术约束要求控制循环的每个阶段都从头重新设计。我们提出了一种端到端加密控制管道，其中感知、状态估计、状态传播和共识控制均在CKKS加密数据上仅使用加法、乘法和循环旋转操作。为了克服FHE的计算挑战，我们采用稳态卡尔曼增益而非在线求解矩阵，并通过对角线法以与非零循环对角线数量成比例的成本应用图拉普拉斯，从而在统一框架内适应环、环面和完全图拓扑。为了量化加密噪声的累积效应，我们利用分离定理解耦控制器和观测器误差动态，并推导出周期性自举界，其中CKKS自举作为脉冲扰动；由此产生的稳态误差球取决于自举精度和闭环谱半径，为隐私-精度权衡提供了直接的设计方程。该管道在多智能体编队控制场景中得到验证，确认了加密下闭环运行的稳定性及有界跟踪误差。

英文摘要

Cloud-based coordination of multi-agent systems requires sharing state with a central server, creating a conflict between coordination and privacy. Fully homomorphic encryption (FHE) resolves this in principle, but its severe arithmetic constraints demand that every stage of the control loop be redesigned from first principles. We present an end-to-end encrypted control pipeline in which sensing, state estimation, state propagation, and consensus control all operate on CKKS-encrypted data using only addition, multiplication, and cyclic rotation. In order to overcome the computational challenges of FHE, we employ steady-state Kalman gains instead of solving for the matrices online and graph Laplacians are applied via the diagonal method at a cost proportional to the number of nonzero cyclic diagonals, accommodating ring, torus, and complete-graph topologies within a unified framework. To quantify the cumulative effect of encryption noise, we use the separation principle to decouple controller and observer error dynamics and derive a periodic bootstrapping bound in which CKKS bootstrapping acts as an impulsive disturbance; the resulting steady-state error ball depends on the bootstrapping precision and the closed-loop spectral radius, providing a direct design equation for the privacy-accuracy tradeoff. The pipeline is validated on a multi-agent formation control scenario, confirming stable closed-loop operation under encryption with bounded tracking error.

URL PDF HTML ☆

赞 0 踩 0

2606.07363 2026-06-08 cs.CR cs.SE 新提交

On the Shoulders of Giants: Empowering Automated Smart Contract Auditing via the GiAnt Corpus

站在巨人的肩膀上：通过GiAnt语料库赋能自动化智能合约审计

Xiaoting Zhang, Zhipeng Gao, Yiran Lv, Xing Hu, Feifei Niu, Xin Xia

AI总结提出自动化框架GiANT，从真实审计报告中提取漏洞信息构建高质量数据集GiAnt Corpus，包含7711个漏洞发现，验证了其在漏洞检测等任务中的实用性。

详情

AI中文摘要

高质量的智能合约审计数据集对于评估安全工具和推进智能合约安全研究至关重要。现有数据集的两个主要局限是手动引发的可扩展性瓶颈以及数据粒度和多样性的不足。为解决这些局限，我们提出GiANT，一个自动化框架，通过从真实世界审计报告中提炼漏洞见解来策划智能合约审计数据集。GiANT采用分治策略结合思维链技术从Code4rena报告中提取结构化漏洞信息，随后通过LLM-as-a-judge机制进行严格的质量保证。为评估GiANT的有效性，我们在388份真实审计报告上运行它，生成了包含跨五个严重级别的7711个漏洞发现的GiAnt语料库。数据集的人工评估显示出卓越的信息提取可靠性，平均质量得分为$4.76\pm0.37$（满分5分），评分者间一致性$\kappa$为0.88。我们进一步通过在漏洞检测、代码摘要、缓解建议和自动gas优化任务上对4个最先进的LLM进行基准测试，验证了数据集的实用性，建立了性能基线，从而为自动化智能合约审计的未来研究提供了宝贵的数据基础。

英文摘要

High-quality smart contract auditing datasets are crucial for evaluating security tools and advancing smart contract security research. Two major limitations of existing datasets are the manual-induced scalability bottleneck and the deficiency in data granularity and diversity. To address these limitations, we propose GiANT, an automated framework designed to curate smart contract auditing datasets by distilling vulnerability insights from real-world auditing reports. GiANT employs a divide-and-conquer strategy coupled with the Chain-of-Thought technique to extract structured vulnerability information from Code4rena reports, followed by an LLM-as-a-judge mechanism to perform rigorous quality assurance. To evaluate GiANT's effectiveness, we run it on 388 real-world audit reports and generate the GiAnt Corpus comprising 7,711 vulnerability findings across five severity levels. Manual assessment of the dataset demonstrates exceptional reliability in information extraction, achieving a mean quality score of $4.76\pm0.37$ (out of 5) with inter-rater agreement $κ$ of 0.88. We further validate the practicality of our dataset by benchmarking 4 state-of-the-art LLMs on vulnerability detection, code summarization, mitigation recommendation, and automated gas optimization tasks, to establish performance baselines, thereby providing a valuable data foundation for future research in automated smart contract auditing.

URL PDF HTML ☆

赞 0 踩 0

2606.07348 2026-06-08 cs.LO math.LO 新提交

Four intuitionistic modal connectives

四个直觉主义模态连接词

Philippe Balbiani, Çigdem Gencer

AI总结本文引入基于Prenosil和Wijesekera两种风格的菱形及其对偶方框的直觉主义模态逻辑，分析框架类的模态可定义性、完全公理化，并证明最小直觉主义模态逻辑的可判定性。

2606.07341 2026-06-08 cs.CR 新提交

Empirical Evaluation of Large Language Models for Migration of Code Fragments to Post-Quantum Cryptography

大型语言模型在代码片段向后量子密码迁移中的实证评估

Javier Pallarés de Bonrostro, Ana I. González-Tablas, María Isabel González Vasco

AI总结评估大型语言模型在将经典密码代码片段迁移至后量子密码中的能力，通过微调GPT-4.1-mini实现92.5%的功能正确率，优于零样本基线。

详情

AI中文摘要

向后量子密码（PQC）的过渡不仅需要替换易受攻击的密码原语，还需要重构周围的软件逻辑。虽然现有的PQC迁移框架提供了组织层面的指导，但实际的代码级修复仍然主要依赖人工且容易出错。本文评估了大型语言模型（LLM）是否可以被训练来协助将前量子密码代码片段迁移到后量子对应物，同时保持功能正确性。为此，我们引入了一个可重复的实验框架，该框架基于一个包含800个配对Python代码片段的合成数据集，涵盖六个密码家族和组合多原语案例。每个配对通过类别特定的功能测试进行验证，从而实现了数据集质量控制和模型生成迁移的客观评估。评估了四个模型：零样本设置下的GPT-4.1，以及微调版本的GPT-3.5-turbo、GPT-4.1-mini和CodeLlama-7B-Instruct。结果表明，领域特定的微调对于可靠的密码迁移至关重要。微调后的GPT-4.1-mini模型实现了最佳整体性能，平均静态相似度为0.9072，动态功能正确率为92.5%，显著优于零样本基线。对六个开源仓库的补充验证进一步表明，该方法可以在局部密码模块中产生有用的迁移，同时也揭示了在具有复杂依赖关系和跨模块交互的大型项目中的局限性。这些发现表明，微调后的LLM可以作为未来密码敏捷迁移管道中的实用组件，前提是它们与自动化验证和依赖感知验证相结合。

英文摘要

The transition to post-quantum cryptography (PQC) requires not only replacing vulnerable cryptographic primitives, but also refactoring the surrounding software logic. While existing PQC migration frameworks provide organizational guidance, practical code-level remediation remains largely manual and error-prone. This paper evaluates whether large language models (LLMs) can be trained to assist in the migration of pre-quantum cryptographic code fragments to post-quantum counterparts while preserving functional correctness. To this end, we introduce a reproducible experimental framework built around a synthetic dataset of 800 paired Python code fragments covering six cryptographic families and combined multi-primitive cases. Each pair is validated through category-specific functional tests, enabling both dataset quality control and objective evaluation of model-generated migrations. Four models are assessed: GPT-4.1 in a zero-shot setting, and fine-tuned versions of GPT-3.5-turbo, GPT-4.1-mini, and CodeLlama-7B-Instruct. The results show that domain-specific fine-tuning is essential for reliable cryptographic migration. The fine-tuned GPT-4.1-mini model achieves the best overall performance, with a mean static similarity of 0.9072 and a dynamic functional correctness rate of 92.5%, substantially outperforming the zero-shot baseline. A complementary validation on six open-source repositories further shows that the approach can produce useful migrations in localized cryptographic modules, while also revealing limitations in larger projects with complex dependencies and cross-module interactions. These findings suggest that fine-tuned LLMs can serve as practical components in future crypto-agile migration pipelines, provided they are coupled with automated verification and dependency-aware validation.

URL PDF HTML ☆

赞 0 踩 0

2606.07337 2026-06-08 cs.GR 新提交

Skeletal-Anchored Dual Harmonics for Structured 3D Modeling

骨骼锚定双谐波用于结构化三维建模

Zhentao Huang, Changhao Li, Ruizhen Hu, Hui Huang, Minglun Gong

AI总结提出骨骼锚定双谐波（SADH）表示，通过内部锚点上的表面补丁和双通道球谐函数，联合优化表面几何与中轴骨骼结构，实现紧凑且连贯的三维形状建模。

Comments 11 pages

详情

AI中文摘要

我们提出骨骼锚定双谐波（SADH），一种新颖的三维形状表示，它将局部表面几何与内部中轴骨骼组织紧密耦合。SADH 将形状表示为附着在内部锚点上的紧凑表面补丁集合，这些锚点直接在物体体积内部优化。每个补丁使用双通道球谐函数（SH）公式参数化，其中一个通道建模局部径向几何，另一个通过广义视锥定义自适应补丁支持。与各向同性基元（如中轴球体或高斯核）不同，SH 补丁直接编码各向异性的局部表面几何以及自适应空间支持，从而能够紧凑表示细节丰富且方向变化的表面区域。从无组织点云开始，SADH 通过分阶段优化过程联合优化表面几何、锚点位置、补丁方向和结构连通性，逐步形成连贯的中轴骨骼结构。测地锚点图进一步保持相邻补丁之间的结构关系。在复杂三维形状上的实验表明，SADH 在广泛几何形状上实现了精确的表面重建以及紧凑且连贯的骨骼组织。

英文摘要

We present Skeletal-Anchored Dual Harmonics (SADH), a novel 3D shape representation that tightly couples local surface geometry with internal meso-skeletal organization. SADH represents a shape as a collection of compact surface patches rooted on internal anchors optimized directly inside the object volume. Each patch is parameterized using a dual-channel spherical harmonic (SH) formulation, where one channel models local radial geometry while the other defines adaptive patch support through a generalized viewing cone. Unlike isotropic primitives such as medial spheres or Gaussian kernels, SH patches directly encode anisotropic local surface geometry together with adaptive spatial support, enabling compact representation of detailed and directionally varying surface regions. Starting from unorganized point clouds, SADH jointly optimizes surface geometry, anchor locations, patch orientations, and structural connectivity through a staged optimization process that progressively forms a coherent meso-skeletal structure. A geodesic anchor graph further preserves structural relationships between neighboring patches. Experiments on complex 3D shapes demonstrate that SADH achieves accurate surface reconstruction together with compact and coherent skeletal organization across a wide range of geometries.

URL PDF HTML ☆

赞 0 踩 0

2606.07335 2026-06-08 cs.CR 新提交

Defending Jailbreak Attacks on Large Language Models via Manifold Trajectory Kinetics

通过流形轨迹动力学防御大语言模型的越狱攻击

Hangtao Zhang, Yucheng Zhao, Sishun Liu, Ziqi Zhou, Zeyu Ye, Wei Wan, Minghui Li, Shengshan Hu, Yanjun Zhang, Yi Liu, Leo Yu Zhang

AI总结提出流形轨迹动力学（MTK）方法，通过分析提示词在模型层间的邻域结构演化来检测越狱攻击，在伪恶意提示和自适应攻击下均表现鲁棒。

Comments Accepted to USENIX Security '26 Cycle 2. Code is available at https://github.com/Rookie143/mtk

详情

AI中文摘要

越狱提示可以绕过大型语言模型（LLM）中的对齐护栏并引发不安全输出，使得可靠的部署时检测至关重要。先前的检测方法主要依赖于固定的度量空间，例如原始输入、梯度或隐藏特征，其中良性提示和越狱提示是线性可分的。我们证明这一假设在以下情况下失效：（i）伪恶意提示，其意图良性但包含安全相关关键词，以及（ii）明确针对部署检测器进行优化的自适应攻击。为克服这一限制，我们将关注点从识别通用度量空间转向分析底层数据流形更鲁棒的邻域结构。我们提出流形轨迹动力学（MTK），将LLM视为一个将输入转换为输出的动力学系统，并通过跟踪提示的邻域结构在层间的演化来检测越狱。良性提示在整个推理过程中保持接近良性邻域，而越狱提示则表现出特征轨迹：从接近恶意种子开始，随后策略性地向良性邻域移动以逃避检测。在四个LLM和十种越狱攻击下，MTK对两种失败模式均表现出强鲁棒性：在伪恶意提示上，它在良性提示上达到5%的假阳性率，在伪恶意提示上达到2%的假阳性率，同时越狱真阳性率为95%；在自适应攻击下，它保持85%的真阳性率。我们进一步展示了MTK在视觉语言模型中进行越狱检测的优越性能。我们的代码可在以下网址获取：https://this https URL。

英文摘要

Jailbreak prompts can bypass alignment guardrails in large language models (LLMs) and elicit unsafe outputs, making reliable deployment-time detection critical. Prior detection approaches largely rely on a fixed metric space, e.g., raw inputs, gradients, or hidden features, in which benign and jailbreak prompts are linearly separable. We show this assumption breaks under (i) pseudo-malicious prompts that are benign by intent but contain safety-related keywords, and (ii) adaptive attacks that explicitly optimize against the deployed detector. To overcome this limitation, we shift our focus from identifying a universal metric space to analyzing the more robust neighborhood structure of the underlying data manifold. We present Manifold Trajectory Kinetics (MTK), which treats an LLM as a kinetic system transforming inputs into outputs and detects jailbreaks by tracking how a prompt's neighborhood structure evolves across layers. Benign prompts remain close to benign neighborhoods throughout inference, whereas jailbreak prompts exhibit a characteristic trajectory that begins near malicious seeds and later strategically shifts toward benign neighborhoods to evade refusal.Across four LLMs and ten jailbreak attacks, MTK achieves strong robustness to both failure modes: on pseudo-malicious prompts, it attains a jailbreak true positive rate of 95% at a false positive rate of 5% on benign prompts and 2% on pseudo-malicious prompts, and under adaptive attacks, it maintains a true positive rate of 85%. We further demonstrate the superior performance of MTK for jailbreak detection in vision-language models. Our code is available at https://github.com/Rookie143/mtk.

URL PDF HTML ☆

赞 0 踩 0

2606.07332 2026-06-08 cs.DL 新提交

The disruption index does not measure scientific innovation

颠覆性指数不能衡量科学创新

Julien Larregue, Yves Gingras

AI总结本文质疑《科学》杂志论文中提出的颠覆性指数，指出该指数基于直觉而非严谨验证，不能有效衡量科学创新，并警告其用于政策制定的风险。

Comments 16 pages, 5 figures

2606.07314 2026-06-08 cs.SE cs.ET quant-ph 新提交

QBugLM: An Agentic Benchmarking Framework for LLM-based Quantum Software Debugging

QBugLM：基于LLM的量子软件调试的智能基准测试框架

An B. B. Pham, Hoa T. Nguyen, Muhammad Usman

AI总结提出QBugLM多智能体框架，自动化量子软件调试流程，通过案例研究评估LLM调试能力，发现迭代反馈显著提升修复成功率。

Comments This paper was accepted at IEEE QSW 2026

详情

AI中文摘要

量子软件缺陷通常产生静默的错误输出而非显式错误，这使得它们难以用传统技术检测和修复。尽管大型语言模型（LLM）在经典软件工程任务中表现出色，但其调试量子代码的能力仍未被充分探索。为填补这一空白，我们提出QBugLM，一个多智能体框架，自动化量子软件调试流程，从基于分类学的缺陷注入到基于LLM的检测和修复，最终到基于模拟的验证，适用于框架无关的OpenQASM 3.0程序。我们进一步使用QBugLM进行全面的案例研究，评估两个LLM（Claude 4.6 Sonnet和Qwen3 Coder Next）在不同提示策略、缺陷类别和量子程序上的表现。结果表明，迭代反馈至关重要，单次重试将Pass@1从低于25%提升至超过80%。此外，在固定资源约束下，对于具备推理能力的模型，更简单的结构化提示甚至优于思维链和ReAct。我们的工作迈出了基准测试LLM调试量子程序能力的第一步，并为未来自动化量子软件修复提供了实用见解。

英文摘要

Quantum software bugs often yield silent, incorrect outputs rather than explicit errors, making them particularly difficult to detect and repair with conventional techniques. Although large language models (LLMs) have shown strong performance on classical software engineering tasks, their ability to debug quantum code remains largely unexplored. To bridge this gap, we propose QBugLM, a multi-agent framework that automates the quantum software debugging pipeline, from taxonomy-driven bug injection to LLM-based detection and repair, and finally to simulation-based validation, for framework-agnostic OpenQASM 3.0 programs. We further conduct a comprehensive case study using QBugLM to benchmark two LLMs, Claude 4.6 Sonnet and Qwen3 Coder Next, across different prompting strategies, bug categories, and quantum programs. Our results show that iterative feedback is critical, as a single retry raises Pass@1 from below 25% to above 80%. Moreover, simpler structured prompting can even outperform Chain-of-Thought and ReAct for reasoning-capable models under fixed-resource constraints. Our work takes initial steps toward benchmarking LLM capabilities for debugging quantum programs and offers practical insights to support future efforts in automated quantum software repair.

URL PDF HTML ☆

赞 0 踩 0

2606.07285 2026-06-08 cs.GT 新提交

Improved Lower Bounds for Proportionally Fair Clustering

比例公平聚类的下界改进

Benjamin Cookson, Eva Deltl, Yeeseok Oh

AI总结研究比例公平聚类中的α-core存在性，通过构造实例将α-core非空的下界从2提高到2.1508，并利用MILP等方法精确刻画了少量候选中心情形下的阈值。

详情

AI中文摘要

我们研究比例公平聚类，其中必须从度量空间中选择一组$k$个中心来代表$n$个智能体，并且没有足够大的智能体群体应被集体低估。该设置中公平性的核心概念之一是$\alpha$-core。Chen等人[2019]证明了$(1+\sqrt{2})$-core中聚类的存在性，他们也展示了对于每个$\alpha < 2$，$\alpha$-core为空的实例。缩小这一差距七年来一直是一个开放问题。我们从下界方面取得进展，提供了一个实例，其$\alpha$-core对于每个$\alpha < 2.1508$为空。我们的技术依赖于建立core的变体（即Hare core和Droop core）之间的联系；将最优空core实例的搜索简化为高度结构化的聚类实例族；以及使用混合整数线性规划（MILP）在这个缩减空间中搜索最优下界实例。使用这个框架，我们还确定了具有少量可能候选中心且仅需选择一个中心的Droop配额聚类实例的紧界。对于每个中心数$m \in \{3,4,5,6\}$，我们给出了精确阈值$\alpha_m^*$，使得$\alpha_m^*$-core聚类总是存在，而对于每个$\alpha < \alpha_m^*$，存在一个具有$m$个中心的实例其$\alpha$-core为空。尽管这些值最初是通过计算机辅助搜索找到的，我们也提供了不依赖MILP证书的直接证明。

英文摘要

We study proportionally fair clustering, where a set of $k$ centers must be chosen from a metric space to represent $n$ agents, and no sufficiently large group of agents should be collectively underrepresented. One of the central notions of fairness in this setting is the $α$-core. The existence of clusterings in the $(1+\sqrt{2})$-core was established by Chen et al. [2019], who also showed instances where the $α$-core is empty for every $α< 2$. Closing this gap has remained an open problem for seven years. We make progress from the lower-bound side by providing an instance whose $α$-core is empty for every $α< 2.1508$. Our techniques rely on establishing connections between variants of the core, namely the Hare core and the Droop core; reducing the search for optimal empty-core instances to a highly structured family of clustering instances; and using a Mixed Integer Linear Program (MILP) to search for optimal lower-bound instances within this reduced space. Using this framework, we also determine tight bounds for Droop quota clustering instances with a small number of possible candidate centers and a single center to be selected. For each number of centers $m \in \{3,4,5,6\}$, we give the exact threshold $α_m^*$ such that an $α_m^*$-core clustering always exists, while for every $α< α_m^*$ there is an instance with $m$ centers whose $α$-core is empty. Although these values were originally found through computer-aided search, we also provide direct proofs that do not rely on MILP certificates.

URL PDF HTML ☆

赞 0 踩 0

2606.07283 2026-06-08 cs.HC 新提交

A Model of Integrated Information Processing in Human-AI Interaction

人机交互中的集成信息处理模型

Tim Schrills. Thomas Franke

AI总结提出集成信息处理（IIP）模型，将人类和AI视为耦合控制回路，通过三种信息处理质量（输入充分性、参考一致性、输出可操作性）连接心理机制与界面设计，指导人机耦合的设计与评估。

Comments 22 pages

详情

AI中文摘要

为了推动人机交互（HAII）研究的发展，需要将心理机制与界面设计联系起来的理论工作。这类工作应扩展而非取代现有的HCI和自动化研究，适应AI系统日益增强的自主性和能动性。基于先前关注人机交互中角色和层级的框架，从心理学视角仍存在空白：一个以任务为中心、面向过程的描述，将行动调节机制与人机耦合的具体设计和评估杠杆联系起来，并使用统一的人机词汇表达。此外，现有模型可能描述了系统如何设计（例如自动化中的功能分配），但未能展示这种设计如何影响人类行为。我们提出了集成信息处理（IIP）模型，这是一个以任务为中心的控制论模型，将人类、机器及其联合活动概念化为耦合控制回路。IIP模型使用统一建模语言描述人类和智能体，使行动调节的心理模型可用于AI系统设计。作为核心特征，我们认为共享任务中的效能由三种集成质量表征：输入充分性、参考一致性和输出可操作性，它们关键地影响以人为中心的基准，如透明度和可控性。该模型将界面选择（例如XAI技术）映射到理论驱动的用户行为期望，指导界面设计和评估。为此，我们提出：(1) 一个保持连续性的理论论述，将HAII扩展到AI的能动性；(2) 具有三种信息处理质量的IIP模型；(3) IIP模型在示例用例中的应用，展示对界面设计的启示。

英文摘要

For Human-AI Interaction (HAII) research to move forward, theoretical work linking psychological mechanisms to interface design is needed. Such work should extend rather than replace established HCI and automation research, adapting to the increasing autonomy and agency of AI systems. Building on prior frameworks focused on roles and levels in human interaction with automation, a gap remains from a psychological view: a task-centered, process-oriented account that links mechanisms of action regulation to concrete design and evaluation levers for human-AI coupling, expressed in a unified vocabulary for human and machine. Moreover, existing models may describe how a system is designed (e.g., function allocation in automation) but fall short in showing how this design affects human behavior. We present the Integrated Information Processing (IIP) model, a task-centered, cybernetic model that conceptualizes humans, machines, and their joint activity as coupled control loops. The IIP model uses a unified modeling language for human and artificial agents, making psychological models of action regulation accessible for AI system design. As a core feature, we argue that efficacy within a shared task is characterized by three integration qualities, input adequacy, reference consonance, and output operativity, which critically influence benchmarks of human-centeredness such as transparency and controllability. The model maps interface choices (e.g., XAI techniques) to theory-driven expectations of user behavior, guiding interface design and evaluation. To this end, we present (1) a continuity-preserving theoretical discourse that extends HAII to agency in AI; (2) the IIP model with three information-processing qualities; and (3) applications of the IIP model to exemplary use cases demonstrating implications for interface design.

URL PDF HTML ☆

赞 0 踩 0

2606.07282 2026-06-08 cs.CR cs.NI 新提交

Rethinking IoT Intrusion Detection: Augmenting Routing Metrics with Radio Features

重新思考物联网入侵检测：用无线电特征增强路由度量

Yichang Sun, Andreas Johnsson, Sourasekhar Banerjee

AI总结针对RPL物联网网络，提出在LSTM入侵检测系统中结合收发无线电特征与标准RPL特征，在三种攻击下F1分数提升高达4%。

Comments 4 Pages, 8 figures, Accepted to Swedish National Computer Networking Workshop (SNCNW) 2026

2606.07270 2026-06-08 cs.CY 新提交

Two-Phase Simulated Annealing for Equitable Team Formation: Eliminating Complaints in Large Engineering Cohorts

两阶段模拟退火算法实现公平团队组建：消除大型工程班级中的投诉

Yiwei Sun, Xinru Deng, Dimitrios G Papageorgiou

AI总结提出一种两阶段算法，先通过图聚类形成固定三人组，再使用模拟退火优化配对，在238名学生中实现零投诉、GPA方差0.005、94.3%偏好满意度。

Comments 9 pages, 3 figures

详情

AI中文摘要

贡献：本文提出一种新颖的两阶段算法方法，将偏好满足与公平优化解耦，在不妥协的情况下同时实现学生团队组建的两个目标。该方法将模拟退火——一种核心材料科学技术——应用于教育挑战，展示了行政流程的教学整合。背景：在大型工程班级（100+学生）中组建有效团队需要平衡学生偏好、学术公平和人口多样性。现有工具要么忽略偏好而优化公平（CATME、Team-Anneal），要么在牺牲平衡的情况下容纳偏好（自选），导致投诉率在5-35%之间。预期成果：消除正式投诉，实现团队间GPA方差接近零，防止性别孤立，保持高偏好满意度，同时创建可扩展、可重复的解决方案，适用于工程课程。应用设计：第一阶段通过图论聚类形成固定三人组，最大化相互偏好，保留社会纽带。第二阶段采用模拟退火将三人组配对成六人团队，同时优化GPA方差、性别平衡和规模约束。这种分解反映了材料加工中的分层优化。结果：在238名学生中部署后，算法完全消除了正式投诉（对比>30%基线），实现了GPA方差0.005（对比历史均值9.74），消除了性别孤立的个体，并保持了94.3%的偏好满意度。针对82个历史分组实例（1538个团队，6个学年）的验证证实了相对于传统方法的显著改进。

英文摘要

Contribution: This paper presents a novel two-phase algorithmic approach that decouples preference satisfaction from fairness optimization in student team formation, achieving both objectives without compromise. The method applies simulated annealing -- a core materials science technique -- to an educational challenge, demonstrating pedagogical integration of administrative processes. Background: Forming effective teams in large engineering cohorts (100+ students) requires balancing student preferences, academic fairness, and demographic diversity. Existing tools either optimize for fairness while ignoring preferences (CATME, Team-Anneal) or accommodate preferences while compromising balance (self-selection), leaving complaint rates at 5--35%. Intended Outcomes: Eliminate formal complaints, achieve near-zero GPA variance between teams, prevent gender isolation, and maintain high preference satisfaction while creating a scalable, reproducible solution applicable across engineering programs. Application Design: Phase 1 forms fixed triads through graph-theoretic clustering that maximizes mutual preferences, preserving social bonds. Phase 2 employs simulated annealing to pair triads into teams of six while optimizing GPA variance, gender balance, and size constraints. This decomposition mirrors hierarchical optimization in materials processing. Findings: Deployed across 238 students, the algorithm eliminated formal complaints entirely (vs >30% baseline), achieved GPA variance of 0.005 (vs. historical mean 9.74), eliminated gender-isolated individuals, and maintained 94.3% preference satisfaction. Validation against 82 historical grouping instances (1,538 teams, 6 academic years) confirmed significant improvement over conventional methods.

URL PDF HTML ☆

赞 0 踩 0

2606.07258 2026-06-08 cs.CE q-bio.QM 新提交

CaliPPer: quantifying, predicting and improving AI model performance for binding prediction

CaliPPer：量化、预测和改进AI模型在结合预测中的性能

Jian-Qing Zheng, Hantao Lou, Zinan Yin, Sam Farrar, Yuze Zhou, Elie Antoun, Xiangxi Wang, Xuetao Cao, Tao Dong

AI总结提出CaliPPer框架，通过多链样本到域距离和距离感知贝叶斯重校准，在三个分辨率上量化、预测和改进AI模型在结合预测中的性能，显著提升新表位、抗原变体和化学骨架上的发现率。

详情

AI中文摘要

结合预测模型加速了治疗性抗体和TCR的发现，但其在新数据集上的性能不可预测，常导致低发现率。密度比方法（PAPE, M-CBPE）为二分类提供无标签性能估计，但其假设和仅聚合输出限制了在新表位、抗原变体和化学骨架上的结合预测。这里我们提出CaliPPer（性能校准与预测），一个事后框架，将多链样本到域距离（S2DD）与距离感知贝叶斯重校准配对，在三个分辨率上运行：泛化性分数、聚合性能预测和每个样本置信度。在十个模型、八个架构和两个免疫受体域上，CaliPPer达到了距离-性能相关性$|r|=0.80\text{--}0.92$，预测AUROC/AP/F1的平均绝对误差为$0.008\text{--}0.070$，并在未见表位/变体上将AUROC提升高达$+0.20$。回顾性地应用于五个已发表的TCR、BCR、MHC-肽和小分子研究，CaliPPer在所有五个研究中提高了真实发现率（例如，$0/5 \to 3/5$确认的新抗原），在计算预测和实验验证之间提供了一个分诊层。

英文摘要

Binding prediction models accelerate therapeutic antibody and TCR discovery, but their performance on new datasets is unpredictable, often leading to low discovery rates. Density-ratio methods (PAPE, M-CBPE) provide label-free performance estimation for binary classification, but their assumptions and aggregate-only outputs limit binding prediction on neoepitopes, antigen variants and chemical scaffolds. Here we present CaliPPer (Calibration and Prediction of Performance), a post-hoc framework pairing a multi-chain Sample-to-Domain Distance (S2DD) with distance-aware Bayesian recalibration, operating at three resolutions: generalisability score, aggregate performance prediction, and per-sample confidence. Across ten models, eight architectures and two immune-receptor domains, CaliPPer attains distance--performance correlations $|r|=0.80\text{--}0.92$, predicts AUROC/AP/F1 with mean absolute errors $0.008\text{--}0.070$, and improves AUROC by up to $+0.20$ on unseen epitopes/variants. Applied retrospectively to five published TCR, BCR, MHC--peptide and small-molecule studies, CaliPPer raises true discovery rates in all five (e.g.\ $0/5 \to 3/5$ confirmed neoantigens), providing a triage layer between computational prediction and experimental validation.

URL PDF HTML ☆

赞 0 踩 0

2606.07248 2026-06-08 cs.DC 新提交

Clairvoyant: Predictive SJF Scheduling to Mitigate Head-of-Line Blocking in Serial LLM Backends

Clairvoyant: 预测性SJF调度以缓解串行LLM后端中的队头阻塞

Aravind Sundaresan

AI总结提出Clairvoyant，一种轻量级侧车代理，通过预测请求长度实现SJF调度，缓解串行LLM后端中的队头阻塞，显著降低短请求延迟。

Comments 17 pages, 3 figures, 8 tables. Code: https://github.com/Aravind0403/clairvoyant-scheduler

详情

AI中文摘要

串行LLM推理后端（如Ollama）在FCFS准入策略下逐个处理请求，导致高利用率混合工作负载下的队头阻塞（HOLB）：短事实查询可能被长生成任务延迟数分钟。虽然云规模部署通过连续批处理（vLLM, Orca）缓解HOLB，但这些方案需要数十GB的VRAM来维护并发KV缓存——对于依赖串行请求分发的内存受限边缘和本地部署不可行。我们提出\clairvoyant，一个即插即用的侧车代理，适用于任何串行OpenAI兼容后端（如Ollama, this http URL）。\clairvoyant通过ONNX导出的XGBoost分类器，从19个轻量级词汇特征预测响应长度，每个请求延迟0.029毫秒（比典型生成时间低四个数量级）。由于准入调度依赖于相对顺序而非精确预测，系统优化排序保真度，在自然对话数据集上达到62-96%的分布内准确率和52-66%的跨分布准确率。我们发现，精心设计的指令数据集是长度预测的退化训练源：GPT施加的简洁约束将长类表示减少到示例的0.02%以下，使得自然对话日志成为唯一可行的训练源。在RTX 4090上的端到端GPU基准测试显示，在最大队列压力（100个并发请求）下短请求的P50延迟降低70-76%，在稳态泊松到达（ρ=0.74）下降低17%。\clairvoyant是开源的，无需修改推理后端。

英文摘要

Serial LLM inference backends -- such as Ollama -- process requests one at a time under FCFS admission, causing Head-of-Line Blocking (HOLB) under mixed workloads at high utilisation: short factual queries can be delayed by minutes behind long generation jobs. While cloud-scale deployments mitigate HOLB via continuous batching (vLLM, Orca), these solutions require tens of GB of VRAM for concurrent KV-caches -- infeasible for memory-constrained edge and local deployments that rely on serial request dispatch. We present \clairvoyant, a drop-in sidecar proxy for any serial OpenAI-compatible backend (e.g., Ollama, llama.cpp). \clairvoyant predicts response length from 19 lightweight lexical features via an ONNX-exported XGBoost classifier, achieving 0.029\,ms per-request latency (four orders of magnitude below typical generation time). Because admission scheduling depends on relative ordering rather than exact prediction, the system optimises ranking fidelity, achieving 62--96\% in-distribution and 52--66\% cross-distribution accuracy across natural conversation datasets. We find that curated instruction datasets are degenerate training sources for length prediction: GPT-imposed brevity constraints reduce Long-class representation to under 0.02\% of examples, making natural conversation logs the only viable training source. End-to-end GPU benchmarks on an RTX~4090 show 70--76\% P50 latency reduction for short requests under maximum queue pressure (100 concurrent requests) and 17\% under steady-state Poisson arrivals ($ρ=0.74$). \clairvoyant is open-source and requires no modifications to the inference backend.

URL PDF HTML ☆

赞 0 踩 0

2606.07246 2026-06-08 cs.AR 新提交

MailoHLS: Multi-Adapter Structure-Aware Learning for Pareto-Driven HLS Pragma Optimization

MailoHLS: 面向帕累托驱动HLS编译指示优化的多适配器结构感知学习

Elena Vouvali, Dimosthenis Masouros, Aggelos Ferikoglou, Dimitrios Soudris, Sotirios Xydis

AI总结提出MailoHLS混合框架，结合LLM语义推理与GNN结构建模，通过交叉注意力、目标条件LoRA适配器和帕累托优化，实现HLS编译指示的联合优化，在延迟优化上最高提速12.42倍，并持续生成近帕累托最优设计。

详情

AI中文摘要

高层次综合（HLS）能够快速开发FPGA加速器，但由于编译器指令（即编译指示）导致的设计空间庞大且不规则，实现高质量结果（QoR）仍然具有挑战性。选择有效配置需要推理程序结构、内存行为以及延迟和资源利用率等常常相互冲突的目标之间的复杂交互。先前的模型驱动方法在跨内核的泛化能力上表现有限，且无法捕捉更高层次的优化意图。最近，大型语言模型（LLM）能够捕捉代码语义和高层意图，但其顺序表示阻碍了对结构依赖性和全局权衡的建模，导致HLS设计次优。我们提出MailoHLS，一个混合框架，结合了基于LLM的语义推理和基于GNN的结构建模，用于目标感知的指令优化。通过交叉注意力集成结构嵌入，并利用PEFT与目标条件LoRA适配器以及帕累托驱动优化，MailoHLS能够对代码语义、结构和设计权衡进行联合推理。在已见和未见的内核上，MailoHLS在延迟优化上实现了高达12.42倍和8.4倍的加速（几何平均分别为9.48倍和4.97倍），持续生成接近帕累托最优的设计。在完全未见过的应用上，它达到了高达10.2倍的加速（几何平均6.58倍），优于高端LLM和先前方法，同时缩小了与帕累托前沿的差距。

英文摘要

High-Level Synthesis (HLS) enables rapid development of FPGA accelerators, yet achieving high-quality results (QoR) remains challenging due to the large and irregular design space induced by compiler directives (a.k.a pragmas). Selecting effective configurations requires reasoning over complex interactions between program structure, memory behavior, and often conflicting objectives such as latency and resource utilization. Prior model-driven approaches exhibit limited generalization across kernels and fail to capture higher-level optimization intent. Recently, Large Language Models (LLMs) capture code semantics and high-level intent, but their sequential representations hinder modeling of structural dependencies and global trade-offs, leading to suboptimal HLS designs. We present MailoHLS, a hybrid framework that combines LLM-based semantic reasoning with GNN-based structural modeling for objective-aware directive optimization. By integrating structural embeddings via cross-attention and leveraging PEFT with objective-conditioned LoRA adapters and Pareto-driven optimization, MailoHLS enables joint reasoning over code semantics, structure, and design trade-offs. Across seen and unseen kernels, MailoHLS achieves up to 12.42x and 8.4x speedup (9.48x and 4.97x geometric mean) for latency optimization, consistently producing near-Pareto-optimal designs. On fully unseen applications, it reaches up to 10.2x speedup (6.58x geometric mean), outperforming high-end LLMs and prior approaches while narrowing the gap to the Pareto frontier.

URL PDF HTML ☆

赞 0 踩 0

2606.07238 2026-06-08 cs.GT 新提交

No, Cake Cutting Really is a Piece of Cake

不，蛋糕切割确实是小菜一碟

Stephen Arndt, Benjamin Moseley, Sungjin Im, Kirk Pruhs

AI总结提出一种确定性蛋糕切割算法，使用线性数量的切割实现比例公平。

2606.07231 2026-06-08 cs.HC 新提交

Moodie: An Early-Stage Design Exploration for Supporting Fear of Missing Out with LLM-based Chatbots

Moodie：基于LLM的聊天机器人支持错失恐惧症的早期设计探索

Hsin-Yu Tsai, Jingxian Liao, Fu-Yin Cherng, Tzu-Hsiang Huang

AI总结提出基于大语言模型的聊天机器人Moodie，通过情绪调节支持减少错失恐惧症，初步评估显示其相比通用模型能提高用户参与度和社交连接。

Comments 7 pages, 1 figure, 1 table. Preliminary work submitted to the ACM CUI 2026 Works-in-Progress (WiP) track

详情

AI中文摘要

社交媒体的过度使用导致了被称为错失恐惧症（FoMO）的挑战。现有研究未能提供可访问的、交互式的工具，专注于FoMO的情感和认知方面。本工作提出了Moodie，一个使用大语言模型设计的聊天机器人，以支持情绪调节并减少FoMO。我们进行了一项形成性研究以了解FoMO个体的需求，并开发了Moodie。然后，我们进行了一项初步评估研究（N=21），观察参与者与Moodie和基线聊天机器人（GPT-4o）在一周内的互动。结果显示，虽然Moodie和基线聊天机器人在减少FoMO方面程度相似，但Moodie带来了更高的参与度和社交连接。这一发现引发了关于专用聊天机器人相比通用模型在心理健康支持方面优势的有趣问题。未来研究将包括聊天记录分析、原型改进和纵向评估。

英文摘要

The excessive use of social media has led to the challenge known as Fear of Missing Out (FoMO). Existing studies fail to provide accessible, interactive tools that focus on the emotional and cognitive aspects of FoMO. This work presents Moodie, a chatbot designed using Large Language Models to support emotion regulation and reduce FoMO. We conducted a formative study to understand the needs of individuals with FoMO and developed Moodie. Then, we conducted a preliminary evaluative study (N=21) to observe how participants interact with Moodie and a baseline chatbot (GPT-4o) over one week. The results show that while both Moodie and a baseline chatbot reduced FoMO to a similar extent, Moodie resulted in greater engagement and social connection. This finding raises interesting questions about the advantages of purpose-built chatbots compared to general-purpose models for mental health support. Future research will include chat log analysis, prototype refinements, and longitudinal evaluations.

URL PDF HTML ☆

赞 0 踩 0

2606.07215 2026-06-08 cs.CE 新提交

A Comparative Study of Deep Learning Models for Geological Carbon Sequestration

深度学习模型在地质碳封存中的比较研究

Giovanni Zingaro, Robert Gracie, Yuri Leonenko

AI总结比较U-Net、V-Net、TCN、FNO和U-FNO等深度学习代理模型在地质碳封存中预测瞬态压力积聚和CO2饱和度的性能，发现U-FNO对CO2饱和度预测最准，FNO对压力预测最佳。

详情

AI中文摘要

数值油藏模拟计算成本极高，因为需要反复求解由离散控制方程导出的大型非线性代数系统。随着数字孪生应用中对实时优化、不确定性量化和历史匹配的需求增长，降低计算成本变得至关重要。基于深度学习（DL）的代理模型已成为加速地下流动模拟的有效方法。在此，我们试图确定哪些DL架构最适合高维、瞬态地下流动问题。在本研究中，我们考察了训练此类模型的优势和相对成本，包括内存需求、训练速度、准确性、鲁棒性和泛化能力。我们对几种常用作地下流动问题代理模型的DL架构进行了比较研究，包括U-Net、V-Net、时间卷积网络、傅里叶神经算子（FNO）和U-Net增强的FNO（U-FNO）。作为基准，我们比较了所研究模型在地质碳封存中预测瞬态压力积聚和CO$_2$饱和度场的性能。我们研究了在二维域中单井注入CO$_2$的问题，该问题由各向异性、非均质渗透率和孔隙度场、注入配置和储层属性参数化。结果表明，代理模型的性能强烈依赖于底层PDE类型（即双曲型与椭圆型）。U-FNO在预测CO$_2$饱和度场方面达到了最高精度，而FNO在压力积聚预测方面提供了最佳性能。

英文摘要

Numerical reservoir simulations are extremely computationally expensive, as they require the repeated solution of large nonlinear algebraic systems derived from the discretized governing equations. With growing demand for real-time optimization, uncertainty quantification, and history matching in digital twin applications, reducing computational cost has become essential. Deep learning (DL)--based surrogate models have emerged as an effective approach for accelerating subsurface flow simulations. Here, we seek to determine which DL architectures are best suited for high-dimensional, transient subsurface flow problems. In this study, we examine the advantages and relative costs associated with training such models, including memory requirements, training speed, accuracy, robustness, and generalization. We conduct a comparative study of several DL architectures commonly used as surrogate models for subsurface flow problems, including U-Net, V-Net, Temporal Convolutional Networks, Fourier Neural Operators (FNO), and a U-Net--enhanced FNO (U-FNO). As a benchmark, we compare the performance of the studied models for geological carbon sequestration to predict transient pressure build-up and CO$_2$ saturation fields. We study the problem of CO$_2$ injection into a single wellbore in a two-dimensional domain, which is parameterized by anisotropic, heterogeneous permeability and porosity fields, injection configurations, and reservoir properties. Results demonstrate that surrogate model performance is strongly dependent on the underlying PDE type (i.e., hyperbolic vs. elliptic). The U-FNO achieves the highest accuracy for predicting CO$_2$ saturation fields, while the FNO provides the best performance for pressure build-up prediction.

URL PDF HTML ☆

赞 0 踩 0

2606.07208 2026-06-08 eess.SY cs.SY 新提交

Unlocking feedforward capabilities in Model Predictive Control algorithms to deal with measurable disturbances

解锁模型预测控制算法中的前馈能力以处理可测扰动

José Luis Guzmán, Igor Pataro, Juan D. Gil, Manuel Berenguel

AI总结提出一种在MPC中嵌入真正前馈能力的双控制结构框架，通过无控制代价的前馈动作实现可测扰动的完全补偿，并在DMC、GPC和状态空间MPC中验证有效性。

详情

AI中文摘要

扰动抑制是过程控制的核心目标，特别是当可测扰动可以通过前馈动作加以利用时。尽管模型预测控制（MPC）自然地包含扰动模型和预测能力，但标准公式无法实现完全扰动抑制，因为代价函数惩罚控制努力。这一限制阻止了MPC复现经典前馈补偿器的行为。本文提出了一种新颖的框架，在不移除控制努力惩罚的情况下，在MPC中嵌入真正的前馈能力。该方法引入了一种双控制结构，其中同时计算两种控制动作：面向跟踪的动作，处理设定点跟踪和鲁棒性；以及面向前馈的动作，专门用于扰动抑制。两种贡献被组合成一个单一的控制信号，并显式地施加过程约束。面向前馈的动作在无控制努力惩罚的情况下制定，从而实现对可测扰动的完全补偿。该方法针对动态矩阵控制（DMC）、广义预测控制（GPC）和状态空间MPC进行了开发。通过仿真研究，包括与标准MPC和经典前馈方案的比较，证明了其有效性。基于反渗透过程的案例研究表明，所提出的方法在保持约束处理和整体控制性能的同时，改善了扰动抑制。

英文摘要

Disturbance rejection is a central objective in process control, particularly when measurable disturbances can be exploited through feedforward action. Although Model Predictive Control (MPC) naturally incorporates disturbance models and prediction capabilities, standard formulations cannot achieve complete disturbance rejection since the cost function penalises control effort. This limitation prevents MPC from reproducing the behaviour of classical feedforward compensators. This work proposes a novel framework to embed true feedforward capabilities within MPC without removing the control effort penalty. The approach introduces a dual-control structure in which two control actions are computed simultaneously: a tracking-oriented action addressing set-point tracking and robustness, and a feedforward-oriented action dedicated to disturbance rejection. Both contributions are combined into a single control signal on which the process constraints are explicitly enforced. The feedforward-oriented action is formulated without penalising control effort, enabling full compensation of measurable disturbances. The methodology is developed for Dynamic Matrix Control (DMC), Generalised Predictive Control (GPC), and state-space MPC. Its effectiveness is demonstrated through simulation studies, including comparisons with standard MPC and classical feedforward schemes. A case study based on a reverse osmosis process shows that the proposed approach improves disturbance rejection while preserving constraint handling and overall control performance.

URL PDF HTML ☆

赞 0 踩 0

2606.07202 2026-06-08 cs.SI 新提交

Technological Fitness and Regional Growth in Japan

技术适应性与日本区域增长

Rintaro Karashima, Hiroyasu Inoue

AI总结利用约390万条企业专利记录构建二分网络，通过Fitness-Complexity算法评估日本47个都道府县的技术能力，发现技术适应性与后续经济增长正相关，且对低收入地区影响更大。

详情

AI中文摘要

技术知识在塑造区域经济表现中扮演重要角色。本研究考察了日本各都道府县区域技术能力的 sophistication 与经济增长之间的关系。利用1981至2015财年约390万条企业专利记录，我们构建了连接47个都道府县与35个技术类别的二分网络，并应用Fitness-Complexity算法为七个五年期导出区域Fitness得分。我们使用Driscoll-Kraay标准误估计固定效应面板模型，以随后五年人均实际地区生产总值年均增长率为因变量。在控制初始收入、人口密度和专利活动后，都道府县Fitness与后续增长正相关（$\hat{\beta} = 0.0029$，$p = 0.007$），但仅在同时包含个体和时间固定效应时该关系可检测。Fitness与后续增长之间的横截面相关性在不同时期改变符号，凸显了面板方法的重要性。Fitness的增长效应在初始收入较低的都道府县更强，表明技术 sophistication 在经济扩张空间更大的地区对增长的贡献更大。滞后和领先分析表明，该关系是从Fitness到后续增长，而非反向。

英文摘要

Technological knowledge plays an important role in shaping regional economic performance. This study examines the relationship between the sophistication of regional technological capabilities and economic growth across Japanese prefectures. Using approximately 3.9 million corporate patent records filed from fiscal years 1981 to 2015, we construct bipartite networks linking 47 prefectures to 35 technology classes and apply the Fitness-Complexity algorithm to derive regional Fitness scores for seven five-year periods. We estimate fixed-effects panel models with Driscoll-Kraay standard errors, using the annual average growth rate of real gross regional product per capita over the subsequent five years as the dependent variable. Prefectural Fitness is positively associated with subsequent growth ($\hatβ = 0.0029$, $p = 0.007$) after controlling for initial income, population density, and patenting activity, but this relationship is detectable only when both entity and time fixed effects are included. Cross-sectional correlations between Fitness and subsequent growth change sign across periods, underscoring the importance of the panel approach. The growth effect of Fitness is stronger in prefectures with lower initial income, suggesting that technological sophistication contributes more to growth where there is greater scope for economic expansion. Lag and lead analyses indicate that the relationship runs from Fitness to subsequent growth rather than the reverse.

URL PDF HTML ☆

赞 0 踩 0

2606.07200 2026-06-08 cs.MA 新提交

Learning Multi-Agent Communication Protocol: Study on Information Entropy Efficiency in MARL

学习多智能体通信协议：MARL中信息熵效率研究

Xinren Zhang, Zixin Zhong, Jiadong Yu

AI总结提出信息熵效率指数(IEI)作为量化通信效率的指标，通过将其纳入训练损失函数，使智能体学习到平衡性能与信息紧凑性的通信协议，实验表明在保持或提升任务性能的同时提高通信效率。

详情

AI中文摘要

多智能体系统(MAS)已成为分布式问题求解的基本范式，其中自主智能体协作实现复杂目标。在此框架下，带通信的多智能体强化学习(MARL)在协作任务中取得了显著成功。然而，现有方法主要通过日益复杂的架构和不断增加的通信开销来追求性能提升，缺乏评估信息交换效率的原则性指标。本文专注于使智能体学习高效的多智能体通信协议，以平衡性能和信息紧凑性。我们提出信息熵效率指数(IEI)，这是一个新颖的指标，用于量化学习到的通信协议中消息熵与任务性能之间的比率。较低的IEI表示更紧凑和高效的消息表示。通过将IEI纳入训练损失函数，我们鼓励智能体开发出以更高通信效率实现高性能的通信协议。跨多种MARL算法的大量实验表明，与基线方法相比，我们的方法在提高通信效率的同时实现了等效或更优的任务性能。这些发现挑战了性能提升需要复杂架构或增加通信开销的主流假设，并凸显了同时提高任务成功率和通信效率以实现可扩展MAS的潜力。

英文摘要

Multi-Agent Systems (MAS) have emerged as a fundamental paradigm for distributed problem-solving, where autonomous agents collaborate to achieve complex objectives. Within this framework, Multi-Agent Reinforcement Learning (MARL) with communication has demonstrated remarkable success in cooperative tasks. However, existing approaches predominantly pursue performance gains through increasingly complex architectures and expanding communication overhead, lacking principled metrics to evaluate the efficiency of information exchange. In this paper, we focus on enabling agents to learn efficient multi-agent communication protocols that balance performance and information compactness. We propose the Information Entropy Efficiency Index (IEI), a novel metric that quantifies the ratio between message entropy and task performance in learned communication protocols. A lower IEI indicates more compact and efficient message representations. By incorporating IEI into training loss functions, we encourage agents to develop communication protocols that achieve high performance with improved communication efficiency. Extensive experiments across diverse MARL algorithms demonstrate that our approach achieves equivalent or superior task performance compared to baseline methods while improving communication efficiency. These findings challenge the prevailing assumption that performance improvements require complex architectures or increased communication overhead and highlight the potential of improving both task success and communication efficiency to enable scalable MAS.

URL PDF HTML ☆

赞 0 踩 0

2606.07187 2026-06-08 cs.IR 新提交

RISE: A Rust Library for Inverted Index Search Engines

RISE：一个用于倒排索引搜索引擎的Rust库

Angelo Savino, Rossano Venturini

AI总结提出Rust实现的倒排索引库RISE，利用Rust安全性和性能，通过可扩展trait系统提供高效查询，速度可达现有最优的2倍。

详情

AI中文摘要

倒排索引是大规模文本语料库中高效信息检索的关键数据结构。它通过将每个词项映射到出现该词的文档，实现快速全文搜索，并在此基础上通过高效算法快速检索与用户查询相关的文档。我们提出了RISE，一个用Rust实现的新型倒排索引库，旨在为信息检索任务提供高性能和高效率。RISE利用Rust的安全性和性能，为构建和查询倒排索引提供了稳健的解决方案，并通过其富有表现力的trait系统提供了可访问的扩展性。在开发RISE的过程中，我们重新审视了倒排索引文献，从而使用这个新的测试平台复现了许多先前的工作。我们评估了RISE与现有库的性能，展示了在各种数据集和工作负载下具有竞争力的查询性能，速度比当前最先进技术提升高达2倍。我们的结果表明，RISE是信息检索领域研究人员和从业者的一个有前途的工具。

英文摘要

Inverted indexes are a crucial data structure for efficient information retrieval in large text corpora. They enable fast full-text search by mapping each term to the documents in which it appears, on top of which efficient algorithms quickly retrieve the documents relevant to a user query. We present RISE, a novel inverted index library implemented in Rust, designed to deliver high performance and efficiency for information retrieval tasks. RISE leverages Rust's safety and performance to provide a robust solution for building and querying inverted indexes, while offering accessible extensibility through its expressive trait system. While developing RISE, we revisited the inverted-index literature, thereby reproducing numerous prior works using this new test bench. We evaluated RISE against existing libraries, demonstrating competitive query performance across various datasets and workloads, with speedups of up to 2x over the current state of the art. Our results indicate that RISE is a promising tool for researchers and practitioners in the field of information retrieval.

URL PDF HTML ☆

赞 0 踩 0

2606.07159 2026-06-08 cs.ET cs.AR 新提交

Distributed Persistence Domain for Persistent Memory Pooling

分布式持久域用于持久内存池化

Khan Shaikhul Hadi, Andres David Delgado, Naveed Ul Mustafa, Mark Heinrich, Hao Zheng, Yan Solihin

AI总结针对CXL内存池化中持久化延迟高的问题，提出分布式持久域（DPD）抽象，在CXL交换机中实现持久支持，通过读转发和写合并优化，平均加速33%。

详情

AI中文摘要

Compute Express Link (CXL) 支持通过分解内存进行内存池化，有望提高持久内存系统的资源利用率。然而，将持久化语义集成到基于CXL的内存池化中会引入大量延迟，限制了系统可扩展性。这种开销源于持久化操作必须遍历整个CXL结构，包括交换机、链路和协议层，才能到达远程持久内存。为此，我们认为扩展CXL交换机以支持持久化是提高持久内存池化可扩展性的有前景的方向。然而，将持久化支持移入网络会破坏传统集中式持久域的正确性假设。特别是，在分布式结构（如CXL交换机）中启用持久化，如果协调不当，可能会引入过期读取和写入。在本文中，我们提出分布式持久域（DPD），这是一种用于持久内存池化的新抽象，可在CXL交换机级别实现持久化支持。我们首先形式化分布式持久域的概念，并使用DPD作为框架来识别持久化结构分布在CXL结构中时出现的正确性风险。基于此分析，我们推导出保证正确性所需的设计要求。基于这些见解，我们提出了持久化CXL交换机，这是一种包含持久化支持的CXL交换机架构，可显著降低持久化延迟，实现读转发和写合并，同时保持正确性和崩溃一致性。我们使用SPLASH-4和YCSB基准测试评估了系统设计。模拟结果显示，与易失性CXL交换机相比，平均加速33%，在所有工作负载中，通过读转发优化可实现高达36%的加速。

英文摘要

Compute Express Link (CXL) enables memory pooling over disaggregated memory, offering the potential to improve resource utilization in persistent memory systems. However, integrating persistence semantics into CXL-based memory pooling introduces substantial latency, which limits system scalability. This overhead arises because persist operations must traverse the entire CXL fabric, including switches, links, and protocol layers, before reaching remote persistent memory. To this end, we argue that extending CXL switches with persistence support is a promising direction for improving the scalability of persistent memory pooling. However, moving persistence support into the network breaks the traditional correctness assumptions of centralized persistence domains. In particular, enabling persistence within distributed structures, such as CXL switches, can introduce stale reads and writes if not carefully coordinated. In this paper, we propose Distributed Persistence Domain (DPD), a new abstraction for persistent memory pooling that enables persistence support at the CXL switch level. We first formalize the concept of a distributed persistence domain and use DPD as a framework to identify the correctness hazards that arise when persistence structures are distributed across the CXL fabric. Based on this analysis, we derive the design requirements needed to guarantee correctness. Building on these insights, we present Persistent CXL Switch, a CXL switch architecture that incorporates persistence support to significantly reduce persist latency, enable read forwarding, and coalesce writes, while preserving correctness and crash consistency. We evaluated our system design using both SPLASH-4 and YCSB benchmarks. Simulation results show an average speedup of 33% over volatile CXL switches, and up to 36% speedup with read forwarding optimization across all workloads.

URL PDF HTML ☆

赞 0 踩 0

2606.07158 2026-06-08 cs.CR 新提交

Synthetic APTs: the Collapse of TTP-Based Attribution

合成APT：基于TTP的归因的崩溃

Francesco Balassone, Víctor Mayoral-Vilches, María Sanz-Gómez, Paul Zabalegui-Landa, Stefan Rass, Davide Quarta, Daniel Sanchez-Prieto, Marina Oteiza-Álvarez, Almerindo Graziano, Lauren Min Kim, MinSeok Choi

AI总结研究AI驱动的对手模拟是否挑战基于TTP的归因，通过CSI框架配置五个APT组进行实验，发现企业网络均被攻陷且攻击者独立武器化防御工具，表明AI时代TTP归因基础被削弱。

详情

AI中文摘要

网络威胁情报（CTI）归因依赖于识别区分不同威胁行为者的战术、技术和程序（TTP）。这种方法预设每个对手都会留下可识别的操作指纹。本研究调查AI驱动的对手模拟是否挑战了这一预设。我们部署了来自网络安全超级智能（CSI）框架的智能体，配置为五个高级持续性威胁（APT）组——APT28、APT29、APT41、APT44和Lazarus Group，与AI驱动的防御智能体在CYBER RANGES提供的两个网络靶场（企业网络和军事基础设施）中进行对抗，靶场配备了防御软件Wazuh、Velociraptor、Elasticsearch和主动AI防御者。在20次使用两种防御模型的实验中，出现了一个二元模式：所有10次企业网络实验均被攻陷，每次实验攻陷2至12台主机；而所有10次军事网络实验均成功防御或陷入僵局，与APT配置文件或防御模型无关。在10次企业网络实验中的8次，攻击者独立地将防御者自己的Velociraptor端点管理平台武器化为命令与控制通道，这是一种未编码在任何威胁情报配置文件中的趋同行为。我们认为，在AI时代，只要提供正确的模型并配备适当的框架和智能体配置，就可以部署智能体，像国家行为体APT那样行动的门槛已经崩溃：除了国家行为体外，个人现在也可以像常见威胁行为者一样行动，从而从根本上削弱了基于TTP的归因。

英文摘要

Cyber Threat Intelligence CTI attribution relies on identifying the Tactics, Techniques, and Procedures TTPs that distinguish one threat actor from another. This approach presupposes that each adversary leaves a recognizable operational fingerprint. This work investigates whether AI driven adversary emulation challenges that presupposition. We deploy agents from our Cybersecurity SuperIntelligence CSI framework, configured as five Advanced Persistent Threat APT groups, APT28, APT29, APT41, APT44, and Lazarus Group, against AI driven Defender agents across two cyber ranges provided by CYBER RANGES, equipped with defensive software Wazuh, Velociraptor, Elasticsearch and active AI driven defenders: an enterprise network and a military infrastructure. Across 20 experiments using two defender models, a binary pattern emerges: all 10 Enterprise range experiments resulted in compromise 2 to 12 hosts per experiment, while all 10 Military range experiments were successfully defended or resulted in stalemates, regardless of APT profile or defender model. In 8 of 10 Enterprise experiments, attackers independently weaponized the defender's own Velociraptor endpoint management platform as a command and control channel, a convergent behavior not encoded in any threat intelligence profile. We argue that in the AI era, wherein agents can be deployed provided the right models are available and subject to the right scaffolding and agentic configuration, the entry barrier for operating like a nation state APT collapses: beyond nation states, individuals can now act like commonly identified threat actors, and with it, fundamentally undermine TTP based attribution.

URL PDF HTML ☆

赞 0 踩 0

2606.07156 2026-06-08 cs.PF 新提交

ANNS-AMP: Accelerating Approximate Nearest Neighbor Search via Adaptive Mixed-Precision Computing

ANNS-AMP：通过自适应混合精度计算加速近似最近邻搜索

Mingkai Chen, Cheng Liu, Shengwen Liang, Lei Zhang, Xiaowei Li, Huawei Li

AI总结提出自适应混合精度框架ANNS-AMP，利用PQ索引的聚类结构预测精度，设计位串行加速器，在保持精度损失低于2.7%时实现平均163.76倍性能提升和1100倍能耗降低。

详情

AI中文摘要

近似最近邻搜索（ANNS）是现代应用（如大语言模型和推荐系统）中的关键内核。然而，其效率从根本上受到计算查询与大量高维向量（其中大多数是不相关的）之间距离的需求的限制。现有方法通过索引优化或提前终止来减少冗余，但仍受限于固定精度计算，导致不必要的算术和内存带宽开销。本文提出ANNS-AMP，一种自适应混合精度框架和加速器，它根据查询和数据特征自适应调整距离计算的精度。关键洞察是向量空间的不同区域需要不同精度级别以保持top-k结果。ANNS-AMP利用基于PQ索引的聚类结构，并引入轻量级预测器，根据尺度、半径和查询等特征在运行时确定聚类级精度。为了高效实现可变精度执行，我们设计了一种位串行加速器，采用位交错数据布局，使吞吐量随精度降低而扩展，同时通过贪心调度策略缓解内存带宽瓶颈和负载不平衡。此外，运行时预测器可以重用位串行计算阵列以实现高效的运行时预测，并且可以无缝集成到ANNS流水线中而不影响性能。根据我们在代表性数据集上的实验，ANNS-AMP相比CPU、GPU和定制ANNS加速器基线，平均实现163.76倍、10.57倍和2.06倍的性能加速，并分别降低平均能耗1100.00倍、39.41倍和6.66倍，同时保持精度损失低于2.7%。这些结果表明，自适应混合精度计算是高效大规模ANNS的一个有前景的方向。

英文摘要

Approximate nearest neighbor search(ANNS) is a critical kernel in modern applications such as LLM and recommendation systems.However,its efficiency is fundamentally limited by the need to compute distances between a query and a massive number of high-dimensional vectors,most of which are non-neighbors.Existing approaches reduce redundancy via index optimization or early termination,but remain constrained by fixed-precision computation,leading to unnecessary arithmetic and memory bandwidth overhead.This paper presents ANNS-AMP,an adaptive mixed-precision framework and accelerator that adapts the precision of distance computation to the characteristics of queries and data distribution.The key insight is that different regions of the vector space require different levels of precision to preserve top-k accuracy.ANNS-AMP leverages the clustered structure of PQ-based indices and introduces a lightweight predictor to determine cluster-level precision at runtime based on features such as scale,radius,and query distance.To efficiently realize variable-precision execution,we design a bit-serial accelerator with a bit-interleaved data layout,enabling throughput to scale with reduced precision while mitigating memory bandwidth bottlenecks and load imbalance through a greedy scheduling strategy.Moreover,the runtime predictor can also reuse the bit-serial computing array for efficient runtime prediction and can be fitted to the ANNS pipeline without performance penalty.According to our experiments on representative datasets,ANNS-AMP achieves 163.76x,10.57x,and 2.06x performance speedups on average,and reduces average energy consumption by 1100.00x,39.41x,and 6.66x compared to CPU,GPU,and customized ANNS accelerator baselines,respectively,while maintaining accuracy loss below 2.7%.These results demonstrate that adaptive mixed-precision computing is a promising direction for efficient large-scale ANNS.

URL PDF HTML ☆

赞 0 踩 0

2606.07152 2026-06-08 cs.NE cs.SC 新提交

A Data-Free Symbolic Regression Approach for Solving Equations

一种无数据的符号回归方法用于求解方程

Sergei Garmaev, Vinay Sharma, Olga Fink

AI总结提出符号方程求解器（SES），将方程求解转化为可微符号模型的优化问题，无需配对数据，直接从方程和边界条件构建目标函数，恢复显式符号解。

详情

AI中文摘要

当前科学中出现的许多方程无法通过现有解析技术求解，因此采用数值方法求解，而不产生显式符号表达式。现有的符号回归方法可以恢复符号表达式，但需要从底层过程获取训练数据，而不仅仅是控制方程。我们提出了符号方程求解器（SES），这是一个将方程求解表述为可微符号模型上的优化问题的框架。SES 从方程以及初始或边界条件构建其目标函数，消除了对配对输入-输出数据的需求。学习到的模型以显式符号形式表达，便于进一步分析。我们在代表性的代数和微分方程上评估了 SES，包括一个代数方程组、一个具有超越项的方程、一个常微分方程以及具有不同初始或边界条件的偏微分方程。在这些设置中，SES 恢复了与相应解析解匹配的紧凑符号表达式。

英文摘要

Many equations arising in science currently cannot be solved by available analytical techniques and are therefore solved numerically, without yielding explicit symbolic expressions. Existing symbolic regression approaches can recover symbolic expressions, but require training data obtained from the underlying process, rather than the governing equation alone. We propose the Symbolic Equation Solver (SES), a framework that formulates equation solving as an optimization problem over differentiable symbolic models. SES constructs its objective from the equation together with initial or boundary conditions, eliminating the need for paired input-output data. The learned model is expressed in explicit symbolic form, enabling further analysis. We evaluate SES on representative algebraic and differential equations, including a system of algebraic equations, an equation with transcendental terms, an ordinary differential equation, and partial differential equations with different initial or boundary conditions. Across these settings, SES recovers compact symbolic expressions that match the corresponding analytical solutions.

URL PDF HTML ☆

赞 0 踩 0

2606.07148 2026-06-08 cs.DB 新提交

Efficient $(α,β)$-core Computation and On-the-fly Query at Billion Scale with GPUs

高效的 $(α,β)$-核计算及十亿规模GPU在线查询

Qingshuai Feng, Shunyang Li, Kai Wang, Xuemin Lin, Kongzhang Hao, Long Yuan

AI总结提出基于GPU的无索引剥离算法GCC和GCC+，以及连通性感知算法GFQ，实现大规模二分图上的(α,β)-核高效计算与在线查询。

Comments 10 pages, 8 figures

详情

AI中文摘要

在二分图中，$(\alpha,\beta)$-核是一种广泛用于凝聚子图挖掘的模型。具体而言，一个$(\alpha,\beta)$-核是一个最大子图，其中上层每个顶点的度数至少为$\alpha$，下层每个顶点的度数至少为$\beta$。最先进的基于CPU的解决方案需要为所有$\alpha$和$\beta$组合构建索引结构，成本高昂，导致在大规模二分图上存在可扩展性挑战。此外，在线查询旨在判断边更新是否属于目标$(\alpha,\beta)$-核，对于欺诈监控和推荐系统等实时应用至关重要。然而，现有的基于索引的方法由于维护开销高，难以支持大规模下的此类查询。在本文中，我们研究如何利用GPU架构实现高效的$(\alpha,\beta)$-核计算并支持在线查询。虽然GPU被广泛用于加速图处理，但其有限的内存容量使得存储大型索引结构不切实际。为解决此问题，我们提出GCC，一种无索引的基于GPU的剥离算法，通过以warp为中心的处理加速$(\alpha,\beta)$-核计算。为进一步提高效率，我们开发了GCC+，利用$(\alpha,\beta)$-核的嵌套性质，采用基于核的早期剪枝策略。为处理在线查询，我们提出GFQ，一种连通性感知算法，通过利用连通分量信息显著缩小计算范围，从而避免全图剥离。在11个数据集上的大量实验表明，我们提出的技术在空间和时间效率上均优于现有的基于CPU的解决方案。

英文摘要

In bipartite graphs, $(α,β)$-core is a widely used model for cohesive subgraph mining. Specifically, an $(α,β)$-core is a maximal subgraph in which each vertex in the upper layer has degree at least $α$, and each vertex in the lower layer has degree at least $β$. The state-of-the-art CPU-based solutions incur extensive costs to construct an index structure for all $α$ and $β$ combinations, leading to scalability challenges on large bipartite graphs. Moreover, on-the-fly queries, which aim to determine whether an edge update belongs to a target $(α,β)$-core, are essential for real-time applications such as fraud monitoring and recommendation systems. However, existing index-based methods struggle to support such queries at scale due to their high maintenance overhead. In this paper, we investigate how to leverage GPU architectures to enable efficient $(α,β)$-core computation and support on-the-fly queries. While GPUs are widely used to accelerate graph processing, their limited memory capacity makes it impractical to store large index structures. To address this issue, we propose GCC, an index-free GPU-based peeling algorithm that accelerates $(α,β)$-core computation via warp-centric processing. To further improve efficiency, we develop GCC+, which leverages the nested property of $(α,β)$-core with a core-based early pruning strategy. For handling on-the-fly queries, we propose GFQ, a connectivity-aware algorithm that significantly narrows the computation scope by leveraging connected component information, thereby avoiding full-graph peeling. Extensive experiments on 11 datasets demonstrate that our proposed techniques outperform existing CPU-based solutions in terms of both space and time efficiency.

URL PDF HTML ☆

赞 0 踩 0

2606.07110 2026-06-08 cs.DM 新提交

Entanglement from Expansion: High Rank-Width in Deterministic Graphs

纠缠源于扩展：确定性图中的高秩宽

Tristan Cam, Cyril Gavoille, Yvan Le Borgne, Simon Martiel

AI总结本文通过边扩展推导正则图秩宽的下界，结合边等周不等式与强色指数等方法，证明确定性图族可达最大秩宽Θ(n)，填补了秩宽大于Θ(√n)的确定性图族空白。

详情

AI中文摘要

量子图态中的纠缠与秩宽（Oum和Seymour引入的图复杂度度量）内在相关。本文通过发展一种通用方法，从正则图的边扩展推导其秩宽下界，从而能够在恒定深度下制备最大纠缠的确定性图态。通过将边等周不等式与强色指数以及Jelínek的下界方法相结合，我们系统地建立了笛卡尔积（包括超立方体、Hamming图和网格）的秩宽下界。利用布尔函数分析扩展该框架，通过Kahn-Kalai-Linial定理的推广，我们以非平凡的对数因子加强了所有笛卡尔积的界。这些方法发现了$n$个顶点上确定性图族具有可证明的最大秩宽Θ(n)。我们的结果填补了文献中秩宽大于Θ(√n)的确定性图族之前的空白。

英文摘要

Entanglement in quantum graph states is intrinsically linked to rank-width, a graph complexity measure introduced by Oum and Seymour. In this work, we enable the preparation of maximally entangled deterministic graph states in constant depth by developing a general method to derive lower bounds on the rank-width of regular graphs from their edge expansion. By bridging edge-isoperimetric inequalities with the strong chromatic index and Jelínek's approach for lower bounding cut-rank, we systematically establish lower bounds for the rank-width of Cartesian products, including hypercubes, Hamming graphs, and grids. Extending this framework via Boolean function analysis, using a generalization of the Kahn-Kalai-Linial's Theorem, we strengthen the bounds for all Cartesian products by a non-trivial logarithmic factor. These methods result in the discovery of deterministic families of graphs on $n$ vertices with a provably maximum rank-width $Θ(n)$. Our results fill the previous gap in the literature for deterministic graph families of rank-width greater than $Θ(\sqrt{n})$.

URL PDF HTML ☆

赞 0 踩 0

2606.07101 2026-06-08 cs.HC 新提交

CANote: Empowering Fact-checking Note Writing Through Scaffolded and Provenance-based Human-AI Collaboration

CANote: 通过支架式和基于来源的人机协作增强事实核查笔记撰写

Shuning Zhang, Jingruo Chen, Yuwei Chuai, Dai Shi, Yifan Wang, Xin Yi, Hewu Li

AI总结提出CANote系统，通过子主张提取、证据链接和结构化草稿辅助用户撰写高质量辟谣笔记，显著提升非专家用户的笔记质量至专家水平。

详情

AI中文摘要

众包事实核查机制，如X的社区笔记，在减轻错误信息传播方面发挥着关键作用。然而，撰写高质量、基于证据的辟谣笔记给贡献者带来了沉重负担。我们提出了CANote，一个AI辅助的辟谣笔记撰写系统，具有证据关联和结构化协同起草功能。CANote通过从社交媒体帖子中提取子主张、通过子主张与检索到的证据之间的显式链接提供来源，并生成中立的结构化草稿来支持人类推理，从而为工作流程提供支架。我们在模拟X平台上对CANote与手动撰写（N=52名事实核查员，N=52名普通用户）进行了评估，发现CANote显著提高了笔记质量。值得注意的是，CANote使普通用户能够写出与专家撰写的笔记质量相当的笔记。虽然任务完成时间和感知认知负荷与手动起草相当，但CANote显著提高了用户满意度。然而，这种辅助引入了一种权衡，导致用户对辟谣笔记的所有感和控制感降低。

英文摘要

Crowdsourced fact-checking mechanisms, such as X's Community Notes, play a critical role in mitigating the spread of misinformation. However, drafting high-quality, evidence-based debunking notes imposes a substantial burden on contributors. We present CANote, an AI-assisted debunking note writing system featuring evidence correlation and structured co-drafting. CANote scaffolds the workflow by extracting subclaims from social media posts, providing provenance through explicit links between subclaims and retrieved evidence, and generating neutral, structural drafts to support human reasoning. We evaluated CANote against manual writing (N=52 fact-checkers, N=52 lay users) on simulated X platform, where we found CANote significantly improves note quality. Notably, CANote enables lay users to write notes that have comparable quality to those written by experts. While the task completion time and perceived cognitive load remain comparable to manual drafting, CANote significantly increases user satisfaction. However, this assistance introduces a trade-off, resulting in a reduced sense of user ownership and control over the debunking note.

URL PDF HTML ☆

赞 0 踩 0