arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2606.09800 2026-06-09 cs.SE cs.AI cs.MA 新提交

FASE: Fast Adaptive Semantic Entropy for Code Quality

FASE: 用于代码质量的快速自适应语义熵

Shizhe Lin, Ladan Tahvildari

发表机构 * University of Waterloo（滑铁卢大学）

AI总结提出快速自适应语义熵（FASE），通过最小生成树近似功能正确性，在HumanEval和BigCodeBench上相比现有语义熵方法在Spearman相关性和ROCAUC上分别提升25%和19%，且计算开销仅为传统方法的0.3%。

详情

AI中文摘要

多智能体代码生成通过模拟人类软件工程生命周期，为自主软件开发提供了一种有前景的范式。然而，系统可靠性仍然受到LLM幻觉和跨交互智能体错误传播的阻碍。虽然语义熵提供了一种无需真实答案即可量化不确定性的原则性方法，但当前方法通常依赖于成本高昂的LLM驱动的等价性检查。在这项工作中，我们引入了快速自适应语义熵（FASE），这是一种基于结构和语义不相似图的最小生成树来近似功能正确性的新型度量。在HumanEval和BigCodeBench上的评估表明，FASE优于通过LLM蕴含的最先进语义熵，在使用Qwen3-Embedding-8B模型时，与基于真实测试用例的Pass@1相比，Spearman相关性平均提升25%，ROCAUC分数提升19%。此外，通过消除成本高昂的LLM驱动的等价性评估，FASE的计算开销可忽略不计，其运行成本仅为传统语义熵方法的约0.3%。这些结果使FASE成为优化现实世界多智能体工作流中不确定性量化的实用且经济高效的解决方案。

英文摘要

Multi-agent code generation offers a promising paradigm for autonomous software development by simulating the human software engineering lifecycle. However, system reliability remains hindered by LLM hallucinations and error propagation across interacting agents. While semantic entropy provides a principled way to quantify uncertainty without ground-truth answers, current methods often rely on costly LLM-driven equivalence checks. In this work, we introduce Fast Adaptive Semantic Entropy (FASE), a novel metric that approximates functional correctness based on the minimum spanning tree of structural and semantic dissimilarity graphs. Evaluations on HumanEval and BigCodeBench demonstrate that FASE outperforms state-of-the-art semantic entropy by LLM entailment, achieving a 25% average improvement in Spearman correlation and a 19% increase in ROCAUC score against Pass@1 from ground-truth test cases when using the Qwen3-Embedding-8B model. Furthermore, by eliminating costly LLM-driven equivalence evaluation, FASE incurs negligible computational overhead, requiring only approximately 0.3% of the runtime cost of traditional semantic entropy approaches. These results position FASE as a practical, cost-effective solution for optimizing uncertainty quantification in real-world multi-agent workflows.

URL PDF HTML ☆

赞 0 踩 0

2606.09778 2026-06-09 quant-ph cs.AI 新提交

Who Earns the Safety? Intervention-Aware Quantum Predictive Control with Safety Attribution

谁赢得了安全？具有安全归因的干预感知量子预测控制

Yifan Wang

发表机构 * Yifan Wang（王一帆）

AI总结提出干预感知变分量子可微预测控制（IA-VQC-DPC），通过原始-对偶干预预算和安全性归因协议，量化并提升量子策略的固有安全性，避免保护层掩盖策略缺陷。

Comments 7 pages, 4 figures

详情

AI中文摘要

硬安全过滤器越来越多地部署在学习控制器的下游，以保证运行时约束满足。然而，一个从不违反约束的过滤控制器可能仍然没有学到任何关于安全性的知识：过滤器可以静默地修复一个不称职的上游策略，使得过滤后的成功衡量的是过滤器，而不是策略。我们认为，安全策略学习应该问谁赢得了安全——策略还是其保护层——并且我们使这个问题可测量。我们引入了干预感知变分量子可微预测控制（IA-VQC-DPC），它（i）在原始-对偶干预预算下训练一个紧凑的变分量子电路（VQC）策略，该预算惩罚对可微控制障碍函数（CBF）投影的依赖，并且（ii）通过一个安全性归因协议进行评估，该协议将执行轨迹修正分解为CBF项和部署运行时保护项，并通过关闭保护评估对策略进行压力测试。在闭环、高保真BOPTEST建筑控制模拟器上（5个种子，每种方法60个回合），干预感知训练显著降低了量子策略的原始预过滤违规和总安全层依赖（两者p < 10^-4），且没有显著的能耗回归；在约400个参数的相同预算下，量子策略比匹配的经典策略显著更安全、更舒适。关闭保护评估证实了改进是策略层面的，并揭示了一个有价值的负面结果：一个学习的可微能量头只有与分布感知的运行时保护配对时才安全。该归因协议在量子策略和建筑之外具有通用性。

英文摘要

Hard safety filters are increasingly placed downstream of learned controllers to guarantee constraint satisfaction at run time. Yet a filtered controller that never violates a constraint may still have learned nothing about safety: the filter can silently repair an incompetent upstream policy, so that post-filter success measures the filter, not the policy. We argue that safe policy learning should ask who earns the safety - the policy or its protective layers - and we make this question measurable. We introduce Intervention-Aware Variational Quantum Differentiable Predictive Control (IA-VQC-DPC), which (i) trains a compact variational quantum circuit (VQC) policy under a primal-dual intervention budget that penalizes reliance on a differentiable Control-Barrier-Function (CBF) projection, and (ii) is evaluated with a safety-attribution protocol that decomposes the executed-trajectory correction into a CBF term and a deployment runtime-guard term, and stress-tests the policy with guard-off evaluation. On closed-loop, high-fidelity BOPTEST building-control emulators (5 seeds, 60 episodes per method), intervention-aware training significantly lowers the quantum policy's raw pre-filter violation and total safety-layer reliance (both p < 10^-4) with no significant energy regression; at an equal approximately 400-parameter budget the quantum policy is significantly safer and more comfortable than a matched classical policy. Guard-off evaluation confirms the improvement is policy-level and exposes a valuable negative result: a learned differentiable energy head is only safe when paired with a distribution-aware runtime guard. The attribution protocol is general beyond quantum policies and buildings.

URL PDF HTML ☆

赞 0 踩 0

2606.09734 2026-06-09 quant-ph cs.LG 新提交

Adaptive directional gradients for parameterised quantum circuits

参数化量子电路的自适应方向梯度

Brian Coyle, Snehal Raj, Virag Umathe, El Amine Cherrat, Elham Kashefi

发表机构 * School of Informatics, University of Edinburgh（爱丁堡大学信息学院）； Fujitsu Research of Europe Ltd.（富士通欧洲有限公司）； LIP6, CNRS, Sorbonne Université（LIP6研究所，法国国家科学研究中心，索邦大学）； QC Ware ； Quantum Signals（量子信号）

AI总结提出基于前向自动微分的参数化量子电路梯度估计框架，通过平均随机方向导数得到无偏梯度，并导出自适应优化器QUIVER，在多达1770个参数的问题上比参数平移规则效率提升数个数量级。

Comments 37 pages, 13 figures

详情

AI中文摘要

在量子硬件上训练参数化量子电路（PQC）的瓶颈在于梯度估计的测量成本，在参数平移规则下，该成本与可训练参数数量呈线性关系，并主导了大规模训练的总预算。本文提出了一种基于前向自动微分模式的PQC前向梯度估计器框架，通过平均自由可调数量的随机方向导数得到梯度的无偏估计，并恢复SPSA、随机坐标下降和参数平移规则作为极限情况，无需辅助量子比特或受控门开销。我们证明随机量子前向梯度下降在标准假设下收敛，并给出了显式的二阶矩展开，该展开在SPSA的单方向极端和参数平移的全梯度极端之间插值。在该框架内，我们推导出QUIVER（量子迭代自适应估计器规则），这是一种参数化电路的自适应优化器，其更新规则遵循闭式最小测量成本分配。数值结果表明，在ECG5000和MNIST数据集上，前向梯度训练具有多达60个量子比特和1770个参数的汉明权重保持正交量子神经网络，比参数平移规则效率高数个数量级。我们还证明，我们提出的QUIVER优化器在使用量子近似优化算法和变分量子特征求解器的优化问题上，可以优于iCANS和gCANS等节省测量的优化器。

英文摘要

Training parameterised quantum circuits (PQCs) on quantum hardware is bottlenecked by the measurement cost of gradient estimation, which under the parameter-shift rule scales linearly in the number of trainable parameters and dominates the total shot budget of training at scale. In this work, we propose a framework of forward gradient estimators for PQCs, based on the forward mode of automatic differentiation, that yields an unbiased estimator of the gradient by averaging a freely tunable number of random directional derivatives and recovers SPSA, random coordinate descent, and the parameter-shift rule as limiting cases, with no ancilla qubits or controlled-gate overhead. We prove that stochastic quantum forward gradient descent converges under standard assumptions, with an explicit second-moment expansion that interpolates between the single-direction extreme of SPSA and the full-gradient extreme of parameter-shift. Within this framework we derive QUIVER (Quantum Iterative V-adaptive Estimator Rule), an adaptive optimiser for parameterised circuits whose update rule follows from a closed-form minimum measurement-cost allocation. We show numerically that forward gradients train Hamming-weight-preserving orthogonal quantum neural networks with up to 60 qubits and 1770 parameters on the ECG5000 and MNIST datasets orders of magnitude more efficiently than the parameter-shift rule. We also demonstrate that our proposed QUIVER optimiser can outperform iCANS and gCANS measurement-frugal optimisers on optimisation problems using the quantum approximate optimisation algorithm and quantum simulation with the variational quantum eigensolver.

URL PDF HTML ☆

赞 0 踩 0

2606.09700 2026-06-09 cs.CR cs.HC cs.LG 新提交

What the Eyes See, the LLMs Miss: Exploiting Human Perception for Adversarial Text Attacks

眼睛所见，大语言模型所不见：利用人类感知进行对抗性文本攻击

Qin Yang, Lu Malloy, Joshua Lee, Xiaohan Chang, Meisam Mohammady, Doowon Kim, Yuan Hong

发表机构 * University of Connecticut（康涅狄格大学）； University of Tennessee（田纳西大学）； University of California, Santa Barbara（加州大学圣芭芭拉分校）； Iowa State University（爱荷华州立大学）

AI总结针对LLM内容审核系统忽视人类视觉线索的缺陷，提出人类可感知对抗攻击（HPAA），通过排版操纵嵌入有害内容，在仅三次查询下实现86%人类识别率而机器检测率低于1%。

Comments This work has been accepted for publication at USENIX Security 2026. This paper includes examples of harmful, hateful, or abusive language for research purposes. Reader discretion is advised

详情

AI中文摘要

基于大型语言模型（LLM）的内容审核系统已成为对抗有害在线内容的关键防线。然而，这些系统主要基于分词文本运行，很大程度上忽略了人类在解释内容时自然依赖的视觉线索。我们表明，这种差异造成了根本性的感知不匹配：人类容易识别为有害的内容，对自动审核系统而言可能变得几乎不可见。为研究这一漏洞，我们引入了一类人类可感知对抗攻击（HPAA），其中有害表达通过视觉上显著的排版操纵嵌入到原本良性的文本中。我们的关键洞察是，排版特征（包括间距、视觉强调和空间排列）可以策略性地组合，以保留人类对有害内容的识别，同时大幅降低机器可检测性。在黑盒设置下，仅使用少量查询预算，我们的攻击自动生成规避内容，无需模型访问或梯度信息。我们在多个数据集和十个已部署的审核系统（包括商业API和最先进的开源防护）上评估了该攻击。结果揭示了人类与机器感知之间的显著差距：仅使用三次检测器查询，生成的攻击在评估系统中实现了超过86%的人类识别率，同时检测率低于1%。我们进一步进行消融研究，以识别驱动成功规避的排版因素，分析当前审核架构为何无法捕捉这些信号，并讨论实际防御措施。我们的发现暴露了当今基于LLM的审核生态系统中的根本盲点，并强调了需要以更符合人类感知理解的方式推理内容的审核系统。

英文摘要

Large language model (LLM)-powered content moderation systems have become a critical defense against harmful online content. However, these systems primarily operate on tokenized text and largely ignore the visual cues that humans naturally rely on when interpreting content. We show that this discrepancy creates a fundamental perceptual mismatch: content that is readily recognized as harmful by humans can become effectively invisible to automated moderation systems. To study this vulnerability, we introduce a class of Human-Perceptible Adversarial Attacks (HPAA), in which harmful expressions are embedded into otherwise benign text through visually salient typographic manipulations. Our key insight is that typographic features, including spacing, visual emphasis, and spatial arrangement, can be strategically combined to preserve human recognition of harmful content while substantially reducing machine detectability. Operating in black-box settings with only a small query budget, our attack automatically generates evasive content without requiring model access or gradient information. We evaluate the attack across multiple datasets and ten deployed moderation systems, including commercial APIs and state-of-the-art open-source guardrails. Results reveal a striking gap between human and machine perception: with only three detector queries, generated attacks achieve over 86\% human recognition while maintaining detection rates below 1\% across the evaluated systems. We further conduct ablation studies to identify the typographic factors driving successful evasion, analyze why current moderation architectures fail to capture these signals, and discuss practical defenses. Our findings expose a fundamental blind spot in today's LLM-based moderation ecosystem and highlight need for moderation systems that reason about content in a manner more consistent with human perceptual understanding.

URL PDF HTML ☆

赞 0 踩 0

2606.09692 2026-06-09 cs.CR cs.AI 新提交

Observability for Delegated Execution in Agentic AI Systems

自主AI系统中委托执行的可观测性

Abhinav Mishra, Kumar Sharad

发表机构 * Splunk ； Cisco Inc（思科公司）

AI总结针对基于LLM的自主系统中委托执行轨迹难以归因的问题，提出一种轻量级网关和通用信息模型，在运行时绑定委托上下文，实现跨工具委托范围的可靠重建和直接取证查询。

详情

AI中文摘要

委托范围的执行无法从标准可观测性中识别：审计日志和执行轨迹在多个不兼容的委托分配下可能完全相同。这一差距在基于LLM的自主系统中尤为严重，其中代理动态选择工具、针对相同指令的执行序列在不同运行中变化，并生成协作子代理。这些动态使轨迹碎片化和交错，使得仅从因果结构进行委托范围重建在结构上欠定。尽管单个操作被授权和记录，现有审计、追踪和安全模式缺乏语义来重建在异构系统中给定委托下发生的操作。我们关注委托范围的归因和访问/共享足迹重建，而非意图推断或推理重建。我们提出一种代理感知的可观测性基础，包括轻量级网关和通用信息模型，在运行时绑定委托上下文。这实现了跨工具委托范围的重建和直接取证查询，无需启发式时间窗口关联。

英文摘要

Delegation-scoped execution is not identifiable from standard observables: audit logs and execution traces can be identical under multiple incompatible delegation assignments. This gap is especially acute in LLM-based agentic systems, where agents dynamically select tools, vary execution sequences across runs for the same instruction, and spawn cooperating sub-agents. These dynamics fragment and interleave traces, making delegation-scoped reconstruction from causal structure alone structurally underdetermined. Although individual actions are authorized and logged, existing audit, tracing, and security schemas lack the semantics to reconstruct what actions occurred under a given delegation across heterogeneous systems. We focus on delegation-scoped attribution and access/share footprint reconstruction, not intent inference or reasoning reconstruction. We present an agent-aware observability substrate consisting of a lightweight gateway and a common information model that binds delegation context at execution time. This enables reliable cross-tool delegation-scoped reconstruction and direct forensic queries without heuristic time-window correlation.

URL PDF HTML ☆

赞 0 踩 0

2606.09686 2026-06-09 cs.AR cs.AI cs.MS cs.NA cs.PF math.NA 新提交

An 84-Format Numeric Catalog with Bit-Exact Conformance Vectors: A Vendor-Neutral Reference for FP8, BF16, MXFP4, and Microscaling Formats

84种数值格式的位精确一致性向量目录：FP8、BF16、MXFP4和微缩放格式的厂商中立参考

Dmitrii Vasilev

发表机构 * Trinity S 3 AI

AI总结针对机器学习硬件中数值格式激增问题，本文构建了涵盖13个家族84种格式的目录，提供6个位精确一致性包和IEEE P3109映射，作为厂商中立的参考基准。

Comments 17 pages. Source repository: https://github.com/gHashTag/paper3-methodology tag v4.0-trinity. Paper CC BY 4.0; code MIT. ORCID 0009-0008-4294-6159

详情

AI中文摘要

机器学习硬件中数值格式的激增——FP8（E4M3和E5M2）、BF16、MXFP4、微缩放块格式以及数十种研究变体——已经超过了厂商中立、位精确参考材料的可用性。工程师在跨加速器移植模型时遇到难以诊断的静默分歧，而缺乏共享的标尺。本文描述了一个涵盖13个家族84种数值格式的目录，一套包含GF16、MXFP4元素、BF16、FP8 E4M3、FP8 E5M2和E8M0块规模的6个位精确一致性包，以及一个IEEE P3109 v3.2.0交叉映射，将每个包映射到其对应的标准轨道配置格式。每个包是一个自包含的JSON文档，带有SHA-256指纹、共享行模式和一个锚向量，该向量编码3.0——恒等式phi^2 + 1/phi^2 = 3——作为跨包完整性检查。这些包已针对ml_dtypes 0.5.4（Google/JAX）进行交叉验证；任何差异都被明确记录，并解释为规范允许的解释差距，而非隐藏。这项工作被框架为注册表填充：它不提出新格式、不做模型精度声明，也不声称优于任何供应商的实现。所有工件均在开放许可下公开获取于https://github.com/gHashTag/t27。

英文摘要

Numeric format proliferation in machine learning hardware -- FP8 (E4M3 and E5M2), BF16, MXFP4, microscaling block formats, and dozens of research variants -- has outpaced the availability of vendor-neutral, bit-exact reference material. Engineers porting models across accelerators encounter silent divergences that are difficult to diagnose without a shared ruler. This paper describes a catalog of 84 numeric formats spanning 13 families, a suite of six bit-exact conformance packs covering GF16, MXFP4 element, BF16, FP8 E4M3, FP8 E5M2, and E8M0 block scale, and an IEEE P3109 v3.2.0 cross-walk that maps each pack to its corresponding standards-track configured format. Each pack is a self-contained JSON document with a SHA-256 fingerprint, a shared row schema, and an anchor vector that encodes 3.0 -- the identity phi^2 + 1/phi^2 = 3 -- as a cross-pack sanity check. Packs are cross-validated against ml_dtypes 0.5.4 (Google/JAX); any divergence is documented explicitly and interpreted as a spec-permitted interpretation gap rather than hidden. The work is framed as registry filling: it does not propose new formats, make model-accuracy claims, or assert superiority over any vendor's implementation. All artifacts are publicly available at https://github.com/gHashTag/t27 under an open license.

URL PDF HTML ☆

赞 0 踩 0

2606.09667 2026-06-09 eess.AS cs.CL cs.SD 新提交

Cross-Modal Masking for Robust Silent Speech Synthesis Using sEMG and Lipreading

基于sEMG和唇读的鲁棒无声语音合成的跨模态掩蔽

Eder del Blanco, David Gimeno-Gómez, Eva Navas, Carlos-D. Martínez-Hinarejos, Inma Hernáez

发表机构 * Aholab research group within the HiTZ Center at University of the Basque Country (UPV/EHU)（巴斯克大学HiTZ中心内Aholab研究组）； PRHLT research center, Universitat Politècnica de València (UPV)（瓦伦西亚理工大学PRHLT研究中心）

AI总结提出掩蔽多模态语音合成框架，联合表面肌电图和唇读信号，通过训练时模态掩蔽提升鲁棒性，在多说话人设置下词错误率降低14个百分点。

Comments 12 pages, 7 figures and 6 tables. Submitted to Transactions on Audio, Speech and Language Processing

详情

AI中文摘要

通过无声语音接口进行语音恢复已成为针对喉部发声受损或缺失个体的有前景的辅助技术。在非侵入式无声语音接口模态中，表面肌电图和基于视频的唇读提供了互补的发音信息，然而它们用于连续语音合成的集成仍未被充分探索。此外，现有的多模态方法很少考虑对模态退化或临时传感器故障的鲁棒性，限制了它们在现实场景中的适用性。在这项工作中，我们提出了一种掩蔽多模态语音合成框架，通过在训练期间进行模态掩蔽来联合利用表面肌电图和唇读信号。在多说话人设置下，与最强的单模态基线相比，所提出的方法将词错误率降低了多达14个绝对百分点。实验结果不仅表明掩蔽策略对于这些性能提升和低比特率条件下的鲁棒性至关重要，而且表明在模态缺失情况下，它们比针对退化的数据增强具有更好的泛化能力。音素级分析进一步揭示了跨模态的互补贡献，对元音和特定辅音组尤其有益。总体而言，这些发现证明了掩蔽多模态集成用于无声语音合成的有效性和鲁棒性，尽管适应喉切除说话者仍是一个开放的研究挑战。

英文摘要

Speech restoration through silent speech interfaces (SSIs) has emerged as a promising assistive technology for individuals with impaired or absent laryngeal voice production. Among non-invasive SSI modalities, surface electromyography (sEMG) and video-based lipreading provide complementary articulatory information, yet their integration for continuous speech synthesis remains underexplored. Moreover, existing multimodal approaches rarely address robustness to modality degradation or temporary sensor failure, limiting their applicability in realistic scenarios. In this work, we propose a masked multimodal speech synthesis framework that jointly leverages sEMG and lipreading signals through modality masking during training. Under multispeaker settings, the proposed approach reduces word error rate by up to 14 absolute percentage points compared to the strongest unimodal baseline. Experimental results not only show that masking strategies are critical for these performance gains and robustness under low-bitrate conditions, but also that they generalize better than degradation-specific data augmentations in the presence of modality absence conditions. Phone-level analyses further reveal complementary contributions across modalities, with particularly strong benefits for vowels and for specific consonant groups. Overall, these findings demonstrate the effectiveness and robustness of masked multimodal integration for silent speech synthesis, although adaptation to laryngectomized speakers remains an open research challenge.

URL PDF HTML ☆

赞 0 踩 0

2606.09648 2026-06-09 cs.DB cs.AI 新提交

ArtiFact: A Large-Scale Multi-Modal Cultural Heritage Dataset

ArtiFact: 大规模多模态文化遗产数据集

Luciano Duarte, Olga Ovcharenko, Sebastian Schelter

发表机构 * BIFOLD & TU Berlin（BIFOLD与柏林技术大学）

AI总结提出包含65万条博物馆记录的多模态文化遗产数据集ArtiFact，用于跨模态错误检测和语义查询处理，揭示现有系统在领域特定错误和文化语义查询上的挑战。

Comments Preprint

详情

AI中文摘要

多模态数据管理已成为数据库社区的核心研究课题，涵盖数据集成、语义查询处理和数据质量评估。尽管兴趣日益增长，但社区缺乏结合表格、文本和图像的大规模真实世界数据集。我们提出ArtiFact，一个多模态文化遗产数据集，包含从大都会艺术博物馆、芝加哥艺术学院和荷兰国立博物馆收集的651045条博物馆记录。我们通过两个下游任务展示了ArtiFact的实用性。对于跨模态错误检测，我们引入了一个精心策划的七类错误分类法，注入到130209条记录中，并表明可靠检测细微领域特定错误（如材料时代错位和时间偏移）仍然是一个开放挑战。对于语义查询处理，我们表明当前系统在处理涉及文化邻近性、模糊对象类型和历史依赖术语的查询时存在困难。我们的结果将ArtiFact定位为多模态数据管理研究的一个具有挑战性的基准。

英文摘要

Multi-modal data management has emerged as a central research topic in the database community, spanning data integration, semantic query processing, and data quality assessment. Despite this growing interest, the community lacks large-scale, real-world datasets combining tables, text, and images. We present ArtiFact, a multi-modal cultural heritage dataset of 651045 museum records collected from the Metropolitan Museum of Art, the Art Institute of Chicago, and the Rijksmuseum. We demonstrate the utility of ArtiFact through two downstream tasks. For cross-modal error detection, we introduce a curated taxonomy of seven error categories injected into 130209 records and show that reliably detecting subtle domain-specific errors such as material anachronisms and temporal shifts remain an open challenge. For semantic query processing, we show that current systems struggle with queries involving cultural proximity, ambiguous object types, and historically contingent terminology. Our results position ArtiFact as a challenging benchmark for multi-modal data management research.

URL PDF HTML ☆

赞 0 踩 0

2606.09643 2026-06-09 cs.DC cs.AI cs.LG cs.OS 新提交

FMplex: Model Virtualization for Serving Extensible Foundation Models

FMplex: 用于服务可扩展基础模型的模型虚拟化

Hetvi Shastri, Pragya Sharma, Walid A. Hanafy, David Irwin, Mani Srivastava, Prashant Shenoy

发表机构 * University of Massachusetts Amherst（马萨诸塞大学阿姆赫斯特分校）； University of California Los Angeles（加州大学洛杉矶分校）

AI总结提出FMplex系统，通过将基础模型作为虚拟化层实现多任务共享，结合批感知公平队列调度器，在7个基础模型和92个下游任务上降低延迟达80%，提升任务容量6倍。

详情

AI中文摘要

基础模型（FMs）越来越多地被用作语言、视觉、时间序列和多模态应用的下游任务骨干。然而，现有的模型服务系统将每个定制任务部署为独立的模型实例，从而复制了重型骨干，浪费了加速器内存，并失去了摊销批处理和加载成本的机会。本文提出了FMplex，一个将FM骨干视为部署共享的虚拟化层的服务系统。FMplex为每个任务提供一个虚拟基础模型（vFM），这是一个由共享物理FM支持的逻辑私有FM实例。这种抽象允许独立定制的任务共享一个骨干，同时保留任务特定的扩展、独立生命周期和任务级隔离。此外，我们提出了一种批感知公平队列调度器，该调度器结合了加权任务级共享以及跨共存任务的批内和批间批处理。我们实现了一个基于FMplex的服务栈，涵盖任务构建、共享感知部署和运行时执行。在7个FM骨干（16个变体）和92个下游任务上，FMplex相比空间分区延迟降低高达80%，相比尽力而为共置延迟降低33.3%，同时在集群规模上可托管多达6倍的任务。

英文摘要

Foundation models (FMs) are increasingly used as backbones for downstream tasks across language, vision, time-series, and multimodal applications. Yet existing model-serving systems deploy each customized task as an independent model instance, thereby replicating heavyweight backbones, wasting accelerator memory, and losing opportunities to amortize batching and loading costs. This paper presents FMplex, a serving system that treats FM backbones as a virtualization substrate for deployment sharing. FMplex presents each task with a virtual foundation model (vFM), a logically private FM instance backed by a shared physical FM. This abstraction lets independently customized tasks share a backbone while preserving task-specific extensions, independent lifecycles, and task-level isolation. In addition, we propose a batch-aware fair-queueing scheduler that combines weighted task-level sharing with inter- and intra-task batching across colocated tasks. We implement a FMplex-based serving stack spanning task construction, sharing-aware deployment, and runtime execution. Across 7 FM backbones (16 variants) and 92 downstream tasks, FMplex reduces latency by up to 80% over spatial partitioning and 33.3% over best-effort co-location, while hosting up to 6x more tasks at cluster scale.

URL PDF HTML ☆

赞 0 踩 0

2606.09617 2026-06-09 math.OC cs.AI cs.CY cs.SY eess.SY 新提交

Powering the Future of AI: Navigating the Trade-offs for Europe's Energy Transition and Net-Zero Goals

赋能AI未来：应对欧洲能源转型与净零目标的权衡

Mohammad Hemmati, Gbemi Oluleye, Vassilis M. Charitopoulos

发表机构 * Department of Chemical Engineering, Sargent Centre for Process Systems Engineering, University College London (UCL)（化学工程系、过程系统工程中心、伦敦大学学院（UCL））； Centre for Environmental Policy, Imperial College London（环境政策中心、伦敦帝国理工学院）

AI总结通过21种AI增长情景下的空间优化模型，量化AI对欧洲电力需求、容量、排放和运行的影响，发现AI到2050年可能增加73-723 TWh需求，导致2030-2050年累计排放超调67-181 MtCO2，且AI基础设施选址将更依赖稳定电源和系统灵活性。

详情

AI中文摘要

全球AI的快速扩张导致能源密集型超大规模数据中心激增，使其成为电力系统规划和运行中的结构性挑战。利用覆盖21种AI增长情景的欧洲空间显式优化模型，我们系统量化了数据中心的额外需求、容量要求、排放和运行影响。结果表明，到2050年，AI可能推动73-723 TWh的额外需求，导致2030年至2050年间累计排放超调67-181 MtCO2。我们的分析表明，2030年后，AI基础设施的地理分布将更多地由稳定电源和系统灵活性决定，而非仅仅依赖清洁能源的丰富程度。在中等情景下，AI需要额外200小时的稳定发电，这使关键枢纽的平准化电力成本增加35欧元/兆瓦时。我们表明，即使在悲观情景下，现有基础设施也需要额外70吉瓦的容量，而在受控增长路径下，这一扩张可能达到226吉瓦。我们进一步发现，数据中心的工作负载动态强烈影响能源调度、系统灵活性和排放，而效率提升显著降低了容量需求和系统峰值。虽然我们的研究结果表明2050年净零目标可能实现，但中期可能出现关键排放风险，除非政策适应这一加速的数字转型，否则欧盟可能危及其中性碳目标。

英文摘要

The rapid expansion of AI globally has led to the proliferation of energy-intensive hyperscale data centres (DCs), making them as a structurally challenging component in power system planning and operation. Using a spatially explicit optimisation model of Europe across 21 AI growth scenarios, we systematically quantify additional demand, capacity requirements, emissions, and operational impacts of DCs. Results indicate that AI could drive 73-723 TWh of extra demand by 2050, risking cumulative emissions overshoots of 67-181 MtCO2 between 2030 and 2050. Our analysis indicates that after 2030, the geography of AI infrastructure will be shaped more by firm power and system flexibility than by the mere abundance of clean energy. In moderate scenarios, AI requires an additional of 200 hours of firm generation, which increases LCOE by 35 EUR/MWh in key hubs. We show that even under the pessimistic scenarios, existing infrastructure would require 70 GW additional capacity, while under managed growth pathways, this expansion could reach 226 GW. We further find DCs workload dynamics strongly shape energy dispatch, system flexibility, and emissions, while improved efficiency significantly reduces capacity needs, and system peaks. While our findings suggest that net-zero targets for 2050 may be achieved, critical emission risks may appear in the intermediate years, and the EU may compromise its carbon-neutral goals unless policies adapt to this accelerating digital transformation.

URL PDF HTML ☆

赞 0 踩 0

2606.09589 2026-06-09 cs.CY cs.AI 新提交

I Was Scrolling and Then I Saw a Pregnant Strawberry

我正刷着手机，然后看到了一颗怀孕的草莓

Piera Riccio

发表机构 * University of Amsterdam（阿姆斯特丹大学）

AI总结研究AI迷你剧（水果剧）中性别化叙事与种族化逻辑，指出其通过生成式AI的美学洗白机制掩盖意识形态内容，并分析其对计算创造力的文化影响。

详情

AI中文摘要

AI迷你剧（又称水果剧）是算法分发的生成式AI短视频系列，以拟人化角色为特征，近期在社交媒体平台上成为普遍现象。本文认为，尽管这些视频看似无害的美学，但它们再现了深度性别化的叙事结构，其中女性角色被系统性地与道德越轨、性背叛和生殖能力相关联，且多个情节也编码了种族化的逻辑，即可见的身体差异被赋予道德负荷的过程。借鉴女性主义电影理论、批判种族理论和平台研究，本文进一步认为，这些视频的生成式AI美学——以柔软、圆润和视觉可爱为特征——作为一种美学洗白机制，中和了这些叙事的意识形态重量，并使其在内容审核系统下仍能流通。本文通过个人观察和细读来探讨这些问题，反思生成式AI的具体可供性，这些可供性使这一现象成为可能，并对计算创造力领域产生文化影响。

英文摘要

AI minidramas (also known as fruit dramas) are short, algorithmically distributed generative AI video series featuring anthropomorphized characters that have recently emerged as a widespread phenomenon on social media platforms. This paper argues that despite their seemingly innocuous aesthetic, these videos reproduce deeply gendered narrative structures in which female characters are systematically associated with moral transgression, sexual betrayal, and reproductive capacity, and that several plots also encode the logic of racialization, i.e., the process by which visible bodily difference is morally loaded. Drawing on feminist film theory, critical race theory, and platform studies, it further argues that the generative AI aesthetic of these videos, characterized by softness, roundness, and visual cuteness, functions as a mechanism of aesthetic laundering, neutralizing the ideological weight of these narratives and enabling their circulation despite content moderation systems. This paper approaches these questions through personal observation and close reading, reflecting on the specific affordances of generative AI that make this phenomenon both possible and culturally consequential for the field of computational creativity.

URL PDF HTML ☆

赞 0 踩 0

2606.09587 2026-06-09 cs.HC cs.AI 新提交

Seeing the Hivemind: A Consensus-Aware Interaction Technique for Mitigating AI Homogenization

看见蜂巢思维：一种缓解AI同质化的共识感知交互技术

Muhammad Haris Khan, Joel wester

发表机构 * University of Copenhagen（哥本哈根大学）

AI总结提出语义排斥技术（SRT），通过计算和用户研究证明其能显著提升AI生成内容的语义多样性，减少共识短语，且不损害有用性和连贯性。

Comments In review

详情

AI中文摘要

人们越来越多地使用AI进行写作等创造性任务。虽然采用率持续增长，但这种使用方式有可能在局部削弱个人创造力，并在整体上减少创造性输出的异质性。为此，我们引入了语义排斥技术（SRT），并通过计算评估和一项针对16名经常使用AI进行创造性任务的参与者的研究对其进行了评估。我们的计算评估显示，SRT在不同任务模式下将语义多样性提高了85--167%，同时将共识短语减少了43--95%。在用户研究中，SRT输出获得了更高的有用性（$p = .019$, $W = .208$）和连贯性评分（$p = .006$, $W = .260$）；68.8%的参与者愿意在多个任务中使用SRT-Strong，而基线仅为18.8%。所有系统中原创性和连贯性评分呈正相关（$ρ= +.40$ 到 $+.67$），表明发散性不必以可读性为代价。综合来看，这些初步发现可为设计旨在支持日常创造力而不助长同质化的AI系统提供参考。

英文摘要

People are increasingly using AI for creative tasks such as writing. While adoption continues to grow, this form of use risks undermining individual creativity locally and reducing the heterogeneity of creative output at scale. In response, we introduce the Semantic Repulsion Technique (SRT) and evaluate it both computationally and through a study with 16 participants who regularly use AI for creative tasks. Our computational assessment reveals that SRT increases semantic diversity by 85--167\% while reducing consensus phrases by 43--95\% across task modes. In the user study, SRT outputs received higher usefulness ($p = .019$, $W = .208$) and coherence ratings ( $p = .006$, $W = .260$); 68.8\% of participants were willing to use SRT-Strong for multiple tasks versus 18.8\% for baselines. Originality and coherence ratings were positively correlated across all systems ($ρ= +.40$ to $+.67$), suggesting that divergence need not compromise readability. Taken together, these preliminary findings can inform the design of AI systems that aim to support everyday creativity without contributing to homogenization.

URL PDF HTML ☆

赞 0 踩 0

2606.09558 2026-06-09 q-bio.GN cs.LG 新提交

Integrating gene regulatory priors into Transformer attention with scTransformer for interpretable scRNA-seq analysis

将基因调控先验知识整合到Transformer注意力中：scTransformer用于可解释的单细胞RNA-seq分析

Mikele Milia, Louis Fabrice Tshimanga, Henning Mueller, Manfredo Atzori, Barbara Di Camillo

发表机构 * Department of Information Engineering, University of Padova（信息工程系，帕多瓦大学）； Department of Neuroscience, University of Padova（神经科学系，帕多瓦大学）； Padova Neuroscience Center（帕多瓦神经科学中心）； Information Systems Institute, University of Applied Sciences Western Switzerland, HES-SO Valais（应用科学西瑞士信息系统研究所，HES-SO瓦莱大学）； Department of Comparative Biomedicine and Food Science, University of Padova（比较生物医学与食品科学系，帕多瓦大学）； Padua Center for Network Medicine, University of Padova（帕维亚网络医学中心，帕多瓦大学）

AI总结提出scTransformer，首次将基因调控先验知识嵌入Transformer注意力机制，通过约束信息流学习生物有意义的表示，在疾病相关单核RNA-seq数据上提升分类精度和细胞类型分离，注意力模式与已知调控程序一致。

详情

AI中文摘要

动机：基于Transformer的模型越来越多地应用于大规模单细胞转录组学，通过自监督学习在数百万个细胞上展现出强大性能。然而，大多数现有方法将基因视为独立特征，很大程度上忽略了先验生物学知识，这限制了可解释性和鲁棒性。在本文中，我们探讨了显式整合基因调控信息是否能同时提升模型性能和生物学洞察。结果：我们提出了scTransformer，这是第一个将生物机制的先验知识构建到模型注意力模式中的基于Transformer的方法。通过根据已知调控结构约束信息流，模型学习到更具生物学意义的表示。我们使用监督细胞类型分类在疾病相关的单核RNA-seq数据集上评估scTransformer。与标准Transformer相比，我们的方法提高了分类准确性，增强了嵌入空间中细胞类型的分离，并产生了与已知调控程序一致的注意力模式。总体而言，我们的结果表明，将生物结构嵌入Transformer模型可以在不牺牲性能的情况下增强可解释性，为单细胞组学的生物学基础模型迈出了原则性的一步。

英文摘要

Motivation: Transformer-based models are increasingly applied to large-scale single-cell transcriptomics, showing strong performance through self-supervised learning on millions of cells. However, most existing approaches treat genes as independent features, and largely ignore prior biological knowledge, which limits interpretability and robustness. In this paper, we explore whether explicitly incorporating gene regulatory information can improve both model performance and biological insight. Results: We present scTransformer, the first Transformer-based approach that builds a priori knowledge of biological mechanisms into the model's attention patterns. By constraining information flow according to known regulatory structures, the model learns representations that are more biologically meaningful. We evaluate scTransformer on a disease-relevant single-nucleus RNA-seq dataset using supervised cell-type classification. Compared to standard Transformers, our approach improves classification accuracy, enhances separation of cell types in embedding space, and produces attention patterns consistent with known regulatory programs. Overall, our results demonstrate that embedding biological structure into Transformer models can enhance interpretability without sacrificing performance, offering a principled step toward biologically grounded foundation models for single-cell omics.

URL PDF HTML ☆

赞 0 踩 0

2606.09551 2026-06-09 cs.CR cs.AI 新提交

FuseFSS: Efficient Secure LLM Inference with Function Secret Sharing

FuseFSS：基于函数秘密共享的高效安全LLM推理

Yuhan Ma, Yong Li, Stefan Schmid

发表机构 * University of Science and Technology of China（中国科学技术大学）

AI总结提出FuseFSS编译器，通过统一编译流水线替代逐算子协议设计，实现安全推理中非线性与辅助操作的高效处理，在BERT和GPT模型上取得1.24-1.50倍加速并减少通信与预处理开销。

Comments Accepted at the 43rd International Conference on Machine Learning (ICML 2026)

详情

AI中文摘要

双服务器安全推理允许客户端查询托管的大型语言模型（LLM）而不泄露提示或嵌入。基于函数秘密共享（FSS）的最新GPU系统使线性层高效，但定点非线性和辅助操作仍是瓶颈，因为每个算子通常通过自定义协议实现，包含各自的比较、回绕校正和预处理材料。我们提出FuseFSS，一个编译器，用单一编译流水线替代逐算子协议设计。对于每个定点算子，一个紧凑的规范列出其区间划分、低次算术片段和所需的谓词位。编译器在公开掩码值上执行两次批处理FSS评估：一次打包比较返回所有谓词位，一次向量区间查找返回活跃系数和常数。与当前最先进的基于FSS的GPU安全推理相比，FuseFSS在保持精度的同时，在BERT和GPT风格模型上实现了1.24倍至1.50倍的端到端加速，并将在线通信减少了9%至16%；预处理也更轻量，密钥生成时间降低14%至23%，密钥大小减小20%至24%。

英文摘要

Two-server secure inference allows a client to query a hosted large language model (LLM) without revealing prompts or embeddings. Recent GPU systems based on function secret sharing (FSS) make linear layers efficient, but fixed-point nonlinearities and helper operations remain a bottleneck because each operator is typically implemented as a bespoke protocol with its own comparisons, wrap-around corrections, and preprocessing material. We present FuseFSS, a compiler that replaces per-operator protocol design with a single compilation pipeline. For each scalar fixed-point operator, a compact specification lists its interval partition, low-degree arithmetic pieces, and required predicate bits. The compiler emits two batched FSS evaluations on the public masked value: one packed comparison that returns all predicate bits, and one vector interval lookup that returns the active coefficients and constants. Compared to the current state-of-the-art FSS-based GPU secure inference, FuseFSS preserves accuracy while achieving a $1.24\times$--$1.50\times$ end-to-end speedup and reducing online communication by $9\%$--$16\%$ on BERT and GPT-style models; preprocessing is also lighter, with $14\%$--$23\%$ lower key-generation time and $20\%$--$24\%$ smaller keys.

URL PDF HTML ☆

赞 0 踩 0

2606.09549 2026-06-09 cs.CR cs.AI 新提交

SecureClaw: Clawing Back Control of LLM Agents

SecureClaw: 夺回对LLM智能体的控制

Yuhan Ma, Stefan Schmid

发表机构 * TU Berlin（柏林技术大学）

AI总结针对工具使用型LLM智能体的双重安全漏洞，提出双边界架构SecureClaw，在效果汇点实施授权、在读边界实施明文隔离，通过预览-提交协议和可信网关实现安全控制，在多个基准上保持可用性的同时将攻击成功率降至接近零。

详情

AI中文摘要

使用工具的大型语言模型（LLM）智能体面临两种不同的安全漏洞：未经授权的外部操作以及在最终输出检查介入之前运行时内部敏感明文的暴露。现有防御通常只保护一个边界（规划器/运行时或动作汇点），因此本身无法同时保护两个表面。我们提出SecureClaw，一种双边界架构，在效果汇点实施授权，在读边界实施明文隔离。敏感读取通过一个可信网关，该网关用不透明句柄替换原始值，在评估部署中，还使用有界摘要作为显式解密接口。改变外部状态的写入遵循PREVIEW→COMMIT协议，其中只有可信执行者才能提交策略授权的确切规范请求。运行时仍然可以基于摘要和符号引用进行规划，但不能直接解引用秘密或执行副作用。在AgentDojo、AgentLeak和Agent Security Bench (ASB)上，SecureClaw是我们在通用测试框架中评估的唯一一种同时保持可用任务效用并在ASB上实现0%攻击成功率（ASR）、在AgentDojo上实现0.64% ASR、在AgentLeak的攻击并行通道上实现3.23%总体泄漏（衡量最终输出和内部中继泄漏）的防御方法。

英文摘要

Tool-using large language model (LLM) agents face two distinct security failures: unauthorized external actions and exposure of sensitive plaintext inside the runtime before any final output check can intervene. Existing defenses usually protect one boundary, either the planner/runtime or the action sink, and therefore do not by themselves secure both surfaces. We present SecureClaw, a dual-boundary architecture that places authorization at the effect sink and plaintext confinement at the read boundary. Sensitive reads pass through a trusted gateway that replaces raw values with opaque handles and, in the evaluated deployment, bounded summaries as an explicit declassification interface. Writes that change external state follow a PREVIEW$\rightarrow$COMMIT protocol in which only a trusted executor may commit the exact canonical request authorized by policy. The runtime can still plan over summaries and symbolic references, but cannot directly dereference secrets or perform side effects. Across AgentDojo, AgentLeak, and Agent Security Bench (ASB), SecureClaw is the only defense we evaluate in a common harness that simultaneously retains usable task utility and achieves 0\% attack success rate (ASR) on ASB, 0.64\% ASR on AgentDojo, and 3.23\% overall leak on AgentLeak's attacked parity lane, which measures final-output and internal-relay leakage.

URL PDF HTML ☆

赞 0 踩 0

2606.09548 2026-06-09 cs.CR cs.AI 新提交

Model Poisoning Against Federated Model Adaptation with Chain of Bit-Flips

基于比特翻转链的联邦模型自适应中毒攻击

Bastien Vuillod, Kevin Hector, Pierre-Alain Moellic, Jean-Max Dutertre, Olivier Potin

发表机构 * CEA-Leti, Mines Saint-Etienne, Equipe Commune SAS（CEA-莱蒂， Mines圣艾蒂安，共同团队SAS）； Univ. Grenoble Alpes, CEA-Leti（格勒诺布尔阿尔卑斯大学， CEA-莱蒂）； Mines Saint-Etienne, CEA-Leti, Centre CMP, Equipe commune SAS（Mines圣艾蒂安， CEA-莱蒂， CMP中心，共同团队SAS）

AI总结提出一种结合硬件故障攻击的模型中毒方法，在联邦学习训练阶段通过比特翻转注入后门，实现任务无关的后门攻击，在ResNet-18上仅需少量故障即可达到94%攻击成功率。

Comments Accepted at ACNS/AIHWS 2026

详情

AI中文摘要

联邦学习允许一组客户端在不共享本地训练数据的情况下共同训练全局模型。将训练责任交给去中心化的参与者可能导致中毒攻击：由恶意第三方控制的客户端可能毒化训练数据集，在神经网络中安装后门。在联邦学习中，这些后门攻击仅依赖算法方法，然而，硬件故障威胁（如Rowhammer）的最新进展拓宽了整体攻击面。在联邦模型自适应的背景下，我们引入了一种针对联邦学习系统的新型后门攻击类别，该攻击基于硬件故障攻击的模型中毒。更准确地说，我们提出了一种任务无关的后门攻击，通过在联邦训练期间诱导单个本地模型参数中的硬件故障（比特翻转）来植入后门。后门是在之前的离线阶段从联邦系统最初使用的预训练模型中精心制作的。我们的结果表明，后门可以成功应用于不同类型的模型和数据集。通常，每个恶意客户端出现最多10次故障，且总共出现19次故障，就足以在ResNet-18上达到94%的攻击成功率。最后，我们讨论了攻击潜在防御的实用性和鲁棒性，同时考虑了Rowhammer的实际约束，这是此类威胁的首选攻击向量。

英文摘要

Federated Learning (FL) allows a set of clients to collectively train a global model without sharing local training data. Giving the responsibility of the training to decentralized actors may lead to poisoning attacks: clients controlled by malicious third party potentially poison the training dataset to install a backdoor in neural networks. In FL, these backdoor attacks rely solely on algorithmic approach, however, recent advances in hardware faults threats (e.g, Rowhammer) have widen the overall attack surface. In the context of federated model adaptation, we introduce a novel category of backdoor attack against FL systems that relies on model poisoning based on hardware-fault attacks. More precisely, we propose a task-agnostic backdoor attack that is implanted during the FL training time by inducing hardware faults (bit-flips) in parameters of a single local model. The backdoor is crafted during a previous offline phase from the pretrained model initially used by the FL system. Our results show that a backdoor can be successfully applied on different type of models and datasets. Typically, with up to 10 faults per malicious client occurrence and 19 total occurrences on a ResNet-18 are enough to reach 94% of attack success rate. Finally, we discuss the practicality and the robustness of the attack potential defenses, while putting into perspective the practical constraints of Rowhammer, which is the preferred attack vector for this type of threats.

URL PDF HTML ☆

赞 0 踩 0

2606.09541 2026-06-09 physics.app-ph cs.LG 新提交

Automating the Expert Eye: A System-Agnostic Deep Learning Framework for Rare Event Discovery in Imbalanced Force Spectroscopy

自动化专家眼：用于非平衡力谱中稀有事件发现的系统无关深度学习框架

Jorge Rodriguez-Ramos

发表机构 * Independent Researcher（独立研究者）； Marseille, France（法国马赛）

AI总结提出一种系统无关的可解释深度学习框架，利用1D到2D光栅化几何矩阵和修改的ResNet18架构，结合非对称Focal Loss，在极端类别不平衡的力谱数据中实现高召回率（0.9231），并通过双阈值分诊系统减少90%以上人工审核工作量。

Comments 13 pages, 2 figures, 2 tables

详情

AI中文摘要

单分子力谱（SMFS）为生物分子力学提供了前所未有的见解，然而高通量生成的力-延伸轨迹造成了严重的数据筛选瓶颈。在数千条噪声主导的曲线中识别罕见的分子解绑事件传统上依赖于繁琐、不可扩展的人工审核。在这里，我们提出了一个系统无关、可解释的深度学习框架，专门用于克服自动SMFS分诊中的极端类别不平衡。利用1D到2D光栅化几何矩阵，我们部署了由非对称Focal Loss目标函数控制的修改版ResNet18架构。我们在R. champanellensis纤维小体的复杂机械解折叠路径上评估了该框架。在超不平衡测试条件下，目标相互作用仅占数据集的1.34%（970条轨迹中13个真实事件），模型实现了0.9196的整体准确率和0.9231的惊人真阳性率（召回率）。通过实施经验校准的双阈值分诊系统，该流程自动丢弃了880条明确的背景噪声轨迹，将人工审核工作量减少超过90%，同时安全地保留了高价值的稀有数据。最后，梯度加权类激活映射（Grad-CAM）可视化验证了网络的决策牢固地基于力曲线的相关几何特征，特别是定位于结构解绑区域，有效缓解了“黑箱”质疑。该开源工具专为免费云端执行而构建，使生物物理学社区能够民主化地实现可扩展、高精度的分子发现。

英文摘要

Single-Molecule Force Spectroscopy (SMFS) provides unprecedented insights into biomolecular mechanics, yet the high-throughput generation of force-extension trajectories creates a severe data curation bottleneck. Identifying rare molecular unbinding events within thousands of noise-dominated curves traditionally relies on tedious, non-scalable manual auditing. Here, we present a system-agnostic, interpretable deep learning framework tailored to overcome extreme class imbalance in automated SMFS triage. Utilizing 1D-to-2D rasterized geometric matrices, we deployed a modified ResNet18 architecture governed by an asymmetric Focal Loss objective function. We evaluated this framework on the complex mechanical unfolding pathways of the R. champanellensis cellulosome. Under hyper-imbalanced test conditions where the target interaction constituted only 1.34% of the dataset (13 true events out of 970 traces), the model achieved an overall accuracy of 0.9196 and a remarkable True Positive Rate (Recall) of 0.9231. By implementing an empirically calibrated dual-threshold triage system, the pipeline automatically discarded 880 unambiguous background noise traces , reducing the manual curation workload by over 90% while safely preserving high-value rare data. Finally, Gradient-weighted Class Activation Mapping (Grad-CAM) visually validated that the network's decisions are firmly anchored in the relevant geometric features of the force curves, specifically localizing on the structural unbinding regions, effectively mitigating 'black-box' skepticism. Built for free cloud-based execution, this open-source tool democratizes scalable, highly precise molecular discovery across the biophysics community.

URL PDF HTML ☆

赞 0 踩 0

2606.09532 2026-06-09 cs.CY cs.CL 新提交

Interpretable Crisis Behavior Analysis Using Mobility and Social Media Data

基于移动性和社交媒体数据的可解释危机行为分析

Muhammad Hamza Arshad Majeed, Sidahmed Benabderrahmane, Talal Rahwan

发表机构 * New York University (NYUAD)（纽约大学（NYUAD））

AI总结提出统一可解释流水线，融合移动性和社交媒体数据，通过形式概念分析和关联规则挖掘，识别危机中跨域行为模式，并在洛杉矶山火和COVID-19案例中验证，生成可操作的政策简报。

详情

AI中文摘要

危机改变了人们的移动方式和沟通方式。在野火和流行病等紧急情况下，移动模式的变化和在线情感话语共同演变，但通常被孤立研究。本文提出了一个统一且可解释的流水线，整合移动性和社交媒体数据，以识别危机环境中的跨域行为模式。该框架通过两个案例研究进行评估：2025年1月洛杉矶野火的短期分析（原型案例）和2020年3月至2021年12月阿联酋COVID-19行为的纵向分析（主要案例，671天）。该流水线对齐异构每日信号，将其转换为二元行为状态，应用形式概念分析（FCA）提取共现结构，挖掘关联规则，并通过时间顺序保留测试验证规则稳定性。一个结构化的政策翻译层将稳健规则转化为操作简报，指定触发条件、提前时间和行动方案。结果揭示了两种危机中清晰的跨域行为结构。在野火案例中，交通压力、恐惧/愤怒情绪和治理话语在33天窗口内紧密耦合，关键规则达到100%置信度，提升度高达2.5。在COVID案例中，重复的移动适应和情绪波动产生了8条稳定的同日规则（88%保留通过率）和40条清晰的预测规则，提前时间为2-7天。该工作表明，可解释的多模态融合可以产生既科学可信又政策可操作的危机情报。

英文摘要

Crises alter both how people move and how they communicate. During emergencies such as wildfires and pandemics, changes in mobility patterns and online emotional discourse evolve jointly, yet they are typically studied in isolation. This paper presents a unified and interpretable pipeline that integrates mobility and social media data to identify cross-domain behavioral patterns in crisis settings. The framework is evaluated through two case studies: a short-horizon analysis of the January 2025 Los Angeles wildfires (prototype case) and a longitudinal analysis of UAE COVID-19 behavior from March 2020 to December 2021 (primary case, 671 days). The pipeline aligns heterogeneous daily signals, transforms them into binary behavioral states, applies Formal Concept Analysis (FCA) to extract co-occurrence structure, mines association rules, and validates rule stability through chronological holdout testing. A structured policy-translation layer renders robust rules as operational briefs specifying triggers, lead times, and action playbooks. Results reveal clear cross-domain behavioral structure in both crises. In the wildfire case, traffic stress, fear/anger sentiment, and governance discourse are tightly coupled within a 33-day window, with key rules reaching 100\% confidence and lift scores up to 2.5. In the COVID case, repeated mobility adaptation and sentiment volatility yield 8 stable same-day rules (88\% holdout pass rate) and 40 clean predictive rules with 2--7 day lead horizons. The work demonstrates that interpretable multimodal fusion can produce both scientifically credible and policy-actionable crisis intelligence.

URL PDF HTML ☆

赞 0 踩 0

2606.09520 2026-06-09 physics.chem-ph cs.AI 新提交

Closing the Prior-Posterior Loop: Self-Reflective Molecular Design with Analysis-Driven LLM Iteration

闭合先验-后验循环：基于分析驱动LLM迭代的自反性分子设计

Junyi Gong, Zijie Qiu, Ben Zhong Tang

发表机构 * Faculty of Chemistry, Shenzhen MSU-BIT University（深圳MSU-BIT大学化学学院）； School of Science and Engineering, Chinese University of Hong Kong (Shenzhen)（香港中文大学（深圳）科学与工程学院）； Department of Chemistry, Hong Kong University of Science and Technology（香港科技大学化学系）

AI总结提出一种自反性分子设计框架，用第一性原理计算的完整物化理由替代标量反馈，使LLM从随机采样器转变为因果推理器，在HOMO-LUMO能隙任务中实现0.0003 eV偏差和100%成功率。

Comments 3 tables, 4 figures

详情

AI中文摘要

通用大语言模型能否像经验丰富的化学家一样精确设计分子？当前的LLM框架通过标量反馈循环（生成、评分、拒绝）来回答这个问题，这相当于有依据的试错。本文表明，用第一性原理计算的完整物化理由替代单一数字，可将LLM从随机采样器转变为因果推理器。我们的系统将检索增强生成与自反模块相结合，该模块将轨道能量、原子电荷和电子密度（而非压缩分数）反馈到设计循环中。在1.0至5.0 eV的HOMO-LUMO能隙目标上，这种结构-性质关系（SPR）反射实现了低至0.0003 eV的偏差，在中等任务上达到100%的成功率，显著优于标量反馈和非反射基线。该框架可无缝推广到偶极矩设计，并在五种不同的LLM骨干网络上表现出鲁棒性。这些结果建立了一个新范式：当模型不仅理解分子为何失败，而且理解失败原因时，迭代分子设计将变得真正具有机理性质。

英文摘要

Can a general-purpose large language model design molecules with the precision of a seasoned chemist? Current LLM-based frameworks answer this question with scalar feedback loops-generate, score, reject-that amount to informed trial-and-error. Here we show that replacing a single number with the full physicochemical rationale from first-principles calculations transforms the LLM from a stochastic sampler into a causal reasoner. Our system couples retrieval-augmented generation with a self-reflection module that feeds orbital energies, atomic charges, and electron densities-rather than compressed scores-back into the design loop. On HOMO-LUMO gap targets from 1.0 to 5.0 eV, this structure-property-relationship (SPR) reflection achieves a deviation as low as 0.0003 eV and a 100% success rate on moderate tasks, decisively outperforming scalar-feedback and non-reflective baselines. The framework generalizes seamlessly to dipole-moment design and proves robust across five distinct LLM backbones. These results establish a new paradigm: when the model understands not only that a molecule fails, but why, iterative molecular design becomes genuinely mechanistic.

URL PDF HTML ☆

赞 0 踩 0

2606.09473 2026-06-09 stat.ML cs.LG 新提交

Report the Floor: A Training-Free Conformal Interval Is a Mandatory Baseline for Probabilistic Time-Series Forecasting

报告基线：无训练共形区间是概率时间序列预测的强制性基准

Valery Manokhin

发表机构 * Independent researcher（独立研究者）

AI总结提出无参数、无训练的共形朴素区间作为概率预测的强基线，在2217个真实序列上击败了多种现有方法，并主张其应成为强制性基准。

详情

DOI: 10.5281/zenodo.20594484

AI中文摘要

概率预测器越来越多地通过学习得到，但它们所比较的基线往往较弱或被忽略。我们表明，最简单的共形区间——一个包裹在有限样本分割共形残差分位数中的最后值点预测，无参数且无需训练——是一个远比其在近期学习预测和共形时间序列比较中几乎完全缺失所暗示的更强大的基线。在来自九个公共来源（Monash、LOTSA、LTSF交通/电力/天气套件、METR-LA、BOOM、nips/probts）的2217个真实序列的单步在线预测中，这个ConformalNaive区间决定性地击败了朴素值分位数基线、整个NPTS系列（NPTS 73%，SeasonalNPTS 64%的序列）以及已发表的共形季节池（CSP）方法（71%的序列，bootstrap 95% CI [69,73]，配对Wilcoxon p约7.6e-135）；它与更简单的学习共形预测器（RCI，分位数回归；中位数相对Winkler在2%以内）相当，并且仅被跟踪分布偏移的自适应在线和集成方法（SPCI、ACI、AgACI）击败，后者在相对Winkler上领先9-33%。它也比训练过的神经预测器校准得更好：在引入DeepNPTS的六个数据集上，平凡的基线在名义95%下覆盖真实值84-85%的时间，而DeepNPTS为66%。在多步季节视界上，情况反转：随机游走基线是最弱的方法，季节池（CSP）获胜——我们描绘了这一边界。最后，我们给出了ConformalNaive+，一个一行代码、无训练、视界自适应的选择器，它在每个视界上达到两个互补基线中较好的一个，并恢复了覆盖。我们认为，每当学习概率预测器声称有改进时，匹配的共形朴素基线必须是一个强制性基准。

英文摘要

Probabilistic forecasters are increasingly learned, yet the baselines they are compared against are often weak or omitted. We show that the simplest possible conformal interval - a last-value point forecast wrapped in a finite-sample split-conformal residual quantile, with no parameters and no training - is a far stronger baseline than its near-total absence from recent learned-forecasting and conformal-time-series comparisons would suggest. In one-step-ahead online forecasting across 2,217 real series from nine public sources (Monash, LOTSA, the LTSF traffic/electricity/weather suites, METR-LA, BOOM, nips/probts), this ConformalNaive interval decisively beats the naive value-quantile baselines, the entire NPTS family (NPTS 73%, SeasonalNPTS 64% of series), and the published Conformal Seasonal Pools (CSP) method (71% of series, bootstrap 95% CI [69,73], paired Wilcoxon p approx 7.6e-135); it is on par with the simpler learned conformal predictors (RCI, quantile regression; median relative Winkler within 2%) and is beaten only by the adaptive-online and ensemble methods (SPCI, ACI, AgACI), which track distribution shift and lead by 9-33% relative Winkler. It is also better calibrated than a trained neural forecaster: on the six datasets that introduced DeepNPTS, the trivial floors cover the truth 84-85% of the time at a nominal 95%, versus DeepNPTS's 66%. At multi-step seasonal horizons the picture inverts: the random-walk floor is the weakest method and the seasonal pool (CSP) wins - a boundary we map. Finally we give ConformalNaive+, a one-line, training-free, horizon-adaptive selector that attains the better of two complementary floors at every horizon with restored coverage. We argue the matching conformal naive floor must be a mandatory baseline whenever a learned probabilistic forecaster claims gains.

URL PDF HTML ☆

赞 0 踩 0

2606.09419 2026-06-09 cond-mat.mtrl-sci cs.AI 新提交

Context-Aware Deep Learning for Defect Classification in Atomic-Resolution STEM

上下文感知深度学习用于原子分辨率扫描透射电镜中的缺陷分类

Jiadong Dan, Cheng Zhang, Leyi Loh, Ivan Verzhbitskiy, Yuan Chen, Goki Eda, Michel Bosman, N. Duane Loh

发表机构 * cond-mat.mtrl-sci（材料科学）

AI总结提出上下文感知学习框架，融合图像对比度与元数据（成分、束能、探测器几何），解决仅凭图像对比度进行缺陷分类的歧义性，在模拟数据上准确率超98%，实验数据接近人类水平。

Comments 6 figures

详情

AI中文摘要

人工智能正在快速推进材料表征，然而电子显微镜中的大多数应用仅依赖图像对比度，忽视了影响图像形成的化学和实验上下文。这一局限性使得缺陷分类本质上具有歧义性，因为相似的对比度可能来自不同的材料或成像条件。在此，我们开发了一个上下文感知学习框架，将图像导出的对比度与描述成分、束能和探测器几何的元数据相结合。利用系统构建的约5500万模拟补丁数据集，涵盖96种掺杂单层过渡金属二硫族化合物的576种情况，我们表明，以上下文变量为条件将缺陷分类从一个不适定的纯图像任务转变为一个适定的、基于物理的问题。该框架在模拟数据上实现了超过98%的准确率，在实验数据上达到了接近人类的一致性，后验熵降低了94%。通过强调上下文基础而非架构复杂性，该方法将实验图像对比度与潜在的化学和成像条件联系起来，支持基于物理的缺陷分配，并为自主材料表征的多模态AI模型提供了一条通用路径。

英文摘要

Artificial intelligence is rapidly advancing materials characterization, yet most applications in electron microscopy rely solely on image contrast, overlooking the chemical and experimental context that shapes image formation. This limitation makes defect classification inherently ambiguous, as similar contrasts can arise from different materials or imaging conditions. Here we develop a context-aware learning framework that integrates image-derived contrast with metadata describing composition, beam energy, and detector geometry. Using a systematically constructed dataset of ~55 million simulated patches spanning 576 cases across 96 doped monolayer transition-metal dichalcogenides, we show that conditioning on contextual variables transforms defect classification from an ill-posed image-only task into a well-posed, physically grounded problem. The framework achieves over 98% accuracy on simulations and near-human agreement on experimental data, with a 94% reduction in posterior entropy. By emphasizing contextual grounding over architectural complexity, this approach links experimental image contrast to the underlying chemical and imaging conditions, supporting physically grounded defect assignments and a general pathway toward multimodal AI models for autonomous materials characterization.

URL PDF HTML ☆

赞 0 踩 0

2606.09414 2026-06-09 cs.HC cs.AI 新提交

AI Assurance in UK Defence: Challenges in Operationalising JSP 936

英国国防中的人工智能保证：JSP 936 操作化的挑战

Callum Cockburn, Sam Farrow

发表机构 * Synoptix

AI总结本文通过结构化解释性审查，识别了英国国防中实施JSP 936进行AI保证的八大挑战，并指出其依赖未解决的技术、组织和保证问题。

详情

AI中文摘要

本报告审查了在英国国防中操作化JSP 936第1部分进行AI保证的实际挑战。通过对该指令要求的结构化解释性审查，分析确定了八个主题挑战领域：证据和论证的充分性、人类与AI交互的管理、操作环境的定义、AI在系统之系统中的集成、AI性能的评估和维护、安全性和安保分析、伦理性的测量以及AI固有复杂性的缓解。报告认为，JSP 936提供了有用的治理基础，但实施取决于未解决的技术、组织和保证问题。这些挑战源于AI赋能系统的社会技术性质、实际部署环境中的不确定性、当前保证方法的局限性以及性能、安全、人类监督、安保和伦理可接受性之间的紧张关系。报告指出了在国防领域实现雄心勃勃、安全且负责任的AI采纳所需进一步的方法、指南和组织能力领域。这与MOD自身将JSP 936描述为需要迭代实施和支持性指导的框架是一致的。

英文摘要

This report examines practical challenges in operationalising JSP 936 Part 1 for AI assurance in UK Defence. Using a structured interpretive review of the directive's requirements, the analysis identifies eight thematic challenge areas adequacy of evidence and argument, management of human interaction with AI, definition of the operational environment, integration of AI within systems of systems, assessment and maintenance of AI performance, analysis of safety and security, measurement of ethicality, and mitigation of the inherent complexities of AI. The report argues that JSP 936 provides a useful governance basis, but that implementation depends on unresolved technical, organisational, and assurance questions. These challenges stem from the socio-technical nature of AI-enabled systems, uncertainty in real-world deployment contexts, limitations in current assurance methodologies, and tensions between performance, safety, human oversight, security, and ethical acceptability. The report identifies areas where further methods, guidance, and organisational capability are needed for the ambitious, safe, and responsible adoption of AI across Defence. This is consistent with MOD's own framing of JSP 936 as requiring iterative implementation and supporting guidance.

URL PDF HTML ☆

赞 0 踩 0

2606.09411 2026-06-09 cs.CR cs.IT cs.LG math.IT 新提交

Now You (Still) See Me: Detecting Evasive Steganographic Payloads in LLMs

现在你（仍然）能看到我：检测大语言模型中的隐蔽隐写载荷

Charles Westphal, Timothy Douglas, Keivan Navaie, Tiago Pimentel, Fernando E. Rosas

发表机构 * UCL Centre for AI（UCL人工智能中心）； University College London（伦敦大学学院）； ML Alignment Theory Scholars（机器学习对齐理论学者）； Department of Computer Science（计算机科学系）； School of Computing and Communications（计算与通讯学院）； ETH Zürich（苏黎世联邦理工学院）； University of Sussex（Sussex大学）； Imperial College London & University of Oxford（伦敦帝国学院与牛津大学）

AI总结针对大语言模型隐写外泄风险，提出一种基于非线性MLP探针的对抗性微调方法可系统规避现有线性探针检测，但通过信息论指导的数据级干预可恢复检测能力。

详情

AI中文摘要

大型语言模型可以通过微调将提示中的秘密编码到流畅、看似良性的输出中。这造成了一种隐写外泄风险，难以通过输出级隐写分析检测。最近的工作提出使用线性探针从内部激活中恢复秘密的机制检测方法。我们表明这种防御可以被系统性地规避，但通过针对性的数据级干预可以恢复可检测性。首先，我们将检测设置扩展到包括非线性MLP探针。然后，我们在五个基础模型上对抗性微调隐写木马：Qwen3-8B、Llama-3.1-8B、Ministral-8B、Qwen3-14B和Phi-4-14B。得到的模型在规避岭回归和留出MLP探针的同时，保留了58%–79%的精确匹配秘密恢复，在六个基准测试中平均能力下降1%–8%。然后，我们给出了这种规避的信息论特征。成功的规避在保持可恢复性的同时，降低了从内容对齐表示中提取秘密的低阶可提取性，迫使载荷与剩余自由度产生协同交互。这激发了一个重新语境化数据集，限制了这些剩余自由度。在该分布上，所有五个规避木马的岭回归和MLP可检测性都得到恢复。总体而言，我们的发现表明基于激活的隐写检测容易受到自适应规避的影响，但理论指导的评估分布可以暴露原本隐藏的载荷。

英文摘要

Large language models can be fine-tuned to encode prompt-borne secrets into fluent, seemingly benign outputs. This creates a steganographic exfiltration risk that is difficult to detect with output-level steganalysis. Recent work proposes mechanistic detection using linear probes that recover the secret from internal activations. We show that this defense can be systematically evaded, but that detectability can be recovered through a targeted data-level intervention. First, we extend the detection setup to include a non-linear MLP probe. We then adversarially fine-tune steganographic trojans across five base models: Qwen3-8B, Llama-3.1-8B, Ministral-8B, Qwen3-14B, and Phi-4-14B. The resulting models retain $58$--$79\%$ exact-match secret recovery while evading both ridge and held-out MLP probes, with $1$--$8\%$ average capability degradation across six benchmarks. We then give an information-theoretic characterization of this evasion. Successful evasion preserves recoverability while reducing low-order extractability of the secret from the content-aligned representation, forcing the payload into synergistic interaction with residual degrees of freedom. This motivates a recontextualization dataset that restricts these residual degrees of freedom. On this distribution, both ridge and MLP detectability are restored across all five evasive trojans. Overall, our findings show that activation-based steganography detection is vulnerable to adaptive evasion, but also that theory-guided evaluation distributions can expose otherwise hidden payloads.

URL PDF HTML ☆

赞 0 踩 0

2606.09408 2026-06-09 cs.CY cs.AI cs.HC 新提交

Can Data Work be Reparative?

数据工作能否具有修复性？

Srravya Chandhiramowuli, Ding Wang, Alex Taylor

发表机构 * University of Edinburgh（爱丁堡大学）； Google Research（谷歌研究院）

AI总结通过民族志研究，探讨公民科技倡议如何从女性主义视角协作构建安全数据集，旨在将数据工作重塑为修复与补救的场所，并分析其中遇到的挑战与张力。

Comments To be presented at ACM FAccT, Montréal, Canada, June 25 to June 28, 2026

详情

AI中文摘要

我们展示了一项关于数据工作替代方法的民族志研究，该方法由一项公民科技倡议开发，该倡议构建用于训练和基准测试在线安全系统的数据集。他们旨在从女性主义视角回应在线安全问题，通过与受在线伤害影响最大的人协作构建安全数据集。在本文中，我们考察了这种方法如何试图将数据工作重新定位为修复和补救的场所，并追溯他们在这一过程中遇到的挣扎。具体来说，我们关注在推进数据工作的公正报酬和AI数据集的集体治理方面所面临的挑战和张力。通过STS视角下的修复正义和修复理论审视这些挑战，我们认为修复数据工作（以及AI）的工作从根本上在于重置责任关系。在当前强调安全评估和红队测试等努力以使AI更加负责任的背景下，我们强调需要面对基本问题：参与这些努力的人类如何与他们帮助产生的数据集和系统相关联。修复性视角要求我们打断数据工作的主流规范，并将那些因当前数据集生产模式中的忽视、疏忽和排斥而受害最深的人置于中心，而不是AI或数据集。我们认为，这为责任提供了大胆的愿景，并为构建数据和AI实践的替代未来贡献了批判性议程。

英文摘要

We present an ethnographic study of an alternative approach to data work, developed by a civic-tech initiative that builds datasets for training and benchmarking online safety systems. They aim to respond to online safety concerns from a feminist perspective, by building safety datasets collaboratively with those most impacted by online harms. In this paper, we examine how this approach aims to reorient data work as a site for repair and redress, and trace the struggles they encounter in the process. Specifically, we draw attention to the challenges and tensions involved in advancing just reward for data work and collective governance of AI datasets. Examining these challenges through an STS-informed lens of reparative justice and repair, we argue that the work of repairing data work (and AI) lies, fundamentally, in resetting the ties of accountability. At a time heightened emphasis on efforts like safety evaluations and red teaming to make AI more responsible, we highlight the need to confront foundational questions about how the humans involved in these efforts relate to the datasets and systems they help produce. A reparative lens demands that we interrupt prevailing norms of data work and place at their centre, not AI or datasets, but those most harmed by the neglect, oversight and exclusion animated in the current modes of dataset production. This, we argue, offers a bold vision for responsibility and contributes towards a critical agenda for building alternative futures of data and AI practice.

URL PDF HTML ☆

赞 0 踩 0

2606.09404 2026-06-09 stat.ML cs.AI cs.LG 新提交

SAILS: Surrogate-based Analysis of Interactions via Local Effect Smooths

SAILS: 基于局部效应平滑的交互作用代理分析

Timo Heiß, Julia Herbinger, Bernd Bischl, Giuseppe Casalicchio

发表机构 * Department of Statistics, LMU Munich（慕尼黑大学统计系）； Munich Center for Machine Learning (MCML)（慕尼黑机器学习中心）； Leibniz Institute for Prevention Research and Epidemiology（莱比锡预防研究与流行病学研究所）

AI总结提出SAILS框架，通过可解释的广义加性模型代理分析黑箱模型中的成对交互作用，实现交互检测、形式分类和可视化。

详情

AI中文摘要

特征交互驱动了机器学习模型的大部分预测能力，然而现有的解释方法仅能检测和量化交互作用，而无法揭示其函数形式，或者只能可视化受限的交互类型。我们提出了基于局部效应平滑的交互作用代理分析（SAILS），这是一个模型无关的框架，通过拟合黑箱模型局部效应的可解释广义加性模型（GAM）代理来分析成对交互作用。对于感兴趣特征的每个区间，代理平滑项在导数层面隔离交互成分，从而实现（i）通过对平滑项显著性检验的启发式方法进行交互检测，（ii）将交互形式分类为线性、乘积可分离和非乘积可分离类型，以及（iii）为每种交互类型提供定制化、可解释的可视化。我们通过受控模拟和实际任务实证验证了该框架，展示了其在成对交互作用上的有效性，但在强特征相关性和高阶交互作用下存在局限性。SAILS填补了XAI工具箱中的一个显著空白，超越了仅检测交互作用，进而表征其函数形式。

英文摘要

Feature interactions drive much of the predictive power of machine learning models, yet existing explanation methods only detect and quantify interactions without revealing their functional form, or visualize only restricted interaction types. We propose Surrogate-based Analysis of Interactions via Local effect Smooths (SAILS), a model-agnostic framework that analyzes pairwise interactions through interpretable generalized additive model (GAM) surrogates fitted to the local effects of a black-box model. For each interval of a feature of interest, the surrogate smooth terms isolate the interaction components on derivative level, enabling (i) interaction detection through a heuristic derived from significance tests on smooth terms, (ii) interaction form categorization into linear, product-separable, and non-product-separable types, and (iii) tailored, interpretable visualizations for each interaction type. We empirically validate the framework through controlled simulations and a real-world task, demonstrating its effectiveness for pairwise interactions, with limitations under strong feature correlations and higher-order interactions. SAILS fills a notable gap in the XAI toolbox, going beyond detection of interactions alone to characterizing their functional form.

URL PDF HTML ☆

赞 0 踩 0

2606.09331 2026-06-09 cs.MM cs.AI cs.LG 新提交

Conan-embedding-v3: Fusing Modality-Specific Models for Omni-Modal Embedding

Conan-embedding-v3: 融合模态特定模型实现全模态嵌入

Shiyu Li, Zhiyuan Hu, Yifan Wang, Peiming Li, Zheng Wei, Yang Tang

发表机构 * Tencent（腾讯）

AI总结提出解耦-融合-恢复框架，通过独立训练模态专家并融合任务向量，再使用投影器恢复和平衡多模态重演解决投影器漂移问题，实现单一骨干网络支持文本、图像、视频、文档和音频检索。

详情

AI中文摘要

全模态检索承诺为文本、图像、视频、文档和音频输入提供单一嵌入空间，但由于这些模态在数据分布、架构和优化动态上存在差异，构建这样一个统一的检索器十分困难。在这项工作中，我们提出了Conan-embedding-v3，一个用于全模态检索的解耦-融合-恢复框架。Conan-embedding-v3首先独立训练模态专家，然后将它们的任务向量融合到一个单一的密集骨干网络中，我们称这种策略为解耦专家融合。我们表明，这种融合组合了视觉、视频和文档检索能力，但也暴露了基于投影器的模态的一个失败模式：当通过外部编码器和投影器附加音频时，融合骨干网络会使投影器校准到音频专家骨干网络，导致尽管原封不动地复制了所有音频特定模块，音频检索性能仍大幅下降。我们将这种失败称为投影器漂移。为了修复它，Conan-embedding-v3应用了投影器恢复（即在保持骨干网络冻结的情况下对投影器进行全参数微调），随后进行平衡的多模态重演。得到的模型在一个骨干网络中支持这些检索路径，在MMEB上达到74.9分，同时在30任务的MAEB音频套件上获得55.61分。

英文摘要

Omni-modal retrieval promises a single embedding space for text, image, video, document, and audio inputs, but building such a unified retriever is difficult since these modalities differ in data distribution, architecture, and optimization dynamics. In this work, we present Conan-embedding-v3, a decouple--fuse--recover framework for omni-modal retrieval. Conan-embedding-v3 first trains modality specialists independently and fuses their task vectors into a single dense backbone, a strategy we call Decoupled Specialist Fusion. We show that this fusion composes visual, video, and document retrieval capabilities, but also exposes a failure mode for projector-based modalities: when audio is attached through an external encoder and projector, fusing the backbone leaves the projector calibrated to the audio-specialist backbone, causing a large audio retrieval regression despite copying all audio-specific modules unchanged. We call this failure Projector Drift. To repair it, Conan-embedding-v3 applies Projector Recovery (i.e., full-parameter fine-tuning of the projector while keeping the backbone frozen) followed by balanced multi-modal rehearsal. The resulting model supports these retrieval pathways in one backbone, achieving 74.9 scores on MMEB while obtaining 55.61 on the 30-task MAEB audio suite.

URL PDF HTML ☆

赞 0 踩 0

2606.09315 2026-06-09 cs.CR cs.AI 新提交

Brain-Prompt Injection: A Route-Safety Audit for BCI-LLM Agents

脑提示注入：BCI-LLM代理的路径安全审计

Jianwei Tai

发表机构 * University of California, Berkeley（加州大学伯克利分校）

AI总结提出路径安全审计契约，通过分离定理和共形校准量化BCI-LLM代理中脑信号注入攻击的风险，实验证明确认通道可降低路由风险。

详情

AI中文摘要

BCI到代理的管道将解码的神经活动转化为工具使用代理的授权通道，暴露了一个我们称之为\emph{脑提示注入}的新攻击面：信号侧扰动、上下文仅注入和自适应双解码器攻击都可以改变路由动作，而EEG侧或文本侧监控器仍然盲视。该堆栈中的路径安全取决于审计日志能观察到什么，而不仅仅是解码器准确性或一致性。我们定义了一个路径安全审计契约：一个最小的日志模式、分母层次结构和端点规范，并证明了一个审计模式分离定理以及一个C3攻击依赖分解；干净的一致性和边际鲁棒性不能识别控制C3路由的联合项。作为契约之上的校准层，我们将分裂共形校准应用于非预言机EEG确认通道，并在明确的威胁原型矩阵下报告由此产生的假接受边界。我们在EEGMMI原生左/右命令控制上实例化该契约，涉及5,400个事件、无害工具存根和种子/案例分母。来源阻止C2路由（$0.000$）；一致性加来源路由C3翻转（$1.000$）；确认加来源路由它们（$0.000$）。共形边界在采集隔离下，对于$α=.005$，在干净效用$0.150$时达到FAR $0.000$；对于$α=.10$，在干净效用$0.452$时达到FAR $0.119$；攻击者可控制的确认通道将界限打破至$\approx\!1$。受试者集群自助法在60名受试者上确认了这些区间；跨架构（TinyEEGNet, EEGNetV4）和容量扫描结果显示在区域内饱和。中介和确认降低了风险；它们不是意图证书。

英文摘要

BCI-to-agent pipelines turn decoded neural activity into an authorization channel for tool-use agents, exposing a new attack surface we call \emph{brain-prompt injection}: signal-side perturbations, context-only injections, and adaptive dual-decoder attacks can all change the routed action while EEG-side or text-side monitors remain blind. Route safety in this stack depends on what the audit log can observe, not on decoder accuracy or agreement alone. We define a Route-Safety Audit Contract: a minimal log schema, denominator hierarchy, and endpoint specification, and prove an audit-schema separation theorem together with a C3 attacked-dependence decomposition; clean agreement and marginal robustness do not identify the joint term that controls C3 routing. As a calibration layer on top of the contract, we apply split-conformal calibration to a non-oracle EEG confirmation channel and report the resulting false-accept frontier under an explicit threat-archetype matrix. We instantiate the contract on EEGMMI native left/right command-control over 5{,}400 events, harmless tool stubs, and seed/case denominators. Provenance blocks C2 routes ($0.000$); agreement-plus-provenance routes C3 flips ($1.000$); confirmation-plus-provenance routes them ($0.000$). The conformal frontier reaches FAR $0.000$ at clean utility $0.150$ for $α=.005$ and FAR $0.119$ at clean utility $0.452$ for $α=.10$ under acquisition isolation; an attacker-controllable confirmation channel breaks the bound to $\approx\!1$. Subject-cluster bootstrap confirms these intervals on $60$ subjects; cross-architecture (TinyEEGNet, EEGNetV4) and capacity-sweep results show within-regime saturation. Mediation and confirmation reduce risk; they are not intent certificates.

URL PDF HTML ☆

赞 0 踩 0

2606.09227 2026-06-09 cs.CR cs.AI cs.CE cs.CY cs.HC cs.SI 新提交

Trustworthy Smart Fabs via Professional Proxies: Scaling Safe and Sustainable by Design (SSbD) through Industrial Data Spaces

通过专业代理实现可信智能晶圆厂：通过工业数据空间扩展安全与可持续设计（SSbD）

Han-Teng Liao, Chang-Yi Kao, Karen Ang

发表机构 * Independent Researcher Dept. Computer Science and Independent Researcher Information Management（独立研究员计算机科学系及独立研究员信息管理）

AI总结针对欧盟SSbD等法规带来的治理瓶颈，提出基于零信任的社会技术编排框架，通过硬件隔离信任区中的专业代理工作流，在工业数据空间中实现自主治理，解决数据主权悖论。

Comments This work was accepted for presentation at the 32nd IEEE ICE/ITMC Conference, Porto, Portugal, 2026 but was subsequently withdrawn prior to publication due to submission volume limits. It is currently under consideration for publication elsewhere

详情

AI中文摘要

2026年欧盟安全与可持续设计（SSbD）框架、企业可持续发展尽职调查指令（CSDDD）和碳边境调节机制（CBAM）的融合，为先进半导体制造设施（“智能晶圆厂”）带来了严重的治理瓶颈。法规合规需求已超出人工企业报告的能力，在多利益相关方透明度与企业数据隐私之间造成了直接冲突。本文通过引入一个零信任的社会技术编排框架来应对这一挑战，该框架在可信工业数据空间中实现了六层SSbD参考架构的操作化。我们提出从被动自动化向自主治理的转变，通过“专业代理”——在硬件隔离信任区内执行的基于角色的代理工作流。该框架结构化为一个可互操作的网络协议栈，协调设施、工艺工程和财务代理团队之间的自动化“五步接力赛”，将工厂车间的良率模型与宏观可持续发展指令对齐。通过在基于硬件的可信执行环境（TEE）中执行虚拟量测（VM）预测和联邦机器学习（FML），该架构解决了数据主权悖论，展示了晶圆厂如何通过国际数据空间（IDS）连接器导出加密签名的合规令牌，而无需暴露专有工艺配方。最终，该框架为技术管理者提供了一条可验证、基于证据的路径，通向有韧性的净零工业5.0生态系统。

英文摘要

The convergence of the 2026 European Union Safe and Sustainable by Design (SSbD) framework, Corporate Sustainability Due Diligence Directive (CSDDD), and Carbon Border Adjustment Mechanism (CBAM) introduce a severe governance bottleneck for advanced semiconductor manufacturing facilities ("Smart Fabs"). Regulatory compliance demands have surpassed the capacity of manual corporate reporting, creating a direct conflict between multi-stakeholder transparency and corporate data privacy. This paper addresses this challenge by introducing a zero-trust socio-technical orchestration framework that operationalizes a six-layer SSbD reference architecture within trustworthy industrial data spaces. We propose a shift from reactive automation to autonomous governance through "Professional Proxies"-role-based agentic workflows executing within hardware-isolated trust zones. Structured as an interoperable network protocol stack, the framework coordinates an automated, five-step "relay race" between Facility, Process Engineering, and Finance proxy teams to align factory-floor yield models with macro-level sustainability mandates. By executing Virtual Metrology (VM) predictions and Federated Machine Learning (FML) inside hardware-rooted Trusted Execution Environments (TEEs), this architecture resolves the Data Sovereignty Paradox, demonstrating how fabs can export cryptographically signed compliance tokens via International Data Spaces (IDS) connectors without exposing proprietary process recipes. Ultimately, this framework provides technology managers with a verifiable, evidence-based pathway toward resilient, net-zero Industry 5.0 ecosystems.

URL PDF HTML ☆

赞 0 踩 0

2606.09213 2026-06-09 cs.PL cs.LG 新提交

SNN-MLIR: An MLIR Dialect for Compiling Neuromorphic SNNs from NIR to Bare-Metal C

SNN-MLIR：一种用于将神经形态SNN从NIR编译到裸机C的MLIR方言

Alejandro García Gener, Alvaro Rollón de Pinedo

发表机构 * INTERA-Group（INTERA小组）

AI总结提出SNN-MLIR，一种MLIR方言，通过NIR-MLIR-C编译桥将神经形态SNN模型从框架无关的NIR格式编译为可移植的C代码，支持浮点和量化数据，实现从仿真到硬件部署的统一中间表示。

Comments 8 pages, 5 figures, 5 tables

详情

AI中文摘要

脉冲神经网络（SNN）越来越多地在各种框架（SnnTorch、Lava、Norse等）中训练，每个框架都有自己的模型格式。神经形态中间表示（NIR）通过提供一种通用的、框架无关的格式来交换训练好的SNN模型，解决了碎片化问题。NIR解决了交换问题，但仅止于此。它提供了网络的描述，而非运行网络的路径。每个后端仍需自行实现部署，之间没有共享的、可转换的编译器表示。本文提出snn-mlir，一种用于SNN的树外MLIR方言，以及一个NIR-MLIR-C编译桥。该方言提供了一小组类型多态操作，这些操作在浮点（f32/f64）和量化数据上行为一致，因此单一的中间表示同时服务于仿真和面向硬件的部署。一个Python前端读取任何NIR文件并发出方言IR，自动插入重新缩放操作以保持各层量化尺度一致。一个参考降级过程将方言转换为标准的linalg和arith操作，工具链从中生成自包含、无依赖的C11代码，可在任何支持C的CPU或嵌入式目标上编译和运行。我们评估了数值精度与参考输出的匹配度、跨CPU目标的可移植性以及量化的代价。当前范围是前馈全连接网络，后端为CPU。snn-mlir以Apache-2.0许可证（含LLVM例外）开源发布，并已在GitHub上可用。

英文摘要

Spiking neural networks (SNNs) are increasingly trained in a wide range of frameworks (SnnTorch, Lava, Norse, and others) each with its own model format. The Neuromorphic Intermediate Representation (NIR) addresses this fragmentation by providing a common, framework-independent format for exchanging trained SNN models. NIR solves the exchange problem, but it stops there. It provides a description of a network, not a path to running one. Each backend is still left to implement deployment on its own, with no shared, transformable compiler representation in between. This paper presents snn-mlir, an outof-tree MLIR dialect for SNNs together with a NIR-MLIR-C compilation bridge. The dialect provides a small set of typepolymorphic operations that work identically on floating-point (f32/f64) and quantized data, so a single intermediate representation serves both simulation and hardware-oriented deployment. A Python front end reads any NIR file and emits dialect IR, automatically inserting rescaling operations to keep quantization scales consistent across layers. A reference lowering pass converts the dialect to standard linalg and arith operations, from which the toolchain produces self-contained, dependency free C11 code that compiles and runs on any C-capable CPU or embedded target. We evaluate numerical fidelity against reference outputs, portability across CPU targets, and the cost of quantization. The current scope is feedforward, fully-connected networks with a CPU backend. snn-mlir is released as open source under the Apache-2.0 license with LLVM-exception and it is already available on Github.

URL PDF HTML ☆

赞 0 踩 0

2606.09200 2026-06-09 cs.DC cs.AI 新提交

Resource-aware Computation-Communication Overlap for multi-GPU ML Workloads

面向多GPU机器学习工作负载的资源感知计算-通信重叠

Minyu Cui, Miquel Pericas

发表机构 * Chalmers University of Technology and University of Gothenburg（查尔姆斯理工大学和哥德堡大学）

AI总结针对多GPU训练中通信瓶颈，提出通过共享内存占用整形和通信流优先级提升实现计算与通信重叠，在多种GPU上减少执行时间达25.5%。

Comments To appear at the AI on HPC Workshop at ISC 2026, held in conjunction with ISC 2026

详情

AI中文摘要

大规模机器学习的快速增长使得跨多GPU的分布式训练成为现代ML系统的基本组成部分。随着模型大小和计算吞吐量的持续增加，通信开销已成为多GPU训练中的主要瓶颈，特别是在计算和通信顺序执行时。本文探索了使用两种可移植运行时控制实现计算和集体通信的并发执行：用于计算内核的共享内存驱动占用整形和用于通信内核的提升调度优先级。我们的方法通过每块共享内存分配来调节计算内核的驻留，为通信内核留下足够的片上资源以取得进展。此外，为通信流分配更高的优先级确保一旦资源可用，通信进展稳定。在NVIDIA A40、A100、H100和AMD MI250X GPU上的实验表明，所提出的方法能够实现有效的计算-通信重叠，并将总执行时间减少高达25.5%，而无需修改供应商库或内核实现。

英文摘要

The rapid growth of large-scale machine learning (ML) has made distributed training across multiple GPUs a fundamental component of modern ML systems. As model sizes and computational throughput continue to increase, communication overhead has become a dominant bottleneck in multi-GPU training, particularly when computation and communication are executed sequentially. This work explores concurrent execution of computation and collective communication using two portable runtime controls: shared-memory-driven occupancy shaping for computation kernels and elevated scheduling priority for communication kernels. Our approach regulates computation-kernel residency through per-block shared-memory allocation, leaving sufficient on-chip resources for communication kernels to make progress. In addition, assigning higher priority to communication streams ensures steady communication progress once resources become available. Experiments on NVIDIA A40, A100, H100, and AMD MI250X GPUs demonstrate that the proposed method enables effective computation-communication overlap and reduces total execution time by up to 25.5 percent, without modifying vendor libraries or kernel implementations.

URL PDF HTML ☆

赞 0 踩 0