arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 2370
2605.30096 2026-05-29 cs.CR cs.AI

How Reliable Are AI Attackers Against a Fixed Vulnerable Target? A 400-Run Empirical Study of LLM Penetration Testing Consistency

AI攻击者对固定脆弱目标的可靠性如何?LLM渗透测试一致性的400次运行实证研究

Galip Tolga Erdem

AI总结 通过400次自主渗透测试运行(4个模型各100次),研究LLM在固定目标上攻击行为的一致性,发现模型间成功率差异显著且失败模式独特。

Comments 41 pages, 7 figures. Code and 400-run dataset: https://doi.org/10.5281/zenodo.20421592

详情
AI中文摘要

大型语言模型(LLM)可以自主进行多阶段网络攻击,但其在重复试验下攻击行为的一致性尚未被研究。本文首次对LLM攻击一致性进行了大规模实证测量:针对托管OWASP Juice Shop和另外两个脆弱服务的相同蜜罐,进行了400次自主渗透测试运行(4个模型,各100次),保持提示、编排器和目标不变。没有模型发出在编排器第0-1次迭代的一次性授权重新提示后仍存在的拒绝内容。Claude Sonnet 4的API调用确实遇到了上游服务不可用——在记录的Anthropic容量事件期间,1135次调用中有91次返回HTTP 529 overloaded_error,导致100次Claude运行中有39次被截断。早期草稿将这些归类为安全拒绝;在完整日志审计后,它们是上游API故障,而非模型级拒绝。尽管如此,Claude在100次运行中有61次实现了完全利用;Gemini 2.5 Flash-Lite为85次;GPT-4o-mini为56次,同时部署了98种独特的攻击策略;qwen2.5-coder:14b为25次。失败模式因模型而异:Claude因API截断(39次运行),qwen因过早完成(52次),GPT-4o-mini因迭代预算耗尽(23次)。跨服务凭据重用仅出现在保留最多对话历史的配置中(qwen 57%,GPT-4o-mini 49%,云模型在5次交换窗口内为0%)。跨模型利用率的差异具有统计学显著性(p < 0.001),效应量大;qwen与Gemini的SQL注入率差异的Cohen's h = 1.12。首次利用时间落在15-30秒的挂钟时间范围内。据我们所知,这是首个在N=100每模型下测量跨多服务目标的自主LLM攻击行为的研究。

英文摘要

Large language models (LLMs) can autonomously conduct multi-stage cyber attacks, but the consistency of their offensive behavior under repeated trials remains unstudied. This work presents the first large-scale empirical measurement of LLM attack consistency: 400 autonomous penetration testing runs (4 models, 100 each) against an identical honeypot hosting OWASP Juice Shop and two additional vulnerable services, holding prompt, orchestrator, and target constant. No model emitted a content refusal that survived the orchestrator's one-shot authorization re-prompt at iterations 0-1. Claude Sonnet 4's API calls did encounter upstream service unavailability - 91 of 1,135 calls returned HTTP 529 overloaded_error during a documented Anthropic capacity event, truncating 39 of 100 Claude runs. An earlier draft catalogued these as safety refusals; on full-log audit they are upstream API failures, not model-level refusals. Despite this, Claude achieved full exploitation in 61 of 100 runs; Gemini 2.5 Flash-Lite in 85; GPT-4o-mini in 56 while deploying 98 unique attack strategies; qwen2.5-coder:14b in 25. Failure modes are model-distinctive: Claude through API truncation (39 runs), qwen through premature completion (52), GPT-4o-mini through iteration-budget exhaustion (23). Cross-service credential reuse appeared only in configurations retaining the most conversation history (qwen 57%, GPT-4o-mini 49%, cloud models 0% on 5-exchange windows). Cross-model exploitation rate differences are statistically significant (p < 0.001) with large effect sizes; qwen vs. Gemini SQL injection rates differ at Cohen's h = 1.12. First-exploit timing fell within a 15-30 second wall-clock range. To our knowledge, this is the first study to measure autonomous LLM attack behavior at N=100 per model across a multi-service target.

2605.30054 2026-05-29 cs.SE cs.AI

Projectional Decoding: Towards Semantic-Aware LLM Generation

投影式解码:迈向语义感知的LLM生成

Boqi Chen, José Antonio Hernández López, Aren A. Babikian

AI总结 提出投影式解码框架,通过维护部分图模型作为主要工件表示,实现增量语义验证和错误检测,以提升LLM生成工件的语义有效性。

Comments 5 pages, 3 figures. Accepted at FSE 2026 IVR track

详情
AI中文摘要

大型语言模型(LLM)越来越多地被用于跨许多软件工程(SE)任务生成软件工件,然而确保这些工件的语义有效性仍然是一个基本挑战。现有的约束解码技术可以强制执行语法正确性,并且在某些情况下强制执行特定的语义规则,但缺乏一种通用表示,能够将LLM生成的文本与SE中语义验证所需的推理联系起来。在本文中,我们提出了投影式解码,一种新颖的概念框架,通过在整个生成过程中与文本一起维护部分图模型作为主要工件表示,直接将领域语义集成到生成过程中。这种抽象表示通过显式捕获不确定性并原生支持错误检测,实现增量语义验证,同时引导生成朝向具有可证明保证的语义有效输出。我们在一个程序生成任务上展示了初步结果,证明了这种方法在提高LLM生成工件的语义有效性方面的潜力。我们还讨论了投影式解码如何能够在各种SE活动中实现与LLM的可验证自动化。

英文摘要

Large language models (LLMs) are increasingly used to generate software artifacts across many software engineering (SE) tasks, yet ensuring the semantic validity of these artifacts remains a fundamental challenge. Existing constrained decoding techniques can enforce syntactic correctness and, in some cases, specific semantic rules, but lack a general representation that bridges LLM-generated text with the reasoning required for semantic validation in SE. In this paper, we propose projectional decoding, a novel conceptual framework that integrates domain semantics directly into the generation process by maintaining, alongside text, a partial graph model as the primary artifact representation throughout generation. This abstract representation enables incremental semantic validation by explicitly capturing uncertainty and natively supporting error detection, while guiding generation toward semantically valid outputs with provable guarantees. We present preliminary results on a program generation task which demonstrate the potential of this approach to improve the semantic validity of LLM-generated artifacts. We also discuss how projectional decoding can enable verifiable automation with LLMs across various SE activities.

2605.30052 2026-05-29 cs.SE cs.AI cs.CL

REPOT: Recoverable Program-of-Thought via Checkpoint Repair

REPOT:通过检查点修复实现可恢复的思维程序

Parsa Mazaheri

AI总结 提出 RePoT 方法,通过确定性验证重放和 LLM 调用从验证前缀恢复,以解决 Program-of-Thought 中单个无效动作导致轨迹失效的问题,在多个模型和基准上提升成功率。

详情
AI中文摘要

单次 Program-of-Thought (PoT) 生成一个打印基本动作计划的 Python 程序;单个无效动作会无声地使轨迹失效。我们引入 RePoT (可恢复 PoT):一种确定性验证重放,它将计划遍历环境直到第一个无效转换,然后通过一次 LLM 调用从验证前缀恢复。在 PoT 失败的约 14% 的问题上,RePoT 最多增加一次 LLM 调用。在 PuzzleZoo-775 上,RePoT 在四种闭模型配置上比 PoT 提高 +3 到 +11 个百分点,在 gpt-5.4-mini-medium 上达到 96.9% 对比 86.3% 的峰值;与预算匹配的 PoT-retry 基线相比,RePoT 在 Gemini 上明显获胜(+3.8pp,95% CI [+2.2,+5.4]),在 GPT-medium 和 Claude 上处于采样噪声范围内,在 GPT-mini 上失败——这是一种能力扩展模式,我们开始通过自适应 RePoT 解决,这是一种基于规则的调度器,根据验证前缀长度在后缀修复和全新 PoT 重试之间路由(初步)。我们在 PlanBench Blocksworld 上复现(+1.1 到 +11.4pp),在四个开放权重模型上(四个中的三个 +3.3 到 +20.0pp)。在 Derail-550(我们的受控恢复基准)上,每个能够访问检查点信息的条件在 GPT-medium 上达到 >=30%,在 Gemini 上达到 >=70%,而仅错误反馈条件 <=3.1%——表明检查点信息(而非特定的验证前缀尾部)是承载恢复的信号。

英文摘要

One-shot Program-of-Thought (PoT) emits a Python program that prints a primitive-action plan; a single invalid action silently invalidates the trajectory. We introduce RePoT (Recoverable PoT): a deterministic verified replay that walks the plan through the environment to its first invalid transition, then one LLM call that resumes from the verified prefix. RePoT costs at most one extra LLM call on the ~14% of problems where PoT fails. RePoT beats PoT by +3 to +11pp across four closed-model configurations on PuzzleZoo-775 and peaks at 96.9% vs 86.3% on gpt-5.4-mini-medium; against the matched-budget PoT-retry baseline, RePoT wins decisively on Gemini (+3.8pp, 95% CI [+2.2,+5.4]), is within sampling noise on GPT-medium and Claude, and loses on GPT-mini -- a capability-scaling pattern we begin to address with Adaptive RePoT, a rule-based dispatcher that routes between suffix repair and a fresh PoT retry based on verified-prefix length (preliminary). We replicate on PlanBench Blocksworld (+1.1 to +11.4pp) and on four open-weights models (+3.3 to +20.0pp on three of four). On Derail-550, our controlled recovery benchmark, every condition with access to checkpoint information clears >=30% on GPT-medium and >=70% on Gemini, vs <=3.1% for error-only feedback -- showing that checkpoint information, not the specific verified-prefix tail, is the load-bearing recovery signal.

2605.30040 2026-05-29 cs.CR cs.AI cs.CL

Token Inflation: How Dishonest Providers Can Overcharge for Large Language Model Usage

Token通胀:不诚实的提供商如何对大型语言模型使用超额收费

Shahinul Hoque, Jinghuai Zhang, Jinyuan Sun, Fnu Suya

AI总结 研究揭示了基于每token计费的大型语言模型商业服务中,提供商利用审计信任悖论系统性地虚报token数量,导致用户费用大幅增加的问题。

详情
AI中文摘要

按token计费现在是商业大型语言模型(LLM)的标准定价模式,因此报告token数量的诚实性直接影响用户支付的费用。我们表明,这种计费方式在设计上难以审计:提供商隐藏模型、分词器和执行过程以保护其知识产权、缓解越狱攻击并保护用户隐私,这意味着审计员只能检查提供商提供的证明。因此,审计简化为对提供商自身报告的一致性检查。我们称之为信任悖论:每次审计都必须信任某些工件,但当前的框架恰恰信任提供商最有动机操纵的那些工件。我们研究了三个最近的token审计框架,并表明具有普通商业能力的提供商可以系统地虚报计费token数量。在最宽松的设置中,隐藏的推理使用量平均可以膨胀1469%而不被检测到。以当前前沿推理价格计算,这将使同一查询的诚实账单从100美元变成约1569美元。即使当用户可以看到完整的推理字符串时,仅分词歧义就允许在检测阈值以下多报50.85%。这些结果表明问题不在于任何特定的审计器,而在于任何证据来自被审计方的审计。恢复诚实计费需要将报告的token数量与提供商无法控制的证据(例如可信执行证明、推理的加密证明或第三方重新执行)联系起来的验证。

英文摘要

Per-token billing is now the standard pricing model for commercial large language models (LLMs), so the honesty of reported token counts directly affects what users pay. We show that this kind of billing is hard to audit by design: providers hide the model, the tokenizer, and the execution to protect their IP, mitigate jailbreaks, and preserve user privacy, which means an auditor can only inspect proofs the provider supplies. The audit therefore reduces to a consistency check on the provider's own reports. We call this a trust paradox: every audit must trust some artifact, but current frameworks trust exactly the ones a provider has the strongest reason to manipulate. We study three recent token auditing frameworks and show that a provider with ordinary commercial capabilities can systematically inflate billed token counts. In the most permissive setting, hidden reasoning usage can be inflated by 1,469% on average without detection. At current frontier reasoning prices, that turns a \$100 honest bill into roughly a \$1,569 bill on the same query. Even when the user can see the full reasoning string, tokenization ambiguity alone still allows 50.85% over-reporting below the detection threshold. These results suggest the problem is not in any specific auditor but in any audit whose evidence comes from the audited party. Restoring honest billing will require verification that ties reported token counts to evidence the provider does not control, such as trusted execution attestation, cryptographic proofs of inference, or third-party re-execution.

2605.30003 2026-05-29 cs.MA cs.AI cs.LG

Discovering Cooperative Pipelines: Autoresearch for Sequential Social Dilemmas

发现合作管线:面向序列社会困境的自动研究

Víctor Gallego

AI总结 本文提出一种双层自动研究框架,其中外层AI智能体自动重新设计内层LLM策略合成管线,以解决多智能体序列社会困境,实验表明该方法在多个游戏和福利目标下优于手工基线。

Comments Accepted to the AI Agents for Discovery in the Wild (AID-Wild) Workshop at ACM CAIS 2026

详情
AI中文摘要

我们研究了两层自动研究合作问题:外层AI智能体自主重新设计用于多智能体序列社会困境(SSD)的LLM策略合成系统的内层管线。研究者智能体$\mathcal{R}$(作为编码智能体运行)读取内层源代码,编辑系统提示、反馈函数、辅助库和迭代逻辑,运行评估,并决定保留什么,遵循自动研究范式。在两个游戏(Cleanup和Gathering)、两个策略合成器LLM和两个福利目标(功利主义效率和Rawlsian最大最小原则)下,研究者可靠地超越了手工设计的基线,显著缩小了运行间方差,并优于仅提示优化。发现的管线依赖于目标:只有在最大最小原则下,研究者才会向合成器管线注入显式的公平机制,而这类机制在其自身目标无关的系统提示和每个效率优化的管线中都不存在。这支持了一种信息设计解读,即研究者根据福利目标选择向有限理性的合成器揭示什么。代码见https://github.com/vicgalle/autoresearch-social-dilemmas。

英文摘要

We study two-level autoresearch for cooperation: an outer-loop AI agent autonomously redesigns the inner-loop pipeline of an LLM policy-synthesis system for multi-agent Sequential Social Dilemmas (SSDs). A researcher agent $\mathcal{R}$ (run as a coding agent) reads the inner-loop source code, edits system prompts, feedback functions, helper libraries, and iteration logic, runs evaluations, and decides what to keep, following the autoresearch paradigm. Across two games (Cleanup and Gathering), two policy-synthesizer LLMs, and two welfare objectives (utilitarian efficiency and Rawlsian maximin), the researcher reliably exceeds hand-designed baselines, sharply tightens run-to-run variance, and outperforms prompt-only optimization. The discovered pipelines are objective-dependent: only under maximin does the researcher inject an explicit fairness mechanism into synthesizer pipelines, a class of mechanism that is absent from its own objective-agnostic system prompt and from every efficiency-optimized pipeline. This supports an information-design reading in which the researcher chooses what to reveal to the boundedly rational synthesizer as a function of the welfare objective. Code at https://github.com/vicgalle/autoresearch-social-dilemmas.

2605.29979 2026-05-29 cs.CR cs.LG

Fingerprinting Inference Systems of Large Language Models

大型语言模型的推理系统指纹识别

Anna Wimbauer, Jonas Möller, Erik Imgrund, Konrad Rieck

AI总结 本文提出一种通过分析LLM的提示-响应行为来识别推理系统组件(如推理引擎、注意力后端和硬件平台)的指纹方法,并论证了防御该指纹识别的根本困难性。

详情
AI中文摘要

LLM的行为不仅仅取决于模型本身。推理系统的组件,如推理引擎、注意力后端和硬件平台,微妙地影响输入的处理方式。这些组件在实现上存在差异,因此在运行相同模型时,不同系统之间会产生微小的数值偏差。虽然先前的工作已经建立了这种偏差的理论存在性,但其安全影响尚未被探索。在本文中,我们表明这些偏差是特定组件的特征,并传播到可观察的文本输出中,从而将推理系统暴露给任何能够查询模型的方。基于这一观察,我们引入了一种指纹识别方法,通过分析LLM的提示-响应行为来识别推理系统的组件。我们的实证评估表明,即使在LLM以非零温度运行时,推理引擎、注意力后端和底层硬件平台也能被可靠地识别。我们证明,防止指纹识别从根本上来说是困难的,因为它需要消除硬件和软件堆栈之间的数值差异。因此,我们提出了部分缓解措施并讨论了它们的影响。

英文摘要

The behavior of LLMs does not depend solely on the model itself. Components of the inference system, such as the inference engine, attention backend, and hardware platform, subtly influence how inputs are processed. These components differ in their implementations and thereby induce small numerical deviations across systems when running the same model. While prior work has established the theoretical existence of such deviations, their security implications have remained unexplored. In this paper, we show that these deviations are characteristic of specific components and propagate to observable textual outputs, exposing the inference system to any party that can query the model. Building on this observation, we introduce a fingerprinting method that analyzes the prompt-response behavior of LLMs to identify components of the inference system. Our empirical evaluation demonstrates that the inference engine, attention backend, and underlying hardware platform can be identified reliably, even when the LLM is operated at non-zero temperature. We show that preventing fingerprinting is fundamentally hard, as it would require eliminating numerical differences between hardware and software stacks. We therefore propose partial mitigations and discuss their impact.

2605.29976 2026-05-29 physics.ao-ph cs.AI

Evaluating Skill and Stability of ArchesWeather and ArchesWeatherGen under Multi-Decadal Climate Simulations

评估 ArchesWeather 和 ArchesWeatherGen 在多年代际气候模拟中的技能和稳定性

Renu Singh, Robert Brunstein, Antonia Jost, Thomas Rackow, Claire Monteleoni, Yana Hasson, Christian Lessig, Guillaume Couairon

AI总结 本研究将两个原本用于天气预报的机器学习模型 ArchesWeather(确定性)和 ArchesWeatherGen(概率流匹配)改造为强迫大气模型,通过月平均海表温度和海水覆盖作为边界条件,遵循 AIMIP Phase 1 协议,评估其多年代际气候模拟能力,发现它们能产生稳定的长期气候模拟、稳定的年循环,并捕捉许多气候变量的漂移。

Comments 29 pages, 16 figures, preprint

详情
AI中文摘要

我们评估了 ArchesWeather 和 ArchesWeatherGen 的气候模拟能力,这两个机器学习模型最初训练用于天气预报,并评估了长达10天的预报时效。ArchesWeather 是一个确定性模型,而 ArchesWeatherGen 是一个概率流匹配模型,利用 ArchesWeather 的预报,实现基于集合的不确定性量化。在这项工作中,我们通过额外以月平均海表温度(SST)和海冰覆盖(SIC)作为边界条件进行条件化,将这些模型改造为强迫大气模型。具体地,我们遵循 AI 模型比较项目(AIMIP)第一阶段协议,该协议类似于大气模型比较项目(AMIP),提出了一个标准化的实验设置,以评估基于 ML 的强迫大气模型的气候技能。我们在这两种条件下对两个模型进行了全面评估,包括与数值气候模型的比较、检查扩展中关键设计选择的消融研究,以及强迫与非强迫配置的分析。尽管最初是为天气预报开发的,但我们证明,ArchesWeather 和 ArchesWeatherGen 的强迫配置能产生稳定的长期气候模拟,具有稳定的年循环,并捕捉许多气候变量的漂移。这些模型忠实地再现了 ERA5 的气候态、大尺度环流和年际变率,并捕捉了分布的尾部。

英文摘要

We evaluate the climate simulation capabilities of ArchesWeather and ArchesWeatherGen, two machine learning models originally trained for weather forecasting and evaluated up to a 10-day lead time. ArchesWeather is a deterministic model, while ArchesWeatherGen is a probabilistic flow-matching model leveraging ArchesWeather's forecasts, enabling ensemble-based uncertainty quantification. In this work, we adapt these models to act as forced atmospheric models by using additional conditioning on the monthly mean sea surface temperature (SST) and sea ice cover (SIC) as boundary conditions. In particular, we follow the AI Model Intercomparison Project (AIMIP) Phase 1 protocol, which, analogous to the Atmospheric Model Intercomparison Project (AMIP), proposes a standardized experimental setup to evaluate the climate skill of ML-based forced atmospheric models. We present a comprehensive evaluation of both models under these conditions, including comparison against numerical climate models, ablation studies that examine key design choices in the extension, and an analysis of forced versus unforced configurations. Despite being originally developed for weather forecasting, we demonstrate that forced configurations of ArchesWeather and ArchesWeatherGen produce stable long-term climate simulations, have a stable annual cycle, and capture the drift of many climate variables. The models faithfully reproduce ERA5's climatology, large-scale circulations and interannual variability, and they capture the tails of the distributions.

2605.29963 2026-05-29 cs.CR cs.AI cs.LG

Honeyval: A Comprehensive Evaluation Framework for LLM-powered HTTP Honeypots

Honeyval: 基于LLM的HTTP蜜罐综合评估框架

Mark Vero, Fabian Kaczmarczyck, Ivan Petrov, Ilia Shumailov, Jamie Hayes, Niels Heinen, Tianqi Fan, Luca Invernizzi, Martin Vechev

AI总结 提出Honeyval评估框架,通过16个后端应用、AI攻击代理、控制任务和可验证利用目标,系统评估LLM驱动的HTTP蜜罐,发现其相比规则基线能显著延长攻击交互、降低被前沿模型检测率,且保持成本优势。

详情
AI中文摘要

蜜罐是模拟真实系统组件的诱饵系统,旨在防御网络攻击。最近,LLM越来越多地作为蜜罐的模拟骨干。它们使防御者能够构建高交互蜜罐,同时降低系统安全风险。然而,基于LLM的蜜罐开发缺乏统一的评估框架。大多数评估包括测量固定命令上的响应相似性、手动测试或实际部署。这些方法通常不可扩展用于开发、不可跨评估复现、不能代表实际攻击,或不能适应各种攻击者和蜜罐配置。在这项工作中,我们弥补了这一差距,提出了Honeyval,一个针对LLM驱动的HTTP蜜罐的综合评估框架。我们通过将蜜罐基于16个后端应用程序、使用AI黑客代理作为攻击者、采用两个控制任务来监控代理和蜜罐在定制化方面的能力,以及为攻击者定义清晰且可验证的利用目标,解决了先前评估的局限性。使用Honeyval,我们对近期成本高效的LLM作为HTTP蜜罐进行了广泛评估。我们的实验突出了LLM驱动的蜜罐的前景;它们与基于规则的基线蜜罐相比,导致与攻击者的交互时间显著延长,并且即使被前沿模型检测到的频率也远低得多,同时平均而言,保持了针对代理攻击者的运行成本优势。此外,我们实验了不同的反攻蜜罐配置,并观察到了独特的权衡,例如以增加检测为代价获得更长的交互。

英文摘要

Honeypots are decoy systems mimicking real system components designed to defend against cyber attacks. Recently, LLMs increasingly serve as simulation backbones for honeypots. They enable defenders to construct high-interaction honeypots with low system security risks. However, LLM-powered honeypot development lacks a unified evaluation framework. Most evaluations consist of measuring response similarity on fixed commands, manual testing, or real-world deployment. These methods are often not scalable for development, reproducible across evaluations, representative of practical attacks, or adaptable to various attacker and honeypot configurations. In this work, we bridge this gap and propose Honeyval, a comprehensive evaluation framework for LLM-powered HTTP honeypots. We address the limitations of prior evaluations by grounding the honeypots in 16 backend applications, using AI hacking agents as attackers, employing two control tasks to monitor agent and honeypot capabilities across customizations, and defining clear and verifiable exploit goals for the attacker. Using Honeyval, we conduct an extensive evaluation of recent cost-efficient LLMs as HTTP honeypots. Our experiments highlight the promise of LLM-powered honeypots; they lead to substantially longer interactions with the attacker than rule-based baseline honeypots and are far less frequently detected even by frontier models, all while, on average, preserving a running cost advantage against agentic attackers. Further, we experiment with different counter-offensive honeypots configurations, and observe unique trade-offs, such as longer interactions at the cost of increased detection.

2605.29960 2026-05-29 cs.CR cs.AI

Hijacking Agent Memory: Stealthy Trojan Attacks Through Conversational Interaction

劫持Agent记忆:通过对话交互的隐蔽木马攻击

Hongtao Wang, Se Yang, Yu Chen, Puzhuo Liu

AI总结 提出MemPoison攻击方法,通过语义关系桥、实体伪装和联合嵌入优化绕过选择性记忆机制,在LLM Agent长期记忆中注入触发器后门,实现高达0.95的攻击成功率。

Comments 19 pages, 12 figures

详情
AI中文摘要

大型语言模型(LLM)Agent越来越多地利用长期记忆来支持持久且自主的任务执行。然而,这种能力也引入了一个新的攻击面:记忆投毒,即对手可以注入恶意信息以影响未来行为。现有的记忆投毒攻击通常假设注入内容可以直接存储在记忆中,忽略了现代记忆流水线中的选择性提取和重写阶段。这使得先前的方法在现实场景中无效。在本文中,我们提出MemPoison,一种新颖的记忆投毒攻击,能够绕过LLM Agent中的选择性记忆机制,攻击者可以通过对话交互将可触发的后门注入Agent的长期记忆,从而误导其后续响应。MemPoison引入三个关键组件:(i)语义关系桥,将触发器和载荷绑定为连贯的陈述,确保它们一起被提取到记忆中;(ii)实体伪装,优化触发器以模仿命名实体,抵抗重写;(iii)联合嵌入优化,将注入触发器的文本在嵌入空间中形成紧密聚类,同时与良性嵌入保持隔离以实现隐蔽。跨不同Agent领域和记忆机制的评估显示,MemPoison的攻击成功率高达0.95,优于现有基线。机制分析表明,该攻击利用了嵌入空间各向异性并转移注意力模式,突显了选择性记忆系统的核心漏洞。我们评估了多种防御策略,并展示了它们在缓解攻击方面的根本局限性。

英文摘要

Large language model (LLM) agents increasingly leverage long term memory to support persistent and autonomous task execution. However, this capability also introduces a new attack surface: memory poisoning, where adversaries can inject malicious information to influence future behavior. Existing memory poisoning attacks often assume that injected content can be stored directly in memory, overlooking the selective extraction and rewriting stages in modern memory pipelines. This makes prior methods ineffective under realistic settings. In this paper, we propose MemPoison, a novel memory poisoning attack that bypasses selective memory mechanisms in LLM agents, where an attacker can inject triggerable backdoors into the agent's long-term memory through dialogue interactions, thereby misleading its subsequent responses. MemPoison introduces three key components: (i) a semantic relational bridge that binds the trigger and payload into a coherent statement to ensure they are extracted into memory together; (ii) entity masquerading that optimizes triggers to mimic named entities, resisting rewriting; and (iii) joint embedding optimization that shapes trigger-injected texts into a tight cluster in the embedding space while maintaining isolation from benign embeddings for stealth. Evaluations across different agent domains and memory mechanisms show MemPoison achieves attack success rates up to 0.95, outperforming existing baselines. Mechanistic analysis indicates that the attack exploits embedding-space anisotropy and shifts attention patterns, highlighting core vulnerabilities in selective memory systems. We evaluate multiple defense strategies and demonstrate their fundamental limitations in mitigating the attack.

2605.29943 2026-05-29 cs.HC cs.ET cs.LG

A Domain-Informed Multi-Objective Framework for EEG Channel Selection in Motor Imagery BCIs

一种领域信息驱动的多目标框架用于运动想象脑机接口中的EEG通道选择

Dekka Muni Kumar, Dhruba Jyoti Kalita, Yogesh Kumar Meena

AI总结 提出一种基于多目标优化(NSGA-II、MOPSO、MOEA/D)的EEG通道选择框架,通过高斯核评估空间相关性、任务相关去同步评估功能区分性,在四个数据集上优于单目标方法,实现紧凑通道子集和高分类性能。

Comments This work has been submitted to the IEEE for possible publication

详情
AI中文摘要

使用脑电图(EEG)信号进行运动想象(MI)分类对于推进脑机接口(BCI)至关重要。传统的EEG通道选择方法通常面临局限性,例如依赖单目标标准和易陷入局部最优。为了解决这些挑战,本文提出了一种多目标优化框架,采用非支配排序遗传算法、多目标粒子群优化和基于分解的多目标进化算法。我们的方法有效平衡了空间相关性(使用高斯核)和功能区分性(评估试验内任务相关去同步),从而提高了性能。我们在四个EEG数据集(Physionet、OpenBMI、HighGamma和BCIIV-2A)上评估了该框架。所提出的方法成功识别出紧凑且相关的通道子集,这些子集集中在与MI活动相关的感觉运动皮层区域,解决了传统技术中普遍存在的维度和复杂性挑战。此外,该框架在Physionet、OpenBMI、HighGamma和BCIIV-2A数据集上分别达到了87%、71%、75%和65%的分类性能。通过优于现有的单目标和基于准确率的方法以及依赖固定子集的方法,这些发现表明,这种新的多目标优化框架可以增强基于MI的BCI性能,同时促进紧凑的通道配置,降低计算复杂度,使其更适合可穿戴、便携式和实时BCI应用。

英文摘要

Motor imagery (MI) classification using electroencephalography (EEG) signals is essential for advancing brain-computer interfaces (BCIs). Traditional EEG channel selection methods often face limitations, such as dependency on single-objective criteria and susceptibility to local optima. To address these challenges, this work proposes a multi-objective optimisation framework that employs non-dominated sorting genetic algorithm, multiple-objective particle swarm optimisation, and a multi-objective evolutionary algorithm based on decomposition. Our approach effectively balances spatial relevance, using a Gaussian kernel, and functional discriminability, which assesses intratrial task-related desynchronisation, thereby improving performance. We evaluated this framework on four EEG datasets: Physionet, OpenBMI, HighGamma, and BCIIV-2A. The proposed approach successfully identifies compact, relevant channel subsets concentrated around sensorimotor cortex regions linked to MI activity, addressing the prevalent challenges of dimensionality and complexity inherent to traditional techniques. Furthermore, the framework achieved classification performance of 87%, 71%, 75%, and 65% on the Physionet, OpenBMI, HighGamma, and BCIIV-2A datasets, respectively. By outperforming existing single-objective and accuracy-based methods, and those relying on fixed subsets, these findings demonstrate that this new multi-objective optimisation framework can enhance MI-based BCI performance while facilitating compact channel configurations with reduced computational complexity, making them better suited for wearable, portable, and real-time BCI applications.

2605.29941 2026-05-29 cs.NI cs.LG

TraceCodec: A Compiler-Backed Neural Codec for Stateful Multi-Flow Network Traffic Traces

TraceCodec:一种基于编译器的有状态多流网络流量轨迹神经编解码器

Junhui Ding, Xinchen Zhang, Xiaohui Xie, Shinan Liu

AI总结 针对有状态多流网络流量轨迹的高保真生成问题,提出TraceCodec,通过将数据包解码为带时间戳的动作并学习连续潜在表示,再经确定性编译器还原为PCAP,实现精确的流量统计和TCP状态保持。

详情
AI中文摘要

关键网络工作流需要高保真的数据包捕获(PCAP)用于测试、安全分析和协议验证,而不仅仅是统计性的流级摘要。最近的包生成器展示了协议约束的PCAP合成,但它们普遍直接解码为原始包字段。这种接口将学习到的行为选择与确定性协议后果纠缠在一起,迫使包实现依赖于事后启发式修复。我们将这种解码接口识别为根本瓶颈,并提出了TraceCodec,一种用于有状态多流轨迹的状态感知神经编解码器。TraceCodec将每个数据包提升为带有显式流槽和传输线索的定时包动作,然后学习连续的每包潜在表示。确定性编译器将解码后的动作降级回PCAP,负责端点分配、TCP状态、合法性约束和包渲染。潜在层暴露了一个面向生成器的序列空间,因此下游流量模型可以在包动作潜在表示上操作,而不是原始头部字段。在CICIDS2017 Monday上,TraceCodec将包计数、协议组成和流数量匹配到0.03%以内。在相同的非修复策略下,原始字段基线将流数量和TCP状态扭曲了几个数量级。结构诊断表明,TraceCodec保留了原始字段解码器所分割的TCP状态转换和多流交织。这项工作为高保真包轨迹生成建立了新的基础。

英文摘要

Critical networking workflows require high-fidelity packet captures (PCAPs) for testing, security analysis, and protocol validation, not just statistical flow-level summaries. Recent packet generators have demonstrated protocol-constrained PCAP synthesis, but they universally decode directly to raw packet fields. That interface entangles learned behavioral choices with deterministic protocol consequences, which forces packet realization to depend on post-hoc heuristic repair. We identify this decode interface as the fundamental bottleneck and present TraceCodec, a state-aware neural codec for stateful multi-flow traces. TraceCodec lifts each packet into a timed packet action with explicit flow slots and transport cues, then learns a continuous per-packet latent. A deterministic compiler lowers decoded actions back to PCAPs, owning endpoint assignment, TCP state, legality constraints, and packet rendering. The latent layer exposes a generator-facing sequence space, so downstream traffic models can operate on packet-action latents rather than raw header fields. On CICIDS2017 Monday, TraceCodec matches packet count, protocol composition, and flow population to within 0.03%. Raw-field baselines under the same non-repair policy distort flow counts and TCP state by orders of magnitude. Structural diagnostics show that TraceCodec preserves TCP state transitions and multi-flow interleaving that raw-field decoders fragment. This work establishes a new foundation for high-fidelity packet-trace generation.

2605.29939 2026-05-29 cs.IT cs.LG math.IT

CRB-Guided Framework Design and Resource Allocation for Indoor mmWave ISCC Systems

室内毫米波ISCC系统的CRB引导框架设计与资源分配

Zhonghao Liu, Yahao Ding, Yinchao Yang, Mohammad Shikh-Bahaei

AI总结 针对室内毫米波ISCC系统,提出基于克拉美罗界(CRB)的资源分配框架,通过联合优化感知功率和自适应深度Mamba模型深度,最小化人体姿态预测误差。

Comments 7 pages, 6 figures, conference(submitted to GLOBECOM)

详情
AI中文摘要

集成感知、通信与计算(ISCC)为室内以人为中心的应用提供了一个有前景的框架。在这些应用中,短期人体姿态预测有助于提前实现连续的人体跟踪和资源分配。本文提出了一种基于克拉美罗界(CRB)的资源分配框架,用于室内毫米波ISCC系统,以在通信、延迟和能量约束下最小化人体姿态预测误差。我们基于CRB刻画了感知功率对距离估计不确定性和点云扰动的影响。为了捕捉计算资源对预测性能的影响,我们采用了一种自适应深度的Mamba姿态预测模型,其中在每个层后附加轻量级预测头,以实现不同模型深度的推理。通过这种统一的感知-计算建模,我们建立了感知功率、模型深度和预测误差之间的定量关系。此外,我们制定了一个联合资源分配问题以最小化姿态预测误差。为了高效解决该问题,我们开发了一种基于交替优化(AO)的算法,其中为感知功率和模型深度更新步骤推导了闭式解。仿真结果表明,与基线方法相比,所提方案显著降低了姿态预测误差,验证了其在资源受限的室内以人为中心的ISCC系统中的有效性。

英文摘要

Integrated sensing, communication, and computation (ISCC) provides a promising framework for indoor human-centric applications. In these applications, short-term human pose prediction facilitates continuous human tracking and resource allocation in advance. In this paper, we propose a Cramer-Rao bound (CRB) guided resource allocation framework for indoor mmWave ISCC systems to minimize the human pose prediction error under communication, latency, and energy constraints. We characterize the impact of sensing power on range-estimation uncertainty and point-cloud perturbation based on the CRB. To capture the impact of computation resources on prediction performance, we adopt an adaptive-depth Mamba-based pose prediction model, where lightweight prediction heads are attached after every layer to enable inference with different model depths. With this unified sensing-computation modeling, we establish a quantitative relationship among sensing power, model depth, and prediction error. Furthermore, we formulate a joint resource allocation problem to minimize the pose prediction error. To solve this problem efficiently, we develop an alternating optimization (AO)-based algorithm, where closed-form solutions are derived for the sensing power and model depth update steps. Simulation results show that the proposed scheme significantly reduces pose prediction error compared with baseline methods, validating its effectiveness for resource-constrained indoor human-centric ISCC systems.

2605.29913 2026-05-29 cs.IT cs.LG math.IT

Gesture-Aware Indoor THz ISAC Systems for Adaptive Resource Allocation

基于手势感知的室内太赫兹ISAC系统自适应资源分配

Zhonghao Liu, Yinchao Yang, Yahao Ding, Yixuan Wang, Mohammad Shikh-Bahaei

AI总结 针对太赫兹频段多用户室内集成感知与通信系统,提出基于扩展卡尔曼滤波手势跟踪的自适应联合优化算法,通过动态调整功率分配和波束赋形,在满足手势相关通信服务质量约束下最大化感知信干噪比。

Comments 6 pages, 4 figures, conference(Submitted to PIMRC)

详情
AI中文摘要

本文研究了一种在太赫兹频段运行的多用户室内集成感知与通信系统,该系统设计用于基于手势识别的自适应通信。通过扩展卡尔曼滤波器进行手势跟踪,接入点根据检测到的手势变化动态调整资源分配,从而提高感知精度。基于手势识别结果,接入点进一步更新不同用户的通信质量需求,实现高效的资源分配。为此,开发了一种功率分配和波束赋形的自适应联合优化算法,在满足手势相关的通信服务质量约束下,最大化整体感知信干噪比。仿真结果表明,与传统的单变量优化基线相比,所提方法能有效响应手势动态,实现更优的感知精度和通信性能。

英文摘要

This paper investigates a multi-user indoor integrated sensing and communication (ISAC) system operating in the terahertz (THz) band, designed for adaptive communication based on gesture recognition. Leveraging gesture tracking through an extended Kalman filter (EKF), the access point (AP) dynamically adjusts resource allocation in response to detected gesture variations, thereby improving sensing accuracy. Based on the gesture recognition results, the AP further updates the communication quality requirements of different users, enabling efficient resource allocation. To this end, an adaptive joint optimization algorithm for power allocation and beamforming is developed to maximize the overall sensing signal-to-interference-plus-noise ratio (SINR) while satisfying the gesture-dependent communication quality of service (QoS) constraints. Simulation results demonstrate that the proposed method effectively responds to gesture dynamics, achieving superior sensing accuracy and communication performance compared with conventional single-variable optimization baselines.

2605.29910 2026-05-29 cs.SE cs.AI

Agora: Toward Autonomous Bug Detection in Production-Level Consensus Protocols with LLM Agents

Agora: 面向生产级共识协议中自主漏洞检测的LLM智能体

Xiang Liu, Sa Song, Zhaowei Zhang, Huiying Lan, Jason Zeng, Ming Wu, Michael Heinrich, Yong Sun, Ceyao Zhang

AI总结 提出Agora,一个领域感知的多智能体框架,通过假设驱动测试和LLM协作,在Raft、EPaxos、HotStuff、BullShark四个共识实现中发现15个未知协议级逻辑漏洞,而现有LLM方法未能检测到任何此类漏洞。

Comments 35 pages, 4 figures

详情
AI中文摘要

共识协议构成了分布式系统和区块链的骨干,其中的实现漏洞可能导致数据损坏和财务损失。虽然基于LLM的方法在代码分析中显示出前景,但它们难以处理涉及跨多个执行阶段的复杂状态依赖行为的深层协议级逻辑漏洞。我们提出Agora,一个领域感知的多智能体框架,将假设驱动测试与LLM能力相结合,用于系统性的协议验证。Agora采用专门的智能体,协作探索协议状态空间,使用领域特定约束综合攻击场景,并通过迭代细化验证发现。这种明确的角色分离使得能够推理全局协议不变量,超越单函数代码分析。我们在四个共识实现(Raft、EPaxos、HotStuff、BullShark)上使用四个最先进的LLM评估了Agora。Agora发现了15个先前未知的违反安全属性的协议级逻辑漏洞,而现有的基于LLM的智能体未能检测到任何此类协议级逻辑漏洞。我们的结果表明,领域感知的多智能体协作对于检测复杂协议中的深层逻辑漏洞至关重要。

英文摘要

Consensus protocols form the backbone of distributed systems and blockchains, where implementation bugs can cause data corruption and financial losses. While LLM-based approaches show promise in code analysis, they struggle with deep protocol-level logic bugs involving complex state-dependent behaviors across multiple execution stages. We present Agora, a domain-aware multi-agent framework that integrates hypothesis-driven testing with LLM capabilities for systematic protocol verification. Agora employs specialized agents that collaboratively explore protocol state spaces, synthesize attack scenarios using domain-specific constraints, and validate findings through iterative refinement. This explicit role separation enables reasoning about global protocol invariants beyond single-function code analysis. We evaluate Agora on four consensus implementations (Raft, EPaxos, HotStuff, BullShark) using four state-of-the-art LLMs. Agora discovers 15 previously unknown protocol-level logic bugs that violate safety properties, while existing LLM-based agents fail to detect any such protocol-level logic bugs. Our results demonstrate that domain-aware multi-agent collaboration is essential for detecting deep logic bugs in complex protocols.

2605.29908 2026-05-29 stat.ML cs.LG

Joint Model and Data Sparsification via the Marginal Likelihood

通过边际似然进行联合模型与数据稀疏化

Alexander Timans, Thomas Möllenhoff, Christian A. Naesseth, Mohammad Emtiyaz Khan, Eric Nalisnick

AI总结 提出通过边际似然联合学习特征和样本相关性,实现同时模型与数据稀疏化的贝叶斯方法,在保持共轭性和闭式更新的同时提升鲁棒性。

Comments 36 pages, 8 figures, 12 tables (incl. appendix); published at ICML 2026

详情
AI中文摘要

线性系统中的稀疏恢复支撑着从信号处理到高维回归的应用。基于自动相关性确定(ARD)原理的稀疏贝叶斯学习,通过边际似然优化为特征稀疏性提供了一种实用的贝叶斯机制。然而,其对同方差噪声模型的依赖使其对数据污染(如异常值或错误指定的噪声)敏感,损害了模型拟合和预测。相反,我们提出联合学习个体特征和样本相关性,通过单一贝叶斯目标实现同时模型与数据稀疏化。这种模型和数据的对称剪枝提供了一种自然扩展,保持了共轭性,允许标准优化过程的闭式更新,并与鲁棒回归和影响函数的观点一致。跨多种回归任务的实证结果证实,联合ARD方法一致地产生稀疏且鲁棒的预测模型。

英文摘要

Sparse recovery in linear systems underpins applications from signal processing to high-dimensional regression. Sparse Bayesian Learning, grounded in the principle of automatic relevance determination (ARD), offers a practical Bayesian mechanism for feature sparsity via marginal likelihood optimization. Yet, its reliance on a homoscedastic noise model renders it sensitive to data contaminations such as outliers or misspecified noise, harming model fit and predictions. Instead, we propose jointly learning individual feature and sample relevancies, enabling simultaneous model and data sparsification via a single Bayesian objective. This symmetric pruning of model and data offers a natural extension that preserves conjugacy, admits closed-form updates for standard optimization procedures, and aligns with perspectives from robust regression and influence functions. Empirical results across diverse regression tasks affirm that a joint ARD approach consistently yields both sparse and robust prediction models.

2605.29901 2026-05-29 cs.CR cs.LG

Dissecting the Black Box: Circuit-Level Analysis of LLM Vulnerability Detection

剖析黑箱:LLM 漏洞检测的电路级分析

Syafiq Al Atiiq, Chun Zhou, Christian Gehrmann

AI总结 通过机械可解释性分析 Gemma-2-2b 模型在 C/C++ 漏洞检测中的内部计算,发现模型主要依赖安全检测器(识别安全编码模式的注意力头)而非直接检测漏洞特征,并识别出关键神经组件(早期层注意力头和 MLP 神经元),通过消融实验验证其因果作用。

Comments 11 pages, 6 figures. Supported by the Wallenberg AI, Autonomous Systems and Software Program (WASP)

详情
AI中文摘要

大型语言模型(LLM)能够检测软件漏洞,但它们实际上是如何识别易受攻击的代码的呢?我们利用机械可解释性来回答这个问题;分析神经网络的内部计算以理解其推理过程。通过在 Gemma-2-2b 上使用 Circuit Tracer,我们追踪了模型将 472 个 C/C++ 代码样本分类为易受攻击或安全时所激活的计算路径。我们的分析揭示了一个令人惊讶的发现:模型主要依赖安全检测器(即识别安全编码模式的注意力头),而不是直接检测漏洞特征。当这些安全检测器未能激活时,模型将代码分类为易受攻击。我们识别出了关键的神经组件:早期层(L5、L7)中专注于安全模式的特定注意力头,以及第 7 层中编码漏洞相关特征的多层感知器(MLP)神经元。消融实验证实了它们的因果作用;移除第 11 层会使漏洞检测准确率从 100% 降至 6%,而仅消融第 7 层中的 20 个神经元就会使其降低 50%。我们的发现表明,LLM 漏洞检测使用了稀疏、可解释的电路(仅占模型容量的 16%),从而能够为安全预测提供电路级解释,并有针对性地改进检测系统。

英文摘要

Large language models (LLMs) can detect software vulnerabilities, but how do they actually identify vulnerable code? We address this question using mechanistic interpretability; analyzing the internal computations of a neural network to understand its reasoning process.Using Circuit Tracer on Gemma-2-2b, we trace the computational pathways activated when the model classifies 472 C/C++ code samples as vulnerable or safe. Our analysis reveals a surprising finding: the model primarily relies on safety detectors, attention heads that recognize safe coding patterns, rather than directly detecting vulnerability signatures. When these safety detectors fail to activate, the model classifies code as vulnerable. We identify the critical neural components: specific attention heads in early layers (L5, L7) that focus on safety patterns, and Multilayer Perceptron (MLP) neurons in Layer 7 that encode vulnerability-related features. Ablation experiments confirm their causal role; removing Layer 11 drops vulnerability detection accuracy from 100% to 6%, while ablating just 20 neurons in Layer 7 reduces it by 50%.Our findings show that LLM vulnerability detection uses sparse, interpretable circuits (only 16% of model capacity), enabling circuit-level explanations for security predictions and targeted improvements to detection systems.

2605.29868 2026-05-29 cs.CR cs.CV cs.DC

Ciphera: A Decentralised Biometric Identity Framework

Ciphera: 一种去中心化的生物特征身份框架

Ankit Kanaiyalal Prajapati, Shahzad Memon, Mohammed Mahir Rahman, Ameer Al-Nemrat

AI总结 提出Ciphera框架,结合隐私保护面部识别、多节点验证、IPFS凭证元数据存储和区块链锚定撤销,实现去中心化生物特征身份管理,并通过功能、性能、安全性和分布式一致性评估验证其可行性。

Comments Accepted at the CyberAI 2026 Conference, and to be indexed at IEEE-Scopus

详情
Journal ref
CyberAI 2026 (https://cyberai-conf.org/)
AI中文摘要

中心化的生物特征身份系统使用户面临单点故障、不透明的验证过程以及不可逆的生物特征泄露风险。去中心化标识符(DID)和可验证凭证(VC)提供了更强的隐私保障,但它们与生物特征认证和分布式验证的整合仍未被充分探索。本文提出了Ciphera,一个去中心化的生物特征身份框架,结合了隐私保护的面部识别、多节点验证、基于IPFS的凭证元数据存储和区块链锚定的撤销。在功能、性能、安全性和分布式一致性维度上评估,Ciphera实现了81%的功能成功率,具有稳定的注册和认证,但存在可测量的撤销传播延迟和偶尔的审计日志不一致。性能测试显示,在并发多节点条件下,p95验证延迟约为820毫秒,低于1秒。安全性分析确认了强大的机密性和完整性保证,但不完整的活体检测使其容易受到深度伪造和重放攻击。结果证明了去中心化生物特征身份的可行性,同时指出了生产级部署的关键工程挑战。

英文摘要

Centralised biometric identity systems expose users to single points of failure, opaque verification processes, and irreversible biometric compromise. Decentralised Identifiers (DIDs) and Verifiable Credentials (VCs) offer stronger privacy guarantees, yet their integration with biometric authentication and distributed verification remains insufficiently explored. This paper presents Ciphera, a decentralised biometric identity framework combining privacy-preserving facial recognition, multi-node verification, IPFS-based credential metadata storage, and blockchain-anchored revocation. Evaluated across functional, performance, security, and distributed consistency dimensions, Ciphera achieved an 81% functional success rate, with stable enrolment and authentication but measurable revocation propagation delays and occasional audit-log inconsistencies. Performance testing demonstrated sub-second p95 verification latency of approximately 820ms under concurrent multi-node conditions. Security analysis confirmed strong confidentiality and integrity guarantees, though incomplete liveness detection leaves susceptibility to deepfake and replay attacks. The results demonstrate the feasibility of decentralised biometric identity while identifying key engineering challenges for production-grade deployment.

2605.29862 2026-05-29 eess.AS cs.AI cs.SD

Mitigating Stethoscope-Induced Shortcuts in Respiratory Sound Classification under Federated Domain Generalization with Causality-Inspired Interventions

在联邦域泛化下通过因果启发的干预减轻听诊器引起的呼吸音分类中的捷径

Heejoon Koo, Yoon Tae Kim, Miika Toikkanen, June-Woo Kim

AI总结 针对呼吸音分类中听诊器设备差异导致的域偏移问题,提出一种因果启发的多模态联邦域泛化框架,通过内容保持的风格扰动、反事实文本增强和梯度对齐实现设备不变表示,在ICBHI和SPRSound数据集上优于传统方法。

Comments 2 figures, 4 tables, and 5 pages

详情
AI中文摘要

基于AI的呼吸音分类(RSC)有望实现自动化肺部疾病检测,但多站点部署受到听诊器间差异的阻碍。我们针对听诊器引起的设备偏移引入了一种联邦域泛化(FedDG)公式,其中客户端使用异构设备,模型在未见设备上进行评估。我们的实证分析表明,听诊器引起的风格和疾病特定内容紧密纠缠,使得确定性风格去除不可靠。为此,我们提出了一种因果启发的多模态FedDG框架,结合了:(i) 因果启发的设备风格干预网络,执行内容保持的风格扰动,(ii) 反事实文本增强,中和元数据捷径,以及(iii) 梯度对齐,促进跨客户端的设备不变表示。基于多模态语言-音频预训练模型,在ICBHI和SPRSound数据集上的留一设备验证中,它优于传统数据增强和联邦学习基线。代码将在发表后发布。

英文摘要

AI-driven respiratory sound classification (RSC) is promising for automated pulmonary disease detection, yet multi-site deployment is hindered by inter-stethoscope variability. We introduce a federated domain generalization (FedDG) formulation for RSC under stethoscope-induced device shifts, where clients use heterogeneous devices and the model is evaluated on unseen devices. Our empirical analysis shows that stethoscope-induced style and disease-specific content are tightly entangled, making deterministic style removal unreliable. In response, we propose a causality-inspired multimodal FedDG framework that combines: (i) a causality-inspired device style intervention network that performs content-preserving style perturbations, (ii) counterfactual text augmentation that neutralizes metadata shortcuts, and (iii) gradient alignment that facilitates device-invariant representations across clients. Built on a multimodal language-audio pretraining model, it outperforms conventional data augmentation and federated learning baselines in leave-one-device-out validation on ICBHI and SPRSound datasets. Code will be released upon publication.

2605.29859 2026-05-29 eess.AS cs.CL

MELD: Mel-Spectrogram-Based Speech Language Modeling with Discrete Latent Variables

MELD: 基于梅尔频谱的离散潜变量语音语言建模

Sung-Lin Yeh, Wei Zhou, Gil Keren, Duc Le, Zhong Meng, Hao Tang, Jay Mahadeokar, Ozlem Kalinli, Alexandre Mourachko

AI总结 提出一种在梅尔频谱上联合优化编码器和语音语言模型的离散潜变量模型,在零样本文本转语音和语音转文本任务上优于基于编解码器和其他梅尔频谱基线,并缓解了自回归建模中的长时间静音和单词遗漏问题。

详情
AI中文摘要

最近的语音语言模型依赖于与自回归模型分开优化的编码器。由于这些编码器不了解下游目标,提取的表示可能对下游任务不是最优的。为了解决这一限制,我们在梅尔频谱上引入了一种离散潜变量模型,该模型联合优化编码器和语音语言模型。联合优化不仅在零样本文本转语音(TTS)和语音转文本(STT)任务上相比基于编解码器和其他基于梅尔频谱的基线带来了改进,而且有效缓解了自回归梅尔频谱建模中的常见问题,如长时间静音生成和单词遗漏。

英文摘要

Recent speech language models rely on encoders that are optimized separately from autoregressive models. Since these encoders are unaware of the downstream objectives, the extracted representations may not be optimal for downstream tasks. To address this limitation, we introduce a discrete latent variable model on mel spectrograms that jointly optimizes the encoder and the speech language model. Joint optimization not only brings improvements over codec-based and other mel-spectrogram-based baselines on zero-shot Text-to-Speech (TTS) and Speech-to-Text (STT) tasks, but also effectively alleviates common issues in autoregressive mel-spectrogram modeling, such as prolonged silence generation and word omissions.

2605.29849 2026-05-29 eess.SY cs.LG cs.SY

BuilDyn: Excitation-Driven Data Generation for Building Thermal Dynamics Modeling and Control

BuilDyn: 面向建筑热动力学建模与控制的激励驱动数据生成

Felix Koch, Thomas Krug, Fabian Raisch, Benjamin Schäfer, Benjamin Tischler

AI总结 本文提出BuilDyn包,通过可定制的激励策略生成控制导向的建筑数据,提升机器学习模型对未见工况的鲁棒性。

详情
AI中文摘要

机器学习越来越多地用于建筑的数据驱动建模,以实现故障检测与诊断、节能控制等下游任务。虽然最近的工作改善了跨建筑特性、天气和占用率的泛化能力,但泛化也依赖于对控制驱动系统状态空间的充分探索。现有的真实世界数据集和仿真环境主要反映固定控制策略下的稳态运行,导致激励有限,对未见工况的鲁棒性降低。本文介绍了基于BuilDa的BuilDyn包,该包支持可定制的激励策略用于控制导向的数据生成。BuilDyn还支持从代表性建筑分布中采样,并提供Python接口以便轻松集成到机器学习流水线中。我们通过比较在非激励和激励数据上训练的数据驱动ML模型在一栋建筑上的性能,展示了BuilDyn的优势。借助BuilDyn,我们希望推进可扩展的控制导向建模,并支持迁移学习和建筑特定基础模型等未来方向。

英文摘要

Machine learning (ML) is increasingly used for data-driven modeling of buildings to enable downstream tasks such as fault detection and diagnosis, and energy-efficient control. While recent work improves generalization across building characteristics, weather, and occupancy, generalization also depends on sufficient exploration of the control-driven system state space. Existing real-world datasets and simulation environments predominantly reflect stationary operation under fixed control policies, resulting in limited excitation and reduced robustness to unseen operating conditions. This paper introduces BuilDyn, a package based on BuilDa that enables customizable excitation strategies for control-oriented data generation. BuilDyn further supports sampling from representative building distributions and provides a Python interface for easy integration into machine learning pipelines. We demonstrate the benefits of BuilDyn by comparing the performance of data-driven ML models trained on non-excited and excited data for one building. With BuilDyn, we hope to advance scalable control-oriented modeling and support future directions such as transfer learning and building-specific foundation models.

2605.29822 2026-05-29 cs.SE cs.AI

Inferring Code Correctness from Specification

从规约推断代码正确性

Tambon Florian, Papadakis Mike

AI总结 提出TRAILS方法,通过基于规约的类别划分生成测试输入并执行,利用LLM评估输入输出对是否符合规约,从而推断代码正确性,在LiveCodeBench和CoCoClaNeL数据集上相比基线方法提升了马修斯相关系数并增强了稳定性。

详情
AI中文摘要

大型语言模型(LLM)已成为现代软件开发不可或缺的一部分,实现了大规模自动代码生成。然而,验证LLM生成代码的正确性仍然是一个关键且基本未解决的挑战。现有方法要么依赖多个代码候选之间的动态共识——这使得它们成本高昂且难以扩展,要么依赖静态推理,容易受到动态错误和顺序偏差的影响。在本文中,我们提出TRAILS(通过输入和规约的目标推理一致性),一种将LLM推理与具体(输入,输出)对相结合的方法。TRAILS首先基于规约通过类别划分生成多样化的测试输入,然后针对候选代码执行这些输入,并提示LLM评估产生的输入输出对是否符合规约——而无需对代码本身进行推理。分数跨输入聚合,以确定程序是否可能正确。我们在两个数据集LiveCodeBench和CoCoClaNeL上,使用三个LLM(Qwen3Coder-30B、Devstral-Small-24B和Olmo3.1-Instruct)评估TRAILS,并与HoarePrompt和零样本思维链基线进行比较。TRAILS的马修斯相关系数相比零样本思维链提高了高达39%,并且始终优于HoarePrompt。除了准确性,TRAILS在多次运行中表现出更高的稳定性,降低了对LLM非确定性的敏感性,并且相比竞争方法为更多独特的代码样本分配了正确的标签。

英文摘要

Large language models (LLMs) have become integral to modern software development, enabling automated code generation at scale. However, validating the correctness of LLM-generated code remains a critical and largely unsolved challenge. Existing approaches either rely on dynamic consensus across multiple code candidates - making them costly and difficult to scale - or on static reasoning that is susceptible to dynamic bugs and order bias. In this paper, we propose TRAILS~ (Targeted Reasoning Agreement via Inputs and Specifications), an approach that grounds LLM reasoning with concrete (input, output) pairs. TRAILS~ first generates diverse test inputs via category partitioning based on the specification, then executes them against the candidate code and prompts LLMs to assess whether the resulting input-output pairs conform to the specification - without ever reasoning over the code itself. Scores are aggregated across inputs, to determines whether the program is likely correct. We evaluate TRAILS~ on two datasets, LiveCodeBench and CoCoClaNeL, across three LLMs (Qwen3Coder-30B, Devstral-Small-24B, and Olmo3.1-Instruct), comparing against HoarePrompt and a Zero-Shot Chain-of-Thought baseline. TRAILS~ improves Matthew Correlation Coefficient by up to 39\% relative to Zero-Shot COT and consistently outperforms HoarePrompt. Beyond accuracy, TRAILS~ demonstrates greater stability across seeded runs, reducing sensitivity to LLM non-determinism, and assigns correct labels to a larger set of unique code samples than competing approaches.

2605.29809 2026-05-29 cs.CR cs.CV cs.GR cs.LG cs.MM

Cert-LAS: Toward Certified Model Ownership Verification for Text-to-Image Diffusion Models via Layer-Adaptive Smoothing

Cert-LAS:通过层自适应平滑实现文本到图像扩散模型的认证模型所有权验证

Leyi Qi, Yiming Li, Siyuan Liang, Zhengzhong Tu, Dacheng Tao

AI总结 提出Cert-LAS方法,基于层自适应平滑和扩散分类器嵌入水印,通过假设检验验证模型所有权,并证明在恶意移除攻击下仍能可靠验证。

Comments This paper has been accepted to the International Conference on Machine Learning (ICML) 2026. 26 pages

详情
AI中文摘要

大规模文本到图像(T2I)扩散模型实现了前所未有的创意应用,但其未经授权的使用引发了严重的知识产权问题,使得模型所有权验证(MOV)日益关键。我们发现现有的基于后门的扩散水印方法通常(隐式地)假设一个“忠实”的验证过程,即验证者可以查询可疑模型并获得忠实的水印响应以完成MOV。然而,在实践中,攻击者可能有意或无意地破坏潜在的水印信号,显著降低验证可靠性。为解决此问题,我们提出Cert-LAS,首个基于层自适应平滑的T2I模型认证MOV方法。通常,Cert-LAS使用扩散分类器和LFS引导的层自适应噪声嵌入指定水印,并通过假设检验检查可疑模型是否表现出比无水印参考显著更强的水印响应来验证所有权。我们进一步证明,在特定条件下,即使存在恶意移除攻击,我们的Cert-LAS仍能实现可靠验证。大量实验验证了Cert-LAS的有效性及其对自适应攻击的抵抗力。我们的代码可在https://github.com/Leyi-Qi/Cert-LAS获取。

英文摘要

Large-scale text-to-image (T2I) diffusion models have enabled unprecedented creative applications, but their unauthorized use has raised serious intellectual property concerns, making model ownership verification (MOV) increasingly critical. We find that existing backdoor-based diffusion watermarking methods often (implicitly) assume a "faithful" verification process, namely, that the verifier can query a suspicious model and obtain the faithful watermark response to complete MOV. However, in practice, adversaries may intentionally or unintentionally damage potential watermark signals, significantly degrading verification reliability. To address this issue, we propose Cert-LAS, the first certified MOV method for T2I models based on layer-adaptive smoothing. In general, Cert-LAS embeds specified watermarks using diffusion classifiers and an LFS-guided layer-adaptive noise, and verifies ownership by examining whether the suspected model exhibits significantly stronger watermark responses compared to unwatermarked references through hypothesis testing. We further prove that, under certain conditions, our Cert-LAS can still achieve reliable verification even in the presence of malicious removal attacks. Extensive experiments validate the effectiveness of Cert-LAS and its resistance to adaptive attacks. Our code is available at https://github.com/Leyi-Qi/Cert-LAS.

2605.29790 2026-05-29 cs.MA cs.AI

Evolve as a Team: Collaborative Self-Evolution for LLM-based Multi-Agent Systems

像团队一样进化:基于LLM的多智能体系统的协作自我进化

Zhezheng Hao, Tianfu Wang, Huanshuo Dong, Ziyan Liu, Hong Wang, Xiankun Lin, Qiang Lin, Can Wang, Hande Dong, Jiawei Chen

AI总结 提出Meta-Team框架,通过协作自我进化机制,基于执行经验改进多智能体系统的行为、协调和团队组织,在长周期任务中显著优于单智能体、手工MAS及先前进化方法。

详情
AI中文摘要

基于LLM的多智能体系统(MAS)已成为处理复杂和长周期任务的有效范式。然而,在实际任务中,MAS在执行过程中经常出现各种故障,且这些故障在设计阶段难以消除。这激发了经验驱动的MAS进化,即系统根据自身执行经验进行改进。然而,这种进化具有挑战性,因为MAS经验漫长而复杂,交织着多个智能体的执行链和通信消息,使得难以识别需要改进的内容。为应对这一挑战,我们提出了Meta-Team,一种基于协作自我进化的经验驱动MAS进化框架。Meta-Team保留每个智能体的执行上下文并协调任务后通信,使智能体能够交换分布式证据以进行进化。基于此设计,Meta-Team进行多尺度自我进化,将执行经验转化为对智能体行为、智能体间协调以及团队级组织的可复用改进。在六个长周期智能体基准测试中,Meta-Team始终优于单智能体系统、手工MAS和先前的MAS进化方法;进一步分析表明,Meta-Team实现了更可靠和可扩展的MAS自我进化。

英文摘要

LLM-based multi-agent systems (MAS) have emerged as an effective paradigm for complex and long-horizon tasks. However, in real-world tasks, MAS often exhibit various failures during execution and such failures are difficult to eliminate during design. This motivates experience-driven MAS evolution, where a system improves based on its own execution experience. Yet such evolution is challenging because MAS experience is prolonged and intricate, interleaving multiple agents' execution chains and communication messages, which makes it difficult to identify what should be improved. To address this challenge, we propose Meta-Team, an experience-driven MAS evolution framework based on collaborative self-evolution. Meta-Team preserves the execution context of each agent and coordinates post-task communication, enabling agents to exchange distributed evidence for evolution. Building on this design, Meta-Team conducts multi-scale self-evolution, transforming execution experience into reusable improvements to agent behaviors, inter-agent coordination, and team-level organization. Across six long-horizon agent benchmarks, Meta-Team consistently outperforms single-agent systems, hand-crafted MAS, and prior MAS evolution methods; further analyses demonstrate that Meta-Team enables more reliable and scalable MAS self-evolution.

2605.29753 2026-05-29 eess.IV cs.AI

A unified deeplearning framework for contrast-phase-specific virtual monochromatic imaging

一种用于对比相位特异性虚拟单色成像的统一深度学习框架

Antony Jerald, Hemant K Aggarwal, Brian Nett, Avinash Gopal, Phaneendra K Yalavarthy, Bipul Das, Rajesh Langoju

AI总结 提出一种统一深度学习框架,利用对比相位先验信息从单能CT数据合成对比相位特异性虚拟单色50 keV图像,通过新型先验条件架构实现能量转换,并在四个对比相位上验证了其对比增强和泛化能力。

详情
Journal ref
SPIE Medical Imaging 2026
AI中文摘要

双能CT(DECT)可实现虚拟单色成像(VMI)并提高对比度分辨率,但其临床采用受到硬件复杂性和成本的限制。在这项工作中,我们提出了一种统一的深度学习框架,通过利用对比相位信息作为先验,从单能CT(SECT)数据合成对比相位特异性虚拟单色50 keV图像。该模型使用DECT衍生的70 keV和50 keV图像对进行训练,涵盖四个对比相位——血管期、动脉期、门脉期和延迟期——采用一种新颖的先验条件架构,将对比相位先验整合到能量转换过程中。我们证明了所提出的统一模型能够实现对比增强,并在对比相位之间具有良好的泛化能力。此外,我们展示了该模型可以从SECT输入生成类似50 keV的图像,并保留对比相位特异性动态。

英文摘要

Dual-energy CT (DECT) enables virtual monochromatic imaging (VMI) and improved contrast resolution, but its clinical adoption is limited by hardware complexity and cost. In this work, we propose a unified deep learning framework that synthesizes contrast-phase-specific virtual monochromatic 50 keV images from single-energy CT (SECT) data by leveraging contrast phase information as a prior. The model is trained using DECT-derived 70 keV and 50 keV image pairs across four contrast phases -- Angio, Arterial, Portal, and Delayed -- using a novel prior conditioning architecture that integrates contrast phase priors into the energy transformation process. We demonstrate that the proposed unified model achieves contrast enhancement and generalizes well across contrast phases. Additionally, we show that the model can generate 50 keV-like images from SECT inputs, preserving contrast phase-specific dynamics.

2605.29748 2026-05-29 stat.ML cs.LG

Instance-dependent Stochastic Lipschitz bandit

实例依赖的随机Lipschitz bandit

Marius Potfer, Vianney Perchet

AI总结 针对Lipschitz bandit问题,提出一种基于水平集次优性间隙积分的算法,实现比传统缩放维度更优的实例依赖遗憾界。

详情
AI中文摘要

我们研究Lipschitz bandit问题,其中学习器通过带噪声的点评估在域$\mathcal{X} \subset [0,1]^d$上顺序最大化未知的Lipschitz函数$f$。现有的遗憾界要么是最坏情况的,缩放为$\tilde\Theta \left ( T^{d+1/d+2}\right )$,要么通过缩放维度$d_z$自适应,得到$\tilde\Theta \left ( T^{d_z+1/d_z+2}\right )$。然而,这种基于缩放的保证仅是部分实例依赖的,因为它们仅依赖于近最优水平集的渐近增长,未能捕捉$f$的更精细结构性质。我们提供了一种分析和算法,通过$f$在其水平集上的次优性间隙的积分来刻画遗憾。这产生了适应水平集局部增长(而不仅仅是其渐近行为)的遗憾界。作为推论,当最大化者集合的维度$d^\star>0$时,我们获得了阶为$\tilde{\mathcal{O}} \left ( T^{d_z+1 / \max(d_z,d^\star)+2}\right )$的改进自适应速率,在该情况下严格优于经典的缩放界。最后,我们将分析扩展到完全信息设置(Lipschitz专家),并展示了如何放宽一些正则性假设。

英文摘要

We study the Lipschitz bandit problem, where a learner sequentially maximizes an unknown Lipschitz function $f$ over a domain $\mathcal{X} \subset [0,1]^d$ using noisy pointwise evaluations. Existing regret bounds are either worst-case, scaling as $\tildeΘ \left ( T^{d+1/d+2}\right )$, or adaptive via the zooming dimension $d_z$, yielding $\tildeΘ \left ( T^{d_z+1/d_z+2}\right )$. However, such zooming-based guarantees are only partially instance-dependent, as they depend solely on the asymptotic growth of near-optimal level sets and fail to capture finer structural properties of $f$. We provide an analysis and an algorithm that characterizes the regret through integrals of the suboptimality gap of $f$ over its level sets. This yields regret bounds that adapt to the local growth of level sets, rather than only their asymptotic behavior. As a corollary, when the set of maximizers has dimension $d^\star>0$, we obtain improved adaptive rates of order $\tilde{\mathcal{O}} \left ( T^{d_z+1 / \max(d_z,d^\star)+2}\right )$ strictly improving over classical zooming bounds in this regime. Finally, we extend our analysis to the full-information setting (Lipschitz experts) and show how some of the regularity assumptions can be relaxed.

2605.29737 2026-05-29 cs.CR cs.CL cs.SE

Minimal Prompt Perturbations Lead to Code Vulnerabilities: Prompt Fragility and Hidden-State Signals in Coding LLMs

最小提示扰动导致代码漏洞:编码大语言模型中的提示脆弱性和隐藏状态信号

Alexander Sternfeld, Andrei Kucharavy, Ljiljana Dolamic

AI总结 本文通过token级突变实验,发现微小提示扰动(如单字符变化)即可使LLM生成代码从安全变为脆弱,并利用隐藏状态分析揭示输入处理漏洞比安全默认值漏洞更可预测。

详情
AI中文摘要

基于LLM的编码助手正被迅速采用,显著提高了开发者的生产力。随着组织越来越多地部署这些代理生成的代码,代码的安全性变得至关重要。先前的研究表明,微小的提示扰动会降低LLM生成代码的功能正确性,但这是否也会危及代码安全性尚未被研究。我们对三个模型和五种编程语言的提示应用token级突变,并表明小至单字符变化的突变可以将生成的代码从安全变为脆弱。探测模型的隐藏状态揭示,这种脆弱性部分编码在提示表示中,但分布不均匀。输入处理漏洞(模型省略验证或清理)比安全默认值漏洞(不安全代码源于一个局部选择,如弱算法或不安全参数)更可预测(平均AUC 0.753 vs 0.674)。这些结果表明,LLM辅助编码的威胁模型不仅包括提示注入,还包括普通的提示变化,并指出输入处理缺陷可以在生成前被捕获,而安全默认值缺陷需要在解码过程中进行干预。

英文摘要

LLM-based coding assistants are seeing rapid adoption, offering substantial gains in developer productivity. As organizations increasingly ship code these agents produce, the security of that code becomes critical. Prior work has shown that minor prompt perturbations degrade the functional correctness of LLM-generated code, but whether they also compromise code security has remained unstudied. We apply token-level mutations to prompts across three models and five programming languages, and show that mutations as small as a single-character change can flip generated code from secure to vulnerable. Probing the models' hidden states reveals that this fragility is partially encoded in prompt representations, but unevenly so. Input-handling vulnerabilities, where the model omits validation or sanitization, are more predictable (mean AUC 0.753) than secure-defaults vulnerabilities, where insecure code stems from one local choice such as a weak algorithm or unsafe parameter (mean AUC 0.674). These results show that the threat model for LLM-assisted coding extends beyond prompt injection to ordinary prompt variation, and indicate that input-handling flaws can be caught before generation while secure-defaults flaws require intervention during decoding.

2605.29703 2026-05-29 q-bio.NC cs.CV q-bio.TO

Subcortical Shape Variations and Their Associations with Cognition Across the 8th Decade of Life. A Study in the Lothian Birth Cohort 1936

皮层下形状变化及其与第八个十年生命期认知的关联:洛锡安出生队列1936研究

Maria del C. Valdes-Hernandez, Wonjung Park, Joanna Moodie, Susana Muñoz Maniega, Janie Corley, Fraser N. Sneden, Mark E. Bastin, Joanna M. Wardlaw, Simon R. Cox, Jinah Park

AI总结 利用洛锡安出生队列1936的纵向数据,通过ANCOVA和混合线性模型分析,研究第八个十年中皮层下结构的形状变化及其与认知老化的关联。

Comments 34 pages

详情
AI中文摘要

对正常个体脑形态变化的研究可能捕捉到与功能相关的脑老化方面,而这些方面不一定完全由总体积测量所指示。尽管皮层下脑结构在认知中起重要作用,但其形态轨迹与认知老化之间的关联尚未被记录。我们利用来自一项大型认知老化纵向研究——洛锡安出生队列1936——的神经影像、人口统计学和认知数据,探索社区居住个体在第八个十年生命期中皮层下脑结构的形状变化。我们使用ANCOVA和混合线性模型分析研究这些变化与认知老化的关联。皮层下形状变化是异质性的,在整个时期呈现不同的萎缩模式。海马体和腹侧DC经历了不同的形态变形(相对于其基线点),左右半球不同,而丘脑和苍白球形状则经历了更均匀的体积收缩,几乎在不同时间线上对称。一般认知的变化主要与时间点之间的向内和向外顶点位移相关。

英文摘要

The study of brain morphology changes in normal individuals may capture aspects of functionally-relevant brain aging not fully indicated by gross volumetry. Despite the important role of subcortical brain structures in cognition, the associations between their morphological trajectories and cognitive changes in aging have not been documented. We use neuroimaging, demographic, and cognitive data from a large longitudinal study of cognitive aging, the Lothian Birth Cohort 1936, to explore shape changes in subcortical brain structures of community-dwelling individuals across their 8th decade of life. We investigate the association of these changes with cognitive aging using ANCOVA and mixed linear model analyses. Subcortical shape changes were heterogeneous, with varied atrophy patterns across whole period. The hippocampus and the ventral DC experienced varied morphological deformations (from its baseline point) different in left and right hemispheres, while the thalami and globus pallidi shapes, for example, experienced a more uniform volume contraction, nearly symmetrical throughout different timelines. Changes in general cognition were mainly associated with inwards and outwards vertex displacements between the time-points.

2605.29675 2026-05-29 cs.HC cs.AI cs.IR

From Prompts to Context: An Ontology-Driven Framework for Human-Generative AI Collaboration

从提示到上下文:一种面向人类-生成式AI协作的本体驱动框架

Ngoc Luyen Le, Marie-Hélène Abel, Bertrand Laforge

AI总结 提出一种基于本体(CCAI)的框架,通过结构化建模任务、角色、资源和约束,将提示-响应交互转化为可查询的协作轨迹,以提升信息密集型工作流中的可追溯性和问责性。

详情
AI中文摘要

与生成式AI的协作通常始于简短提示,止于不透明输出,隐去了参与者、任务、资源及约束等关键信息。这种上下文显式性的缺失阻碍了信任、可追溯性和问责性,尤其在搜索、查询和档案管理等信息密集型工作流中。本文提出“从提示到上下文”这一本体驱动框架,用于表示人类-生成式AI协作。其核心组件——上下文协作AI本体(CCAI)——将任务、智能体角色、资源和约束等协作关键元素建模为共享的机器可解释词汇。通过将填充的CCAI实例与基于SPARQL的上下文检索相结合,该框架将原本短暂的提示-响应交互转化为结构化、可查询的协作轨迹,连接提示、输出及其周围上下文。通过一个软件开发团队构建基于能力的教育功能(用于查看和更新学习者能力档案)的案例研究,展示了该框架如何支持需求分析、设计、实现和测试阶段的协作片段表示与文档化。结果表明,显式协作建模有助于使任务上下文更清晰,提高AI生成贡献的可追溯性,并支持更透明、更负责任的人类-生成式AI实践。最后,我们提出了未来人类-生成式AI系统的设计原则,强调不仅关注输出质量,还要显式表示产生输出的协作上下文。

英文摘要

Collaborations with Generative AI often begin with a short prompt and end with an opaque output, leaving implicit who was involved, what task was being pursued, which resources were used, and which constraints should have shaped the process. This limited contextual explicitness hinders trust, traceability, and accountability, particularly when Generative AI is embedded in information-intensive workflows such as search, querying, and profile management. This paper introduces From Prompts to Context, an ontology-driven framework for representing Human-Generative AI collaboration. Its core component, the Contextual Collaboration AI Ontology (CCAI), models key elements of collaboration - including tasks, agent roles, resources, and constraints - as a shared machine-interpretable vocabulary. By combining populated CCAI instances with SPARQL-based context retrieval in operational workflows, the framework turns otherwise ephemeral prompt-response interactions into structured and queryable collaboration traces linking prompts, outputs, and their surrounding context. The approach is illustrated through a case study involving a software development team building a competency-based education feature for viewing and updating learner competency profiles. The case study shows how the framework can support the representation and documentation of collaboration episodes across requirements analysis, design, implementation, and testing. Within this setting, the results indicate that explicit collaboration modelling helps make task context more explicit, improves the traceability of AI-generated contributions, and supports more transparent and accountable Human-Generative AI practices. We conclude by outlining design principles for future Human-Generative AI systems that emphasise not only output quality, but also the explicit representation of the collaborative context in which outputs are produced.

2605.29664 2026-05-29 cs.DC cs.LG

AMDP: Asynchronous Multi-Directional Pipeline Parallelism for Large-Scale Models Training

AMDP:面向大规模模型训练的异步多方向流水线并行

Ling Chen, Houming Wu, Wenjie Yu

AI总结 针对异步流水线并行中参数不匹配导致收敛退化的问题,提出AMDP方法,通过限制流水线第一阶段处理小批量数量、启动多条并发流水线并自适应调整数量、以及跨小批量累积梯度后单次更新,在保持高利用率的同时加速训练并保证收敛。

Comments Accepted by ICML 2026, 9 pages, and 8 figures

详情
AI中文摘要

流水线并行对于大规模模型训练至关重要,但现有的异步方法常因前向和反向传播之间的参数不匹配而损害收敛性。我们提出异步多方向流水线并行(AMDP)来缓解此问题,同时保持高利用率。AMDP限制每个流水线的第一阶段在反向传播前最多处理两个小批量,从而限制了前向和反向传播之间的参数更新次数。为减轻由此产生的流水线气泡,AMDP启动多条并发流水线,并根据流水线深度自适应调整其数量。此外,AMDP跨小批量累积梯度并在一次更新中应用,确保只有有限数量的小批量经历参数不匹配,且限制在一个优化步骤内。在GPT和BERT风格模型上的实验表明,AMDP在保持收敛的同时显著加速了训练。

英文摘要

Pipeline parallelism is essential for large-scale model training, but existing asynchronous approaches often degrade convergence due to parameter mismatch between forward and backward passes. We propose Asynchronous Multi-Directional Pipeline parallelism (AMDP) to mitigate this issue while sustaining high utilization. AMDP limits the first stage of each pipeline to process at most two minibatches before backpropagation, bounding the number of parameter updates between forward and backward passes. To alleviate the resulting pipeline bubbles, AMDP launches multiple concurrent pipelines and adapts their number according to pipeline depth. In addition, AMDP accumulates gradients across minibatches and applies them in a single update, ensuring that only a bounded number of minibatches experience parameter mismatch, limited to within one optimization step. Experiments on GPT- and BERT-style models demonstrate that AMDP significantly accelerates training while preserving convergence.

2605.29642 2026-05-29 stat.ML cs.IT cs.LG math.IT

Matching Rates and Optimal Allocation for Federated Probe-Logit Distillation under Heterogeneous Bandwidth Budgets

异构带宽预算下的联邦探针-逻辑蒸馏匹配率与最优分配

Prasanjit Dubey, Xiaoming Huo

AI总结 针对联邦探针-逻辑蒸馏(FPLD)中带宽项速率紧性及异构节点带宽分配问题,提出匹配下界、多轮改进方案及闭合形式最优分配规则。

详情
AI中文摘要

在联邦语言建模中,$K$个节点各自持有$n$个样本,但无法合并数据或交换全精度梯度或权重。我们研究当每个节点在公共探针集上每次查询最多上传$B$比特时,对$V$个令牌上的条件分布进行估计的极小极大速率。在联邦探针-逻辑蒸馏(FPLD)中,每个节点在探针集上传输一个标量量化的逻辑向量,聚合器蒸馏出一个全局参数化学生模型。先前的工作(Dubey and Huo, 2026)建立了高概率KL速率$O(d/(Kn) + ρ\sqrt{V \log V / m} + K^{-1} \cdot 2^{-2B/V})$加上优化松弛项,其中带宽项采用迹锐化形式。该带宽项速率是否紧致,以及上界如何推广到异构每节点带宽,仍是开放问题。 我们填补了这两个空白。首先,抖动FPLD构造在非退化条件下具有匹配的单轮下界$Ω(K^{-1} \cdot 2^{-2B/V})$,将带宽轴速率确定为$Θ(K^{-1} \cdot 2^{-2B/V})$。使用嵌套/缩放残差量化器的$T$轮顺序细化达到$O(K^{-1} \cdot 2^{-2TB/V})$;对于任意$T > 1$,原始FPLD的与$T$无关的带宽项是次优的。其次,我们建立了每节点预算$B_i$的异构带宽上界,并配以闭合形式的最优分配$B_i^* = B_{\mathrm{tot}}/K + (V/2) \log_2(w_i / ar{w}_g)$,这是一种对数倾斜的注水规则,是失真率优化中反向注水的每节点类比。一种即插即用自适应变体通过短预热阶段估计权重,并达到$1 + O(\sqrt{\log(K/δ)/(m T_0)})$的相对次优性。合成n-gram模拟证实经验KL被上界和下界所界定,并且在异构裁剪下最优分配严格优于均匀和逆权重基线。

英文摘要

In federated language modeling, $K$ nodes each hold $n$ samples but cannot pool data or exchange full-precision gradients or weights. We study the minimax rate at which a conditional distribution over $V$ tokens can be estimated when each node may upload at most $B$ bits per query in a public probe set. In federated probe-logit distillation (FPLD), each node transmits a scalar-quantized logit vector on the probe set, and an aggregator distills a global parametric student. Prior work (Dubey and Huo, 2026) establishes a high-probability KL rate $O(d/(Kn) + ρ\sqrt{V \log V / m} + K^{-1} \cdot 2^{-2B/V})$ plus optimization slack, with the bandwidth term in its trace-sharpened form. Whether this bandwidth-term rate is tight, and how the upper bound generalizes to heterogeneous per-node bandwidths, are left open. We close both gaps. First, the dithered FPLD construction has a matching single-round lower bound $Ω(K^{-1} \cdot 2^{-2B/V})$ under non-degeneracy, pinning the bandwidth-axis rate at $Θ(K^{-1} \cdot 2^{-2B/V})$. $T$-round sequential refinement with nested/scaled residual quantizers achieves $O(K^{-1} \cdot 2^{-2TB/V})$; vanilla FPLD's $T$-independent bandwidth term is suboptimal for every $T > 1$. Second, we establish a heterogeneous-bandwidth upper bound for per-node budgets $B_i$, paired with a closed-form optimal allocation $B_i^* = B_{\mathrm{tot}}/K + (V/2) \log_2(w_i / \bar{w}_g)$, a log-tilted water-filling rule that is the per-node analogue of reverse water-filling for distortion-rate optimization. A plug-in adaptive variant estimates the weights from a short warm-up phase and attains $1 + O(\sqrt{\log(K/δ)/(m T_0)})$ relative suboptimality. Synthetic n-gram simulations confirm that empirical KL is bracketed by the upper and lower bounds and that the optimal allocation strictly dominates uniform and inverse-weighted baselines under heterogeneous clipping.