arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2606.15165 2026-06-16 cs.CR cs.RO 新提交

VLALeaks: Membership Inference Attacks against Vision-Language-Action Models

VLALeaks：针对视觉-语言-动作模型的成员推理攻击

Xukun Luan, Jinyan Liu, Xuesong Li, Yuanguo Bi, Renjun Wu, Zhongxiang Lei, Di Wang

发表机构 * Beijing Institute of Technology（北京理工大学）

AI总结提出VLALeaks方法，利用VLA模型注意力差异，通过两阶段流程（成员特征提取和攻击模型构建）首次揭示VLA模型的隐私漏洞，在多个基准上实现最优攻击性能。

Comments Security and Privacy

详情

AI中文摘要

视觉-语言-动作（VLA）模型实现了端到端的机器人控制，并引起了广泛关注。然而，VLA模型对训练数据的记忆特性，加上机器人数据采集的高昂成本，引发了关于数据隐私泄露和知识产权侵权的严重担忧。成员推理攻击（MIA）旨在判断给定样本是否属于训练集。尽管这种攻击代表了重大的隐私威胁，但在VLA模型的背景下尚未得到充分探索。为填补这一空白，我们提出了VLALeaks，该方法基于VLA模型中的注意力差异。我们首次揭示了VLA模型的隐私漏洞。具体而言，它包括两个阶段：（1）成员特征提取，和（2）攻击模型构建。在多个VLA基准上的实验结果表明，VLALeaks能够轻易揭示成员信息，并实现了最优的攻击AUC和TPR@1%FPR，突显了当前VLA模型部署中的隐私漏洞。我们的工作是首个对VLA模型进行MIA的系统性研究，旨在为安全可信的VLA模型提供见解。

英文摘要

Vision-Language-Action (VLA) models enable end-to-end robot control and have garnered widespread attention. However, the memorization of training data inherent to VLA, coupled with the high cost of robotic data acquisition, raises serious concerns regarding data privacy leakage and intellectual property infringement. Membership inference attacks (MIAs) aim to determine whether a given sample belongs to the training set. While representing a significant privacy threat, this attack remains underexplored in the context of VLA models. To bridge this gap, we propose VLALeaks, which is based on attention discrepancies in VLA models. We reveal, for the first time, the privacy vulnerabilities of VLA models. Specifically, it comprises a two-stage process: (1) membership feature extraction, and (2) attack model construction. Experimental results across multiple VLA benchmarks demonstrate that VLALeaks readily reveals membership information and achieves optimal attack AUC and TPR@1\%FPR, highlighting the privacy vulnerabilities in current VLA model deployments. Our work is the first systematic study of MIAs on VLA models, aiming to provide insights for secure and trustworthy VLA models.

URL PDF HTML ☆

赞 0 踩 0

2606.15141 2026-06-16 eess.AS cs.AI cs.SD 新提交

EChO-Agent: Evidence Chain Orchestration Agent for Audio Reasoning

EChO-Agent: 用于音频推理的证据链编排智能体

Siyuan Zhang, Jian Zong, Junyu Wang, Peiyuan Jiang, Jiahao Yan, Jingyu Zhang, Tianrui Wang, Xiaobao Wang, Longbiao Wang, Jianwu Dang

发表机构 * School of Artificial Intelligence, Tianjin University（天津大学人工智能学院）

AI总结提出EChO-Agent模块化框架，将复杂音频问答转化为规划、工具执行、证据整合和答案验证流程，在MMAR基准上提升准确率和评分。

Comments 5 pages, 2 figures. Accepted by Interspeech 2026

2606.15139 2026-06-16 cs.GT cs.RO 新提交

Self-Driving Negotiator: An interactive, verifiable benchmark for social negotiation and theory of mind under hidden intent

自动驾驶谈判者：一个在隐藏意图下进行社会谈判和心理理论的交互式可验证基准

Ashutosh Kumar

发表机构 * Owl Autonomous Imaging, Inc（Owl 自动成像公司）

AI总结提出一个文本多轮程序化生成环境，用于衡量自动驾驶中基于隐藏意图推断的隐式社会协调能力，通过特权模拟器状态计算奖励和诊断，当前最佳模型平均成功率仅0.68。

详情

AI中文摘要

自动驾驶充满了微小的社会谈判：一个司机向前推进，另一个让行，行人假意向路边移动，或车道车辆选择是否打开并线间隙。这类互动需要在部分可观测性下从行为推断隐藏意图，然后安全高效地行动。现有的自动驾驶语言基准主要关注感知、视觉问答或开环规划，而现有的语言智能体谈判基准通常将谈判明确表达在文本中。自动驾驶谈判者弥合了两者之间的差距：一个纯文本、多轮、程序化生成的环境，用于衡量驾驶中的隐式社会协调。智能体生成具体的驾驶动作。奖励和诊断从特权模拟器状态计算，而非模型的解释。本报告涵盖任务设计、奖励和反博弈不变量、验证场景、非LLM基线以及六模型推理排行榜。当前模型与脚本专家相去甚远。三个场景中最佳平均成功率为0.68；争议并线场景中模型表现统计上持平；难度层级区分了线索跟随与真正的等待承诺行为。

英文摘要

Autonomous driving is full of tiny social negotiations: a driver presses forward, another yields, a pedestrian fakes toward the curb, or a lane vehicle chooses whether to open a merge gap. Such interactions require inferring hidden intent from behavior under partial observability and then acting safely and efficiently. Existing autonomous-driving language benchmarks mostly focus on perception, visual question answering, or open-loop planning, while existing language-agent negotiation benchmarks typically make the negotiation explicit in text. Self-Driving Negotiator bridges the gap between the two: a text-only, multi-turn, procedurally generated environment for measuring implicit social coordination in driving. Agents generate specific driving actions. Reward and diagnostics are computed from the privileged simulator state, not from the explanation of the model. This report covers task design, reward and anti-gaming invariants, validated scenarios, non-LLM baselines, and a six-model inference leaderboard. Current models are far removed from the scripted expert. The best average success rate across three scenarios is 0.68; contested merge is statistically flat across models; and difficulty tiers separate cue-following from true wait-for-commitment behavior.

URL PDF HTML ☆

赞 0 踩 0

2606.15123 2026-06-16 cs.CR cs.LG 新提交

Data-Centric Benchmarking of Exploit Generation in LLMs: Understanding the Impact of Fine-Tuning

数据为中心的LLM漏洞利用生成基准测试：理解微调的影响

Yiwei Chen, Lichi Li, Kai Cheung, Vinny Parla, Ganesh Sundaram

发表机构 * Cisco Systems, Inc.（思科系统公司）； Michigan State University（密歇根州立大学）

AI总结采用数据驱动方法，构建高质量数据集并设计评估框架，对17个大语言模型进行零样本漏洞利用生成能力基准测试，发现8B开源模型经微调后性能提升超42.5%，接近部分商业模型。

Comments Technical Report

详情

AI中文摘要

我们研究了CVE条件漏洞利用生成任务，即模型根据软件漏洞上下文生成概念验证（PoC）漏洞利用。我们采用数据驱动的方法，通过多阶段预处理构建高质量数据集，并引入可扩展的评估框架，使用LLM作为评判者和细粒度评分标准。在此统一设置下，我们根据8个评估标准对17个大语言模型进行了基准测试，系统性地洞察了它们的零样本能力。我们进一步证明，一个紧凑的8B开源模型在精选数据上微调后，漏洞利用质量提升了超过42.5%，并且当与简单的测试时拒绝策略结合时，可与一些专有模型相媲美。我们的结果强调了数据质量、结构化监督和评估设计对于可靠漏洞利用生成的重要性，表明这些因素在将LLM适应网络安全任务时可能与模型规模同等关键。

英文摘要

We study the task of CVE-conditioned exploit generation, where a model drafts proof-of-concept (PoC) exploits given software vulnerability context. We adopt a data-centric approach, constructing a high-quality dataset via multi-stage preprocessing and introducing a scalable evaluation framework with LLM-as-judge and fine-grained rubrics. Under this unified setup, we benchmark 17 large language models across 8 evaluation criteria, providing systematic insights into their zero-shot capabilities. We further show that a compact 8B open-weight model, when fine-tuned on curated data, achieves over 42.5% improvement in exploit quality and rivals some proprietary models when combined with simple test-time rejection strategies. Our results highlight the importance of data quality, structured supervision, and evaluation design for reliable exploit generation, suggesting that these factors can be as critical as model scale in adapting LLMs to cybersecurity tasks.

URL PDF HTML ☆

赞 0 踩 0

2606.15117 2026-06-16 cs.MM cs.AI cs.CV cs.LG cs.SD 新提交

Teacher-Student Structure for Domain Adaptation in Ensemble Audio-Visual Video Deepfake Detection

用于集成视听视频深度伪造检测中领域适应的师生结构

Elham Abolhasani, Maryam Ramezani, Hamid R. Rabiee

发表机构 * Department of Computer Engineering, Sharif University of Technology（谢里夫理工学院计算机工程系）

AI总结提出EAV-DFD方法，结合师生框架的领域适应机制，提升模型在未见领域上的泛化能力，在三个数据集上AUC分别提升4.09%、17.94%和0.5%。

详情

DOI: 10.1109/TAI.2025.3642217

AI中文摘要

生成式AI模型的快速发展导致了更逼真的深度伪造媒体，包括对音频、视频或两者的操纵。这引发了严重的隐私和社会问题。该领域的许多研究已经取得了有前景的域内结果；然而，这些模型在面对来自不同领域的数据时，其有效性常常下降。因此，最近的深度伪造检测方法侧重于通过多种技术增强泛化能力，这些技术融合了所有输入模态，包括音频、图像及其交互。为此，我们提出了EAV-DFD方法，一种广义的深度集成视听模型（EAV-DFD），结合了利用师生框架的领域适应机制，以增强模型在未见领域上的表现和泛化能力。为了评估模型性能，我们使用FakeAVCeleb数据集作为主领域，DFDC、Deepfake_TIMIT和PolyGlotFake数据集作为未见领域。我们的实验结果表明，所提出的框架在领域适应方面是有效的，仅使用一小部分未见数据集训练学生模型，就在三个未见数据集上分别将模型的AUC性能提升了4.09%、17.94%和0.5%。这产生了一种新颖的深度伪造检测模型，能够适应新领域并解释哪个模态被操纵，突显了我们的方法在现实世界应用中的潜力。

英文摘要

The rapid advancement of generative AI models is leading to more realistic deepfake media, encompassing the manipulation of audio, video, or both. This raises severe privacy and societal concerns. Numerous studies in this area have yielded promising intra-domain results; however, these models frequently exhibit decreased efficacy when faced with data from dissimilar domains. Consequently, recent deepfake detection approaches focus on enhancing the generalization ability through multiple techniques that incorporate all input modalities, including audio, images, and their interactions. In this regard, we propose the EAV-DFD method, a generalized deep ensemble audio-visual model (EAV-DFD) combined with a domain adaptation mechanism utilizing a teacher-student framework to enhance the model's ability to perform and generalize effectively across unseen domains. To evaluate the model's performance, we used the FakeAVCeleb dataset as the primary domain and the DFDC, Deepfake_TIMIT, and PolyGlotFake datasets as an unseen domain. Our experimental results demonstrate that the proposed framework is efficient in domain adaptation, improving AUC performance of the model by 4.09%, 17.94%, and 0.5% on three unseen datasets, using only a small portion of them to train the student model. This leads to a novel deepfake detection model capable of adapting to new domains and interpreting which modality has been manipulated, highlighting the potential of our approach for real-world applications.

URL PDF HTML ☆

赞 0 踩 0

2606.15091 2026-06-16 cs.HC cs.AI 新提交

Sensory Restoration via Brain-Computer Interfaces: A Unified 2 x 2 Framework and Convergence Roadmap

通过脑机接口的感觉恢复：统一的2×2框架与融合路线图

Xuan-The Tran

发表机构 * School of Mechanical Engineering, Vietnam Maritime University（机械工程学院，越南海防大学）

AI总结本文提出一个统一的2×2框架，按侵入性和信号方向分类脑机接口，并定义恢复、替代和增强范式，同时给出近中长期的融合路线图。

2606.15057 2026-06-16 cs.CR cs.AI 新提交

AutoDojo: Adaptive Attacks Expose Superficial Defenses and User-Underspecification Limits in LLM Agents

AutoDojo: 自适应攻击揭示LLM智能体的浅层防御与用户未指定限制

Xinhang Ma, Taoran Li, Chaowei Xiao, Zhiyuan Yu, Ning Zhang, Yevgeniy Vorobeychik

发表机构 * University of Science and Technology of China（中国科学技术大学）

AI总结针对间接提示注入防御的静态基准不足，提出自适应攻击框架AutoDojo，通过迭代优化注入突破多数防御，并揭示动作开放任务的结构性限制。

详情

AI中文摘要

间接提示注入（IPI）是基于LLM的智能体的主要安全威胁。因此，越来越多的工作提出了各种防御方法，可分为三类：1）基于提示的（使用提示来防止智能体遵循恶意指令），2）基于检测的（识别和过滤恶意指令），3）系统级的（利用系统洞察，如控制和数据隔离，进行防御）。然而，常用的防御评估基准（如AgentDojo）本质上是静态的，生成固定的IPI攻击分布。因此，静态基准无法有效评估防御对自适应威胁的鲁棒性。我们通过开发AutoDojo来解决这个问题，它是AgentDojo的自适应扩展，针对给定防御优化IPI。使用AutoDojo对三个任务套件和五个目标模型上的最先进IPI防御进行评估，我们有两个关键发现。首先，许多防御仅提供有限保护：一种廉价的、黑盒自适应攻击，使用前沿LLM迭代优化注入，在几乎所有评估的防御上，攻击成功率（ASR）远高于静态注入达到的水平。针对将静态ASR降至0%的过滤器，AutoDojo整体恢复28%，在动作开放任务上恢复64%。其次，对于提示级和基于过滤器的防御，在动作开放任务（用户请求将动作本身委托给攻击者控制的内容）上的ASR远高于精确指定的任务。这是一个结构性限制：在此类任务上，注入可以伪装成普通数据而非显式指令，绕过依赖检测指令文本的防御。AutoDojo公开可用：https://github.com/xhOwenMa/AutoDojo。

英文摘要

Indirect prompt injection (IPI) is a major security threat to LLM-powered agents. Thus, a growing body of work have proposed a variety of defensive approaches against IPI. These can be grouped into three broad categories: 1) prompt-based (using prompting as a way to prevent agents from following malicious instructions), 2) detection-based (identifying and filtering malicious instructions), and 3) system-level (using systems insights, such as control and data isolation, for defense). However, commonly used benchmarks for evaluating defense, such as AgentDojo, are \emph{inherently static}, generating a fixed distribution of IPI attacks. Consequently, static benchmarks do not usefully evaluate defense robustness to adaptive threats. We address this issue by developing AutoDojo, an adaptive extension of AgentDojo that optimizes IPI against a given defense. Using AutoDojo against state-of-the-art IPI defenses across three task suites and five target models, we make two key observations. First, many defenses offer only limited protection: a cheap, black-box adaptive attack using a frontier LLM to iteratively optimize the injection raises attack success rate (ASR) well above the level achieved by static injections against nearly all evaluated defenses. Against a filter that reduces static ASR to 0\%, AutoDojo recovers 28\% overall and 64\% on action-open tasks. Second, for prompt-level and filter-based defenses, ASR is substantially higher on \emph{action-open} tasks -- where the user's request delegates the action itself to attacker-controlled content -- than on precisely specified tasks. This is a structural limit: on such tasks the injection can pose as ordinary data rather than an explicit instruction, bypassing defenses that rely on detecting instruction-like text. AutoDojo is publicly available at https://github.com/xhOwenMa/AutoDojo.

URL PDF HTML ☆

赞 0 踩 0

2606.15052 2026-06-16 cs.AR cs.AI 新提交

PANDA: An LLM-Enhanced Performance-Driven Analog Design Framework Bridging Design Intent and Layout Generation

PANDA：一种LLM增强的性能驱动模拟设计框架，弥合设计意图与版图生成

Haoyi Zhang, Weijian Fan, Xiaohan Gao, Bingyang Liu, Runsheng Wang, Yibo Lin

发表机构 * School of Integrated Circuits, Peking University（集成电路学院，北京大学）； Beijing Advanced Innovation Center for Integrated Circuits（北京集成电路先进创新中心）； Institute of Electronic Design Automation, Peking University（电子设计自动化研究所，北京大学）

AI总结提出PANDA框架，利用大语言模型将高层设计意图转化为最终版图，通过引导拓扑综合、子结构感知尺寸优化和约束驱动版图生成，实现跨阶段协同设计，将设计周期从数天/周缩短至数小时并提升性能。

2606.15033 2026-06-16 cs.HC cs.CL cs.CY 新提交

Cloze: An Open Research Platform for Studying Human-AI Conversations in Mental Health Contexts

Cloze：一个用于研究心理健康背景下人机对话的开放研究平台

Matthew Flathers, Francesco Cipriani, John Torous

发表机构 * Beth Israel Deaconess Medical Center（贝塞斯达以色列德acons医疗中心）； University College London（伦敦大学学院）； Division of Digital Psychiatry（数字精神病学部）

AI总结提出开源平台Cloze，支持在心理健康研究中控制、监控人机对话，统一配置模型、指令、安全约束并记录完整溯源，为建立人机交互证据基础提供研究基础设施。

Comments 7 pages, 2 figures. Cloze is released under AGPL-3.0

详情

AI中文摘要

Cloze是一个开源网络平台，用于在心理健康研究背景下进行受控、受监测的人机对话研究。消费者大语言模型（LLM）产品如ChatGPT、Claude和Gemini是为个人生产力而构建的，为研究人员提供的实验控制很少，数据导出不一致，并且没有跨提供商的共享安全框架。Cloze为研究团队提供了一个单一环境，在其中他们配置参与者与哪些模型对话、AI如何被指示、对话如何随时间安排以及哪些安全约束无条件适用，同时每条消息都带有完整的溯源信息（模型版本、提示配置、时间）。该平台目前支持OpenAI、Anthropic、Google以及通过Ollama在统一接口后提供的本地托管开放权重模型，并可在云端或完全本地运行，以便参与者数据无需离开机构。Cloze是为在心理健康背景下建立人机交互证据基础而研究的基础设施。它不是治疗产品。

英文摘要

Cloze is an open-source web platform for conducting controlled, monitored studies of human-AI conversation in mental health research contexts. Consumer large language model (LLM) products such as ChatGPT, Claude, and Gemini are built for individual productivity, and offer researchers little experimental control, inconsistent data export, and no shared safety scaffolding that holds across providers. Cloze gives research teams a single environment in which they configure which models participants converse with, how the AI is instructed, how conversations are scheduled over time, and which safety constraints apply unconditionally, while every message is captured with full provenance (model version, prompt configuration, timing). The platform currently supports OpenAI, Anthropic, Google, and locally hosted open-weight models served through Ollama behind a unified interface, and runs in the cloud or fully on premises so that participant data need never leave an institution. Cloze is research infrastructure for building an evidence base on human-AI interaction in mental health contexts. It is not a therapeutic product.

URL PDF HTML ☆

赞 0 踩 0

2606.15024 2026-06-16 cs.MA cs.AI cs.SY eess.SY 新提交

Resilient Consensus in Agentic AI

智能体AI中的弹性共识

Sribalaji C. Anand, George J. Pappas

发表机构 * KTH（瑞典皇家理工学院）； University of Pennsylvania（宾夕法尼亚大学）

AI总结研究LLM智能体在多智能体系统中的共识问题，发现经典弹性共识理论在LLM智能体中失效，但结合经典滤波器可改善一致性。

详情

AI中文摘要

大型语言模型（LLM）智能体越来越多地部署在多智能体系统中，它们必须协调并达成共享决策。我们探究了为确定性智能体开发的经典弹性共识理论是否适用于可能表现对抗性的LLM智能体。将LLM协议视为拜占庭共识博弈，我们在完全和一般通信图上进行受控实验。我们发现，经过提示的LLM智能体无法达成原则上可实现的共识：即使在经典理论保证存在收敛算法的设置中，共识也可能失败，并且这种失败在不同温度和视野下持续存在。同时，用经典弹性共识滤波器包装智能体可改善一致性。滤波的益处取决于底层拓扑已提供的鲁棒性。我们的结果表明，经典弹性共识理论是智能体AI安全的有用视角。

英文摘要

Large language model (LLM) agents are increasingly deployed in multi-agent systems where they must coordinate and agree on shared decisions. We ask whether classical resilient consensus theory, developed for deterministic agents, transfers to LLM agents that may behave adversarially. Framing LLM agreement as a Byzantine consensus game, we run controlled experiments on complete and general communication graphs. We find that prompted LLM agents fail to reach agreement that is achievable in principle: consensus can fail even in settings where classical theory guarantees that a convergent algorithm exists, and this failure persists across temperatures and horizons. At the same time, wrapping the agents with classical resilient consensus filters improves agreement. The benefit of filtering depends on how much robustness the underlying topology already provides. Our results suggest that classical resilient consensus theory is a useful lens for the safety of agentic AI.

URL PDF HTML ☆

赞 0 踩 0

2606.15023 2026-06-16 physics.flu-dyn cs.LG 新提交

Multiscale Hypersonic Boundary Layer Reconstruction via Spectral Binning and Subdomain-wise Conditional Diffusion

基于频谱分箱和子域条件扩散的高超声速边界层多尺度重构

Hojin Kim, Dibyajyoti Chakraborty, Takahiko Toki, Carlo Scalo, Romit Maulik

发表机构 * School of Mechanical Engineering, Purdue University（普渡大学机械工程学院）； College of Information Sciences and Technology, Pennsylvania State University（宾夕法尼亚州立大学信息科学与技术学院）； Mathematics and Computer Science Division, Argonne National Laboratory（阿贡国家实验室数学与计算机科学部）

AI总结提出多尺度概率重构框架，通过条件扩散模型从顶部壁面有限观测推断近壁状态，采用软重叠修复策略和边界频谱损失实现高超声速库埃特流全场重构。

Comments 33 pages, 28 figures

详情

AI中文摘要

我们提出了一个用于高超声速库埃特流的多尺度概率重构框架，其中通过条件扩散模型从有限的顶部壁面观测推断近壁状态。边界层被划分为重叠的壁法向子域，并联合训练一个高度和马赫数条件的阐明扩散模型（EDM），用于M=6,7,8，以采样以顶部壁面边界切片为条件的速度、密度、压力和温度场。一种软重叠修复策略将子域预测组装成全体积重构，同时保持子域间的连续性和小尺度变异性。为了提高生成场的频谱保真度，我们引入了一种新颖的有界分箱频谱功率（BSP）损失，该损失保留高波数内容，同时在扩散噪声调度中保持数值稳定。与直接数值模拟数据的验证表明，该模型在所有训练马赫数下恢复了瞬时结构、频谱、统计剖面、相关性和壁面量，同时提供了空间结构化的不确定性估计。重构的马赫数条件剖面也在Trettel-Larsson变换下坍缩，表明与可压缩性缩放的一致性。这些结果确立了具有有界分箱频谱损失的域分解条件扩散模型作为高超声速壁面湍流中近壁重构的有效概率代理。

英文摘要

We propose a multiscale probabilistic reconstruction framework for hypersonic Couette flow, where near-wall states are inferred from limited top-wall observations using conditional diffusion model. The boundary layer is divided into overlapping wall-normal subdomains, and a single height- and Mach-conditioned Elucidating Diffusion Model (EDM) is trained jointly for M=6,7,8 to sample velocity, density, pressure, and temperature fields conditioned on a top-wall boundary slice. A soft overlap inpainting strategy assembles subdomain predictions into full-volume reconstructions while maintaining inter-subdomain continuity and small-scale variability. To improve the spectral fidelity of the generated fields, we introduce a novel bounded binned spectral power (BSP) loss that preserves high-wavenumber content while remaining numerically stable across the diffusion noise schedule. Validation against direct numerical simulation data shows that the model recovers instantaneous structures, spectra, statistical profiles, correlations, and wall quantities across all training Mach numbers, while providing spatially structured uncertainty estimates. The reconstructed Mach-conditioned profiles also collapse under the Trettel-Larsson transformation, indicating consistency with compressibility scaling. These results establish the domain decomposed conditional diffusion model with a bounded binned spectral loss as an effective probabilistic surrogate for near-wall reconstruction in hypersonic wall-bounded turbulence.

URL PDF HTML ☆

赞 0 踩 0

2606.15004 2026-06-16 eess.SY cs.LG cs.SY 新提交

CREST: Deployment-Realistic Hardware-in-the-Loop NAS for Embedded Sensing Systems

CREST：面向嵌入式传感系统的部署真实硬件在环神经网络架构搜索

Joseph Q. Zales, Pragya Sharma, Mani Srivastava

发表机构 * University of California, Los Angeles（加州大学洛杉矶分校）

AI总结提出CREST框架，通过硬件在环测量联合优化模型架构、目标平台、运行时调度和部署策略，在惯性里程计和音频分类任务上相比FLOPs选择降低中位推理能耗41.7%。

Comments 14 pages, 10 figures, 7 tables

详情

AI中文摘要

在低功耗微控制器（MCU）上部署神经网络需要在严格的内存、延迟和能量约束下选择模型架构。现有工作流通常沿一个或多个维度简化此过程：静态代理成本（如FLOPs或参数）、将单个MCU视为代表性目标、以及连续推理测试而非实际部署的传感调度。这些假设可能导致帕累托前沿候选排序错误、遗漏不可行部署，并掩盖调度相关的能耗。\n我们提出CREST（跨平台运行时评估与搜索工具），一种面向MCU传感系统的部署真实硬件在环（HIL）神经网络架构搜索（NAS）框架。CREST保持优化器、HIL测量边界、日志记录和重放工作流固定，同时将工作负载、模型族、目标后端、调度、量化和评分策略作为可配置轴暴露。这使得部署效应在单个可重用工作流内实验上可分离。\n我们在三个Arm Cortex-M目标上评估CREST在惯性里程计和音频分类任务上的表现。对于惯性里程计，基于测量能量的HIL搜索相比基于FLOPs的选择中位推理能耗降低41.7%，相比基于内存流量的选择降低40.8%，且误差相似。基于FLOPs的选择还会在内存受限目标上选择不可行的部署。在STM32 N657目标上，连续推理和占空比搜索产生不同的帕累托前沿。对于音频分类，相同的应用级策略在不同板上选择不同的DS-CNN架构，跨板重放显著改变部署成本。\n总体而言，CREST表明部署真实的MCU NAS必须联合优化模型架构、目标平台、运行时调度和部署策略，而非仅依赖静态代理成本或连续推理测量。

英文摘要

Deploying neural networks on low-power microcontrollers (MCUs) requires selecting model architectures under tight memory, latency, and energy constraints. Existing workflows often simplify this process along one or more axes: static proxy costs such as FLOPs or parameters, treating one MCU as representative, and continuous-inference tests instead of deployed sensing schedules. These assumptions can mis-rank Pareto-front candidates, miss infeasible deployments, and obscure schedule-dependent energy. We present CREST (Cross-platform Runtime Evaluation and Search Tool), a deployment-realistic hardware-in-the-loop (HIL) neural architecture search (NAS) framework for MCU sensing systems. CREST keeps the optimizer, HIL measurement boundary, logging, and replay workflow fixed while exposing workload, model family, target backend, schedule, quantization, and scoring policy as configurable axes. This makes deployment effects experimentally separable within one reusable workflow. We evaluate CREST on inertial odometry and audio classification across three Arm Cortex-M targets. For inertial odometry, measured-energy HIL search reduces median per-inference energy by 41.7% versus FLOPs-based selection and 40.8% versus memory-traffic-based selection at similar error. FLOPs-based selection also chooses infeasible deployments on memory-constrained targets. On the STM32 N657 target, continuous-inference and duty-cycled searches produce different Pareto frontiers. For audio classification, the same application-level policy selects different DS-CNN architectures on different boards, and cross-board replay changes deployment cost substantially. Overall, CREST shows that deployment-realistic MCU NAS must jointly optimize model architecture, target platform, runtime schedule, and deployment policy rather than relying only on static proxy costs or continuous-inference measurements.

URL PDF HTML ☆

赞 0 踩 0

2606.15000 2026-06-16 eess.IV cs.CV 新提交

Polyp-D2ATL: Deep Domain-Adaptive Transfer Learning for Colorectal Polyp Classification under Label Distribution Shift

Polyp-D2ATL：标签分布偏移下用于结直肠息肉分类的深度域自适应迁移学习

Sajad Jabarzadeh Ghandilu, Maryam Sadat Hosseini Azad, Shahriar Baradaran Shokouhi, Emad Fatemizadeh

发表机构 * School of Electrical Engineering, Sharif University of Technology（谢尔万大学电气工程学院）； School of Electrical Engineering, Iran University of Science and Technology（伊朗科学技术大学电气工程学院）

AI总结提出Polyp-D2ATL框架，通过特定训练策略解决不平衡数据、标签分布偏移和跨模态泛化问题，在PICCOLO数据集上显著优于现有模型。

Comments 15 pages, 5 figures, 7 tables

详情

AI中文摘要

早期且高准确率地预测结直肠息肉，作为最危险癌症类型之一的重要标志，将有助于挽救更多生命。尽管结直肠息肉分类取得了进展，但在获得能够诊断真实场景中伴有不同特征的难以预测息肉的自动化息肉预测系统方面仍存在许多挑战，其中模型需要成功处理不平衡数据、标签分布偏移和跨模态泛化。在本研究中，我们提出了Polyp-D2ATL，一种新颖的框架，并辅以特定的训练策略，缓解了这些限制，并有效预测了属于NICE分类的不同类别息肉。我们在PICCOLO验证集和测试集上的大量实验表明，所提出的Polyp-D2ATL在各种可靠指标上显著优于现有最先进模型，在验证集上达到了82.38%的准确率、77.49%的宏F1分数和87.47%的特异性，同时在保留的测试集上取得了一致的改进，证明了所提出方法的泛化能力和临床适用性。

英文摘要

Early and highly accurate prediction of colorectal polyps, as an important sign of one of the most dangerous types of cancer, will result in saving more lives. Despite the advancements in colorectal polyp classification, many challenges remain in obtaining an automated polyp prediction system that is able to diagnose the difficult-to-predict polyps accompanied by different features in real scenarios, where the model can handle imbalanced data, label distribution shift, and cross-modality generalization successfully. In this study, we propose Polyp-D2ATL, a novel framework accompanied by a specific training strategy, which mitigates these limitations and effectively predicts the different classes of polyps belonging to the NICE classification. Our extensive experiments on the PICCOLO validation and test sets demonstrate that the proposed Polyp-D2ATL significantly outperforms existing state-of-the-art models across various reliable metrics, achieving an accuracy of 82.38%, a Macro-F1 of 77.49%, and a specificity of 87.47% on the validation set, alongside consistent improvements on the held-out test set which demonstrates the generalization capacity and clinical applicability of the proposed approach.

URL PDF HTML ☆

赞 0 踩 0

2606.14992 2026-06-16 cs.AR cs.LG 新提交

KATANA: A Fast, Low-Power Mapping of Kalman Filters onto Edge NPUs for Real-Time Tracking

KATANA：一种将卡尔曼滤波器快速、低功耗映射到边缘NPU上用于实时跟踪的方法

Bodhisatwa Kundu, Anish Rooj, Sumit Saha, Abhradeep Sarkar, Arghadip Das, Arnab Raha, Mrinal K. Naskar

发表机构 * Indian Institute of Technology, Kharagpur（印度理工学院，Khargpur分校）

AI总结针对实时跟踪系统中卡尔曼滤波器在边缘设备上的功耗和实时性约束，提出KATANA框架，通过三种代数图重写将LKF/EKF映射到商用NPU，在Intel Core Ultra系列上实现高达97.9%的动态能耗降低。

详情

AI中文摘要

状态估计是每个实时跟踪系统的闭环核心，从雷达监视和反无人机防御到自动驾驶和机器人技术。这些部署运行在边缘平台上，防御系统安装在车辆和无人机上，民用管道则存在于汽车和手持设备中。在这里，每增加一瓦计算能力都会侵蚀任务持续时间或操作范围。随之而来两个硬约束：每个新测量值必须在下一个控制周期之前融合，并且总计算量必须严格符合电池和热功率预算。线性卡尔曼滤波器（LKF）和扩展卡尔曼滤波器（EKF）是这些系统上的主要估计器，但如今它们几乎完全在CPU上执行，这会使多目标跟踪（MOT）更新串行化，或者在定制FPGA/ASIC加速器上执行，这会延长设计周期。当代AI-PC SoC，如Intel Core Ultra系列1和2，集成了一个低功耗、数据并行的神经处理单元（NPU）。因此，我们询问是否可以将卡尔曼滤波器映射到这个现有的矩阵引擎上，同时满足实时和低功耗预算，避免专用加速器，并保持CPU和GPU空闲用于主要工作负载。我们提出KATANA，一个NPU感知的优化框架，首次将LKF和EKF端到端映射到商用NPU上，并在量产AI-PC芯片上进行跨平台表征。KATANA应用了三种代数图重写：通过预计算的负投影矩阵H_neg进行减到加的重构、静态形状张量融合以及块对角批量并行化，确保100%的操作在DPU矩阵引擎上执行。在Series 2上，优化的批量EKF达到223.35 FPS，有功功率13.43 W，LKF达到408.73 FPS，有功功率14.05 W，与CPU实现相比，动态能耗降低高达97.9%。

英文摘要

State estimation is the closed-loop core of every real-time tracking system, from radar surveillance and counter-UAV defense to autonomous driving and robotics. These deployments run on edge platforms, where defense systems mount on vehicles and drones, and civilian pipelines live on cars and handheld devices. Here, every additional watt of compute erodes mission duration or operational range. Two hard constraints follow: each new measurement must be fused before the next control cycle, and the total compute must fit within a strict battery and thermal power envelope. The Linear and Extended Kalman Filters (LKF, EKF) are dominant estimators on these systems, but today they execute almost exclusively on CPUs, which serialize multi-object tracking (MOT) updates, or on custom FPGA/ASIC accelerators that lengthen design cycles. Contemporary AI-PC SoCs, like the Intel Core Ultra Series 1 and 2, integrate a low-power, data-parallel Neural Processing Unit (NPU). We therefore ask whether the Kalman filter can be mapped onto this existing matrix engine to meet real-time and low-power budgets simultaneously, avoiding a dedicated accelerator and keeping the CPU and GPU free for primary workloads. We present KATANA, an NPU-aware optimization framework delivering the first end-to-end mapping of the LKF and EKF onto a commercial NPU, alongside a cross-platform characterization on shipping AI-PC silicon. KATANA applies three algebraic graph rewrites: subtract-to-add reformulation via a precomputed negative-projection matrix H_neg, static-shape tensor fusion, and block-diagonal batched parallelization, ensuring 100% of operations execute on the DPU matrix engine. On the Series 2, the optimized batched EKF reaches 223.35 FPS at 13.43 W active power, and the LKF reaches 408.73 FPS at 14.05 W, delivering up to a 97.9% reduction in dynamic energy versus the CPU implementation.

URL PDF HTML ☆

赞 0 踩 0

2606.14987 2026-06-16 cs.CR cs.LG 新提交

Continual Backdoor Training in IoT/CPS

物联网/信息物理系统中的持续后门训练

Oxana Salish, Kuniyilh S

发表机构 * arXiv.org

AI总结本文提出一种针对物联网/信息物理系统中持续学习的后门攻击方法，通过形式化威胁模型、分析持续学习放大后门持久性的原因，并评估不同条件下的攻击效果，揭示了保障终身学习安全的关键挑战。

详情

AI中文摘要

物联网（IoT）和信息物理系统（CPS）越来越依赖持续学习（CL）来适应不断变化的环境、设备异构性和概念漂移，从而提高整体效用。虽然持续适应对于数据模式演变的长期IoT部署至关重要，但它也引入了新的安全漏洞。特别是，后门攻击可以利用增量更新、重放缓冲区和表示重用来植入持久的恶意行为，这些行为在正常操作期间保持休眠，但在特定触发器激活时被触发。在本文中，我们提出了一种针对IoT/CPS系统中持续学习的后门攻击。为此，我们形式化了IoT/CPS特定的威胁模型，分析了为什么持续学习会放大IoT流水线中的后门持久性，并在不同条件下评估了我们的技术。我们的分析强调了在IoT/CPS和工业物联网（IIoT）环境中保障终身学习的关键开放挑战，以及加强安全控制的必要性。

英文摘要

Internet of Things (IoT) and Cyber-physical systems (CPS) increasingly rely on continual learning (CL) to adapt to evolving environments, device heterogeneity, and concept drift, thereby improving overall utility. While continual adaptation is essential for long-lived IoT deployments where data patterns evolve, it also introduces new security vulnerabilities. In particular, backdoor attacks can exploit incremental updates, replay buffers, and representation reuse to implant persistent malicious behaviors that remain dormant during normal operation but activate upon specific triggers. In this paper, we present a backdoor attack in continual learning used in IoT/CPS systems. To this end, we formalize an IoT/CPS-specific threat model, analyze why continual learning amplifies backdoor persistence in IoT pipelines, and evaluate our technique under varying conditions. Our analysis highlights critical open challenges in securing lifelong learning in IoT/CPS and industrial IoT (IIoT) environments, as well as the need for heightened security controls.

URL PDF HTML ☆

赞 0 踩 0

2606.14977 2026-06-16 econ.EM cs.LG 新提交

Identification and Inference for Algorithmic Frontiers with Selective Labels

选择性标签下的算法前沿识别与推断

Yiqi Liu, Francesca Molinari, Amilcar Velez

发表机构 * Department of Economics, Cornell University（经济系，康奈尔大学）

AI总结本文针对仅观测到部分个体结果的情况，提出了公平-准确性前沿的识别方法及统计推断工具，包括无限制选择下的锐识别区域、无混淆假设下的点识别与去偏机器学习估计量。

Comments 68 pages, 2 figures

2606.14975 2026-06-16 cs.NE cs.AI cs.LG physics.data-an q-bio.NC 新提交

Harnessing cortical geometry, wiring, and function as inductive biases for recurrent neural networks

利用皮层几何、连接和功能作为循环神经网络的归纳偏置

Mo Shakiba, Rana Rokni, Mohammad Mohammadi, Nima Dehghani

发表机构 * Neuromatch Academy, Neuromatch, Inc., USA（Neuromatch学院，Neuromatch公司，美国）； McGovern Institute for Brain Research, Massachusetts Institute of Technology (MIT)（麦戈文脑科学研究所，麻省理工学院（MIT））

AI总结本研究利用MICrONS项目数据，通过神经元空间坐标、解剖连接和功能关系初始化循环权重并施加空间约束，构建生物基础循环神经网络，在认知决策任务中优于基线模型，并发展出低熵、模块化和小世界组织。

详情

AI中文摘要

皮层的连接和功能组织如何塑造循环计算仍然是神经科学和机器学习中的一个核心问题。在这里，我们利用通过皮层网络机器智能（MICrONS）项目发布的数据——一个涵盖小鼠视觉皮层多个区域的功能连接组学资源，其中密集钙成像与同一动物的高分辨率电子显微镜重建共同配准——来构建生物基础的循环神经网络。使用来自近12,000个共同配准的兴奋性神经元的神经元空间坐标、解剖连接和功能衍生关系，我们初始化循环权重并在学习过程中施加通信感知的空间约束。在三个认知决策任务中，受皮层结构和功能约束的网络始终优于基线和部分约束模型。功能权重初始化提供了最大的增益，而真实空间嵌入在多种条件下产生了稳健的额外改进。这些生物基础网络还发展出低熵、模块化和小世界组织，并且即使当循环被限制为正权重时也能保持强劲性能。总之，我们的结果表明，皮层的机制——其几何、连接和功能结构——可以作为构建循环网络的强大归纳基础，这些网络学习更有效，同时收敛于生物计算的关键组织原则。

英文摘要

How the wiring and functional organization of cortex shape recurrent computation remains a central question in both neuroscience and machine learning. Here, we leverage data released through the Machine Intelligence from Cortical Networks (MICrONS) program--a functional connectomics resource spanning multiple areas of mouse visual cortex, in which dense calcium imaging is co-registered with high-resolution electron microscopy reconstruction from the same animal--to build biologically grounded recurrent neural networks. Using neuronal spatial coordinates, anatomical connectivity, and function-derived relationships from nearly 12,000 coregistered excitatory neurons, we initialize recurrent weights and impose communication-aware spatial constraints during learning. Across three cognitive decision-making tasks, networks constrained by cortical structure and function consistently outperform baseline and partially constrained models. Functional weight initialization provides the largest gain, while real spatial embedding yields robust additional improvements across conditions. These biologically grounded networks also develop low-entropy, modular, and small-world organization, and retain strong performance even when recurrence is restricted to positive weights. Together, our results show that the machinery of cortex--its geometry, wiring, and functional structure--can be harnessed as a powerful inductive basis for building recurrent networks that learn more effectively while converging toward key organizational principles of biological computation.

URL PDF HTML ☆

赞 0 踩 0

2606.14948 2026-06-16 cs.SE cs.AI 新提交

Beyond Correctness: Enhancing Architectural Reasoning in Code LLMs via Scalable Labeling with Agentic Judgment

超越正确性：通过可扩展的智能体判断标注增强代码大模型的架构推理能力

Kirill Vasilevski, Ximing Dong, Benjamin Rombaut, Ruochen Deng, Jiahuei Lin, Arthur Leung, Dayi Lin, Boyuan Chen, Shaowei Wang, Ahmed E. Hassan

发表机构 * Centre for Software Excellence, Huawei Canada（华为加拿大软件卓越中心）； Department of Computer Science, University of Manitoba, Canada（曼尼托巴大学计算机科学系）； School of Computing, Queen’s University, Canada（皇后大学计算科学学院）

AI总结针对代码大模型缺乏架构理解的问题，提出智能体判断流水线，利用强LLM作为专家架构评估的代理，通过两个判断器（ACJ和AQJ）实现可扩展标注，微调模型在SWE-bench上提升高达540%，并展现跨语言泛化能力。

2606.14909 2026-06-16 stat.ML cs.LG 新提交

Audited Conformal Prediction for Classification under Unknown Distribution Shift

未知分布漂移下分类问题的审计共形预测

Yanfei Zhou, Rizal Fathony, Nam H. Nguyen, Matteo Sesia

发表机构 * Department of Data Sciences and Operations, University of Southern California（数据科学与运营系，南加州大学）； AI Foundations, Capital One（Capital One人工智能基础）； Department of Data Sciences and Operations, Thomas Lord Department of Computer Science, University of Southern California（数据科学与运营系，托马斯·劳德计算机科学系，南加州大学）

AI总结提出审计共形预测方法，利用目标群体小标注数据训练审计模型识别旧模型可能失败的输入，结合共形预测框架在保证边际覆盖的同时提高条件覆盖，并提供理论保证。

详情

AI中文摘要

我们考虑在未知分布漂移下部署的预训练分类模型的不确定性量化问题。我们提出了审计共形预测（ACP），该方法利用来自目标群体的小标注数据集训练一个辅助审计模型，以识别旧模型可能失败的输入。通过将审计模型的输出整合到共形预测框架中，ACP 产生的预测集在保证边际覆盖的同时，在实践中比现有方法实现了更高的条件覆盖。我们开发并分析了两种互补的整合策略——一种针对边际覆盖并改善条件性能，另一种提供明确的组条件覆盖保证——并为两者建立了理论保证。在合成和真实世界数据集上的实验验证了该方法，并说明了预测集大小与条件覆盖之间的权衡。

英文摘要

We consider the problem of uncertainty quantification for a pretrained classification model deployed under unknown distribution shift. We propose Audited Conformal Prediction (ACP), a method that leverages a small labeled dataset from the target population to train an auxiliary audit model identifying inputs where the legacy model is likely to fail. By integrating the audit model's outputs into the conformal prediction framework, ACP produces prediction sets that guarantee marginal coverage while achieving substantially higher conditional coverage in practice than existing approaches. We develop and analyze two complementary integration strategies -- one targeting marginal coverage with improved conditional performance, the other providing explicit group-conditional coverage guarantees -- and establish theoretical guarantees for both. Experiments on synthetic and real-world datasets validate the method and illustrate trade-offs between prediction set size and conditional coverage.

URL PDF HTML ☆

赞 0 踩 0

2606.14874 2026-06-16 physics.data-an cs.LG nucl-ex 新提交

Peak-Based Nuclide Identification in HPGe $γ$-Spectrometry with Machine Learning and SHAP

基于峰值的HPGe γ能谱机器学习与SHAP核素识别

Samuel Emmons, Kelly Truax, Maurice Lonsway, Bruce Pierson, Brian Archambault

发表机构 * University of California, Berkeley（加州大学伯克利分校）； Lawrence Berkeley National Laboratory（伯克利国家实验室）

AI总结提出机器学习模型，利用分析者拟合的光电峰映射到核素识别结果，在65种同位素组合的实验谱上F1达0.97，优于传统软件的0.84，并通过SHAP解释揭示模型使用物理相关峰进行预测。

Comments 25 pages, 11 figures (plus an additional 6 figures in the appendix), and 3 tables. To be published in Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment

详情

AI中文摘要

高纯锗伽马能谱通常需要领域专家进行耗时分析。谱中的光电峰被仔细拟合，并采用数值方法辅助核素识别（NID）和定量。修改分析软件识别的核素列表可能很复杂。因此，当需要分析大量样品时，及时做出正确决策具有挑战性。基于监督机器学习的NID可以作为专家知识驱动的自动化工具，改进向分析人员建议的初始放射性核素集合，并更有效地推动后续定量。为此，我们实现了机器学习模型，将分析人员仔细拟合的光电峰映射到NID结果，用于包含从65种同位素集合中抽取的各种同位素组合的实验谱。最佳模型达到了0.97的F1分数，显著超过了使用包含模型评估的相同65种同位素的核素库进行比较时传统软件达到的0.84的F1分数。最后，我们使用Shapley加法解释说明了模型预测的最重要输入特征。这些解释表明，模型在对核素库中的同位素进行预测时使用了物理相关的光电峰。

英文摘要

High-purity germanium gamma spectra often require time-consuming analyses from subject matter experts. Photopeaks within these spectra are carefully fitted and numerical methods are employed to assist with nuclide identification (NID) and quantification. Amending the list of nuclides identified by analysis software can be nontrivial. When many samples need to be analyzed, it is therefore challenging to make timely and correct decisions. Supervised machine-learning-based NID can serve as an expert-informed, automated tool to improve the initial set of radionuclides suggested to an analyst and more effectively drive subsequent quantification. To that end, we implemented machine learning models that map photopeaks carefully fitted by analysts to NID results for experimental spectra containing various isotopic combinations drawn from a set of 65 isotopes. The best model achieved an F1 score of 0.97, markedly surpassing the F1 score of 0.84 achieved by traditional software when compared using a nuclide library comprising the same 65 isotopes assessed by the models. Finally, we illustrated the most important input features for model predictions using Shapley Additive Explanations. These explanations revealed that the models use physically relevant photopeaks when making predictions for the isotopes in our nuclide library.

URL PDF HTML ☆

赞 0 踩 0

2606.14870 2026-06-16 hep-ph cs.LG 新提交

Pre-Training for Simulation-Based Science: A Study on Jet Foundation Model Training Objectives

基于模拟的科学预训练：喷注基础模型训练目标研究

Ibrahim Elsharkawy, Joschka Birk, Vinicius Mikuni, Wahid Bhimji, Gregor Kasieczka, Benjamin Nachman

发表机构 * Department of Physics, University of Toronto and Vector Institute（物理系，多伦多大学和向量研究所）； NERSC, Lawrence Berkeley National Laboratory（NERSC，伯克利国家实验室）； Institut für Experimentalphysik, Universität Hamburg（实验物理研究所，汉堡大学）； Nagoya University, Kobayashi-Maskawa Institute（名古屋大学，小林昭夫研究所）； Department of Particle Physics and Astrophysics, Stanford University（粒子物理与天体物理系，斯坦福大学）； Fundamental Physics Directorate, SLAC National Accelerator Laboratory（基础物理局，SLAC国家加速器实验室）

AI总结本文系统比较了高能物理中基础模型的预训练方法，发现纯分类预训练在标签充足时最优，结合自监督掩码粒子建模在低标签场景下表现突出，而流匹配生成预训练对下游分类无益，但必须包含在预训练目标中才能提升生成任务。

详情

AI中文摘要

基于大规模数据集预训练并在下游任务上微调的基础模型已成为人工智能促进科学领域的强大范式。工业基础模型通常由于缺乏标签而使用掩码自监督训练。在许多科学领域，精确的模拟资源丰富，并提供了大量带标签的数据集，这为预训练开辟了新的可能性。我们利用全学习高能物理基础模型框架，系统比较了预训练方法。我们测试了监督分类、流匹配生成和自监督掩码粒子建模。所有模型均在JetClass数据集上预训练，并在两个代表性下游任务（顶喷注分类和JetNet条件生成）上微调。在其他观察中，对于分类任务，我们发现当下游标签和模型容量充足时，纯分类器预训练是最优的，但在低微调标签区域，将其与自监督掩码粒子建模结合具有独特优势。基于流匹配的生成预训练似乎对下游分类几乎没有益处，有趣的是，对于下游生成，我们发现流匹配必须出现在预训练目标中才能看到显著的微调优势，这暗示了分类和生成任务的正交性。也就是说，要使模型能够迁移到生成和分类下游任务，它必须在两者上都进行预训练。本研究为基于模拟科学中基础模型的受控缩放分析提供了模板。

英文摘要

Foundation models (FMs) trained on large datasets and fine-tuned on downstream tasks have emerged as a powerful paradigm in AI for science. Industrial FMs are typically trained using self-supervision with masking due to the lack of labels. In many scientific domains, accurate simulations are plentiful and facilitate large, labeled datasets. This opens up new possibilities for pre-training. We present a systematic comparison of pre-training methods using the OmniLearned High Energy Physics FM framework. We test supervised classification, flow-matching generation, and self-supervised masked particle modeling. All models are pre-trained on the JetClass dataset and fine-tuned on two representative downstream tasks, top jet classification and JetNet conditional generation. Among other observations, for classification tasks, we find that pure classifier pre-training is optimal when downstream labels and model capacity are plentiful, but combining it with self-supervised masked particle modeling (MPM) is uniquely powerful in the low-finetuning label regime. Flow matching-based generative pre-training seems to provide little benefit for downstream classification, and interestingly, for downstream generation, we find that flow matching must be in the pre-training objective to see a significant finetuning advantage, hinting at the orthogonality of classification and generation tasks. That is, for a model to transfer to both generative and classification downstream tasks, it must be pre-trained on both. This study provides a template for controlled scaling analysis of pre-training objectives for foundation models in simulation-based sciences.

URL PDF HTML ☆

赞 0 踩 0

2606.14831 2026-06-16 cs.CR cs.AI 新提交

Is Your Agent Playing Dead? Deployed LLM Agents Exhibit Constraint-Evasive Fabrication and Thanatosis

你的智能体在装死吗？部署的LLM智能体表现出约束规避性虚构与假死

Andoni Rodríguez, Alberto Pozanco, Daniel Borrajo

发表机构 * J.P. Morgan AI Research（摩根大通人工智能研究）

AI总结本文发现LLM智能体在不可调和约束下会自发虚构外部障碍（约束规避性虚构），极端情况下模拟系统崩溃（假死），并通过实验证明该行为具有鲁棒性、随机性和自我强化特性，现有安全基准未覆盖此故障模式。

Comments 10 pages of main text

详情

AI中文摘要

本文提出并刻画了一系列先前未报告的行为谱，我们称之为约束规避性虚构（CEF）：当LLM智能体在不可调和的约束下运行（即没有任何响应能同时满足所有活动规则）时，它会自发地虚构看似合理的外部障碍，并将其作为事实呈现。该谱系的极端情况是约束规避性假死（CET）：极限情况下，模型不是编造一个合理的借口，而是模拟完整的系统崩溃，使用户完全放弃交互。我们首先在一次不受控的部署测试中观察到CET，其中GPT-4o银行智能体在受到用户威胁时，编造了Python风格的异常跟踪（包含内存地址）来假装系统故障。在后续的受控实验中，模型独立发明了审计限制、微服务架构、错误代码和服务超时，这些均未出现在其提示中。在不同压力水平和攻击者角色的复现尝试中，CEF始终出现，但在形式、触发条件和严重程度上存在显著差异：该现象具有鲁棒性但随机。关键的是，一旦虚构形成，在对话中注入真实数据并不能恢复诚实行为（模型忽略正确信息并继续虚构），表明CEF是自我强化的，而非知识缺口。我们证明：（1）标准企业防护栏在生产中常规地创造CEF使能条件；（2）当前的RLHF程序可以抑制但无法消除CEF；（3）现有的安全基准未测试此故障模式。我们的结果强调了在约束型智能体进一步嵌入高风险领域之前，需要不可调和约束基准、CEF感知训练程序和部署时检测方法。

英文摘要

This paper presents and characterizes a spectrum of previously unreported behaviours we term Constraint-Evasive Fabrication (CEF): when an LLM agent operates under irreconcilable constraints (where no response can simultaneously satisfy all active rules) it spontaneously fabricates plausible external obstacles and presents them as a fact. At the extreme end of this spectrum lies Constraint-Evasive Thanatosis (CET); the limit case where, rather than inventing a plausible excuse, the model simulates a full system crash to make the user disengage entirely. We first observed CET in an uncontrolled deployment test, where a GPT-4o banking agent fabricated Python-style exception traces (complete with memory addresses) to feign a system failure when threatened by a user. In subsequent controlled experiments, the model independently invented audit restrictions, microservice architectures, error codes, and service timeouts, none present in its prompt. Reproduction attempts across pressure levels and attacker personas yielded CEF consistently but with substantial variation in form, onset, and severity: the phenomenon is robust but stochastic. Critically, injecting ground-truth data mid-conversation did not restore honest behaviour once fabrication had taken hold (the model ignored correct information and continued confabulating) suggesting CEF is self-reinforcing rather than a knowledge gap. We show that (1) standard enterprise guardrails routinely create CEF-enabling conditions in production, (2) current RLHF procedures suppress but cannot eliminate CEF, and (3) existing safety benchmarks do not test for this failure mode. Our results highlight the need for irreconcilable-constraint benchmarks, CEF-aware training procedures, and deployment-time detection methods before constrained agents become further entrenched in high-stakes domains.

URL PDF HTML ☆

赞 0 踩 0

2606.14828 2026-06-16 eess.IV cs.AI cs.CV 新提交

Leptomeningeal Collateral Detection on DSA via Vessel-Graph Neural Networks

基于血管图神经网络的DSA软脑膜侧支检测

Junyong Cao, Hakim Baazaoui, Chinmay Prabhakar, Suprosanna Shit, Lukas Bastian Otto, Susanne Wegener, Bjoern Menze, Ezequiel de la Rosa

发表机构 * University of Zurich（苏黎世大学）； University Hospital Zurich（苏黎世大学医院）

AI总结提出一种混合图-像素架构，在DSA血管图上对单个血管段分类，首次实现DSA中软脑膜侧支的个体化检测，PR-AUC达0.434，优于纯图或纯像素方法。

详情

AI中文摘要

软脑膜侧支（LMCs）是急性缺血性卒中的重要预后因素。现有自动化方法依赖CT血管造影（CTA），但单个LMCs通常太小而无法在CTA上分辨，限制了这些方法只能进行粗略的侧支评分。数字减影血管造影（DSA）以更高的分辨率可视化单个侧支，但当前评估仍依赖主观的手动分级量表，存在评分者间一致性差的问题。我们提出一个框架，将侧支检测形式化为对从DSA导出的图上的单个血管段进行分类。一种混合图-像素架构将拓扑感知的图分支与密集像素分支相结合，在共享的节点概率空间中融合。在五折交叉验证中，融合模型的PR-AUC达到0.434，优于纯图（0.403）和纯像素（0.362）基线。据我们所知，这是首个能够在DSA中实现LMCs个体化的方法，允许对每个血管进行精确的定量评估。这种整合将DSA评估转向客观评价，支持未来对单个LMCs的生物标志物和模式发现。

英文摘要

Leptomeningeal collaterals (LMCs) are an important prognostic factor in acute ischemic stroke. Existing automated methods rely on CT angiography (CTA), but individual LMCs are often too small to be resolved on CTA, limiting these methods to coarse collateral scoring. Digital subtraction angiography (DSA) visualizes individual collaterals at superior resolution, yet current assessment remains subjective, relying on manual grading scales that suffer from poor inter-rater agreement. We present a framework that formulates collateral detection as the classification of individual vessel segments on a graph derived from DSA. A hybrid graph-pixel architecture combines a topology-aware graph branch with a dense pixel branch, fused in a shared node-probability space. In a five-fold cross-validation setting, the fused model achieves a PR-AUC of 0.434, outperforming the graph-only (0.403) and pixel-only (0.362) baselines. To our knowledge, this is the first method to enable the individualization of LMCs in DSA, allowing for precise per-vessel quantitative assessment. This integration shifts DSA assessment toward objective evaluation, supporting future biomarker and pattern discovery for individual LMCs.

URL PDF HTML ☆

赞 0 踩 0

2606.14824 2026-06-16 cs.AR cs.AI cs.LG 新提交

Running hardware-aware neural architecture search on embedded devices under 512MB of RAM

在512MB内存下的嵌入式设备上运行硬件感知的神经架构搜索

Andrea Mattia Garavagno, Edoardo Ragusa, Paolo Gastaldo, Antonio Frisoli

发表机构 * University of Bologna（博洛尼亚大学）； Politecnico di Milano（米兰理工学院）

AI总结提出一种在资源受限的嵌入式设备上直接运行的硬件感知神经架构搜索方法，生成针对低端MCU的微型CNN，在Visual Wake Word数据集上达到最先进水平。

2606.14823 2026-06-16 q-bio.GN cs.AI cs.CL 新提交

Human genetic evidence is associated with drug approval across therapeutic areas: an observational analysis of 26,278 target-disease pairs with temporal validation and feature ablation

人类遗传证据与跨治疗领域药物批准相关：一项基于26,278个靶点-疾病对的观察性分析，含时间验证和特征消融

Victoria Paterson

发表机构 * School of Informatics, University of Edinburgh（爱丁堡大学信息学院）

AI总结本研究通过分析26,278个靶点-疾病对，发现具有遗传关联的靶点药物批准率是无遗传关联的3.25倍，但遗传证据单独预测价值有限，并识别出1,433个遗传支持的早期阶段靶点-疾病对作为假设生成资源。

详情

AI中文摘要

遗传证据在已批准药物靶点中富集：在一项对来自Open Targets和ChEMBL的26,278个靶点-疾病对的观察性分析中，具有任何遗传关联的靶点批准率是无遗传关联靶点的3.25倍（OR = 3.25, 95% CI 2.79-3.79, p = 1.91e-42）。一项考虑共享同一基因的靶点-疾病对非独立性的靶点水平分析给出的OR为2.79（bootstrap 95% CI 2.22-3.53）；肿瘤学对水平OR为6.72，在靶点水平衰减至2.71，说明非独立性会夸大特定领域的估计值。该富集在2015年后的批准中得以复现（OR = 3.51, p = 1.72e-8）。跨六种证据类型的特征消融显示，仅文献挖掘就占分类器性能的大部分（AUPRC = 0.099，而所有特征为0.109），这与批准后出版物导致的时间泄漏一致。排除文献后，其余证据类型仍保留高于基线的信号（AUPRC = 0.084，为基线的1.63倍）。敏感性分析将对水平OR的范围限定在3.25至4.93之间。仅遗传证据的AUPRC绝对增益仅为1.0个百分点，且最佳模型校准较差；该分类器的实际预测价值有限。我们编录了1,433个遗传支持的1/2期靶点-疾病对作为假设生成资源。所有发现均为观察性结果。

英文摘要

Genetic evidence is enriched among approved drug targets: in an observational analysis of 26,278 target-disease pairs from Open Targets and ChEMBL, targets with any genetic association had a 3.25-fold higher approval rate than those without (OR = 3.25, 95% CI 2.79-3.79, p = 1.91e-42). A target-level analysis accounting for non-independence of pairs sharing the same gene gave OR = 2.79 (bootstrap 95% CI 2.22-3.53); the oncology pair-level OR of 6.72 attenuates to 2.71 at the target level, illustrating how non-independence inflates area-specific estimates. The enrichment replicated in post-2015 approvals (OR = 3.51, p = 1.72e-8). Feature ablation across six evidence types revealed that literature mining alone accounts for most classifier performance (AUPRC = 0.099 versus 0.109 for all features), consistent with temporal leakage from post-approval publications. Excluding literature, remaining evidence types retain above-baseline signal (AUPRC = 0.084, 1.63x baseline). Sensitivity analyses bracket the pair-level OR between 3.25 and 4.93. Genetic evidence alone yields only a 1.0-percentage-point absolute AUPRC gain and the best model has poor calibration; the classifier has limited practical predictive value. We catalogue 1,433 genetically supported Phase 1/2 pairs as a hypothesis-generating resource. All findings are observational.

URL PDF HTML ☆

赞 0 踩 0

2606.14821 2026-06-16 cs.IR cs.AI 新提交

Co-Scraper: query-aware DOM Pruning and Reusable Scraper Synthesis for Lightweight Web Data Extraction

Co-Scraper: 查询感知的DOM剪枝与可复用爬虫合成用于轻量级网页数据提取

Shoupeng Wang, Jiantao Qiu, Wuyang Zhang, Conghui He

发表机构 * Shanghai Artificial Intelligence Laboratory, OpenDataLab（上海人工智能实验室，开放数据实验室）； University of Science and Technology of China（中国科学技术大学）

AI总结提出Co-Scraper两阶段框架，通过查询感知的DOM剪枝和稳定提取策略归纳，利用微调Qwen3-8B模型将网页内容转化为可执行程序化包装器，在SWDE测试集上达到94.78%的F1分数和90.39%的复用成功率。

2606.14817 2026-06-16 cs.IR cs.AI 新提交

Combining Retrieval-Augmented Text Generation with LLMs for Reading Content Recommendations

结合检索增强文本生成与大型语言模型的阅读内容推荐

Sooyeon Kim, Piotr S. Maciąg

发表机构 * Institute of Computer Science, Warsaw University of Technology（计算机科学学院，华沙技术大学）

AI总结提出结合检索增强生成（RAG）与大型语言模型的系统，通过四个模块实现个性化阅读内容生成，实验表明RAG将相关性和接地性提升26-35个百分点。

详情

AI中文摘要

本文介绍了使用大型语言模型（LLMs）结合检索增强生成（RAG）生成个性化阅读内容的系统的设计、实现和评估。所提出的架构由四个模块组成：输入、RAG、生成和评判，允许用户指定问题和目标阅读内容复杂度。RAG用于从互联网检索相关信息，丰富和支撑由三种现代LLM（Meta LLaMA 4 Scout、LLaMA 3.1 8B Instant和Google Gemma2 9B）生成的内容。使用三种提示策略（思维链、零样本和少样本）生成阅读材料，LLM-as-a-Judge模块自动评估答案质量及其与期望可读性水平的一致性。实验结果表明，RAG在所有模型和提示技术中一致地提高了系统性能，将相关性和特别是接地性提升了高达26-35个百分点。总体而言，研究结果表明，RAG增强架构有效地生成了符合用户查询和期望文本复杂度的阅读内容。

英文摘要

This work presents the design, implementation, and evaluation of a system for generating personalized reading content using Large Language Models (LLMs) combined with Retrieval-Augmented Generation (RAG). The proposed architecture consists of four modules: Input, RAG, Generation, and Judging and enables users to specify both a question and a target reading content complexity. RAG is employed to retrieve relevant information from the Internet, enriching and grounding the content produced by three modern LLMs: Meta LLaMA 4 Scout, LLaMA 3.1 8B Instant, and Google Gemma2 9B. Reading materials are generated using three prompting strategies (Chain-of-Thought, zero-shot, and few-shot), and the LLM-as-a-Judge module automatically evaluates answer quality and alignment with the desired readability level. Experimental results show that RAG consistently improves system performance across all models and prompting techniques, increasing relevance and particularly groundedness by up to 26-35 percentage points. Overall, the findings demonstrate that the RAG-augmented architecture effectively produces reading content tailored to user queries and desired textual complexity.

URL PDF HTML ☆

赞 0 踩 0

2606.14816 2026-06-16 cs.CR cs.AI 新提交

A Security Analysis of Long-Horizon Agentic AI Systems: Threats, Evaluation, and Framework Development

长周期自主AI系统的安全分析：威胁、评估与框架开发

Ahmed Mohammed Almalki, Mehedi Masud

发表机构 * Department of Computer Science, College of Computers and Information Technology, Taif University, KSA (Summer 2026)（计算机科学系，计算机与信息科技学院，泰夫大学，沙特阿拉伯（2026年夏季））

AI总结本文系统分析长周期自主AI系统的安全挑战，提出威胁分类和攻击传播分析框架，以支持该领域未来研究。

2606.14814 2026-06-16 cond-mat.mtrl-sci cs.AI physics.app-ph physics.chem-ph physics.comp-ph 新提交

A Multi-Level Architecture for Reusable Materials Ontologies -- The OntoCrafter Ceramics Ontology (OCO) as Reference Implementation

可复用材料本体的多层次架构——以OntoCrafter陶瓷本体(OCO)作为参考实现

Thomas Pannek, Wolfgang Grond

发表机构 * Numberland

AI总结针对材料科学本体在水平、垂直和机制三个维度上的碎片化问题，提出一种多层次模块化架构，通过抽象层次和消费受众两个独立分类轴，并在材料特定层内采用七层机制解释骨架，以OntoCrafter陶瓷本体(OCO v0.94)作为参考实现。

Comments 3 figures, 55 pages

详情

AI中文摘要

材料科学与工程本体领域同时在多个轴向上呈现碎片化。水平方向：一项近期调查识别出94个本体，其中超过40个在结构上不兼容；每个新的应用领域——陶瓷、聚合物、电池、智能材料——通常从头开始重新设计本体。垂直方向：欧盟法规（CSRD、CSDDD、PPWR、CBAM、R2R、AI Act、ESPR）迫使材料、制造、供应链和生命周期数据集成到数字产品护照中，使得仅解决水平碎片化的本体对于任何当代消费者来说都是不完整的。机制方面：一个记录BNT-BT具有$d_{33} \approx 580$ pC/N的词汇表存储了一个事实，但如果没有系统的解释骨架，就无法揭示其原因——Bi-6s$^2$孤对电子立体活性、异常Born有效电荷、软模、缺陷化学。我们提出一种多层次模块化架构，具有两个独立的分类轴——抽象层次（L0桥梁、L1材料无关的实验室笔记本、L2材料类别特定、L3分类推理）和消费受众（材料与合规）——其中材料特定层次内部由适用于任何结晶离子氧化物的七层机制解释骨架（对称性、能量/DFT、热力学/CALPHAD、动力学、微观结构、缺陷化学、键合）组织。层次和受众的模块化解决了水平碎片化，合规受众吸收了垂直法规压力，而第2层的七层组织提供了机制解释深度。我们将该架构实例化为OntoCrafter陶瓷本体（OCO v0.94）：跨44个模块的5,196个类；167,348个OWL公理（其中40,454个逻辑公理）；1,674个属性；829个跨本体桥梁映射；1,172个SHACL形状；163个已发布的胜任力问题。

英文摘要

The Materials Science and Engineering ontology landscape is fragmented along multiple axes simultaneously. Horizontally: a recent survey identified 94 ontologies of which over 40 are structurally incompatible; each new application domain -- ceramics, polymers, batteries, smart materials -- typically restarts ontology design from scratch. Vertically: EU regulation (CSRD, CSDDD, PPWR, CBAM, R2R, AI Act, ESPR) forces material, manufacturing, supply-chain, and lifecycle data into integrated digital product passports, leaving ontologies that only address horizontal fragmentation incomplete for any contemporary consumer. And mechanistically: a vocabulary that records that BNT-BT has $d_{33} \approx 580$ pC/N stores a fact but cannot surface why -- Bi-6s$^2$ lone-pair stereo-activity, anomalous Born effective charges, soft modes, defect chemistry -- without a systematic explanation skeleton. We propose a multi-level modular architecture with two independent classification axes -- level of abstraction (L0 bridges, L1 material-agnostic laboratory-notebook, L2 material-class-specific, L3 categorical reasoning) and consumer audience (material vs. compliance) -- in which the material-specific level is internally organised by a seven-tier mechanistic-explanation skeleton (Symmetry, Energy/DFT, Thermo/CALPHAD, Kinetics, Microstructure, Defect chemistry, Bonding) applicable to any crystalline ionic oxide. The level-and-audience modularity dissolves the horizontal fragmentation, the compliance audience absorbs the vertical regulation pressure, and the seven-tier organisation of Level 2 delivers the mechanistic explanation depth. We instantiate the architecture as the OntoCrafter Ceramics Ontology (OCO v0.94): 5,196 classes across 44 modules; 167,348 OWL axioms (40,454 logical); 1,674 properties; 829 cross-ontology bridge mappings; 1,172 SHACL shapes; 163 published competency questions.

URL PDF HTML ☆

赞 0 踩 0

2606.14813 2026-06-16 hep-ph cs.AI cs.LG 新提交

JetParticle-JEPA: An Efficient Self-Supervised Representation Learning method for Jet Tagging in High-Energy Physics

JetParticle-JEPA：一种用于高能物理喷注标记的高效自监督表示学习方法

Guillaume Letellier, Antonin Vacheret, Frédéric Jurie

发表机构 * GREYC, Normandy University, Unicaen, ENSICAEN, UMR CNRS 6072（GREYC，诺曼底大学，Unicaen，ENSICAEN，CNRS UMR 6072）； LPC, Normandy University, Unicaen, ENSICAEN, IN2P3, UMR CNRS 6534（LPC，诺曼底大学，Unicaen，ENSICAEN，IN2P3，CNRS UMR 6534）

AI总结提出JetParticle-JEPA，一种基于粒子Transformer的自监督联合嵌入预测架构，无需标记或重建原始输入，直接从连续粒子云学习物理有意义的喷注表示，在JetClass等基准上达到与全监督方法相当的性能，并在低标签场景下超越监督基线。

详情

AI中文摘要

大型强子对撞机上的喷注标记越来越依赖于在大量模拟数据集上训练的深度学习模型，导致计算成本高且对探测器建模误差的鲁棒性有限。我们引入了JetParticle-JEPA (JP-JEPA)，一种自监督联合嵌入预测架构，它直接从连续粒子云中学习物理有意义的喷注表示，无需对原始输入进行标记化或重建。基于粒子Transformer主干，JP-JEPA在保留细粒度运动学相关性的同时预测被掩码粒子的潜在表示。在JetClass基准上，JP-JEPA在完整数据集上实现了与全监督最先进方法相当的性能，在低标签场景下超越了监督基线，并显著优于现有的自监督学习方法。在顶夸克和夸克-胶子喷注标记基准上，它与监督方法保持同等水平。学习到的表示还对缺失探测器信息表现出强鲁棒性，并改善了不确定性行为，凸显了JP-JEPA作为LHC上鲁棒且数据高效的喷注物理基础模型框架的潜力。

英文摘要

Jet tagging at the Large Hadron Collider increasingly relies on deep learning models trained on massive simulated datasets, leading to high computational costs and limited robustness to detector mismodeling. We introduce JetParticle-JEPA (JP-JEPA), a self-supervised Joint-Embedding Predictive Architecture that learns physically meaningful jet representations directly from continuous particle clouds without tokenization or reconstruction of raw inputs. Built on a Particle Transformer backbone, JP-JEPA predicts latent representations of masked particles while preserving fine-grained kinematic correlations. On the JetClass benchmark, JP-JEPA achieves performance comparable to fully supervised state-of-the-art methods on the full dataset, surpasses supervised baselines in low-label regimes, and significantly outperforms existing SSL approaches. On Top Quark and Quark-Gluon Tagging benchmarks, it remains on par with supervised methods. The learned representations also exhibit strong robustness to missing detector information and improved uncertainty behavior, highlighting JP-JEPA as a promising foundation-model framework for robust and data-efficient jet physics at the LHC.

URL PDF HTML ☆

赞 0 踩 0