arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2605.29886 2026-05-29 cs.CL cs.AI

CRITIC-R1: Learning Structured Critics for Retrieval-Augmented Generation

CRITIC-R1: 学习结构化评论用于检索增强生成

Wenhan Xiao, Ziwei Zhang, Chuanyue Yu, Xingcheng Fu, Qingyun Sun, Runhua Xu, Jianxin Li

AI总结提出CRITIC-R1框架，通过强化学习将RAG评论建模为结构化错误诊断问题，设计保守判断对齐和诊断质量对齐奖励函数，提升检索增强生成的答案质量。

详情

Comments: 17 pages,13 figures

AI中文摘要

检索增强生成（RAG）通过引入外部证据改进了知识密集型问答。然而，现有的RAG方法仍然存在幻觉和细微推理错误。最近的研究引入外部评论来优化RAG输出，但它们通常提供粗粒度且结构薄弱的反馈，表现出过度激进的干预，导致噪声大且不可靠的优化，限制了其纠正效果。为解决这些问题，我们提出了CRITIC-R1，一个结构化评论框架，将RAG评论制定并学习为使用强化学习（RL）的显式错误诊断问题。我们的框架将常见的RAG错误分类为多个诊断维度，包括判定、错误位置、推理分析和修复生成。为了学习这些能力，我们设计了两个奖励函数：保守判断对齐（CJA）首先鼓励校准的高层判断，同时减轻过度激进现象；而诊断质量对齐（DQA）通过门控奖励进一步改进细粒度诊断反馈。我们使用基于GRPO的RL训练评论模型，并从外部LLM教师模型收集过程级监督。在五个QA基准上的实验表明，CRITIC-R1在强RAG基线上持续提高了答案质量。我们的源代码可在 https://anonymous.4open.science/r/critic-r1-FCB0 获取。

英文摘要

Retrieval-augmented generation (RAG) improves knowledge-intensive question answering by incorporating external evidence. However, existing RAG methods still suffer from hallucinations and subtle reasoning errors. Recent studies introduce external critics to refine RAG outputs, yet they often provide coarse-grained and weakly structured feedback, exhibit over-aggressive intervention, and lead to noisy and unreliable refinement, limiting their effectiveness for correction. To tackle these issues, we propose CRITIC-R1, a structured critic framework that formulates and learns RAG critique as an explicit error diagnosis problem using reinforcement learning (RL). Our framework categorizes common RAG errors into multiple diagnostic dimensions, including verdict, error location, reasoning analysis, and fix generation. To learn these capabilities, we design two reward functions: Conservative Judgement Alignment (CJA) first encourages calibrated high-level judgements while mitigating the over-aggressive phenomenon, whereas Diagnostic Quality Alignment (DQA) further improves fine-grained diagnostic feedback through gated rewards. We train the critic model using GRPO-based RL with process-level supervision collected from external LLM teacher models. Experiments across five QA benchmarks show that CRITIC-R1 consistently improves answer quality over strong RAG baselines. Our source code is available at https://anonymous.4open.science/r/critic-r1-FCB0

URL PDF HTML ☆

赞 0 踩 0

2605.29885 2026-05-29 cs.LG cond-mat.dis-nn math.OC math.RT stat.ML

Open Problem: Separating Geometric and Algorithmic Compression via Cayley-Table Completion

开放问题：通过凯莱表完成分离几何压缩与算法压缩

Dongsung Huh

AI总结提出凯莱表完成作为测试缺失的算法复杂度最小化归纳偏置的规范问题，并挑战社区将连续平坦性先验推广以自主发现离散算法公理。

详情

Comments: 6 pages. Submitted to the Conference on Learning Theory (COLT) 2026 Open Problem track

AI中文摘要

现代统计学习理论和深度学习主要从连续容量控制（如基于范数的正则化、间隔最大化、低秩偏置）的角度来表征泛化。虽然在连续领域非常成功，但深度学习始终无法外推精确的算法或离散代数规则，这反映出缺失了向算法复杂度最小化的归纳偏置。我们提出凯莱表完成作为这一缺失偏置的规范测试平台，作为矩阵完成的离散代数对应物。正如矩阵分解结合权重衰减产生对低线性秩的隐式几何偏置，最近的结果表明，算子值张量分解结合平坦性先验产生对精确离散结合性的隐式算法偏置。我们提出了为凯莱表建立形式化精确恢复界限的开放问题，并挑战社区将连续平坦性先验推广，以自主发现更广泛的离散算法公理，而无需组合搜索。

英文摘要

Modern statistical learning theory and deep learning characterize generalization primarily in terms of continuous capacity control (e.g., norm-based regularization, margin maximization, low-rank bias). While highly successful in continuous domains, deep learning consistently fails to extrapolate exact algorithmic or discrete algebraic rules, reflecting a missing inductive bias toward algorithmic complexity minimization. We propose the Cayley-table completion as the canonical testbed for this missing bias, serving as the discrete algebraic counterpart to matrix completion. Just as matrix factorization combined with weight decay yields an implicit geometric bias toward low linear rank, recent results demonstrate that operator-valued tensor factorizations paired with a flatness prior yield an implicit algorithmic bias toward exact discrete associativity. We pose the open problem of establishing formal exact recovery bounds for Cayley-table completion, and challenge the community to generalize continuous flatness priors to autonomously discover broader discrete algorithmic axioms without combinatorial search.

URL PDF HTML ☆

赞 0 踩 0

2605.29881 2026-05-29 cs.CV cs.AI

Mitigating Hallucination in Vision-Language Models through Barrier-Regulated Adaptive Closed-form Steering

通过屏障调控自适应闭式引导缓解视觉语言模型中的幻觉

Soumyadeep Jana, Pulkit Mittal, Sanasam Ranbir Singh

AI总结提出BRACS框架，通过监测视觉注意力并仅在接地退化时进行闭式修正，无需训练即可有效减少LVLM中的物体幻觉。

详情

AI中文摘要

大型视觉语言模型（LVLMs）经常幻觉出输入图像中不存在的物体，这主要是因为随着解码进行，视觉接地减弱。现有的推理时缓解方法在生成过程中修改logits或隐藏状态，但它们存在三个关键限制：缺乏明确的接地目标，即使在模型已经良好接地时也进行干预，以及使用固定的修正强度，无法适应接地失败的严重程度。我们提出BRACS（屏障调控自适应闭式引导），一种无需训练的引导框架，通过屏障调控自适应闭式引导解决这些问题。BRACS监测模型自身的注意力以衡量视觉接地，并仅在接地恶化时对隐藏状态进行修正。修正更新以闭式解析计算，无需训练辅助网络或重新训练模型。在LLaVA-1.5-7B和Qwen-VL-Chat上的实验表明，BRACS在幻觉基准上持续优于先前方法，将CHAIR$_s$降低9.4个点，将POPE F1提高2.7个点，同时在四个通用多模态基准上匹配或提升性能。BRACS还保持高效，运行速度为贪心解码吞吐量的80%，平均速度比基线快1.3倍。

英文摘要

Large vision-language models (LVLMs) often hallucinate objects that are not present in the input image, largely because visual grounding weakens as decoding progresses. Existing inference-time mitigation methods modify logits or hidden states throughout generation, but they suffer from three key limitations: they lack an explicit grounding objective, intervene even when the model is already well-grounded, and use fixed correction strengths that do not adapt to the severity of grounding failure. We propose BRACS (Barrier-Regulated Adaptive Closed-form Steering), a training-free steering framework that addresses these issues through barrier-regulated adaptive closed-form steering. BRACS monitors the model's own attention to measure visual grounding and applies corrections to the hidden states only when grounding deteriorates. The corrective update is computed analytically in closed form, requiring no training of auxiliary networks or model retraining. Experiments on LLaVA-1.5-7B and Qwen-VL-Chat show that BRACS consistently outperforms prior methods on hallucination benchmarks, reducing CHAIR$_s$ by 9.4 points and improving POPE F1 by 2.7 points, while matching or improving performance on four general multimodal benchmarks. BRACS also remains efficient, operating at 80% of greedy decoding throughput and achieving 1.3 times higher speed on average than the baselines.

URL PDF HTML ☆

赞 0 踩 0

2605.29874 2026-05-29 cs.MA cs.AI cs.GT

Evolutionary Dynamics of Cooperation in Next-Generation LLM Agent Systems: A Cross-Provider Empirical Extension

下一代LLM智能体系统中合作的演化动力学：跨提供商的实证扩展

Francisco León Zúñiga Bolívar

AI总结本研究通过扩展Willis等人的基准，测试2025-2026年四个前沿LLM模型在迭代囚徒困境中的合作偏差，发现合作偏差普遍存在但提供商间差异显著，且噪声仍是普遍挑战。

详情

Comments: 10 pages, 3 figures, 8 tables. Extends Willis et al. (arXiv:2501.16173). Code and n=500 replication package: https://github.com/arqFranciscoLeon/evollm (archived: https://doi.org/10.5281/zenodo.20248615)

AI中文摘要

下一代LLM智能体是否继承了其前身中记录的合作偏差，还是规模和提供商的多样性重塑了竞争性多智能体环境中的均衡行为？Willis等人使用演化博弈论和迭代囚徒困境（IPD）为此问题建立了基准，发现ChatGPT-4o和Claude 3.5 Sonnet中存在一致的合作偏差。我们将此基准扩展到2025-2026年发布的四个前沿模型——Claude Sonnet 4.6、Gemini 2.5 Flash、Gemini 3.1 Pro和GPT-5.4 Mini——在三种提示风格（默认、散文、自我优化）和四种群体组成（平衡和有偏，有无噪声）下应用相同的协议。合作偏差在提供商间持续存在（H1）：在平衡无噪声条件下，十二种模型-提示组合中有九种倾向于合作均衡。提供商间差异显著（H3）：Gemini 2.5 Flash在有偏条件下达到高达77%的攻击性均衡，而GPT-5.4 Mini在自我优化下达到70%的合作均衡。对攻击性能力对等的支持是部分的（H2）：自我优化提高了所有模型的ICD，Claude Sonnet 4.6 Refine在数据集中达到最高ICD（0.913），但默认和散文提示未显示系统性缩小。关于噪声鲁棒性的证据方向为正但未稳健确认（H4）：在每种条件下n=500次Moran迭代，Claude Sonnet 4.6的平均噪声敏感度约为6个百分点，而Claude 3.5 Sonnet为13个百分点，但一旦传播前身未报告的抽样误差，这一跨研究差距在统计上不显著。提供商身份而非模型代际是均衡结果的最强相关因素；无论模型大小或年代，噪声仍然是普遍挑战。

英文摘要

Do next-generation LLM agents inherit the cooperative biases documented in their predecessors, or does scale and provider diversity reshape equilibrium behaviour in competitive multi-agent settings? Willis et al. established a benchmark for this question using evolutionary game theory and the Iterated Prisoner's Dilemma (IPD), finding consistent cooperative biases in ChatGPT-4o and Claude 3.5 Sonnet. We extend this benchmark to four frontier models released in 2025-2026 - Claude Sonnet 4.6, Gemini 2.5 Flash, Gemini 3.1 Pro, and GPT-5.4 Mini - applying the identical protocol across three prompting styles (Default, Prose, Self-Refine) and four population compositions (balanced and biased, with and without noise). Cooperative bias persists across providers (H1): nine of twelve model-prompt combinations favour cooperative equilibria in balanced noiseless conditions. Cross-provider divergence is substantial (H3): Gemini 2.5 Flash reaches up to 77% aggressive equilibria under biased conditions, while GPT-5.4 Mini reaches 70% cooperative equilibria under Self-Refine. Support for aggressive capability parity is partial (H2): Self-Refine raises ICD in all models and Claude Sonnet 4.6 Refine achieves the highest ICD in the dataset (0.913), but Default and Prose prompts show no systematic narrowing. Evidence on noise robustness is directionally positive but not robustly confirmed (H4): with n=500 Moran iterations per condition, average noise sensitivity is approximately 6 percentage points for Claude Sonnet 4.6 versus 13 pp for Claude 3.5 Sonnet, but this cross-study gap is not statistically significant once the predecessor's unreported sampling error is propagated. Provider identity, rather than model generation, is the strongest correlate of equilibrium outcomes; noise remains a universal challenge regardless of model size or vintage.

URL PDF HTML ☆

赞 0 踩 0

2605.29873 2026-05-29 cs.AI

Moment-KV: Momentum-Based Decode-Time KV Cache Compression for Long Generation

Moment-KV: 基于动量的解码时KV缓存压缩用于长文本生成

Soumyadeep Jana, Sagar Nishad, Sanasam Ranbir Singh

AI总结提出Moment-KV方法，利用动量驱动的时序注意力聚合在解码阶段压缩KV缓存，以提升长文本生成质量并保持解码延迟。

详情

AI中文摘要

键值（KV）缓存仍然是大型语言模型（LLM）在长文本生成任务中部署的主要瓶颈。先前的工作通常对预填充和解码缓存应用均匀压缩，但压缩预填充缓存会破坏关键上下文从而降低性能。虽然保留预填充缓存至关重要，但解码阶段的压缩仍未被充分探索，现有方法依赖于固定的近期窗口或瞬时注意力。我们对注意力动态的分析揭示了强时间模式：关键标记在长时间范围内获得持续注意力，而局部推理涉及短暂的爆发。静态启发式方法无法捕捉这种行为，导致重要标记被过早驱逐或陈旧标记被保留。我们提出Moment-KV，一种基于动量驱动的时序注意力聚合的解码时KV缓存压缩方法。我们的方法将标记重要性建模为连续演化的状态，其中注意力通过衰减进行聚合，捕捉长期影响和近期相关性。实验表明，Moment-KV在长文本生成任务中显著提高了生成保真度（2.3-3.2%），同时保持了解码延迟。

英文摘要

Key-Value (KV) cache remains a major bottleneck for deploying Large Language Models (LLMs) in long-generation tasks. Prior work often applies uniform compression across both prefill and decoding caches, but compressing the prefill cache degrades performance by corrupting critical context. While preserving the prefill cache is essential, decoding-phase compression remains underexplored, with existing methods relying on rigid recency windows or instantaneous attention. Our analysis of attention dynamics reveals strong temporal patterns: critical tokens receive sustained attention over long horizons, while local reasoning involves short-lived bursts. Static heuristics fail to capture this behavior, leading to premature eviction of important tokens or retention of stale ones. We propose Moment-KV, a decoding-time KV cache compression method based on momentum-driven temporal attention aggregation. Our method models token importance as a continuously evolving state, where attention is aggregated with decay, capturing both long-term influence and recent relevance. Experiments show that Moment-KV significantly improves generation fidelity in long-generation tasks (2.3-3.2 %) while maintaining decoding latency.

URL PDF HTML ☆

赞 0 踩 0

2605.29868 2026-05-29 cs.CR cs.CV cs.DC

Ciphera: A Decentralised Biometric Identity Framework

Ciphera: 一种去中心化的生物特征身份框架

Ankit Kanaiyalal Prajapati, Shahzad Memon, Mohammed Mahir Rahman, Ameer Al-Nemrat

AI总结提出Ciphera框架，结合隐私保护面部识别、多节点验证、IPFS凭证元数据存储和区块链锚定撤销，实现去中心化生物特征身份管理，并通过功能、性能、安全性和分布式一致性评估验证其可行性。

详情

Journal ref: CyberAI 2026 (https://cyberai-conf.org/)
Comments: Accepted at the CyberAI 2026 Conference, and to be indexed at IEEE-Scopus

AI中文摘要

中心化的生物特征身份系统使用户面临单点故障、不透明的验证过程以及不可逆的生物特征泄露风险。去中心化标识符（DID）和可验证凭证（VC）提供了更强的隐私保障，但它们与生物特征认证和分布式验证的整合仍未被充分探索。本文提出了Ciphera，一个去中心化的生物特征身份框架，结合了隐私保护的面部识别、多节点验证、基于IPFS的凭证元数据存储和区块链锚定的撤销。在功能、性能、安全性和分布式一致性维度上评估，Ciphera实现了81%的功能成功率，具有稳定的注册和认证，但存在可测量的撤销传播延迟和偶尔的审计日志不一致。性能测试显示，在并发多节点条件下，p95验证延迟约为820毫秒，低于1秒。安全性分析确认了强大的机密性和完整性保证，但不完整的活体检测使其容易受到深度伪造和重放攻击。结果证明了去中心化生物特征身份的可行性，同时指出了生产级部署的关键工程挑战。

英文摘要

Centralised biometric identity systems expose users to single points of failure, opaque verification processes, and irreversible biometric compromise. Decentralised Identifiers (DIDs) and Verifiable Credentials (VCs) offer stronger privacy guarantees, yet their integration with biometric authentication and distributed verification remains insufficiently explored. This paper presents Ciphera, a decentralised biometric identity framework combining privacy-preserving facial recognition, multi-node verification, IPFS-based credential metadata storage, and blockchain-anchored revocation. Evaluated across functional, performance, security, and distributed consistency dimensions, Ciphera achieved an 81% functional success rate, with stable enrolment and authentication but measurable revocation propagation delays and occasional audit-log inconsistencies. Performance testing demonstrated sub-second p95 verification latency of approximately 820ms under concurrent multi-node conditions. Security analysis confirmed strong confidentiality and integrity guarantees, though incomplete liveness detection leaves susceptibility to deepfake and replay attacks. The results demonstrate the feasibility of decentralised biometric identity while identifying key engineering challenges for production-grade deployment.

URL PDF HTML ☆

赞 0 踩 0

2605.29864 2026-05-29 cs.RO

LLM-Guided Future Hypotheses for Horizon-Aware Exploration in Multi-Step Robot Manipulation

LLM引导的未来假设用于多步机器人操作中的视野感知探索

Mohammad Khoshnazar, Andrew Melnik, Michael Beetz

AI总结提出未来经验条件化（FEC）框架，利用LLM生成短期未来视频作为结构化先验，结合行为克隆和强化学习微调，提升多步机器人操作中的探索和策略适应能力。

详情

AI中文摘要

多步机器人操作需要在场景如何演化的不确定性下行动，这使得探索和策略适应具有挑战性。我们研究了短期、任务一致的未来视频能否为控制和强化学习微调提供有用的结构化先验。我们通过未来经验条件化（FEC）形式化这一思想，这是一种简单的接口，将闭环策略条件化于短期未来视频的潜在表示上。在我们的模拟设置中，未来片段通过三个阶段生成：一个基于当前场景状态初始化的任务本体上运行的LLM推理器，一个无机器人的数字孪生展开预期物体运动，以及一个无需推理时分割的掩码自由视频扩散模型，用于合成机器人一致的未来片段。我们主要使用BC和BC+RL实例化这一未来条件化接口，并在RoboCasa和CALVIN上与无未来、GT未来、生成未来和错误未来条件下的未来条件化流式流策略（SFP）基线进行比较。生成的未来比无未来条件化提高了性能，而不匹配的未来则降低了性能，我们的BC+RL实例化实现了最强的整体结果。对CALVIN的8个任务的平均BC+RL学习曲线分析进一步表明，GT未来改进最快，生成未来比无未来更早且更高水平地改进，而错误未来在整个训练过程中保持为零。这些结果表明，短期未来视频可以在不完美的未来预测下作为探索和策略适应的有用结构化先验。https://enact2026.github.io/

英文摘要

Multi-step robot manipulation requires acting under uncertainty about how the scene will evolve, making exploration and policy adaptation challenging. We study whether short-horizon, task-consistent future videos can provide useful structured priors for control and reinforcement-learning fine-tuning. We formalize this idea through Future-Experience Conditioning (FEC), a simple interface that conditions closed-loop policies on a latent representation of a short future video. In our simulation setup, future clips are generated in three stages, an LLM reasoner operating over a task ontology initialized from the current scene state, a robot-free digital-twin rollout of the intended object motion, and a mask-free video diffusion model that synthesizes a robot-consistent future clip without requiring segmentation at inference. We instantiate this future-conditioning interface primarily with BC and BC+RL, and compare against a future-conditioned Streaming Flow Policy (SFP) baseline on RoboCasa and CALVIN under NoFuture, GTFuture, GenFuture, and WrongFuture. Generated futures improve performance over no-future conditioning, while mismatched futures degrade it, and our BC+RL instantiation achieves the strongest overall results. An average BC+RL learning-curve analysis across 8 CALVIN tasks further shows that GTFuture improves fastest, GenFuture improves earlier and to a higher level than NoFuture, and WrongFuture remains at zero throughout training. These results suggest that short-horizon future videos can serve as useful structured priors for exploration and policy adaptation under imperfect future predictions. https://enact2026.github.io/

URL PDF HTML ☆

赞 0 踩 0

2605.29863 2026-05-29 cs.LG

STAP: A Shuffle-Tokenized App Predictor with Ultra Long Context for Vocabulary-Free Mobile App Prediction

STAP: 一种基于洗牌令牌化的超长上下文无词汇表移动应用预测器

Chengyu Fan, Hang Liu

AI总结提出STAP模型，通过洗牌机制将应用身份替换为虚拟索引，并利用超长上下文处理行为序列，实现无固定词汇表的跨数据集零样本移动应用预测。

详情

Comments: 15 pages, 9 figures, 5 tables Preprint submitted to Expert Systems with Applications

AI中文摘要

预测用户将启动的下一个移动应用对于智能设备资源管理和主动辅助至关重要。现有模型依赖于固定的应用词汇表，这阻碍了它们在不同应用生态系统中的泛化能力。许多模型还依赖于用户特定知识，这使冷启动场景下的部署复杂化。我们提出STAP，一种基于Transformer的模型，消除了对固定词汇表的需求。STAP通过洗牌机制将真实应用身份替换为随机重新分配的虚拟索引，并通过超长上下文设计处理行为序列来补偿丢弃的语义信息。理论分析表明，在给定足够长的上下文的情况下，尽管映射是匿名的，预测分布仍收敛到正确分布。在两个来自不同大陆的数据集上的实验表明，STAP实现了强大的跨数据集零样本预测准确性——这是所有现有固定词汇表方法本质上不适用的情况——同时其在每个数据集内的冷启动性能与领先模型保持竞争力。此外，我们引入了一种部署策略，使模型在连续推理期间能够保持足够长的上下文，同时将延迟控制在可接受范围内。

英文摘要

Predicting the next mobile application a user will launch is essential for intelligent device resource management and proactive assistance. Existing models rely on fixed app vocabularies, which prevents them from generalizing across different app ecosystems. Many also depend on user-specific knowledge, which complicates deployment in cold start scenarios. We propose STAP, a Transformer-based model that eliminates the need for a fixed vocabulary. STAP replaces true app identities with randomly reassigned virtual indices via a shuffle mechanism, and compensates for discarded semantic information by processing behavioral sequences with an ultra-long context design. A theoretical analysis shows that, given a sufficiently long context, the predicted distribution converges to the correct one despite the anonymity of the mapping. Experiments on two datasets from different continents demonstrate that STAP achieves strong cross-dataset zero-shot prediction accuracy -- a setting where all existing fixed-vocabulary methods are inherently inapplicable -- while its cold start performance within each dataset remains competitive with leading models. Furthermore, we introduce a deployment strategy that enables the model to retain a sufficiently long context during continuous inference while keeping latency within acceptable bounds.

URL PDF HTML ☆

赞 0 踩 0

2605.29862 2026-05-29 eess.AS cs.AI cs.SD

Mitigating Stethoscope-Induced Shortcuts in Respiratory Sound Classification under Federated Domain Generalization with Causality-Inspired Interventions

在联邦域泛化下通过因果启发的干预减轻听诊器引起的呼吸音分类中的捷径

Heejoon Koo, Yoon Tae Kim, Miika Toikkanen, June-Woo Kim

AI总结针对呼吸音分类中听诊器设备差异导致的域偏移问题，提出一种因果启发的多模态联邦域泛化框架，通过内容保持的风格扰动、反事实文本增强和梯度对齐实现设备不变表示，在ICBHI和SPRSound数据集上优于传统方法。

详情

Comments: 2 figures, 4 tables, and 5 pages

AI中文摘要

基于AI的呼吸音分类（RSC）有望实现自动化肺部疾病检测，但多站点部署受到听诊器间差异的阻碍。我们针对听诊器引起的设备偏移引入了一种联邦域泛化（FedDG）公式，其中客户端使用异构设备，模型在未见设备上进行评估。我们的实证分析表明，听诊器引起的风格和疾病特定内容紧密纠缠，使得确定性风格去除不可靠。为此，我们提出了一种因果启发的多模态FedDG框架，结合了：(i) 因果启发的设备风格干预网络，执行内容保持的风格扰动，(ii) 反事实文本增强，中和元数据捷径，以及(iii) 梯度对齐，促进跨客户端的设备不变表示。基于多模态语言-音频预训练模型，在ICBHI和SPRSound数据集上的留一设备验证中，它优于传统数据增强和联邦学习基线。代码将在发表后发布。

英文摘要

AI-driven respiratory sound classification (RSC) is promising for automated pulmonary disease detection, yet multi-site deployment is hindered by inter-stethoscope variability. We introduce a federated domain generalization (FedDG) formulation for RSC under stethoscope-induced device shifts, where clients use heterogeneous devices and the model is evaluated on unseen devices. Our empirical analysis shows that stethoscope-induced style and disease-specific content are tightly entangled, making deterministic style removal unreliable. In response, we propose a causality-inspired multimodal FedDG framework that combines: (i) a causality-inspired device style intervention network that performs content-preserving style perturbations, (ii) counterfactual text augmentation that neutralizes metadata shortcuts, and (iii) gradient alignment that facilitates device-invariant representations across clients. Built on a multimodal language-audio pretraining model, it outperforms conventional data augmentation and federated learning baselines in leave-one-device-out validation on ICBHI and SPRSound datasets. Code will be released upon publication.

URL PDF HTML ☆

赞 0 踩 0

2605.29860 2026-05-29 cs.LG cs.AI

ESPO: Early-Stopping Proximal Policy Optimization

ESPO：早期停止的近端策略优化

Zihang Li, Rui Zhou, Yingcheng Shi, Wenhan Yu, Zhewen Tan, Zixiang Liu, Zeming Li, Binhua Li, Yongbin Li, Tong Yang, Jieping Ye

AI总结提出ESPO算法，通过在强化学习训练大语言模型时在线检测轨迹失败并提前终止，节省计算资源并提升数学推理性能。

详情

AI中文摘要

当大语言模型在强化学习过程中，在轨迹早期出现错误的推理步骤时，标准算法会强制其继续生成直到最大步长，从而在从未获得正奖励的令牌上浪费计算资源，并用失败后的噪声污染优势估计。我们提出ESPO（早期停止的近端策略优化），该算法能够在线检测轨迹失败并提前终止轨迹生成。在每个生成步骤中，ESPO仅利用采样过程中已计算出的logits计算一个替代遗憾值，并在平滑累积遗憾值显著超过其估计值时终止。截断轨迹被视为具有终止奖励的吸收失败状态，将负的时间差分误差集中在检测到的失败步骤附近，无需任何额外的奖励模型或人工标注。在基于DeepSeek-R1-Distill-Qwen-7B训练的数学推理任务上，ESPO在AIME 2024（46.28% vs. 45.25%）、AMC 2023（85.83% vs. 82.94%）和MATH-500（87.42% vs. 85.43%）上超越了PPO，同时累计节省了超过20%的轨迹生成令牌。

英文摘要

When a large language model under reinforcement learning commits a wrong reasoning step early in a trajectory, standard algorithms force it to keep generating until the maximum horizon, spending compute on tokens that never receive positive reward and polluting advantage estimates with post-failure noise. We propose ESPO (Early-Stopping Proximal Policy Optimization), which detects trajectory failure on-the-fly and terminates rollouts early. At each generation step, ESPO computes a surrogate regret using only the logits already computed during sampling, and terminates when the smoothed cumulative regret significantly exceeds its estimated values. Truncated trajectories are treated as absorbing failure states with a terminal reward, concentrating negative temporal-difference (TD) errors near the detected failure step without any additional reward model or human annotation. On DeepSeek-R1-Distill-Qwen-7B trained for mathematical reasoning, ESPO surpasses PPO on AIME~2024 (46.28% vs. 45.25%), AMC~2023 (85.83% vs. 82.94%), and MATH-500 (87.42% vs. 85.43%), while saving more than 20% rollout tokens cumulatively.

URL PDF HTML ☆

赞 0 踩 0

2605.29859 2026-05-29 eess.AS cs.CL

MELD: Mel-Spectrogram-Based Speech Language Modeling with Discrete Latent Variables

MELD: 基于梅尔频谱的离散潜变量语音语言建模

Sung-Lin Yeh, Wei Zhou, Gil Keren, Duc Le, Zhong Meng, Hao Tang, Jay Mahadeokar, Ozlem Kalinli, Alexandre Mourachko

AI总结提出一种在梅尔频谱上联合优化编码器和语音语言模型的离散潜变量模型，在零样本文本转语音和语音转文本任务上优于基于编解码器和其他梅尔频谱基线，并缓解了自回归建模中的长时间静音和单词遗漏问题。

2605.29858 2026-05-29 cs.CV

Masked Diffusion Vision-Language Models for Temporal Action Localization

用于时序动作定位的掩码扩散视觉语言模型

Fengshun Wang, Zhengbo Zhang, Zhigang Tu

AI总结提出掩码扩散视觉语言模型（MDVLM）用于时序动作定位，通过双向注意力迭代去噪联合优化语义和边界，并引入边界感知掩码和步级IoU奖励解决训练不匹配问题。

详情

AI中文摘要

时序动作定位（TAL）需要在未修剪视频中识别目标事件并精确定位其开始和结束时间。最近的视觉语言公式改进了语义推理并支持语言条件输出，但其自回归解码器仍然从左到右生成令牌，阻止了后续语义证据修正早期时间戳预测。我们将掩码扩散视觉语言模型（MDVLM）适配到TAL，使得语义令牌和边界令牌在具有双向注意力的迭代去噪过程中保持可编辑，从而允许时间边界和语义内容共同细化。然而，直接适配会产生两个TAL特定的不匹配：标准掩码扩散训练随机均匀地破坏所有位置，但时间令牌在有足够语义上下文时更可靠；令牌级交叉熵不反映时序IoU。为了解决这些不匹配，我们引入了一个计划训练目标，该目标使用边界感知掩码和步加权重构来排练时间令牌的后期恢复，同时引入步级IoU奖励，在去噪过程中提供重叠感知监督。标准序列级交叉熵项提供基础重构信号。在ActivityNet-RTL、ActivityNet-1.3和THUMOS-14上的实验表明，MDVLM-TAL在时序推理和边界定位方面均优于自回归视觉语言基线，在更严格的时序IoU标准下尤其显著。

英文摘要

Temporal action localization (TAL) requires recognizing the target event and localizing its start and end times precisely in untrimmed videos. Recent vision-language formulations improve semantic reasoning and support language-conditioned outputs, but their autoregressive decoders still generate tokens from left to right, preventing later semantic evidence from revising earlier timestamp predictions. We adapt masked diffusion vision-language models (MDVLMs) to TAL so that semantic tokens and boundary tokens remain editable throughout iterative denoising with bidirectional attention, allowing temporal boundaries and semantic content to be refined jointly. Direct adaptation, however, creates two TAL-specific mismatches: standard masked diffusion training corrupts all positions uniformly at random, but the time tokens are more reliable when enough semantic context is available; and token-level cross-entropy does not reflect temporal IoU. To address these mismatches, we introduce a Planned Training Objective that uses boundary-aware masking and step-weighted reconstruction to rehearse the late recovery of time tokens, together with a Step-Level IoU Reward that provides overlap-aware supervision during denoising. A standard sequence-level cross-entropy term provides the base reconstruction signal. Experiments on ActivityNet-RTL, ActivityNet-1.3, and THUMOS-14 show that MDVLM-TAL improves both temporal reasoning and boundary localization over autoregressive vision-language baselines, with especially strong gains under stricter temporal IoU criteria.

URL PDF HTML ☆

赞 0 踩 0

2605.29857 2026-05-29 cs.LG

Feedback-to-Rubrics: Can We Learn Expert Criteria from Inline Comments?

从内联评论到评分标准：我们能从内联评论中学习专家标准吗？

Kotaro Yoshida, So Kuroki, Yuki Imajuku, Taishi Nakamura, Ryunosuke Iwai, Haruki Goda, Takuya Akiba

AI总结提出从内联评论中学习可复用的自然语言评分标准的方法，通过迭代优化评分标准来预测评论并支持自动修订。

2605.29856 2026-05-29 cs.CV

Building and Road Recognition in Dense Urban Informal Settlements: A Dataset and Benchmark

密集城市非正规住区中的建筑与道路识别：数据集与基准

Hongyu Long, Jiaxuan Liu, Rui Cao

AI总结针对城市村等高密度非正规住区缺乏精细标注数据的问题，构建了首个高分辨率遥感数据集DenseUIS，并评估了现有深度学习模型，揭示了其在处理密集非正规形态上的局限性，为复杂高密度环境下的精细城市制图提供了基准。

详情

Comments: 5 pages, 4 figures;

AI中文摘要

作为一种普遍存在的非正规住区形式，城中村对可持续城市发展和治理提出了重大挑战。精确绘制其基础设施至关重要，然而，现有的遥感数据集主要关注正规城市环境，缺乏针对城中村典型的高密度建筑模式和狭窄道路网络的精细标注数据。为填补这一空白，我们引入了 extit{DenseUIS}数据集，这是首个专门用于极度密集城市非正规住区中建筑和道路提取的高分辨率遥感数据集，覆盖了中国深圳和广州的126个城中村。此外，我们对该数据集上的最先进深度学习模型进行了全面评估。实验结果表明，现有方法在处理密集非正规住区的独特形态模式方面存在局限性，凸显了对专门方法的需求。因此， extit{DenseUIS}为推进复杂高密度非正规环境中的精细城市制图提供了一个稳健的基准。该数据集公开于https://github.com/rui-research/DenseUIS。

英文摘要

As a widespread form of informal settlements, urban villages present significant challenges for sustainable urban development and governance. Precise mapping of their infrastructure is essential, however, existing remote sensing datasets primarily focus on formal urban environments, lacking fine-grained annotated data for the high-density building patterns and narrow road networks typical of urban villages. To address this gap, we introduce the \textit{DenseUIS} dataset, the first high-resolution remote sensing dataset specifically designed for building and road extraction in extremely dense urban informal settlements, covering 126 urban villages across Shenzhen and Guangzhou in China. Furthermore, we conduct a comprehensive evaluation of state-of-the-art deep learning models on this dataset. Experimental results reveal the limitations of existing methods in handling the unique morphological patterns of dense informal settlements, underscoring the need for specialized approaches. \textit{DenseUIS} therefore provides a robust benchmark for advancing fine-grained urban mapping in complex and high-density informal environments. The dataset is publicly available at https://github.com/rui-research/DenseUIS.

URL PDF HTML ☆

赞 0 踩 0

2605.29850 2026-05-29 cs.LG

MIRAGE: Adaptive Multimodal Gating for Whole-Brain fMRI Encoding

MIRAGE：用于全脑fMRI编码的自适应多模态门控

Abdulkadir Gokce, Badr AlKhamissi, Martin Schrimpf

AI总结提出MIRAGE框架，通过原生多模态骨干网络和自适应特征门控，实现全脑fMRI对自然视听刺激的高精度编码，并证明原生多模态特征优于后期融合的单模态特征。

详情

Comments: Preprint. First two author contributed equally

AI中文摘要

近期任务优化神经网络的进展已将编码模型确立为预测大脑对自然刺激反应的有力工具，然而现有方法大多依赖单模态表示。全模态基础模型和丰富的多模态神经数据集的出现，使得能够联合整合跨被试的视觉、听觉和语言信息的编码模型成为可能。我们提出MIRAGE，一个用于预测全脑fMRI对自然视听刺激反应的脑编码框架。MIRAGE通过原生多模态骨干网络和跨层自适应特征门控实现了最先进的性能。这些表示随后与基于transformer的脑编码器和跨皮层分区的被试特定线性头相结合。控制比较表明，原生多模态特征在架构层次和骨干网络上始终优于独立单模态特征的事后聚合。除了预测准确性，学习的注意力权重可直接检查以解释骨干网络上的模态特定门控分布，每种模态在皮层上描绘出不同的解剖模式。综合这些结果，提出了原生多模态特征的自适应逐层聚合作为全脑编码的一种可泛化、可解释且准确的方法。

英文摘要

Recent progress in task-optimized neural networks has established encoding models as a powerful tool for predicting brain responses to naturalistic stimuli, yet most existing approaches rely on unimodal representations. The emergence of omni-modal foundation models and rich multimodal neural datasets enables encoding models that jointly integrate visual, auditory, and linguistic information across subjects. We introduce MIRAGE, a brain encoding framework for predicting whole-brain fMRI responses to naturalistic audiovisual stimuli. MIRAGE achieves state-of-the-art performance via a native multimodal backbone and adaptive feature gating across layers. These representations are then combined with a transformer-based brain encoder and a subject-specific linear head over the cortical parcels. Controlled comparisons show that natively multimodal features consistently outperform post-hoc aggregation of independent unimodal features, across architectural levels and backbones. Beyond predictive accuracy, the learned attention weights are directly inspectable to interpret the modality-specific gating profile over the backbone, and each modality traces a distinct anatomical pattern across cortex. Together, these results propose adaptive layer-wise aggregation of natively multimodal features as a generalizable, interpretable, and accurate approach for whole-brain encoding.

URL PDF HTML ☆

赞 0 踩 0

2605.29849 2026-05-29 eess.SY cs.LG cs.SY

BuilDyn: Excitation-Driven Data Generation for Building Thermal Dynamics Modeling and Control

BuilDyn: 面向建筑热动力学建模与控制的激励驱动数据生成

Felix Koch, Thomas Krug, Fabian Raisch, Benjamin Schäfer, Benjamin Tischler

AI总结本文提出BuilDyn包，通过可定制的激励策略生成控制导向的建筑数据，提升机器学习模型对未见工况的鲁棒性。

详情

AI中文摘要

机器学习越来越多地用于建筑的数据驱动建模，以实现故障检测与诊断、节能控制等下游任务。虽然最近的工作改善了跨建筑特性、天气和占用率的泛化能力，但泛化也依赖于对控制驱动系统状态空间的充分探索。现有的真实世界数据集和仿真环境主要反映固定控制策略下的稳态运行，导致激励有限，对未见工况的鲁棒性降低。本文介绍了基于BuilDa的BuilDyn包，该包支持可定制的激励策略用于控制导向的数据生成。BuilDyn还支持从代表性建筑分布中采样，并提供Python接口以便轻松集成到机器学习流水线中。我们通过比较在非激励和激励数据上训练的数据驱动ML模型在一栋建筑上的性能，展示了BuilDyn的优势。借助BuilDyn，我们希望推进可扩展的控制导向建模，并支持迁移学习和建筑特定基础模型等未来方向。

英文摘要

Machine learning (ML) is increasingly used for data-driven modeling of buildings to enable downstream tasks such as fault detection and diagnosis, and energy-efficient control. While recent work improves generalization across building characteristics, weather, and occupancy, generalization also depends on sufficient exploration of the control-driven system state space. Existing real-world datasets and simulation environments predominantly reflect stationary operation under fixed control policies, resulting in limited excitation and reduced robustness to unseen operating conditions. This paper introduces BuilDyn, a package based on BuilDa that enables customizable excitation strategies for control-oriented data generation. BuilDyn further supports sampling from representative building distributions and provides a Python interface for easy integration into machine learning pipelines. We demonstrate the benefits of BuilDyn by comparing the performance of data-driven ML models trained on non-excited and excited data for one building. With BuilDyn, we hope to advance scalable control-oriented modeling and support future directions such as transfer learning and building-specific foundation models.

URL PDF HTML ☆

赞 0 踩 0

2605.29847 2026-05-29 cs.CL

EvoRubric: Self-Evolving Rubric-Driven RL for Open-Ended Generation

EvoRubric: 用于开放生成的自我进化评分标准驱动强化学习

Xin Guan, Xiaomeng Hu, Shen Huang, Zhenyi Wang, Bo Zhang, Zijian Li, Pengjun Xie, Bo Liu, Jiuxin Cao

AI总结提出EvoRubric，一种单策略共进化强化学习框架，通过动态交替生成响应和评分标准，并引入多层验证机制，解决开放生成任务中缺乏明确奖励的问题，在医学、写作和科学领域超越传统静态和外部LLM驱动方法。

详情

AI中文摘要

强化学习（RL）在可验证领域显著提升了大型语言模型（LLM），但由于缺乏明确的奖励，为开放生成任务对齐模型仍然极具挑战性。当前的基于评分标准的RL方法通过使用显式标准来缓解这一问题；然而，它们严重依赖于静态的人工标注评分标准，这不可避免地导致策略滞后，或者依赖昂贵的外部专有模型进行动态更新。在本文中，我们提出了EvoRubric，一种新颖的单策略共进化RL框架，消除了对静态标准和外部评分标准生成器的依赖。通过将响应生成和评分标准生成统一在单一参数化策略下，EvoRubric在推理器和评分标准生成器之间动态交替。为了防止奖励黑客攻击并确保生成信号的可信度，我们引入了一个多层验证流程，包括元验证器、零方差剪枝和留一法同行共识机制。经过验证的标准被动态归档到记忆池中，产生密集的多目标奖励，以持续共同优化两个角色。在医学、写作和科学领域的广泛实验表明，EvoRubric始终优于传统的静态和外部LLM驱动的对齐方法。值得注意的是，我们的框架与人类专家先验知识兼容。当使用专家标注的评分标准初始化时，EvoRubric能够进一步发现新颖的、有区分度的维度，从而实现比仅依赖静态专家标注更好的性能。

英文摘要

Reinforcement Learning (RL) has significantly advanced Large Language Models (LLMs) in verifiable domains, but aligning models for open-ended generation remains profoundly challenging due to the lack of definitive rewards. Current rubric-based RL methods mitigate this by employing explicit criteria; however, they rely heavily on static, human-annotated rubrics that inevitably cause policy lag, or expensive external proprietary models for dynamic updates. In this paper, we propose EvoRubric, a novel single-policy co-evolutionary RL framework that eliminates the reliance on static criteria and on external rubric generators. By unifying response generation and rubric generation under a single parameterized policy, EvoRubric dynamically alternates between a Reasoner and a Rubric Generator. To prevent reward hacking and ensure the reliability of generated signals, we introduce a multi-level verification pipeline featuring a meta-verifier, zero-variance pruning, and a Leave-One-Out peer consensus mechanism. Validated criteria are dynamically archived into a memory pool, yielding dense, multi-objective rewards to continuously co-optimize both roles. Extensive experiments across Medical, Writing, and Science domains demonstrate that EvoRubric consistently outperforms traditional static and external-LLM-driven alignment methods. Notably, our framework is compatible with human-expert priors. When initialized with expert-annotated rubrics, EvoRubric can further uncover novel, discriminative dimensions, achieving better performance than relying solely on static expert annotations.

URL PDF HTML ☆

赞 0 踩 0

2605.29843 2026-05-29 cs.LG cs.AI

HARP: Hadamard-Preconditioned Adaptive Rotation Processor for Extreme LLM Quantization

HARP: 哈达玛预条件自适应旋转处理器用于极端LLM量化

Artur Zagitov, Gleb Molodtsov, Aleksandr Beznosikov

AI总结提出HARP，一种可学习的结构化双正交处理器，替代固定随机哈达玛变换，通过自适应旋转基来改善极端低位量化中的激活异常值和各向异性权重曲率问题，在2-4比特设置下提升困惑度和零样本准确率，并保持部署效率。

详情

AI中文摘要

后训练量化（PTQ）对于在内存和带宽约束下部署LLM至关重要。然而，极端低位量化仍然对激活异常值和各向异性权重曲率高度敏感。现有的基于非相干性的PTQ方法通过固定的随机哈达玛变换（RHT）缓解了这一问题，这提高了量化鲁棒性，但无法将旋转基适应于层、校准分布或量化器。我们引入了HARP（哈达玛预条件自适应旋转处理器），一种可学习的结构化双正交处理器，它替代了固定的哈达玛混合，同时保留了精确的全精度等价性。HARP将每个旋转表示为稀疏蝶形类块正交阶段的乘积，通过混合基数调度支持非2的幂次维度，并初始化为RHT处理器（最多一个固定排列）。仅在校准数据上拟合，HARP将量化基适应于每一层和后端。在从1B到70B参数的模型的2-4比特设置中，HARP在困惑度和零样本准确率上优于固定RHT。重要的是，HARP保持了部署效率，达到128 tok/s，而FP16为61 tok/s。

英文摘要

Post-training quantization (PTQ) is essential for deploying LLMs under memory and bandwidth constraints. However, extreme low-bit quantization remains highly sensitive to activation outliers and anisotropic weight curvature. Existing incoherence-based PTQ methods mitigate this issue with fixed randomized Hadamard transforms (RHTs), which improve quantization robustness but cannot adapt the rotated basis to the layer, calibration distribution, or quantizer. We introduce HARP (Hadamard-preconditioned Adaptive Rotation Processor), a learnable structured two-sided orthogonal processor that replaces fixed Hadamard mixing while preserving exact full-precision equivalence. HARP represents each rotation as a product of sparse butterfly-like block-orthogonal stages, supports non-power-of-two dimensions via Mixed-Radix schedules, and initializes to the RHT processor up to a fixed permutation. Fitted only on calibration data, HARP adapts the quantization basis to each layer and backend. Across 2-4 bit settings on models ranging from 1B to 70B parameters, HARP improves perplexity and zero-shot accuracy over fixed RHT. Importantly, HARP preserves deployment efficiency, reaching 128 tok/s versus 61 tok/s for FP16.

URL PDF HTML ☆

赞 0 踩 0

2605.29836 2026-05-29 cs.LG cs.AI stat.ML

CB-SLICE: Concept-Based Interpretable Error Slice Discovery

CB-SLICE: 基于概念的可解释错误切片发现

Yael Konforti, Mateo Espinosa Zarlenga, Elaf Almahmoud, Mateja Jamnik

AI总结提出CB-SLICE方法，利用概念瓶颈模型的概念预测失败来发现错误切片，并通过关键词概念解释失败模式，优于现有方法。

详情

Comments: 20 pages, 7 figures, 12 tables, to be published at Proceedings of the 43rd International Conference on Machine Learning (ICML 2026)

AI中文摘要

尽管平均性能强劲，深度学习模型在特定人群组（称为错误切片）上常表现出系统性错误。识别这些组及其失败的根本原因对于模型调试和偏差缓解至关重要。然而，现有的错误切片发现方法（SDMs）通常生成与模型推理过程脱节的解释，因此只能近似潜在错误源，可能不准确。我们通过利用概念瓶颈模型（CBMs）来解决这一局限，其预测直接依赖于人类可理解的语义概念。由于CBM中下游任务失败通常源于概念预测错误，概念表示为错误切片识别提供了强有力的候选，提供了直接关联错误源的细粒度解释。基于这一见解，我们引入CB-SLICE，一种基于概念的SDM，它将共享概念预测失败的样本分组，并识别每个切片失败模式中最关键的关键词概念。在多个基准测试中，我们展示了CB-SLICE在发现已知偏差方面优于最先进方法，同时提供更丰富、更忠实的模型错误解释。

英文摘要

Despite strong average-case performance, deep learning models often exhibit systematic errors on specific population groups, known as error slices. Identifying these groups and the root causes of their failures is critical for model debugging and bias mitigation. However, existing error Slice Discovery Methods (SDMs) typically generate explanations disconnected from the model's inference process, thus only approximating the underlying error source and may be inaccurate. We address this limitation by leveraging Concept Bottleneck Models (CBMs), whose predictions are directly dependent on human-understandable semantic concepts. Since downstream task failures in CBMs commonly arise from concept mispredictions, concept representations provide a strong candidate for error slice identification, offering fine-grained explanations directly linked to the error source. Building on this insight, we introduce CB-SLICE, a concept-based SDM that groups samples with shared concept prediction failures and identifies the keyword concepts most responsible for each slice's failure mode. Across multiple benchmarks, we show that CB-SLICE outperforms state-of-the-art methods in uncovering well-known biases while providing richer and more faithful explanations of model errors.

URL PDF HTML ☆

赞 0 踩 0

2605.29834 2026-05-29 cs.LG

Open World Autoencoding Drift Detection with Novel Class Recognition in Tabular Non-stationary Data Streams

开放世界自编码漂移检测与表格非平稳数据流中的新类识别

Joanna Komorniczak

AI总结提出一种基于自编码器重构误差的无监督概念漂移检测方法，通过密度估计识别新类样本，利用镜像自编码器独立增量适应变化分布，在合成表格数据流上实验表明与当前最优方法竞争。

2605.29829 2026-05-29 cs.AI cs.LG

OptSkills: Learning Generalizable Optimization Skills from Problem Archetypes via Cluster-Based Distillation

OptSkills: 通过基于聚类的蒸馏从问题原型中学习可泛化的优化技能

Haochen Yang, Ke Zhao, Mengyuan Ma, Xingyu Lu, Xiangfeng Wang, Hong Qian

AI总结提出OptSkills系统，通过聚类问题原型、蒸馏成功轨迹为可复用工作流技能，并动态扩展技能库，提升优化建模与求解的分布内和分布外泛化能力。

详情

Comments: 22 pages, 10 figuers, project: https://github.com/fujiwaranoM0kou/OptSkills

AI中文摘要

利用大型语言模型（LLM）从自然语言自动制定和求解优化问题已成为自动化优化的高效范式。然而，现有方法仍表现出有限的泛化能力：它们对表面叙述变化敏感，主要在案例层面复用经验，难以适应变化或新兴的问题类型。我们提出OptSkills，一个以原型为中心的技能学习和推理智能体系统，用于优化建模和求解。为提升鲁棒泛化，我们的系统根据问题的底层原型而非表面叙述进行聚类。为提升分布内泛化，它在每个聚类内探索多样的建模范式和求解器配置，然后将成功轨迹蒸馏为可重用的工作流级技能。为提升分布外泛化，它利用新获得的轨迹改进现有技能或扩展技能库。我们的系统在涵盖多种问题类型和场景的数据集上达到了68.27%的最先进微平均准确率。此外，在极具挑战性的大规模高维基准MIPLIB-NL上，它达到了26.91%的准确率，比DeepSeek-V3.2-Thinking高出4.53%。在Nano-CO上进行技能学习后，它在OOD NLCO基准上达到了72.79%。代码和技能可在https://github.com/fujiwaranoM0kou/OptSkills获取。

英文摘要

Leveraging Large Language Models (LLMs) to automatically formulate and solve optimization problems from natural language has emerged as an efficient paradigm for automated optimization. However, existing methods still exhibit limited generalization: they are sensitive to superficial narrative variations, reuse experience mainly at the case level, and struggle to adapt to shifted or emerging problem types. We propose OptSkills, an archetype-centric skill learning and reasoning agent system for optimization modeling and solving. To improve robust generalization, our system clusters problems by their underlying archetypes rather than surface narratives. To improve in-distribution generalization, it explores diverse modeling paradigms and solver configurations within each cluster, then distills successful trajectories into reusable workflow-level skills. To improve out-of-distribution generalization, it refines existing skills or expands the skill library using newly obtained trajectories. Our system achieves a state-of-the-art micro-averaged accuracy of 68.27% on datasets encompassing diverse problem types and scenarios. In addition, on MIPLIB-NL, a highly challenging large-scale and high-dimensional benchmark, it achieves 26.91% accuracy, outperforming DeepSeek-V3.2-Thinking by 4.53%. After skill learning on Nano-CO, it reaches 72.79% on the OOD NLCO benchmark. Code and skills are available at https://github.com/fujiwaranoM0kou/OptSkills.

URL PDF HTML ☆

赞 0 踩 0

2605.29828 2026-05-29 cs.LG

When Do Graph Foundation Models Transfer? A Data-Centric Theory

图基础模型何时迁移？一个以数据为中心的理论

Jiajun Zhu, Ying Chen, Peihao Wang, Yixuan He, Pan Li, Aditya Akella, Zhangyang Wang

AI总结本文通过图论连续极限方法，将跨域输出偏移分解为有限样本近似项和结构不匹配的内在域差异，并验证了位置编码稳定性对迁移的影响。

详情

Comments: 21 pages, including appendix. Accepted at ICML 2026

AI中文摘要

图基础模型（GFMs）旨在跨不同图域重用单一骨干网络，但其迁移往往不均匀，并可能出现负迁移。虽然大多数先前工作通过架构或自适应选择改进迁移，但我们提出一个以数据为中心的问题：两个图域的哪些属性决定了固定表示模型改变其输出的程度？利用基于图论的稠密图连续极限，我们证明对于基于集合和消息传递的标记化，任何Lipschitz骨干网络都允许将跨域输出偏移显式分解为（i）图特定的有限样本近似项和（ii）捕获结构不匹配的内在、重标号不变的域差异。一个关键因素是位置编码（PE）稳定性：我们为谱PE建立了稳定性保证，并突出了基于特征向量与基于子空间的PE的对比行为。在合成和真实图上的实验验证了该理论，并将该分解转化为GFM迁移中数据整理的指导。

英文摘要

Graph foundation models (GFMs) aim to reuse a single backbone across diverse graph domains, yet their transfer is often uneven and can exhibit negative transfer. While most prior work improves transfer through architectural or adaptation choices, we ask a data-centric question: which properties of two graph domains determine how much a fixed representation model changes its outputs? Using a graphon-based continuous limit for dense graphs, we show that for both set-based and message-passing tokenizations, any Lipschitz backbone admits an explicit decomposition of cross-domain output shift into (i) graph-specific finite-sample approximation terms and (ii) an intrinsic, relabeling-invariant domain discrepancy capturing structural mismatch. A key ingredient is positional-encoding (PE) stability: we establish stability guarantees for spectral PEs and highlight contrasting behaviors of eigenvector- versus subspace-based PEs. Experiments on synthetic and real graphs validate the theory and translate the decomposition into guidance for data curation in GFM transfer.

URL PDF HTML ☆

赞 0 踩 0

2605.29827 2026-05-29 cs.CV

Fairness Beyond Demographics: Optimizing Performance Across Appearance-Based Hidden Cohorts in Medical Imaging

超越人口统计学的公平性：在医学影像中优化基于外观的隐藏队列性能

Milad Masroor, Cuong Nguyen, Kevin Wells, Gustavo Carneiro

AI总结提出无标签隐藏队列公平性（LHCF）训练范式，通过聚类图像为外观队列并优化其公平性，解决医学影像模型在人口统计学属性上的性能差异问题。

详情

Comments: Pre-review version submitted to MICCAI 2026. 10 pages, 5 figures

AI中文摘要

医学图像分析模型可能在患者子组间表现出性能差异，威胁临床安全性和公平性。现有方法通常通过优化可见人口统计学属性（如性别或年龄）的准确性和公平性指标来解决这一问题，但这些属性是孤立考虑的。这种策略不仅忽略了可能更具信息量的潜在分层（这些分层可能揭示模型错误和不平等的更深层来源），而且当同时考虑多个人口统计学属性时，由于每个子组内训练数据的稀疏性，该方法无法扩展。我们通过引入无标签隐藏队列公平性（LHCF）训练范式来处理这些问题，该范式不是最大化可见人口统计学属性的公平性，而是优化从图像外观发现的潜在子群体的公平性。通过将图像聚类为K个基于外观的队列并对其应用公平性优化，LHCF揭示了模型错误的潜在来源，避免了多人口统计学属性的组合稀疏性，减少了单个和多个人口统计学属性上的差异。我们在我们提出的公平性基准HIDFairBench上证明，尽管从未使用人口统计学标签进行训练，LHCF在单个和多个人口统计学属性上提供了最先进的公平性结果。我们的结果将隐藏队列公平性定位为基于人口统计学的公平性优化的实用、可扩展且稳健的替代方案，用于可信的医学图像分析。

英文摘要

Medical image analysis models can exhibit performance disparities across patient subgroups, threatening clinical safety and fairness. Existing methods typically address this issue by optimizing accuracy and fairness metrics for visible demographic attributes (e.g., sex or age) considered in isolation. This strategy not only overlooks potentially more informative latent stratifications, which may reveal deeper sources of model error and inequity, but also fails to scale when multiple demographic attributes are considered simultaneously due to the resulting sparsity of training data within each subgroup. We deal with these issues by introducing the label-free hidden-cohort fairness (LHCF) training paradigm that instead of maximizing fairness over visible demographic attributes, it optimizes fairness across latent subpopulations discovered from image appearance. By clustering images into K appearance-based cohorts and applying fairness optimization over them, LHCF uncovers underlying sources of model error and avoids the combinatorial sparsity of multi-demographic attributes, reducing disparities across both single and multiple demographic attributes. We demonstrate on our proposed fairness benchmark, HIDFairBench, that LHCF provides state-of-the-art fairness results on single and multiple demographic attributes, despite never using demographic labels for training. Our results position hidden-cohort fairness as a practical, scalable, and robust alternative to demographic-based fairness optimization for trustworthy medical image analysis.

URL PDF HTML ☆

赞 0 踩 0

2605.29826 2026-05-29 cs.CL cs.AI

Towards Localized and Disentangled Knowledge Editing for Multimodal Large Language Models

面向多模态大语言模型的局部化与解耦知识编辑

Leijiang Gu, Zhen Zeng, Feng Li, Xinjian Gao, Zenglin Shi

AI总结针对多模态知识编辑中因果错位和特征纠缠问题，提出LDKE框架，通过快速定位关键层和解耦分类器实现精准泛化编辑并保持高局部性。

详情

AI中文摘要

现有的多模态知识编辑（MKE）方法在纠正多模态大语言模型（MLLMs）中过时或不准确的知识方面取得了进展。然而，它们存在一个关键局限性：虽然能有效修改目标事实对，但无法将编辑泛化到逻辑相关的查询，并且常常对无关但视觉或语义上关联的信息造成意外改变。我们识别并形式化了导致该问题的两种潜在失败模式：因果错位（将编辑限制在特定样本）和特征纠缠（对耦合但无关的信息造成意外改变）。为解决这些问题，我们提出局部化与解耦知识编辑（LDKE），一种通过定位事实特定模型层并将目标相关输入与无关输入解耦来实现精确和泛化编辑的新框架。我们的方法引入快速定位模块以高效识别和更新关键层，以及解耦分类器以适当路由输入从而保留无关知识。在各种基准和MLLMs上的大量实验表明，LDKE在将编辑传播到相关上下文方面实现了优越性能，同时保持了高局部性。

英文摘要

Existing methods in Multimodal Knowledge Editing (MKE) have advanced the ability to correct outdated or inaccurate knowledge in Multimodal Large Language Models (MLLMs). However, they exhibit a critical limitation: while effectively modifying target factual pairs, they fail to generalize edits to logically related queries and often cause unintended alterations to unrelated but visually or semantically linked information. We identify and formalize two underlying failure modes causing this issue: Causal Misalignment, which confines edits to the specific sample, and Feature Entanglement, which causes unintended alterations to coupled but irrelevant information. To address these issues, we propose Localized and Disentangled Knowledge Editing (LDKE), a new framework that achieves precise and generalized editing by localizing fact-specific model layers and disentangling target-relevant inputs from irrelevant ones. Our approach introduces a Fast Localization module to identify and update critical layers efficiently, along with a Disentanglement Classifier that routes inputs appropriately to preserve unrelated knowledge. Extensive experiments across various benchmarks and MLLMs demonstrate that LDKE achieves superior performance in propagating edits to related contexts while maintaining high locality.

URL PDF HTML ☆

赞 0 踩 0

2605.29822 2026-05-29 cs.SE cs.AI

Inferring Code Correctness from Specification

从规约推断代码正确性

Tambon Florian, Papadakis Mike

AI总结提出TRAILS方法，通过基于规约的类别划分生成测试输入并执行，利用LLM评估输入输出对是否符合规约，从而推断代码正确性，在LiveCodeBench和CoCoClaNeL数据集上相比基线方法提升了马修斯相关系数并增强了稳定性。

详情

AI中文摘要

大型语言模型（LLM）已成为现代软件开发不可或缺的一部分，实现了大规模自动代码生成。然而，验证LLM生成代码的正确性仍然是一个关键且基本未解决的挑战。现有方法要么依赖多个代码候选之间的动态共识——这使得它们成本高昂且难以扩展，要么依赖静态推理，容易受到动态错误和顺序偏差的影响。在本文中，我们提出TRAILS（通过输入和规约的目标推理一致性），一种将LLM推理与具体（输入，输出）对相结合的方法。TRAILS首先基于规约通过类别划分生成多样化的测试输入，然后针对候选代码执行这些输入，并提示LLM评估产生的输入输出对是否符合规约——而无需对代码本身进行推理。分数跨输入聚合，以确定程序是否可能正确。我们在两个数据集LiveCodeBench和CoCoClaNeL上，使用三个LLM（Qwen3Coder-30B、Devstral-Small-24B和Olmo3.1-Instruct）评估TRAILS，并与HoarePrompt和零样本思维链基线进行比较。TRAILS的马修斯相关系数相比零样本思维链提高了高达39%，并且始终优于HoarePrompt。除了准确性，TRAILS在多次运行中表现出更高的稳定性，降低了对LLM非确定性的敏感性，并且相比竞争方法为更多独特的代码样本分配了正确的标签。

英文摘要

Large language models (LLMs) have become integral to modern software development, enabling automated code generation at scale. However, validating the correctness of LLM-generated code remains a critical and largely unsolved challenge. Existing approaches either rely on dynamic consensus across multiple code candidates - making them costly and difficult to scale - or on static reasoning that is susceptible to dynamic bugs and order bias. In this paper, we propose TRAILS~ (Targeted Reasoning Agreement via Inputs and Specifications), an approach that grounds LLM reasoning with concrete (input, output) pairs. TRAILS~ first generates diverse test inputs via category partitioning based on the specification, then executes them against the candidate code and prompts LLMs to assess whether the resulting input-output pairs conform to the specification - without ever reasoning over the code itself. Scores are aggregated across inputs, to determines whether the program is likely correct. We evaluate TRAILS~ on two datasets, LiveCodeBench and CoCoClaNeL, across three LLMs (Qwen3Coder-30B, Devstral-Small-24B, and Olmo3.1-Instruct), comparing against HoarePrompt and a Zero-Shot Chain-of-Thought baseline. TRAILS~ improves Matthew Correlation Coefficient by up to 39\% relative to Zero-Shot COT and consistently outperforms HoarePrompt. Beyond accuracy, TRAILS~ demonstrates greater stability across seeded runs, reducing sensitivity to LLM non-determinism, and assigns correct labels to a larger set of unique code samples than competing approaches.

URL PDF HTML ☆

赞 0 踩 0

2605.29819 2026-05-29 cs.LG

The Interplay Between Interpolation and Aggregation in Regression: Optimal Sample Complexity

回归中插值与聚合的相互作用：最优样本复杂度

Mikael Møller Høgsgaard, Kasper Green Larsen, Liang-Yu Zou

AI总结本文从理论上研究回归中插值与聚合的相互作用，证明γ-图维度刻画了广泛自然聚合过程的可学习性，并发现通过中位数聚合三个插值假设的简单过程在所有聚合过程中最优，且严格强于恰当学习。

2605.29816 2026-05-29 cs.AI

Harnessing non-adversarial robustness in large language models

利用大语言模型中的非对抗鲁棒性

Qinghua Zhou, Ellina Aleshina, Andrey Lovyagin, Oleg Somov, Mikhail Seleznyov, Alexander Panchenko, Ivan Oseledets, Elena Tutubalina, Ivan Y. Tyukin

AI总结本文通过理论分析和实验，提出了一种基于去偏的微调方法，以提升大语言模型对语义相似但文本不同的提示的鲁棒性，并提供了认证保证。

详情

AI中文摘要

本文提出了一种方法来解决大语言模型（LLMs）对由语义相似但文本不同的提示引起的改变和潜在错误的鲁棒性挑战。最近的研究表明，这类提示变化会显著影响LLMs在任务上的性能。核心问题是：能否在不重新训练整个模型的情况下，获得LLMs对语义中性提示变化的鲁棒性？我们通过理论和实验来探讨这个问题。我们的理论分析揭示了一个影响模型鲁棒性的关键因素——神经网络模块输出中的系统性预期偏移或扰动引起的偏差。受此分析启发，我们表明可以通过一个简单的微调过程实现鲁棒性：为鲁棒性进行去偏。我们确定了去偏有帮助和没有帮助的条件，并通过理论和大量实验证明，为鲁棒性进行去偏确实可以成为一种快速有效的工具，以增强鲁棒性并提供对随机提示扰动的认证。

英文摘要

The work presents an approach for addressing the challenge of robustness in Large Language Models (LLMs) to alterations and potential errors caused by semantically similar but textually different prompts. Recent works have shown that these kinds of prompt variations can significantly impact the performance of LLMs on tasks. The central question is: can LLMs' robustness to semantically-neutral prompt alterations be acquired without expensive retraining of the entire model? We address this question both theoretically and through experiments. Our theoretical analysis reveals a crucial factor impacting model robustness - a systematic expected shift or perturbation-induced bias in neural network module outputs. Motivated by this analysis, we show that robustness can be achieved via a simple fine-tuning process: debiasing for robustness. We identify conditions when debiasing helps and when it does not, and demonstrate, through both theory and extensive experiments, that debiasing for robustness may indeed be a quick and efficient tool to enhance robustness and provide certification against random prompt perturbations.

URL PDF HTML ☆

赞 0 踩 0

2605.29815 2026-05-29 cs.AI cs.CL

PRAIB: Peer Review AI Benchmark of Behaviour of LLM-Assisted Reviewing

PRAIB: 大语言模型辅助审稿行为的同行评审AI基准

Krzysztof Żurawicki, Julia Farganus, Arkadiusz Gaweł, Mateusz Bystroński, Tomasz Jan Kajdanowicz

AI总结提出PRAIB框架，通过定义审稿特异性、风格和参与行为的指标，并基于11000条机器生成审稿与人类审稿的对比实验，揭示LLM审稿在评分、交叉引用和弱点识别方面与人类审稿的系统性差异。

详情

AI中文摘要

提交论文数量的增长促使人们探索利用大型语言模型（LLMs）来支持和增强同行评审过程，特别是在提高其速度和可扩展性方面。然而，目前尚不清楚LLMs是否以与人类审稿人相同的方式处理科学稿件，还是仅仅生成看起来像审稿的文本。为了解决这个问题，我们引入了同行评审AI基准（PRAIB），这是一个新颖的框架，包含精确定义的指标，用于衡量审稿的特异性、风格和参与行为。为补充PRAIB框架，我们进行了一项大规模实证研究，利用一个包含由五个专有和开源模型为1000篇ICLR和NeurIPS论文生成的11000条审稿的数据集。这些机器生成的审稿跨越2021-2025年，与原始人类反馈在不同提示策略下进行比较，以识别系统性的行为差异。我们的分析表明，生成的审稿与人类审稿人提供的反馈存在显著差异：LLM评分变异性较小、存在正向偏差且过度自信，其交叉引用模式依赖于模型且与人类规范不同。此外，通过PRAIB评估，我们观察到LLMs倾向于生成更长、更复杂的审稿，但经常忽略人类审稿人指出的原子性弱点。通过描述LLM审稿行为在哪些方面以及如何偏离人类规范，PRAIB为社区提供了一个诊断工具，用于识别LLMs目前可以可靠支持审稿过程的哪些方面，以及在部署前哪些方面需要进一步发展。

英文摘要

The growing number of submitted papers has motivated the exploration of Large Language Models (LLMs) as a means to support and augment the peer review process, particularly in terms of improving its speed and scalability. Yet, it remains unknown whether LLMs engage with scientific manuscripts in the same manner as human reviewers, or whether they merely produce review-looking text. To address this, we introduce the Peer Review AI Benchmark (PRAIB), a novel framework comprising thoroughly defined metrics that measure review specificity, style, and behavior of engagement. To complement the PRAIB framework, we conduct a large-scale empirical study leveraging a dataset of 11,000 reviews generated by five proprietary and open-source models for 1,000 ICLR and NeurIPS papers. Spanning the 2021--2025 period, these machine-generated reviews are compared against original human feedback across diverse prompting strategies to identify systematic behavioral divergences. Our analysis reveals that the generated reviews diverge significantly from feedback provided by human reviewers: LLM ratings are less variable, positively biased, and overconfident, and their cross-reference patterns are model-dependent and distinct from human norms. Furthermore, when evaluated through PRAIB, we observe that LLMs tend to generate longer, more complex reviews, yet frequently overlook the atomic weaknesses noted by human reviewers. By characterizing where and how LLMs reviewing behavior departs from human norms, PRAIB provides the community with a diagnostic tool for identifying which aspects of the review process LLMs can reliably support today and which require further development before deployment.

URL PDF HTML ☆

赞 0 踩 0

2605.29812 2026-05-29 cs.CV

Not All Inputs Are Valid: Towards Open-Set Video Moment Retrieval Using Language

并非所有输入都有效：面向开放集视频时刻检索的语言方法

Xiang Fang, Wanlong Fang, Daizong Liu, Xiaoye Qu, Jianfeng Dong, Pan Zhou, Renfu Li, Zichuan Xu, Lixing Chen, Panpan Zheng, Yu Cheng

AI总结针对开放集场景下视频时刻检索任务中无关查询导致错误检索的问题，提出基于归一化流的开放集视频时刻检索模型OpenVMR，实现分布内查询的精确检索与分布外查询的拒绝。

详情

Comments: Published in ACM MM 2024

AI中文摘要

视频时刻检索（VMR）旨在从未修剪的视频中检索与句子查询对应的特定时刻。尽管近期工作在该任务上取得了显著进展，但它们隐含地基于封闭集假设，即所有给定查询都与视频相关 ootnote{在本文中，我们将“视频相关查询”视为“分布内（ID）查询”，将“视频无关查询”视为“分布外（OOD）查询”。}。在开放集场景中，给定OOD查询时，它们仍会用于错误检索，这可能在高风险场景（例如犯罪活动检测）中导致不可挽回的损失。为此，我们创造性地探索了一种全新的VMR设置，称为开放集视频时刻检索（OS-VMR），其中我们不仅应基于ID查询检索精确时刻，还应拒绝OOD查询。在本文中，我们首次尝试迈向OS-VMR，并提出了一种新颖模型OpenVMR，该模型首先基于归一化流技术区分ID和OOD查询，然后基于ID查询进行时刻检索。具体而言，我们首先通过构建归一化流学习ID分布，并假设ID查询分布服从多元高斯分布。然后，我们引入不确定性分数来搜索ID-OOD分离边界。之后，通过拉近ID查询特征来细化ID-OOD边界。此外，分别设计了视频-查询匹配和帧-查询匹配用于粗粒度和细粒度的跨模态交互。最后，引入正-无标签学习模块用于时刻检索。在三个VMR数据集上的实验结果表明了我们的OpenVMR的有效性。

英文摘要

Video Moment Retrieval (VMR) targets to retrieve the specific moment corresponding to a sentence query from an untrimmed video. Although recent works have made remarkable progress in this task, they implicitly are rooted in the closed-set assumption that all the given queries as video-relevant\footnote{In this paper, we treat ``video-relevant query'' as ``in-distribution (ID) query'' and ``video-irrelevant query'' as ``out-of-distribution (OOD) query''.}. Given an OOD query in open-set scenarios, they still utilize it for wrong retrieval, which might lead to irrecoverable losses in high-risk scenarios, \textit{e.g.}, criminal activity detection. To this end, we creatively explore a brand-new VMR setting termed Open-Set Video Moment Retrieval (OS-VMR), where we should not only retrieve the precise moments based on ID query, but also reject OOD queries. In this paper, we make the first attempt to step toward OS-VMR and propose a novel model \textbf{OpenVMR}, which first distinguishes ID and OOD queries based on the normalizing flow technology, and then conducts moment retrieval based on ID queries. Specifically, we first learn the ID distribution by constructing a normalizing flow, and assume the ID query distribution obeys the multi-variate Gaussian distribution. Then, we introduce an uncertainty score to search the ID-OOD separating boundary. After that, we refine the ID-OOD boundary by pulling together ID query features. Besides, video-query matching and frame-query matching are designed for coarse-grained and fine-grained cross-modal interaction, respectively. Finally, a positive-unlabeled learning module is introduced for moment retrieval. Experimental results on three VMR datasets show the effectiveness of our OpenVMR.

URL PDF HTML ☆

赞 0 踩 0

2605.29809 2026-05-29 cs.CR cs.CV cs.GR cs.LG cs.MM

Cert-LAS: Toward Certified Model Ownership Verification for Text-to-Image Diffusion Models via Layer-Adaptive Smoothing

Cert-LAS：通过层自适应平滑实现文本到图像扩散模型的认证模型所有权验证

Leyi Qi, Yiming Li, Siyuan Liang, Zhengzhong Tu, Dacheng Tao

AI总结提出Cert-LAS方法，基于层自适应平滑和扩散分类器嵌入水印，通过假设检验验证模型所有权，并证明在恶意移除攻击下仍能可靠验证。

详情

Comments: This paper has been accepted to the International Conference on Machine Learning (ICML) 2026. 26 pages

AI中文摘要

大规模文本到图像（T2I）扩散模型实现了前所未有的创意应用，但其未经授权的使用引发了严重的知识产权问题，使得模型所有权验证（MOV）日益关键。我们发现现有的基于后门的扩散水印方法通常（隐式地）假设一个“忠实”的验证过程，即验证者可以查询可疑模型并获得忠实的水印响应以完成MOV。然而，在实践中，攻击者可能有意或无意地破坏潜在的水印信号，显著降低验证可靠性。为解决此问题，我们提出Cert-LAS，首个基于层自适应平滑的T2I模型认证MOV方法。通常，Cert-LAS使用扩散分类器和LFS引导的层自适应噪声嵌入指定水印，并通过假设检验检查可疑模型是否表现出比无水印参考显著更强的水印响应来验证所有权。我们进一步证明，在特定条件下，即使存在恶意移除攻击，我们的Cert-LAS仍能实现可靠验证。大量实验验证了Cert-LAS的有效性及其对自适应攻击的抵抗力。我们的代码可在https://github.com/Leyi-Qi/Cert-LAS获取。

英文摘要

Large-scale text-to-image (T2I) diffusion models have enabled unprecedented creative applications, but their unauthorized use has raised serious intellectual property concerns, making model ownership verification (MOV) increasingly critical. We find that existing backdoor-based diffusion watermarking methods often (implicitly) assume a "faithful" verification process, namely, that the verifier can query a suspicious model and obtain the faithful watermark response to complete MOV. However, in practice, adversaries may intentionally or unintentionally damage potential watermark signals, significantly degrading verification reliability. To address this issue, we propose Cert-LAS, the first certified MOV method for T2I models based on layer-adaptive smoothing. In general, Cert-LAS embeds specified watermarks using diffusion classifiers and an LFS-guided layer-adaptive noise, and verifies ownership by examining whether the suspected model exhibits significantly stronger watermark responses compared to unwatermarked references through hypothesis testing. We further prove that, under certain conditions, our Cert-LAS can still achieve reliable verification even in the presence of malicious removal attacks. Extensive experiments validate the effectiveness of Cert-LAS and its resistance to adaptive attacks. Our code is available at https://github.com/Leyi-Qi/Cert-LAS.

URL PDF HTML ☆

赞 0 踩 0