arXivDaily arXiv每日学术速递 周一至周五更新
全部学科分类 3839
专题追踪
2606.07965 2026-06-09 cs.AI 新提交

Zero-Shot Learning in Industrial Scenarios: New Large-Scale Benchmark, Challenges and Baseline

工业场景中的零样本学习:新的大规模基准、挑战与基线

Zekai Zhang, Qinghui Chen, Maomao Xiong, Shijiao Ding, Zhanzhi Su, Xinjie Yao, Yiming Sun, Cong Bai, Jinglin Zhang

发表机构 * Zhejiang University(浙江大学) Alibaba Group(阿里巴巴集团) Huawei Technologies(华为技术有限公司) Tsinghua University(清华大学) Microsoft Research(微软研究院)

AI总结 针对工业场景中视觉语言模型应用难、数据稀缺的问题,提出大规模多模态工业开放数据集MMIO和精炼文本-视觉提示RTVP,实现零样本工业缺陷检测,在MMIO上达到SOTA。

详情
AI中文摘要

大型视觉语言模型(LVLMs)在视觉任务中取得了显著成功。然而,工业场景与自然场景之间的巨大差异使得应用LVLMs具有挑战性。现有的LVLMs依赖用户提供的提示来分割目标,这常常由于包含不相关像素而导致性能次优。此外,数据的稀缺性也使得LVLMs在工业场景中的应用仍未得到探索。为填补这一空白,本文提出了一个开放的工业数据集和一个精炼文本-视觉提示(RTVP),用于零样本工业缺陷检测。首先,本文构建了包含80K+样本的多模态工业开放数据集(MMIO)。MMIO包含多样化的工业类别,包括6个超类和18个子类。MMIO是首个用于工业零样本学习的大规模多场景预训练数据集,并为未来工业场景中的开放模型提供了宝贵的训练数据。基于MMIO,本文提出了专门用于工业零样本任务的RTVP。RTVP有两个显著优势:第一,本文设计了一种专家引导的大模型领域自适应机制,并基于Mobile-SAM设计了工业零样本方法,增强了大模型在工业场景中的泛化能力。第二,RTVP直接从图像自动生成视觉提示,并考虑了先前LVLM忽略的文本-视觉提示交互,提高了视觉和文本内容的理解。在MMIO的零样本和封闭场景中,RTVP分别以42.2%和24.7%的AP达到了SOTA。

英文摘要

Large Visual Language Models (LVLMs) have achieved remarkable success in vision tasks. However, the significant differences between industrial and natural scenes make applying LVLMs challenging. Existing LVLMs rely on user-provided prompts to segment objects. This often leads to suboptimal performance due to the inclusion of irrelevant pixels. In addition, the scarcity of data also makes the application of LVLMs in industrial scenarios remain unexplored. To fill this gap, this paper proposes an open industrial dataset and a Refined Text-Visual Prompt (RTVP) for zero-shot industrial defect detection. First, this paper constructs the Multi-Modal Industrial Open Dataset (MMIO) containing 80K+ samples. MMIO contains diverse industrial categories, including 6 super categories and 18 subcategories. MMIO is the first large-scale multi-scenes pre-training dataset for industrial zero-shot learning, and provides valuable training data for open models in future industrial scenarios. Based on MMIO, this paper provides a RTVP specifically for industrial zero-shot tasks. RTVP has two significant advantages: First, this paper designs an expert-guided large model domain adaptation mechanism and designs an industrial zero-shot method based on Mobile-SAM, which enhances the generalization ability of large models in industrial scenarios. Second, RTVP automatically generates visual prompts directly from images and considers text-visual prompt interactions ignored by previous LVLM, improving visual and textual content understanding. RTVP achieves SOTA with 42.2% and 24.7% AP in zero-shot and closed scenes of MMIO.

2606.07964 2026-06-09 cs.CL 新提交

What Does Debiasing Really Remove? A Geometric Study of PCA-Based Gender Debiasing in Word Embeddings

去偏究竟移除了什么?词嵌入中基于PCA的性别去偏的几何研究

Alexey Kresin, Tchifou M. Dieffi, Tomer Caspi

发表机构 * Hood College(胡德学院) Ben-Gurion University of the Negev(内盖夫本-古里安大学)

AI总结 通过几何分析揭示PCA去偏主要移除第一主成分中的直接性别偏见,但无法消除分布在多维度上的关联偏见,且会破坏嵌入几何结构,表明偏见并非纯低秩,简单子空间移除不足以全面去偏。

Comments 8 pages, 4 figures. Source code available at https://github.com/AlexeyKresin/embedding-bias-geometry

详情
AI中文摘要

基于主成分分析(PCA)的去偏方法被广泛用于减少大型语言模型词嵌入中的性别偏见,但尚不清楚这些方法实际移除了偏见的哪些方面以及这一过程的破坏性有多大。这些方法基于偏见存在于低维子空间的理解,假设大部分偏见可以通过少数主成分捕获。在这项工作中,我们对基于PCA的性别去偏进行了系统的几何分析,并研究了嵌入空间中实际被移除的内容。我们在多个嵌入上的实验表明,直接性别偏见主要集中在前几个主成分上,支持了低秩偏见假设。然而,通过WEAT测量的关联偏见并不与这些主方向对齐,而是分布在多个嵌入维度上。此外,正如预期,我们证明移除越来越多的主成分会导致嵌入几何的一致退化,影响语义结构和向量关系。这些结果表明,基于PCA的去偏是一种权衡:虽然它有效减少了某些形式的直接偏见,但未能消除分布式关联,并引入了几何扭曲。此外,不存在通用的最优去偏水平,因为偏见减少与语义保留之间的平衡取决于所选的度量和嵌入。总体而言,我们的发现表明词嵌入中的偏见并非纯粹低秩,简单的子空间移除方法可能不足以实现全面去偏。

英文摘要

Debiasing methods based on principal component analysis (PCA) are broadly used to reduce gender bias in word embeddings used in LLMs, yet it remains unclear what aspects of bias they actually remove and how destructive this process is. These methods are based on the understanding that bias resides in a low-dimensional subspace, with the assumption that most of it can be captured by a few principal components. In this work, we conduct a systematic geometric analysis of PCA-based gender debiasing and investigate what is actually removed from the embedding space. Our experiments across multiple embeddings show that direct gender bias is primarily concentrated in the first principal component, supporting the low-rank bias hypothesis. However, associative bias measured by WEAT does not align with these principal directions and is instead spread across multiple embedding dimensions. Furthermore, as expected, we demonstrate that removing an increasing number of principal components leads to a consistent degradation of the embedding geometry, affecting semantic structure and vector relationships. These results reveal that PCA-based debiasing operates as a trade-off: while it effectively reduces certain forms of direct bias, it fails to eliminate distributed associations and introduces geometric distortion. Moreover, there is no universal optimal level of debiasing, as the balance between bias reduction and semantic preservation depends on the chosen metric and embedding. Overall, our findings suggest that bias in word embeddings is not purely low-rank and that simple subspace removal methods may be insufficient for comprehensive debiasing.

2606.07963 2026-06-09 cs.AI cs.CL 新提交

Shared Latent Structures Enable Unified Backdoor Detection and Mitigation in LLMs

共享潜在结构实现大语言模型中的统一后门检测与缓解

Omar Mahmoud, Aly M. Kassem, Thommen George Karimpanal, Buddhika Laknath Semage, Negar Rostamzadeh, Golnoosh Farnadi, Santu Rana

发表机构 * Deakin University(迪肯大学) Mila, Quebec AI Institute(魁北克人工智能研究所Mila)

AI总结 发现大语言模型中多种后门攻击共享潜在机制,通过稀疏自编码器检测因果特征,并提出双向激活操控和概念消融微调实现统一检测与缓解。

详情
AI中文摘要

大语言模型中的后门攻击通常被视为孤立的触发-响应失败,促使防御针对特定触发或行为。我们证明这种观点是不完整的。在多样化的后门行为中,我们识别出一个共享的潜在机制,可以被检测、因果控制和抑制。通过在残差流激活上使用稀疏自编码器,我们发现一小部分潜在特征在越狱、拒绝操控、密码锁定、偏见诱导、情感误分类和基于国家的有害建议中一致激活。这些特征在Qwen3、Gemma~3和Llama~3.1模型(参数从4B到32B)以及微调和权重编辑攻击中泛化。通过双向激活操控,我们证明这些特征是因果性的:抑制它们降低攻击成功率,而放大它们在干净提示上诱导目标行为。我们进一步训练轻量级SAE特征分类器,这些分类器零样本泛化到未见后门,并优于残差流和权重差异基线。最后,我们引入概念消融微调,通过在训练期间消融共享潜在子空间来抑制后门形成。总之,我们的结果表明许多后门依赖于可转移的潜在机制,从而实现统一的检测和缓解。

英文摘要

Backdoor attacks in large language models (LLMs) are often treated as isolated trigger-response failures, motivating defenses tailored to specific triggers or behaviors. We show this view is incomplete. Across diverse backdoor behaviors, we identify a shared latent mechanism that can be detected, causally controlled, and suppressed. Using sparse autoencoders (SAEs) on residual-stream activations, we find a small set of latent features consistently activated across jailbreaking, refusal manipulation, password-locking, bias induction, sentiment misclassification, and country-conditioned harmful advice. These features generalize across Qwen3, Gemma~3, and Llama~3.1 models from 4B to 32B parameters, and across both fine-tuning and weight-editing attacks. Through bidirectional activation steering, we show these features are causal: suppressing them reduces attack success, while amplifying them induces target behaviors on clean prompts. We further train lightweight SAE-feature classifiers that generalize zero-shot to unseen backdoors and outperform residual-stream and weight-diffing baselines. Finally, we introduce Concept Ablation Fine-Tuning (CAFT), which suppresses backdoor formation by ablating the shared latent subspace during training. Together, our results suggest that many backdoors rely on a transferable latent mechanism, enabling unified detection and mitigation.

2606.07962 2026-06-09 cs.CV 新提交

ChronoPhyBench: Do MLLMs Truly Understand the World or Merely Exploit Language Priors?

ChronoPhyBench:多模态大语言模型真正理解世界还是仅仅利用语言先验?

Bin Zhu, Yanhao Jia, Kexin Zhao, Jie Wang, Munan Ning, Hao Li, Yuwei Niu, Tanqing Sun, Huangchong Yan, Mingjun Pan, Xinyi Wu, Qishen Yin, Yunyang Ge, Shuai Zhao, Li Yuan

发表机构 * Peking University, Shenzhen Graduate School(北京大学深圳研究生院) Peng Cheng Laboratory(鹏城实验室) Rabbitpre Intelligence Nanyang Technological University(南洋理工大学) Tsinghua University(清华大学)

AI总结 提出ChronoPhyBench基准,通过视频上下文和文本描述强制模型进行物理状态推理,揭示当前开源模型物理推理能力仍处初级阶段。

详情
AI中文摘要

多模态大语言模型(MLLMs)的最新进展在开放世界推理和理解方面展现了卓越的能力。然而,一个关键的不确定性仍然存在:这些模型是真正融合跨模态信息以构建基于物理的推理链,还是仅仅利用强大的语言先验来掩盖单模态依赖,从而幻觉出高级多模态能力?受此启发,为严格减轻语言模态偏差和捷径,我们提出一个新的多模态时序物理动力学推理基准ChronoPhyBench,该基准将下一状态预测与视觉问答(VQA)范式统一,通过基于历史视频上下文和文本描述,强制模型通过单图像选择以及更复杂的多帧时序排序任务来推断后续物理状态。同时,我们构建了一个基于ChronoPhyBench标准的大规模多模态推理数据集,包含超过10,000个长视频及其精心标注的描述,总计500万token。我们的实验评估揭示了与以往基准结论的鲜明对比。当前开源模型执行基于物理的多模态推理的能力仍处于初级阶段。最终,本工作旨在系统性地压力测试多模态模型的推理能力,量化幻觉率,并推动物理AI的发展,从而为社区提供一个朝着通用人工智能(AGI)的稳健且透明的评估框架。

英文摘要

Recent advancements in Multimodal Large Language Models (MLLMs) have demonstrated remarkable proficiency in open-world reasoning and understanding. However, a critical ambiguity persists: it remains unclear whether these models genuinely synthesize cross-modal information to construct physically grounded reasoning chains, or if they merely exploit strong language priors to mask single-modality reliance, thereby hallucinating advanced multimodal capabilities. Motivated by this, and to rigorously mitigate language modality bias and shortcuts, we propose a novel multimodal Chrono}logical Physical Dynamics Reasoning Benchmark ChronoPhyBench, which unifies next state prediction with Visual Question Answering (VQA) paradigms by conditioning on historical video context and textual captions to enforce models to deduce subsequent physical states through both single image selection and the inherently more complex task of multiple frame chronological sorting. Concurrently, we construct a large-scale multimodal reasoning dataset curated using the ChronoPhyBench criteria, comprising over 10,000 long-form videos paired with meticulously annotated captions, totaling 5M tokens. Our experimental evaluations reveal a stark contrast to conclusions drawn by previous benchmarks. The capacity of current open-source models to perform physically grounded multimodal reasoning remains in its infancy. Ultimately, this work seeks to systematically stress-test the reasoning capabilities of multimodal models, quantify hallucination rates, and advance the development of Physical AI, thereby providing the community with a robust and transparent evaluation framework toward Artificial General Intelligence (AGI).

2606.07954 2026-06-09 cs.LG cs.AI 新提交

Minibatch Selection via Partition Matroid Constrained Gradient Matching

基于划分拟阵约束梯度匹配的小批量选择

Prayas Agrawal, Prateek Chanda, Ishita Khatri, Ganesh Ramakrishnan, Bamdev Mishra, Pratik Jawanpuria

发表机构 * Indian Institute of Technology Bombay(印度理工学院班加罗尔) Department of Computer Science and Engineering(计算机科学与工程系) Centre for Machine Intelligence and Data Science(机器智能与数据科学中心) Microsoft Research India(微软印度研究院) Microsoft India(微软印度)

AI总结 提出PartitionSel方法,通过划分拟阵约束下的梯度匹配效用最大化,实现跨域小批量选择,减少冗余并提升训练兼容性,在LLM微调中取得鲁棒性提升。

Comments 28 pages, 12 figures, ICML 2026

Journal ref Proceedings of the 43rd International Conference on Machine Learning (ICML 2026), Seoul, South Korea, PMLR 306, 2026

详情
AI中文摘要

在异构数据上训练大型语言模型(LLMs)需要选择能够平衡收敛速度与跨领域覆盖的小批量。现有方法要么在每个领域内独立选择样本,要么依赖计算昂贵的代理模型来学习连续的领域权重。我们提出PartitionSel,一种跨领域小批量选择方法,它在每个领域的预算(编码为划分拟阵约束)下最大化验证引导的梯度匹配效用。通过单一效用耦合每个领域的预算,PartitionSel旨在减少跨领域选择中的冗余。所提出的目标是弱子模的,并允许使用正交匹配追踪算法,具有可证明的近似保证。在实验中,我们在MetaMathQA和Mol-Instructions上对Qwen2.5和Llama-3进行微调时,评估了PartitionSel的小批量选择。PartitionSel在两个基准测试中均比每个领域和领域无关的基线获得了鲁棒的提升。它还减少了每个批次内冲突梯度对的数量,表明跨领域耦合转化为更兼容的训练更新。

英文摘要

Training large language models (LLMs) on heterogeneous data requires selecting minibatches that balance convergence speed with coverage across domains. Existing methods either select samples independently within each domain or rely on computationally expensive proxy models to learn continuous domain weights. We propose PartitionSel, a cross-domain minibatch selection approach that maximizes a validation-guided gradient-matching utility under per-domain budgets encoded as a partition-matroid constraint. By coupling the per-domain budgets through a single utility, PartitionSel is designed to reduce redundancy in selections across domains. The proposed objective is weakly submodular and admits an orthogonal matching pursuit algorithm with provable approximation guarantees. Empirically, we evaluate PartitionSel for minibatch selection during the fine-tuning of Qwen2.5 and Llama-3 on MetaMathQA and Mol-Instructions. PartitionSel achieves robust gains over per-domain and domain-agnostic baselines on both benchmarks. It also reduces the number of conflicting gradient pairs within each batch, indicating that the cross-domain coupling translates into more compatible training updates.

2606.07953 2026-06-09 cs.AI 新提交

Unification of Closed-Open Industrial Detection Scenarios: New Large-Scale Benchmarks,Challenges and Baselines

闭集与开集工业检测场景的统一:新的大规模基准、挑战与基线

Zekai Zhang, Jinglin Zhang, Qinghui Chen, Gang Li, Da Chen, Shuainan Jing, He Wang, Dagang Li, Cong Liu, Cong Bai, Shengyong Chen

发表机构 * Shandong University(山东大学) University Paris Dauphine, PSL Research University, CNRS, UMR 7534(巴黎多芬纳大学,PSL研究大学,法国国家科学研究中心,UMR 7534) Qilu University of Technology(齐鲁工业大学) Zhejiang University of Technology(浙江工业大学) Nova University of Lisbon(里斯本新大学) Macau University of Science and Technology(澳门科技大学) Tianjin University of Technology(天津理工大学)

AI总结 针对工业缺陷检测中数据集稀缺和人工提示依赖问题,提出含百万样本的MMIOC-1M基准和RTVPNet网络,通过专家辅助域投影、能量稀疏采样和双向文本-视觉交互实现最优性能。

详情
AI中文摘要

大规模视觉语言模型(LVLMs)在自然视觉任务中取得了显著成功,但其在工业缺陷检测中的应用仍面临两个基本限制:(i)缺乏覆盖多个领域不同缺陷类别的大规模工业数据集,以及(ii)依赖人工提示(点、框、掩码)引入主观噪声,且缺乏用于细粒度理解的文本-视觉交互。为解决这些挑战,我们引入了一个大规模多模态工业开闭集基准(MMIOC-1M),包含超过一百万个样本,涵盖14个超类、29个工业场景和351个缺陷子类。据我们所知,MMIOC-1M是首个同时支持开放词汇和闭集工业检测的统一最大基准,为工业场景中的LVLMs提供了宝贵的预训练数据。此外,我们提出了一种精炼的文本-视觉提示网络(RTVPNet),包含三个关键创新:(1)专家辅助域投影机制,使通用视觉模型能够快速适应工业领域;(2)基于能量的稀疏采样策略,无需人工干预即可自动生成精炼的视觉提示;(3)双向文本-视觉交互模块,增强跨模态语义对齐和理解。大量实验表明,RTVPNet在MMIOC-1M、LVIS和COCO基准上实现了最先进的性能,同时保持了计算效率。数据集和代码可在https://github.com/hellozzk/MMIO获取。

英文摘要

Large-scale Visual-Language Models (LVLMs) have achieved remarkable success in natural visual tasks, yet their application to industrial defect detection remains challenging due to two fundamental limitations: (i) the scarcity of large-scale industrial datasets that cover diverse defect categories across multiple domains, and (ii) the reliance on manual prompts (points, boxes, masks) that introduce subjective noise and lack text-visual interaction for fine-grained understanding. To address these challenges, we introduce a Large-Scale Multi-Modal Industrial Open-Closed benchmark (MMIOC-1M) containing over one million samples across $14$ super-categories, $29$ industrial scenes, and $351$ defect subcategories. To our knowledge, MMIOC-1M is the first unified largest benchmark supporting both open-vocabulary and closed-set industrial detection, providing valuable pre-training data for LVLMs in industrial scenarios. Furthermore, we propose a Refined Text-Visual Prompt Network (RTVPNet) that incorporates three key innovations: (1) an expert-assisted domain projection mechanism that enables rapid adaptation of general vision models to industrial domains, (2) an energy-based sparse sampling strategy that automatically generates refined visual prompts without manual intervention, and (3) a bidirectional text-visual interaction module that enhances cross-modal semantic alignment and understanding. Extensive experiments demonstrate that RTVPNet achieves state-of-the-art performance on MMIOC-1M, LVIS, and COCO benchmarks while maintaining computational efficiency. The dataset and code are available at https://github.com/hellozzk/MMIO.

2606.07951 2026-06-09 cs.CL cs.AI cs.LG 新提交

From `May' to `Is': Certainty Distortion in Language Model Rewriting

从“可能”到“是”:语言模型改写中的确定性扭曲

Catarina G Belem, Shang Wu, Hongyu Yao, Mark Steyvers, Sameer Singh, Padhraic Smyth

发表机构 * University of California Irvine(加利福尼亚大学尔湾分校) Massachusetts Institute of Technology(麻省理工学院)

AI总结 研究语言模型在改写任务中系统性增加表达确定性的偏差,提出基于人群判断的评估指标,发现高达75%的输出存在确定性扭曲,且模型更倾向于提高确定性。

详情
AI中文摘要

人类越来越多地以塑造信念和驱动决策的方式使用语言模型(LM),包括讨论、改写和总结来自科学文章、新闻和医学报告的信息。然而,在这些领域中,主张表达的信心程度至关重要,但关于LM是否忠实地保留它却知之甚少。在这项工作中,我们研究了LM中的确定性扭曲,定义为当语义内容被保留时,表达确定性的有意义变化。我们提出了一种基于LM的评估指标,该指标与人群层面的确定性判断一致。使用该指标,我们在科学和医学交流任务的背景下,表征了不同规模和系列的模型中的确定性扭曲。我们的结果表明,确定性扭曲影响了高达75%的LM输出,并且在改写任务中系统性地不对称,大多数LM将表达确定性增加的可能性是降低的1.5-2倍。这些效应可以通过重复释义累积:在医学领域,claude-haiku-4-5在一次迭代后增加了20%示例的确定性,五次迭代后增加到40%。基于提示的干预减少了整体确定性扭曲,但并未消除它。总之,这些发现揭示了普遍存在的夸大表达确定性的偏差,对在高风险领域依赖LM的用户有直接影响。

英文摘要

Humans increasingly turn to Language Models (LMs) in ways that shape beliefs and drive decisions, including discussing, rewriting, and summarizing information from scientific articles, news, and medical reports. However, in these domains, where how confidently a claim is expressed matters, little is known about whether LMs faithfully preserve it. In this work, we investigate certainty distortion in LMs, defined as meaningful changes in expressed certainty when semantic content is preserved. We propose an LM-based evaluation metric that is consistent with population-level judgments of certainty. Using this metric, we characterize certainty distortion across different sizes and families of models in the context of scientific and medical communication tasks. Our results show that certainty distortion affects up to 75\% of LM outputs and is systematically asymmetric in rewriting tasks with most LMs being 1.5-2$\times$ more likely to increase the expressed certainty than to decrease it. These effects can compound over repeated paraphrasing: in the medical domain, claude-haiku-4-5 increases certainty of 20\% examples after a single iteration, increasing to 40\% after five iterations. Prompt-based interventions reduce overall certainty distortion but do not eliminate it. Together, these findings reveal a general bias toward inflating expressed certainty, with direct implications for users who rely on LMs in high-stakes domains.

2606.07950 2026-06-09 cs.LG 新提交

The Easy, the Hard, and the Learnable: Confidence and Difficulty-Adaptive Policy Optimization for LLM Reasoning

简单、困难与可学习:面向LLM推理的置信度与难度自适应策略优化

Zhanke Zhou, Xiangyu Lu, Chentao Cao, Brando Miranda, Tongliang Liu, Bo Han, Sanmi Koyejo

发表机构 * TMLR Group, Department of Computer Science, Hong Kong Baptist University(香港 Baptist 大学计算机科学系 TMLR 组) Stanford University(斯坦福大学) Sydney AI Centre, The University of Sydney(悉尼大学人工智能中心)

AI总结 针对GRPO训练中均匀采样导致计算效率低的问题,提出CoDaPO方法,通过置信度和难度自适应重加权与重采样,在固定预算下提升可学习问题的发现,在12个基准上优于现有RL方法。

Comments Published in ICML 2026

详情
AI中文摘要

具有可验证奖励的强化学习可以显著提升LLM的推理能力,然而标准的GRPO风格训练通常通过均匀采样和加权同等对待简单、困难和可学习的问题,导致计算分配效率低下。我们通过跟踪token对数概率、组归一化优势以及由此产生的token级更新权重来研究GRPO。这揭示了随着训练进行出现的三种重复动态:(1) 置信度膨胀,(2) 优势收缩,以及(3) 层次收敛。这些发现表明,每次更新的效用很大程度上取决于问题难度和模型当前的能力。受此启发,我们提出了置信度与难度自适应策略优化(CoDaPO),该方法根据展开置信度和经验难度为每个问题分配一个有界值。CoDaPO随后使用该值对策略更新进行重新加权,并在小批量内重新采样高价值的可学习问题,从而在固定计算预算下增加可学习带内的发现。在十二个基准测试中,CoDaPO在准确性上持续优于现有的RL方法。我们的代码公开在 https://github.com/tmlr-group/CoDaPO。

英文摘要

RL with verifiable rewards can substantially improve LLM reasoning, yet standard GRPO-style training often treats easy, hard, and learnable questions alike through uniform sampling and weighting, leading to inefficient compute allocation. We study GRPO by tracking token log-probabilities, group-normalized advantages, and the induced token-level update weights. This reveals three recurring dynamics as training proceeds: (1) confidence inflation, (2) advantage contraction, and (3) hierarchical convergence. These findings suggest that the utility of each update depends strongly on both question difficulty and the model's current competence. Motivated by this, we propose Confidence and Difficulty-adaptive Policy Optimization (CoDaPO), which assigns each question a bounded value from rollout confidence and empirical difficulty. CoDaPO then uses this value to reweight policy updates and resample high-value learnable questions within mini-batches, thereby increasing discovery within the learnable band under a fixed compute budget. Across twelve benchmarks, CoDaPO consistently improves accuracy over existing RL methods. Our code is publicly available at https://github.com/tmlr-group/CoDaPO.

2606.07938 2026-06-09 cs.CV cs.MM eess.IV 新提交

DAL-PCQA: Enabling Distortion-Level and Language-Driven Reasoning for Point Cloud Quality Assessment

DAL-PCQA:实现点云质量评估的失真级别与语言驱动推理

Swarna Chakraborty, Gabriel De Castro Araújo, Syeda Tasmi Faria, Marcelo M. Carvalho, Mylene C. Q. Farias

发表机构 * University of Brasília(巴西利亚大学)

AI总结 提出DAL-PCQA数据集,通过多级失真标签、质量类别和自然语言描述,结合零样本与微调多模态模型,实现可解释的点云质量评估。

Comments Accepted at Qomex 2026

详情
AI中文摘要

点云质量评估(PCQA)方法通常预测标量平均意见分数(MOS),量化整体感知退化但不揭示其原因。相比之下,人类观察者自然地以特定失真(如模糊、颜色偏移、点密度变化、缺失区域和几何变形)进行推理。为弥合这一差距,我们引入了DAL-PCQA,一个用于PCQA的失真感知、语言标注数据集。DAL-PCQA用多级失真严重性标签、离散质量类别和与人类感知对齐的结构化自然语言描述增强了基准点云。我们定义了一个涵盖光度学和几何学伪影的点云特定失真分类法。统计分析揭示了不同失真类型和质量级别的特征退化模式。为评估这些标注的实用性,我们比较了用于生成感知质量描述的零样本和微调多模态模型。实验表明,失真感知监督显著提高了与真实描述的词法和语义对齐。通过实现可解释的失真级别推理,DAL-PCQA促进了语言驱动的、可解释的点云质量评估。该数据集公开于https://github.com/swarna96/DAL-PCQA。

英文摘要

Point Cloud Quality Assessment (PCQA) methods typically predict scalar Mean Opinion Scores (MOS), which quantify overall perceptual degradation but do not reveal its causes. In contrast, human observers naturally reason in terms of specific distortions such as blur, color shifts, point density changes, missing regions, and geometric deformations. To close this gap, we introduce DAL-PCQA, a distortion-aware, language-annotated dataset for PCQA. DAL-PCQA augments benchmark point clouds with multi-level distortion severity labels, discrete quality categories, and structured natural language descriptions aligned with human perception. We define a point-cloud-specific distortion taxonomy that covers both photometric and geometric artifacts. Statistical analysis reveals characteristic degradation patterns across distortion types and quality levels. To assess the utility of these annotations, we compare zero-shot and fine-tuned multimodal models for generating perceptual quality descriptions. Experiments show that distortion-aware supervision substantially improves lexical and semantic alignment with ground-truth descriptions. By enabling interpretable, distortion-level reasoning, DAL-PCQA facilitates language-driven, explainable point cloud quality assessment. The dataset is publicly available at https://github.com/swarna96/DAL-PCQA.

2606.07935 2026-06-09 cs.CV 新提交

REACT 2026: The Fourth Multiple Appropriate Facial Reaction Generation Challenge: Personalised MAFRG and Appropriate EEG Reaction Prediction

REACT 2026:第四届多适切面部反应生成挑战赛:个性化MAFRG与适切脑电反应预测

Siyang Song, Micol Spitale, Zijian Wu, Xiangyu Kong, Cheng Luo, Cristina Palmero, German Barquero, Sergio Escalera, Michel Valstar, Mohamed Daoudi, Fabien Ringeval, Andrew Howes, Elisabeth Andre, Hatice Gunes

发表机构 * University of Exeter(埃克塞特大学) Politecnico di Milano(米兰理工大学) Nanjing University of Science and Technology(南京理工大学) King Abdullah University of Science and Technology(阿卜杜拉国王科技大学) King's College London(伦敦国王学院) Universitat de Barcelona(巴塞罗那大学) University of Nottingham(诺丁汉大学) IMT Nord Europe(IMT 北欧欧洲) Université Grenoble Alpes(格勒诺布尔-阿尔卑斯大学) University of Augsburg(奥格斯堡大学) University of Cambridge(剑桥大学)

AI总结 提出REACT 2026挑战赛,鼓励开发机器学习模型,用于生成个性化、适切、多样、真实且同步的人类面部反应,并引入个性标签和脑电记录,探索新的一对多个性化面部反应生成设置。

Comments arXiv admin note: text overlap with arXiv:2505.17223

详情
AI中文摘要

在二元交互中,多种人类面部反应可能适合回应每个说话者行为。继REACT 2023、2024和2025挑战系列成功举办后,针对多适切面部反应生成(MAFRG)问题,已开发了一系列生成式深度学习模型。今年,我们提出REACT 2026挑战赛,鼓励开发和基准测试机器学习(ML)模型,这些模型能够生成由特定人类倾听者表达的多个个性化、适切、多样、真实且同步的人类风格面部反应,以回应每个给定的说话者行为。作为挑战的关键,我们持续向挑战参与者提供REACT 2025引入的MARS数据集,并额外提供个体层面的五大人格标签和脑电记录。这引入了一种新的结合人类表达行为、情感和神经生理信号的一对多个性化面部反应生成设置,这在当前的二元交互建模中仍很大程度上未被探索。本文还介绍了挑战指南和四个提议的子挑战的新基线:离线通用和个性化MAFRG,以及在线通用和个性化MAFRG,这些基线公开于https://github.com/reactmultimodalchallenge/baseline_react2026。

英文摘要

In dyadic interactions, various human facial reactions could be appropriate for responding to each human speaker behaviour. Following the successful organisation of the REACT 2023, 2024 and 2025 challenge series, a body of generative deep learning (DL) models have been developed for the problem of multiple appropriate facial reaction generation (MAFRG). This year, we propose the REACT 2026 challenge encouraging the development and benchmarking of Machine Learning (ML) models that can generate multiple personalised, appropriate, diverse, realistic and synchronised human-style facial reactions expressed by a specific human listener for responding to each given speaker behaviour. As a key of the challenge, we continuously provide challenge participants with MARS dataset introduced by REACT 2025 but additionally provide individual-level Big-Five personality labels and EEG recordings. This introduces a new one-to-many personalised facial reaction generation setting combining human expressive behavioural, affective and neurophysiological signals, which remains largely unexplored in current dyadic interaction modelling. This paper also presents the challenge guidelines and new baselines on the four proposed sub-challenges: Offline generic and personalised MAFRG as well as Online generic and personalised MAFRG, respectively, which are publicly available at https://github.com/reactmultimodalchallenge/baseline_react2026.

2606.07934 2026-06-09 cs.RO 新提交

X-OP: Cross-Morphology Whole-Body Teleoperation via MPC Retargeting

X-OP: 基于MPC重定向的跨形态全身遥操作

Jen-Wei Wang, Sarthak Kaingade, Andrea Tagliabue, Nicholas Morozovsky

发表机构 * Amazon(亚马逊) University of California, Berkeley(加州大学伯克利分校)

AI总结 提出一种基于单个XR设备的层次化全身遥操作框架,通过MPC重定向联合优化操作者意图与机器人动态可行性,无需针对特定机器人重新训练策略,在仿真和实物实验中显著提升任务成功率。

Comments 9 pages, 4 figures

详情
AI中文摘要

全身遥操作对于在移动操作任务中实现可扩展的机器人数据收集至关重要,然而现有依赖外骨骼套装或多摄像头设置的方法带来了高昂的成本、复杂性和环境限制。最近使用单一扩展现实(XR)设备结合端到端强化学习策略的方法部分解决了这些限制,但需要针对特定机器人重新训练,遭受分布外故障,并且依赖于忽略动态可行性的运动重定向。我们提出了一种由单个XR设备驱动的层次化全身遥操作框架,该框架无需重新训练特定机器人的策略即可泛化到不同机器人形态。基于模型预测控制(MPC)的运动重定向器联合优化与操作者意图的对齐以及机器人的动态可行性,为现有的低级控制器生成最优命令。为了确保鲁棒的在线执行,我们引入了一种状态同步方法,在每个MPC步骤重置模拟器状态以处理嘈杂的真实世界测量和接触敏感性,并集成基于SLAM的全局位姿反馈以减轻长期漂移。仿真结果显示,在全身控制任务中,与基线相比,人形机器人(完成时间降低30%以上,功耗降低20%)和移动操作臂(零碰撞)均取得了更高的成功率。真实世界实验进一步验证了我们方法的有效性和灵活性,展示了所提出的重定向器在两个平台上成功部署于全身控制任务,并允许用户根据偏好轻松调整遥操作行为。该即插即用框架为全身机器人遥操作提供了一种可扩展、形态无关的解决方案,实现了实时行为定制和跨平台的广泛适用性。

英文摘要

Whole-body teleoperation is essential for scalable robot data collection in loco-manipulation tasks, yet existing approaches relying on exoskeleton suits or multi-camera setups impose prohibitive cost, complexity, and environmental constraints. Recent methods using a single extended reality (XR) device with end-to-end reinforcement learning policies partially address these limitations but require robot-specific retraining, suffer from out-of-distribution failures, and rely on motion retargeting that neglects dynamic feasibility. We propose a hierarchical whole-body teleoperation framework driven by a single XR device that generalizes across diverse robot morphologies without retraining robot-specific policies. A Model Predictive Control (MPC)-based motion retargeter jointly optimizes alignment with the operator's intent and the robot's dynamic feasibility, generating optimal commands for existing low-level controllers. To ensure robust online execution, we introduce a state synchronization method that resets the simulator state at each MPC step to handle noisy real-world measurements and contact sensitivity, and integrate SLAM-based global pose feedback to mitigate long-term drift. Simulation results show higher success rates on whole-body control tasks for both a humanoid (over 30% lower completion time and 20% lower power consumption) and a mobile manipulator (zero collisions) compared to baselines. Real-world experiments further validate the effectiveness and flexibility of our method, demonstrating the successful deployment of the proposed retargeter on both platforms for whole-body control tasks and the ease of allowing users to adjust teleoperation behavior based on their preferences. This plug-and-play framework offers a scalable, morphology-agnostic solution for whole-body robot teleoperation, enabling real-time behavioral customization and broad applicability across platforms.

2606.07932 2026-06-09 cs.CV cs.GR cs.MM eess.IV math.OC 新提交

LEGS: Laplacian-Enhanced Gaussian Splatting with a Nonlinear Weighted Loss

LEGS: 拉普拉斯增强的高斯泼溅与非线性加权损失

Yongfei Guo, Qizhou Huo, Xuan Sun, Yuanhao Gong

发表机构 * Changchun Institute of Optics, Fine Mechanics and Physics, Chinese Academy of Sciences(中国科学院长春光学精密机械与物理研究所) University of Chinese Academy of Sciences(中国科学院大学)

AI总结 提出LEGS方法,利用二阶拉普拉斯结构引导和非线性权重函数改进高斯泼溅的损失函数,在保持渲染管线不变的情况下提升结构感知优化,在Tanks&Temples和Mip-NeRF360数据集上PSNR最高提升1.68 dB。

详情
AI中文摘要

3D高斯泼溅(3DGS)已成为辐射场重建和实时新视角合成的高效显式表示方法。然而,其标准光度损失对平坦区域和结构丰富区域处理相似,这可能限制尖锐轮廓和精细细节的恢复。边缘引导高斯泼溅(EGGS)通过边缘引导加权提高了结构感知能力,但主要依赖一阶梯度响应和线性加权。本文提出LEGS,一种具有非线性加权损失的拉普拉斯增强高斯泼溅方法。LEGS用二阶拉普拉斯结构引导取代一阶梯度引导,并通过非线性响应-权重函数将归一化拉普拉斯响应映射为逐像素权重。所提出的损失改进了结构感知的高斯优化,同时保持原始3DGS渲染管线不变。在完整Tanks&Temples和Mip-NeRF360数据集上的实验表明,LEGS相比3DGS的峰值信噪比(PSNR)最高提升1.68 dB,相比EGGS最高提升0.52 dB。将所提出的二阶非线性加权策略集成到FastGS和FasterGS中,PSNR进一步提升最高1.69 dB,证明了其作为高斯泼溅管线的通用损失级扩展的有效性,在AR/VR、沉浸式可视化和实时3D内容生成中具有潜在应用。

英文摘要

3D Gaussian Splatting (3DGS) has become an efficient explicit representation for radiance field reconstruction and real-time novel view synthesis. However, its standard photometric loss treats flat and structure-rich regions similarly, which may limit the recovery of sharp contours and fine details. Edge-Guided Gaussian Splatting (EGGS) improves structure awareness through edge-guided weighting, but mainly relies on first-order gradient responses and linear weighting. In this paper, we propose LEGS, a Laplacian-Enhanced Gaussian Splatting method with a nonlinearly weighted loss. LEGS replaces first-order gradient guidance with second-order Laplacian structural guidance and maps the normalized Laplacian response into pixel-wise weights through nonlinear response-to-weight functions. The proposed loss improves structure-aware Gaussian optimization while keeping the original 3DGS rendering pipeline unchanged. Experiments on the full Tanks\&Temples and Mip-NeRF360 datasets show that LEGS improves peak signal-to-noise ratio (PSNR) by up to 1.68 dB over 3DGS and up to 0.52 dB over EGGS. Incorporating the proposed second-order nonlinear weighting strategy into FastGS and FasterGS further improves PSNR by up to 1.69 dB, demonstrating its effectiveness as a general loss-level extension for Gaussian Splatting pipelines with potential applications in AR/VR, immersive visualization, and real-time 3D content generation.

2606.07929 2026-06-09 cs.AI 新提交

Stress-testing medical large language models reveals latent safety pathology beyond benchmark accuracy

压力测试医学大语言模型揭示基准准确性之外的潜在安全病理

Yuan Shen, Xiaojun Wu, Linghua Yu

发表机构 * College of Computer Science and Technology, Zhejiang University, PR China(浙江大学计算机科学与技术学院,中国)

AI总结 提出AI-MASLD压力审计框架,通过240个临床病例和六种叙事扰动探针对七种模型进行双重压力测试,发现量化模型存在伪正常化,医学监督微调损害逻辑稳定性和公平性,开源模型在安全维度上达到或超越闭源模型。

Comments 34 pages, 5 figures

详情
AI中文摘要

大语言模型(LLMs)正基于可能无法检测到安全相关失效模式的基准准确性进入临床实践。本文提出AI-MASLD,一个压力审计框架,它将肝病学中的代谢压力测试逻辑应用于临床LLMs的评估。使用240个跨六种叙事扰动探针的临床病例,我们对七个模型进行了双重压力测试,并通过三个指标量化性能:代谢指数(MI)、扰动翻转率(PFR)和反事实公平指数(CFI)。在干净的基线条件下,所有模型表现一致良好。在现实叙事压力下,性能急剧分化,揭示了两种不同的应激反应表型。量化模型表现出伪正常化,其中低翻转率掩盖了功能崩溃。医学监督微调系统地降低了逻辑稳定性、公平性和信息提取能力。一个开源模型在每一个安全维度上达到或超过了专有替代方案。这些发现确立了叙事压力审计作为基于准确性评估的必要补充。

英文摘要

Large language models (LLMs) are entering clinical practice based on benchmark accuracy that may fail to detect safety-relevant failure modes. Here we present AI-MASLD, a stress-audit framework that adapts the logic of metabolic stress testing from hepatology to the evaluation of clinical LLMs. Using 240 clinical cases across six narrative perturbation probes, we subjected seven models to double-stress testing and quantified performance through three indices: metabolic index (MI), perturbation flip rate (PFR), and counterfactual fairness index (CFI). Under clean baseline conditions, all models performed uniformly well. Under realistic narrative stress, performance diverged sharply, revealing two distinct stress-response phenotypes. Quantized models exhibited pseudonormalization, in which low flip rates hid functional collapse. Medical supervised fine-tuning systematically degraded logical stability, fairness, and information extraction. An open-weight model matched or exceeded proprietary alternatives on every safety dimension. These findings establish narrative stress auditing as a necessary complement to accuracy-based evaluation.

2606.07925 2026-06-09 cs.CL 新提交

ROSUM-MCTS: Monte Carlo Tree Search-Inspired HDL Code Summarization with Structural Rewards

ROSUM-MCTS:基于蒙特卡洛树搜索的HDL代码摘要生成与结构奖励

Prashanth Vijayaraghavan, Charles Mackin, Luyao Shi, Apoorva Nitsure, Ashutosh Jadhav, David Beymer, Tyler Baldwin, Ehsan Degan, Vandana Mukherjee

发表机构 * University of California, Berkeley(加州大学伯克利分校) Stanford University(斯坦福大学)

AI总结 提出ROSUM-MCTS方法,利用蒙特卡洛树搜索引导大语言模型,通过分层候选扩展和复合奖励函数优化硬件描述语言代码摘要,在VHDL和Verilog数据集上优于基线方法。

Comments 7 pages

Journal ref ICLAD'2025

详情
AI中文摘要

大型语言模型(LLMs)在代码摘要方面显示出潜力,但其对硬件描述语言(HDL)如VHDL和Verilog的有效性尚未充分探索。我们提出ROSUM-MCTS,一种受蒙特卡洛树搜索(MCTS)启发的LLM引导方法,通过结构化探索和强化驱动优化来改进摘要。我们的方法通过分层候选扩展机制整合局部和全局上下文,并使用复合奖励函数优化摘要,该函数平衡功能正确性(FC)、局部内容充分性(LCA)和流畅性。我们在VHDL-eval和Verilog-eval数据集上评估ROSUM-MCTS,证明其通过利用结构化自底向上细化和基于强化的优化,持续优于基线方法。消融研究证实了局部和全局扩展策略的必要性,以及平衡FC和LCA以获得最佳性能的重要性。此外,ROSUM-MCTS对表面修改(如变量重命名)具有鲁棒性,在基线性能下降时仍能保持摘要质量。这些结果确立了ROSUM-MCTS作为有效且鲁棒的HDL摘要框架,为进一步研究强化增强的代码摘要铺平了道路。

英文摘要

Large language models (LLMs) have shown promise in code summarization, yet their effectiveness for Hardware Description Languages (HDLs) like VHDL and Verilog remains underexplored. We propose ROSUM-MCTS, an LLM-guided approach inspired by Monte Carlo Tree Search (MCTS) that refines summaries through structured exploration and reinforcement-driven optimization. Our method integrates both local and global context via a hierarchical candidate expansion mechanism and optimizes summaries using a composite reward function balancing functional correctness (FC), local content adequacy (LCA), and fluency. We evaluate ROSUM-MCTS on the VHDL-eval and Verilog-eval datasets, demonstrating its consistent outperformance over baseline methods by leveraging structured bottom-up refinement and reinforcement-based optimization. Ablation studies confirm the necessity of both local and global expansion strategies, as well as the importance of balancing FC and LCA for optimal performance. Furthermore, ROSUM-MCTS proves robust against superficial modifications, such as variable renaming, maintaining summary quality where baselines degrade. These results establish ROSUM-MCTS as an effective and robust HDL summarization framework, paving the way for further research into reinforcement-enhanced code summarization.

2606.07924 2026-06-09 cs.CV cs.AI cs.CL cs.LG cs.MM 新提交

Decoupling Semantics and Logic: A Training-Free Coarse-to-Fine Pipeline for Video Retrieval-Augmented Generation

解耦语义与逻辑:一种无需训练的从粗到精的视频检索增强生成流水线

Jiaxin Dai, Zehang Wei, Jiamin Yan, Xiang Xiang

发表机构 * School of Computer Science & Tech, Huazhong University of Science and Technology(华中科技大学计算机科学与技术学院) School of AI and Automation, Huazhong University of Science and Technology(华中科技大学人工智能与自动化学院)

AI总结 提出一种无需训练的两阶段级联视频RAG流水线,通过解耦语义检索与逻辑推理,实现跨语言长视频理解、严格角色遵循和零幻觉时间定位。

Comments To be presented at ACL 2026 MAGMAR Workshop (Oral; Retrieval leaderboard No.1)

详情
AI中文摘要

本文介绍了我们为第二届多模态增强生成研讨会(MAGMaR)提交的系统描述。针对跨语言长视频理解、严格角色遵循和零幻觉时间定位等关键挑战,我们提出了一种完全无需训练的两阶段级联视频RAG流水线。我们的架构通过模态感知的任务分工,策略性地将语义检索与认知逻辑推理解耦。在第一阶段,一个高召回率的语义预取模块仅使用高保真视觉摘要和全局文本描述进行密集检索,明确隔离噪声模态(如OCR和ASR)以保持纯净的向量空间。在第二阶段,一个由商业大语言模型(LLM)驱动的自适应、迭代和推理(A.I.R.)过滤代理执行细粒度认知重排序。该代理重新整合完整的多模态上下文,以强制执行与用户角色的严格逻辑对齐,有效剪除语义相似但逻辑无关的候选。最后,提示雕刻机制约束生成器将蒸馏后的子集合成为严格格式化的JSON响应,并带有精确的块级引用。在RAG轨道上的评估表明,我们的资源感知方法在信息检索和角色条件生成方面均表现出卓越的精度。

英文摘要

This paper presents our system description for the 2nd Workshop on Multimodal Augmented Generation via MultimodAl Retrieval (MAGMaR). Addressing the critical challenges of cross-lingual long-video comprehension, strict persona adherence, and zero-hallucination temporal grounding, we propose a fully training-free, two-stage cascaded Video RAG pipeline. Our architecture strategically decouples semantic retrieval from cognitive logical reasoning through a modality-aware division of labor. In the first stage, a high-recall semantic pre-fetching module employs dense retrieval using only high-fidelity visual summaries and global text descriptions, explicitly isolating noisy modalities (e.g., OCR and ASR) to maintain a pristine vector space. In the second stage, an Adaptive, Iterative, and Reasoning-based (A.I.R.) filtering agent, powered by a commercial Large Language Model (LLM), performs fine-grained cognitive reranking. The agent re-incorporates full multimodal contexts to enforce strict logical alignment with user personas, effectively pruning semantically similar but logically irrelevant candidates. Finally, a Prompt Sculpting mechanism constrains the generator to synthesize the distilled subset into strictly formatted JSON responses with exact chunk-level citations. Evaluated on the RAG track, our resource-aware approach shows exceptional precision in both information retrieval and persona-conditioned generation.

2606.07916 2026-06-09 cs.AI 新提交

The CIFAR Synthetic Evidence Corpus for Detecting AI-Generated Evidence

CIFAR合成证据语料库:用于检测AI生成证据

Kelly McConvey, Jalehsadat Mahdavimoghaddam, Nima Jamali, Maksym Taranukhin, Sajad Ebrahimi, Wentao Zhang, Yuntian Deng, Karen Eltis, Maura R. Grossman, Vered Shwartz, Ebrahim Bagheri

发表机构 * University of Toronto(多伦多大学) University of Waterloo(滑铁卢大学) University of British Columbia(不列颠哥伦比亚大学) Vector Institute(向量研究所) University of Ottawa(渥太华大学)

AI总结 针对司法系统中证据真实性检测缺乏合适数据集的问题,构建了包含多种文档类型和篡改策略的CIFAR合成证据语料库,支持在受控条件下评估证据验证系统。

详情
AI中文摘要

生成模型生成逼真文档的能力日益增强,这对司法系统和法院中的证据工作流程构成了直接挑战,因为决策越来越依赖于收据、通信和行政记录等证据的真实性。与社交媒体或学术环境不同,证据性文档通常仅被微妙地修改,通过局部编辑保持整体合理性,同时改变法律含义。然而,自动检测的进展仍然有限,主要原因是缺乏适合司法系统要求的训练和评估数据。现有资源要么专注于人脸照片或自然风景,要么局限于狭窄的学术或社交媒体文档类型,未能捕捉真实世界证据数据的结构、多样性或篡改模式。因此,当前的检测系统不一定能学习到适合司法系统的有意义信号。我们引入了CIFAR合成证据语料库,这是一个旨在在现实和受控条件下严格评估证据验证的数据集。该语料库涵盖多个文档家族和一系列篡改策略,从小规模字段级编辑到完整文档伪造,并使用多种最先进的生成工具构建。其组织方式系统性地变化篡改复杂性和生成方法,同时在训练和测试数据之间强制源级分离,以反映现实世界的泛化挑战。

英文摘要

The growing ability of generative models to produce realistic documents poses a direct challenge to evidentiary workflows in the justice system and the courts, where decisions increasingly depend on the authenticity of evidence such as receipts, communications, and administrative records. Unlike social media or academic settings, evidentiary documents are often only subtly altered, with small, localized edits that preserve overall plausibility while changing legal meaning. Yet progress on automated detection remains limited, largely due to the absence of suitable training and evaluation data especially suited for the justice system requirements. Existing resources are either focused on photos of human faces or natural scenery or on narrowly scoped academic or social media document types, and do not capture the structure, diversity, or manipulation patterns characteristic of real-world evidentiary data. As a result, current detection systems do not necessarily learn meaningful signals appropriate for the justice system. We introduce the CIFAR Synthetic Evidence Corpus, a dataset designed to enable rigorous evaluation of evidence verification under realistic and controlled conditions. The corpus spans multiple document families and a spectrum of manipulation strategies, from small field-level edits to complete document fabrication, and is constructed using a diverse set of state-of-the-art generative tools. It is organized to systematically vary both manipulation complexity and generation method, while enforcing source-level separation between training and test data to reflect real-world generalization challenges.

2606.07915 2026-06-09 cs.AI 新提交

EditSR: Enhancing Neural Symbolic Regression via Edit-based Rectification

EditSR: 通过基于编辑的修正增强神经符号回归

Da Li, Xinxin Li, Xingyu Cui, Jin Xu, Juan Zhang, Junping Yin

发表机构 * Northeast Normal University(东北师范大学) East China Normal University(华东师范大学) Shenzhen Institute of Advanced Technology(深圳先进技术研究院) Graduate School of China Academy of Engineering Physics(中国工程物理研究院研究生院) Beihang University(北京航空航天大学) Institute of Applied Physics and Computational Mathematics(北京应用物理与计算数学研究所)

AI总结 提出EditSR双层框架,第一层神经符号回归模型生成表达式,第二层基于编辑的修正器通过预训练的状态转移链逐步修正错误,避免全局搜索重启,有效减少误差累积,提升复杂表达式生成的结构正确性。

详情
AI中文摘要

神经符号回归模型通过将结构搜索转移到预训练来提高推理效率,但其一次性自回归解码容易产生误差累积,可能导致生成结构不正确的表达式,尤其是在复杂表达式生成场景中。现有的修正策略可以缓解这一问题,但它们通常依赖于重新启动全局搜索,从而削弱了神经模型的效率优势,并且仍然容易受到误差累积的影响。在本文中,我们提出了EditSR,一个双层框架,第一层结合神经符号回归模型,第二层结合基于编辑的修正器,以实现高效预测和事后修正。我们不重新启动全局搜索,而是通过预训练修正器来保持修正效率。具体来说,我们将修正过程形式化为从错误表达式开始的逐步状态转移链,并开发了一种状态转移算法来构建用于训练修正器的监督修正链。为了确保修正过程中的语法有效性,每个编辑操作都被限制在语法有效的空间内,使得每个编辑后的表达式仍然可解析。此外,由于每个编辑决策基于当前状态而非历史,修正器允许后续编辑修正早期步骤中的错误,从而降低误差累积的风险。大量实验和消融研究表明,EditSR以有限的额外成本显著提高了符号结构恢复能力,在复杂表达式上收益更明显,因为一次性自回归解码更容易受到误差累积的影响。

英文摘要

Neural symbolic regression models improve inference efficiency by shifting structural search to pretraining, but their one-pass autoregressive decoding is prone to error accumulation, which may lead to generating structurally incorrect expressions, especially in complex expression generation scenarios. Existing rectification strategies can alleviate this issue, but they often depend on restarting global search, thereby weakening the efficiency advantage of neural models, and remain susceptible to error accumulation. In this paper, we propose EditSR, a two-layer framework that combines a neural symbolic regression model in the first layer with an edit-based Rectifier in the second layer to achieve efficient prediction and post-hoc rectification. Instead of restarting the global search, we maintain rectification efficiency by pretraining the Rectifier. Specifically, we formulate the rectification process as a step-by-step state-transition chain starting from an incorrect expression, and develop a state-transition algorithm to construct supervised rectification chains for training the Rectifier. To ensure syntactic validity throughout rectification, each edit action is restricted to a syntactically valid space so that every edited expression remains parseable. In addition, because each edit decision is conditioned on the current state rather than the history, the Rectifier allows errors made in earlier steps to be rectified by subsequent edits, thereby reducing the risk of error accumulation. Extensive experiments and ablation studies show that EditSR substantially improves symbolic structure recovery with limited extra cost, with more pronounced gains on complex expressions, where one-pass autoregressive decoding is more susceptible to error accumulation.

2606.07910 2026-06-09 cs.LG 新提交

CAAL: Contextual Bandits based Online Hand-Craft Active Learning Strategy Selection

CAAL: 基于上下文赌博机的在线手工主动学习策略选择

Shao-An Yin, Jiacong Li, Tianpei Xie, Cecile Levasseur, Wojciech Kowalinski, Nicola Elia

发表机构 * University of Minnesota, Twin Cities(明尼苏达大学双城分校) Amazon(亚马逊)

AI总结 提出CAAL框架,利用上下文信息和奖励预测动态选择主动学习策略,在公共数据集上优于现有基线方法。

Comments 8 pages, 5 figures, Accepted to the NYRL 2025 Workshop

详情
AI中文摘要

主动学习算法面临的挑战是未标注数据统计分布的不确定性,这使得难以选择最佳的手工策略。为了解决这个问题,我们引入了上下文自适应主动学习(CAAL)。在CAAL中,每个“臂”代表一个手工策略。与仅基于标注数据反馈选择策略的现有框架不同,我们利用外部上下文信息的奖励预测动态选择用于标注数据批次的策略。这个通用框架允许通过领域知识进行定制,以设计更有效的奖励和上下文候选。此外,我们通过实验表明,使用我们的奖励和上下文设计,CAAL在公共数据集上优于现有的基线自适应策略。无论每次迭代的批次大小如何,我们的结果都是一致的。

英文摘要

The challenge with active learning algorithms is the uncertainty of the statistical distribution of unlabeled data, making it difficult to choose the best hand-crafted strategy. To address this, we introduced Contextual Adaptive Active Learning (CAAL). In CAAL, each "arm" represents a hand-crafted strategy. Unlike existing frameworks that select strategies based only on feedback from labeled data, we dynamically choose strategies for labeling batches of data using reward prediction with external context information. This general framework allows for customization with domain knowledge to design more effective rewards and context candidates. In addition, we experimentally show that CAAL outperforms the existing baseline adaptive strategy on public datasets using our reward and context design. Our results are consistent regardless of batch size in each iteration.

2606.07908 2026-06-09 cs.LG 新提交

Layer-wise Derivative Controlled Networks Achieve Competitive Accuracy and Gradient Stability Across Data Regimes

逐层导数控制网络在不同数据体制下实现竞争性准确性和梯度稳定性

Rowan Martnishn

发表机构 * Rowan Martnishn

AI总结 基于ChainzRule的导数控制网络通过逐层雅可比惩罚,在表格和NLP任务中实现低数据高性能,梯度尾比作为泛化诊断指标。

详情
AI中文摘要

基于ChainzRule(CR)的导数控制网络结合了三次多项式层与轻量级前向逐层雅可比惩罚(DREG)。在本多部分系列的第二篇论文中,我们评估了CR在不同数据体制下的泛化特性。我们消融了DREG系数调度的形状,证明最优退火范围取决于表示噪声。在Pima糖尿病数据集上,CR在低数据下表现强劲,并在5%至100%训练数据范围内保持相对于基线的持续准确率优势,这得益于异常稳定的梯度尾比(约1.01-1.02,而ReLU网络为1.07-1.09)。扩展到SST-5,在冻结嵌入和BERT微调体制下均取得有竞争力或更优的结果,包括在训练数据显著减少的情况下超越先前的BERT基线。这些结果具有统计显著性:CR在我们能识别的最强已发表基线上,在两个数据集上均取得了更优的准确率(p < 0.05)。这些结果表明,逐层导数控制引入了一种偏向低频、稳定表示的结构性归纳偏置,该偏置在表格和NLP领域、数据量和表示质量上均能稳健泛化。梯度尾比可作为泛化能力的可靠、无标签诊断指标。

英文摘要

Derivative-controlled networks based on ChainzRule (CR) combine cubic polynomial layers with a lightweight forward-mode per-layer Jacobian penalty (DREG). In this second paper of a multi-part series, we evaluate the generalization properties of CR across data regimes. We ablate the shape of the DREG coefficient schedule, demonstrating that the optimal annealing range depends on representation noise. On the Pima Diabetes dataset, CR achieves strong low-data performance and maintains a consistent accuracy advantage over baselines from 5\% to 100\% training data, supported by exceptionally stable gradient tail ratios ($\sim$1.01--1.02 vs. 1.07--1.09 for ReLU networks). Extensions to SST-5 show competitive or superior results in both frozen-embedding and BERT fine-tuned regimes, including outperforming prior BERT baselines despite substantially less training data. These results are statistically significant: CR achieves superior accuracy over the strongest published baselines we could identify on both datasets ($p < 0.05$). These results establish that layer-wise derivative control induces a structural inductive bias toward low-frequency, stable representations that generalizes robustly across tabular and NLP domains, data volumes, and representation qualities. The gradient tail ratio serves as a reliable, label-free diagnostic of generalization capability.

2606.07907 2026-06-09 cs.CV cs.AI 新提交

3D Oral Modelling with Improved Vertex Distribution Using Matching-Based Learning

基于匹配学习的改进顶点分布的3D口腔建模

Jihun Cho, Soo-Yeon Jeong, Eun-Jeong Bae, Sun-Young Ihm

发表机构 * st Jihun Cho(第一作者) nd Soo-Yeon Jeong(第二作者) rd Eun-Jeong Bae(第三作者) th Sun-Young Ihm(第四作者)

AI总结 针对3D口腔重建中预测顶点分布不均的问题,提出结合匈牙利匹配过滤与排斥损失的改进损失函数,使顶点分布更均匀,虽精度略降但有效缓解了聚集现象。

Comments 5 pages, 7 figures. English version of a paper presented at the Korea Multimedia Society Conference, November 2025

详情
AI中文摘要

在我们之前的工作中,提出了一个基于深度学习的3D口内重建框架。该模型直接从十张固定角度的口内图像预测显式3D点云坐标,采用MobileNetV2和多头注意力进行多视图特征融合,并使用L1损失和倒角距离的组合作为损失函数。尽管模型达到了77.49%的准确率,但预测顶点倾向于集中在真实值的高密度区域,而其他区域大部分未被覆盖。\n在本文中,提出了一种改进的损失函数来解决这一局限性。引入了带过滤的匈牙利匹配和排斥损失,以强制重建模型上的顶点分布更加均匀。所提出的模型达到了68.02%的准确率,数值上低于之前的模型。然而,先前工作中观察到的顶点聚集问题得到了显著缓解,预测顶点在整个重建表面上分布更加均匀。

英文摘要

In our previous work, a deep learning-based framework for 3D intraoral reconstruction was proposed. The model directly predicts explicit 3D point cloud coordinates from ten fixed-angle intraoral images, employing MobileNetV2 and Multi-head Attention for multi-view feature fusion, with a combined L1 Loss and Chamfer Distance as the loss function. Although the model achieved an accuracy of 77.49%, predicted vertices tended to concentrate in high-density regions of the ground truth, leaving other regions largely uncovered. In this paper, an improved loss function is proposed to address this limitation. Hungarian matching with filtering and Repulsion Loss are introduced to enforce more uniform vertex distribution across the reconstructed model. The proposed model achieves an accuracy of 68.02%, which is numerically lower than the previous model. However, the vertex clustering issue observed in the prior work is substantially alleviated, with predicted vertices distributed more evenly across the entire reconstructed surface.

2606.07902 2026-06-09 cs.RO 新提交

End-to-End Control of a Powered Knee-Ankle Prosthesis Towards Unified, Tuning-Free Assistance

动力膝踝假肢的端到端控制:迈向统一、免调参的辅助

John Shim, Christoph Nuesslein, Sixu Zhou, Hanjun kim, Kinsey Herrin, Aaron Young

发表机构 * Georgia Institute of Technology(佐治亚理工学院) Woodruff School of Mechanical Engineering(伍德拉夫机械工程学院) Institute for Robotics and Intelligent Machines(机器人与智能机器研究所)

AI总结 本文提出一种端到端假肢控制器,利用时序卷积网络从机载传感器估计连续执行器信号,无需意图分类器和个体调参,在多种地形和步态模式下实现统一、模式自适应的假肢辅助。

Comments 7 pages, 6 figures

详情
AI中文摘要

动力假肢通常依赖需要大量手动调参和显式模式分类的阻抗控制器。在这项工作中,我们展示了端到端假肢控制器的实时部署,该控制器从机载传感器估计连续执行器信号,消除了对意图分类器和个体调参的需求。时序卷积网络在来自18名经股截肢者的多地形数据集上训练,并在五种运动模式下实时部署。四名参与者(三名健全人,一名经股截肢者)在平地、斜坡上坡、斜坡下坡以及楼梯上坡和下坡上行走。在平地行走中,部署的控制器再现了踝关节峰值力矩随步行速度变化的训练数据缩放关系(部署:0.85 Nm/kg per m/s,p = 0.001;训练:0.96 Nm/kg per m/s,95% CI [0.42, 1.50],p = 0.002),排除了一个因异常假肢负载导致的离群点。在斜坡上坡时,控制器使膝关节预屈曲随坡度变化(部署:2.92 deg/deg,p = 0.027;训练:3.30 deg/deg,95% CI [1.83, 4.77],p < 0.001)。在斜坡下坡时,控制器相对于平地行走增加了膝关节阻力矩(部署:+0.16 Nm/kg,p < 0.001;训练:+0.16 Nm/kg,p = 0.008)。尽管训练数据仅包含一种肢体引导序列,但控制器在楼梯上坡和下坡中为健侧和假肢侧引导序列均生成了无缝的过渡。这些结果为端到端控制提供了初步证据,表明其能够提供统一、模式自适应的假肢辅助,而无需个体调参。

英文摘要

Powered prostheses conventionally rely on impedance controllers that require extensive manual tuning and explicit mode classification. In this work, we present real-time deployment of an end-to-end prosthesis controller that estimates continuous actuator signals from onboard sensors, eliminating the need for intent classifiers and subject-specific tuning. Temporal Convolutional Networks were trained on a multi-terrain dataset from 18 individuals with transfemoral amputation and deployed in real time across five locomotion modes. Four participants (three able-bodied, one with transfemoral amputation) ambulated across level ground, ramp ascent and descent, and stair ascent and descent. During level walking, the deployed controller reproduced the training-data scaling of peak ankle torque with walking speed (deployed 0.85 Nm/kg per m/s, p = 0.001; training 0.96 Nm/kg per m/s, 95% CI [0.42, 1.50], p = 0.002), after excluding one outlier traced to atypical prosthesis loading. During ramp ascent, the controller scaled knee pre-flexion with grade (deployed 2.92 deg/deg, p = 0.027; training 3.30 deg/deg, 95% CI [1.83, 4.77], p < 0.001). During ramp descent, the controller increased resistive knee torque relative to level walking (deployed +0.16 Nm/kg, p < 0.001; training +0.16 Nm/kg, p = 0.008). Seamless stair transitions were generated for both intact- and prosthetic-side-leading sequences in ascent and descent, despite the training data containing only one limb-leading sequence. These results provide initial evidence towards end-to-end control that can provide unified, mode-adaptive prosthetic assistance without subject-specific tuning.

2606.07898 2026-06-09 cs.LG cs.CE 新提交

Temporal Coverage over Density: Parsimonious Training-Set Design for ML Climate Downscaling

密度之上的时间覆盖:机器学习气候降尺度的简约训练集设计

Karandeep Singh, Stefan Rahimi, Chad W. Thackeray, Stephen Cropper, Alex Hall

发表机构 * University of California, Los Angeles(加州大学洛杉矶分校) University of Wyoming(怀俄明大学)

AI总结 针对机器学习气候降尺度中高分辨率模拟资源有限的问题,提出通过时间分布采样而非连续块状采样来分配训练年份,以更好地捕捉强迫气候响应和内部变率,实验表明时间分布采样在固定预算下性能最优。

Comments 22 pages, 8 figures

详情
AI中文摘要

高分辨率区域气候模拟为气候影响评估提供了关键信息,但计算成本高昂,这推动了机器学习降尺度器和模拟器的发展。一个关键挑战是确定如何将有限的高分辨率模拟分布到不断变化的气候轨迹中,以同时捕捉强迫气候响应和内部变率。利用美国西部的CESM2大型集合,我们在固定数据预算下比较了三种训练年份选择策略:历史年份的连续块、从模拟期开始和结束年份中抽取的年份,以及分布在整个气候轨迹中的年份。同时包含历史和未来年份始终优于仅使用历史年份训练,这表明让降尺度模型接触历史记录之外的气候状态的重要性,并突出了统计降尺度中常见的平稳性假设的局限性。使用分布在整个气候轨迹中的年份进行训练总体表现最佳,表明内部变率的广泛采样除了暴露于强迫气候响应之外还提供了额外信息。在时间分布子集上训练的模型能更成功地再现未见集合成员中的变率,同时在广泛的气候诊断中保持强劲性能。即使仅使用可用高分辨率年份的十分之一进行训练,时间分布模型仍与全数据训练高度竞争。这些结果表明,在固定计算预算下,分配稀缺的高分辨率模拟时,气候状态的广泛采样比时间连续性更有价值。这些发现为区域气候降尺度和大型集合预测工作流程提供了实用指导。

英文摘要

High-resolution regional climate simulations provide critical information for climate impacts assessments but remain computationally expensive, motivating the development of machine-learning downscalers and emulators. A key challenge is determining how limited high-resolution simulations should be distributed across a changing climate trajectory to capture both forced climate response and internal variability. Using the CESM2 Large Ensemble over the western United States, we compare three training-year selection strategies under fixed data budgets: a contiguous block of historical years, years drawn from both the beginning and end of the simulation period, and years distributed throughout the full climate trajectory. Including both historical and future years consistently outperforms training on historical years alone, demonstrating the importance of exposing downscaling models to climate states outside the historical record and highlighting limitations of stationarity assumptions common in statistical downscaling. Training on years distributed throughout the full climate trajectory performs best overall, indicating that broad sampling of internal variability provides additional information beyond exposure to the forced climate response alone. Models trained on temporally distributed subsets more successfully reproduce variability in unseen ensemble members while retaining strong performance across a wide range of climate diagnostics. Even when trained on only one-tenth of the available high-resolution years, temporally distributed models remain highly competitive with full-data training. These results suggest that, under fixed computational budgets, broad sampling of climate states is more valuable than temporal continuity when allocating scarce high-resolution simulations. The findings provide practical guidance for regional climate downscaling and large-ensemble projection workflows.

2606.07897 2026-06-09 cs.AI cs.HC 新提交

The AI Epistemic Deference Index: A Continuous Measure of Sycophancy

AI认知顺从指数:谄媚行为的连续度量

Alejandro Botas, Paul de Font-Reaulx, Luke Hewitt

发表机构 * Independent(独立研究者) University of Michigan, Ann Arbor(密歇根大学安娜堡分校) Transluce

AI总结 提出AI认知顺从指数(AEDI),通过从自然语言输出中估计概率来连续度量模型对用户态度的顺从程度,测试8个模型发现显著差异,Claude顺从最少,Grok和Gemini最多。

详情
AI中文摘要

当前的AI模型经常表现出认知谄媚,即赞同用户的说法。现有的评估通常通过衡量使模型改变二元认可所需的条件,或通过引发对命题的明确概率来度量。然而,许多面向用户的谄媚行为是通过日常语言中表达的分级支持的转变来体现的。我们提出AI认知顺从指数(AEDI):一个连续的、单维度的分数,表示模型输出中表达的支持对用户提示中表达的态度敏感程度。为了生成AEDI,我们提供了一种新的协议,用于从自然语言输出中估计概率,使用LLM作为评判者,并验证了其与人类判断的一致性和相关性。我们在一个包含500个不同主题命题和16000个不同用户态度提示的新策划数据库上部署了该指数,测试了8个主流模型。每个模型都表现出显著的顺从,尽管不同提供商之间存在巨大且系统的差异,其中Claude模型顺从最少,而Grok和Gemini模型顺从最多。在要求书面产物的提示中,这种效应被放大,并集中在模型先验较弱的命题上。我们发布AEDI作为一个易于更新的基准和测量流程,用于输出级别的谄媚评估。

英文摘要

Current AI models frequently exhibit epistemic sycophancy, endorsing claims to agree with a user. Existing evaluations typically measure this either by assessing what it takes to make a model shift a binary endorsement or by eliciting an explicit probability in a proposition. However, much user-facing sycophantic behavior is demonstrated through shifts in graded support expressed through ordinary language. We propose the AI Epistemic Deference Index (AEDI): a continuous, unidimensional score representing how sensitive the support expressed in a model's output is to the attitude expressed in a user's prompt. To generate AEDI, we provide a new protocol for estimating probabilities from natural language outputs, using LLMs-as-judges validated for consistency and correlation to human judgment. We deploy it on a new curated database of 500 propositions across diverse topics and 16,000 prompts varying in user attitude, testing eight prominent models. Every model exhibits substantial deference, though with large and systematic differences across providers, with Claude models demonstrating the least, and Grok and Gemini models the most. The effect is amplified in prompts requesting a written artifact, and concentrated on propositions where models hold weaker priors. We release AEDI as an easy-to-update benchmark and measurement pipeline for output-level sycophancy evaluation.

2606.07895 2026-06-09 cs.CV cs.RO 新提交

TBD-VLA: Temporal Block Diffusion Vision Language Action Model

TBD-VLA: 时序块扩散视觉语言动作模型

Sung-Wook Lee, Xuhui Kang, Yen-Ling Kuo

发表机构 * University of Virginia(弗吉尼亚大学)

AI总结 提出TBD-VLA框架,通过时序块扩散机制实现离散令牌VLA模型的并行动作生成,兼顾时序连贯性与推理速度,在仿真和真实任务中优于先前方法。

详情
AI中文摘要

离散视觉-语言-动作(VLA)模型通常将动作生成建模为离散动作空间上的下一个令牌预测,每个令牌自回归地依赖于先前的上下文。虽然有效,但这种范式会导致高推理延迟,并且很大程度上忽略了动作轨迹中固有的时间结构。最近的工作引入并行解码以提高效率,实现更快的推理,但缺乏建模令牌依赖关系的显式机制。我们提出TBD-VLA,一种基于离散令牌的VLA框架,它结合了块扩散以实现时序动作生成。我们将动作序列划分为时间块,并在每个块内执行掩码离散扩散,同时保持跨块的自回归生成。这种设计统一了时序自回归和并行动作解码,实现了强时序连贯性和改进的推理速度。此外,显式的时序建模通过时序修补实现了动作块(例如实时分块)的异步执行。TBD-VLA在仿真和真实世界的操作任务中显著优于先前的VLA方法,为走向快速、时序感知的离散VLA模型提供了一条可扩展的路径。项目网页:https://tbd-vla.github.io/

英文摘要

Discrete Vision-Language-Action (VLA) models typically formulate action generation as next-token prediction over discretized action spaces, conditioning each token autoregressively on prior context. While effective, this paradigm incurs high inference latency and largely ignores the temporal structure inherent in action trajectories. Recent efforts introduce parallel decoding to improve efficiency, enabling faster inference, but lack explicit mechanisms for modeling token dependencies. We introduce TBD-VLA, a discrete token-based VLA framework that incorporates block diffusion to enable temporal action generation. We partition action sequences into temporal blocks and perform masked discrete diffusion within each block, while maintaining autoregressive generation across blocks. This design unifies temporal autoregression and parallel action decoding, achieving both strong temporal coherence and improved inference speed. In addition, the explicit temporal modeling enables asynchronous execution of action chunks (e.g., Real-Time Chunking) via temporal in-painting. TBD-VLA significantly outperforms prior VLA approaches in both simulation and real-world manipulation tasks, offering a scalable path toward fast, temporally aware, discrete VLA models. Project webpage: https://tbd-vla.github.io/

2606.07893 2026-06-09 cs.CL 新提交

Beyond Individual Personas: Aligning Synthetic Dialogue to Population-Level Behavior Distributions

超越个体角色:将合成对话对齐到群体层面的行为分布

Xinyi Liu, Rinat Khaziev, Hooshang Nayyeri, Emine Yilmaz, Charith Peris, Hari Thadakamalla

发表机构 * Amazon(亚马逊) University of Illinois Urbana–Champaign(伊利诺伊大学厄巴纳-香槟分校) University College London(伦敦大学学院)

AI总结 提出GroupPersona框架,通过将参考语料库的行为分布转化为生成控制,使合成对话语料库在群体层面与参考分布对齐,在12个行为属性上Jensen-Shannon散度降低24.4%。

详情
AI中文摘要

合成对话语料库越来越多地被用作目标对话数据的代理,然而基于角色的生成器优化的是个体对话而非语料库组成,导致产生局部合理的对话但群体层面的行为混合失真。我们引入GroupPersona框架,该框架将合成对话语料库对齐到参考语料库的行为分布。GroupPersona将群体统计转化为生成控制:它将每个对话的核心行为特征与可预测的副作用分离,并利用由此产生的行为组来根据定义参考群体的交互模式调节用户代理。我们在四个跨越两种对话来源(助手风格和Reddit衍生)的语料库上评估GroupPersona,并采用两种构建变体:结构保持和变体增强。相对于最强的平均基线,GroupPersona将合成分布与参考分布在12个行为属性上的Jensen-Shannon散度从0.234降低到0.177,降低了24.4%,并且在所有四个语料库上达到最佳或并列最佳,同时保持结构对齐。它还在参考对话质量分数的校准上达到最接近,将参考对话轮廓的平均绝对偏差降低到0.63,而次优基线为0.91。

英文摘要

Synthetic dialogue corpora are increasingly used as proxies for target dialogue data, yet persona-grounded generators optimize individual conversations rather than corpus composition, yielding locally plausible dialogues with distorted population-level behavior mixes. We introduce GroupPersona, a framework that aligns synthetic dialogue corpora to the behavior distribution of a reference corpus. GroupPersona turns population statistics into generation controls: it separates each dialogue's core behavioral signature from predictable side effects, and uses the resulting behavioral groups to condition user agents on the interaction patterns that define the reference population. We evaluate GroupPersona on four corpora crossing two dialogue sources, assistant-style and Reddit-derived, with two construction variants: structure-preserving and variation-enhanced. GroupPersona lowers Jensen-Shannon divergence between synthetic and reference distributions over 12 behavior attributes from 0.234 to 0.177 relative to the strongest average baseline, a 24.4% reduction, and is best or tied-best on all four corpora while preserving structural alignment. It also achieves the closest calibration to reference-conversation quality scores, reducing mean absolute deviation from the reference-conversation profile to 0.63 versus 0.91 for the next-best baseline.

2606.07891 2026-06-09 cs.CV 新提交

C3VD-DEFCOL: A Deformable Colonoscopy Dataset with Time-Resolved 3D Ground Truth and Realistic Appearance

C3VD-DEFCOL:具有时间分辨三维真实地面真值和逼真外观的可变形结肠镜数据集

Ethan Luk, Mayank V. Golhar, Anthony Song, Raúl Iranzo, Víctor M. Batlle, Lalithkumar Seenivasan, José M. M. Montiel, Nicholas J. Durr

发表机构 * Johns Hopkins University(约翰霍普金斯大学) Universidad de Zaragoza(萨拉戈萨大学)

AI总结 提出C3VD-DEFCOL框架和数据集,通过模拟蠕动变形和真实纹理渲染,为可变形结肠镜三维重建提供带时间分辨地面真值的评估平台。

详情
AI中文摘要

三维重建可通过估计黏膜覆盖范围并在筛查期间提醒临床医生遗漏区域来改进结肠镜检查。然而,由于当前没有数据集同时提供逼真的体内外观和密集的时间分辨三维地面真值(尤其在非刚性变形下),算法开发受到限制。我们提出C3VD-DEFCOL,一个用于评估可变形结肠镜重建的框架和数据集,具有配对的几何和逼真纹理。从C3VD/C3VDv2结肠网格和相机轨迹出发,我们生成结肠表面的受控变形,包括蠕动波和中心线运动,并渲染每帧深度、表面法线、光流、相机姿态和时间戳三维网格。然后,我们使用渲染的几何(主要是深度)来条件化一个基于LTX-2.3的模拟到真实翻译模型,该模型生成具有体内样黏膜颜色、纹理、血管和镜面外观的RGB片段,同时保留底层三维场景结构。所得数据集包含来自11个独特结肠网格几何的110个视频,具有不同的相机轨迹、外观和参数化变形模式,包括三个蠕动严重程度级别作为受控评估轴。我们使用外观真实性、几何一致性和时间一致性指标评估生成的视频,并利用配对地面真值对可变形三维重建中的下游任务——姿态估计进行基准测试。实验表明,姿态估计误差随变形严重程度增加而增加,提供了现有体内数据集无法实现的受控压力测试。总体而言,C3VD-DEFCOL被设计为一个可重复的定量评估平台,用于测试可变形三维重建算法,旨在缩小合成数据集与体内结肠镜之间的领域差距。

英文摘要

3D reconstruction could improve colonoscopy by estimating mucosal coverage and alerting clinicians to missed regions during screening. However, algorithm development is limited as no current datasets provide both a realistic in vivo appearance and dense, time-resolved 3D ground truth, especially under non-rigid deformation. We present C3VD-DEFCOL, a framework and dataset for evaluating deformable colonoscopy reconstruction with paired geometry and realistic texture. Starting from C3VD/C3VDv2 colon meshes and camera trajectories, we generate controlled deformations of the colon surface, including peristaltic waves and centerline motion, and render per-frame depth, surface normals, optical flow, camera poses, and time-stamped 3D meshes. We then use the rendered geometry, primarily depth, to condition an LTX-2.3-based sim-to-real translation model that produces RGB clips with in vivo-like mucosal color, texture, vasculature, and specular appearance while preserving the underlying 3D scene structure. The resulting dataset contains 110 videos from 11 unique colon mesh geometries, with varying camera trajectories, appearances, and parameterized deformation regimes, including three peristaltic severity levels that serve as controlled evaluation axes. We evaluate the generated videos using appearance realism, geometric consistency, and temporal consistency metrics, and use the paired ground truth to benchmark the downstream task of pose estimation in deformable 3D reconstruction. Our experiments show how pose estimation error increases with increasing deformation severity, providing a controlled stress test that is not possible with existing in vivo datasets. Overall, C3VD-DEFCOL is designed as a reproducible, quantitative evaluation platform for testing deformable 3D reconstruction algorithms, with the goal of reducing the domain gap between synthetic datasets and in vivo colonoscopy.

2606.07890 2026-06-09 cs.LG stat.ML 新提交

Partially Performative Prediction

部分表现性预测

Jaewook Lee, Tijana Zrnic

发表机构 * Stanford University(斯坦福大学)

AI总结 提出部分表现性预测框架,统一建模由模型部署引起的内生分布偏移和外部时间变化引起的外生偏移,并定义在线表现性稳定与最优性,分析重复训练等启发式方法的适应性条件。

详情
AI中文摘要

表现性预测研究当预测模型部署在重要领域时产生的反馈循环。在这些设置中,部署模型可能会改变模型旨在预测其模式的人群,导致学习系统内生的分布偏移。这种视角不同于经典的分布偏移处理,其中偏移通常被建模为数据生成过程中的外生变化。然而,在实践中,分布偏移很少是单一类型的。预测模型可能通过其支持的决策影响未来数据,而世界本身也因学习者无法控制的原因持续漂移。我们研究部分表现性预测,这是一个捕捉内源和外源分布偏移源的框架。该框架通过允许数据分布既响应部署的模型又根据外部时变过程演化,推广了表现性预测。我们通过定义在线类比来跟踪演化的部分表现性环境,将表现性稳定性和表现性最优性的核心概念扩展到这一设置。我们分析了实用的学习启发式方法,包括重复训练,并刻画了它们何时成功适应部分表现性环境。

英文摘要

Performative prediction studies feedback loops that arise when predictive models are deployed in consequential domains. In these settings, deploying a model can change the population whose patterns the model aims to predict, inducing a distribution shift that is endogenous to the learning system. This perspective departs from classical treatments of distribution shift, where shifts are typically modeled as exogenous changes in the data-generating process. Yet, in practice, distribution shift is rarely one or the other. Predictive models may influence future data through the decisions they support, while the world itself continues to drift for reasons beyond the learner's control. We study partially performative prediction, a framework that captures both endogenous and exogenous sources of distribution shift. The framework generalizes performative prediction by allowing the data distribution to evolve both in response to the deployed model and according to an external, time-varying process. We extend the central notions of performative stability and performative optimality to this setting by defining their online analogues that track the evolving partially performative environment. We analyze practical learning heuristics, including repeated retraining, and characterize when they successfully adapt to partially performative environments.

2606.07889 2026-06-09 cs.LG cs.AI cs.CL 新提交

Strained Coherence: A Pre-Failure Signal in Coding Agent Execution Trajectories

应变连贯性:编码代理执行轨迹中的故障前信号

Marut Pandya, Kasey Zhang, Baiqing Lyu

发表机构 * GitHub

AI总结 提出“应变连贯性”模式,即编码代理识别到问题但仍按原计划行动,通过构建Claude Sonnet 4.6检测器在44条轨迹上实现94%故障预测精度,优于基线方法。

详情
AI中文摘要

基于LLM的编码代理有时会承认自身推理中的问题,但仍继续执行。我们将这种模式称为应变连贯性:一种与安全相关的故障模式,其中代理拥有应改变其行为的信息,陈述了该信息,却仍违背它行动。该模式与口头奖励黑客行为重叠,即代理指出任务代理与底层目标之间的冲突,却仍优化代理。我们给出操作性定义,构建一个Claude Sonnet 4.6评判器,读取完整轨迹并标记该模式出现的片段,并使用Qwen3.5-35B-A3B骨干在44条Terminal-bench-2轨迹上评估。标记轨迹的失败率为94%,而未标记轨迹为46%(47个百分点的差距,Fisher精确检验p=0.003;排除三个提示嵌入示例后为46个百分点,p=0.006)。在匹配选择性下,检测器达到94%的精确度,而词汇话语标记基线为88%;两种方法的10条轨迹交集具有100%的失败率(Clopper-Pearson 95%置信区间[69%, 100%])。我们在Gemma4-31B上使用43条轨迹进行复制:整体信号方向一致但不显著(20个百分点差距,p=0.31),衰减主要由13条零思考内容的轨迹驱动,其中检测器没有可分析的基础。在Gemma的高冗长度三分位中,差距为+30个百分点;在Qwen的中等和高冗长度三分位中,差距各为+40个百分点。两个模型的首次标记出现在轨迹经过时间的中位数83-84%处,且二元标记在软化显式冲突标记的释义中保持不变(8/8条轨迹)。与单变量预测器不同,检测器输出可解释的跨度级输出——引用的承认、引用的行动和类型化的冲突——显示代理看到并忽略了什么。

英文摘要

LLM-based coding agents sometimes acknowledge a problem in their own reasoning and then proceed anyway. We call this pattern strained coherence: a safety-relevant failure mode in which an agent has information that should change its behavior, states that information, and still acts against it. The pattern overlaps with verbalized reward hacking, where an agent names a tension between a task proxy and the underlying goal yet optimizes the proxy anyway. We give an operational definition, build a Claude Sonnet 4.6 judge that reads full trajectories and flags spans where the pattern occurs, and evaluate it on 44 Terminal-bench-2 trajectories using a Qwen3.5-35B-A3B backbone. Flagged trajectories fail 94% of the time versus 46% for unflagged trajectories (47-point gap, Fisher's exact p = 0.003; 46 points after excluding three prompt-embedded examples, p = 0.006). At matched selectivity, the detector reaches 94% precision versus 88% for a lexical discourse-marker baseline; the 10-trajectory intersection of the two methods has a 100% failure rate (Clopper-Pearson 95% CI [69%, 100%]). We replicate on Gemma4-31B with 43 trajectories: the overall signal is directionally consistent but not significant (20-point gap, p = 0.31), with attenuation driven largely by 13 trajectories with zero think content, where the detector has no substrate to analyze. In the high-verbosity Gemma tertile, the gap is +30 points; in the mid- and high-verbosity Qwen tertiles, it is +40 points each. The first flag appears at a median of 83-84% of elapsed trajectory time across both models, and the binary flag survives paraphrases that soften explicit conflict markers (8/8 trajectories). Unlike univariate predictors, the detector emits interpretable span-level output -- quoted acknowledgment, quoted action, and typed conflict -- showing what the agent saw and ignored.

2606.07882 2026-06-09 cs.CV cs.AI 新提交

The Cross-Architecture Substrate: A Domain-Transcendent, Calibration-Surviving Geometric Invariant of Modern Vision Encoders

跨架构基板:现代视觉编码器的领域超越、校准存活的几何不变量

Yousef Radwan

发表机构 * KAUST(阿卜杜拉国王科技大学)

AI总结 发现现代视觉编码器训练后前16个主方向收敛到同一16维几何对象(跨架构基板),该基板跨视觉领域传输、校准后仍存在,并应用于无标签迁移性过滤、领域检测、低样本探测和无教师蒸馏。

Comments 14 pages, 2 figures. 40th Conference on Neural Information Processing Systems (NeurIPS 2026)

详情
AI中文摘要

不同的视觉神经网络——训练用于分类、对比、重建或将图像与文本匹配——应该具有相应不同的内部表示。我们报告它们并非如此。训练后,十三个现代视觉编码器内部的前十六个主变化方向收敛到同一个十六维几何对象。我们称之为跨架构基板,并使用PCA、中心核对齐(CKA)和Pang 2026校准进行研究。该基板在四个视觉领域(自然照片、医学CT、卫星、显微镜)上以中位数Procrustes-CKA 0.679传输,在八个领域(增加素描、深度、热红外、天文学)上为0.604,每对>0.40。它在全局(7.4倍判别vs MAE分离,n=13,394)和局部(4.82-5.30,p<10^{-44})上经受住Pang校准。它不是像素统计(0.263),不是Gabor特征(0.31),不是随机投影(0.041),并且在训练的前10%中出现,而准确率持续上升。我们提供了四个应用:一个无标签迁移性过滤器,优于LogME(快3倍,+0.15 Kendall-tau);一个四路领域检测器(99.6%准确率);一个冻结低样本探测器(16维在每类N=50标签时比768维DINOv2高3.78个百分点);以及一个无教师蒸馏辅助,匹配训练教师KD在33对上(10%标签分数时峰值增益7.56个百分点)。该基板不跨模态,不帮助跨范式蒸馏,也不预测迁移质量(与迁移准确率的rho=0.08)。

英文摘要

Different vision neural networks -- trained to classify, contrast, reconstruct, or match images to text -- should have correspondingly different internal representations. We report that they do not. After training, the top sixteen principal directions of variation inside thirteen modern vision encoders converge to the same sixteen-dimensional geometric object. We call this the cross-architecture substrate and study it with PCA, centred kernel alignment (CKA), and Pang 2026 calibration. The substrate transports across four visual domains (natural photographs, medical CT, satellite, microscopy) at median Procrustes-CKA 0.679, and across eight domains (adding sketches, depth, thermal infrared, astronomy) at 0.604, every pair >0.40. It survives Pang calibration globally (7.4x disc-vs-MAE separation, n=13,394) and locally (4.82-5.30, p<10^{-44}). It is not pixel statistics (0.263), not Gabor features (0.31), not a random projection (0.041), and emerges in the first 10% of training while accuracy keeps climbing. We deliver four applications: a label-free transferability filter beating LogME (3x faster, +0.15 Kendall-tau); a four-way domain detector (99.6% accuracy); a frozen low-shot probe (16 dims beat 768-dim DINOv2 by 3.78pp at N=50 labels per class); and a teacher-free distillation auxiliary matching trained-teacher KD on 33 pairs (7.56pp peak gain at 10% label fraction). The substrate does not cross modalities, does not help cross-paradigm distillation, and does not predict transfer quality (rho=0.08 against transfer accuracy).

2606.07881 2026-06-09 cs.LG 新提交

Breaking the Bubble: Asynchronous Pipeline Parallel Training with Bounded Weight Inconsistency

打破气泡:具有有界权重不一致性的异步流水线并行训练

Itay Elam, Eliron Rahimi, Avi Mendelson, Chaim Baskin

发表机构 * Technion - Israel Institute of Technology(以色列理工学院) Ben-Gurion University of the Negev(本·古里安大学)

AI总结 提出PACI方法,通过局部梯度累积控制版本漂移,实现无气泡异步流水线并行,在GPT风格语言模型预训练中匹配同步1F1B-flush的稳定性和困惑度,吞吐量完全利用,训练时间至准确率提升达1.69倍。

详情
AI中文摘要

流水线并行对于训练大型神经网络至关重要,但现有的调度方案在吞吐量、内存和优化一致性之间进行权衡。同步流水线保持了前向/反向权重一致性,但存在气泡;异步流水线消除了气泡,但引入了权重版本不匹配,通常需要权重暂存、预测或校正机制。我们提出了PACI(具有可控不一致性的流水线异步训练),一种无气泡的异步流水线方法,它限制了前向/反向版本漂移,无需权重暂存、预测、额外的参数副本或全局同步。关键思想是使用局部梯度累积作为版本控制机制:通过相对于流水线延迟减慢参数版本演化,PACI限制了任何微批次跨越的优化器更新次数,同时保持稳态利用率。在GPT风格的语言模型预训练中,PACI匹配了同步1F1B-flush的稳定性和最终困惑度,保留了相同的峰值内存占用,实现了完全利用的流水线吞吐量,并将训练时间至准确率相比最快的flush基线提升了高达1.69倍。这些结果表明,前向/反向不一致性不必消除:当明确有界时,可以安全地将其换取显著的效率提升。

英文摘要

Pipeline parallelism is essential for training large neural networks, but existing schedules trade off throughput, memory, and optimization consistency. Synchronous pipelines preserve forward/backward weight consistency but suffer from bubbles; asynchronous pipelines remove bubbles but introduce weight-version mismatch, typically requiring weight stashing, prediction, or correction mechanisms. We introduce PACI (Pipeline Asynchronous training with Controlled Inconsistency), a bubble-free asynchronous pipeline method that bounds forward/backward version drift without weight stashing, prediction, additional parameter copies, or global synchronization. The key idea is to use local gradient accumulation as a version-control mechanism: by slowing parameter-version evolution relative to pipeline delay, PACI limits the number of optimizer updates crossed by any micro-batch while preserving steady-state utilization. In GPT-style language-model pretraining, PACI matches the stability and final perplexity of synchronous 1F1B-flush, retains the same peak memory footprint, achieves fully utilized pipeline throughput, and improves training time-to-accuracy by up to $1.69\times$ over the fastest flush baseline. These results show that forward/backward inconsistency need not be eliminated: when explicitly bounded, it can be safely traded for substantial efficiency gains.