arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部类别1778
2606.07280 2026-06-08 cs.CV 新提交

Geometric-Aware Hypergraph Reasoning for Novel Class Discovery in Point Cloud Segmentation

几何感知超图推理用于点云分割中的新类别发现

Zihao Zhang, Aming Wu, Yang Li, Yahong Han, Jialie Shen

发表机构 * School of Artificial Intelligence, College of Intelligence and Computing, Tianjin University School of Computer Science and Information Engineering, Hefei University of Technology Department of Computer Science City St George’s, University of London

AI总结 提出超图框架建模高阶关联,结合几何感知原型,实现点云分割中从已知到新类别的协同推理,提升分割精度。

详情
Comments
Accepted to the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026
AI中文摘要

点云分割中的新类别发现旨在从已知类别转移知识,自动识别和分割点云中未标注的新类别。现有方法主要依赖成对关联进行类别分配和新类别推理,这限制了其捕捉已知和新类别间复杂关系的能力,可能导致语义分割不准确。为解决此问题,我们引入基于超图的框架,建模类别间的高阶关联,并实现从已知类别到新类别的协同推理,超越传统的成对关系。此外,现有方法倾向于关注语义特征提取,而对点云中的几何信息关注不足。为了更好地利用空间结构,我们提出几何感知原型以增强类别级几何线索的表示。通过超边传播几何信息,所提方法改进了对类别间空间分布的理解,从而实现更准确的分割。在SemanticKITTI和SemanticPOSS数据集上的实验证明了我们方法的有效性和优越性。

英文摘要

Novel class discovery in point cloud segmentation aims to transfer knowledge from known classes to automatically identify and segment unlabeled novel classes in point clouds. Existing methods mainly rely on pairwise associations for class assignment and novel class reasoning, which limits their ability to capture complex relationships among known and novel classes and may lead to inaccurate semantic segmentation. To address this issue, we introduce a hypergraph-based framework that models high-order associations among classes and enables collaborative reasoning from known classes to novel classes beyond traditional pairwise relations. Moreover, existing methods tend to focus on semantic feature extraction while paying insufficient attention to geometric information in point clouds. To better exploit spatial structure, we propose Geometric-Aware Prototypes to enhance the representation of class-level geometric cues. By propagating geometric information through hyperedges, the proposed method improves the understanding of spatial distributions across classes and leads to more accurate segmentation. Experiments on the SemanticKITTI and SemanticPOSS datasets demonstrate the effectiveness and superiority of our method.

2606.07254 2026-06-08 cs.LG cs.FL 新提交

A Held-Out Transition-Pair Falsifier for Long-Horizon Non-Abelian State Tracking

长视野非阿贝尔状态跟踪的保留转移对验证器

Jeonghoon Lee

发表机构 * Attractor Dynamics

AI总结 针对序列模型在非交换状态跟踪中的局限,提出保留转移对验证协议,在投影循环状态模型上实现长达百万步的完美预测,揭示显式非交换状态组合作为有效归纳偏置。

详情
Comments
Technical preprint, 24 pages. 7 figures
AI中文摘要

状态跟踪揭示了序列模型的一个尖锐限制:相关信号通常不是观测令牌的摘要,而是通过非交换变换演化的有序潜在状态。我们引入了一个用于有限非阿贝尔群跟踪的保留转移对验证器。该协议在训练期间禁止选定的有序生成器对,并在评估期间要求相同的局部模式,从而阻断了一条直接的局部转移记忆路径。在一个受控的 $S_3 \ imes S_3$ 基准测试中,仅在长度为8的序列上训练的投影循环状态模型,在长达1,048,576个令牌的评估视野中,跨五个种子产生了无错误的最终状态预测(每个视野完美250/250)。匹配的原生读出基线,包括bag、GRU和单配置结构化状态空间模型,在相同协议下保持接近基线水平。投影匹配的GRU、结构化SSM和bag基线配备了类似的有限群原型读出,在相同分割下也保持接近随机水平。机制诊断显示,硬投影与低同态误差、低状态一致性漂移和非平凡交换子分离同时出现,而软投影则导致最终状态精度崩溃。干净分割审计验证了训练和评估分区之间零逐字缩减词重叠和零结构模板重叠。该证据限于这个受控的有限群验证器,而非通用架构排名。在该范围内,显式投影的非交换状态组合作为长视野隐藏状态跟踪的有用归纳偏置。

英文摘要

State tracking exposes a sharp limitation of sequence models: the relevant signal is often not a summary of observed tokens, but an ordered latent state that evolves through non-commutative transformations. We introduce a held-out transition-pair falsifier for finite non-Abelian group tracking. The protocol forbids selected ordered generator pairs during training and requires the same local patterns during evaluation, blocking one direct local-transition memorization pathway. In a controlled $S_3 \times S_3$ benchmark, a projected recurrent state model trained only on length-8 sequences produces error-free final-state predictions (perfect 250/250 per horizon) through evaluation horizons up to 1,048,576 tokens across five seeds. Matched native-readout baselines, including bag, GRU, and a single-configuration structured state-space model, remain near floor under the same protocol. Projection-matched GRU, structured SSM, and bag baselines equipped with analogous finite-group prototype readouts also remain near chance under the same split. Mechanism diagnostics show that hard projection coincides with low homomorphism error, low state-consistency drift, and non-trivial commutator separation, while softened projection collapses final-state accuracy. Clean-split audits verify zero verbatim reduced-word overlap and zero structural-template overlap between training and evaluation partitions. The evidence is scoped to this controlled finite-group falsifier rather than to a general architecture ranking. Within that regime, explicit projected non-commutative state composition acts as a useful inductive bias for long-horizon hidden-state tracking.

2606.07249 2026-06-08 cs.CV 新提交

Reconstructing Multi-Decadal Forest Disturbances: A Spatio-Temporal Transformer Approach

重建多年代森林干扰:一种时空Transformer方法

Linus Scheibenreif, Anton Raichuk, Maxim Neumann

发表机构 * Google DeepMind

AI总结 提出时空Transformer框架,同时建模时间轨迹和空间邻域,利用Landsat、Sentinel-1/2数据重建美国1984-2022年森林干扰图,在手动标注验证集上达到高精度并减少空间伪影。

详情
AI中文摘要

准确监测森林干扰对于理解碳动态和土地管理至关重要,但传统方法通常依赖卫星时间序列的逐像素分析,忽略了空间上下文。我们提出了一种深度学习框架,通过同时建模时间轨迹和空间邻域,绘制了美国本土38年(1984-2022)的森林干扰图。通过利用视觉Transformer架构,我们的方法有效过滤了弱监督信号中的噪声,生成了空间连贯的干扰图。我们在多个卫星(Landsat、Sentinel-1、Sentinel-2)和时间窗口(38年及最近6年)上进行了详尽评估,并使用新的人工标注验证数据集(n=300)和独立火周界数据集(n=706)验证了性能。结果凸显了任务的复杂性:我们的时空模型表现出高精度(在MTBS上±1年检测精度高达98.2%,在CONUS验证数据集上高达71.3%,F1分数分别高达75.8%和47.3%),并有效减少了空间伪影,但与逐像素基线相比,在不同干扰类型上存在性能权衡。我们的方法为一致的森林监测提供了有前景的基础。

英文摘要

Accurate monitoring of forest disturbances is essential for understanding carbon dynamics and land management, yet traditional approaches typically rely on pixel-wise analysis of satellite time-series, ignoring spatial context. We present a deep learning framework that maps 38 years (1984-2022) of forest disturbance across the contiguous United States by modeling temporal trajectories and spatial neighborhoods simultaneously. By leveraging a vision transformer architecture, our approach effectively filters noise from weak supervision signals to produce spatially coherent disturbance maps. We perform exhaustive evaluations across multiple satellites (Landsat, Sentinel-1, Sentinel-2) and temporal windows (38 years and the more recent 6 years), validating performance against a novel, manually annotated validation dataset (n=300) and independent fire perimeter dataset (n=706). The results highlight the complexity of the task: while our spatio-temporal model demonstrates high precision (up to 98.2% for +-1 year detection on MTBS and up to 71.3% on the CONUS validation datasets, with F1-scores up to 75.8% and 47.3%, respectively) and effectively reduces spatial artifacts, it exhibits performance trade-offs across different disturbance regimes compared to pixel-wise baselines. Our method offers a promising foundation for consistent forest monitoring.

2606.07244 2026-06-08 cs.RO cs.AI cs.CV 新提交

Beyond Waypoints: A Trajectory-Centric Waypointing Paradigm for Vision-Language Navigation

超越航点:面向视觉语言导航的轨迹中心航点范式

Haoxiang Shi, Xiang Deng, Haoyu Zhang, Qiaohui Chu, Yaowei Wang, Liqiang Nie

发表机构 * Harbin Institute of Technology (Shenzhen) Pengcheng Laboratory

AI总结 提出轨迹航点范式,通过TSDF引导的扩散策略预测可执行轨迹,解决VLN-CE中航点不可达与规划控制不一致问题,在基准上取得最优性能。

详情
AI中文摘要

连续环境中的视觉语言导航(VLN-CE)要求智能体在类似真实世界的环境中遵循自然语言指令进行导航。大多数VLN-CE方法采用三阶段框架:航点预测器提出可导航航点,导航器选择最佳航点,低层控制器执行移动。然而,这种解耦范式常导致航点不可达或规划与控制不一致。本文提出一种称为轨迹航点的新范式,将每个候选航点锚定到可执行轨迹上。为此,我们设计了TSDF引导的扩散策略作为轨迹航点预测器,引导轨迹生成避开障碍物,从本质上保证预测航点的可达性。进一步提出轨迹增强导航器,将关联轨迹作为额外信息注入规划,实现高层语义决策与低层执行的严格一致性。在VLN-CE基准上的大量实验表明,我们的轨迹航点范式优于基线方法。

英文摘要

Vision-Language Navigation in Continuous Environments (VLN-CE) requires agents to follow natural-language instructions while navigating in real-world-like environments. Most VLN-CE approach\-es adopt a three-stage framework: a waypoint predictor proposes navigable waypoints, and a navigator selects the best waypoint, with a low-level controller executing the movement to it. However, this decoupled paradigm often leads to unreachable waypoints or inconsistencies between planning and control. In this work, instead of predicting isolated waypoints, we introduce a novel paradigm called Trajectory Waypoint, which grounds each candidate waypoint in an executable trajectory. To realize this, we design a Trajectory Waypoint Predictor formulated as a TSDF-guided diffusion policy, which steers trajectory generation away from obstacles, inherently ensuring the reachability of the predicted waypoints. We further propose a trajectory-enhanced navigator that injects the associated trajectory as additional information for planning, enabling strict consistency between high-level semantic decisions and low-level execution. Extensive experiments on the VLN-CE benchmark show that our Trajectory Waypoint paradigm achieves superior performance over the baselines.

2606.07240 2026-06-08 cs.CL cs.SD 新提交

KIT's Submission to Cross-Lingual Voice Cloning in IWSLT 2026

KIT 提交至 IWSLT 2026 跨语言语音克隆任务

Seymanur Akti, Alexander Waibel

发表机构 * Karlsruhe Institute of Technology (KIT) Carnegie Mellon University (CMU) KIT Campus Transfer (KCT)

AI总结 针对跨语言语音克隆中的口音变化和领域词汇问题,基于FishAudio-S2-Pro多语言文本转语音模型,引入语言标签提示、强化学习微调和参考条件词汇匹配方法,提升可懂度和自然度。

详情
AI中文摘要

跨语言语音克隆旨在在保留源语言参考说话者身份的同时,生成目标语言的语音。该任务是语音翻译的核心,也是IWSLT 2026跨语言语音克隆轨道的焦点。一个关键挑战是在口音变化和领域特定词汇存在的情况下保持可懂度和自然度。我们基于多语言文本转语音模型FishAudio-S2-Pro,引入语言标签提示以改善语言控制并减少口音泄漏。我们进一步应用强化学习(RL)微调进行任务适应,并观察到可懂度的提升。最后,我们提出了一种参考条件词汇匹配方法,在词汇重叠时改善领域特定术语的发音。结果表明,语言提示带来了最大的增益,而词汇匹配在匹配子集上产生了一致的改进。

英文摘要

Cross-lingual voice cloning aims to generate speech in a target language while preserving speaker identity from a source-language reference. This task is central to speech translation and is the focus of the IWSLT 2026 Cross-Lingual Voice Cloning track. A key challenge is maintaining intelligibility and naturalness in the presence of accent variation and domain-specific vocabulary. We build on a multilingual text-to-speech model, FishAudio-S2-Pro, and introduce language tag prompting to improve language control and reduce accent leakage. We further apply reinforcement learning (RL) fine-tuning for task adaptation and observe improvements in intelligibility. Finally, we propose a reference-conditioned lexical matching method that improves pronunciation of domain-specific terms when lexical overlap is present. Results show that language prompting provides the largest gains, while lexical matching yields consistent improvements on matched subsets.

2606.07239 2026-06-08 cs.LG 新提交

Generative Molecular Morphing for Flexible-Size Design via Unbalanced Optimal Transport

基于非平衡最优传输的柔性尺寸分子生成变形设计

Malte Franke, Stefan P. Schmid, Zarko Ivkovic, Kjell Jorner, Andreas Krause

发表机构 * ETH Zürich NCCR Catalysis

AI总结 针对现有扩散和流模型固定原子数限制的问题,提出基于非平衡最优传输的柔性尺寸分子生成模型Morph,实现条件与无条件3D分子设计,在保持性能的同时提供采样灵活性,并支持分布外生成。

详情
AI中文摘要

生成分子设计的成功取决于模型向高奖励样本的可引导性。由于许多分子性质与分子大小内在相关,准确捕捉性质与原子数的联合分布至关重要。然而,当前的扩散和基于流的模型固定了原子数,这最终限制了它们驾驭这种复杂关系的能力。为解决这一问题,我们引入了Morph,一种基于几何图的柔性尺寸生成模型,用于条件和无条件的3D分子设计。通过动态调整尺寸,Morph可以无缝集成现有的结构先验(如骨架),并显著增强性质引导。我们证明Morph在提供无与伦比的采样灵活性的同时,与当前固定尺寸的最先进模型性能相当。我们展示了在先前模型失败的领域中的分布外生成,为分子设计的增强生成建模铺平了道路。

英文摘要

The success of generative molecular design hinges on a model's steerability toward high-reward samples. Because many molecular properties are intrinsically linked to molecular size, accurately capturing the joint distribution of properties and the number of atoms is essential. However, current diffusion and flow-based models fix the number of atoms, which ultimately limits their ability to navigate this complex relationship. To address this, we introduce Morph, a flexible-size generative model for conditional and unconditional 3D molecular design based on geometric graphs. By dynamically adapting size, Morph can seamlessly integrate existing structural priors, like scaffolds, and significantly enhances property steering. We show that Morph matches current fixed-size state-of-the-art models while offering the benefit of unparalleled sampling flexibility. We demonstrate out-of-distribution generation in regimes where previous models fail, paving the way for enhanced generative modeling for molecular design.

2606.07237 2026-06-08 cs.CL cs.AI cs.LG 新提交

When Large Language Models Fail in Healthcare: Evaluating Sensitivity to Prompt Variations

当大型语言模型在医疗保健中失败:评估对提示变化的敏感性

Mahdi Alkaeed

发表机构 * Department of Computer Science and Engineering, Doha, Qatar

AI总结 本研究系统分析了通用和医学专用LLM对提示扰动的敏感性,发现即使是微小的措辞变化也可能改变临床建议,对抗性提示可能引发有害输出,表明这些模型在临床应用中不可靠。

详情
Comments
12 pages
AI中文摘要

大型语言模型(LLM)越来越多地用于医疗保健任务,如临床问答、诊断支持和报告总结。尽管前景广阔,但这些模型对微小的提示扰动(包括词汇和句法)仍然高度敏感,在安全关键的临床应用中构成严重风险。在本研究中,我们使用MedMCQA基准进行了系统的敏感性分析,以评估通用(例如GPT-3.5、Llama3)和医学专用LLM(例如ClinicalBERT、BioLlama3、BioBERT)的鲁棒性。我们将扰动分为自然和对抗两种类型,并检查它们对临床推理任务中模型一致性、准确性和可靠性的影响。我们的发现表明,医学LLM并非本质安全。即使是措辞的微小变化也可能改变临床建议,而针对性的对抗性提示可能引发有害输出。在医疗保健等高风险环境中,这种不可预测性是不可接受的——模型因重新措辞的输入而改变诊断,或因轻微改写而幻觉药物,临床医生无法可靠地信任它们。虽然模型通常对简单的词汇替换或释义表现出韧性,但在句法重新排序或误导性上下文线索下往往会崩溃。这种脆弱性在通用和领域专用LLM中都很明显。值得注意的是,对抗性操作可能导致临床危险的输出,例如推荐不正确的剂量或遗漏关键发现。

英文摘要

Large Language Models (LLMs) are increasingly used in healthcare for tasks such as clinical question answering, diagnosis support, and report summarization. Despite their promise, these models remain highly sensitive to subtle prompt perturbations, both lexical and syntactic, posing serious risks in safety-critical clinical applications. In this study, we conduct a systematic sensitivity analysis to evaluate the robustness of both general-purpose (e.g., GPT-3.5, Llama3) and medical-specific LLMs (e.g., ClinicalBERT, BioLlama3, BioBERT) using the MedMCQA benchmark. We categorize perturbations into natural and adversarial types and examine their effect on model consistency, accuracy, and reliability in clinical reasoning tasks. Our findings reveal that medical LLMs are not intrinsically safe. Even minor variations in phrasing can alter clinical advice, and targeted adversarial prompts can provoke harmful outputs. In high-stakes settings like healthcare, such unpredictability is unacceptable-models that change diagnoses due to reworded inputs or hallucinate medications when slightly rephrased cannot be reliably trusted by clinicians. While models tend to show resilience to simple lexical substitutions or paraphrasing, they often break down under syntactic reordering or misleading contextual cues. This fragility is evident across both general-purpose and domain-specific LLMs. Notably, adversarial manipulations can lead to clinically dangerous outputs, such as recommending incorrect dosages or omitting critical findings.

2606.07233 2026-06-08 cs.CV cs.LG cs.RO 新提交

Does Appearance Help? A Systematic Study of Image-Based Re-Identification in Online 3D Multi-Pedestrian Tracking

外观有帮助吗?在线3D多行人追踪中基于图像的重识别系统研究

Eduardo Borges, Luís Garrote, Urbano J. Nunes

发表机构 * Institute of Systems and Robotics, Department of Electrical and Computer Engineering, University of Coimbra

AI总结 系统研究轻量级投影框架下图像重识别在在线3D多目标追踪中的作用,提出级联匹配策略以在低延迟下恢复遮挡轨迹并防止身份切换。

详情
Comments
Accepted for publication at the 35th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN 2026)
AI中文摘要

基于LiDAR的3D多目标追踪通常仅依赖几何信息,这在长时间遮挡或拥挤人群环境中往往不足以区分目标。虽然集成基于RGB的重识别提供了保持身份上下文的理论解决方案,但现有方法通常依赖计算昂贵的并行检测器,阻碍了机器人的实时响应。本文通过利用轻量级投影框架解耦移动机器人的几何和外观建模,对在线3D多目标追踪中的基于图像的重识别进行了系统研究。对特征提取架构进行了全面分析,采用轻量级CNN和视觉Transformer,并评估了多种多模态数据关联策略以平衡计算延迟和鲁棒追踪。在KITTI数据集的行人类别上的实验表明,外观和运动成本的朴素线性融合由于视觉噪声而降低了性能。相反,级联匹配策略成功恢复了被遮挡的轨迹而不损害整体精度,有效防止了身份切换以维持人机交互的连续性。我们表明,轻量级架构可以在安全导航所需的低延迟和社交意识所需的判别能力之间提供最优权衡。

英文摘要

LiDAR-based 3D Multi-Object Tracking (MOT) typically relies solely on geometric information, which is often insufficient to distinguish between targets during prolonged occlusions or in crowded human-populated environments. While integrating RGB-based Re-Identification (ReID) offers a theoretical solution for preserving identity context, existing approaches often rely on computationally expensive parallel detectors that hinder real-time robot responsiveness. This work presents a systematic study of image-based ReID in online 3D MOT, utilizing a lightweight projection-based framework to decouple geometric and appearance modeling for mobile robots. A comprehensive analysis of feature extraction architectures is conducted, employing lightweight CNNs and Vision Transformers, and evaluating various multi-modal data association strategies to balance computational latency with robust tracking. Experiments on the Pedestrian class of the KITTI dataset reveal that naive linear fusion, of appearance and motion costs, degrades performance due to visual noise. Conversely, a cascaded matching strategy successfully recovers occluded tracks without compromising overall precision, effectively preventing identity switches to maintain human-robot interaction continuity. We show that lightweight architectures can offer an optimal trade-off between the low latency required for safe navigation and the discriminative power needed for social awareness.

2606.07222 2026-06-08 cs.CV cs.AI 新提交

DualGate-Net: A Prior-Gated Dual-Encoder Framework for Histopathology Cell Detection

DualGate-Net: 用于组织病理学细胞检测的先验门控双编码器框架

Bahman Jafari Tabaghsar, Son Tran, K. Devaraja, Atul Sajjanhar

发表机构 * School of Information Technology, Deakin University Kasturba Medical College, Manipal Academy of Higher Education

AI总结 提出DualGate-Net,通过可学习的先验门控融合机制自适应调节组织先验影响,结合局部和全局编码器及辅助分支,在OCELOT基准上实现稳健的细胞检测。

详情
Comments
15 pages, 4 figures
AI中文摘要

组织病理学图像中的细胞检测强烈依赖于周围组织背景,其中视觉上相似的细胞在不同微环境下可能属于不同类别。最近的感知组织方法结合了上下文先验,但通常依赖于可能传播噪声信息的静态融合策略。在这项工作中,我们提出了DualGate-Net,一种先验感知的双编码器框架,通过可学习的先验门控融合机制结合了基于ConvNeXtV2的局部编码器和基于SegFormer的全局编码器。所提出的模块自适应地调节组织先验在空间位置上的影响,同时一个辅助的前景重建分支在训练过程中保留高频细胞结构。此外,还引入了辅助的细胞性引导线索以进一步提高定位鲁棒性。在OCELOT基准上的实验表明,该方法在验证集上取得了0.7722的宏F1分数,在测试集上取得了0.7345的宏F1分数,突显了自适应先验整合对于稳健的组织病理学细胞检测的有效性。

英文摘要

Cell detection in histopathology images strongly depends on surrounding tissue context, where visually similar cells may belong to different classes under different microenvironments. Recent tissue-aware methods incorporate contextual priors, but often rely on static fusion strategies that may propagate noisy information. In this work, we propose DualGate-Net, a prior-aware dual-encoder framework that combines a ConvNeXtV2-based local encoder and a SegFormer-based global encoder through a learnable prior-gated fusion mechanism. The proposed module adaptively regulates the influence of tissue priors across spatial locations, while an auxiliary foreground reconstruction branch preserves high-frequency cellular structures during training. In addition, auxiliary cellness-guided cues are incorporated to further improve localization robustness. Experiments on the OCELOT benchmark demonstrate consistent improvements, achieving macro F1-scores of 0.7722 on the validation set and 0.7345 on the test set, highlighting the effectiveness of adaptive prior integration for robust histopathology cell detection.

2606.07219 2026-06-08 cs.CL cs.SI 新提交

Adversarial Creation and Detection of AI-Generated Social Bot Content

AI生成的社交机器人内容的对抗性创建与检测

Mykola Trokhymovych, Ricardo Baeza-Yates, Alessandro Flammini, Diego Saez-Trumper, Filippo Menczer

发表机构 * Universitat Pompeu Fabra Observatory on Social Media, Indiana University KTH Royal Institute of Technology

AI总结 提出对抗性方法模拟恶意用户冒充真人,构建多语言跨平台配对数据集,训练检测模型显著优于现有方法。

详情
AI中文摘要

大型语言模型与社交机器人的结合使得恶意行为者能够通过大规模生成类人内容来操纵信息生态系统。现有的AI生成内容检测模型在真实场景中常常失效,主要原因是缺乏真实标注数据。我们通过一种对抗性方法弥补了这一空白,该方法模拟了恶意行为者对真实社交媒体用户的冒充。利用这种方法,我们整理了一个多语言、跨平台的人类与AI生成消息的配对数据集。在这样的对抗性数据上训练,能够实现对AI生成文本的准确检测。我们的方法在真实世界、分布外数据上显著优于现有的基于内容的机器人检测模型。

英文摘要

The convergence of large language models and social bots allows malicious actors to manipulate the information ecosystem by generating human-like content at scale. Existing models for detecting AI-generated content often fail in the wild, primarily due to the lack of ground-truth data. We address this gap through an adversarial methodology that models the impersonation of real social media users by malicious actors. Using this methodology, we curate a multilingual, cross-platform dataset of paired human and AI-generated messages. Training on such adversarial data yields accurate detection of AI-generated text. Our approach significantly outperforms existing models for content-based bot detection in real-world, out-of-distribution data.

2606.07217 2026-06-08 cs.RO cs.CV cs.LG 新提交

Robotic Policy Adaptation via Weight-Space Meta-Learning

通过权重空间元学习实现机器人策略自适应

Christian Bianchi, Siamak Yousefi, Alessio Sampieri, Andrea Roberti, Luca Rigazio, Fabio Galasso, Luca Franco

发表机构 * ItalAI University of Verona Sapeinza University of Rome

AI总结 提出WIZARD框架,通过权重空间元学习从语言指令和演示视频生成任务特定LoRA参数,无需微调即可适应新任务,在LIBERO上性能提升高达14倍。

详情
AI中文摘要

视觉-语言-动作(VLA)模型正成为机器人操作的一种有前景的范式,能够从大规模演示和动作标签语料库中训练通用策略。然而,将这些模型适应新任务通常仍需要任务特定的演示、动作注释和额外的微调,使得部署成本高昂且难以扩展。我们提出WIZARD,一种权重空间元学习框架,通过为冻结的VLA策略生成任务特定的LoRA参数来避免任务特定的微调。仅凭语言指令和简短的演示视频,WIZARD即可在单次前向传播中预测相应的自适应权重,无需目标任务动作标签或测试时优化。在元训练期间,WIZARD学习将任务证据直接映射到专家LoRA更新,在权重空间中捕获任务之间的关系。在LIBERO上的实验表明,WIZARD在未见过的数据集集合上性能提升高达约2倍,在未见过的任务上提升高达约14倍。在Franka Emika Panda机器人上,WIZARD持续优于真实域自适应基线,表明生成的适配器提供了超越仿真的任务级特化。

英文摘要

Vision-Language-Action (VLA) models are emerging as a promising paradigm for robotic manipulation, enabling general-purpose policies trained from large corpora of demonstrations and action labels. However, adapting these models to new tasks still typically requires task-specific demonstrations, action annotations, and additional fine-tuning, making deployment costly and difficult to scale. We propose WIZARD, a weight-space meta-learning framework that sidesteps task-specific fine-tuning by generating task-specific LoRA parameters for a frozen VLA policy. Given only a language instruction and a short demonstration video, WIZARD predicts the corresponding adaptation weights in a single forward pass, without target-task action labels or test-time optimization. During meta-training, WIZARD learns to map task evidence directly to expert LoRA updates, capturing relationships between tasks in weight space. Experiments on LIBERO show that WIZARD improves performance by up to ~2x on unseen dataset collections and up to ~14x on unseen tasks. On a Franka Emika Panda, WIZARD consistently improves over a real-domain adapted baseline, showing that generated adapters provide task-level specialization beyond simulation.

2606.07211 2026-06-08 cs.RO cs.AI 新提交

An Abstract Architecture for Explainable Autonomy in Hazardous Environments

危险环境中可解释自主性的抽象架构

Matt Luckcuck, Hazel M Taylor, Marie Farrell

发表机构 * Maynooth University University of Manchester

AI总结 提出一种支持自主系统解释其行为的抽象架构,旨在通过设计可解释性增强用户信任,并以民用核工业为例展示应用。

详情
Comments
Originally published 20th of October 2022 at the Second International Workshop on Requirements Engineering for Explainable Systems (RE4ES), which was hosted by the International Requirements Engineering Conference 2022
AI中文摘要

自主机器人系统被提议用于危险环境,通常是为了减少人类工人的风险。在不久的将来,人类工人可能会继续使用和指挥这些自主机器人,就像其他计算机化工具一样,但具有更复杂的决策能力。因此,工程努力的一个重要方向是确保这些用户信任系统。最近的文献表明,可解释性与系统的可信度密切相关。与安全性和保密性属性一样,可解释性应该被设计到系统中,而不是事后添加。本文提出了一种抽象架构,支持自主系统解释其行为(可解释自主性),为实施可解释自主系统提供了设计模板。我们给出了一个工作示例,说明我们的架构如何应用于民用核工业,其中工人和监管机构都需要信任系统的决策能力。

英文摘要

Autonomous robotic systems are being proposed for use in hazardous environments, often to reduce the risks to human workers. In the immediate future, it is likely that human workers will continue to use and direct these autonomous robots, much like other computerised tools but with more sophisticated decision-making. Therefore, one important area on which to focus engineering effort is ensuring that these users trust the system. Recent literature suggests that explainability is closely related to how trustworthy a system is. Like safety and security properties, explainability should be designed into a system, instead of being added afterwards. This paper presents an abstract architecture that supports an autonomous system explaining its behaviour (explainable autonomy), providing a design template for implementing explainable autonomous systems. We present a worked example of how our architecture could be applied in the civil nuclear industry, where both workers and regulators need to trust the system's decision-making capabilities.

2606.07210 2026-06-08 cs.SD cs.CR 新提交

A Large-Scale Per-Speaker Analysis of Re-identification Risk in Speech Anonymization

语音匿名化中重识别风险的大规模每说话人分析

Orane Dufour, Paul Magron, Mickael Rouvier, Emmanuel Vincent

发表机构 * Université de Lorraine, CNRS, Inria, LORIA LIA, Avignon University

AI总结 通过大规模每说话人分析,发现语音匿名化中重识别风险在个体间差异巨大,且风险由攻击者、匿名化器和可用语音量共同决定,挑战了固有说话人隐私风险的概念。

详情
Comments
Accepted to Interspeech
AI中文摘要

语音匿名化通常使用平均情况指标(如等错误率)进行评估,这可能会掩盖个体间重识别风险的巨大差异。在本文中,我们基于最坏情况下的可链接性度量,进行了大规模每说话人隐私分析。评估了近5000名说话人在多个匿名化系统、攻击者架构和对话长度下的表现。虽然可链接性分数在说话人层面上高度极化,但易于重识别和难以重识别的说话人集合在不同配置下差异显著。我们表明,没有单一因素可以解释说话人的脆弱性。相反,重识别风险源于攻击者、匿名化器和可用语音量之间的相互作用。这些结果挑战了固有说话人级隐私风险的概念,并强调需要明确以攻击者和匿名化器为条件的评估协议。

英文摘要

Speech anonymization is commonly evaluated using averagecase metrics such as the equal error rate, which can hide large disparities in re-identification risks across individuals. In this paper, we conduct a large-scale per-speaker privacy analysis using a linkability-based metric under a worst-case scenario. Nearly 5,000 speakers are evaluated across multiple anonymization systems, attacker architectures, and conversation lengths. While linkability scores are highly polarized at the speaker level, the sets of easy to re-identify and hard to re-identify speakers vary substantially across configurations. We show that no single factor explains speaker vulnerability. Instead, the re-identification risk emerges from the interaction between the attacker, the anonymizer, and the amount of available speech. These results challenge the notion of intrinsic speaker-level privacy risks and emphasize the need for evaluation protocols that are explicitly conditioned on the attacker and anonymizer.

2606.07196 2026-06-08 cs.LG 新提交

Structure-Preserving Correction Learning for Sparse Bayesian Inference in Brain Source Imaging

脑源成像中稀疏贝叶斯推断的结构保持校正学习

Marco Morik, Xiao Ruiting, Shinichi Nakajima, Stefan Haufe, Ismail Huseynov

发表机构 * Berlin Institute for the Foundations of Learning and Data (BIFOLD), Germany Technische Universität Berlin, Germany RIKEN Center for Advanced Intelligence Project (AIP), Japan Physikalisch-Technische Bundesanstalt, Germany Charité – Universitätsmedizin Berlin, Germany

AI总结 提出一种结构保持的校正学习方法,通过展开经典联合超参数求解器为可训练神经网络,在保留贝叶斯结构的同时学习更新机制,提升M/EEG脑源成像的重建性能和收敛性。

详情
Comments
preprint
AI中文摘要

经典的稀疏Type-II贝叶斯方法用于M/EEG脑成像支持源和噪声超参数的联合估计,但依赖于固定的迭代更新规则。尽管这些更新是有原则且可解释的,但其动态无法从数据中适应。我们提出学习更新机制本身,同时通过将经典联合超参数求解器展开为可训练的神经架构(其层镜像原始迭代)来保留底层贝叶斯结构。得到的框架初始化为在训练前精确恢复经典求解器,并通过逐渐更具表达力的校正学习机制(从可学习偏置到自适应MLP和基于注意力的上下文细化)得到丰富。这样,训练不会用黑箱预测器替代贝叶斯推断,而是学习结构化的校正项,同时保留原始更新动态的可解释性和基于模型的特性。因此,结构保持校正学习旨在改善经验重建性能,而不替代原始的基于模型的推断机制。实验结果表明,学习的校正变体在保留算法透明性的同时,改善了基线展开求解器的重建性能和收敛行为。

英文摘要

Classical sparse Type-II Bayesian methods for M/EEG brain imaging support joint estimation of source and noise hyperparameters, but rely on fixed iterative update rules. Although these updates are principled and interpretable, their dynamics cannot be adapted from data. We propose to learn the update mechanism itself while preserving the underlying Bayesian structure by unfolding a classical joint hyperparameter-learning solver into a trainable neural architecture whose layers mirror the original iterations. The resulting framework is initialized to recover the classical solver exactly before training and is enriched through progressively more expressive correction-learning mechanisms, ranging from learnable biases to adaptive MLP and attention-based contextual refinements. In this way, training does not replace Bayesian inference with a black-box predictor, but instead learns structured correction terms while retaining the interpretability and model-based character of the original update dynamics. Structured correction learning therefore aims to improve empirical reconstruction performance without replacing the original model-based inference mechanism. Experimental results show that the learned correction variants improve reconstruction performance and convergence behavior over the baseline unfolded solver while preserving its algorithmic transparency.

2606.07193 2026-06-08 cs.RO 新提交

Shield-Loco: Shielding Locomotion Policies with Predictive Safety Filtering

Shield-Loco:基于预测性安全过滤的防护运动策略

Aditya Shirwatkar, Sebastian Sanokowski, Shishir Kolathaya, Aaron Johnson, Majid Khadiv

发表机构 * Robert Bosch Center for Cyber Physical Systems, Indian Institute of Science, Bangalore, India Munich Institute of Robotics and Machine Intelligence (MIRMI), Technical University of Munich, Munich, Germany Department of Computer Science & Automation, Indian Institute of Science, Bangalore, India Department of Mechanical Engineering, Carnegie Mellon University, Pittsburgh, PA, USA Institute for Advanced Study, Technical University of Munich, Garching, Germany

AI总结 提出一种预测性安全过滤器,通过全物理模型优化接触序列,减少四足机器人在密集杂乱环境中的安全违规,同时保持任务性能。

详情
AI中文摘要

强化学习(RL)策略能够实现动态腿部运动,但缺乏避免训练中未出现的约束违反的机制。大规模离线安全学习对于覆盖所有边缘情况是不切实际的。现有的安全框架要么依赖无法推理全身行为的降阶模型,要么需要保守的恢复控制器,这会降低任务性能。我们提出一种预测性安全过滤器,它对输入到RL策略的名义接触位置进行事后过滤。当预测到碰撞时,基于采样的优化器使用全物理模型异步搜索更安全的接触序列,而学习的价值函数则引导长期回报。我们的三个算法组件(采样接触的几何投影、动量增强更新和副本交换)使得在不连续的接触景观中优化变得可行。我们在密集杂乱环境中的四足机器人上验证了该过滤器,无论是在仿真还是真实世界中,都显示出在最小偏离名义输入的情况下大幅减少安全违规。

英文摘要

Reinforcement learning (RL) policies enable dynamic legged locomotion but lack mechanisms to avoid violations of safety constraints that are absent during training. Large-scale offline safe learning is impractical for covering all edge cases. Existing safety frameworks either rely on reduced-order models that cannot reason about whole-body behaviors or require conservative recovery controllers that degrade task performance. We propose a predictive safety filter that post-hoc filters the nominal contact locations fed to the RL policy. When a collision is predicted, a sampling-based optimizer asynchronously searches for safer contact sequences using a full-physics model, while a learned value function bootstraps long-horizon returns. Our three algorithmic components (geometric projection of sampled contacts, momentum-augmented updates, and replica-exchange) make the optimization tractable in a discontinuous contact landscape. We validate the filter on a quadruped robot in dense, cluttered environments, both in simulation and in the real world, showing substantial reductions in safety violations with minimal deviation from the nominal input.

2606.07185 2026-06-08 cs.CV 新提交

AdaTok: Self-Budgeting Image Tokenization with Quality-Preserving Dynamic Tokens

AdaTok: 具有质量保持动态令牌的自预算图像令牌化

Xiaocheng Lu, Yuxi Chen, Jie Zhang, Jian Liu, Jingcai Guo, Fangqi Zhu, Tao Han, Song Guo

发表机构 * The Hong Kong University of Science and Technology The Hong Kong Polytechnic University

AI总结 提出AdaTok,一种自预算离散一维令牌化器,通过表示-分配协同设计(优先表示学习和自适应令牌分配)实现图像自适应令牌数量,在保持重建质量的同时减少平均令牌数。

详情
Comments
Preprint; 11 pages, 4 figures
AI中文摘要

图像令牌化器,从二维网格到最近的一维序列,通常用相同固定数量的令牌编码每张图像。然而视觉复杂度高度异质,因此统一预算在简单输入上过度开销,在复杂输入上不足。现有的弹性令牌化器暴露了可变长度重建,但通常将令牌长度作为部署时的操作点、搜索目标或外部预测,而非令牌化器本身的输出。在这项工作中,我们询问离散视觉令牌化器能否一次性自我预算。我们的核心发现是,可操作的弹性需要表示-分配协同设计:前缀必须在不同预算下保持可解码,且令牌化器必须学习每个图像需要哪个前缀。我们提出AdaTok,一种自预算离散一维令牌化器。AdaTok结合了优先表示学习(通过嵌套尾部掩码对令牌排序,并通过多头LoRA解码器头解决预算依赖的语义偏移)和自适应令牌分配(在候选预算上训练轻量级确定性组GRPO策略)。动态帕累托加权在策略训练期间平衡保真度和效率,无需手动权衡扫描。在ImageNet-1K上,AdaTok-Full在256个令牌时达到rFID 1.31,而AdaTok-Adaptive平均仅使用约118个令牌达到rFID 1.50,在可比预算下优于离散一维基线。在自回归图像生成中,较短的适应性表示相比固定256令牌解码实现了约2.1倍的吞吐量,表明视觉令牌数量可以学习为内容条件输出,而非设置为固定超参数。

英文摘要

Image tokenizers, from 2D grids to recent 1D sequences, typically encode every image with the same fixed number of tokens. Yet visual complexity is highly heterogeneous, so a uniform budget overspends on simple inputs and underserves complex ones. Existing elastic tokenizers expose variable-length reconstructions, but often leave token length as a deployment-time operating point, a search target, or an external prediction rather than an output of the tokenizer itself. In this work, we ask whether a discrete visual tokenizer can budget itself in one pass. Our central finding is that actionable elasticity requires a representation--allocation co-design: prefixes must remain decodable across budgets, and the tokenizer must learn which prefix each image needs. We propose AdaTok, a self-budgeting discrete 1D tokenizer. AdaTok combines Prioritized Representation Learning, which orders tokens with nested tail masking and resolves budget-dependent semantic shift through Multi-Head LoRA decoder heads, with Adaptive Token Allocation, which trains a lightweight deterministic-group GRPO policy over candidate budgets. Dynamic Pareto Weighting balances fidelity and efficiency during policy training without manual trade-off sweeps. On ImageNet-1K, AdaTok-Full reaches rFID 1.31 at 256 tokens, while AdaTok-Adaptive attains rFID 1.50 using only ~118 tokens on average, outperforming discrete 1D baselines at comparable budgets. In autoregressive image generation, the shorter adaptive representation yields ~2.1x throughput over a fixed 256-token decode, suggesting that visual token count can be learned as a content-conditioned output rather than set as a fixed hyperparameter.

2606.07183 2026-06-08 cs.CL 新提交

Geometry of Semantic Space: Comparative Study of Discrete and Continuous Models

语义空间的几何:离散与连续模型的比较研究

Gabriel Bounias, Sabine Ploux

发表机构 * ISC-PIF (Institut des Systemes Complexes de Paris IdF), CNRS, France CAMS (Centre d’analyse et de mathématique sociales), CNRS & EHESS, Paris, France

AI总结 本研究比较了监督向量嵌入(如CamemBERT)与词汇共现图在语义几何上的差异,发现图模型结构更清晰可读,而Transformer嵌入的拓扑分布不理想。

详情
Comments
9 pages, 7 figures
AI中文摘要

这项工作考察了NLP模型背后的语义几何。我们比较了监督向量嵌入(如CamemBERT)与更直接编码语义关系的词汇共现图。虽然基于Transformer的嵌入取得了强劲性能,但它们诱导的几何结构往往显示出不令人满意的分布。相比之下,基于图的模型揭示了更清晰、更易读的意义组织。我们实现了一种方法,允许我们基于这两种方法诱导的图结构或嵌入拓扑进行比较分析。比较结果——应用于法国“大国家辩论”语料库(公众辩论中公民贡献的集合)——显示了相似的局部拓扑,但非常不同的整体结构和拓扑。这些发现表明深度监督模型与基于图的模型之间存在互补视角,为引导神经架构朝向更稳定和可解释的图结构收敛提供了新途径。

英文摘要

This work examines the semantic geometry underlying NLP models. We compare supervised vector embeddings, such as CamemBERT, with lexical co-occurrence graphs that encode semantic relations more directly. While transformer-based embeddings achieve strong performance, their induced geometries often display unsatisfactory distributions. In contrast, graph-based models reveal a clearer and more human-readable organization of meaning. We have implemented a methodology that allows us to perform a comparative analysis either based on the structure of the graphs or based on the topology of the embeddings induced by these two approaches. The results of the comparison -- applied to the French "Great National Debate" corpus a collection of citizen contributions to the public debate -- show a similar local topology but a very different overall structure and topology. Theses findings suggest complementary perspectives between deep supervised models and graph-based models, considering a new pathway to guide neural architectures toward more stable and interpretable convergence with graphs structures.

2606.07180 2026-06-08 cs.CV cs.LG 新提交

OPTIMUS-Prime: Minimal and Sufficient Concept Explanations for Deep Vision Models

OPTIMUS-Prime:深度视觉模型的最小且充分的概念解释

Arthur Hoarau, Chenrui Zhu, Vu Linh Nguyen

发表机构 * Université de Lorraine, CentraleSupélec Loria, CNRS, Metz, France Université de technologie de Compiègne UMR CNRS 7253 Heudiasyc, France

AI总结 提出OPTIMUS框架,基于主蕴含项理论生成视觉热图解释,满足充分性和最小性,提供形式化保证。

详情
AI中文摘要

自动化决策中日益增长的透明度需求已将可解释人工智能(XAI)推向机器学习研究的前沿。然而,在计算机视觉中,现有的解释方法通常优先考虑最终用户的可访问性,而牺牲了形式化保证,在实用性和理论严谨性之间留下了关键差距。在本文中,我们通过引入OPTIMUS(一种用于深度分类模型的基于概念的可视化解释的新框架)来弥补这一差距。OPTIMUS解释采用视觉热图的形式,不仅对最终用户保持可解释性,而且基于成熟的主蕴含项理论,提供了现有基于显著性方法所缺乏的形式化保证。具体来说,OPTIMUS解释满足两个理想性质:充分性,确保被强调的概念可证明地保证分类器的预测;以及最小性,确保这些概念的严格子集不再保留此保证。这两个性质共同产生了逻辑上紧凑且视觉上连贯的解释。我们在视觉分类基准上验证了我们的方法,证明OPTIMUS热图自然且忠实地呈现了模型预测背后的决策相关概念。

英文摘要

The growing demand for transparency in automated decision-making has propelled eXplainable Artificial Intelligence (XAI) to the forefront of machine learning research. In computer vision, however, existing explanation methods often prioritize end-user accessibility at the expense of formal guarantees, leaving a critical gap between practical utility and theoretical rigor. In this paper, we address this gap by introducing OPTIMUS, a novel framework for generating concept-based visual explanations for deep classification models. OPTIMUS explanations take the form of visual heatmaps that not only remain interpretable to end users, but are grounded in the well-established theory of prime implicants, providing formal guarantees that have been largely absent from existing saliency-based methods. Specifically, OPTIMUS explanations satisfy two desirable properties: sufficiency, ensuring that the highlighted concepts provably guarantee the classifier's prediction, and minimality, ensuring that no strict subset of those concepts retains this guarantee. Together, these properties yield explanations that are both logically tight and visually coherent. We validate our approach on a visual classification benchmark, demonstrating that OPTIMUS heatmaps naturally and faithfully surface the decision-relevant concepts underlying model predictions.

2606.07179 2026-06-08 cs.CV cs.MM eess.IV 新提交

EvoGS: Constructing Continuous-Layered Gaussian Splatting with Evolution Tree for Scalable 3D Streaming

EvoGS:基于进化树构建连续分层高斯泼溅以实现可扩展3D流式传输

Yuang Shi, Simone Gasparini, Géraldine Morin, Wei Tsang Ooi

发表机构 * National University of Singapore IRIT - University of Toulouse IPAL, IRL2955

AI总结 提出EvoGS,首个连续分层高斯泼溅表示,通过进化树结构实现父-子细化,消除冗余并支持可扩展3D流式传输,传输负载和显存占用分别降低2.4倍和5.5倍。

详情
Comments
Project page: https://yuang-ian.github.io/evogs/
AI中文摘要

流式传输3D高斯泼溅需要高度可扩展的渐进式表示。现有渐进式方法依赖\textit{离散分层},为每个细节层次累积独立的泼溅集。层间的结构独立性固有地导致误差累积、严重的泼溅冗余以及不受控的质量过渡。我们提出EvoGS,首个\textit{连续分层}表示。EvoGS组织为进化树,通过显式的、受小波启发的父-子细化生成更精细的细节。这使得子节点能够结构性地纠正祖先误差,产生固有稀疏且高度可压缩的层间信号。大量实验表明,EvoGS将泼溅冗余从超过65%降至低于25%。与最先进的基线相比,它分别将传输负载和GPU显存占用降低高达2.4倍和5.5倍,并实现了适用于实时自适应流式传输的平滑质量过渡。项目页面:此 https URL

英文摘要

Streaming 3D Gaussian Splatting requires highly scalable, progressive representations. Existing progressive methods rely on \textit{discrete layering}, accumulating separate splat sets for each level of detail. This structural independence between layers inherently leads to error accumulation, severe splat redundancy, and uncontrolled quality transitions. We propose EvoGS, the first \textit{continuous-layering} representation. Organized as an Evolution Tree, EvoGS generates finer details via an explicit, wavelet-inspired parent-child refinement. This empowers child nodes to structurally correct ancestral errors, yield inherently sparse and highly compressible inter-layer signals. Extensive experiments show EvoGS eliminates splat redundancy from over 65\% to under 25\%. Compared to state-of-the-art baselines, it reduces transmission payload and GPU VRAM footprint by up to 2.4$\times$ and 5.5$\times$, respectively, and achieves smooth quality transitions optimal for real-time adaptive streaming. Project page: https://yuang-ian.github.io/evogs/

2606.07175 2026-06-08 cs.CV 新提交

Seeing Without Exposing: Adaptive Privacy Control for Open-World, Context-Hungry MLLMs

看见而不暴露:面向开放世界、上下文饥渴型MLLM的自适应隐私控制

Siyuan Xu, Yibing Liu, Peilin Chen, Yung-Hui Li, Shiqi Wang, Sam Kwong

发表机构 * City University of Hong Kong Hon Hai Research Institute Lingnan University

AI总结 针对多模态大语言模型在开放世界中面临不可预测敏感信息泄露的隐私挑战,提出无训练方法APD,将隐私元素漂移至语义等价替代物并锚定上下文线索,结合新基准AdaptShield实现隐私保护与上下文保留的平衡提升。

详情
AI中文摘要

多模态大语言模型(MLLM)引发了新的隐私挑战。在数据方面,用户提供的输入通常包含不可预测的敏感信息;而在下游任务方面,模型推理依赖于丰富的视觉上下文,这些上下文本身可能涉及隐私敏感信息。然而,现有的隐私保护方法依赖于预定义的敏感类别和固定的混淆策略,难以应对MLLM中的此类挑战。为解决这一困境,我们提出了锚定隐私漂移(APD),一种无需训练的方法,它将隐私敏感元素漂移到语义等价的替代物,同时将上下文线索锚定到源图像。为了系统评估这种隐私保护和上下文保留的双重目标,我们引入了AdaptShield,一个涵盖22个隐私类别的综合基准,它将传统隐私度量与基于MLLM的上下文效用评估相结合。大量实验表明,我们的方法在隐私净化和内容保留方面实现了平衡改进,在四个MLLM系列(即Qwen2.5、Qwen3、InternVL3和InternVL3.5)上,文本类别的平均增益为10.4%,基于MLLM的评估平均增益为8.5%。

英文摘要

Multimodal large language models (MLLMs) have raised new privacy challenges. On the data side, user-provided inputs often include unpredictable sensitive information; while on the downstream task side, model reasoning depends on rich visual context that may itself be privacy-sensitive. Existing privacy protection methods, however, rely on predefined sensitive categories and fixed obfuscation strategies, struggling to tackle such challenges in MLLMs. To address this dilemma, we propose Anchored Privacy Drifting (APD), a training-free method that drifts privacy-sensitive elements toward semantically equivalent alternatives while anchoring contextual cues to the source image. To systematically evaluate this dual objective of privacy protection and contextual preservation, we introduce AdaptShield, a comprehensive benchmark covering 22 privacy categories, which combines conventional privacy metrics with MLLM-based assessments of contextual utility. Extensive experiments show that our method achieves balanced improvements in both privacy sanitization and content retention, with average gains of 10.4% on textual categories and 8.5% under MLLM-based evaluation across four MLLM series, i.e., Qwen2.5, Qwen3, InternVL3, and InternVL3.5.

2606.07171 2026-06-08 cs.CV 新提交

When Recovery Matters: The Blind Spot of Surrogate Privacy in MLLM Editing

当恢复至关重要时:MLLM编辑中替代隐私的盲点

Siyuan Xu, Yibing Liu, Peilin Chen, Yung-Hui LI, Shiqi Wang, Sam Kwong

发表机构 * City University of Hong Kong Hon Hai Research Institute Lingnan University

AI总结 针对多模态大模型编辑中的隐私风险,提出首个面向恢复的替代隐私保护编辑基准SPPE,涵盖36个细粒度隐私类别和65个编辑指令,并设计可编辑性评估与替代到源编辑恢复两个任务及对应方法。

详情
AI中文摘要

多模态大语言模型(MLLM)支持灵活的指令驱动图像编辑,但当用户图像暴露多样且用户特定的私有内容时,会产生隐私风险。典型的隐私保护策略通常在云端编辑前用替代内容替换敏感区域。然而,结果输出往往是编辑后的替代图像而非期望的编辑后源图像,在设计和评估范围中都忽略了局部恢复。为此,我们引入SPPE(基于替代的隐私保护编辑),这是首个面向恢复的基准,涵盖36个细粒度隐私类别和65个编辑指令。它定义了两个互补任务:1)可编辑性评估,在云端交互前估计替代图像是否能产生与原始图像一致的编辑;2)替代到源编辑恢复,评估编辑后的替代图像是否能转移回私有源图像并保留编辑效果。我们为每个任务提出了专用方法:ERMA通过指令感知的多模态关系建模预测替代可编辑性,而C2E-S2SER通过使用替代编辑对作为视觉编辑证据和源图像作为源保留锚点来执行循环一致性恢复。在SPPE和InstructPix2Pix上的实验表明,两个任务均有一致改进。对于可编辑性评估,ERMA在SRCC上比最佳基线提升13.9%,在PLCC上提升12.3%。对于替代到源编辑恢复,C2E-S2SER在SPPE的所有8个源完整性和编辑一致性指标上优于SOER。

英文摘要

Multimodal Large Language Models (MLLMs) enable flexible instruction-driven image editing, but privacy risks arise when user images expose diverse and user-specific private content. Canonical privacy protection strategies typically substitute sensitive regions with surrogate content before cloud editing. Yet, the resulting output is often an edited surrogate rather than the desired edited source image, neglecting the local recovery in both design and evaluation scope. To this end, we introduce SPPE (Surrogate-based Privacy-Preserving Editing), the first recovery-oriented benchmark covering 36 fine-grained privacy categories and 65 editing instructions. It defines two complementary tasks: 1) editability assessment, which estimates before cloud interaction whether a surrogate can induce an edit consistent with the original image; and 2) surrogate-to-source edit recovery, which evaluates whether the edited surrogate can be transferred back to the private source with the edit effect preserved. We address each task with a dedicated method: ERMA predicts surrogate editability through instruction-aware multimodal relation modeling, while \method performs cycle-consistent recovery by using the surrogate editing pair as visual edit evidence and the source image as a source-preserving anchor. Experiments on SPPE and InstructPix2Pix show consistent improvements on both tasks. For editability assessment, ERMA improves over the best-performing baselines by 13.9% in SRCC and 12.3% in PLCC. For surrogate-to-source edit recovery, C2E-S2SER outperforms SOER across all 8 source integrity and edit consistency metrics on SPPE.

2606.07170 2026-06-08 cs.RO 新提交

Test-Time Trajectory Optimization for Autonomous Driving

自动驾驶的测试时轨迹优化

Yihong Xu, Eloi Zablocki, Yuan Yin, Elias Ramzi, Ellington Kirby, Alexandre Boulch, Matthieu Cord

发表机构 * valeo.ai Sorbonne Université CNRS ISIR

AI总结 提出TOAD方法,在测试时使用交叉熵方法优化轨迹,无需重新训练即可提升多种规划器的性能。

详情
AI中文摘要

端到端的自动驾驶规划器通常生成一组候选轨迹,对每个轨迹评分,并返回得分最高的候选轨迹。然而,评分器仅在生成候选轨迹后应用,无法影响轨迹集合:无论评分器质量如何,候选轨迹集较弱会限制规划性能。我们转而将评分器视为学习到的轨迹级奖励函数,并搜索最大化该奖励的轨迹。我们的方法TOAD在测试时运行交叉熵方法,从规划器的候选轨迹进行热启动。它无需重新训练,可即插即用于现有规划器。在六个基础规划器上,TOAD在NAVSIM-v1(94.7 PDMS)、NAVSIM-v2(56.3 EPDMS)和闭环HUGSIM基准测试中提升了结果。代码将通过项目页面公开:this https URL。

英文摘要

End-to-end planners for autonomous driving typically generate a set of candidate trajectories, score each one, and return the highest-scoring candidate. However, the scorer is applied only after the proposals are generated and cannot influence the set of trajectories: a weak set of candidates limits planning performance regardless of the scorer's quality. We instead treat the scorer as a learned trajectory-level reward function and search for trajectories that maximize it. Our method, TOAD, runs the Cross-Entropy Method at test time, warm-started from the planner's proposals. It requires no retraining and is plug-and-play for existing planners. Across six base planners, TOAD improves results on NAVSIM-v1 (94.7 PDMS), NAVSIM-v2 (56.3 EPDMS), and the closed-loop HUGSIM benchmark. The code will be made publicly available via the project page: https://valeoai.github.io/TOAD/.

2606.07167 2026-06-08 cs.CL cs.AI 新提交

UrduMMLU: A Massive Multitask Benchmark for Urdu Language Understanding

UrduMMLU:乌尔都语理解的大规模多任务基准

Ahmer Tabassum, Sarfraz Ahmad, Hasan Iqbal, Owais Aijaz, Momina Ahsan, Preslav Nakov

发表机构 * MBZUAI

AI总结 针对乌尔都语缺乏本地教育来源的MMLU风格基准,提出包含26,431道多选题的UrduMMLU,覆盖26个学科,评估30个LLM发现Gemini-3.5-Flash最佳,多数模型在人文科目上表现差。

详情
Comments
27 pages, 18 figures, 17 tables, Submitted to ARR May 2026
AI中文摘要

有意义的 multilingual 评估必须在目标语言和教育背景下测试模型。乌尔都语有超过2.3亿人使用,但缺乏从本地教育来源构建的广泛MMLU风格基准。我们提出UrduMMLU,一个包含26,431道乌尔都语多选题的基准,涵盖26个学科和五个领域,数据来自本地乌尔都语题库和公开考试PDF。与基于翻译的资源不同,UrduMMLU既包括标准学术科目,也包括乌尔都语和地区特定内容。我们通过双重人工标注和严格共识过滤对考试部分进行标注。我们在英语和乌尔都语提示下评估了30个LLM,进行了60次零样本评估,并进一步在两种提示语言的多个少样本设置下评估了四个开源LLM。Gemini-3.5-Flash表现最佳,准确率达到90.20%和90.34%,而其他模型均未超过85%。最强的开源模型落后7.79和8.92个百分点,许多模型在乌尔都语人文科目上比STEM科目损失25到40个百分点。少样本提示仅带来微小提升。UrduMMLU表明,当前LLM中乌尔都语知识仍不均匀,尤其是地区性内容。

英文摘要

Meaningful multilingual evaluation must test models in the target language and educational context. Urdu, spoken by more than 230 million people, lacks a broad MMLU-style benchmark built from native educational sources. We introduce UrduMMLU, a benchmark of 26,431 Urdu MCQs across 26 subjects and five domains, collected from native Urdu MCQ banks and public examination PDFs. Unlike translation-based resources, UrduMMLU covers both standard academic subjects and Urdu- and region-specific content. We label the exam-derived portion through dual human annotation with strict consensus filtering. We evaluate 30 LLMs under English and Urdu prompts, yielding 60 zero-shot evaluations, and further evaluate four open-source LLMs under multiple few-shot settings across both prompt languages. Gemini-3.5-Flash performs best, reaching 90.20% and 90.34% accuracy, while no other model exceeds 85%. The strongest open-source model trails by 7.79 and 8.92 points, and many models lose 25 to 40 points on Urdu-centered Humanities subjects compared with STEM. Few-shot prompting yields only modest gains. UrduMMLU shows that Urdu knowledge remains uneven in current LLMs, especially for regionally grounded content.

2606.07161 2026-06-08 cs.CV 新提交

TraRA: Trajectory-level Recognition Aggregation for Video Text Spotting in Urban Surveillance

TraRA: 面向城市监控视频文本识别的轨迹级识别聚合方法

Duc Tri Tran, Trung Thanh Nguyen, Vijay John, Phi Le Nguyen, Yasutomo Kawanishi

发表机构 * RIKEN Hanoi University of Science and Technology Nagoya University Lawrence Technological University Ritsumeikan University

AI总结 提出TraRA方法,通过轨迹级文本识别聚合,利用时间与多模态一致性,解决监控视频中运动模糊、遮挡等导致的帧级识别不一致问题,在多个基准上提升跟踪与识别性能。

详情
Comments
22nd IEEE International Conference on Advanced Visual and Signal-Based Systems
AI中文摘要

视频文本识别(VTS)对于城市监控和智能交通系统至关重要,能够自动读取视频流中的街道标志、车辆标记和场景文本。然而,由于监控场景中常见的动态视频因素(包括运动模糊、遮挡和尺度变化)导致帧级识别退化,可靠识别仍然具有挑战性。现有的VTS方法通常对每一帧独立进行识别,导致跨序列的结果不一致且不准确。为了解决这些限制,我们提出了TraRA(面向VTS的轨迹级识别聚合),这是一种即插即用的方法,通过利用时间和多模态一致性执行轨迹级文本识别。TraRA集成了两个关键模块:(1)时间聚类和(2)视觉-语言聚合。前者通过分组时间和视觉上一致的文本实例来细化噪声轨迹,而后者采用低秩自适应增强的视觉-语言模型,融合跨帧的视觉线索与语言上下文。通过聚合整个文本轨迹的信息,TraRA即使在具有挑战性的监控条件下也能实现鲁棒的文本识别。在四个公共基准(包括道路和城市场景数据集RoadText、BOVText、ArTVideo和ICDAR15)上进行的大量实验表明,与最先进的VTS方法相比,TraRA持续提升了跟踪和识别性能。源代码可在该网址获取。

英文摘要

Video Text Spotting (VTS) is essential for urban surveillance and intelligent transportation systems, enabling automated reading of street signs, vehicle markings, and scene text in video streams. However, reliable recognition remains challenging due to dynamic video factors common in surveillance scenarios, including motion blur, occlusion, and scale variation, which degrade frame-level recognition. Existing VTS methods typically perform recognition independently on each frame, leading to inconsistent and inaccurate results across sequences. To address these limitations, we propose TraRA (Trajectory-level Recognition Aggregation for VTS), a plug-and-play method that performs trajectory-level text recognition by leveraging temporal and multimodal consistency. TraRA integrates two key modules: (1) the Temporal Clustering and (2) the Vision-Language Aggregation. The former refines noisy trajectories by grouping temporally and visually coherent text instances, while the latter employs a Low-Rank Adaptation-enhanced Vision-Language model to fuse visual cues with linguistic context across frames. By aggregating information over entire text trajectories, TraRA achieves robust text recognition even under challenging surveillance conditions. Extensive experiments on four public benchmarks, including road and urban scene datasets (RoadText, BOVText, ArTVideo, and ICDAR15), demonstrate that TraRA consistently improves tracking and recognition performance over state-of-the-art VTS methods. The source code is available at https://github.com/trid2912/TraRA.

2606.07146 2026-06-08 cs.LG cs.CE 新提交

Decision-Aware Evaluation of Physics-Informed Surrogates

决策感知的物理信息替代模型评估

Daniel Cieślak, Andrzej Czyżewski

发表机构 * Gdańsk University of Technology

AI总结 针对物理信息机器学习在工程决策中的评估,提出pinn-gym基准,通过曲线误差、物理可行性、top-k检索和遗憾值等多维度指标,揭示低nRMSE不足以识别有用设计,且物理信息损失改变权衡而非单调改进所有指标。

详情
Comments
12 pages, 5 figures, 9 tables. Code and data available at https://github.com/Dyniel/pinn-gym
AI中文摘要

物理信息机器学习通常通过曲线误差来评估,尽管工程应用取决于下游决策:对候选方案进行排序、避免不可行设计以及限制遗憾值。我们引入了pinn-gym,一个用于材料条件晶格设计的开放基准,它结合了一个透明的降阶碰撞冲击预言机、五种可打印聚合物卡片、无量纲力响应目标以及一个涵盖曲线保真度、物理可行性、top-k检索和质量遗憾值的协议。在逐材料、混合和跨材料设置中,低nRMSE通常不足以识别有用的设计选择。物理信息损失改变了权衡,而不是单调地改进所有指标,并且无量纲条件化提高了可比性,但并未使迁移对称。该基准不是经过认证的材料模型;在发布的预言机、候选生成器和材料卡片中,pinn-gym提供了一个可重复的测试平台,用于评估PIML替代模型作为决策系统,而不仅仅是曲线预测器。

英文摘要

Physics-informed machine learning is often assessed by curve error, although engineering use depends on downstream decisions: ranking candidates, avoiding infeasible designs and limiting regret. We introduce pinn-gym, an open benchmark for material-conditioned lattice design that couples a transparent reduced-order crush-and-impact oracle with five printable polymer cards, dimensionless force-response targets and a protocol spanning curve fidelity, physical admissibility, top-k retrieval and mass regret. Across per-material, pooled and cross-material settings, low nRMSE is frequently insufficient to identify useful design selections. Physics-informed losses alter trade-offs rather than monotonically improving all metrics, and dimensionless conditioning improves comparability without making transfer symmetric. The benchmark is not a certified material model; within the released oracle, candidate generator and material cards, pinn-gym provides a reproducible testbed for evaluating PIML surrogates as decision systems rather than curve predictors alone.

2606.07145 2026-06-08 cs.CV 新提交

Consistent-Inversion: Reverse Consistency Guidance for Structure-Preserving Visual Editing

Consistent-Inversion: 用于结构保持视觉编辑的反向一致性引导

Xiaocheng Lu, Jingcai Guo, Song Guo

发表机构 * Hong Kong University of Science and Technology The Hong Kong Polytechnic University

AI总结 提出Consistent-Inversion,一种无训练的反向一致性引导框架,通过检查中间目标轨迹能否在源提示下反向到源反转轨迹,并利用反向一致性差异校正早期去噪步骤,在保持结构的同时提升编辑效果。

详情
Comments
Submitted to IEEE Transactions on Multimedia; 10 pages, 4 figures
AI中文摘要

文本引导的扩散模型已成为真实图像视觉编辑的有效工具,其中编辑后的图像必须遵循目标指令,同时保持与编辑无关的结构。大多数无训练编辑器依赖于反转:源图像被映射到一个噪声潜变量轨迹,终端潜变量被重新用于目标提示去噪。这种重用有助于保持结构,但也耦合了源重建和目标编辑。由此产生的轨迹不匹配可能会损害背景/布局细节,或过度约束预期编辑。本文提出Consistent-Inversion,一种用于结构保持视觉编辑的无训练反向一致性引导框架。Consistent-Inversion不将反转后的源潜变量视为固定初始化,而是检查中间目标轨迹是否能在源提示下反向到源反转轨迹。为使这一检查明确,我们构建了一个辅助的目标侧噪声表示,执行源引导的反向去噪,并将得到的反向一致性差异作为校正信号,用于选定的早期目标去噪步骤。该方法不更新模型参数,与基于反转的编辑器兼容,且在稀疏应用时仅引入少量推理开销。在PIE-Bench上的实验表明,Consistent-Inversion在统一的SD3.5协议下提高了背景和结构保真度,同时保持目标提示对齐,兼容性实验进一步验证了相同校正原则在经典Stable-Diffusion反转流水线上的有效性。

英文摘要

Text-guided diffusion models have become effective tools for real-image visual editing, where the edited image must follow a target instruction while preserving editing-irrelevant structure. Most training-free editors rely on inversion: a source image is mapped to a noisy latent trajectory and the terminal latent is reused for target-prompt denoising. This reuse is useful for preservation, but it also couples source reconstruction and target editing. The resulting trajectory mismatch may either damage background/layout details or over-constrain the intended edit. This paper presents Consistent-Inversion, a training-free reverse consistency guidance framework for structure-preserving visual editing. Instead of treating the inverted source latent as a fixed initialization, Consistent-Inversion checks whether an intermediate target trajectory can be reversed toward the source inversion trajectory under the source prompt. To make this check well-defined, we construct an auxiliary target-side noise representation, perform source-guided reverse denoising, and use the resulting reverse consistency discrepancy as a correction signal for selected early target denoising steps. The method does not update model parameters, is compatible with inversion-based editors, and introduces only a small inference overhead when applied sparsely. Experiments on PIE-Bench show that Consistent-Inversion improves background and structural fidelity under a unified SD3.5 protocol while maintaining target-prompt alignment, and compatibility experiments further verify the same correction principle on classical Stable-Diffusion inversion pipelines.

2606.07141 2026-06-08 cs.LG cs.AI 新提交

REMEDI: A Benchmark for Retention and Unlearning Evaluation in Multi-label Clinical Disease Inference

REMEDI:多标签临床疾病推断中的保留与遗忘评估基准

Anurag Sharma, Sai Teja Chunchu, Prasenjit Mitra, Sandipan Sikdar, Koustav Rudra

发表机构 * IIT Kharagpur Carnegie Mellon University L3S Research Center, Leibniz University Hannover

AI总结 提出REMEDI基准,针对多标签临床疾病推断中的机器遗忘问题,利用MIMIC-III数据库评估现有方法在效用与遗忘性能间的权衡,并发现其不适用于多标签任务。

详情
Comments
Under review
AI中文摘要

用于临床疾病推断的语言模型在患者数据上进行训练,这些数据可能包含敏感和私人信息,数据所有者可能出于隐私或版权原因要求从训练模型中删除其数据。然而,精确遗忘患者特定数据是棘手的,而通过少量数据删除重新训练则资源密集。虽然存在几种可用的机器遗忘方法,但其效用通常局限于非医疗领域。此外,评估此类遗忘方法的现有基准主要使用合成数据集,这些数据集不能真正代表现实系统。因此,这些遗忘方法在医疗领域的有效性在很大程度上尚不清楚。为此,我们引入了REMEDI,一个针对多标签和多类别临床疾病推断的广泛机器遗忘基准,其中标签相关性、纵向结构和安全约束使得遗忘特别具有挑战性。与现有基准不同,REMEDI考虑:(1) 相关的应用领域(医疗),(2) 涉及多样遗忘实例集的全面遗忘设置,(3) 具有挑战性的遗忘场景,包括多标签和多类别分类任务,以及(4) 评估指标,涉及效用和遗忘程度的性能。REMEDI使用MIMIC-III临床数据库开发,该数据库包含患者的全面临床数据。现有遗忘方法的实验表明,效用和遗忘性能之间存在权衡。它们也大多不适合多标签分类任务。为促进可重复性,我们公开了我们的基准。

英文摘要

Language models trained for clinical disease inference are trained on patient data, which may include sensitive and private information, and data owners may request the removal of their data from a trained model due to privacy or copyright concerns. However, exactly unlearning patient-specific data is intractable, and retraining with minor data removal is resource-intensive. While there exists several machine unlearning methods that can be used, their utility is generally restricted to non-medical domains. Moreover, the existing benchmarks for evaluating such unlearning methods primarily utilize synthetically curated datasets, which are not truly representative of real-world systems. Hence, the effectiveness of these unlearning methods in the medical domain is largely unclear. To this end, we introduce REMEDI, an extensive benchmark for machine unlearning tailored to multi-label and multiclass clinical disease inference, where label correlations, longitudinal structure, and safety constraints make unlearning particularly challenging. Unlike the existing benchmarks, REMEDI considers: (1) a relevant application domain (medical), (2) comprehensive unlearning setups involving diverse sets of forget instances, (3) challenging unlearning scenarios including multi-label and multi-class classification tasks, and (4) evaluation metrics involving performance both in terms of utility and extent of unlearning achieved. REMEDI is developed using the MIMIC-III clinical database that contains comprehensive clinical data of patients. Experiments with existing unlearning methods indicate that there exists a trade-off between utility and unlearning performance. They are also largely unsuited to multi-label classification tasks. To facilitate reproducibility, we make our benchmark publicly available.

2606.07134 2026-06-08 cs.LG 新提交

$α$-PFN: Fast Entropy Search via In-Context Learning

$\alpha$-PFN:通过上下文学习实现快速熵搜索

Herilalaina Rakotoarison, Steven Adriaensen, Tom Viering, Carl Hvarfner, Samuel Müller, Frank Hutter, Eytan Bakshy

发表机构 * University of Freiburg University of Tübingen University of Amsterdam Lund University Meta

AI总结 提出一种两阶段摊销策略,利用先验数据拟合网络(PFN)在单次前向传播中近似熵搜索采集函数,实现超过50倍加速,在合成和真实基准上性能与最先进方法相当。

详情
Comments
Published at ICML 2026
AI中文摘要

信息论采集函数如熵搜索(ES)为贝叶斯优化(BO)提供了原则性的探索-利用框架。然而,它们的实际实现依赖于复杂且缓慢的近似,即信息增益的蒙特卡洛估计。这种复杂性可能引入数值误差,并需要专门的、手工定制的实现。我们提出了一种两阶段摊销策略,该策略学习使用先验数据拟合网络(PFN)在单次前向传播中近似基于熵搜索的采集函数。第一个PFN被训练为以最优值的信息为条件;第二个$\alpha$-PFN通过训练来预测期望信息增益,该训练基于使用第一个PFN测量的信息增益。$\alpha$-PFN提供了一种灵活的学习近似,用每个候选点的单次前向传播取代了复杂的启发式近似,实现了快速且可扩展的采集评估。实验上,我们的方法在合成和真实世界基准上与最先进的熵搜索实现具有竞争力,同时在我们所有实验中加速了不同的熵搜索变体,加速比超过50倍。源代码:此https URL。

英文摘要

Information-theoretic acquisition functions such as Entropy Search (ES) offer a principled exploration-exploitation framework for Bayesian optimization (BO). However, their practical implementation relies on complicated and slow approximations, i.e., a Monte Carlo estimation of the information gain. This complexity can introduce numerical errors and requires specialized, hand-crafted implementations. We propose a two-stage amortization strategy that learns to approximate entropy search-based acquisition functions using Prior-data Fitted Networks (PFNs) in a single forward pass. A first PFN is trained to be conditioned on information about the optima; second, the $α$-PFN is trained to predict the expected information gain by training on information gains measured with the first PFN. The $α$-PFN offers a flexible learned approximation, which replaces the complex heuristic approximations with a single forward pass per candidate, enabling rapid and extensible acquisition evaluation. Empirically, our approach is competitive with state-of-the-art entropy search implementations on synthetic and real-world benchmarks, while accelerating the different entropy search variants across all our experiments, with speed ups over 50x. Source code: https://github.com/automl/AlphaPFN.

2606.07130 2026-06-08 cs.CL 新提交

Explicit Evidence Grounding via Structured Inline Citation Generation

通过结构化内联引文生成实现显式证据基础

Anar Yeginbergen, Amelie Wührl, Anna Rogers, Rodrigo Agerri

发表机构 * University of the Basque Country (UPV/EHU) IT University of Copenhagen

AI总结 提出FullCite框架,通过提示生成、约束解码和后处理跨度对齐三种策略生成结构化内联引文,在三个QA基准上评估引文质量和忠实性,发现LLMs虽能识别相关文档但难以精确定位支持性证据跨度。

详情
AI中文摘要

随着AI系统被更广泛采用,对事实性和忠实性生成的需求日益增长。因此,通过引文适当归因信息变得至关重要。本文介绍了FullCite,一个与大多数先前工作不同,生成结构化内联引文的框架,将每个主张链接到其源文档和支持证据。FullCite提出了三种内联引文生成策略:基于提示的生成、在引文语法上的约束解码以及事后跨度对齐。使用三个问答基准,即ASQA、BioASQ和ExpertQA,我们从三个维度评估引文质量和忠实性:文档级正确性、证据跨度识别以及主张-引文忠实性。我们的评估表明,虽然LLMs通常能有效识别相关文档,但它们在识别文档内精确的支持性跨度方面存在困难。这一差距表明,实现忠实的归因问答需要研究更加重视精确的证据跨度识别。

英文摘要

As AI systems become more widely adopted, the demand for factual and faithful generation grows. Properly attributing information through citations becomes, therefore, crucial. This work introduces FullCite, a framework that, in contrast to most previous works, generates structured inline citations linking each claim to both its source document and supporting evidence. FullCite proposes three strategies to inline citation generation: prompt-based generation, constrained decoding over a citation grammar, and posthoc span alignment. Using three question answering benchmarks, namely, ASQA, BioASQ, and ExpertQA, we assess citation quality and faithfulness along three dimensions: document-level correctness, evidence span identification, and claim-citation faithfulness. Our evaluation shows that while LLMs are generally effective at identifying relevant documents, they struggle to identify the precise supporting spans within them. This gap suggests that achieving faithful attributed QA will require research to place greater emphasis on precise evidence span identification.

2606.07127 2026-06-08 cs.LG 新提交

Learning Explicit Behavioral Models with Adaptive Questions and World-Model Probes

通过自适应问题和世界模型探针学习显式行为模型

Hikaru Shindo, Yu Deng, Teng Cao, Quentin Delfosse, Christopher Tauchmann, Jannis Blüml, Gopika Sudhakaran, Kristian Kersting

发表机构 * Technical University of Darmstadt Hessian Center for Artificial Intelligence (hessian.AI) German Research Center for Artificial Intelligence (DFKI)

AI总结 提出显式符号行为模型(ESBM),通过自适应问题和世界模型探针将任务性能与可解释机制结合,在Atari任务中学习高分策略并生成显式答案和机制预测。

详情
AI中文摘要

仅针对任务回报训练的交互式智能体可以获得高分,但无法表示其动作成功的机制。这导致行为脆弱且难以诊断,并在环境动态变化时限制适应性。现有的LLM反思和策略代码修复可以从失败轨迹中修正行为,但问题和世界理解测试通常仅在训练后使用。我们引入了显式符号行为模型(ESBM),一种可训练的行为模型,将任务性能与基于证据的问答和可执行机制预测相结合。ESBM通过类型化谓词、加权规则、有界选项和机制记忆表示行为;机制层在动作干预下预测符号事件、对象变化、奖励和终止后果。每次滚动后,自适应问题和主动世界模型探针将得分失败、问答错误和转换预测错误转化为局部ESBM编辑的约束。候选模型通过多准则规则选择,该规则联合评估任务得分、可回答性和主动世界模型一致性。在测试的Atari风格协议下,ESBM学习高分策略,同时产生显式答案和可执行机制预测,表明自适应问题可以作为训练压力和可重用基准,用于该设置下的机制策略学习。

英文摘要

Interactive agents trained only against task return can achieve high scores while failing to represent the mechanisms that make their actions succeed. This makes brittle behavior difficult to diagnose and limits adaptation when environment dynamics change. Existing LLM reflection and policy-code repair can revise behavior from failed trajectories, but questions and world-understanding tests are usually used only after training. We introduce an Explicit Symbolic Behavioral Model (ESBM), a trainable behavioral model that couples task performance with evidence-grounded question answering and executable mechanism prediction. An ESBM represents behavior through typed predicates, weighted rules, bounded options and mechanism memory; the mechanism layer predicts symbolic events, object changes, rewards and terminal consequences under action interventions. After each rollout, adaptive questions and active world-model probes convert score failures, QA errors and transition-prediction errors into constraints for local ESBM edits. Candidate models are selected by a multi-criterion rule that jointly evaluates task score, answerability and active world-model consistency. Under the tested Atari-style protocols, ESBM learns high-scoring policies while producing explicit answers and executable mechanism predictions, indicating that adaptive questions can serve as both training pressure and reusable benchmarks for mechanistic policy learning in this setting.