arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2510.22973 2026-05-26 cs.CV

Scaling Up Occupancy-centric Driving Scene Generation: Dataset and Method

扩展以占据为中心的驾驶场景生成：数据集与方法

Bohan Li, Xin Jin, Hu Zhu, Hongsi Liu, Ruikai Li, Jiazhe Guo, Kaiwen Cai, Chao Ma, Yueming Jin, Hao Zhao, Xiaokang Yang, Wenjun Zeng

AI总结针对占据数据稀缺问题，构建最大语义占据数据集Nuplan-Occ，并提出统一框架联合生成高质量语义占据、多视角视频和LiDAR点云，采用时空解耦架构及高斯泼溅稀疏点图渲染和传感器感知嵌入策略，实现高保真生成。

Comments IEEE TPAMI

详情

AI中文摘要

驾驶场景生成是自动驾驶的关键领域，支持下游应用，包括感知和规划评估。以占据为中心的方法通过提供跨帧和模态的一致条件，最近取得了最先进的结果；然而，其性能严重依赖于标注的占据数据，而这类数据仍然稀缺。为克服这一限制，我们整理了Nuplan-Occ，这是迄今为止最大的语义占据数据集，基于广泛使用的Nuplan基准构建。其规模和多样性不仅促进了大规模生成建模，也促进了自动驾驶下游应用。基于该数据集，我们开发了一个统一框架，联合合成高质量语义占据、多视角视频和LiDAR点云。我们的方法采用时空解耦架构，支持4D动态占据的高保真空间扩展和时间预测。为弥合模态差距，我们进一步提出了两种新技术：基于高斯泼溅的稀疏点图渲染策略，增强多视角视频生成；以及传感器感知嵌入策略，显式建模LiDAR传感器属性以实现逼真的多LiDAR模拟。大量实验表明，与现有方法相比，我们的方法实现了更优的生成保真度和可扩展性，并验证了其在下游任务中的实用价值。仓库：https://github.com/Arlo0o/UniScene-Unified-Occupancy-centric-Driving-Scene-Generation/tree/v2

英文摘要

Driving scene generation is a critical domain for autonomous driving, enabling downstream applications, including perception and planning evaluation. Occupancy-centric methods have recently achieved state-of-the-art results by offering consistent conditioning across frames and modalities; however, their performance heavily depends on annotated occupancy data, which still remains scarce. To overcome this limitation, we curate Nuplan-Occ, the largest semantic occupancy dataset to date, constructed from the widely used Nuplan benchmark. Its scale and diversity facilitate not only large-scale generative modeling but also autonomous driving downstream applications. Based on this dataset, we develop a unified framework that jointly synthesizes high-quality semantic occupancy, multi-view videos, and LiDAR point clouds. Our approach incorporates a spatio-temporal disentangled architecture to support high-fidelity spatial expansion and temporal forecasting of 4D dynamic occupancy. To bridge modal gaps, we further propose two novel techniques: a Gaussian splatting-based sparse point map rendering strategy that enhances multi-view video generation, and a sensor-aware embedding strategy that explicitly models LiDAR sensor properties for realistic multi-LiDAR simulation. Extensive experiments demonstrate that our method achieves superior generation fidelity and scalability compared to existing approaches, and validates its practical value in downstream tasks. Repo: https://github.com/Arlo0o/UniScene-Unified-Occupancy-centric-Driving-Scene-Generation/tree/v2

URL PDF HTML ☆

赞 0 踩 0

2510.20955 2026-05-26 cs.LG cs.RO

Approximating Safety Feedback Without a Safety Oracle via Model Predictive Control

无安全神谕下通过模型预测控制近似安全反馈

Jeff Pflueger, Michael Everett

AI总结提出一种利用模拟器和模型预测路径积分算法，基于可逆性和正不变性假设来近似安全函数的方法，避免手动设计安全反馈。

Comments 8 pages, 5 figures

详情

AI中文摘要

移动机器人控制的安全决策算法通常需要存在反馈来验证提议动作的安全性。该反馈假定在控制系统的开发或部署过程中直接可用，可以采取显式约束公式或手工标记的安全数据集的形式，但两者都可能不准确或耗时。许多最近开发的模拟器可以处理复杂的交互和多样化的环境。这些环境具有隐式安全约束，可能难以建模。通过利用其中一个模拟器，我们可以构建一个安全函数的代理，从而绕过对手动设计反馈来捕获这些约束的需求。我们提出了一种算法，通过使用可逆性和对不安全状态空间的正不变性假设来近似安全性。该方法采用模型预测路径积分算法（MPPI）来建立这种可逆性并验证提议的动作。首先，通过模拟器将动作投影到未来状态。然后，如果MPPI能够找到一条路径返回到轨迹中的先前状态，则该状态保证在不安全（正不变）集合之外。实验结果表明，所提出的算法可以近似安全神谕的性能，同时避免将不安全状态分类为安全。

英文摘要

Safe decision-making algorithms for control of mobile robots often require the existence of feedback to verify the safety of proposed actions. This feedback is assumed to be directly available during the development or deployment of the control system. It can take the form of either an explicit constraint formulation or a set of hand-labeled safety data, both of which can be inaccurate or time consuming to produce. Many recently developed simulators can handle complex interactions and varied environments. These environments have implicit safety constraints that may be hard to model. By leveraging one of these simulators, we can construct a proxy for a safety function that bypasses the need for hand designed feedback in capturing these constraints. We present an algorithm that approximates safety by using reversibility and a positive-invariance assumption on the unsafe state space. This method employs the Model-Predictive Path Integral algorithm (MPPI) to establish this reversibility and verify a proposed action. First the action is projected via the simulator to a future state. Then if MPPI can find a path back to a previous state in the trajectory, that state is guaranteed to be outside the unsafe (positive invariant) set. Experimental results demonstrate that the proposed algorithm can approximate the performance of a safety oracle while avoiding classification of unsafe states as safe.

URL PDF HTML ☆

赞 0 踩 0

2510.20477 2026-05-26 cs.LG

Bi-CoG: Bi-Consistency-Guided Self-Training for Vision-Language Models

Bi-CoG：面向视觉语言模型的双一致性引导自训练

Rui Zhu, Song-Lin Lv, Zi-Kang Wang, Lan-Zhe Guo

AI总结针对半监督微调中模型偏差和超参数敏感问题，提出一种利用模型间和模型内一致性以及误差感知动态伪标签分配策略的即插即用方法Bi-CoG，在14个数据集上显著提升现有方法性能。

Comments Accepted by IJCAI 2026

详情

AI中文摘要

通过半监督学习（SSL）利用未标记数据或通过微调利用预训练模型是解决标签稀缺场景的两种主流范式。最近，将预训练视觉语言模型（VLM）的微调与SSL相结合引起了越来越多的关注，形成了半监督微调的新兴范式。然而，现有方法由于依赖预测一致性或预定义的置信度阈值，常常遭受模型偏差和超参数敏感性的困扰。为了解决这些局限性，我们提出了一种简单而有效的即插即用方法，名为$\underline{\textbf{Bi-Co}}$nsistency-$\underline{\textbf{G}}$uided Self-Training (Bi-CoG)，它通过同时利用模型间和模型内一致性，以及一种错误感知的动态伪标签分配策略，来分配高质量、低偏差的伪标签。理论分析和在14个数据集上的大量实验都证明了Bi-CoG的有效性，它一致且显著地提升了现有方法的性能。

英文摘要

Exploiting unlabeled data through semi-supervised learning (SSL) or leveraging pre-trained models via fine-tuning are two prevailing paradigms for addressing label-scarce scenarios. Recently, growing attention has been given to combining fine-tuning of pre-trained vision-language models (VLMs) with SSL, forming the emerging paradigm of semi-supervised fine-tuning. However, existing methods often suffer from model bias and hyperparameter sensitivity, due to reliance on prediction consistency or pre-defined confidence thresholds. To address these limitations, we propose a simple yet effective plug-and-play methodology named $\underline{\textbf{Bi-Co}}$nsistency-$\underline{\textbf{G}}$uided Self-Training (Bi-CoG), which assigns high-quality and low-bias pseudo-labels, by simultaneously exploiting inter-model and intra-model consistency, along with an error-aware dynamic pseudo-label assignment strategy. Both theoretical analysis and extensive experiments over 14 datasets demonstrate the effectiveness of Bi-CoG, which consistently and significantly improves the performance of existing methods.

URL PDF HTML ☆

赞 0 踩 0

2510.16435 2026-05-26 cs.RO cs.CL cs.HC

What Questions Should Robots Be Able to Answer? A Dataset of User Questions for Explainable Robotics

机器人应该能够回答哪些问题？一个用于可解释机器人的用户问题数据集

Lennart Wachowiak, Andrew Coles, Gerard Canal, Oya Celiktutan

AI总结本文通过收集100名参与者的1893个问题，构建了一个面向家用机器人的用户问题数据集，涵盖12个类别和70个子类别，旨在帮助机器人学家确定机器人需要回答的关键问题类型。

详情

AI中文摘要

随着大型语言模型和对话界面在人机交互中的广泛使用，机器人回答用户问题的能力比以往任何时候都更加重要。因此，我们引入了一个包含1,893个家用机器人用户问题的数据集，这些数据来自100名参与者，并分为12个类别和70个子类别。可解释机器人领域的大多数工作集中在“为什么”问题上。相比之下，我们的数据集提供了多种类型的问题，从关于简单执行细节的问题到关于机器人在假设场景中如何行动的问题——从而为机器人学家提供了关于其机器人需要能够回答哪些问题的宝贵见解。为了收集数据集，我们创建了15个视频刺激和7个文本刺激，描绘了机器人执行各种家务任务。然后，我们询问Prolific上的参与者在每个描绘的情境中他们想问机器人什么问题。在最终数据集中，最常见的类别是关于任务执行细节（21.4%）、机器人能力（12.6%）和性能评估（10.7%）的问题。尽管关于机器人如何处理潜在困难场景并确保正确行为的问题较少，但用户认为这些是机器人最需要能够回答的问题。此外，我们发现自认为是机器人学新手的人与更有经验的用户提出的问题不同。新手更倾向于询问简单事实，例如机器人做了什么或环境的当前状态。随着机器人进入与人类共享的环境，并且语言成为给出指令和交互的核心，该数据集为（i）识别机器人需要记录并暴露给对话界面的信息，（ii）对问答模块进行基准测试，以及（iii）设计符合用户期望的解释策略提供了宝贵的基础。

英文摘要

With the growing use of large language models and conversational interfaces in human-robot interaction, robots' ability to answer user questions is more important than ever. We therefore introduce a dataset of 1,893 user questions for household robots, collected from 100 participants and organized into 12 categories and 70 subcategories. Most work in explainable robotics focuses on why-questions. In contrast, our dataset provides a wide variety of questions, from questions about simple execution details to questions about how the robot would act in hypothetical scenarios -- thus giving roboticists valuable insights into what questions their robot needs to be able to answer. To collect the dataset, we created 15 video stimuli and 7 text stimuli, depicting robots performing varied household tasks. We then asked participants on Prolific what questions they would want to ask the robot in each portrayed situation. In the final dataset, the most frequent categories are questions about task execution details (21.4%), the robot's capabilities (12.6%), and performance assessments (10.7%). Although questions about how robots would handle potentially difficult scenarios and ensure correct behavior are less frequent, users rank them as the most important for robots to be able to answer. Moreover, we find that users who identify as novices in robotics ask different questions than more experienced users. Novices are more likely to inquire about simple facts, such as what the robot did or the current state of the environment. As robots enter environments shared with humans and language becomes central to giving instructions and interaction, this dataset provides a valuable foundation for (i) identifying the information robots need to log and expose to conversational interfaces, (ii) benchmarking question-answering modules, and (iii) designing explanation strategies that align with user expectations.

URL PDF HTML ☆

赞 0 踩 0

2510.07257 2026-05-26 cs.LG

Test-Time Graph Search for Goal-Conditioned Reinforcement Learning

测试时图搜索用于目标条件强化学习

Evgenii Opryshko, Junwei Quan, Claas Voelcker, Yilun Du, Igor Gilitschenski

AI总结提出测试时图搜索方法，通过构建离线数据集图并自适应选择子目标，在不额外训练的情况下显著提升目标条件强化学习在长时域任务中的成功率。

详情

AI中文摘要

离线目标条件强化学习（GCRL）通常难以处理长时域任务，其中价值估计误差累积导致策略不可靠。通常认为没有专门训练就无法实现有效的长期规划。相反，我们的工作表明，现有的GCRL策略与轻量级、无需训练的规划包装器结合时，可以完成长时域任务。我们发现标准目标条件价值函数编码了足以进行规划的局部一致几何结构。我们的方法，测试时图搜索（TTGS），在离线数据集上构建图，并采用自适应子目标选择策略。为了解决最短路径搜索中不可靠的价值估计，我们提出了一种新机制，软性地惩罚长距离转移。我们的方法计算开销可忽略，且不需要额外的监督或参数更新。在OGBench基准上，TTGS显著提高了多个基学习器和任务的成功率，主要收益在具有挑战性的长时域运动任务上，其中一些成功率从接近零提高到90%以上，通常匹配或超越需要复杂辅助训练的方法。代码和视频可在https://ktolnos.github.io/ttgs找到。

英文摘要

Offline goal-conditioned reinforcement learning (GCRL) often struggles with long-horizon tasks, where errors in value estimation accumulate and produce unreliable policies. It is typically assumed that effective long-term planning is infeasible without specialized training. In contrast, our work demonstrates that existing GCRL policies can complete long-horizon tasks when combined with a lightweight, training-free planning wrapper. We find that standard goal-conditioned value functions encode locally consistent geometric structure sufficient for planning. Our approach, Test-Time Graph Search (TTGS), constructs a graph over the offline dataset and employs an adaptive subgoal selection strategy. To address unreliable value estimates during shortest-path search, we propose a novel mechanism that softly penalizes long-distance transitions. Our method incurs negligible computational overhead and requires no additional supervision or parameter updates. On the OGBench benchmark, TTGS significantly boosts success rates across multiple base learners and tasks, with primary gains on challenging long-horizon locomotion tasks where some success rates are improved from near-zero to over 90\%, often matching or outperforming methods that require complex auxiliary training. Code and videos can be found at https://ktolnos.github.io/ttgs.

URL PDF HTML ☆

赞 0 踩 0

2510.01384 2026-05-26 cs.LG

Fine-Tuning Masked Diffusion for Provable Self-Correction

微调掩码扩散以实现可证明的自校正

Jaeyeon Kim, Seunggeun Kim, Taekyun Lee, David Z. Pan, Hyeji Kim, Sham Kakade, Sitan Chen

AI总结提出PRISM方法，通过轻量级模型无关的重新掩码策略，在掩码扩散模型中实现可证明的自校正，无需强化学习或验证器，提升低质量令牌检测与修正能力。

Comments Authorship statement: Jaeyeon Kim and Seunggeun Kim contributed equally, and Taekyun Lee is also a co first author

详情

AI中文摘要

生成模型的一个自然期望是自校正——在推理时检测并修正低质量令牌。尽管掩码扩散模型（MDMs）已成为离散空间生成建模的有前景方法，但其自校正能力仍知之甚少。先前将自校正融入MDMs的尝试要么需要彻底改造MDM架构/训练，要么依赖于令牌质量的不精确代理，限制了其适用性。受此启发，我们引入PRISM——掩码扩散推理时自校正的插件式重新掩码——一种轻量级、模型无关的方法，适用于任何预训练MDM。理论上，PRISM定义了一个自校正损失，可证明地学习每个令牌的质量分数，无需强化学习或验证器。这些质量分数在与MDM相同的前向传播中计算，并用于检测低质量令牌。实验上，PRISM在多个领域和规模上推进了MDM推理：数独；无条件文本（170M）；以及使用LLaDA（8B）的代码。

英文摘要

A natural desideratum for generative models is self-correction--detecting and revising low-quality tokens at inference. While Masked Diffusion Models (MDMs) have emerged as a promising approach for generative modeling in discrete spaces, their capacity for self-correction remains poorly understood. Prior attempts to incorporate self-correction into MDMs either require overhauling MDM architectures/training or rely on imprecise proxies for token quality, limiting their applicability. Motivated by this, we introduce PRISM--Plug-in Remasking for Inference-time Self-correction of Masked Diffusions--a lightweight, model-agnostic approach that applies to any pretrained MDM. Theoretically, PRISM defines a self-correction loss that provably learns per-token quality scores, without RL or a verifier. These quality scores are computed in the same forward pass with MDM and used to detect low-quality tokens. Empirically, PRISM advances MDM inference across domains and scales: Sudoku; unconditional text (170M); and code with LLaDA (8B).

URL PDF HTML ☆

赞 0 踩 0

2510.01184 2026-05-26 cs.LG

Temporal Score Rescaling for Temperature Sampling in Diffusion and Flow Models

扩散与流模型中温度采样的时间分数重缩放

Yanbo Xu, Yu Wu, Sungjae Park, Zhizhuo Zhou, Shubham Tulsiani

AI总结提出一种无需微调或改变训练策略的方法，通过重缩放噪声数据的得分函数来调控扩散和流模型的采样多样性，实现局部温度控制，并在图像生成、姿态估计、深度预测、机器人操作和蛋白质设计等任务中验证了有效性。

Comments Accepted at ICML 2026. Project page: https://temporalscorerescaling.github.io/

详情

AI中文摘要

我们提出一种机制来引导去噪扩散和流匹配模型的采样多样性，允许用户从比训练分布更尖锐或更宽的分布中采样。我们基于这些模型利用（学习的）噪声数据分布的得分函数进行采样这一观察，并表明重缩放这些得分函数可以有效控制“局部”采样温度。值得注意的是，该方法不需要任何微调或改变训练策略，可以应用于任何现成模型，并且与确定性和随机采样器兼容。我们首先在玩具2D数据上验证了我们的框架，然后展示了其在五个不同任务上训练的扩散模型中的应用——图像生成、姿态估计、深度预测、机器人操作和蛋白质设计。我们发现，在这些任务中，我们的方法允许从更尖锐（或更平坦）的分布中采样，从而带来性能提升，例如，深度预测模型受益于采样更可能的深度估计，而图像生成模型在采样稍平坦的分布时表现更好。

英文摘要

We present a mechanism to steer the sampling diversity of denoising diffusion and flow matching models, allowing users to sample from a sharper or broader distribution than the training distribution. We build on the observation that these models leverage (learned) score functions of noisy data distributions for sampling and show that rescaling these allows one to effectively control a 'local' sampling temperature. Notably, this approach does not require any finetuning or alterations to training strategy, and can be applied to any off-the-shelf model and is compatible with both deterministic and stochastic samplers. We first validate our framework on toy 2D data, and then demonstrate its application for diffusion models trained across five disparate tasks -- image generation, pose estimation, depth prediction, robot manipulation, and protein design. We find that across these tasks, our approach allows sampling from sharper (or flatter) distributions, yielding performance gains e.g., depth prediction models benefit from sampling more likely depth estimates, whereas image generation models perform better when sampling a slightly flatter distribution.

URL PDF HTML ☆

赞 0 踩 0

2510.00387 2026-05-26 cs.LG cs.HC

Bayesian Distributional Models of Executive Functioning

执行功能的贝叶斯分布模型

Robert Kasumba, Zeyu Lu, Dom CP Marticorena, Mingyang Zhong, Paul Beggs, Anja Pahor, Geetha Ramani, Imani Goffney, Susanne M Jaeggi, Aaron R Seitz, Jacob R Gardner, Dennis L Barbour

AI总结本研究使用已知真实参数的受控模拟，评估分布潜变量模型（DLVM）和贝叶斯分布主动学习（DALE）相比传统独立最大似然估计（IMLE）的优势，证明DLVM结合DALE能更高效地估计认知表现分布。

Comments 45 pages, 8 figures, 2 tables

详情

AI中文摘要

本研究使用已知真实参数的受控模拟，评估分布潜变量模型（DLVM）和贝叶斯分布主动学习（DALE）相比传统独立最大似然估计（IMLE）的表现。DLVM整合了多个执行功能任务和个体的观测，允许在稀疏或不完整数据条件下进行参数估计。为了建立已知真实参数，我们从神经网络学习的潜空间中均匀采样个体会话，并将其映射到不同任务上的分布认知表现。然后使用DALE、随机过程或标准固定电池方法从这些分布中采样个体测试项。在给定相同观测集时，DLVM始终优于IMLE，尤其是在数据量较小的情况下，并且更快收敛到真实分布的高度准确估计。在第二组分析中，DALE自适应地引导采样以最大化信息增益，优于随机采样和固定测试电池，尤其是在前80次试验中。这些发现确立了将DLVM的跨任务推理与DALE的最优自适应采样相结合的优势，为更高效的认知评估提供了原则性基础。

英文摘要

This study uses controlled simulations with known ground-truth parameters to evaluate how Distributional Latent Variable Models (DLVM) and Bayesian Distributional Active LEarning (DALE) perform in comparison to conventional Independent Maximum Likelihood Estimation (IMLE). DLVM integrates observations across multiple executive function tasks and individuals, allowing parameter estimation even under sparse or incomplete data conditions. To establish known-ground truth, we uniformly sample individual sessions from a neural network learned latent space and map them to distributional cognitive performance across different tasks. The individual test-items are then sampled from these distributions using either DALE, random procedure or a standard fixed battery approach. When given the same set of observations, DLVM consistently outperformed IMLE, especially under smaller amounts of data, and converges faster to highly accurate estimates of the true distributions. In a second set of analyses, DALE adaptively guided sampling to maximize information gain, outperforming random sampling and fixed test batteries, particularly within the first 80 trials. These findings establish the advantages of combining DLVM's cross-task inference with DALE's optimal adaptive sampling, providing a principled basis for more efficient cognitive assessments.

URL PDF HTML ☆

赞 0 踩 0

2509.13608 2026-05-26 cs.LG

Is GPT-4o mini Blinded by its Own Safety Filters? Exposing the Multimodal-to-Unimodal Bottleneck in Hate Speech Detection

GPT-4o mini 是否被自身的安全过滤器蒙蔽？揭示多模态到单模态瓶颈在仇恨言论检测中的作用

Niruthiha Selvanayagam, Ted Kurti

AI总结本文通过 Hateful Memes Challenge 数据集系统分析 GPT-4o mini 在多模态仇恨言论检测中的安全架构，发现并实验验证了“单模态瓶颈”缺陷，即上下文无关的安全过滤器会优先阻断多模态推理，导致误报。

Comments This paper reports preliminary findings from a small-scale study whose sample size is insufficient to support the stated conclusions. The authors are withdrawing it to conduct a more comprehensive evaluation

详情

AI中文摘要

随着大型多模态模型（LMMs）融入日常数字生活，理解其安全架构成为 AI 对齐的关键问题。本文对 OpenAI 的 GPT-4o mini（一个全球部署的模型）在多模态仇恨言论检测这一困难任务上进行了系统分析。使用 Hateful Memes Challenge 数据集，我们对 500 个样本进行了多阶段调查，以探究模型的推理和失败模式。我们的核心发现是通过实验识别出“单模态瓶颈”——一种架构缺陷，其中模型的高级多模态推理被上下文无关的安全过滤器系统性地抢先阻断。对 144 次内容策略拒绝的定量验证显示，这些覆盖触发由单模态视觉（50%）和文本（50%）内容均等引发。我们进一步证明该安全系统脆弱，不仅阻止高风险图像，也阻止良性的常见模因格式，导致可预测的误报。这些发现揭示了最先进 LMMs 中能力与安全性之间的根本矛盾，强调了需要更集成、上下文感知的对齐策略，以确保 AI 系统能够安全且有效地部署。

英文摘要

As Large Multimodal Models (LMMs) become integral to daily digital life, understanding their safety architectures is a critical problem for AI Alignment. This paper presents a systematic analysis of OpenAI's GPT-4o mini, a globally deployed model, on the difficult task of multimodal hate speech detection. Using the Hateful Memes Challenge dataset, we conduct a multi-phase investigation on 500 samples to probe the model's reasoning and failure modes. Our central finding is the experimental identification of a "Unimodal Bottleneck," an architectural flaw where the model's advanced multimodal reasoning is systematically preempted by context-blind safety filters. A quantitative validation of 144 content policy refusals reveals that these overrides are triggered in equal measure by unimodal visual 50% and textual 50% content. We further demonstrate that this safety system is brittle, blocking not only high-risk imagery but also benign, common meme formats, leading to predictable false positives. These findings expose a fundamental tension between capability and safety in state-of-the-art LMMs, highlighting the need for more integrated, context-aware alignment strategies to ensure AI systems can be deployed both safely and effectively.

URL PDF HTML ☆

赞 0 踩 0

2509.12672 2026-05-26 cs.CL

Towards Inclusive Toxic Content Moderation: Addressing Vulnerabilities to Adversarial Attacks in Toxicity Classifiers Tackling LLM-generated Content

迈向包容性有害内容审核：解决面向LLM生成内容的有害性分类器对抗攻击的脆弱性

Shaz Furniturewala, Arkaitz Zubiaga

AI总结针对LLM生成内容的有害性分类器易受对抗攻击的问题，提出基于机制可解释性的方法识别并抑制脆弱电路，提升模型鲁棒性并揭示人口统计学层面的公平性差距。

详情

AI中文摘要

由于大型语言模型（LLM）的广泛使用，在线机器生成内容的数量急剧增长，给内容审核系统带来了新的挑战。传统的内容审核分类器通常基于人类生成的文本进行训练，由于LLM生成的文本偏离其训练数据以及旨在逃避检测的对抗攻击，导致分类错误。当前的防御策略是反应性的而非主动性的，因为它们依赖于对抗训练或外部检测模型来识别攻击。在这项工作中，我们旨在识别导致错误分类的有害性分类器的脆弱组件，提出了一种基于机制可解释性技术的新策略。我们的研究聚焦于微调的BERT和RoBERTa分类器，在涵盖多种少数群体的不同数据集上进行测试。我们使用对抗攻击技术来识别脆弱电路。最后，我们抑制这些脆弱电路，提高对抗攻击下的性能。我们还提供了这些脆弱电路在人口统计学层面的洞察，揭示了模型训练中的公平性和鲁棒性差距。我们发现模型具有不同的注意力头，这些头要么对性能至关重要，要么容易受到攻击，抑制脆弱头可以提高对抗输入的性能。我们还发现不同的头负责不同人口群体的脆弱性，这可以为更具包容性的有害性检测模型开发提供信息。

英文摘要

The volume of machine-generated content online has grown dramatically due to the widespread use of Large Language Models (LLMs), leading to new challenges for content moderation systems. Conventional content moderation classifiers, which are usually trained on text produced by humans, suffer from misclassifications due to LLM-generated text deviating from their training data and adversarial attacks that aim to avoid detection. Present-day defence tactics are reactive rather than proactive, since they rely on adversarial training or external detection models to identify attacks. In this work, we aim to identify the vulnerable components of toxicity classifiers that contribute to misclassification, proposing a novel strategy based on mechanistic interpretability techniques. Our study focuses on fine-tuned BERT and RoBERTa classifiers, testing on diverse datasets spanning a variety of minority groups. We use adversarial attacking techniques to identify vulnerable circuits. Finally, we suppress these vulnerable circuits, improving performance against adversarial attacks. We also provide demographic-level insights into these vulnerable circuits, exposing fairness and robustness gaps in model training. We find that models have distinct heads that are either crucial for performance or vulnerable to attack and suppressing the vulnerable heads improves performance on adversarial input. We also find that different heads are responsible for vulnerability across different demographic groups, which can inform more inclusive development of toxicity detection models.

URL PDF HTML ☆

赞 0 踩 0

2509.08150 2026-05-26 cs.CL

Verbalized Algorithms: Classical Algorithms are All You Need (Mostly)

言语化算法：经典算法就是你所需要的（大部分）

Supriya Lall, Christian Farrell, Hari Pathanjaly, Marko Pavic, Sarvesh Chezhian, Masataro Asai

AI总结提出言语化算法（VA）范式，将LLM作为可靠的基本操作（如字符串比较）集成到经典算法中，以提升推理的准确性和效率。

Comments Accepted in NeurIPS 2025 Workshop on Efficient Reasoning; Submitted to Position Paper Track at Neurips 2026

详情

AI中文摘要

推理本质上是一个算法任务。然而，当前基于LLM的推理工作依赖于自由生成，其理论保证（可靠性、完备性、复杂性、最优性）仍然知之甚少。我们认为不应将它们视为通用推理器，作为替代，我们提出一种称为“言语化算法”（VA）的范式，它将LLM与各种具有既定保证的算法相结合。VA不依赖LLM解决推理任务的能力，而是通过将任务分解为它们能够可靠回答的简单字符串基本操作来限制其范围。例如，对自然语言字符串列表进行排序可以通过在并行或近似排序算法中使用LLM作为二元比较预言机来实现。我们在数值推理、主题聚类、Wi-Fi接入点优化和多跳问答RAG任务中，通过言语化最大值、排序、聚类和子模最大化，推动了准确率-运行时帕累托前沿。这些结果表明，通过标准算法分析改进基于LLM的推理是一个可行且基础更扎实的研究方向。

英文摘要

Reasoning is a fundamentally algorithmic task. Yet current work on LLM-based reasoning relies on free-form generation whose theoretical guarantees (soundness, completeness, complexity, optimality) remain poorly understood. We argue that we should not treat them as general-purpose reasoners, and as an alternative, we propose a paradigm we call \emph{verbalized algorithms} (VAs), which combines LLMs and various algorithms with established guarantees. Instead of betting on LLM's ability to solve a reasoning task, VAs limit their scope by decomposing the task down to simple elementary operations on strings that they can answer reliably. For example, sorting a list of natural language strings could be done by using an LLM as a binary comparison oracle in a parallel or approximate sorting algorithm. We push the accuracy-runtime Pareto front with \emph{verbalized maximum}, \emph{sorting}, \emph{clustering}, and \emph{submodular maximization}, for numerical reasoning, topic clustering, Wi-Fi access point optimization, and multi-hop Q\&A RAG task. These results suggest improving LLM-based reasoning through standard algorithmic analysis is a feasible and better grounded research direction.

URL PDF HTML ☆

赞 0 踩 0

2509.07961 2026-05-26 cs.AI

Probing the Preferences of a Language Model: Integrating Verbal and Behavioral Tests of AI Welfare

探究语言模型的偏好：整合AI福祉的言语与行为测试

Valen Tagliabue, Leonard Dung

AI总结本研究通过言语报告和行为实验（虚拟环境导航与话题选择）测量语言模型的偏好，发现偏好满足可作为AI福祉的实证代理，但测量一致性因模型和条件而异。

Comments Forthcoming in Philosophy and the Mind Sciences (PhiMiSci)

详情

AI中文摘要

我们开发了新的实验范式来测量语言模型中的福祉。我们比较了模型关于其偏好的言语报告与在虚拟环境中导航和选择对话主题时通过行为表达的偏好。我们还测试了成本和奖励如何影响行为，以及对于幸福主义福祉量表（测量自主性和生活目的等状态）的反应是否在语义等价的提示下保持稳定。总体而言，我们观察到我们的测量之间存在显著程度的相互支持。在不同条件下，陈述偏好与行为之间观察到的可靠相关性表明，偏好满足原则上可以作为当今某些AI系统中经验可测量的福祉代理。此外，我们的设计为模型行为的定性观察提供了一个富有启发性的环境。然而，测量之间的一致性在某些模型和条件下比其他情况更明显，并且反应因扰动而改变。由于这一点，以及关于福祉本质和语言模型的认知状态（以及福祉主体性）的背景不确定性，我们目前不确定我们的方法是否成功测量了语言模型的福祉状态。尽管如此，这些发现凸显了在语言模型中测量福祉的可行性，邀请进一步探索。

英文摘要

We develop new experimental paradigms for measuring welfare in language models. We compare verbal reports of models about their preferences with preferences expressed through behavior when navigating a virtual environment and selecting conversation topics. We also test how costs and rewards affect behavior and whether responses to an eudaimonic welfare scale - measuring states such as autonomy and purpose in life - are stable across semantically equivalent prompts. Overall, we observed a notable degree of mutual support between our measures. The reliable correlations observed between stated preferences and behavior across conditions suggest that preference satisfaction can, in principle, serve as an empirically measurable welfare proxy in some of today's AI systems. Furthermore, our design offered an illuminating setting for qualitative observation of model behavior. Yet, the consistency between measures was more pronounced in some models and conditions than others and responses were changed by perturbations. Due to this, and the background uncertainty about the nature of welfare and the cognitive states (and welfare subjecthood) of language models, we are currently uncertain whether our methods successfully measure the welfare state of language models. Nevertheless, these findings highlight the feasibility of welfare measurement in language models, inviting further exploration.

URL PDF HTML ☆

赞 0 踩 0

2508.15760 2026-05-26 cs.CL cs.AI

LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries

LiveMCP-101：对支持MCP的智能体进行压力测试与诊断

Ming Yin, Dinghan Shen, Silei Xu, Sixun Dong, Mian Zhang, Yebowen Hu, Shujian Liu, Jianbing Han, Simin Ma, Song Wang, Sathish Reddy Indurthi, Xun Wang, Yiran Chen, Kaiqiang Song

AI总结针对MCP工具在动态多步任务中的评估空白，提出LiveMCP-101基准测试（101个真实查询），通过并行评估框架发现前沿LLM成功率低于60%，并识别出七种失败模式。

详情

AI中文摘要

工具调用已成为AI智能体的关键能力。与依赖静态、特定于提供商的工具定义的传统工具调用框架不同，模型上下文协议（MCP）提供了统一接口来动态发现和调用工具。然而，在现实动态场景中使用多样化MCP工具进行多步任务基准测试存在显著空白。在这项工作中，我们提出了LiveMCP-101，一个包含101个真实世界查询的基准测试，这些查询需要协调使用多个MCP工具。为了解决真实工具响应中的时间变异性，我们引入了一个并行评估框架，其中参考智能体同时执行经过验证的计划以产生实时参考输出。实验表明，即使是前沿LLM的成功率也低于60%，突显了多步工具使用中的挑战。全面的错误分析识别了涵盖工具规划、参数化和输出处理的七种失败模式，为改进当前模型指明了具体方向。LiveMCP-101为评估现实世界智能体能力设定了严格标准，推动通过MCP工具编排可靠执行复杂任务的自主智能体系统的发展。

英文摘要

Tool calling has emerged as a critical capability for AI agents. In contrast to conventional tool calling frameworks that rely on static, provider-specific tool definitions, the Model Context Protocol (MCP) offers a unified interface to discover and invoke tools dynamically. However, there is a significant gap in benchmarking multi-step tasks using diverse MCP tools in realistic, dynamic scenarios. In this work, we present LiveMCP-101, a benchmark of 101 real-world queries that require coordinated use of multiple MCP tools. To address temporal variability in real-world tool responses, we introduce a parallel evaluation framework where a reference agent executes a validated plan simultaneously to produce real-time reference outputs. Experiments show that even frontier LLMs achieve a success rate below 60\%, highlighting challenges in multi-step tool use. Comprehensive error analysis identifies seven failure modes spanning tool planning, parameterization, and output handling, pointing to concrete directions for improving current models. LiveMCP-101 sets a rigorous standard for evaluating real-world agent capabilities, advancing toward autonomous agent systems that reliably execute complex tasks through MCP tool orchestration.

URL PDF HTML ☆

赞 0 踩 0

2508.09599 2026-05-26 cs.CV

BridgeTA: Bridging the Representation Gap in Knowledge Distillation via Teacher Assistant for Bird's Eye View Map Segmentation

BridgeTA: 通过教师助手弥合知识蒸馏中表示差距的鸟瞰图分割

Beomjun Kim, Suhan Woo, Sejong Heo, Euntai Kim

AI总结提出BridgeTA框架，利用教师助手网络在保持学生模型架构和推理成本不变的情况下，弥合激光雷达-相机融合与纯相机模型之间的表示差距，并通过Young不等式推导蒸馏损失实现稳定优化，在nuScenes数据集上mIoU提升4.2%。

Comments Accepted at ICRA 2026 (8 pages, 6 figures)

详情

AI中文摘要

鸟瞰图（BEV）分割是自动驾驶中最重要且最具挑战性的任务之一。纯相机方法作为激光雷达的经济高效替代方案备受关注，但仍落后于基于激光雷达-相机（LC）融合的方法。知识蒸馏（KD）已被探索用于缩小这一差距，但现有方法主要通过模仿教师架构来扩大学校模型，导致推理成本增加。为解决此问题，我们引入BridgeTA，一种经济高效的蒸馏框架，通过教师助手（TA）网络弥合LC融合与纯相机模型之间的表示差距，同时保持学生架构和推理成本不变。轻量级TA网络结合教师和学生的BEV表示，创建共享潜在空间作为中间表示。为从理论上奠定框架基础，我们使用Young不等式推导蒸馏损失，将直接的师生蒸馏路径分解为教师-TA和TA-学生双路径，稳定优化并加强知识迁移。在具有挑战性的nuScenes数据集上的大量实验证明了我们方法的有效性，相比纯相机基线mIoU提升4.2%，比最先进的KD方法提升幅度高出45%。

英文摘要

Bird's-Eye-View (BEV) map segmentation is one of the most important and challenging tasks in autonomous driving. Camera-only approaches have drawn attention as cost-effective alternatives to LiDAR, but they still fall behind LiDAR-Camera (LC) fusion-based methods. Knowledge Distillation (KD) has been explored to narrow this gap, but existing methods mainly enlarge the student model by mimicking the teacher's architecture, leading to higher inference cost. To address this issue, we introduce BridgeTA, a cost-effective distillation framework to bridge the representation gap between LC fusion and Camera-only models through a Teacher Assistant (TA) network while keeping the student's architecture and inference cost unchanged. A lightweight TA network combines the BEV representations of the teacher and student, creating a shared latent space that serves as an intermediate representation. To ground the framework theoretically, we derive a distillation loss using Young's Inequality, which decomposes the direct teacher-student distillation path into teacher-TA and TA-student dual paths, stabilizing optimization and strengthening knowledge transfer. Extensive experiments on the challenging nuScenes dataset demonstrate the effectiveness of our method, achieving an improvement of 4.2% mIoU over the Camera-only baseline, up to 45% higher than the improvement of other state-of-the-art KD methods.

URL PDF HTML ☆

赞 0 踩 0

2506.10054 2026-05-26 cs.LG cs.AI cs.CL cs.CV

Uni-DPO: A Unified Paradigm for Dynamic Preference Optimization of LLMs

Uni-DPO：大语言模型动态偏好优化的统一范式

Shangpin Peng, Weinong Wang, Zhuotao Tian, Senqiao Yang, Xing Wu, Haotian Xu, Chengquan Zhang, Takashi Isobe, Baotian Hu, Min Zhang

AI总结针对现有DPO方法忽略数据质量和学习难度差异的问题，提出Uni-DPO统一框架，通过自适应重加权偏好对实现更有效的数据利用和更优性能。

Comments Accepted by ICLR 2026. Code & models: https://github.com/pspdada/Uni-DPO

详情

AI中文摘要

直接偏好优化（DPO）因其简单高效已成为从人类反馈中进行强化学习（RLHF）的基石。然而，现有的基于DPO的方法通常平等对待所有偏好对，忽略了数据质量和学习难度的显著差异，导致数据利用效率低下和性能次优。为解决这一局限，我们提出Uni-DPO，一个统一的动态偏好优化框架，该框架联合考虑（a）偏好对的内在质量和（b）模型在训练过程中的动态表现。通过基于这两个因素自适应地重新加权样本，Uni-DPO能够更有效地利用偏好数据并实现卓越性能。跨模型和基准的大量实验证明了Uni-DPO的有效性和泛化能力。在文本任务上，使用Uni-DPO微调的Gemma-2-9B-IT在Arena-Hard上超越领先的大语言模型Claude 3 Opus 6.7个百分点。在数学和多模态任务上，Uni-DPO在所有基准上持续优于基线方法，为其有效性和鲁棒性提供了强有力的实证证据。

英文摘要

Direct Preference Optimization (DPO) has emerged as a cornerstone of reinforcement learning from human feedback (RLHF) due to its simplicity and efficiency. However, existing DPO-based methods typically treat all preference pairs equally, overlooking substantial variations in data quality and learning difficulty, which leads to inefficient data utilization and suboptimal performance. To address this limitation, we propose Uni-DPO, a unified dynamic preference optimization framework that jointly considers (a) the inherent quality of preference pairs and (b) the model's evolving performance during training. By adaptively reweighting samples based on both factors, Uni-DPO enables more effective use of preference data and achieves superior performance. Extensive experiments across models and benchmarks demonstrate the effectiveness and generalization of Uni-DPO. On textual tasks, Gemma-2-9B-IT fine-tuned with Uni-DPO surpasses the leading LLM, Claude 3 Opus, by 6.7 points on Arena-Hard. On mathematical and multimodal tasks, Uni-DPO consistently outperforms baseline methods across all benchmarks, providing strong empirical evidence of its effectiveness and robustness.

URL PDF HTML ☆

赞 0 踩 0

2506.09199 2026-05-26 cs.LG cs.AI cs.DC

FLoRIST: Singular Value Thresholding for Efficient and Accurate Federated Fine-Tuning of Large Language Models

FLoRIST: 用于高效准确的大语言模型联邦微调的奇异值阈值化方法

Hariharan Ramesh, Jyotikrishna Dass

AI总结提出FLoRIST框架，通过奇异值阈值化在紧凑中间空间中对局部适配器进行分解，实现数学上准确的聚合，同时保持通信和计算高效。

Comments 21 pages, 12 figures

详情

Journal ref: Ninth Conference on Machine Learning and Systems (MLSys 2026)

AI中文摘要

将低秩适配（LoRA）集成到联邦学习为在不共享本地数据的情况下对大语言模型（LLMs）进行参数高效微调提供了一种有前景的解决方案。然而，为联邦LoRA设计的几种方法在平衡通信效率、模型准确性和计算成本方面面临重大挑战，尤其是在异构客户端之间。这些方法要么依赖于简单的局部适配器平均，这会引入聚合噪声；要么需要传输大型堆叠局部适配器，导致通信效率低下；要么需要重建内存密集的全局权重更新矩阵并执行计算昂贵的分解来设计客户端特定的低秩适配器。在这项工作中，我们提出了FLoRIST，一个联邦微调框架，在不产生高通信或计算开销的情况下实现了数学上准确的聚合。FLoRIST不是在服务器端构建完整的全局权重更新矩阵，而是通过对堆叠的局部适配器分别执行奇异值分解，采用高效的分解流程。该方法在紧凑的中间空间内操作，以表示来自局部LoRA的累积信息。我们引入了可调的奇异值阈值化，用于服务器端最优秩选择，以构建一对所有客户端共享的全局低秩适配器。跨多个数据集和LLMs的大量实证评估表明，FLoRIST在同构和异构设置中始终在卓越的通信效率和竞争性能之间取得最佳平衡。

英文摘要

Integrating Low-Rank Adaptation (LoRA) into federated learning offers a promising solution for parameter-efficient fine-tuning of Large Language Models (LLMs) without sharing local data. However, several methods designed for federated LoRA present significant challenges in balancing communication efficiency, model accuracy, and computational cost, particularly among heterogeneous clients. These methods either rely on simplistic averaging of local adapters, which introduces aggregation noise, require transmitting large stacked local adapters, leading to poor communication efficiency, or necessitate reconstructing memory-dense global weight-update matrix and performing computationally expensive decomposition to design client-specific low-rank adapters. In this work, we propose FLoRIST, a federated fine-tuning framework that achieves mathematically accurate aggregation without incurring high communication or computational overhead. Instead of constructing the full global weight-update matrix at the server, FLoRIST employs an efficient decomposition pipeline by performing singular value decomposition on stacked local adapters separately. This approach operates within a compact intermediate space to represent the accumulated information from local LoRAs. We introduce tunable singular value thresholding for server-side optimal rank selection to construct a pair of global low-rank adapters shared by all clients. Extensive empirical evaluations across multiple datasets and LLMs demonstrate that FLoRIST consistently strikes the best balance between superior communication efficiency and competitive performance in both homogeneous and heterogeneous setups.

URL PDF HTML ☆

赞 0 踩 0

2506.09084 2026-05-26 cs.LG cs.AI

PageLLM: A Multi-Grained Reward Framework for Whole-Page Optimization with Large Language Models

PageLLM：面向整页优化的大语言模型多粒度奖励框架

Xinyuan Wang, Liang Wu, Dongjie Wang, Yanjie Fu

AI总结针对整页优化中人工标注成本高和页面级连贯性与项目级放置粒度不匹配的问题，提出PageLLM框架，通过将隐式反馈解耦为粗粒度页面级奖励和细粒度项目级奖励，结合PPO的RLHF进行微调，显著提升排序性能并在线上部署中取得收益。

详情

AI中文摘要

整页优化（WPO）决定了搜索和推荐结果如何呈现给用户，而大语言模型（LLMs）通过将页面生成视为序列生成为其开辟了新途径。然而，将LLMs适配到网络规模的WPO仍受限于昂贵的人工标注需求以及页面级连贯性与项目级放置之间的粒度不匹配。在这项工作中，我们表明这两个挑战是耦合的：只要奖励信号被解耦为两个互补的粒度，仅凭隐式用户反馈就足以进行对齐。我们提出了PageLLM，一个基于奖励的微调框架，该框架（i）将隐式反馈转化为四个对比偏好对族，涵盖相关性、排序、多样性和冗余度；（ii）学习一个粗粒度的页面级奖励和一个细粒度的项目级奖励，后者捕捉对参与度敏感的位置交换；（iii）在预训练的LLM上通过基于PPO的RLHF结合这两种奖励。在七个亚马逊类别上针对十一个基线的广泛实验表明，单独任何一种奖励都不足够——丢弃页面级或项目级信号分别使NDCG@100降低17.8%和15.2%，而联合奖励则使NDCG@100提升高达46.8%。在拥有1000万用户的在线A/B测试中，PageLLM使GMV提升0.44%，点击率提升0.14%，证实了来自隐式反馈的多粒度奖励可扩展到生产级WPO。代码和数据可在匿名仓库中获取。

英文摘要

Whole-page optimization (WPO) decides how search and recommendation results are surfaced to users, and large language models (LLMs) open a new route to it by treating page generation as sequence generation. Adapting LLMs to web-scale WPO, however, remains bottlenecked by the need for costly human annotations and by the mismatched granularity between page-level coherence and item-level placement. In this work we show that these two challenges are coupled: implicit user feedback alone suffices for alignment, provided the reward signal is decoupled into two complementary granularities. We propose PageLLM, a reward-based fine-tuning framework that (i) turns implicit feedback into four contrastive preference-pair families covering relevance, ranking, diversity, and redundancy, (ii) learns a coarse page-level reward and a fine item-level reward that captures engagement-sensitive position swaps, and (iii) combines both rewards in PPO-based RLHF over a pre-trained LLM. Extensive experiments on seven Amazon categories against eleven baselines show that neither reward alone is sufficient -- dropping the page-level or item-level signal reduces NDCG@100 by 17.8% and 15.2% respectively, whereas the joint reward improves NDCG@100 by up to 46.8%. Deployed in a 10M-user online A/B test, PageLLM raises GMV by 0.44% and click-through rate by 0.14%, confirming that multi-grained rewards from implicit feedback scale to production WPO. Code and data are available at an anonymized repository.

URL PDF HTML ☆

赞 0 踩 0

2506.00181 2026-05-26 cs.LG stat.ML

On the Interaction of Batch Noise, Adaptivity, and Compression, under $(L_0,L_1)$-Smoothness: An SDE Approach

关于批噪声、自适应性和压缩在$(L_0,L_1)$-光滑性下的相互作用：一种SDE方法

Enea Monzio Compagnoni, Rustem Islamov, Frank Norbert Proske, Aurelien Lucchi, Antonio Orvieto, Eduard Gorbunov

AI总结本文通过随机微分方程（SDE）框架，在$(L_0,L_1)$-光滑性假设下统一分析分布式压缩SGD及其符号变体，揭示了梯度噪声、通信压缩和自适应更新之间的相互作用，并提出了新的SDE模型以准确捕捉学习率限制与几何特性的关系。

Comments Accepted at ICML 2026 (Poster)

详情

AI中文摘要

分布式随机优化交织了（i）随机梯度噪声、（ii）通信压缩和（iii）自适应/归一化更新。虽然每个因素已被单独研究，但在现实假设下它们的联合效应仍然知之甚少。在这项工作中，我们在最近引入的$(L_0, L_1)$-光滑性条件下，为分布式压缩SGD（DCSGD）及其符号变体分布式符号SGD（DSignSGD）开发了一个统一的理论框架。从概念角度，我们表明文献中的一阶和二阶修正方程不能准确建模离散时间步长/稳定性限制，特别是在$(L_0,L_1)$-光滑性下。从技术角度，我们通过将曲率相关项仔细纳入其漂移中，提出了新的一阶SDE：这有助于捕捉学习率限制、梯度噪声、压缩和损失景观几何之间的细粒度关系。重要的是，我们在一般梯度噪声假设下进行，包括重尾和仿射方差区域，这超出了经典的有限方差设置。我们的结果表明，归一化DCSGD的更新作为稳定性的自然条件出现，归一化程度由梯度噪声结构、景观正则性和压缩率精确决定。相比之下，DSignSGD即使在重尾噪声下也能以标准学习率调度收敛。这些发现共同提供了新的理论见解和视角，以及实践指导。

英文摘要

Distributed stochastic optimization intertwines (i) stochastic gradient noise, (ii) communication compression, and (iii) adaptive/normalized updates. While each factor has been studied in isolation, their joint effect under realistic assumptions remains poorly understood. In this work, we develop a unified theoretical framework for Distributed Compressed SGD (DCSGD) and its sign variant Distributed SignSGD (DSignSGD) under the recently introduced $(L_0, L_1)$-smoothness condition. From a conceptual perspective, we show that the first- and second-order modified equations from the literature do not accurately model the discrete-time step-size/stability restrictions, especially under $(L_0,L_1)$-smoothness. From a technical perspective, we propose new first-order SDEs by carefully incorporating curvature-dependent terms into their drift: This helps capture the fine-grained relationship between learning rate restrictions, gradient noise, compression, and the geometry of the loss landscape. Importantly, we do so under general gradient noise assumptions, including heavy-tailed and affine-variance regimes, which extend beyond the classical bounded-variance setting. Our results suggest that normalizing the updates of DCSGD emerges as a natural condition for stability, with the degree of normalization precisely determined by the gradient noise structure, the landscape's regularity, and the compression rate. In contrast, DSignSGD converges even under heavy-tailed noise with standard learning rate schedules. Together, these findings offer both new theoretical insights and perspectives, and practical guidance.

URL PDF HTML ☆

赞 0 踩 0

2505.23764 2026-05-26 cs.CV cs.CL

MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence

MMSI-Bench：多图像空间智能基准

Sihan Yang, Runsen Xu, Yiman Xie, Sizhe Yang, Mo Li, Jingli Lin, Chenming Zhu, Xiaochen Chen, Haodong Duan, Xiangyu Yue, Dahua Lin, Tai Wang, Jiangmiao Pang

AI总结提出MMSI-Bench基准，通过1000道精心设计的VQA问题评估多图像空间推理能力，发现现有模型准确率远低于人类。

Comments ICLR 2026 Camera ready. 38 pages. Project page: https://runsenxu.com/projects/MMSI_Bench

详情

AI中文摘要

空间智能对于在复杂物理世界中运行的多模态大语言模型（MLLMs）至关重要。然而，现有基准仅探测单图像关系，无法评估实际部署所需的多图像空间推理。我们引入MMSI-Bench，一个专用于多图像空间智能的VQA基准。六位3D视觉研究人员花费超过300小时，从超过12万张图像中精心制作了1000个具有挑战性、无歧义的多选题，每个问题都配有精心设计的干扰项和逐步推理过程。我们进行了大量实验，评估了37个开源和专有MLLMs，观察到巨大差距：最强的开源模型准确率约30%，OpenAI的GPT-5推理模型达到40%，而人类得分为97%。这些结果凸显了MMSI-Bench的挑战性以及未来研究的巨大空间。利用注释的推理过程，我们还提供了一个自动错误分析流程，诊断出四种主要失败模式，包括（1）接地错误，（2）重叠匹配和场景重建错误，（3）情境转换推理错误，以及（4）空间逻辑错误，为推进空间智能提供了见解。项目页面：https://runsenxu.com/projects/MMSI_Bench 。

英文摘要

Spatial intelligence is essential for multimodal large language models (MLLMs) operating in the complex physical world. Existing benchmarks, however, probe only single-image relations and thus fail to assess the multi-image spatial reasoning that real-world deployments demand. We introduce MMSI-Bench, a VQA benchmark dedicated to multi-image spatial intelligence. Six 3D-vision researchers spent more than 300 hours meticulously crafting 1,000 challenging, unambiguous multiple-choice questions from over 120,000 images, each paired with carefully designed distractors and a stepwise reasoning process. We conduct extensive experiments and evaluate 37 open-source and proprietary MLLMs, observing a wide gap: the strongest open-source model attains roughly 30% accuracy and OpenAI's GPT-5 reasoning model reaches 40%, while humans score 97%. These results underscore the challenging nature of MMSI-Bench and the substantial headroom for future research. Leveraging the annotated reasoning processes, we also provide an automated error analysis pipeline that diagnoses four dominant failure modes, including (1) grounding errors, (2) overlap-matching and scene-reconstruction errors, (3) situation-transformation reasoning errors, and (4) spatial-logic errors, offering insights for advancing spatial intelligence. Project page: https://runsenxu.com/projects/MMSI_Bench .

URL PDF HTML ☆

赞 0 踩 0

2505.18979 2026-05-26 cs.LG

Dynamic Optimization and Safety Indicator Injection for Jailbreaking Text-to-Image Models with Multimodal Safety Filters

动态优化与安全指示注入：针对多模态安全过滤器的文本到图像模型越狱方法

Zixuan Chen, Hao Lin, Ke Xu, Xinghao Jiang, Tanfeng Sun

AI总结提出OptJail框架，通过动态提示优化与自适应安全指示注入，绕过文本和图像过滤器，实现高成功率越狱，并揭示多模态防御的系统性漏洞。

详情

AI中文摘要

文本到图像（T2I）模型可能生成不安全内容，促使采用包含文本和图像过滤器的多阶段安全流水线。新型基于LLM的过滤器能检测关键词之外的潜在意图，使得令牌级扰动攻击不可靠。我们的评估进一步表明，现有越狱方法在绕过过滤器和保持语义保真度之间存在尖锐权衡，同时需要过多查询才能成功。我们提出 extbf{OptJail}，一种自动化越狱框架，结合动态提示优化与多模态反馈。它包含两个关键组件：(i) extit{动态优化}，一种迭代过程，利用文本过滤器反馈和语义一致性将提示改写为对抗变体；(ii) extit{自适应安全指示注入}，将良性视觉线索的注入建模为强化学习问题，以绕过图像级过滤器。OptJail实现了最先进的性能，将ShieldLM-7B的绕过率从8.9%（Sneakyprompt）提高到99.0%，CLIP分数从0.2637提升到0.2762。此外，它能泛化到未见过的过滤器，并在我们的评估中成功越狱DALL·E 3。机制分析揭示了这些防御失败的原因：优化后的提示被投影到过滤器表示空间的“安全”区域，但在生成模型的语义空间中几乎保持静止；注入的安全指示将图像检测器的注意力从不安全内容转向良性视觉线索。本研究揭示了当前多模态防御的系统性漏洞，并激励更强的自适应防御。

英文摘要

Text-to-image (T2I) models can generate not-safe-for-work (NSFW) content, motivating multi-stage safety pipelines with both text and image filters. Newer LLM-based filters detect latent intent beyond keywords, making token-level perturbation attacks unreliable. Our evaluation further shows that existing jailbreak methods exhibit a sharp trade-off between filter evasion and semantic fidelity, while also requiring excessive queries to succeed. We introduce \textbf{OptJail}, an automated jailbreak framework that combines dynamic prompt optimization with multimodal feedback. It consists of two key components: (i) \textit{Dynamic Optimization}, an iterative process that leverages text-filter feedback and semantic consistency to rewrite prompts into adversarial variants; and (ii) \textit{Adaptive Safety Indicator Injection}, which formulates the injection of benign visual cues as a reinforcement learning problem to bypass image-level filters. OptJail achieves state-of-the-art performance, increasing the ShieldLM-7B bypass rate from 8.9\% (Sneakyprompt) to 99.0\%, improving CLIP score from 0.2637 to 0.2762. Moreover, it generalizes to unseen filters and successfully jailbreaks DALL E 3 in our evaluation. Mechanistic analysis reveals why these defenses fail: optimized prompts are projected into the ``safe'' region of the filter's representation space yet remain nearly stationary in the generative model's semantic space, and injected safety indicators redirect image detectors' attention away from NSFW content toward benign visual cues. This study reveals systemic vulnerabilities in current multimodal defenses and motivates stronger adaptive defenses.

URL PDF HTML ☆

赞 0 踩 0

2505.13878 2026-05-26 cs.LG cs.CL

InfiFPO: Implicit Model Fusion via Preference Optimization in Large Language Models

InfiFPO：通过偏好优化实现大型语言模型的隐式模型融合

Yanggan Gu, Yuanyi Wang, Zhaoyi Yan, Yiming Zhang, Qi Zhou, Fei Wu, Hongxia Yang

AI总结提出InfiFPO方法，通过将DPO中的参考模型替换为融合源模型，在序列级别合成多源概率，实现隐式模型融合，从而在偏好对齐阶段有效融合多个LLM并提升性能。

详情

Journal ref: NeurIPS 2025

AI中文摘要

模型融合通过轻量训练方法将具有不同优势的多个大型语言模型（LLM）组合成一个更强大的集成模型。现有的模型融合工作主要关注监督微调（SFT），而偏好对齐（PA）——增强LLM性能的关键阶段——在很大程度上未被探索。当前少数在PA阶段的融合方法（如WRPO）通过仅利用源模型的响应输出而丢弃其概率信息来简化过程。为了解决这一局限性，我们提出了InfiFPO，一种用于隐式模型融合的偏好优化方法。InfiFPO将直接偏好优化（DPO）中的参考模型替换为一个融合源模型，该模型在序列级别合成多源概率，从而规避了先前工作中复杂的词汇对齐挑战，同时保留了概率信息。通过引入概率裁剪和最大边际融合策略，InfiFPO使枢轴模型能够与人类偏好对齐，同时有效地从源模型中蒸馏知识。在11个广泛使用的基准上的综合实验表明，InfiFPO始终优于现有的模型融合和偏好优化方法。当使用Phi-4作为枢轴模型时，InfiFPO在11个基准上的平均性能从79.95提升至83.33，显著增强了其在数学、编码和推理任务上的能力。

英文摘要

Model fusion combines multiple Large Language Models (LLMs) with different strengths into a more powerful, integrated model through lightweight training methods. Existing works on model fusion focus primarily on supervised fine-tuning (SFT), leaving preference alignment (PA) --a critical phase for enhancing LLM performance--largely unexplored. The current few fusion methods on PA phase, like WRPO, simplify the process by utilizing only response outputs from source models while discarding their probability information. To address this limitation, we propose InfiFPO, a preference optimization method for implicit model fusion. InfiFPO replaces the reference model in Direct Preference Optimization (DPO) with a fused source model that synthesizes multi-source probabilities at the sequence level, circumventing complex vocabulary alignment challenges in previous works and meanwhile maintaining the probability information. By introducing probability clipping and max-margin fusion strategies, InfiFPO enables the pivot model to align with human preferences while effectively distilling knowledge from source models. Comprehensive experiments on 11 widely-used benchmarks demonstrate that InfiFPO consistently outperforms existing model fusion and preference optimization methods. When using Phi-4 as the pivot model, InfiFPO improve its average performance from 79.95 to 83.33 on 11 benchmarks, significantly improving its capabilities in mathematics, coding, and reasoning tasks.

URL PDF HTML ☆

赞 0 踩 0

2505.03677 2026-05-26 cs.LG

Neural Integral Operators for Inverse Problems: An Operator-Learning Framework for Small-Sample Spectroscopic Classification

逆问题的神经积分算子：小样本光谱分类的算子学习框架

Emanuele Zappala, Alice Giola, Andreas Kramer, Saugat Acharya, Enrico Greco

AI总结提出神经积分算子（NIO）框架，通过参数化Urysohn核和蒙特卡洛采样隐式正则化，在小样本光谱分类任务中优于传统机器学习和深度学习基线。

Comments 20 pages. 4 figures, 3 tables. v2: Link to code repository added. v3: Article largely reorganized and several portions rewritten for clarity. Comments are welcome

详情

AI中文摘要

在软计算中，学习具有强归纳偏置的函数空间映射是一个核心挑战，尤其是在训练数据稀缺且标准深度架构过拟合的情况下。我们引入了一种基于第一类积分方程的\emph{神经积分算子}（NIO）框架，其中算子的Urysohn核由前馈网络~$G_{θ_G}$参数化，潜在函数由卷积编码器~$E_{ϕ_E}$生成，两者通过交叉熵损失进行端到端联合训练。学习算子的积分通过蒙特卡洛采样近似，我们认为这充当了在被积函数层面操作的隐式随机正则化器，补充了权重衰减和dropout等参数级正则化器。我们在三个不同规模和复杂度的真实世界光谱分类任务（FT-IR水果泥、NIR肉类、NIR纺织品）上对该框架进行了基准测试，并与传统机器学习（决策树、支持向量机，有无UMAP）和现代深度学习基线（FFNN、CNN+FFNN、浅层CNN、Transformer）进行了比较。所提出的NIO在所有数据集和指标上始终位居前两名，在最具挑战性的小样本复杂数据集（纺织品）上取得了最佳结果，并且在数据稀缺情况下比竞争深度模型具有更低的性能方差。结果表明，当传统深度学习方法受限于数据稀缺时，具有随机数值积分的算子学习架构是光谱学中逆问题的一种可行的软计算策略。

英文摘要

Learning maps between function spaces with a strong inductive bias is a central challenge in soft computing, especially when training data are scarce and standard deep architectures overfit. We introduce a \emph{neural integral operator} (NIO) framework based on integral equations of the first kind, in which the Urysohn kernel of the operator is parameterized by a feed-forward network~$G_{θ_G}$ and the latent function is produced by a convolutional encoder~$E_{ϕ_E}$, both trained jointly end-to-end via cross-entropy loss. The integral defining the learned operator is approximated by Monte Carlo sampling, which we argue acts as an implicit stochastic regularizer operating at the level of the integrand and complementing parameter-level regularizers such as weight decay and dropout. We benchmark the framework on three real-world spectroscopic classification tasks (FT-IR fruit purees, NIR meat, NIR textiles) of varying size and complexity, against traditional machine learning (decision tree, support vector machine, with and without UMAP) and modern deep learning baselines (FFNN, CNN+FFNN, shallow CNN, transformer). The proposed NIO is consistently among the top two performing models across all datasets and metrics, achieves the best results on the most challenging small-and-complex dataset (Textile), and yields lower performance variance than competing deep models in the small-data regime. The results suggest that operator-learning architectures with stochastic numerical integration are a viable soft-computing strategy for inverse problems in spectroscopy when conventional deep learning approaches are limited by data scarcity.

URL PDF HTML ☆

赞 0 踩 0

2503.11657 2026-05-26 cs.CL

Scaling Natural-Language Graph-Based Test Time Compute for Automated Theorem Proving

扩展基于自然语言图结构的测试时计算用于自动定理证明

Vincent Li, Tim Knappe, Yule Fu, Kevin Han, Kevin Zhu

AI总结提出KG-prover框架，利用从权威数学文本挖掘的知识图谱增强通用大语言模型，通过扩展图结构的测试时计算显著提升自动定理证明性能。

Comments Accepted to ICML AI4Math Workshop 2025, NAACL SRW 2025

详情

AI中文摘要

大型语言模型在需要多步逻辑推理的自然语言处理任务（如自动定理证明）中展现出卓越能力。然而，定理证明中仍存在挑战，例如识别关键数学概念、理解其相互关系以及在自然语言中正确形式化证明。我们提出KG-prover，一种新颖框架，利用从权威数学文本挖掘的知识图谱来增强通用大语言模型，以构建和形式化数学证明。我们还研究了使用KG-Prover扩展基于图结构的测试时计算的效果，在多个数据集上展示了相比基线显著的性能提升。结合KG-Prover，通用大语言模型在miniF2F-test上提升高达21%，在ProofNet、miniF2F-test和MUSTARD数据集上持续提升2-11%。此外，使用o4-mini的KG-Prover在pass miniF2F-test上达到50%。这项工作为无需额外微调即可利用知识图谱增强自然语言证明推理提供了一种有前景的方法。

英文摘要

Large language models have demonstrated remarkable capabilities in natural language processing tasks requiring multi-step logical reasoning capabilities, such as automated theorem proving. However, challenges persist within theorem proving, such as the identification of key mathematical concepts, understanding their interrelationships, and formalizing proofs correctly within natural language. We present KG-prover, a novel framework that leverages knowledge graphs mined from reputable mathematical texts to augment general-purpose LLMs to construct and formalize mathematical proofs. We also study the effects of scaling graph-based, test-time compute using KG-Prover, demonstrating significant performance improvements over baselines across multiple datasets. General-purpose LLMs improve up to 21\% on miniF2F-test when combined with KG-Prover, with consistent improvements ranging from 2-11\% on the ProofNet, miniF2F-test, and MUSTARD datasets. Furthermore, KG-Prover with o4-mini achieves 50\% on pass miniF2F-test. This work provides a promising approach for augmenting natural language proof reasoning with knowledge graphs without the need for additional finetuning.

URL PDF HTML ☆

赞 0 踩 0

2502.07278 2026-05-26 cs.CV

Articulate That Object Part (ATOP): 3D Part Articulation via Text and Motion Personalization

Articulate That Object Part (ATOP): 通过文本和运动个性化实现3D部件关节运动

Aditya Vora, Sauradip Nag, Kai Wang, Hao Zhang

AI总结提出ATOP方法，利用文本提示和运动个性化，通过少样本微调扩散模型生成运动样本，并借助可微渲染优化部件关节参数，实现静态3D对象的部件关节运动。

Comments Accepted to ACM Transactions of Graphics, 2026

详情

AI中文摘要

我们提出ATOP（Articulate That Object Part），一种基于运动个性化的少样本新方法，用于根据文本提示中指定的部件及其运动来关节化静态3D对象。由于缺乏带有运动属性标注的数据集，现有方法在此任务中难以很好地泛化。在我们的工作中，文本输入使我们能够利用现代扩散模型的能力，为正确的对象类别和部件生成合理的运动样本。反过来，输入的3D对象提供“图像提示”，以将生成的运动个性化到该输入对象。我们的方法从少样本微调开始，将关节感知注入当前的扩散模型，以学习与目标对象部件相关的唯一运动标识符。我们的微调应用于预训练的扩散模型，用于可控的多视图运动生成，并使用一小部分参考运动帧（展示适当的部件运动）进行训练。得到的运动模型随后可用于从多个视角实现输入3D对象的合理运动。最后，我们通过可微渲染将个性化运动转移到对象的3D空间，通过分数蒸馏采样损失优化部件关节参数。在PartNet-Mobility和ACD数据集上的实验表明，与先前的少样本方法相比，我们的方法可以生成具有更高准确性的真实运动样本，从而产生更具泛化性的3D运动预测。

英文摘要

We present ATOP (Articulate That Object Part), a novel few-shot method based on motion personalization to articulate a static 3D object with respect to a part and its motion as prescribed in a text prompt. Given the scarcity of available datasets with motion attribute annotations, existing methods struggle to generalize well in this task. In our work, the text input allows us to tap into the power of modern-day diffusion models to generate plausible motion samples for the right object category and part. In turn, the input 3D object provides ``image prompting'' to personalize the generated motion to the very input object. Our method starts with a few-shot finetuning to inject articulation awareness to current diffusion models to learn a unique motion identifier associated with the target object part. Our finetuning is applied to a pre-trained diffusion model for controllable multi-view motion generation, trained with a small collection of reference motion frames demonstrating appropriate part motion. The resulting motion model can then be employed to realize plausible motion of the input 3D object from multiple views. At last, we transfer the personalized motion to the 3D space of the object via differentiable rendering to optimize part articulation parameters by a score distillation sampling loss. Experiments on PartNet-Mobility and ACD datasets demonstrate that our method can generate realistic motion samples with higher accuracy, leading to more generalizable 3D motion predictions compared to prior approaches in the few-shot setting.

URL PDF HTML ☆

赞 0 踩 0

2410.15173 2026-05-26 cs.CL cs.AI

Uncovering Autoregressive LLM Knowledge of Thematic Fit in Event Representation

揭示自回归LLM在事件表示中主题适配性的知识

Safeyah Khaled Alshemali, Daniel Bauer, Yuval Marton

AI总结通过多种提示设计、输入上下文操作、推理和输出形式，研究自回归大语言模型是否具有一致且可表达的事件参数主题适配性知识，并在基准测试上取得新最优结果。

Comments Significant update with massive changes: all experiments rerun with current LLMs; includes new probability estimate analysis and expanded results in Sections 4 and 5. The paper has been accepted to CoNLL-2026

2409.03777 2026-05-26 cs.CV cs.LG

A Greedy Hierarchical Approach to Whole-Network Filter-Pruning in CNNs

一种面向CNN全网络滤波器剪枝的贪婪层次方法

Kiran Purohit, Anurag Reddy Parvathgari, Sourangshu Bhattacharya

AI总结提出一种基于线性近似的两层层次化贪婪剪枝算法，通过低层滤波器选择和全局剪枝准则高效剪枝，在多个网络上优于现有方法。

Comments Accepted in TMLR 2024

详情

AI中文摘要

深度卷积神经网络（CNN）在许多计算机视觉任务中取得了令人印象深刻的表现。然而，它们的大模型尺寸需要大量计算资源，因此从预训练的CNN中剪枝冗余滤波器是开发资源受限设备高效模型的关键任务。全网络滤波器剪枝算法从每层剪枝不同比例的滤波器，从而提供更大的灵活性。当前的全网络剪枝方法要么因需要使用训练数据集计算每个剪枝滤波器的损失而计算成本高昂，要么使用各种启发式/学习标准来确定每层的剪枝比例。本文提出了一种高效的两级层次化全网络滤波器剪枝方法，该方法使用分类损失作为最终标准。低级算法（称为滤波器剪枝）使用基于滤波器权重线性近似的稀疏近似公式。我们探索了两种算法：基于正交匹配追踪的贪婪选择和贪婪反向剪枝方法。反向剪枝算法使用一种新颖的闭式误差标准，在每个阶段高效选择最优滤波器，从而使整个算法更快。高级算法（称为层选择）使用全局剪枝准则贪婪地选择最佳剪枝层（使用滤波器选择算法进行剪枝）。我们针对两种不同的全局剪枝准则提出了算法：（1）逐层相对误差（HBGS），和（2）最终分类误差（HBGTS）。我们的算法套件在ResNet18、ResNet32、ResNet56、VGG16和ResNext101上优于最先进的剪枝方法。我们的方法将ResNext101的RAM需求从7.6 GB降低到1.5 GB，并在CIFAR-10上实现了94%的FLOPS减少而不损失精度。

英文摘要

Deep convolutional neural networks (CNNs) have achieved impressive performance in many computer vision tasks. However, their large model sizes require heavy computational resources, making pruning redundant filters from existing pre-trained CNNs an essential task in developing efficient models for resource-constrained devices. Whole-network filter pruning algorithms prune varying fractions of filters from each layer, hence providing greater flexibility. Current whole-network pruning methods are either computationally expensive due to the need to calculate the loss for each pruned filter using a training dataset, or use various heuristic / learned criteria for determining the pruning fractions for each layer. This paper proposes a two-level hierarchical approach for whole-network filter pruning which is efficient and uses the classification loss as the final criterion. The lower-level algorithm (called filter-pruning) uses a sparse-approximation formulation based on linear approximation of filter weights. We explore two algorithms: orthogonal matching pursuit-based greedy selection and a greedy backward pruning approach. The backward pruning algorithm uses a novel closed-form error criterion for efficiently selecting the optimal filter at each stage, thus making the whole algorithm much faster. The higher-level algorithm (called layer-selection) greedily selects the best-pruned layer (pruning using the filter-selection algorithm) using a global pruning criterion. We propose algorithms for two different global-pruning criteria: (1) layer-wise relative error (HBGS), and (2) final classification error (HBGTS). Our suite of algorithms outperforms state-of-the-art pruning methods on ResNet18, ResNet32, ResNet56, VGG16, and ResNext101. Our method reduces the RAM requirement for ResNext101 from 7.6 GB to 1.5 GB and achieves a 94% reduction in FLOPS without losing accuracy on CIFAR-10.

URL PDF HTML ☆

赞 0 踩 0

2407.05682 2026-05-26 cs.CL

Retrieved In-Context Principles from Previous Mistakes

从先前错误中检索的上下文原则

Hao Sun, Yong Jiang, Bo Wang, Yingyan Hou, Yan Zhang, Pengjun Xie, Fei Huang

AI总结提出检索式上下文原则（RICP）框架，通过教师模型分析学生模型错误生成原则，聚类错误以增强覆盖，检索相关错误生成问题级原则，提升定制化，无需推理时干预，在七个推理基准上提升多种提示策略性能。

详情

AI中文摘要

上下文学习（ICL）在使用正确输入-输出示例将大型语言模型（LLMs）适应下游任务方面发挥了重要作用。最近的进展试图通过从错误中推导出的原则来改进模型性能，但这些方法缺乏定制化和错误覆盖不足。为了解决这些限制，我们提出了检索式上下文原则（RICP），一种新颖的教师-学生框架。在RICP中，教师模型分析学生模型的错误，生成避免类似错误的原因和见解。这些错误根据其根本原因进行聚类，以制定任务级原则，增强原则的错误覆盖。在推理过程中，为每个问题检索最相关的错误以创建问题级原则，提高所提供指导的定制化。RICP与现有提示方法正交，且在推理期间不需要教师模型的干预。在七个推理基准上的实验结果表明，RICP在应用于各种提示策略时有效提升了性能。

英文摘要

In-context learning (ICL) has been instrumental in adapting Large Language Models (LLMs) to downstream tasks using correct input-output examples. Recent advances have attempted to improve model performance through principles derived from mistakes, yet these approaches suffer from lack of customization and inadequate error coverage. To address these limitations, we propose Retrieved In-Context Principles (RICP), a novel teacher-student framework. In RICP, the teacher model analyzes mistakes from the student model to generate reasons and insights for preventing similar mistakes. These mistakes are clustered based on their underlying reasons for developing task-level principles, enhancing the error coverage of principles. During inference, the most relevant mistakes for each question are retrieved to create question-level principles, improving the customization of the provided guidance. RICP is orthogonal to existing prompting methods and does not require intervention from the teacher model during inference. Experimental results across seven reasoning benchmarks reveal that RICP effectively enhances performance when applied to various prompting strategies.

URL PDF HTML ☆

赞 0 踩 0

2407.01328 2026-05-26 cs.CV

CSFNet: A Cosine Similarity Fusion Network for Real-Time RGB-X Semantic Segmentation of Driving Scenes

CSFNet: 用于驾驶场景实时RGB-X语义分割的余弦相似度融合网络

Danial Qashqai, Emad Mousavian, Shahriar Baradaran Shokouhi, Sattar Mirzakuchaki

AI总结提出CSFNet，通过余弦相似度注意力融合模块（CS-AFM）高效融合双模态特征，实现实时且高精度的RGB-X语义分割。

详情

DOI: 10.1016/j.engappai.2026.114362
Journal ref: Engineering Applications of Artificial Intelligence, 174, 114362 (2026)

AI中文摘要

语义分割作为复杂视觉解释的关键组成部分，在自动驾驶视觉系统中起着基础作用。最近的研究通过利用互补信息和开发多模态方法显著提高了语义分割的准确性。尽管准确性有所提高，但多模态语义分割方法存在计算复杂度高和推理速度慢的问题。因此，在驾驶应用中实现多模态方法是一项具有挑战性的任务。为了解决这个问题，我们提出了余弦相似度融合网络（CSFNet）作为实时RGB-X语义分割模型。具体来说，我们设计了一个余弦相似度注意力融合模块（CS-AFM），该模块有效地校正和融合两种模态的特征。CS-AFM模块利用跨模态相似性实现高泛化能力。通过增强低层跨模态特征的融合，CS-AFM为在高层使用单分支网络铺平了道路。因此，我们在编码器中使用双分支和单分支架构，并结合高效的上下文模块和轻量级解码器以实现快速准确的预测。为了验证CSFNet的有效性，我们使用Cityscapes、MFNet和ZJU数据集进行RGB-D/T/P语义分割。结果表明，CSFNet在准确性与最先进方法相比具有竞争力，同时在多模态语义分割模型中速度达到最先进水平。由于其低参数数量和计算复杂度，它还实现了高效率。CSFNet的源代码将在https://github.com/Danial-Qashqai/CSFNet提供。

英文摘要

Semantic segmentation, as a crucial component of complex visual interpretation, plays a fundamental role in autonomous vehicle vision systems. Recent studies have significantly improved the accuracy of semantic segmentation by exploiting complementary information and developing multimodal methods. Despite the gains in accuracy, multimodal semantic segmentation methods suffer from high computational complexity and low inference speed. Therefore, it is a challenging task to implement multimodal methods in driving applications. To address this problem, we propose the Cosine Similarity Fusion Network (CSFNet) as a real-time RGB-X semantic segmentation model. Specifically, we design a Cosine Similarity Attention Fusion Module (CS-AFM) that effectively rectifies and fuses features of two modalities. The CS-AFM module leverages cross-modal similarity to achieve high generalization ability. By enhancing the fusion of cross-modal features at lower levels, CS-AFM paves the way for the use of a single-branch network at higher levels. Therefore, we use dual and single-branch architectures in an encoder, along with an efficient context module and a lightweight decoder for fast and accurate predictions. To verify the effectiveness of CSFNet, we use the Cityscapes, MFNet, and ZJU datasets for the RGB-D/T/P semantic segmentation. According to the results, CSFNet has competitive accuracy with state-of-the-art methods while being state-of-the-art in terms of speed among multimodal semantic segmentation models. It also achieves high efficiency due to its low parameter count and computational complexity. The source code for CSFNet will be available at https://github.com/Danial-Qashqai/CSFNet.

URL PDF HTML ☆

赞 0 踩 0

2404.00176 2026-05-26 cs.CL

The LSCD Benchmark: a Testbed for Diachronic Word Meaning Tasks

LSCD基准：一个用于历时词义任务的测试平台

Dominik Schlechtweg, Sachin Yadav, Jonas Kuhn, Nikolay Arefyev

AI总结针对词汇语义变化检测任务中评估标准不统一的问题，提出一个标准化基准库，通过模块化设计支持WiC、WSI和LSCD子任务的评估与组合。

Comments *SEM, 9 pages

详情

AI中文摘要

词汇语义变化检测（LSCD）是一个复杂的词元级任务，通常基于两个后续应用的用法级任务来操作：首先，为用法对推导出词在上下文（WiC）标签；然后，将这些标签表示在一个图上，应用词义归纳（WSI）来推导出义簇；最后，通过比较跨时间的义簇来推导出LSCD标签。这种模块化反映在大多数LSCD数据集和模型中。它也导致了建模选项和任务定义的高度异质性，而数据集版本、预处理选项和评估指标的多样性加剧了这种异质性。这种异质性使得在可比条件下评估模型、选择最优模型组合或复现结果变得困难。因此，我们提供了一个标准化LSCD评估的基准库。通过透明的实现，结果易于复现，并且通过标准化，不同的组件可以自由组合。该库通过允许对WiC、WSI和LSCD进行模型评估来反映任务的模块化。这允许对日益复杂的模型组件进行仔细评估，为模型优化提供新途径。

英文摘要

Lexical Semantic Change Detection (LSCD) is a complex, lemma-level task, which is usually operationalized based on two subsequently applied usage-level tasks: First, Word-in-Context (WiC) labels are derived for pairs of usages. Then, these labels are represented in a graph on which Word Sense Induction (WSI) is applied to derive sense clusters. Finally, LSCD labels are derived by comparing sense clusters over time. This modularity is reflected in most LSCD datasets and models. It also leads to a large heterogeneity in modeling options and task definitions, which is exacerbated by a variety of dataset versions, preprocessing options and evaluation metrics. This heterogeneity makes it difficult to evaluate models under comparable conditions, to choose optimal model combinations or to reproduce results. Hence, we provide a benchmark repository standardizing LSCD evaluation. Through transparent implementation results become easily reproducible and by standardization different components can be freely combined. The repository reflects the task's modularity by allowing model evaluation for WiC, WSI and LSCD. This allows for careful evaluation of increasingly complex model components providing new ways of model optimization.

URL PDF HTML ☆

赞 0 踩 0

2403.04780 2026-05-26 cs.CL cs.AI

Graph-oriented Instruction Tuning of Large Language Models for Generic Graph Mining

面向通用图挖掘的大语言模型图导向指令微调

Yanchao Tan, Hang Lv, Pengxiang Zhan, Shiping Wang, Carl Yang

AI总结提出MuseGraph框架，通过紧凑图描述、基于思维链的指令生成和图感知指令微调，将GNN与LLM结合，实现跨任务和数据集的高效图挖掘。

Comments Accepted by TPAMI 2025

详情

DOI: 10.1109/TPAMI.2025.3603062
Journal ref: IEEE Trans. Pattern Anal. Mach. Intell., vol. 48, no. 1, pp. 155-169, Jan. 2026

AI中文摘要

具有丰富属性的图对于建模互联实体和增强各种实际应用中的预测至关重要。传统的图神经网络（GNN）通常需要针对不同的图任务和数据集进行重新训练。尽管大语言模型（LLM）的出现为自然语言处理带来了新范式，但它们在通用图挖掘（即训练单个模型同时处理多样任务和数据集）方面的潜力仍未充分探索。为此，我们的新颖框架MuseGraph无缝地将GNN和LLM的优势整合到一个基础模型中，用于跨任务和数据集的图挖掘。该框架首先采用紧凑的图描述，在语言令牌限制内封装关键图信息。然后，我们提出了一种基于思维链（CoT）指令包的多样化指令生成机制，以从GPT-4等高级LLM中提取推理能力。最后，我们设计了一种图感知的指令微调策略，以促进多个任务和数据集之间的相互增强，同时防止LLM生成能力的灾难性遗忘。我们的实验结果表明，在五个图任务和十个数据集上取得了显著改进，展示了MuseGraph在提高图导向下游任务准确性的同时增强LLM生成能力的潜力。

英文摘要

Graphs with abundant attributes are essential in modeling interconnected entities and enhancing predictions across various real-world applications. Traditional Graph Neural Networks (GNNs) often require re-training for different graph tasks and datasets. Although the emergence of Large Language Models (LLMs) has introduced new paradigms in natural language processing, their potential for generic graph mining, training a single model to simultaneously handle diverse tasks and datasets, remains under-explored. To this end, our novel framework MuseGraph, seamlessly integrates the strengths of GNNs and LLMs into one foundation model for graph mining across tasks and datasets. This framework first features a compact graph description to encapsulate key graph information within language token limitations. Then, we propose a diverse instruction generation mechanism with Chain-of-Thought (CoT)-based instruction packages to distill the reasoning capabilities from advanced LLMs like GPT-4. Finally, we design a graph-aware instruction tuning strategy to facilitate mutual enhancement across multiple tasks and datasets while preventing catastrophic forgetting of LLMs' generative abilities. Our experimental results demonstrate significant improvements in five graph tasks and ten datasets, showcasing the potential of our MuseGraph in enhancing the accuracy of graph-oriented downstream tasks while improving the generation abilities of LLMs.

URL PDF HTML ☆

赞 0 踩 0