arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 2163
2603.17305 2026-05-20 cs.AI cs.CL cs.LG

Contrastive Reasoning Alignment: Reinforcement Learning from Hidden Representations

对比推理对齐:从隐藏表示中学习强化学习

Haozheng Luo, Yimin Wang, Jiahao Yu, Binghui Wang, Yan Chen

AI总结 本文提出了一种基于对比学习和强化学习的框架CRAFT,通过优化隐藏状态空间中的目标来提升对抗攻击的鲁棒性,核心贡献是通过隐藏空间的几何结构实现推理层面的安全对齐。

Comments International Conference on Machine Learning (ICML) 2026

详情
AI中文摘要

我们提出CRAFT,一种红队对齐框架,利用模型推理能力和隐藏表示来提高对jailbreak攻击的鲁棒性。与以往主要在输出层面操作的防御方法不同,CRAFT将大型推理模型对齐以生成安全意识的推理轨迹,通过显式优化定义在隐藏状态空间上的目标。方法上,CRAFT将对比表示学习与强化学习相结合,分离安全和不安全的推理轨迹,得到支持鲁棒、推理层面安全对齐的潜在空间几何。理论上,我们证明将潜在文本一致性纳入GRPO可以消除表面上对齐的策略,将其排除在局部最优之外。实验上,我们在多个安全基准上评估CRAFT,使用两个强大的推理模型Qwen3-4B-Thinking和R1-Distill-Llama-8B,其中它在多个安全基准上均优于IPO和SafeKey等最先进的防御方法。值得注意的是,CRAFT在基础模型上实现了平均79.0%的推理安全性和87.7%的最终响应安全性提升,证明了隐藏空间推理对齐的有效性。

英文摘要

We propose CRAFT, a red-teaming alignment framework that leverages model reasoning capabilities and hidden representations to improve robustness against jailbreak attacks. Unlike prior defenses that operate primarily at the output level, CRAFT aligns large reasoning models to generate safety-aware reasoning traces by explicitly optimizing objectives defined over the hidden state space. Methodologically, CRAFT integrates contrastive representation learning with reinforcement learning to separate safe and unsafe reasoning trajectories, yielding a latent-space geometry that supports robust, reasoning-level safety alignment. Theoretically, we show that incorporating latent-textual consistency into GRPO eliminates superficially aligned policies by ruling them out as local optima. Empirically, we evaluate CRAFT on multiple safety benchmarks using two strong reasoning models, Qwen3-4B-Thinking and R1-Distill-Llama-8B, where it consistently outperforms state-of-the-art defenses such as IPO and SafeKey. Notably, CRAFT delivers an average 79.0% improvement in reasoning safety and 87.7% improvement in final-response safety over the base models, demonstrating the effectiveness of hidden-space reasoning alignment.

2603.11768 2026-05-20 cs.AI

Governing Evolving Memory in LLM Agents: Risks, Mechanisms, and the Stability and Safety Governed Memory (SSGM) Framework

在LLM代理中治理演化的记忆:风险、机制以及稳定性与安全性的治理记忆(SSGM)框架

Chingkwun Lam, Jiaxin Li, Lingfei Zhang, Kuo Zhao

AI总结 本文研究了LLM代理中记忆治理的问题,提出了一种新的SSGM框架,通过一致性验证、时间衰减建模和动态访问控制来缓解记忆腐蚀风险,提高记忆系统的稳定性与安全性。

详情
AI中文摘要

长期记忆已成为自主大型语言模型(LLM)代理的基础组件,使连续适应、终身多模态学习和复杂推理成为可能。然而,随着记忆系统从静态检索数据库转变为动态、代理机制,关于记忆治理、语义漂移和隐私漏洞的关键问题变得突出。尽管最近的调查主要集中在记忆检索效率上,但它们大多忽略了在高度动态环境中记忆腐蚀的新兴风险。为了解决这些新兴挑战,我们提出了稳定性与安全性治理记忆(SSGM)框架,一种概念治理架构。SSGM通过在任何记忆巩固之前执行一致性验证、时间衰减建模和动态访问控制,将记忆演进与执行分离。通过形式分析和架构分解,我们展示了SSGM如何缓解拓扑诱导的知识泄漏,其中敏感上下文被固化到长期存储中,并帮助防止语义漂移,其中知识通过迭代总结退化。最终,这项工作提供了一个全面的记忆腐蚀风险分类法,并建立了部署安全、持久和可靠的代理记忆系统稳健治理范式。

英文摘要

Long-term memory has emerged as a foundational component of autonomous Large Language Model (LLM) agents, enabling continuous adaptation, lifelong multimodal learning, and sophisticated reasoning. However, as memory systems transition from static retrieval databases to dynamic, agentic mechanisms, critical concerns regarding memory governance, semantic drift, and privacy vulnerabilities have surfaced. While recent surveys have focused extensively on memory retrieval efficiency, they largely overlook the emergent risks of memory corruption in highly dynamic environments. To address these emerging challenges, we propose the Stability and Safety-Governed Memory (SSGM) framework, a conceptual governance architecture. SSGM decouples memory evolution from execution by enforcing consistency verification, temporal decay modeling, and dynamic access control prior to any memory consolidation. Through formal analysis and architectural decomposition, we show how SSGM can mitigate topology-induced knowledge leakage where sensitive contexts are solidified into long-term storage, and help prevent semantic drift where knowledge degrades through iterative summarization. Ultimately, this work provides a comprehensive taxonomy of memory corruption risks and establishes a robust governance paradigm for deploying safe, persistent, and reliable agentic memory systems.

2603.09473 2026-05-20 cs.RO cond-mat.mtrl-sci

Receptogenesis in a Vascularized Robotic Embodiment

血管化机器人躯体中的生成

Kadri-Ann Pankratov, Leonid Zinatullin, Hans Priks, Adele Metsniit, Urmas Johanson, Tarmo Tamm, Alvo Aabloo, Edoardo Sinibaldi, Indrek Must

AI总结 本文提出了一种通过动态材料重构实现机器人身体内在功能适应和发展的方法,通过利用流体ics重构材料界面,实现基于环境提示的传感器生成,展示了在血管化机器人复合材料中通过光聚合物化实现材料级功能重构的可行性。

Comments Supplementary Files currently unavailable online. Please contact the First Author to request any Supplementary Files Version 2 - revision

详情
AI中文摘要

为机器人系统配备在操作过程中生成新硬件的能力扩展了对物理适应性的控制。不同于依赖于预或后部署离散组件集成的模块化系统,我们设想物理适应和发育可能来自动态材料重构以塑造身体内在功能。受生物体循环系统重新分配质量和功能的启发,我们利用流体ics重构材料界面,这是一种目前在机器人学中尚未具备的能力。在此,我们通过一种用于编程材料合成的血管化机器人复合材料实现了这一合成生长能力,通过生成物的生成——即基于环境提示从内部液体储备中按需构造传感器来演示。通过协调前体的流体运输与外部局部紫外线照射,我们驱动了原位光聚合,将前体转化为具有光敏感聚吡咯的固态分散体,从而在PETG中化学重构血管。该反应将具有光迟滞引发剂的前体转化为具有光敏感聚吡咯的固态分散体,在PETG中化学重构血管,从而建立了一种通过特征电阻降低验证的传感模式。新合成的传感器关闭了局部控制回路以调节受启发于飞蛾的机器人演示器的翅拍。这种物理更新在实时中增加了机器人的能力。血管化机器人躯体的材料级功能重构为在处于环境中机器人系统中生成新硬件提供了概念验证的材料基础——迈向能够自主产生硬件更新以匹配新环境需求的处于环境中机器人。

英文摘要

Equipping robotic systems with the capacity to generate $\textit{ex novo}$ hardware during operation extends control of physical adaptability. Unlike modular systems that rely on discrete component integration pre- or post-deployment, we envision the possibility that physical adaptation and development emerge from dynamic material restructuring to shape the body's intrinsic functions. Drawing inspiration from circulatory systems that redistribute mass and function in biological organisms, we utilize fluidics to restructure the material interface, a capability currently unpaired in robotics. Here, we realize this synthetic growth capability through a vascularized robotic composite designed for programmable material synthesis, demonstrated via receptogenesis - the on-demand construction of sensors from internal fluid reserves based on environmental cues. By coordinating the fluidic transport of precursors with external localized UV irradiation, we drive an $\textit{in situ}$ photopolymerization that chemically reconstructs the vasculature from the inside out. This reaction converts precursors with photolatent initiator into a solid dispersion of UV-sensitive polypyrrole in PETG, establishing a sensing modality validated by a characteristic decrease in electrical impedance. The newly synthesized sensor closed a local control loop to regulate wing flapping in a moth-inspired robotic demonstrator. This physical update increased the robot's capability in real time. Material-level functional restructuring of the vascularized robot body provides a proof-of-concept materials basis for $\textit{ex novo}$ hardware generation in situated robotic systems - a step toward situated robots in which a reaction to environmental stimuli autonomously produces hardware updates to match new environmental demands.

2603.05933 2026-05-20 cs.CL cs.LG

Structured Style-Rewrite with Chain-of-Thought Planning for Low-Resource Character Dialogue

结构化风格重写与思维链规划用于低资源字符对话

Chanhui Zhu

AI总结 本文提出了一种结构化风格重写框架,结合思维链规划,以解决低资源条件下中文字符驱动生成中的风格分离问题,通过分解角色风格为可解释的格式签名、语法和语用维度,并利用思维链监督进行显式风格规划,实验表明该方法在保持语义忠实性的同时提升了风格质量。

Comments 30 pages, 5 figures. Preprint

详情
AI中文摘要

将小型语言模型(SLMs)应用于中文字符驱动生成仍然具有挑战性,因为数据稀缺和分离角色风格困难。标准监督微调(SFT)通常捕捉到表层语义,但会产生频繁的越界输出(OOC)。我们将此问题框架为受控的句子级风格重写任务,该任务将风格质量与对话情境管理分离。我们提出了一种结构化风格重写框架,将角色风格分解为可解释的格式签名、语法和语用维度,并结合思维链(CoT)监督进行显式风格规划。一个CoT共享直接偏好优化(DPO)阶段进一步通过确保偏好学习目标输出层面的风格执行而非推理轨迹差异来对齐风格规划与表层实现。在八个角色四个不同源领域的实验中,我们的方法使Qwen3-1.7B模型在有效风格得分上达到0.632,同时保持强语义忠实性(0.878),在评估系统中处于帕累托前沿,并在消费级硬件上显著优于更大的基线(如GLM-4.7)

英文摘要

Applying Small Language Models (SLMs) to Chinese character-driven generation remains challenging due to data scarcity and the difficulty of disentangling character style. Standard Supervised Fine-Tuning (SFT) often captures surface-level semantics but produces frequent Out-Of-Character (OOC) outputs. We frame this as a controlled sentence-level style rewriting task, which isolates stylistic quality from dialogue context management. We propose a Structured Style-Rewrite Framework that decomposes character style into interpretable format signature, syntactic, and pragmatic dimensions, combined with Chain-of-Thought (CoT) supervision for explicit style planning. A CoT-Shared Direct Preference Optimization (DPO) stage further aligns style planning with surface realization by ensuring preference learning targets output-level style execution rather than reasoning trace differences. Experiments across eight characters from four diverse source domains demonstrate that our method enables a Qwen3-1.7B model to achieve a Valid Style Score of $0.632$ while maintaining strong semantic fidelity (0.878), placing on the Pareto frontier among the evaluated systems and outperforming significantly larger baselines (e.g., GLM-4.7) on consumer hardware.

2603.05910 2026-05-20 cs.AI

The World Won't Stay Still: Programmable Evolution for Agent Benchmarks

世界不会静止:为智能体基准测试的可编程进化

Guangrui Li, Yaochen Xie, Yi Liu, Ziwei Dong, Xingyuan Pan, Tianqi Zheng, Jason Choi, Michael J. Morais, Binit Jha, Shaunak Mishra, Bingrou Zhou, Chen Luo, Monica Xiao Cheng, Dawn Song

AI总结 本文研究了结构化环境进化作为智能体基准测试构建问题,提出了一种基于图的框架ProEvolve,使环境进化可编程,并在电商和航班预订领域验证了其在质量、实现有效性及失败模式方面的表现。

详情
AI中文摘要

LLM驱动的工具调用智能体通过与环境交互、查询数据和调用工具进行多轮过程来满足用户请求。然而,大多数现有基准测试在静态环境接口下评估这些系统,具有固定架构和工具集,难以评估智能体在环境演变时的行为——当能力被添加、重新组织或废弃时。在本文中,我们研究了结构化环境演变作为工具调用智能体的基准构建问题。我们提出了ProEvolve,一种基于图的框架,使环境演变可编程。其核心是一个类型关系图,为环境提供统一的显式表示——数据、工具和架构。在此形式化下,添加、删除或修改能力被表达为图变换,这些变换能一致地在工具、架构和数据访问之间传播更新。基于此,ProEvolve支持(1)通过显式图变换自动生成演变的可执行环境,以及(2)通过子图采样和实例化进行图引导的任务沙盒构建。我们通过两个工具调用领域——电商和航班预订——在质量、实现有效性和失败模式方面验证了ProEvolve。最后,我们使用生成的基准作为下游诊断,研究智能体在结构化环境演变下的代表性行为。

英文摘要

LLM-powered tool-calling agents fulfill user requests by interacting with environments, querying data, and invoking tools in a multi-turn process. Yet, most existing benchmarks evaluate these systems under static environment interfaces, with fixed schemas and toolsets, making it difficult to assess how agents behave as environments evolves -- when capabilities are added, reorganized, or deprecated across successive environment versions. In this paper, we study structured environment evolution as a benchmark-construction problem for tool-calling agents. We propose ProEvolve, a graph-based framework that makes environment evolution programmable. At its core, a typed relational graph provides a unified, explicit representation of the environment - data, tools, and schema. Under this formalism, adding, removing, or modifying capabilities are expressed as graph transformations that coherently propagate updates across tools, schemas, and data access. Building on this, ProEvolve supports (1) automatic generation of evolved executable environments through explicit graph transformations, and (2) graph-grounded construction of task sandboxes via subgraph sampling and instantiation. We validate ProEvolve in two tool-calling domains, e-commerce and airline booking, in terms of quality, implementation validity, and failure modes. Finally, we use the generated benchmark as a downstream diagnostic to study how representative agents behave under structured environment evolution.

2603.05066 2026-05-20 cs.LG

Reward-Conditioned Reinforcement Learning

基于奖励的强化学习

Michal Nauman, Marek Cygan, Pieter Abbeel

AI总结 本文提出基于奖励的强化学习(RCRL),通过在收集经验时使用单一名义目标,使智能体在不额外交互的情况下暴露于多种奖励目标,从而提高样本效率并支持零样本行为调整。

Comments preprint

详情
AI中文摘要

单任务强化学习代理通常在固定奖励函数下训练,这限制了它们对奖励误指定的鲁棒性和适应变化偏好的能力。我们引入基于奖励的强化学习(RCRL),一种离策略方法,该方法在收集经验时将智能体条件于奖励参数化,同时通过重新计算共享回放数据中的反事实奖励,使智能体暴露于多种奖励目标,而无需额外的环境交互。在单任务、多任务和基于视觉的基准测试中,RCRL在名义奖励参数化下提高了样本效率,能够高效适应新参数化,并在部署时支持零样本行为调整。我们的结果表明,RCRL提供了一种可扩展的机制,用于学习鲁棒且可操控的策略,而无需牺牲单任务训练的简洁性。

英文摘要

Single-task RL agents are typically trained under a fixed reward function, which limits their robustness to reward misspecification and their ability to adapt to changing preferences. We introduce Reward-Conditioned Reinforcement Learning (RCRL), an off-policy method that conditions agents on reward parameterizations while collecting experience under a single nominal objective. By recomputing counterfactual rewards from shared replay data, RCRL exposes the agent to multiple reward objectives without additional environment interaction, connecting single-task RL with ideas from multi-objective and multi-task learning. Across single-task, multi-task, and vision-based benchmarks, RCRL improves sample efficiency under the nominal reward parameterization, enables efficient adaptation to new parameterizations, and supports zero-shot behavioral adjustment at deployment. Our results show that RCRL provides a scalable mechanism for learning robust, steerable policies without sacrificing the simplicity of single-task training.

2602.20700 2026-05-20 cs.CV

NGL: Natural Garment Language for Training-Free Sewing Pattern Estimation

NGL:自然服装语言用于无训练缝纫图案估计

Anna Badalyan, Pratheba Selvaraju, Giorgio Becherini, Omid Taheri, Victoria Fernandez Abrevaya, Michael Black

AI总结 本研究提出NGL自然服装语言,通过利用视觉语言模型的自然描述能力,实现无训练的缝纫图案估计,解决了传统方法在泛化能力、真实世界关联性和多层服装处理上的不足。

Comments 12 pages, 7 figures

详情
AI中文摘要

从图像估计缝纫图案是创建高质量3D服装的实用方法,但受限于真实世界图像和缝纫图案配对数据的稀缺性而具有挑战性。现有方法通过训练视觉语言模型(VLMs)从参数化服装模型采样的合成服装中学习低级缝纫图案表示来解决这一限制。然而,这些方法往往难以泛化到野外图像,无法捕捉真实世界服装部件之间的关联,并且局限于单层服装。相比之下,我们发现VLMs在描述服装时表现良好,但将这些描述映射到有效的缝纫图案仍然困难。为此,我们提出了NGL(自然服装语言),一种针对VLMs的领域特定语言,能够以与VLMs的自然描述能力对齐的方式表示服装。利用NGL,我们引入了一条完全无训练的流程,通过查询大型VLMs提取结构化的服装规格,并确定性地将其转换为有效的缝纫图案。我们在Dress4D、CloSe以及一个包含253张野外时尚图像的新数据集上评估了我们的方法。我们的方法在标准几何度量上实现了最先进的性能,并在人类和基于GPT的感知评估中优于现有基线。此外,NGL能够恢复多层服装,而竞争方法主要集中在单层服装上,突显了其在处理有遮挡部分的真实世界图像时的强大泛化能力。这些结果表明,高效的服装表示对于使用VLMs进行缝纫图案估计至关重要。我们的代码和数据将供研究使用。

英文摘要

Estimating sewing patterns from images is a practical approach for creating high-quality 3D garments, but it remains challenging due to the scarcity of paired real-world image and sewing-pattern data. Existing methods address this limitation by training vision-language models (VLMs) to learn low-level sewing-pattern representations from synthetic garments sampled from parametric garment models. However, they often struggle to generalize to in-the-wild images, fail to capture real-world correlations between garment parts, and are restricted to single-layer outfits. In contrast, we observe that VLMs are effective at describing garments in natural language, but mapping these descriptions into valid sewing patterns remains difficult. To bridge this gap, we propose NGL (Natural Garment Language), a novel domain-specific language that represents garments in terms aligned with VLMs' natural descriptive abilities. Leveraging NGL, we introduce a fully training-free pipeline that queries large VLMs to extract structured garment specifications and deterministically converts them into valid sewing patterns. We evaluate our method on the Dress4D, CloSe and a newly collected dataset of 253 in-the-wild fashion images. Our approach achieves state-of-the-art performance on standard geometry metrics and is preferred in both human and GPT-based perceptual evaluations compared to existing baselines. Furthermore, NGL recovers multi-layer outfits whereas competing methods focus mostly on single-layer garments, highlighting its strong generalization to real-world images even with occluded parts. These results demonstrate that an efficient garment representation is critical for sewing pattern estimation with VLMs. Our code and data will be released for research use.

2602.14778 2026-05-20 cs.CL cs.AI cs.CY

A Geometric Analysis of Small-sized Language Model Hallucinations

小尺寸语言模型幻觉的几何分析

Emanuele Ricco, Elia Onofri, Lorenzo Cima, Stefano Cresci, Roberto Di Pietro

AI总结 本文从几何角度分析小尺寸语言模型幻觉问题,提出APORIA框架,通过句子嵌入空间研究重复提示下的响应,发现真实响应比幻觉响应更紧密聚类,并通过APORIA-LP方法实现高效分类,同时发布SOCRATES-300K数据集以支持进一步研究。

Comments 30 pages, 12 figures, 14 tables, accepted as regular paper at ICML'26

详情
AI中文摘要

幻觉--合理但事实错误的响应--对大型语言模型(LLMs)的可靠性构成重大挑战,尤其是在多步骤或代理设置中。现有研究大多将幻觉视为知识缺失的结果;我们显示,即使相关事实知识存在,模型仍会产生幻觉响应,这指向检索不稳定而非知识缺口。基于这一观察,我们引入APORIA(Aggregate Prompt-wise Observation Retrieving Instability via Asymmetry--幻觉所体现的困惑-矛盾状态),一种几何框架,研究相同提示的重复响应在句子嵌入空间中的情况。我们的核心假设是真实响应比幻觉响应更紧密聚类;我们实证验证了这一点,并显示经过Fisher投影后,两种响应类别变得一致可分离。我们通过APORIA-LP方法利用几何中的不对称性,这是一种高效的标签传播方法,通过最少30-50次注释即可对大量响应进行分类,在十种小尺寸LLM上实现超过90%的F1分数。为支持进一步研究,我们发布了SOCRATES-300K数据集,包含300,000个完全标注的响应,以及数据集生成和结果复现的代码。我们的关键发现--从嵌入空间的几何角度分析幻觉--补充了传统知识中心和单响应评估范式,为进一步研究铺平道路。

英文摘要

Hallucinations -- plausible but factually incorrect responses -- pose a major challenge to the reliability of Large Language Models (LLMs), especially in multi-step or agentic settings. Existing work largely frames hallucinations as a consequence of missing knowledge; we show instead that, even when the relevant factual knowledge is present, models still produce hallucinated answers, pointing to retrieval instability rather than knowledge gaps. Building on this observation, we introduce APORIA (Aggregate Prompt-wise Observation Retrieving Instability via Asymmetry -- the state of puzzlement-in-contradiction that hallucinations embody), a geometric framework that studies repeated responses to the same prompt in sentence-embedding space. Our central hypothesis is that genuine responses cluster more tightly than hallucinated ones; we empirically validate this and show that, after Fisher projection, the two response classes become consistently separable. We leverage this asymmetry in geometry via APORIA-LP, an efficient label-propagation method that classifies large collections of responses from as few as 30--50 annotations, achieving F1 scores above 90% across ten small-sized LLMs. To support further research, we release SOCRATES-300K, a fully labelled dataset of 300,000 responses, together with the code for both dataset generation and result reproduction. Our key finding -- framing hallucinations from a geometric perspective in the embedding space -- complements traditional knowledge-centric and single-response evaluation paradigms, paving the way for further research.

2602.14594 2026-05-20 cs.CL

The Wikidata Query Logs Dataset

Wikidata查询日志数据集

Sebastian Walter, Hannah Bast

AI总结 本文提出了一个包含335,000个问题-查询对的Wikidata查询日志(WDQL)数据集,该数据集通过真实世界中的SPARQL查询构建,无需模板生成查询,且规模是现有类似数据集的11倍多。通过代理方法去匿名化、清理和验证查询,生成对应的自然语言问题,用于训练问答方法。

Comments Accepted for publication at SIGIR 2026

详情
AI中文摘要

我们介绍了Wikidata查询日志(WDQL)数据集,该数据集包含335,000个关于Wikidata知识图谱的问题-查询对。它比现有类似格式的数据集大11倍以上,而无需依赖模板生成的查询。相反,我们通过发送到Wikidata查询服务的真实SPARQL查询构建该数据集,并为这些查询生成问题。由于这些基于日志的查询被匿名化,因此往往不产生结果,因此需要大量努力将它们转换回有意义的SPARQL查询。为此,我们提出了一种基于代理的方法,该方法迭代地去匿名化、清理和验证查询,并与Wikidata进行比对,同时生成相应的自然语言问题。我们展示了该数据集在训练问答方法方面的益处。所有WDQL资产以及代理代码均通过https://github.com/ad-freiburg/wikidata-query-logs公开发布,并采用宽松的许可证。

英文摘要

We present the Wikidata Query Logs (WDQL) dataset, a dataset consisting of 335k question-query pairs over the Wikidata knowledge graph. It is over 11x larger than the largest existing Wikidata datasets of similar format without relying on template-generated queries. Instead, we construct it using real-world SPARQL queries sent to the Wikidata Query Service and generate questions for them. Since these log-based queries are anonymized, and therefore often do not produce results, a significant amount of effort is needed to convert them back into meaningful SPARQL queries. To achieve this, we present an agent-based method that iteratively de-anonymizes, cleans, and verifies queries against Wikidata while also generating corresponding natural-language questions. We demonstrate the benefit of this dataset for training question-answering methods. All WDQL assets, as well as the agent code, are publicly available via https://github.com/ad-freiburg/wikidata-query-logs under a permissive license.

2602.10933 2026-05-20 cs.LG

CMAD: Cooperative Multi-Agent Diffusion via Stochastic Optimal Control

CMAD:通过随机最优控制的协作多智能体扩散

Riccardo Barbano, Alexander Denker, Zeljko Kereta, Runchang Li, Francisco Vargas

AI总结 本文提出了一种新的框架,将多模型组合生成问题转化为协作随机最优控制问题,通过联合优化扩散轨迹来实现更有效的生成效果。

详情
AI中文摘要

连续时间生成模型在图像恢复和合成中取得了显著成功。然而,控制多个预训练模型的组合仍是一个开放性挑战。当前方法大多将组合视为概率密度的代数组合,如通过概率密度的产品或专家混合。这种观点假设目标分布已知,这几乎从未发生。在本文中,我们提出了一种不同的范式,将组合生成视为协作随机最优控制问题。与其结合概率密度,我们把预训练的扩散模型视为相互作用的智能体,其扩散轨迹通过最优控制共同引导,朝着其聚合输出上定义的共享目标前进。我们在条件MNIST生成上验证了我们的框架,并将其与一个简单的基线进行比较,该基线在推理时间用每步梯度引导替代了学习的协作控制。

英文摘要

Continuous-time generative models have achieved remarkable success in image restoration and synthesis. However, controlling the composition of multiple pre-trained models remains an open challenge. Current approaches largely treat composition as an algebraic composition of probability densities, such as via products or mixtures of experts. This perspective assumes the target distribution is known explicitly, which is almost never the case. In this work, we propose a different paradigm that formulates compositional generation as a cooperative Stochastic Optimal Control problem. Rather than combining probability densities, we treat pre-trained diffusion models as interacting agents whose diffusion trajectories are jointly steered, via optimal control, toward a shared objective defined on their aggregated output. We validate our framework on conditional MNIST generation and compare it against a naïve inference-time DPS-style baseline replacing learned cooperative control with per-step gradient guidance.

2602.03924 2026-05-20 cs.LG cs.AI physics.ao-ph

WIND: Weather Inverse Diffusion for Zero-Shot Atmospheric Modeling

WIND:用于零样本大气建模的天气反向扩散

Michael Aich, Andreas Fürst, Florian Sestak, Carlos Ruiz-Gonzalez, Niklas Boers, Johannes Brandstetter

AI总结 本文提出WIND,一种统一的预训练基础模型,能够无需任务特定微调即可替代各种任务的专用基线,通过自监督视频重建目标预训练,实现了对大气的鲁棒、任务无关的先验学习,从而解决天气和气候问题,如概率预报、空间时间降尺度、从稀疏观测重建空间场以及强制全球干空气质量守恒。

Comments Published at the 43rd International Conference on Machine Learning (ICML 2026)

详情
AI中文摘要

深度学习已革新了天气预报,但仍有诸多挑战,包括气候建模。此外,当前领域仍然碎片化:高度专门化的模型通常为不同任务单独训练。为统一这一领域,我们引入WIND,一种单一预训练的基础模型,能够替代各种任务的专用基线。关键在于,与之前的气象基础模型不同,我们无需任何任务特定的微调。为了学习大气的鲁棒、任务无关的先验,我们使用无条件视频扩散模型预训练WIND,通过自监督视频重建目标迭代地从噪声状态重建大气动态。在推理时,我们将各种领域特定的问题严格视为反问题,并通过后验采样解决。这种统一的方法使我们能够解决高度相关的天气和气候问题,包括概率预报、空间和时间降尺度、从稀疏观测重建空间场以及强制全球干空气质量守恒。我们进一步展示了WIND如何在给定的非分布热力学扰动下用于探索极端天气事件。通过结合生成视频建模与反问题求解,WIND为基于AI的大气建模提供了一种计算高效的替代方案。

英文摘要

Deep learning has revolutionized weather forecasting, but many challenges remain, including climate modeling. Moreover, the current landscape remains fragmented: highly specialized models are typically trained individually for distinct tasks. To unify this landscape, we introduce WIND, a single pre-trained foundation model capable of replacing specialized baselines across a vast array of tasks. Crucially, in contrast to previous atmospheric foundation models, we achieve this without any task-specific fine-tuning. To learn a robust, task-agnostic prior of the atmosphere, we pre-train WIND with a self-supervised video reconstruction objective, utilizing an unconditional video diffusion model to iteratively reconstruct atmospheric dynamics from a noisy state. At inference, we frame diverse domain-specific problems strictly as inverse problems and solve them via posterior sampling. This unified approach allows us to tackle highly relevant weather and climate problems, including probabilistic forecasting, spatial and temporal downscaling, reconstruction of spatial fields from sparse observations and enforcing global dry air mass conservation. We further demonstrate how WIND can be applied to explore extreme weather events under prescribed out-of-distribution thermodynamic perturbations. By combining generative video modeling with inverse problem solving, WIND offers a computationally efficient alternative for AI-based atmospheric modeling.

2602.03839 2026-05-20 cs.LG

Understanding and Exploiting Weight Update Sparsity for Communication-Efficient Distributed RL

理解并利用权重更新稀疏性以实现通信高效的分布式强化学习

Erfan Miahi, Eugene Belilovsky

AI总结 本文研究了在带宽受限的分布式强化学习中,通过利用权重更新的稀疏性来减少通信开销,提出了一种名为PULSE的算法,通过计算可见稀疏化原则,实现了高效的权重同步和伪梯度同步。

Comments 40 pages, 19 figures, 14 tables

详情
AI中文摘要

带宽受限的分布式强化学习(RL)在大规模语言模型训练后受到两个通道的限制:从训练器到推理工人的权重同步,以及训练器之间的梯度或伪梯度同步。我们发现,在标准训练和推理前向传递中使用的BF16转换后,大约99%的每步权重更新在视觉上是不可见的。我们通过展示,在典型的RL训练后学习率下,Adam更新通常低于本地BF16舍入阈值,解释了这种稀疏性。我们将这一观察转化为一种名为计算可见稀疏化的算法原则:仅传输会改变下一个前向传递的更新。PULSE(Precision-gated Updates for Low-precision Sparse Exchange)将这一原则转化为两种通信算法:PULSESync从训练器向推理工发送无损稀疏BF16权重补丁,PULSELoCo通过误差反馈稀疏化DiLoCo风格的FP32伪梯度同步。在带宽受限的商用网络上,PULSESync在重建训练器权重位相同的情况下,将权重同步通信减少了超过100倍。PULSELoCo在四个模型上与DiLoCo相当,同时在训练器之间的通信减少了超过17倍,与DiLoCo相比,超过100倍,与DDP相比。

英文摘要

Bandwidth-constrained distributed reinforcement learning (RL) post-training of large language models is bottlenecked by two channels: weight synchronization from trainers to inference workers, and gradient or pseudo-gradient synchronization across trainers. We find that approximately 99% of per-step weight updates are invisible after the BF16 cast used by standard training and inference forward passes. We explain this sparsity by showing that, at typical RL post-training learning rates, Adam updates often fall below the local BF16 rounding threshold. We turn this observation into an algorithmic principle called compute-visible sparsification: transmit only updates that would change the next forward pass. PULSE (Precision-gated Updates for Low-precision Sparse Exchange) turns this principle into two communication algorithms: PULSESync sends lossless sparse BF16 weight patches from trainers to inference workers, and PULSELoCo sparsifies DiLoCo-style FP32 pseudo-gradient synchronization with error feedback. Over bandwidth-constrained commodity networks, PULSESync cuts weight-synchronization communication by over 100x while reconstructing trainer weights bit-identically. PULSELoCo matches DiLoCo across four models while reducing trainer-to-trainer communication by over 17x versus DiLoCo and over 100x versus DDP in the largest evaluated setting.

2601.20529 2026-05-20 cs.RO cs.MA

A Practical Framework of Key Performance Indicators for Multi-Robot Lunar and Planetary Field Tests

多机器人月球和行星实地测试的关键绩效指标实用框架

Julia Richter, David Oberacker, Gabriela Ligeza, Valentin T. Bickel, Philip Arm, William Talbot, Marvin Grosse Besselmann, Florian Kehl, Tristan Schnell, Hendrik Kolvenbach, Rüdiger Dillmann, Arne Roennau, Marco Hutter

AI总结 本文提出了一种基于多机器人月球场景的KPI框架,用于评估和比较不同实地测试的性能,强调效率、鲁棒性和精度的场景依赖性优先级,并通过实地测试验证了其在实际应用中的有效性。

Comments Presented at ICRA 2026 Workshop on Multi-Agent Robotic Systems: Real-World Collaboration and Interaction

详情
AI中文摘要

在月球上寻找关键资源(如钛铁矿、稀有地球元素和水冰)需要稳健的探索方法,鉴于多变的地形和恶劣的环境条件。尽管有许多类比实地测试旨在实现这些目标,但比较其结果仍然具有挑战性,因为机器人平台和实验设置存在差异。这些任务通常使用选定的、场景特定的工程度量来评估性能,但无法建立场性能与科学驱动目标之间的明确联系。在本文中,我们通过从三个现实的多机器人月球场景中推导出一个结构化的KPI框架来填补这一空白,这些场景反映了科学目标和操作约束。我们的框架强调效率、鲁棒性和精度的场景依赖性优先级,并且专门设计用于实际部署。我们通过多机器人实地测试验证了该框架,并发现其在效率和鲁棒性相关的KPI方面具有实用性和易用性,而精度导向的KPI则需要可靠的地面真实数据,这在户外类比环境中并不总是可行。总体而言,我们提出这个框架作为通用的评估标准,能够实现一致、目标导向的多机器人实地测试比较,并支持未来行星探索机器人系统的系统性发展。

英文摘要

Robotic prospecting for critical resources on the Moon, such as ilmenite, rare earth elements, and water ice, requires robust exploration methods given the diverse terrain and harsh environmental conditions. Although numerous analog field trials address these goals, comparing their results remains challenging because of differences in robot platforms and experimental setups. These missions typically assess performance using selected, scenario-specific engineering metrics that fail to establish a clear link between field performance and science-driven objectives. In this paper, we address this gap by deriving a structured framework of KPI from three realistic multi-robot lunar scenarios reflecting scientific objectives and operational constraints. Our framework emphasizes scenario-dependent priorities in efficiency, robustness, and precision, and is explicitly designed for practical applicability in field deployments. We validated the framework in a multi-robot field test and found it practical and easy to apply for efficiency- and robustness-related KPI, whereas precision-oriented KPI require reliable ground-truth data that is not always feasible to obtain in outdoor analog environments. Overall, we propose this framework as a common evaluation standard enabling consistent, goal-oriented comparison of multi-robot field trials and supporting systematic development of robotic systems for future planetary exploration.

2601.16200 2026-05-20 cs.LG cs.CV

Feature-Space Smoothing: Certified Robustness of Deep Representations

特征空间平滑:深度表示的认证鲁棒性

Song Xia, Meiwen Ding, Chenqi Kong, Wenhan Yang, Xudong Jiang

AI总结 本文提出了一种特征空间平滑(FS)框架,通过在特征表示层面提供认证鲁棒性,以解决深度学习模型对恶意输入的脆弱性问题,核心方法是通过特征平滑保证清洁和对抗特征之间的余弦相似度下界,并引入高斯平滑增强器(GSB)提升编码器的高斯鲁棒性得分,从而提升模型的鲁棒性并保持下游任务性能。

Comments Under review

详情
AI中文摘要

现代深度学习模型在多种应用中表现出强大的能力,但仍然容易受到通过特征空间扭曲诱导错误预测的恶意输入的攻击。为了解决这一脆弱性,我们提出了特征空间平滑(FS),一种通用的防御框架,该框架能够在特征表示层面提供认证鲁棒性。我们证明,FS将给定的特征编码器转换为一个平滑版本,该版本在l_2有界扰动下保证清洁和对抗特征之间的余弦相似度的认证下界。然后我们建立该特征余弦相似度下界(FCSB)可以扩展到预测层面的认证,其值由编码器内在的高斯鲁棒性得分决定。基于这些见解,我们引入了高斯平滑增强器(GSB),一个即插即用的模块,用于提升编码器的高斯鲁棒性得分。具体来说,GSB模块被插入以增强特征空间的一致性,并在高斯扰动下保持特征的实用性,以供下游任务使用。这种设计使FS能够无缝集成到受保护的模型上,例如多模态大语言模型(MLLMs),而无需额外的模型重新训练或对齐,从而在提升鲁棒性的同时保持下游任务的性能。广泛的实验表明,整合FS一致地提供了非平凡的认证鲁棒性,并在多种模型和应用中显著提高了面向任务的性能,即使在强白盒对抗攻击下也如此。

英文摘要

Modern deep learning models exhibit strong capabilities across diverse applications, yet remain vulnerable to malicious inputs that induce erroneous predictions via feature-space distortion. To address this vulnerability, we propose Feature-space Smoothing (FS), a general defense framework that provides certified robustness at the feature representation level. We show that FS converts a given feature encoder into a smoothed variant that is guaranteed to maintain a certified lower bound on the cosine similarity between clean and adversarial features under l_2-bounded perturbations. We then establish that this Feature Cosine Similarity Bound (FCSB) can be extended to the prediction-wise certification under the cosine similarity measure, and the value of FCSB is determined by the encoder intrinsic Gaussian robustness score. Building on those insights, we introduce the Gaussian Smoothness Booster (GSB), a plug-and-play module to improve the encoder Gaussian robustness score. Specifically, the GSB module is plugged to enhance the feature-space consistency and maintain the feature utility for downstream tasks under Gaussian perturbations. This design enables seamless integration of FS on the protected model, e.g., Multimodal Large Language Models (MLLMs), without additional model retraining or alignment, improving its robustness while preserving the performance for downstream task-oriented decoding. Extensive experiments demonstrate that integrating FS consistently provides non-trivial certified robustness and significantly improves task-oriented performance under strong white-box adversarial attacks across diverse models and applications.

2601.14848 2026-05-20 cs.LG cs.AI cs.NE cs.RO

From Observation to Prediction: LSTM for Vehicle Lane Change Forecasting on Highway On/Off-Ramps

从观测到预测:LSTM用于高速公路进出匝道的车辆车道变更预测

Mohamed Abouras, Catherine M. Elias

AI总结 本文研究了高速公路进出匝道区域与直线路段的区别,利用多层LSTM架构和ExiD无人机数据集训练模型,测试了不同预测时间范围和不同模型的工作流程,结果表明在4秒内预测准确率可达76%(匝道区域)和94%(一般高速公路场景).

详情
AI中文摘要

进出匝道是尽管引入了更高的高速公路交互变异水平但仍然被低估的道路部分。预测这些区域车辆的行为可以减少不确定性的影响并提高道路安全性。在本文中,研究了该感兴趣区域(AoI)与直线路段之间的差异。利用多层LSTM架构和ExiD无人机数据集训练AoI模型。在过程中测试了不同的预测时间范围和不同模型的工作流程。结果表明,在最大预测时间范围内,预测准确率在4秒内显示出巨大潜力,匝道区域的预测准确率从约76%开始,而一般高速公路场景的预测准确率在最大预测时间范围内达到94%。

英文摘要

On and off-ramps are understudied road sections even though they introduce a higher level of variation in highway interactions. Predicting vehicles' behavior in these areas can decrease the impact of uncertainty and increase road safety. In this paper, the difference between this Area of Interest (AoI) and a straight highway section is studied. Multi-layered LSTM architecture to train the AoI model with ExiD drone dataset is utilized. In the process, different prediction horizons and different models' workflow are tested. The results show great promise on horizons up to 4 seconds with prediction accuracy starting from about 76% for the AoI and 94% for the general highway scenarios on the maximum horizon.

2601.12696 2026-05-20 cs.CL

UbuntuGuard: A Culturally-Grounded Policy Benchmark for Equitable AI Safety in African Languages

UbuntuGuard:一种基于文化的政策基准,用于非洲语言中的公平AI安全

Tassallah Abdullahi, Macton Mgonzo, Mardiyyah Oduwole, Paul Okewunmi, Abraham Owodunni, Ritambhara Singh, Carsten Eickhoff

AI总结 本文提出UbuntuGuard,一种针对非洲语言的公平AI安全政策基准,通过来自155名领域专家的对抗性查询,构建了基于本地规范和文化期望的政策和参考响应,评估守护模型的性能。

Comments 15 pages

详情
AI中文摘要

当前的守护模型主要偏向西方,优化于高资源语言,使低资源非洲语言面临不断演变的危害、跨语言失败和文化不匹配。此外,大多数守护模型依赖于刚性的、预定义的安全类别,无法在多样化的语言和社会文化背景下泛化。实现稳健的安全需要灵活的、在运行时可执行的政策和基准,这些基准能反映本地规范、危害场景和文化期望。我们引入UbuntuGuard,这是首个基于政策的安全基准,用于非洲语言,由来自敏感领域的155名领域专家撰写的对抗性查询构建。从这些专家编写的查询中,我们推导出上下文特定的安全政策和参考响应,捕捉基于文化的危险信号,使守护模型的评估与政策对齐。我们评估了15个模型,包括7个通用大型语言模型和8个守护模型,涵盖三种不同的变体:静态、动态和多语言。我们的发现表明,现有的以英语为中心的基准高估了现实世界多语言安全,跨语言迁移提供了部分但不充分的覆盖,而动态模型虽然在推理时更能利用政策,但仍难以完全本地化非洲语言的环境。这些发现突显了对多语言、基于文化的安全基准的紧迫需求,以促进低资源语言中可靠和公平的守护模型的开发。

英文摘要

Current guardian models are predominantly Western-centric and optimized for high-resource languages, leaving low-resource African languages vulnerable to evolving harms, cross-lingual failures, and cultural misalignment. Moreover, most guardian models rely on rigid, predefined safety categories that fail to generalize across diverse linguistic and sociocultural contexts. Achieving robust safety requires flexible, runtime-enforceable policies and benchmarks that reflect local norms, harm scenarios, and cultural expectations. We introduce UbuntuGuard, the first policy-based safety benchmark for African languages built from adversarial queries authored by 155 domain experts across sensitive fields, including healthcare. From these expert-crafted queries, we derive context-specific safety policies and reference responses that capture culturally grounded risk signals, enabling policy-aligned evaluation of guardian models. We evaluate 15 models, comprising seven general-purpose LLMs and eight guardian models across three distinct variants: static, dynamic, and multilingual. Our findings reveal that existing English-centric benchmarks overestimate real-world multilingual safety, cross-lingual transfer provides partial but insufficient coverage, and dynamic models, while better equipped to leverage policies at inference time, still struggle to fully localize African-language contexts. These findings highlight the urgent need for multilingual, culturally grounded safety benchmarks to enable the development of reliable and equitable guardian models for low-resource languages.

2601.12373 2026-05-20 cs.CV cs.HC cs.RO

CD-TWINSAFE: A ROS-enabled Digital Twin for Scene Understanding and Safety Emerging V2I Technology

CD-TWINSAFE:一种基于ROS的数字孪生用于场景理解和安全新兴V2I技术

Amro Khaled, Farah Khaled, Omar Riad, Catherine M. Elias

AI总结 本文提出了一种基于V2I的数字孪生系统CD-TWINSAFE,用于自动驾驶车辆的场景理解和安全监控,通过同时运行的两个栈结构实现车辆侧的驾驶模块和数字孪生模块,利用立体相机和Unreal Engine 5构建场景复现,并通过ROS架构实现V2I通信。

详情
AI中文摘要

本文介绍了CD-TWINSAFE,一种基于V2I的自动驾驶车辆数字孪生系统。所提出的架构由两个同时运行的栈组成,一个是车载驾驶栈,包含立体相机用于场景理解,另一个是数字孪生栈,运行Unreal Engine 5的场景复制品并返回安全警报至驾驶舱。车载栈在车辆侧实现,包括两个主要自主模块:定位和感知。通过车载传感器获取车辆的位置和方向。此外,感知模块负责处理立体相机的20fps图像,并通过两个互补的管道理解场景,包括物体检测和特征提取,包括物体速度、偏转角以及安全指标时间到碰撞和时间头道。收集的数据通过ROS架构以自定义ROS2消息的形式发送到基础设施侧,并通过UDP链接在4G调制解调器上进行V2I通信。通过数字孪生监控环境,共享消息更新生成的ego车辆和检测到的对象的信息,基于实时的定位和感知数据。通过不同驾驶场景的测试来验证所提出架构的有效性和实时响应能力。

英文摘要

In this paper, the CD-TWINSAFE is introduced, a V2I-based digital twin for Autonomous Vehicles. The proposed architecture is composed of two stacks running simultaneously, an on-board driving stack that includes a stereo camera for scene understanding, and a digital twin stack that runs an Unreal Engine 5 replica of the scene viewed by the camera as well as returning safety alerts to the cockpit. The on-board stack is implemented on the vehicle side including 2 main autonomous modules; localization and perception. The position and orientation of the ego vehicle are obtained using on-board sensors. Furthermore, the perception module is responsible for processing 20-fps images from stereo camera and understands the scene through two complementary pipelines. The pipeline are working on object detection and feature extraction including object velocity, yaw and the safety metrics time-to-collision and time-headway. The collected data form the driving stack are sent to the infrastructure side through the ROS-enabled architecture in the form of custom ROS2 messages and sent over UDP links that ride a 4G modem for V2I communication. The environment is monitored via the digital twin through the shared messages which update the information of the spawned ego vehicle and detected objects based on the real-time localization and perception data. Several tests with different driving scenarios to confirm the validity and real-time response of the proposed architecture.

2601.12358 2026-05-20 cs.CV cs.AI cs.RO

From Prompts to Pavement: LMMs-based Agentic Behavior-Tree Generation Framework for Autonomous Vehicles

从提示到道路:基于大语言模型的代理行为树生成框架用于自动驾驶车辆

Omar Y. Goba, Ahmed Y. Gado, Catherine M. Elias, Ahmed Hussein

AI总结 本文提出了一种基于大语言模型和多模态视觉模型的代理行为树生成框架,用于自动驾驶车辆在复杂环境中自适应导航。该框架通过链式符号提示评估场景关键性,通过上下文学习构建高层子目标,并通过生成器合成可执行的BT子树,实现了在CARLA+Nav2模拟中对突发障碍物(如道路堵塞)的成功绕行。

详情
AI中文摘要

自动驾驶车辆(AVs)需要适应性行为规划器来安全地导航不可预测的现实环境。传统的行为树(BTs)提供结构化决策逻辑,但本质上是静态的,并且需要大量人工调优,限制了其在SAE Level 5自主性中的应用。本文提出了一种代理框架,利用大语言模型(LLMs)和多模态视觉模型(LVMs)来实时生成和适应BTs。一个专门的Descriptor代理使用链式符号提示来评估场景关键性,一个Planner代理通过上下文学习构建高层子目标,一个Generator代理合成可执行的BT子树。该系统集成到CARLA+Nav2模拟中,仅在基线BT失败时触发,展示了成功绕过突发障碍物(例如道路堵塞)的能力,无需人工干预。与静态BT基线相比,该方法是一种概念验证,能够扩展到多样的驾驶场景。

英文摘要

Autonomous vehicles (AVs) require adaptive behavior planners to navigate unpredictable, real-world environments safely. Traditional behavior trees (BTs) offer structured decision logic but are inherently static and demand labor-intensive manual tuning, limiting their applicability at SAE Level 5 autonomy. This paper presents an agentic framework that leverages large language models (LLMs) and multi-modal vision models (LVMs) to generate and adapt BTs on the fly. A specialized Descriptor agent applies chain-of-symbols prompting to assess scene criticality, a Planner agent constructs high-level sub-goals via in-context learning, and a Generator agent synthesizes executable BT sub-trees in XML format. Integrated into a CARLA+Nav2 simulation, our system triggers only upon baseline BT failure, demonstrating successful navigation around unexpected obstacles (e.g., street blockage) with no human intervention. Compared to a static BT baseline, this approach is a proof-of-concept that extends to diverse driving scenarios.

2601.03645 2026-05-20 cs.CL cs.CY

LLM-MC-Affect: LLM-Based Monte Carlo Modeling of Affective Trajectories and Latent Ambiguity for Interpersonal Dynamic Insight

LLM-MC-Affect: 基于大语言模型的蒙特卡洛建模:情感轨迹与潜在模糊性的人际动态洞察

Yu-Zheng Lin, Bono Po-Jen Shih, John Paul Martin Encinas, Elizabeth Victoria Abraham Achom, Karan Himanshu Patel, Jesus Horacio Pacheco, Sicong Shao, Jyotikrishna Dass, Soheil Salehi, Pratik Satam

AI总结 本文提出LLM-MC-Affect框架,通过概率建模方法,将情感视为连续的潜在概率分布,从而捕捉人际互动中的情感轨迹和潜在模糊性,为动态分析提供新的视角和方法。

Comments Accepted to the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)

详情
AI中文摘要

情感协调是人类互动的核心属性,决定了关系意义在实时中如何构建。尽管基于文本的情感推断已变得越来越可行,但先前的方法通常将情感视为个体说话者的确定性点估计,未能捕捉到相互交流中固有的主观性、潜在模糊性和序列耦合。我们引入LLM-MC-Affect,一种概率框架,将情感不视为静态标签,而是在情感空间上定义的连续潜在概率分布。通过利用随机LLM解码和蒙特卡洛估计,该方法近似这些分布,以推导出高保真的情感轨迹,明确量化中央情感倾向和感知模糊性。这些轨迹通过序列交叉相关和基于斜率的指标,实现了对人际耦合的结构化分析,识别说话者之间的领先或滞后影响。为了验证该方法的解释能力,我们利用教师-学生教学对话作为代表性案例研究,其中我们的定量指标成功地提炼出高水平的互动洞察,如有效的支架作用。这项工作建立了一种可扩展且可部署的路径,用于理解人际动态,提供了一种可推广的解决方案,其应用不仅限于教育,还扩展到更广泛的社会和行为研究。

英文摘要

Emotional coordination is a core property of human interaction that shapes how relational meaning is constructed in real time. While text-based affect inference has become increasingly feasible, prior approaches often treat sentiment as a deterministic point estimate for individual speakers, failing to capture the inherent subjectivity, latent ambiguity, and sequential coupling found in mutual exchanges. We introduce LLM-MC-Affect, a probabilistic framework that characterizes emotion not as a static label, but as a continuous latent probability distribution defined over an affective space. By leveraging stochastic LLM decoding and Monte Carlo estimation, the methodology approximates these distributions to derive high-fidelity sentiment trajectories that explicitly quantify both central affective tendencies and perceptual ambiguity. These trajectories enable a structured analysis of interpersonal coupling through sequential cross-correlation and slope-based indicators, identifying leading or lagging influences between interlocutors. To validate the interpretive capacity of this approach, we utilize teacher-student instructional dialogues as a representative case study, where our quantitative indicators successfully distill high-level interaction insights such as effective scaffolding. This work establishes a scalable and deployable pathway for understanding interpersonal dynamics, offering a generalizable solution that extends beyond education to broader social and behavioral research.

2512.24139 2026-05-20 cs.LG stat.ME

Colorful Pinball: Density-Weighted Quantile Regression for Conditional Guarantee of Conformal Prediction

Colorful Pinball:基于密度加权分位数回归的条件保证置信预测

Qianyi Chen, Bo Li

AI总结 本文提出了一种基于密度加权分位数回归的条件保证置信预测方法,通过改进标准置信预测的条件覆盖性能,提供更精确的非渐近保证。

Comments ICML 2026

详情
AI中文摘要

尽管置信预测提供了稳健的边缘覆盖保证,但实现特定输入的可靠条件覆盖仍然具有挑战性。虽然有限样本下无法获得精确的分布无关条件覆盖,但近期研究集中在改进标准置信程序的条件覆盖性能上。与针对放宽条件覆盖概念的方法不同,我们直接针对条件覆盖的均方误差,通过优化支撑许多置信方法的分位数回归组件来改进。利用泰勒展开,我们推导出一种尖锐的替代目标函数:密度加权pinball损失,其中权重由非置信分数的条件密度在真实分位数处的值给出。我们提出了一种三头分位数网络,通过使用辅助分位数水平$1-α\pm δ$的有限差分估计这些权重,随后通过优化加权损失微调中心分位数。我们提供了具有精确非渐近保证的理论分析,刻画了由此产生的超额风险。在多样化的高维真实世界数据集上的广泛实验展示了在条件覆盖性能上的显著改进。

英文摘要

Although conformal prediction provides robust marginal coverage guarantees, achieving reliable conditional coverage for specific inputs remains challenging. While exact distribution-free conditional coverage is impossible with finite samples, recent work has focused on improving the conditional coverage of standard conformal procedures. Distinct from approaches that target relaxed notions of conditional coverage, we directly target the mean squared error of conditional coverage by refining the quantile regression components that underpin many conformal methods. Leveraging a Taylor expansion, we derive a sharp surrogate objective for quantile regression: a density-weighted pinball loss, where the weights are given by the conditional density of the nonconformity score evaluated at the true quantile. We propose a three-headed quantile network that estimates these weights via finite differences using auxiliary quantile levels at $1-α\pm δ$, subsequently fine-tuning the central quantile by optimizing the weighted loss. We provide a theoretical analysis with exact non-asymptotic guarantees characterizing the resulting excess risk. Extensive experiments on diverse high-dimensional real-world datasets demonstrate remarkable improvements in conditional coverage performance.

2512.20931 2026-05-20 cs.RO

Certifiable Alignment of GNSS and Local Frames via Lagrangian Duality

通过拉格朗日对偶实现GNSS与局部框架的可验证对齐

Baoshan Song, Matthew Giamou, Penggao Yan, Chunxi Xia, Li-Ta Hsu

AI总结 本文提出了一种全局最优求解器,通过将原始伪距或多普勒测量转换为凸松弛问题,实现了GNSS与局部框架的可验证对齐,解决了传统方法在GNSS退化环境下的局限性。

Comments Final version in RA-L

详情
AI中文摘要

估计局部系统相对于全球导航卫星系统(GNSS)参考的绝对对齐 often 遭遇局部极小值和对卫星可用性高度依赖的问题。现有方法对于此对齐任务依赖于大量卫星,无法在GNSS退化环境中使用,或使用局部优化方法无法保证解的最优性。本文介绍了一种全局最优求解器,将原始伪距或多普勒测量转换为凸松弛问题。所提出的方法是可验证的,意味着可以数值验证结果的正确性,填补了现有局部优化器无法保证最优性的空白。我们首先将原始框架对齐问题公式化为一个非凸二次约束二次规划(QCQP)问题,并将QCQP问题松弛为一个凹的拉格朗日对偶问题,为原问题提供一个下界成本。然后我们进行松弛紧密性和可观测性分析,推导出解的可验证最优性的标准。最后进行仿真和实际世界实验来评估所提出的方法。实验表明,即使只有2颗卫星的多普勒测量和2D车辆运动,我们的方法也能提供可验证的最优解,而传统基于速度的VOBA方法和先进的GVINS对齐技术可能会失败或收敛到局部极小值。为了支持机器人中的GNSS导航技术发展,所有代码和数据均在https://github.com/Baoshan-Song/Certifiable-Doppler-alignment上开源。

英文摘要

Estimating the absolute orientation of a local system relative to a global navigation satellite system (GNSS) reference often suffers from local minima and high dependency on satellite availability. Existing methods for this alignment task rely on abundant satellites unavailable in GNSS-degraded environments, or use local optimization methods which cannot guarantee the optimality of a solution. This work introduces a globally optimal solver that transforms raw pseudo-range or Doppler measurements into a convexly relaxed problem. The proposed method is certifiable, meaning it can numerically verify the correctness of the result, filling a gap where existing local optimizers fail. We first formulate the original frame alignment problem as a nonconvex quadratically constrained quadratic program (QCQP) problem and relax the QCQP problem to a concave Lagrangian dual problem that provides a lower cost bound for the original problem. Then we perform relaxation tightness and observability analysis to derive criteria for certifiable optimality of the solution. Finally, simulation and real world experiments are conducted to evaluate the proposed method. The experiments show that our method provides certifiably optimal solutions even with only 2 satellites with Doppler measurements and 2D vehicle motion, while the traditional velocity-based VOBA method and the advanced GVINS alignment technique may fail or converge to local optima without notice. To support the development of GNSS-based navigation techniques in robotics, all code and data are open-sourced at https://github.com/Baoshan-Song/Certifiable-Doppler-alignment.

2512.03869 2026-05-20 cs.CV cs.CY

An Automated Framework for Large-Scale Graph-Based Cerebrovascular Analysis

一种用于大规模基于图的脑血管分析的自动化框架

Daniele Falcetta, Liane S. Canas, Lorenzo Suppa, Matteo Pentassuglia, Jon Cleary, Marc Modat, Sébastien Ourselin, Maria A. Zuluaga

AI总结 本文提出了一种自动化脑血管分析框架,通过骨架化生成的图表示建模血管形态,并通过区域划分、中心线提取和图构建计算15种形态学、拓扑学、分形和几何特征,以多尺度方式表征脑血管组织。

Comments Accepted at IEEE ISBI 2026

详情
AI中文摘要

我们提出了CaravelMetrics,一种用于自动化脑血管分析的计算框架,通过骨架化生成的图表示建模血管形态。该框架整合了基于图谱的区域划分、中心线提取和图构建,以计算15种形态学、拓扑学、分形和几何特征。这些特征可以全局从完整的血管网络或区域内动脉territories估计,从而实现脑血管组织的多尺度表征。应用于IXI数据集中的570个3D TOF-MRA扫描(年龄20-86岁),CaravelMetrics产生可重复的血管图,捕捉年龄和性别相关变化以及教育程度相关的血管复杂性增加,与文献中的发现一致。该框架提供了一种可扩展且完全自动的定量脑血管特征提取方法,支持规范建模和群体水平的血管健康和衰老研究。

英文摘要

We present CaravelMetrics, a computational framework for automated cerebrovascular analysis that models vessel morphology through skeletonization-derived graph representations. The framework integrates atlas-based regional parcellation, centerline extraction, and graph construction to compute fifteen morphometric, topological, fractal, and geometric features. The features can be estimated globally from the complete vascular network or regionally within arterial territories, enabling multiscale characterization of cerebrovascular organization. Applied to 570 3D TOF-MRA scans from the IXI dataset (ages 20-86), CaravelMetrics yields reproducible vessel graphs capturing age- and sex-related variations and education-associated increases in vascular complexity, consistent with findings reported in the literature. The framework provides a scalable and fully automated approach for quantitative cerebrovascular feature extraction, supporting normative modeling and population-level studies of vascular health and aging.

2511.22940 2026-05-20 cs.CV

One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer

一对一动画:无对齐角色动画和图像姿态转换

Shijun Shi, Jing Xu, Zhihang Li, Chunli Peng, Xiaoda Yang, Lijing Lu, Kai Hu, Jiangning Zhang

AI总结 本文提出了一种统一框架,用于高保真角色动画和图像姿态转换,解决了参考姿态错位问题,通过自监督补全任务和混合参考融合注意力机制提升生成质量。

Comments Project Page:https://ssj9596.github.io/one-to-all-animation-project/

详情
AI中文摘要

最近扩散模型的进步极大地提高了基于姿态的角色动画效果。然而,现有方法受限于空间对齐的参考姿态对和匹配的骨骼结构。处理参考姿态错位仍是一个未解决的问题。为此,我们提出了One-to-All Animation,一种统一框架,用于高保真的角色动画和图像姿态转换,适用于任意布局的参考。首先,为了处理空间错位的参考,我们将训练重新公式化为自监督的补全任务,将多样布局的参考转换为统一的遮挡输入格式。其次,为了处理部分可见的参考,我们设计了一个参考提取器用于全面的身份特征提取。进一步,我们整合了混合参考融合注意力机制以处理不同分辨率和动态序列长度。最后,从生成质量的角度,我们引入了身份鲁棒姿态控制,将外观与骨骼结构解耦以缓解姿态过拟合,并引入了一个令牌替换策略以实现连贯的长视频生成。大量实验表明,我们的方法优于现有方法。代码和模型可在https://github.com/ssj9596/One-to-All-Animation上获得。

英文摘要

Recent advances in diffusion models have greatly improved pose-driven character animation. However, existing methods are limited to spatially aligned reference-pose pairs with matched skeletal structures. Handling reference-pose misalignment remains unsolved. To address this, we present One-to-All Animation, a unified framework for high-fidelity character animation and image pose transfer for references with arbitrary layouts. First, to handle spatially misaligned reference, we reformulate training as a self-supervised outpainting task that transforms diverse-layout reference into a unified occluded-input format. Second, to process partially visible reference, we design a reference extractor for comprehensive identity feature extraction. Further, we integrate hybrid reference fusion attention to handle varying resolutions and dynamic sequence lengths. Finally, from the perspective of generation quality, we introduce identity-robust pose control that decouples appearance from skeletal structure to mitigate pose overfitting, and a token replace strategy for coherent long-video generation. Extensive experiments show that our method outperforms existing approaches. The code and model are available at https://github.com/ssj9596/One-to-All-Animation.

2511.21577 2026-05-20 cs.SD cs.AI

HarmonicAttack: An Adaptive Cross-Domain Audio Watermark Removal

HarmonicAttack: 一种自适应跨领域音频水印移除方法

Kexin Li, Xiao Hu, Ilya Grishchenko, David Lie

AI总结 本文提出HarmonicAttack,一种无需访问目标水印检测器的新型音频水印移除方法,通过训练通用模型来移除音频水印,同时在不同分布数据集上保持高感知质量。

Comments Under Review

详情
AI中文摘要

高质量的AI生成音频的可用性引发了诸如虚假信息活动和语音克隆欺诈等安全挑战。对抗AI生成音频的滥用的关键防御措施是通过水印标记,以便能够轻易区分真实音频。那些试图滥用AI生成音频的人可能会尝试移除音频水印,因此研究有效的水印移除技术对于客观评估音频水印的鲁棒性至关重要。先前的水印移除方案通常假设在移除过程中可以访问目标水印检测器。这种假设往往不切实际,可能导致对当前水印方案的过度自信。我们引入了HarmonicAttack,一种新的音频水印移除方法,它不需要访问目标水印算法。它只需要一组原始和水印样本来训练一个能够从音频样本中移除水印的通用模型。我们还发现,训练样本不需要与目标样本具有相同的分布,因为我们的攻击在面对非分布样本时具有最小的退化。与现有水印移除攻击相比,HarmonicAttack在移除最新方案(包括AudioSeal、WavMark、SilentCipher和AudioMarkNet)的水印方面更加有效,同时保持高感知质量。尽管HarmonicAttack是在LibriSpeech数据集上针对AudioSeal训练的,但它能够泛化到未见过的数据集和水印方案。例如,在VCTK上,HarmonicAttack对AudioMarkNet的识别准确率达到了92%,明显优于最佳基线的38%。在FMA上,HarmonicAttack对所有水印达到了100%的识别准确率,而最佳基线在AudioSeal上仅达到2%,在WavMark上达到44%。

英文摘要

The availability of high-quality, AI-generated audio raises security challenges such as misinformation campaigns and voice-cloning fraud. A key defense against the misuse of AI-generated audio is by watermarking it, so that it can be easily distinguished from genuine audio. Those seeking to misuse AI-generated audio may attempt to remove audio watermarks, so studying effective watermark removal techniques is critical to objectively evaluate the robustness of audio watermarks. Previous watermark removal schemes typically assume access to the target watermark detector during the removal process. This assumption is often impractical, which may lead to a false sense of confidence in current watermark schemes. We introduce HarmonicAttack, a novel audio watermark removal method that requires no access to the target watermark algorithm. It only needs a number of original and watermarked samples to train a general model capable of removing watermarks from audio samples. We also find that training samples do not need to share the same distribution as target samples, as our attack generalizes to out-of-distribution samples with minimal degradation. Compared with existing watermark removal attacks, HarmonicAttack is more effective at removing watermarks from state-of-the-art schemes, including AudioSeal, WavMark, SilentCipher, and AudioMarkNet, while maintaining high perceptual quality. Although HarmonicAttack is trained on the LibriSpeech dataset against AudioSeal, it generalizes across unseen datasets and watermarking schemes. For instance, on VCTK, HarmonicAttack achieves a 92% ASR against AudioMarkNet, substantially outperforming the best baseline at 38%. On FMA, HarmonicAttack reaches 100% ASR against all watermarks, whereas the best baseline achieves only 2% against AudioSeal and 44% against WavMark.

2511.18236 2026-05-20 cs.RO cs.SY eess.SY

APULSE: A Scalable Hybrid Algorithm for the RCSPP on Large-Scale Dense Graphs

APULSE:一种用于大规模密集图上RCSPP的可扩展混合算法

Nuno Soares, António Grilo

AI总结 本文提出APULSE算法,通过结合A*启发式搜索、Pulse式剪枝机制和时间桶策略,高效解决大规模密集图上的资源受限最短路径问题,展现出显著的可扩展性和鲁棒性。

Comments This version corrects keywords and reference [9]. 9 pages

详情
Journal ref
in IEEE Access, vol. 14, pp. 40690-40706, 2026
AI中文摘要

资源受限最短路径问题(RCSPP)是一个基础的NP难优化挑战,广泛应用于网络路由和自主导航等领域。该问题涉及在受预算限制的二次资源下寻找最小主成本路径。尽管存在各种RCSPP求解器,但它们在应用于复杂现实场景中常见的大型密集图时往往面临严重的可扩展性限制,使其在时间敏感的规划中不切实际。在无人地面车辆(UGVs)的任务规划等领域,这种挑战尤为突出。本文介绍APULSE,一种混合标签设置算法,旨在高效解决此类挑战性图中的RCSPP。APULSE结合了由A*启发式引导的最佳优先搜索、激进的Pulse式剪枝机制以及时间桶策略,以有效减少状态空间。通过使用大规模UGV规划场景的计算研究,APULSE与最先进的算法进行了基准测试。结果表明,APULSE在大型问题实例上能够以数量级更快的速度和更高的鲁棒性找到近最优解,特别是在竞争方法失败的情况下。这种优越的可扩展性使APULSE成为复杂大规模环境中的RCSPP有效解决方案,使其能够实现交互式决策支持和动态重新规划能力。

英文摘要

The resource-constrained shortest path problem (RCSPP) is a fundamental NP-hard optimization challenge with broad applications, from network routing to autonomous navigation. This problem involves finding a path that minimizes a primary cost subject to a budget on a secondary resource. While various RCSPP solvers exist, they often face critical scalability limitations when applied to the large, dense graphs characteristic of complex, real-world scenarios, making them impractical for time-critical planning. This challenge is particularly acute in domains like mission planning for unmanned ground vehicles (UGVs), which demand solutions on large-scale terrain graphs. This paper introduces APULSE, a hybrid label-setting algorithm designed to efficiently solve the RCSPP on such challenging graphs. APULSE integrates a best-first search guided by an A* heuristic with aggressive, Pulse-style pruning mechanisms and a time-bucketing strategy for effective state-space reduction. A computational study, using a large-scale UGV planning scenario, benchmarks APULSE against state-of-the-art algorithms. The results demonstrate that APULSE consistently finds near-optimal solutions while being orders of magnitude faster and more robust, particularly on large problem instances where competing methods fail. This superior scalability establishes APULSE as an effective solution for RCSPP in complex, large-scale environments, enabling capabilities such as interactive decision support and dynamic replanning.

2511.16062 2026-05-20 cs.LG

Gauge-Equivariant Graph Networks via Self-Interference Cancellation

通过自干扰消除的 gauge-等变图网络

Yoonhyuk Choi, Jiho Choi, Jiwoo Kang

AI总结 本文提出了一种通过自干扰消除的 gauge-等变图网络(GESC),该网络通过投影机制替代传统的加法聚合,有效处理异质图中的自干扰问题,从而提升模型在异质图上的表现。

详情
AI中文摘要

图神经网络(GNNs)在同质图上表现优异,但在异质图上常常因自我强化和相位不一致的信号而失效。我们提出了一种通过自干扰消除的 gauge-等变图网络(GESC),该网络通过投影机制替代传统的加法聚合。与以往依赖加法信息混合的磁性或 gauge-等变 GNN 不同,GESC 显式建模由于冗余低频成分产生的自干扰。我们表明现有 gauge 基础 GNN 中缺乏干扰处理是导致 gauge 传输下过平滑现象的主要原因。我们引入一个 U(1) 相位连接,随后进行秩-1 投影以在注意力之前抑制自平行成分,并引入一个考虑符号的门控来调节负向对齐的邻居。在多样化的图基准测试中,GESC 一致优于最近最先进的模型,同时提供了一个统一的、具有干扰意识的信息传递视角。我们的代码可在 https://github.com/ChoiYoonHyuk/GESC 上获得。

英文摘要

Graph Neural Networks (GNNs) excel on homophilous graphs but often fail under heterophily due to self-reinforcing and phase-inconsistent signals. We propose a \textbf{G}auge-\textbf{E}quivariant Graph Network with \textbf{S}elf-Interference \textbf{C}ancellation (GESC), which replaces additive aggregation with a projection-based interference mechanism. Unlike prior magnetic or gauge-equivariant GNNs that rely on additive message mixing, GESC explicitly models self-interference arising from redundant low-frequency components. We show that the absence of interference handling in existing gauge-based GNNs is a primary driver of oversmoothing under gauge transport. We introduce a $\mathrm{U}(1)$ phase connection followed by a rank-1 projection that suppresses self-parallel components before attention, and a sign-aware gate that regulates negatively aligned neighbors. Across diverse graph benchmarks, GESC consistently outperforms recent state-of-the-art models while offering a unified, interference-aware view of message passing. Our code is available at https://github.com/ChoiYoonHyuk/GESC.

2511.13174 2026-05-20 cs.LG

Warm-starting active-set solvers using graph neural networks

利用图神经网络进行主动集求解器的预热启动

Ella J. Schmidtobreick, Daniel Arnström, Paul Häusner, Jens Sjölund

AI总结 本文提出利用图神经网络预测双主动集求解器DAQP中的活跃约束,通过将二次规划问题表示为二分图来利用其结构特性,从而有效预热启动求解器,减少迭代次数,并在不同问题规模下展示出良好的泛化能力和可扩展性。

Comments Accepted at Learning for Dynamics and Control Conference (L4DC)

详情
AI中文摘要

二次规划(QP)求解器在实时控制和优化中被广泛使用,但其计算成本通常限制了在时间敏感设置中的应用。为了解决这一问题,我们提出了一种学习优化方法,利用图神经网络(GNN)来预测双主动集求解器DAQP中的活跃约束。我们的方法通过将QP表示为二分图,利用其结构特性,学习近似最优活跃集以有效预热启动求解器。在不同问题规模下,GNN始终比冷启动减少求解器迭代次数,同时性能与多层感知机基线相当。与基线相比,我们的基于GNN的方法在不同问题规模上训练后,能够泛化到未见过的维度,展示了灵活性和可扩展性。这些结果突显了结构感知学习在实时应用如模型预测控制中加速优化的潜力。

英文摘要

Quadratic programming (QP) solvers are widely used in real-time control and optimization, but their computational cost often limits applicability in time-critical settings. To resolve this, we propose a learning-to-optimize approach using graph neural networks (GNNs) to predict active constraints in the dual active-set solver DAQP. Our method exploits the structural properties of QPs by representing them as bipartite graphs and learns to approximate the optimal active set for effectively warm-starting the solver. Across varying problem sizes, the GNN consistently reduces the number of solver iterations compared to cold-starting, while performance is comparable to a multilayer perceptron baseline. In contrast to the baseline, our GNN-based approach trained on varying problem sizes generalizes to unseen dimensions, demonstrating flexibility and scalability. These results highlight the potential of structure-aware learning to accelerate optimization in real-time applications such as model predictive control.

2510.25897 2026-05-20 cs.CV cs.LG

MIRO: MultI-Reward cOnditioned pretraining improves T2I quality and efficiency

MIRO:多奖励条件预训练提升T2I质量和效率

Nicolas Dufour, Lucas Degeorge, Arijit Ghosh, Vicky Kalogeiton, David Picard

AI总结 MIRO通过在训练过程中对模型施加多个奖励,直接学习用户偏好,从而提升文本到图像生成的质量和效率,同时在GenEval组合基准和用户偏好评分上取得最佳成绩。

Comments Accepted at ICML 2026. Project page: https://nicolas-dufour.github.io/miro

详情
AI中文摘要

后训练文本到图像生成器的默认范式包括事后选择生成的图像,随后使用一个奖励模型进行训练以对齐生成器与奖励,通常为用户偏好。这会丢弃信息性数据,并且仅优化单一奖励,从而损害多样性、语义保真度和效率。相反,我们提出MIRO,一种在训练过程中对模型施加多个奖励的方法,从而让模型直接学习用户偏好。MIRO预训练不仅提高了生成图像的视觉质量,还加快了训练速度,在GenEval组合基准和用户偏好评分(PickAScore、ImageReward、HPSv2)上实现了最先进的性能。

英文摘要

The default paradigm of post-training text-to-image generators includes post-hoc selection of generated images, and subsequent training with one reward model to align the generator to the reward, typically user preference. This discards informative data as well as optimizes only for a single reward, hence harming diversity, semantic fidelity and efficiency. Instead, we propose MIRO, a method that conditions the model on multiple rewards during training, thus letting the model learn user preferences directly. MIRO pre-training both improves the visual quality of the generated images and speeds up the training, achieving state of the art on the GenEval compositional benchmark and user-preference scores (PickAScore, ImageReward, HPSv2).

2510.25348 2026-05-20 cs.LG cs.SI

Beyond Leakage and Complexity: Towards Realistic and Efficient Information Cascade Prediction

超越泄露和复杂性:迈向现实和高效的流行信息级联预测

Jie Peng, Rui Wang, Qiang Wang, Zhewei Wei, Bin Tong, Guan Wang, Bo Zheng

AI总结 本文针对信息级联流行度预测中的三个关键问题:时间泄露、数据集特征贫乏和计算效率低下,提出了一种时间有序分割策略、大规模电商级联数据集Taoke以及轻量级框架CasTemp,实现了在四个数据集上的最先进的性能和数量级的速度提升。

详情
AI中文摘要

信息级联流行度预测是分析社交网络内容扩散的关键问题。然而,当前的相关工作存在三个关键限制:(1)当前评估中的时间泄露——基于随机级联的分割允许模型访问未来信息,导致不现实的结果;(2)特征贫乏的数据集缺乏下游转换信号(例如点赞、评论或购买),这限制了更多实际应用;(3)复杂图方法的计算效率低下,需要数天的训练才能获得微小的改进。我们从任务设置、数据集构建和模型设计三个角度系统地解决这些挑战。首先,我们提出了一种时间有序的分割策略,将数据按时间顺序划分为连续的窗口,确保模型在没有未来信息泄露的情况下进行真正的预测任务。其次,我们引入了Taoke,一个大规模电商级联数据集,具有丰富的推广者/产品属性和真实的购买转换信号——捕捉从推广到货币化的完整扩散生命周期。第三,我们开发了CasTemp,一个轻量级框架,通过时间行走、基于Jaccard的邻居选择用于跨级联依赖性以及具有时间意识的注意力的GRU编码来高效建模级联动态。在无泄露评估下,CasTemp在四个数据集上实现了最先进的性能,具有数量级的速度提升。值得注意的是,它在预测第二阶段流行度转换方面表现优异——这是实际应用中至关重要的任务。

英文摘要

Information cascade popularity prediction is a key problem in analyzing content diffusion in social networks. However, current related works suffer from three critical limitations: (1) temporal leakage in current evaluation--random cascade-based splits allow models to access future information, yielding unrealistic results; (2) feature-poor datasets that lack downstream conversion signals (e.g., likes, comments, or purchases), which limits more practical applications; (3) computational inefficiency of complex graph-based methods that require days of training for marginal gains. We systematically address these challenges from three perspectives: task setup, dataset construction, and model design. First, we propose a time-ordered splitting strategy that chronologically partitions data into consecutive windows, ensuring models are evaluated on genuine forecasting tasks without future information leakage. Second, we introduce Taoke, a large-scale e-commerce cascade dataset featuring rich promoter/product attributes and ground-truth purchase conversions--capturing the complete diffusion lifecycle from promotion to monetization. Third, we develop CasTemp, a lightweight framework that efficiently models cascade dynamics through temporal walks, Jaccard-based neighbor selection for inter-cascade dependencies, and GRU-based encoding with time-aware attention. Under leak-free evaluation, CasTemp achieves state-of-the-art performance across four datasets with orders-of-magnitude speedup. Notably, it excels at predicting second-stage popularity conversions--a practical task critical for real-world applications.

2510.18924 2026-05-20 cs.LG cs.AI

Noise-corrected GRPO: From Noisy Rewards to Unbiased Gradients

噪声校正的GRPO:从噪声奖励到无偏梯度

Omar El Mansouri, Fathinah Asma Izzati, Mohamed El Amine Seddik, Salem Lahlou

AI总结 本文提出了一种噪声鲁棒的GRPO框架,通过校正奖励中的噪声来获得无偏梯度估计,从而提升强化学习在噪声环境中的性能。

详情
AI中文摘要

人类反馈的强化学习(RLHF)或可验证奖励(RLVR)是对大语言模型进行对齐或构建最新SOTA推理模型的标准范式,但其对不一致或错误奖励产生的噪声非常敏感。然而,此类噪声与广泛使用的基于组的策略优化方法之间的相互作用仍不为人知。我们引入了一种噪声鲁棒的组相对策略优化(GRPO)和正确执行GRPO(Dr.GRPO)框架,该框架明确将奖励损坏建模为伯努利噪声。我们的方法在估计奖励翻转概率后应用噪声校正,以消除学习信号的偏差,从而获得可证明无偏的梯度估计。理论分析表明,基于组的方法本质上可以缓解个体层面的噪声,而我们的校正策略增强了这种鲁棒性。实验表明,在应用我们的噪声校正到标准奖励模型使用时,数学和代码任务中均观察到一致的改进,特别是在现实奖励模型条件下,数学任务的准确性提高了高达6.7个百分点,代码任务提高了1.5个百分点。这项工作将监督学习中的标签噪声校正与现代RLHF相结合,提供了理论洞察和实用算法,以应对噪声现实世界部署。

英文摘要

Reinforcement learning from human feedback (RLHF) or verifiable rewards (RLVR), the standard paradigm for aligning LLMs or building recent SOTA reasoning models, is highly sensitive to noise from inconsistent or erroneous rewards. Yet, the interaction between such noise and widely used group-based policy optimization methods remains underexplored. We introduce a noise-robust Group Relative Policy Optimization (GRPO) and Done Right GRPO (Dr.GRPO) framework that explicitly models reward corruption as Bernoulli noise. Our method applies noise correction after estimating reward flip probabilities to debias the learning signal, yielding provably unbiased gradient estimates. Theoretical analysis shows that group-based methods inherently mitigate individual-level noise, and our correction strategy amplifies this robustness. Empirically, we observe consistent improvements across math and code tasks when applying our noise correction to standard reward model usage, with particular gains of up to 6.7 percentage points in accuracy on math tasks and 1.5 on code tasks under realistic reward model conditions. This work bridges label-noise correction from supervised learning with modern RLHF, offering both theoretical insights and a practical algorithm for noisy real-world deployment.