arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 4178
2606.00686 2026-06-02 cs.LG

Dialectics of Alignment: Harnessing Unsafe Knowledge for Dynamic Safety Routing

对齐的辩证法:利用不安全知识实现动态安全路由

Maryam Hashemzadeh, Jerry Huang, Minseon Kim, Marc-Alexandre Côté, Sarath Chandar

发表机构 * Chandar Research Lab(Chandar研究实验室) Mila – Quebec AI Institute(魁北克AI研究所) Université de Montréal(蒙特利尔大学) Microsoft Research(微软研究院) Polytechnique Montréal(蒙特利尔理工学院) Canada CIFAR AI Chair(加拿大CIFAR人工智能主席)

AI总结 提出SafeMoE框架,通过混合专家模型将不安全知识隔离到领域特定的低秩适配器中,并训练轻量级门控网络动态路由这些专家,在保持安全性的同时生成信息丰富的响应。

详情
AI中文摘要

大语言模型(LLM)对齐的主流范式通过擦除、过滤不安全数据或训练模型严格拒绝有害提示来运作。虽然这种方法能有效降低即时毒性,但根本上限制了模型的认识论范围,导致系统过度谨慎,对敏感但良性的查询输出无信息量的全面拒绝。在这项工作中,我们挑战了不安全数据必须丢弃的正统观念。我们提出了一种对齐的辩证方法,认为不安全数据编码了丰富的、领域特定的知识,对于细致、安全且信息丰富的生成至关重要。为实现这一点,我们引入了SafeMoE,一个混合专家(MoE)框架,将不安全知识隔离到仅在有害语料上训练的领域特定低秩适配器(LoRA专家)中。为了从这些不安全基元中综合安全性,我们使用最小、高度精选的安全信息响应集训练一个轻量级门控网络。在推理时,该路由器动态编排不安全专家,有效引导生成轨迹以利用其深层领域知识,同时严格执行安全约束。在严格的安全基准上的广泛实证评估表明,SafeMoE不仅更安全,安全响应率相对提高了20%以上(绝对增益超过15%),而且在安全性和危害性至关重要时能生成更具信息量的响应。此外,路由机制在未见领域和更广泛的安全任务上表现出强大的零样本泛化能力,无需领域特定监督。我们的发现表明对齐的范式转变:真正的安全不需要掩盖不安全知识,而是需要其受控整合。

英文摘要

The prevailing paradigm in large language model (LLM) alignment operates via erasure, filtering unsafe data or training models to strictly refuse harmful prompts. While effective at reducing immediate toxicity, this approach fundamentally constricts the model's epistemological scope, resulting in over-cautious systems that output uninformative blanket refusals to sensitive yet benign queries. In this work, we challenge the orthodoxy that unsafe data must be discarded. We propose a dialectical approach to alignment, positing that unsafe data encodes rich, domain specific knowledge critical for nuanced, safe, and informative generation. To operationalize this, we introduce SafeMoE, a Mixture-of-Experts (MoE) framework that isolates unsafe knowledge into domain-specific Low-Rank Adapters (LoRA experts) trained exclusively on harmful corpora. To synthesize safety from these unsafe primitives, we train a lightweight gating network using a minimal, highly curated set of safe-informative responses. During inference, this router dynamically orchestrates the unsafe experts, effectively steering the generation trajectory to harness their deep domain knowledge while strictly enforcing safety constraints. Extensive empirical evaluations across stringent safety benchmarks demonstrate that SafeMoE is not only safer, achieving over a 20% relative improvement in safe response rate (more than a 15% absolute gain), but also produces more informative responses when safety and harmfulness are of paramount concern. Furthermore, the routing mechanism exhibits strong zero-shot generalization to unseen domains and broader safety tasks without domain-specific supervision. Our findings suggest a paradigm shift in alignment: true safety requires not the masking of unsafe knowledge, but its controlled integration.

2606.00685 2026-06-02 cs.LG

Prior-Guided Multi-Omic Transformers for Single-Cell Gene Regulatory Network Inference

先验引导的多组学Transformer用于单细胞基因调控网络推断

Tianyang Xu, Tianci Liu, Niraj Rayamajhi, Ryan Patrick, Kranthi Varala, Ying Li, Jing Gao

发表机构 * Elmore Family School of Electrical and Computer Engineering(埃尔莫夫家庭电气与计算机工程学院) Purdue University(普渡大学) Department of Horticulture and Landscape Architecture(园艺与景观建筑系) School of Biological Sciences(生物科学学院)

AI总结 提出EpiAwareNet框架,通过先验引导的多组学Transformer,结合基因-峰值交叉注意力模块和批量数据先验,从配对单细胞数据中重建基因调控网络。

详情
Comments
12 pages, 6 figures. Accepted to the KDD 2026 AI4Sciences Track
AI中文摘要

基因调控网络(GRN)捕捉转录因子-靶标相互作用,是理解细胞状态调控和疾病的核心。从配对的单细胞转录组和染色质可及性数据重建GRN具有前景但充满挑战:scATAC极其稀疏,且大多数方法依赖于固定的峰值-基因链接和弱监督。我们提出EpiAwareNet,一个先验引导的多组学Transformer框架,仅使用轻量级生物学先验从配对单细胞数据重建GRN。在第一阶段,EpiAwareNet通过基因-峰值交叉注意力模块学习联合基因-峰值表示,实现数据驱动的、基因特异性的可及性信号聚合,而非硬编码的峰值-基因分配。在第二阶段,EpiAwareNet引入批量数据衍生的GRN先验作为噪声正边,在标签稀缺情况下提供弱监督,同时保持对先验噪声的鲁棒性,细化调控分数。在我们的实验中,EpiAwareNet在GRN重建上优于代表性的单组学和多组学基线,并产生更具生物学合理性的GRN,例如改善已知调控相互作用的恢复,这表明当与自适应跨模态表示学习结合时,来自批量数据的轻量级生物学先验可以有效指导单细胞GRN推断。代码和数据将在https://github.com/tianyang-x/EpiAwareNet_pub提供。

英文摘要

Gene regulatory networks (GRNs) capture transcription factor-target interactions and are central to understanding cell-state regulation and disease. Reconstructing GRNs from paired single-cell transcriptomic and chromatin accessibility data is promising but challenging: scATAC is extremely sparse, and most methods rely on fixed peak-to-gene links and weak supervision. We present EpiAwareNet, a prior-guided multi-omic Transformer framework that reconstructs GRNs from paired single-cell data using only lightweight biological priors. In Stage 1, EpiAwareNet learns joint gene-peak representations with a gene-peak cross-attention module, enabling data-driven, gene-specific aggregation of accessibility signals rather than hard-coded peak-to-gene assignments. In Stage 2, EpiAwareNet incorporates a bulk-derived GRN prior as noisy positive edges to provide weak supervision under label scarcity, refining regulatory scores while remaining robust to prior noise. In our experiments, EpiAwareNet improves GRN reconstruction over representative single- and multi-omic baselines and yields GRNs with greater biological plausibility, such as improved recovery of known regulatory interactions, suggesting that lightweight biological priors from bulk data can effectively guide single-cell GRN inference when combined with adaptive cross-modal representation learning. Code and data will be available at https://github.com/tianyang-x/EpiAwareNet_pub.

2606.00683 2026-06-02 cs.CL

OCC-RAG: Optimal Cognitive Core for Faithful Question Answering

OCC-RAG:面向忠实问答的最优认知核心

Maksim Savkin, Mikhail Goncharov, Alexander Gambashidze, Alla Chepurova, Dmitrii Tarasov, Nikita Andriianov, Daria Pugacheva, Vasily Konovalov, Andrey Galichin, Ivan Oseledets

发表机构 * OCC Team(OCC团队)

AI总结 提出OCC-RAG,一种通过多上下文多跳合成数据训练的小语言模型,在忠实问答任务中匹配或超越2-6倍规模通用模型。

详情
AI中文摘要

近年来,语言模型的发展由规模定义,每一代模型都将更多世界知识吸收进其参数中。然而,许多实际应用更受益于稳健推理而非广泛的参数化知识。在此背景下,任务专用的小语言模型(SLM)提供了一种原则性的设计选择。我们提出最优认知核心(OCC),一个基于此前提构建的SLM家族。作为OCC的变体,我们提出OCC-RAG,针对基于给定上下文的忠实问答(QA)进行了优化。该任务与OCC设计方法直接对齐,需要在提供的段落上进行多跳推理,同时忽略记忆的知识。为训练OCC-RAG,我们实现了一种新颖的流水线,用于大规模合成多上下文、多跳QA数据,生成了一个包含超过三百万个样本的语料库,针对多跳推理、严格上下文忠实性和校准的弃权进行了优化。我们发布了OCC-RAG-0.6B和OCC-RAG-1.7B,两者均在此语料库上进行了中期训练。这些模型生成带有源引用的结构化推理轨迹,这些引用基于上下文中的逐字引用。通过OCC-RAG,我们证明了紧凑的任务专用SLM可以在多跳推理(HotpotQA、MuSiQue、TAT-QA)、忠实性(ConFiQA)和拒绝(MuSiQue-Un)基准测试中匹配或超越规模为其2-6倍的通用模型。

英文摘要

Recent progress in the development of language models has been defined by scale, with each generation absorbing more of the world's knowledge into its weights. However, many practical applications benefit more from robust reasoning than from extensive parametric knowledge. In this setting, task-specialized small language models (SLMs) offer a principled design choice. We introduce Optimal Cognitive Core (OCC), a family of SLMs built around this premise. As a variant of OCC, we present OCC-RAG, optimized for faithful question answering (QA) grounded in the provided context. This task directly aligns with the OCC design approach, requiring multi-hop reasoning over supplied passages while ignoring memorized knowledge. To train OCC-RAG, we implement a novel pipeline for synthesizing multi-context, multi-hop QA data at scale, producing a corpus of over three million examples targeting multi-hop reasoning, strict context faithfulness, and calibrated abstention. We release OCC-RAG-0.6B and OCC-RAG-1.7B, both mid-trained on this corpus. The models produce structured reasoning traces with source citations grounded in literal quotes from the context. Through OCC-RAG, we demonstrate that compact, task-specialized SLMs can match or exceed general-purpose models 2 -- 6x their size across multi-hop reasoning (HotpotQA, MuSiQue, TAT-QA), faithfulness (ConFiQA), and refusal (MuSiQue-Un) benchmarks.

2606.00677 2026-06-02 cs.LG

Limits of Resolution Equivariance in Fourier Neural Operators

傅里叶神经算子中的分辨率等变性极限

Alex Colagrande, Paul Caillon, Eva Feillet, Alexandre Allauzen

发表机构 * Miles Team, LAMSADE, Université Paris Dauphine-PSL(巴黎萨克雷大学巴黎-达菲学院LAMSADE团队) Université Paris-Saclay, CNRS, LISN(巴黎-萨克雷大学CNRS LISN) ESPCI PSL, Paris(巴黎ESPCI PSL)

AI总结 本文通过对比直接细网格推理与低网格加傅里叶零填充上采样两种策略,发现傅里叶神经算子并不总是能泛化到不同分辨率,并分析了其层间频谱特性,指出非线性混叠是零样本分辨率等变性的主要障碍。

详情
Comments
Published as a paper at AI&PDE: ICLR 2026 Workshop on AI and Partial Differential Equations. 6 pages, 2 figures
AI中文摘要

傅里叶神经算子通常被认为能够跨空间分辨率泛化,从而可以在粗网格上训练并在细网格上部署。我们通过对比从训练分辨率 $s$ 到测试分辨率 $S>s$ 时的两种推理选择来检验这一假设:直接在 $S$ 上运行 FNO,或者在 $s$ 上运行并通过傅里叶零填充将预测上采样到 $S$。在达西流问题上,我们观察到直接细网格推理并非总是有益的,甚至可能比低网格加上采样基线更差。我们进一步分析了层间频谱,发现在傅里叶截断下,中间表示的能量越来越集中在低频,而高频输出主要由后期的非线性/解码器阶段产生。这为 FNO 在保留少量模式时仍能表现良好,但对分辨率变化敏感的现象提供了机制性解释。我们的发现强调了一个简单但强大的跨分辨率评估基线,并指出非线性混叠是零样本分辨率等变性的关键障碍。

英文摘要

Fourier Neural Operators are often assumed to generalize across spatial resolutions, enabling training on a coarse grid and deployment on a finer grid. We test this assumption by contrasting two inference-time choices when moving from training resolution $s$ to test resolution $S>s$: running FNO directly at $S$, or running at $s$ and upsampling the prediction to $S$ via Fourier zero-padding. On Darcy flow, we observe that direct fine-grid inference is not reliably beneficial and can be worse than the low-grid-plus-upsampling baseline. We further analyze layerwise spectra and find that, under Fourier truncation, intermediate representations increasingly concentrate energy in low frequencies, with high-frequency output produced mainly by late nonlinear/decoder stages. This offers a mechanistic explanation for why FNO can perform well while retaining few modes, yet remain sensitive under resolution shifts. Our findings highlight a simple but strong baseline for cross-resolution evaluation and point to nonlinear aliasing as a key obstacle to zero-shot resolution equivariance.

2606.00676 2026-06-02 cs.CV

A Modelling and Evaluation Framework for EuroCrops-Driven Sentinel-2 Crop Segmentation

基于EuroCrops驱动的Sentinel-2作物分割的建模与评估框架

Alexandra Nicoleta Scarlat, Ioana Cristina Plajer, Alexandra Baicoianu

发表机构 * Transilvania University of Braşov(布拉索夫瓦拉米亚大学)

AI总结 提出一个可配置的流水线,利用EuroCrops标注和Sentinel-2影像生成语义分割数据集,并训练U-Net模型评估其在域内和域外数据集上的性能。

详情
AI中文摘要

本工作提出了一个可配置的流水线,用于从Sentinel-2影像和EuroCrops地块级标注生成适用于语义分割的农业数据集。该流程通过标签统一、Sentinel-2产品选择、空间对齐、栅格化、图块提取、质量过滤和类别感知样本选择,将异质的矢量作物标注转化为对齐的多光谱图像-掩码对。生成的数据集包含来自五个欧洲国家的67,337个图块,并使用简化的十种作物类别加上背景的分类法。 使用10个Sentinel-2光谱波段和组合损失(类别加权交叉熵和Dice损失)训练了一个带有组归一化的四层U-Net。在基于EuroCrops的内部测试集上,模型实现了平均交并比(mIoU)0.7665、像素准确率0.8693和平均类别准确率0.9072。与光谱和空间上下文随机森林基线相比,U-Net显示了学习多尺度空间表示对于作物分割的重要性。 在未见过的比利时EuroCrops子集、DACIA5和PASTIS上进行了外部评估。结果显示,在外部和跨数据集评估下存在明显的性能差距,尤其是对于具有不同分类法、标注协议、空间覆盖或时间组织的基准。模型更可靠地转移到分类法对齐的优势类别(如玉米和小麦),而对于几个少数类别以及适应后的单日期PASTIS设置,性能仍然有限。这些发现突出了在现实域偏移下使用EuroCrops衍生监督进行Sentinel-2作物分割的潜力和局限性。

英文摘要

This work presents a configurable pipeline for generating semantic-segmentation-ready agricultural datasets from Sentinel-2 imagery and EuroCrops parcel-level annotations. The workflow transforms heterogeneous vector crop annotations into aligned multispectral image--mask pairs through label harmonization, Sentinel-2 product selection, spatial alignment, rasterization, patch extraction, quality filtering, and class-aware sample selection. The generated dataset contains 67,337 patches from five European countries and uses a reduced taxonomy of ten crop classes plus background. A four-level U-Net with Group Normalization was trained using 10 Sentinel-2 spectral bands and a composite loss combining class-weighted cross-entropy and Dice loss. On the internal EuroCrops-based test split, the model achieved a mean Intersection over Union (mIoU) of 0.7665, a pixel accuracy of 0.8693, and a mean class accuracy of 0.9072. Compared with spectral and spatial-context Random Forest baselines, the U-Net showed the importance of learned multi-scale spatial representations for crop segmentation. External evaluation was performed on unseen Belgian EuroCrops subsets, DACIA5, and PASTIS. The results show a clear performance gap under external and cross-dataset evaluation, especially for benchmarks with different taxonomies, annotation protocols, spatial coverage, or temporal organization. The model transfers more reliably to dominant and taxonomically aligned classes such as maize and wheat, while performance remains limited for several minority classes and for the adapted single-date PASTIS setting. These findings highlight both the potential and the limitations of using EuroCrops-derived supervision for Sentinel-2 crop segmentation under realistic domain shifts.

2606.00674 2026-06-02 cs.LG cs.AI

The Paradox of Outcome Optimization: A Causal Information-Theoretic Bound on Reasoning Shortcuts in LLMs

结果优化的悖论:LLM中推理捷径的因果信息论界限

Zihan Chen, Yiming Zhang, Wenxiang Geng, Zenghui Ding, Yining Sun

发表机构 * HFIPS, Chinese Academy of Sciences(中国科学院HFIPS) University of Science and Technology of China(中国科学技术大学)

AI总结 针对基于结果强化学习的LLM在分布外任务中推理脆弱的问题,提出因果信息论框架解释奖励诱导的流形坍缩,并证明过程奖励模型作为拓扑滤波器可消除低复杂度捷径。

详情
AI中文摘要

通过基于结果的强化学习(RL)对齐的大型语言模型(LLM)经常表现出一种关键失败模式:它们在分布内基准测试上取得高性能,但在分布外(OOD)任务上推理能力脆弱。我们将这种现象称为奖励诱导的流形坍缩。我们建立了一个理论框架,将结构因果模型(SCM)和信息瓶颈(IB)原理联系起来,以解释这一悖论。我们将推理定义为高复杂度的因果过程,将捷径学习定义为利用低复杂度的虚假相关性。在随机梯度下降(SGD)的隐式归纳偏置下,只要训练分布允许对真实因果机制进行“马尔可夫筛选”,优化结果奖励的模型就会偏向于捷径解。我们基于语义覆盖度量($\eta$)而非样本量推导了一个新的泛化界限,说明了为什么在同质分布上扩展数据可能无法纠正推理缺陷。我们还表明,过程奖励模型(PRM)作为拓扑滤波器,通过强制执行逐步互信息约束,使得低复杂度的捷径流形不可行。这些结果为过程监督在简单信用分配之外的作用提供了数学基础。

英文摘要

Large Language Models (LLMs) aligned via outcome-based Reinforcement Learning (RL) frequently exhibit a critical failure mode: they achieve high performance on in-distribution benchmarks while demonstrating brittle reasoning capabilities on out-of-distribution (OOD) tasks. We term this phenomenon Reward-Induced Manifold Collapse. We establish a theoretical framework bridging Structural Causal Models (SCM) and the Information Bottleneck (IB) principle to explain this paradox. We define reasoning as a high-complexity causal process and shortcut learning as the exploitation of low-complexity spurious correlations. Under the implicit inductive bias of Stochastic Gradient Descent (SGD), models optimized for outcome rewards are biased toward shortcut solutions whenever the training distribution allows for a ``Markovian Screening'' of the true causal mechanism. We derive a new generalization bound based on Semantic Coverage Measure ($η$) rather than sample size, showing why data scaling on homogeneous distributions may fail to correct reasoning flaws. We also show that Process Reward Models (PRMs) function as Topological Filters, enforcing step-wise mutual information constraints that render the low-complexity shortcut manifold inadmissible. These results provide a mathematical grounding for the role of process supervision beyond simple credit assignment.

2606.00673 2026-06-02 cs.CV

T-CLIP: Enabling Thermal Perception for Contrastive Language-Image Pretraining

T-CLIP:面向对比语言-图像预训练的热感知

Tayeba Qazi, Ayush Maheshwari, Prerana Mukherjee, Brejesh Lall

发表机构 * Indian Institute of Technology Delhi, India(印度理工学院德里分校) NVIDIA AI Technology Center, India(NVIDIA AI技术中心) Jawaharlal Nehru University, India(贾瓦哈拉尔·尼赫鲁大学)

AI总结 针对CLIP无法对齐热图像与文本描述的问题,提出物理感知的热描述数据集IR-Cap和解耦双LoRA框架T-CLIP,实现场景级和对象级热理解,在跨模态检索任务上超越所有基线。

详情
Comments
34pages (including references and appendix), 13 figures
AI中文摘要

热成像在低光照和恶劣天气等挑战性条件下提供了可见光谱视觉的强大替代方案,然而像CLIP这样的基础视觉-语言模型由于根本性的热感知差距,无法将热图像与文本描述对齐。我们识别出三个主要挑战:缺乏带标题的热数据集、标准LLM无法推理热现象,以及热成像中的一个关键表示挑战——全局场景上下文和对象级热信号在单个嵌入空间中同时学习时会产生冲突。为了解决这些问题,我们引入了IR-Cap,这是第一个物理感知的热标题生成管道和数据集,在三个公开基准上提供互补的全局和细粒度热描述;以及T-CLIP,一个解耦的双LoRA框架,独立地适配CLIP用于场景级和对象级热理解。T-CLIP在三个热基准的跨模态检索中相对于所有基线取得了一致的改进,并且我们初步展示了其在文本条件热图像生成中的适用性。

英文摘要

Thermal imaging offers a powerful alternative to visible-spectrum vision under challenging conditions such as low illumination and adverse weather, yet foundational vision-language models like CLIP fail to align thermal images with textual descriptions due to a fundamental thermal perception gap. We identify three major challenges: the lack of captioned thermal datasets, the inability of standard LLMs to reason about thermal phenomena, and a key representational challenge in thermal imaging where global scene context and object-level heat signatures conflict when learned together in a single embedding space. To address these, we introduce IR-Cap, the first physics-aware thermal captioning pipeline and dataset providing complementary global and fine-grained thermal descriptions across three public benchmarks, and T-CLIP, a decoupled dual-LoRA framework that independently adapts CLIP for scene-level and object-level thermal understanding. T-CLIP achieves consistent improvements over all baselines across three thermal benchmarks in cross-modal retrieval, and we provide an exploratory demonstration of its applicability to text-conditioned thermal image generation.

2606.00672 2026-06-02 cs.AI cs.LG

Medication-Aware Financial Exploitation Detection for Alzheimer's Patients Using Edge-Aware Interaction Risk Modeling

基于边缘感知交互风险建模的阿尔茨海默病患者药物感知金融剥削检测

Farzana Akter, Lisan Al Amin, Rakib Hossain, Chaitanya Gunupudi, Faisal Quader

发表机构 * Cognitive Links LLC University of Maryland, College Park(马里兰大学学院公园分校)

AI总结 提出一种药物感知框架,通过同步药物依从性与交易监控,利用交互感知逻辑模型提升对认知风险金融事件的检测,尤其在药物脆弱窗口期召回率从0.7442提升至0.9070。

详情
AI中文摘要

金融剥削对阿尔茨海默病患者日益构成威胁,尤其是在认知稳定性下降期间。传统欺诈检测系统通常仅依赖金融行为,忽略可能改变脆弱性的临床相关因素。本文提出一种药物感知框架,将药物依从性与交易级监控同步,以改进对认知风险金融事件的检测。构建了180名患者45天的混合模拟数据集,产生8,100条药物记录和30,855笔交易。该框架通过纯金融、加性药物感知和交互感知逻辑模型评估金额异常、商家新颖性、交易频率、时间偏差和药物依从性。结果表明,纯金融基线获得了最高的全局F1分数0.5000,但交互感知模型在药物诱导脆弱窗口期内将召回率从0.7442提升至0.9070,并在排名高风险案例中实现了最高平均精度。研究结果表明,药物依从性作为金融风险的上下文修饰因子比作为孤立预测因子更有用。

英文摘要

Financial exploitation is a growing concern for people with Alzheimer's disease, especially during periods of reduced cognitive stability. Conventional fraud detection systems usually rely on financial behavior alone and ignore clinically relevant factors that may alter vulnerability. This paper proposes a medication-aware framework that synchronizes medication adherence with transaction-level monitoring to improve detection of cognitively risky financial events. A hybrid simulation dataset was constructed for 180 patients across 45 days, producing 8,100 medication records and 30,855 transactions. The framework evaluates amount anomaly, vendor novelty, transaction frequency, time deviation, and medication adherence through financial-only, additive medication-aware, and interaction-aware logistic models. Results show that the financial-only baseline obtained the highest global F1-score of 0.5000, but the interaction-aware model improved recall during medication-induced vulnerability windows from 0.7442 to 0.9070 and achieved the highest average precision for ranked high-risk cases. The findings suggest that medication adherence is most useful as a contextual modifier of financial risk rather than as an isolated predictor.

2606.00671 2026-06-02 cs.AI cs.CL cs.LG

AXIOM: A Trust-First Neuro-Symbolic Execution Architecture for Verifiable Mathematical Reasoning

AXIOM: 一种用于可验证数学推理的信任优先神经符号执行架构

Alessio Bruno

发表机构 * Independent researcher(独立研究者)

AI总结 提出AXIOM架构,将语言模型限制为规范化器,通过确定性计算机代数系统管道实现可验证的数学推理,在4个MATH类别上达到94.36%的正确率和100%的信任度。

详情
Comments
Preprint. 12 pages, 2 figures. Live interactive demo: https://huggingface.co/spaces/Squagghy/axiom-solver. Paper artifact and dataset on Zenodo (concept-DOI): 10.5281/zenodo.20440225
AI中文摘要

我们提出AXIOM,一种用于自然语言数学推理的信任优先神经符号执行架构。在AXIOM中,语言模型严格作为规范化器:它将非正式问题文本重写为狭窄的模式,由确定性计算机代数系统(CAS)管道消费,该管道推导并验证答案,或作为第一类输出弃权。路由遵循问题形状正则表达式、特定模式提示和封闭形式CAS处理器之间的1:1:1对齐,已交付3100多条这样的路由,并在250多个连续提交中零LOST_CORRECT回归。我们在4个MATH类别上报告了实证结果,累积正确率为94.36%(2,592/2,747),可解析问题的信任度为100.00%(在整个2,747条记录基准测试中零自信错误答案),所有四个领域均高于每个领域70/90/70的阈值,每个领域信任度为100.0%,仅规则处理器的中位延迟为1毫秒(在lm-eval算术20,000条记录基准测试中占88%的记录)。该架构通过公共部署已服务约30,000次生产查询。我们强调的贡献不是最终的准确率数字,而是该架构建立的向前动态:生产中的每个记录弃权在一次发布周期后都是候选正确,因为新任务在不回归注册表的情况下组合。支撑这一特性的操作纪律——数学模板分桶、LOST_CORRECT扫描作为回归预言机、可解析优先接入以及弃权作为第一类输出——构成了一个可迁移的框架,适用于数学之外的值得信赖的神经符号系统。

英文摘要

We present AXIOM, a trust-first neuro-symbolic execution architecture for natural-language mathematical reasoning. In AXIOM, the language model functions strictly as a canonicalizer: it rewrites informal problem text into a narrow schema consumed by a deterministic Computer-Algebra-System (CAS) pipeline, which derives and verifies the answer or abstains as a first-class output. Routing follows a 1:1:1 alignment between problem-shape regex, schema-specific prompt, and closed-form CAS handler, with 3,100+ such routes shipped and zero LOST_CORRECT regressions across 250+ consecutive ship commits. We report empirical results on 4 MATH categories with a cumulative correctness of 94.36% (2,592/2,747) at 100.00% trust on parseable (zero confident-wrong answers across the full 2,747-record benchmark), all four domains above the per-domain 70/90/70 floor with per-domain trust at 100.0%, and median latency of 1 ms on rule-only handlers (88% of records on the lm-eval arithmetic 20,000-record benchmark). The architecture has served ~30,000 production queries through a public deployment. The contribution we emphasize is not a final accuracy figure but the forward dynamic the architecture establishes: every logged abstain in production is a candidate correct after one ship cycle, since new tasks compose without regressing the registry. The operational discipline behind this property -- math-template bucketing, LOST_CORRECT scan as regression oracle, parseable-first onboarding, and abstain as first-class output -- constitutes a transferable framework for trustworthy neuro-symbolic systems beyond mathematics.

2606.00670 2026-06-02 cs.SD cs.AI

Beyond the Mouth: Upper-Face Affective Cues in Audiovisual Sentence Recognition under Acoustic Uncertainty

超越口部:声学不确定性下视听句子识别中的上半脸情感线索

Zhou Yang, Yueyi Yang

发表机构 * Faculty of Education and Psychology, University of Oulu, Finland(奥卢大学教育与心理学学院,芬兰) Center for Machine Vision and Signal Analysis, University of Oulu, Finland(奥卢大学机器视觉与信号分析中心,芬兰)

AI总结 本研究利用CREMA-D语料库,通过特征分类器探究在声学退化条件下,上半脸情感信息是否有助于视听句子识别,发现上半脸情感线索能提升模型校准和鲁棒性。

详情
AI中文摘要

面对面言语理解本质上是多模态的,整合了声学信号与可见的发音、面部表情、头部运动及其他社交相关线索。虽然视听言语系统通常将口部区域作为语言信息的主要视觉来源,但情感面部表情常被单独视为情感识别目标。本文研究在声学退化条件下,上半脸情感信息是否有助于视听句子识别,超越音频和口部区域线索。使用CREMA-D视听情感言语语料库,我们在四种线索条件下训练基于特征的句子分类器:仅音频(A)、音频加口部/下半脸特征(A+M)、音频加上半脸特征(A+U)以及音频加口部和上半脸特征(A+M+U)。模型在干净音频和粉红噪声条件下(+10 dB、+5 dB和0 dB SNR)进行评估,采用演员独立划分。结果表明,在退化音频下,口部/下半脸特征提供了显著的鲁棒性优势。在0 dB SNR下,A+M相比A准确率提升0.0794,演员自举95%置信区间为[0.0296, 0.1298]。上半脸情感线索表现出更微妙的效果。尽管A+M+U相比A+M的直接准确率增益很小,但全脸模型在不同SNR水平上持续改善校准,并且在噪声条件下优于打乱的上半脸对照。这些发现表明,情感面部信息可能支持声学不确定性下的多模态鲁棒性和置信度估计,而不直接编码词汇内容。更广泛地说,该研究强调了社交表达性面部线索在以人为中心的视听交互系统中的潜在作用。

英文摘要

Face-to-face speech comprehension is inherently multimodal, integrating acoustic signals with visible articulation, facial expression, head motion, and other socially relevant cues. While audiovisual speech systems typically focus on the mouth region as the primary visual source of linguistic information, affective facial expressions are often treated separately as emotion-recognition targets. This paper investigates whether upper-face affective information contributes to audiovisual sentence recognition beyond audio and mouth-region cues, particularly under acoustic degradation. Using the CREMA-D audiovisual emotional speech corpus, we train feature-based sentence classifiers under four cue conditions: audio only (A), audio plus mouth/lower-face features (A+M), audio plus upper-face features (A+U), and audio plus both mouth and upper-face features (A+M+U). Models are evaluated on clean audio and pink-noise conditions at +10 dB, +5 dB, and 0 dB SNR using actor-independent splits. Results show that mouth/lower-face features provide substantial robustness benefits under degraded audio. At 0 dB SNR, A+M improves accuracy over A by 0.0794, with an actor-bootstrap 95% confidence interval of [0.0296, 0.1298]. Upper-face affective cues exhibit a more nuanced effect. Although the direct accuracy gain of A+M+U over A+M is small, full-face models consistently improve calibration across SNR levels and outperform shuffled upper-face controls under noisy conditions. These findings suggest that affective facial information may support multimodal robustness and confidence estimation under acoustic uncertainty without directly encoding lexical content. More broadly, the study highlights the potential role of socially expressive facial cues in human-centered audiovisual interaction systems.

2606.00664 2026-06-02 cs.RO cs.CV

SKIP: Sparse Keyframe Interpolation Paradigm for Efficient Embodied World Models

SKIP: 用于高效具身世界模型的稀疏关键帧插值范式

Ziheng He, Yixiang Chen, Ning Yang, Zhanqian Wu, Qisen Ma, Yuan Xu, Jiabing Yang, Peiyan Li, Xiangnan Wu, Xiaofeng Wang, Zheng Zhu, Jing Liu, Nianfeng Liu, Yan Huang

发表机构 * UCAS(中国科学院自动化研究所) CASIA(中国科学院自动化研究所) NJU(南京大学) GigaAI THU(清华大学) FiveAges

AI总结 提出稀疏关键帧插值范式(SKIP),通过识别任务相关关键帧并仅生成这些帧,再基于机器人动作插值缺失帧,实现高效视频生成,在LIBERO上速度提升4.16倍,FVD降低89%,且生成视频作为训练数据时策略性能下降极小。

详情
Comments
25 pages, 10 figures
AI中文摘要

具身世界模型通过预测机器人动作如何影响周围场景,已成为机器人学中一种有前景的范式。然而,在像素空间中进行 rollout 推理在计算上仍然昂贵,因为长时程操作视频通常必须逐帧生成。这种成本不能通过不加区分地丢弃帧来轻易降低,因为下游策略依赖于对稀疏任务相关事件(如接近、接触、抓取和释放)的完整保留。为了解决这一挑战,我们提出了稀疏关键帧插值范式(SKIP),这是一种事件保留的稀疏到密集框架,避免了密集的逐帧生成。SKIP 首先通过利用机器人感知的多模态特征来识别任务相关的关键帧。然后,它仅用稀疏视频扩散模型合成这些关键帧。一个学习到的间隙预测器和一个动作条件插值器随后根据机器人动作重建缺失的间隔。在 LIBERO 上,SKIP 生成密集 rollouts 的速度比密集基线快 4.16 倍,同时提高了视觉保真度并将聚合 FVD 降低了 89.0%。重要的是,SKIP 生成的视频是有效的策略训练数据。即使它们完全替代真实演示,π_{0.5} 的成功率在 LIBERO 模拟中仅下降 1.3 个百分点,在真实机器人上下降 6.7 个百分点,而完全密集的逐帧生成则下降 48 到 58 个百分点。

英文摘要

Embodied world models have emerged as a promising paradigm in robotics by predicting how robot actions affect the surrounding scene. However, the rollout inference remains computationally expensive in pixel space, as long-horizon manipulation videos typically have to be generated frame by frame. This cost cannot be easily reduced by indiscriminately dropping frames, since downstream policies rely on complete preservation of sparse task-relevant events such as approach, contact, grasp, and release. To address this challenge, we propose Sparse Keyframe Interpolation Paradigm (SKIP), an event-preserving sparse-to-dense framework that avoids dense frame-by-frame generation. SKIP first identifies task-relevant keyframes by leveraging robot-aware multimodal features. It then synthesizes only these keyframes with a sparse video diffusion model. A learned gap predictor and an action-conditioned interpolator subsequently reconstruct the missing intervals according to the robot actions. On LIBERO, SKIP generates dense rollouts $4.16\times$ faster than a dense baseline while improving visual fidelity and reducing aggregate FVD by $89.0\%$. Importantly, SKIP-generated videos are effective policy-training data. Even when they fully replace real demonstrations, $π_{0.5}$ success drops only $1.3$ pp in LIBERO simulation and $6.7$ pp on the real robot, whereas fully dense frame-by-frame generation collapses by $48$ to $58$ pp.

2606.00662 2026-06-02 cs.CV

TAP-JEPA: Frozen Future-Latent Probing and Two-Stage Score Fusion for EPIC-KITCHENS-100 Action Anticipation

TAP-JEPA:冻结的未来潜在探测与两阶段分数融合用于EPIC-KITCHENS-100动作预测

Chaoyang Wang, Lexuan Xu

发表机构 * Beihang University(北航大学)

AI总结 提出TAP-JEPA方法,利用冻结的V-JEPA 2.1特征和两阶段分数融合,在EPIC-KITCHENS-100动作预测挑战中获得第二名。

详情
Comments
The runner-up solution for the Action Anticipation Challenge, EPIC-KITCHENS-100 at the CVPR EgoVis Workshop 2026
AI中文摘要

本报告介绍了TAP-JEPA,我们在EgoVis 2026的EPIC-KITCHENS-100(EK-100)动作预测挑战中获得亚军的提交方案。该任务是从目标动作开始前结束的自我中心视频片段中预测下一个动词、名词以及动词-名词动作。TAP-JEPA没有微调大型视频骨干网络,而是在冻结的V-JEPA 2.1特征上构建了一个紧凑的预测模型:ViT-G/384编码器提取可见的动作前令牌,预训练的潜在预测器从观察到的上下文估计近未来的令牌,两组令牌通过带有动词、名词和动作对特定查询的注意力探针进行融合。在最终提交中,我们使用官方训练集和大部分验证集扩展了监督训练,保留了一小部分用于合理性检查和定性观察,并采用了两阶段分数融合:首先在每个epoch内平均八个独立初始化的探针副本,然后合并epoch 12-20的候选结果,并应用依赖于类别的权重。在官方开放测试排行榜上,我们的sunshinesky条目达到了27.91%的整体动作平均Top-5召回率(MT5R),排名第二,仅比最高分低0.04个百分点。

英文摘要

This report presents TAP-JEPA, our runner-up submission to the EPIC-KITCHENS-100 (EK-100) Action Anticipation Challenge at EgoVis 2026. The task is to anticipate the next verb, noun, and verb-noun action from an egocentric clip that ends before the target action begins. Instead of fine-tuning a large video backbone, TAP-JEPA builds a compact anticipation model on frozen V-JEPA 2.1 features: a ViT-G/384 encoder extracts visible pre-action tokens, the pre-trained latent predictor estimates near-future tokens from the observed context, and both token groups are fused by attentive probes with task-specific queries for verbs, nouns, and action pairs. For the final submission, we expand supervised training with the official training split and most of the validation split, reserving a small subset for sanity checks and qualitative inspection, and adopt a two-stage score fusion that first averages eight independently initialized probe replicas within each epoch and then merges candidates from epochs 12-20 with field-dependent weights. On the official open-testing leaderboard, our sunshinesky entry achieves 27.91 percent overall action Mean Top-5 Recall (MT5R), ranking second and only 0.04 percentage points behind the top score.

2606.00658 2026-06-02 cs.CV cs.AI

Collaborative Few-Step Distillation and Low-Bit Quantization for Wan2.2 Dual-Expert Video Diffusion Models

Wan2.2双专家视频扩散模型的协同少步蒸馏与低位量化

Jinyang Du, Shenghao Jin, Ziqian Xu, Ruihao Gong, Shiqiao Gu, Yang Yong, Jinyang Guo, Xianglong Liu

发表机构 * IEEE ICME 2026 GCC Low-Bit-width Large Model Quantization Challenge(GCC 低精度大模型量化挑战)

AI总结 针对Wan2.2-T2V-A14B视频扩散模型,提出结合少步分布匹配蒸馏与低位量化的部署压缩流程,通过双专家去噪分支校准、敏感层保护及HiF4低位表示,在保持质量的同时降低计算开销。

详情
AI中文摘要

大型视频扩散模型实现了强大的视觉质量,但由于每个样本需要大量去噪步骤和较大的驻留参数足迹,部署成本仍然很高。本文研究了一种面向部署的压缩流程,针对Wan2.2-T2V-A14B模型,结合少步分布匹配蒸馏与低位量化。该流程遵循模型的双专家去噪路线,分别校准高噪声和低噪声分支,保护敏感入口层,并使用HiF4风格的低位表示以改善动态范围覆盖。量化是在蒸馏后的少步学生模型上校准,而非原始的长步轨迹上,从而减少推理过程中的激活分布不匹配。所提出的协同设计使量化模型保持接近同步全精度模型,并在平均8步和20步时超越原始全精度基线。在测试配置中,20步设置提供了最佳的质量-效率权衡。

英文摘要

Large video diffusion models achieve strong visual quality but remain expensive to deploy because each sample requires many denoising steps and a large resident parameter footprint. This paper studies a deployment-oriented compression pipeline for Wan2.2-T2V-A14B by combining few-step distribution-matching distillation with low-bit quantization. The pipeline follows the model's dual-expert denoising route, calibrates the high-noise and low-noise branches separately, protects sensitive entrance layers, and uses HiF4-style low-bit representation to improve dynamic-range coverage. Quantization is calibrated on the distilled few-step student rather than on the original long-step trajectory, reducing activation-distribution mismatch during inference. The proposed co-design keeps the quantized model close to the same-step full-precision model and surpasses the original full-precision baseline at 8 and 20 steps on average. The 20-step setting gives the best quality-efficiency trade-off in the tested configurations.

2606.00651 2026-06-02 cs.LG cs.AI cs.CL

MESA: Improving MoE Safety Alignment via Decentralized Expertise

MESA: 通过去中心化专家提升MoE安全对齐

Yitong Sun, Yao Huang, Teng Li, Ranjie Duan, Yichi Zhang, Xingjun Ma, Hui Xue, Xingxing Wei

发表机构 * University of Science and Technology of China(中国科学技术大学)

AI总结 针对MoE架构中安全能力集中于少数专家导致的脆弱性,提出MESA框架,通过最优传输理论实现专家安全职责去中心化分配与路由细化,在保持实用性的同时提升防御性能。

详情
Comments
18 pages, 8 figures, accepted by ICML 2026
AI中文摘要

混合专家(MoE)架构高效扩展大型语言模型(LLM),通过动态路由将输入分配给相关专家,以降低计算成本的同时增强容量,但引入了一个关键漏洞:安全稀疏性,即安全能力集中在少数专家中,使其容易受到对抗性绕过。同时,传统的对齐方法统一调整所有参数,忽略了它们的功能差异,并无意中降低了性能。为了解决这些挑战,我们提出了MESA(MoE安全对齐),一个针对基于MoE的LLM的定向对齐框架,策略性地去中心化安全责任以最大化覆盖范围,同时最小化对实用性的干扰。基于最优传输(OT)理论,MESA通过两种机制运作:(1)专家容量重新分配使用传输成本矩阵将安全职责分配给最具成本效益的专家,以及(2)动态路由细化约束路由器精确激活这些去中心化模块。实验表明,MESA在保持有用性的同时,对各种有害基准实现了稳健的防御性能。代码可在https://github.com/lorraine021/MESA获取。

英文摘要

Mixture-of-Experts (MoE) architectures scale Large Language Models (LLMs) efficiently, enabling greater capacity with reduced computational cost by dynamically routing inputs to relevant experts, yet introduce a critical vulnerability: Safety Sparsity, where safety capabilities concentrate in few experts, making them susceptible to adversarial bypassing. Meanwhile, conventional alignment methods uniformly adapt all parameters, ignoring their functional differences and inadvertently degrading performances. To address these challenges, we propose MESA (MoE Safety Alignment), a targeted alignment framework for MoE-based LLMs that strategically decentralizes safety responsibility to maximize coverage while minimizing interference with utility. Based on Optimal Transport (OT) theory, MESA operates through two mechanisms: (1) Expert Capacity Reallocation uses a transport cost matrix to distribute safety duties to the most cost-effective experts, and (2) Dynamic Routing Refinement constrains the router to precisely activate these decentralized modules. Experiments show that MESA achieves robust defensive performance against varied harmful benchmarks while preserving helpfulness. Code is available at https://github.com/lorraine021/MESA.

2606.00647 2026-06-02 cs.CL cs.AI

LinguIUTics at PsyDefDetect: Iterative Imbalance-Aware Fine-tuning of Qwen3-8B for Psychological Defense Mechanism Classification

LinguIUTics 在 PsyDefDetect 中的研究:用于心理防御机制分类的迭代不平衡感知微调 Qwen3-8B

Shefayat E Shams Adib, Ahmed Alfey Sani, Md Hasibur Rahman Alif, Ajwad Abrar

发表机构 * Department of Computer Science and Engineering, Islamic University of Technology, Dhaka, Bangladesh(计算机科学与工程系,伊斯兰技术大学,达卡,孟加拉国)

AI总结 针对对话文本中心理防御机制检测的类别不平衡问题,提出基于 QLoRA 微调 Qwen3-8B 的迭代不平衡感知方法,通过分组分层交叉验证、少数类轮询词汇增强和后处理流水线,在 PsyDefDetect 2026 共享任务中达到宏 F1 0.3917,排名第4。

详情
Comments
Accepted at PsyDefDetect, a shared task at the 25th BioNLP Workshop (BioNLP 2026), co-located with ACL 2026 in San Diego, CA, USA
AI中文摘要

检测对话文本中的心理防御机制仍然是一个具有挑战性的临床自然语言处理问题。针对 PsyDefDetect 2026 共享任务(九类话语分类,通过宏 F1 评估),我们的团队 LinguIUTics 在官方正类排行榜上取得了 0.3917 的宏 F1 分数,在 21 个注册团队中排名第 4,比 Ministral-8B 任务基线(宏 F1 0.3148)提高了 7.7 个绝对点(相对提升 24.4%)。由于严重的类别不平衡,BERT 系列编码器和零样本 LLM 在稀有类别上被证明无效,因此我们转向对 Qwen3-8B 进行 QLoRA 微调。我们利用三个关键策略:分组分层交叉验证(防止泄漏)、少数类轮询词汇增强,以及包含 logit 偏置调整和集成混合的后处理流水线。这些组件共同缩小了验证集与排行榜之间的差距,并显著提高了少数类的召回率,将关键的“Unclear”类别(第8级)从接近零的性能提升到 F1 分数 0.797。

英文摘要

Detecting psychological defense mechanisms in conversational text remains a challenging clinical NLP problem. For the PsyDefDetect 2026 shared task (nine-class utterance classification evaluated via macro F1), our team LinguIUTics achieves a macro F1-score of 0.3917 on the official positive-class leaderboard, ranking 4th out of 21 registered teams and improving over the Ministral-8B task baseline (31.48 macro F1) by 7.7 absolute points (24.4 percent relative). BERT-family encoders and zero-shot LLMs proved ineffective on rare classes due to severe class imbalance, leading us to QLoRA fine-tuning of Qwen3-8B. We leverage three key strategies: grouped stratified cross-validation (preventing leakage), minority-class round-robin lexical augmentation, and a post-processing pipeline with logit bias tuning and ensemble blending. Together, these components close much of the validation-to-leaderboard gap and substantially improve minority-class recall, driving the critical "Unclear" class (Level 8) from near-zero performance to an F1 score of 0.797.

2606.00642 2026-06-02 cs.AI cs.CR

Hidden Thoughts Are Not Secret: Reasoning Trace Exposure in LLMs

隐藏的思考并非秘密:大型语言模型中的推理痕迹暴露

Yu-An Lu, Ci-Yang Tsai, Yu-Lin Tsai, Raluca Ada Popa, Chia-Mu Yu

发表机构 * National Yang Ming Chiao Tung University(国家阳明交通大学) UC Berkeley(伯克利大学)

AI总结 本文提出推理暴露提示(REP)方法,通过影子模型生成的示范以辅助代码格式包装,从受害者模型中引出用户可见的推理痕迹,显著提高暴露痕迹与内部痕迹的相似性并保留有用推理信号。

详情
AI中文摘要

推理痕迹已成为改进和转移大型语言模型能力的有价值学习信号。特别是,详细痕迹有助于将推理行为从更强的教师模型蒸馏到较弱的学生模型。能力转移的价值促使许多部署了推理模型的系统隐藏原始内部痕迹,最多向用户暴露摘要和答案。因此,我们提出这样的问题:这种接口级别的痕迹隐藏是否能防止用户通过提示获得有用的推理监督?我们通过推理暴露提示(REP)研究这个问题,这是一种轻量级的上下文引出方法,使用影子模型生成的示范以辅助代码格式包装,从受害者模型中引出用户可见的推理痕迹。在常见的推理数据集、不同的受害者模型和不同的学生模型蒸馏中,REP显著提高了暴露痕迹与REP条件内部痕迹之间的相似性,同时保留了有用的推理信号。

英文摘要

Reasoning traces have become a valuable form of learning signals for improving and transferring the capabilities of large language models. In particular, detailed traces can help distill reasoning behavior from stronger teacher models into weaker student models. The value of capability transfer has motivated many deployed systems with reasoning models to hide raw internal traces and expose at most summaries and answers to users. As a result, we ask whether such interface-level trace hiding prevents users from obtaining useful reasoning supervision through prompting. We study this question with Reasoning Exposure Prompting (REP), a lightweight in-context elicitation method that uses shadow-model-generated demonstrations wrapped in auxiliary code-like formats to raise user-visible reasoning traces from a victim model. Across the common reasoning dataset, different victim models, and different student model distillation, REP substantially increases similarity between exposed and REP-conditioned internal traces while preserving useful reasoning signals.

2606.00640 2026-06-02 cs.CV

An Attribute-Based Measure of Video Complexity

基于属性的视频复杂度度量

Aditya Sarkar, Yi Li, Zihao Wang, Jiacheng Cheng, Sai Vidyaranya Nuthalapati, Aashu Singh, Shlok Kumar Mishra, David Jacobs, Nuno Vasconcelos

发表机构 * UMIACS-University of Maryland College Park(马里兰大学College Park分校UMIACS) University of California San Diego(加州大学圣地亚哥分校) Yale University(耶鲁大学) Meta AI

AI总结 提出VideoABC框架,通过属性空间量化估计视频-问题对在视频大语言模型上的失败概率,实现非参数复杂度度量。

详情
AI中文摘要

提出了一种新的框架,用于估计视频-问题对给视频大语言模型带来的复杂度,即基于属性的视频复杂度(VideoABC)。视频复杂度定义为视频大语言模型在给定视频-问题对上的失败概率。VideoABC是一种非参数复杂度度量,使用参考视频数据集和预定义的视频属性词汇表(这些属性对复杂度有信息量,例如场景复杂度或与问题相关的视频事件速度)。在训练阶段,参考视频被投影到这些属性空间中,然后进行量化。计算每个量化单元的期望ABC。给定一个新视频及其在属性空间中的投影,通过关联量化单元的期望ABC来估计复杂度。为了能够使用小规模参考视频数据集,结合了两种量化器:k-means量化器(能对参考数据集分布内的样本进行准确复杂度估计)和通用格点量化器(保证对分布外样本的泛化)。受心理物理学研究中目标-干扰物操纵的启发,提出了一种合成视频生成程序,用于在训练期间填充格点量化器的单元,从而计算其期望ABC。实验结果表明,即使使用非常低维的属性表示,VideoABC也有效,其性能大大优于“视频大语言模型作为评判者”等方法,且复杂度更低。最后,VideoABC分数在定义良好的属性方面的可解释性,揭示了基准测试的属性组成如何影响其复杂度。

英文摘要

A new framework for the estimation of the complexity posed by video-question pairs to video-LLMs, Video Attribute-Based Complexity (VideoABC), is proposed. Video complexity is defined as the probability of failure of a video-LLM for a given video-question pair. VideoABC is a non-parametric complexity measure, using a reference video dataset and a pre-defined vocabulary of video attributes informative of complexity, \eg the scene complexity or the speed of the video event informative of the question. In a training phase, reference videos are projected into the space of these attributes, which is then quantized. The expected ABC of each quantization cell is then computed. Given a new video and its projection into the attribute space, complexity is estimated by the expected ABC of the associated quantization cell. To enable the use of VideoABC with small reference video datasets, two quantizers are combined: a k-means quantizer that enables accurate complexity estimates for samples in the distribution of the reference dataset and a universal lattice quantizer that guarantees generalization to out-of-distribution samples. A synthetic video generation procedure, inspired by target-distractor manipulations of psychophysics studies, is proposed to populate the cells of the lattice quantizer during training, enabling the computation of their expected ABCs. Experimental results show that VideoABCis effective even with very low-dimensional attribute representations, substantially outperforming approaches like `video-LLM as judge' with much less complexity. Finally, the explainable nature of the VideoABC score, in terms of well-defined attributes, is shown to provide insights on how the attribute composition of benchmarks affects their complexity.

2606.00637 2026-06-02 cs.RO

Global-Local Attention Decomposition for Terrain Encoding in Humanoid Perceptive Locomotion

全局-局部注意力分解用于人形感知运动中的地形编码

Shengcheng Fu, Yang Zhang, Zhanxiang Cao, Liyun Yan, Yizhi Chen, Yunpeng Yin, Yue Gao

发表机构 * Tongji University(同济大学) Shanghai Innovation Institute(上海创新研究院) Shanghai Jiao Tong University(上海交通大学) Humanoid Robot (Shanghai) Co., Ltd.(人形机器人(上海)有限公司)

AI总结 提出全局-局部注意力分解(GLAD)方法,通过粗到细编码器分离全局地形感知和局部立足点选择,实现人形机器人在稀疏立足点和受限环境中的鲁棒运动。

详情
AI中文摘要

尽管强化学习显著推进了人形运动,感知策略在稀疏立足点地形和受限环境中仍然存在困难。在这些场景中成功需要广泛的地形感知和精确的立足点选择,而传统编码器常常纠缠这两种感知角色。为了解决这一挑战,我们提出了用于人形运动地形编码的全局-局部注意力分解(GLAD)。通过基于机器人中心高程图的粗到细编码器实现,GLAD明确分离了这些目标:全局注意力分支利用注意力池化总结周围地形上下文,而状态条件局部注意力分支稀疏化并编码精确的立足点相关几何。这种显式注意力分解防止了细粒度空间线索的稀释,同时减少了训练开销。实验表明,GLAD能够在具有挑战性的间隙、踏脚石和楼梯上实现可靠运动。此外,学习到的策略表现出涌现的地形响应行为,在简单速度指令下自主跟随狭窄路径并避开障碍物,无需显式导航规划器。在搭载机载LiDAR的Unitree G1人形机器人上的实际部署中,所提方法在多种稀疏立足点和障碍物丰富领域实现了鲁棒的零样本仿真到现实迁移。

英文摘要

Although reinforcement learning has significantly advanced humanoid locomotion, perceptive policies still struggle on sparse-foothold terrain and constrained environments. Success in these scenarios requires both broad terrain awareness and precise foothold selection, two perceptual roles that conventional encoders often entangle. To address this challenge, we propose Global-Local Attention Decomposition (GLAD) for terrain encoding in humanoid locomotion. Realized by a coarse-to-fine encoder over a robot-centric elevation map, GLAD explicitly separates these objectives: a global attention branch utilizes attention pooling to summarize the surrounding terrain context, while a state-conditioned local attention branch sparsifies and encodes precise foothold-relevant geometry. This explicit attention decomposition prevents the dilution of fine-grained spatial cues while reducing training overhead. Experiments demonstrate that GLAD enables reliable locomotion over challenging gaps, stepping stones, and stairs. Furthermore, the learned policy exhibits emergent terrain-responsive behaviors, autonomously following narrow paths and avoiding obstacles under simple velocity commands without explicit navigation planners. In real-world deployment on a Unitree G1 humanoid robot using onboard LiDAR, the proposed method achieves robust zero-shot sim-to-real transfer across diverse sparse-foothold and obstacle-rich domains.

2606.00635 2026-06-02 cs.LG

How Neural Losses Shape VAE Latents

神经损失如何塑造VAE潜在变量

Giorgio Strano, Luca Cerovaz, Michele Mancusi, Tommaso Mencattini, Emanuele Rodolà

发表机构 * Sapienza University of Rome(罗马大学萨皮恩扎分校) Paradigma, Inc.(Paradigma公司) Moises Systems, Inc.(Moises系统公司) EPFL(苏黎世联邦理工学院)

AI总结 本文研究感知损失和对抗损失等神经重建损失如何改变VAE的率失真问题,证明其减少潜在表示信息量并改变潜在空间几何结构,使表示更各向同性且不确定性分布更均匀。

详情
AI中文摘要

现代VAE很少使用标准$β$-VAE目标隐含的点态似然进行训练。在实践中,尽管缺乏对如何改变模型潜在动态的理解,点态重建常与感知损失和对抗损失结合。我们表明,重建损失的选择重塑了率失真问题本身,改变了潜在表示的信息内容和几何结构,这些变化可能仅从重建中无法察觉。首先,我们证明并实证验证,用神经项(如感知和对抗目标)增强点态重建会减少存储在潜在表示中的信息量。其次,我们展示神经重建损失系统地改变了潜在空间的几何结构:它们使表示更各向同性,并更均匀地将不确定性分布在潜在维度上,产生不同的后验方差分布。这些发现强调了率失真权衡并非理解VAE行为的全面视角,我们提出一种更机械的方法来研究失真度量的选择如何重塑优化问题。

英文摘要

Modern VAEs are rarely trained with the pointwise likelihood implied by the standard $β$-VAE objective. In practice, pointwise reconstruction is often combined with perceptual and adversarial losses, despite a lack of understanding of how this changes the latent dynamics of the model. We show that the choice of reconstruction loss reshapes the rate-distortion problem itself, altering both the information content and the geometry of the learned latent space in ways that may be invisible from reconstructions alone. First, we prove and verify empirically that augmenting pointwise reconstruction with neural terms, such as perceptual and adversarial objectives, reduces the amount of information stored in the latent representations. Second, we show that neural reconstruction losses systematically change the geometry of the latent space: they make representations more isotropic and distribute uncertainty more evenly across latent dimensions, producing different posterior variance profiles. These findings highlight how the rate-distortion tradeoff is not a comprehensive lens to understand the behavior of VAEs, and we propose a more mechanistic approach to investigate how the choice of a distortion metric reshapes the optimization problem.

2606.00634 2026-06-02 cs.CL cs.LG

French parsing enhanced with a word clustering method based on a syntactic lexicon

基于句法词典的词聚类方法增强的法语解析

Anthony Sigogne, Matthieu Constant, Eric Laporte

发表机构 * Université Paris-Est(巴黎-est大学) LIGM(语言与信息学实验室)

AI总结 本文通过将法语句法词典(Lexicon-Grammar)的数据整合到概率解析器中,并应用聚类方法于法语树库的动词,提高了基于概率上下文无关文法的解析性能。

详情
Journal ref
Second Workshop on Statistical Parsing of Morphologically Rich Languages (SPMRL), 2011, Dublin, Ireland, pp.22-27
AI中文摘要

本文评估了从法语句法词典(Lexicon-Grammar, Gross, 1994)中提取的数据整合到概率解析器中的效果。我们表明,通过对法语树库(Abeillé et al., 2003)中的动词应用聚类方法,基于概率上下文无关文法(Petrov et al., 2006)的解析器在法语上获得了准确的性能。

英文摘要

This article evaluates the integration of data extracted from a French syntactic lexicon, the Lexicon-Grammar (Gross, 1994), into a probabilistic parser. We show that by applying clustering methods on verbs of the French Treebank (Abeillé et al., 2003), we obtain accurate performances on French with a parser based on a Probabilistic Context-Free Grammar (Petrov et al., 2006).

2606.00630 2026-06-02 cs.CV stat.ML

A Systematic Benchmark of Intraoperative Ultrasound-to-MR Synthesis for Brain Tumour Surgery

脑肿瘤手术中术中超声到MR合成的系统基准测试

Olga Esteban-Sinovas, Santiago Cepeda, Ignacio Arrese, Rosario Sarabia

发表机构 * Department of Neurosurgery, Neurovascular Unit Río Hortega University Hospital(里奥霍尔特ega大学医院神经外科部门,神经血管单元) Specialized Group in Biomedical Imaging and Computational Analysis (GEIBAC)(生物医学成像与计算分析专项组(GEIBAC)) Instituto de Investigación Biosanitaria de Valladolid (IBioVALL)(瓦尔拉多利德生物医学研究 institute(IBioVALL))

AI总结 针对脑肿瘤手术中术中超声(ioUS)到MR图像合成问题,本研究在公共ReMIND数据集上系统比较了6种生成器、4种推理模式和2种目标,结合图像保真度指标和下游分割评估,发现感知质量(LPIPS)与下游效用最相关,而SSIM与效用负相关,SynDiff-2.5D在下游分割中表现最佳。

详情
AI中文摘要

术中超声(ioUS)在脑肿瘤手术中是一种多功能、成本效益高的模态,但其解释困难:采集平面非标准,伪影具有模态特异性,且其外观与术前MRI(手术规划工具、分割模型和外科医生经验所依赖的)显著不同。从ioUS合成类似MRI的图像可以使基于MRI的基础设施在术中无需额外扫描即可重复使用。大多数先前的工作孤立地评估单一架构;据我们所知,没有基准测试在共同协议下涵盖架构范式、推理机制和下游任务端点。我们在公共ReMIND数据集(76名患者;153对ioUS/T2w和104对ioUS/FLAIR研究;60/16患者级训练/保留测试集划分)上填补了这一空白。六个生成器(四个GAN基线:Pix2Pix、SwinPix2Pix、CycleGAN、CUT;Transformer增强的ResViT;以及少步扩散模型SynDiff)分别在四种推理机制(2D、2.5D、2D+3D细化、全3D)和两种目标(仅T2w;T2w+FLAIR多任务)下训练,共产生48个实验。图像保真度指标(SSIM、PSNR、MAE、LPIPS)辅以nnU-Net v2下游分割评估(肿瘤和切除腔)以及按组织学分级和再次手术的亚组分析。没有一种架构在所有轴上占优,而且关键的是,感知质量与下游效用最密切相关(LPIPS,r=-0.66,p<0.001),而更高的SSIM与更差的效用相关(r=-0.64,p<0.001);SynDiff-2.5D最好地保留了下游分割(U_Dice=0.55)。因此,应报告或优先考虑感知和下游任务指标而非全局SSIM,并且架构选择应取决于手术阶段、患者病史和临床目标。

英文摘要

Intraoperative ultrasound (ioUS) is a versatile, cost-effective modality in brain tumour surgery, but its interpretation is difficult: acquisition planes are non-standard, artefacts are modality-specific, and its appearance differs markedly from the preoperative MRI on which surgical-planning tools, segmentation models and the surgeon's experience rely. Synthesising MRI-like images from ioUS could let this MRI-based infrastructure be reused intraoperatively without an extra scan. Most prior work evaluates a single architecture in isolation; to our knowledge, no benchmark has spanned architectural paradigms, inference regimes and downstream-task endpoints under a common protocol. We address this gap on the public ReMIND data set (76 patients; 153 paired ioUS/T2w and 104 paired ioUS/FLAIR studies; 60/16 patient-level train/held-out split). Six generators (four GAN baselines: Pix2Pix, SwinPix2Pix, CycleGAN, CUT; the transformer-augmented ResViT; and the few-step diffusion model SynDiff) were each trained under four inference regimes (2D, 2.5D, 2D + 3D-refinement, full-3D) and two targets (T2w only; T2w + FLAIR multi-task), yielding 48 experiments. Image-fidelity metrics (SSIM, PSNR, MAE, LPIPS) were complemented by an nnU-Net v2 downstream segmentation evaluation (tumour and resection cavity) and by subgroup analyses by histological grade and reoperation. No architecture dominated every axis, and, critically, perceptual quality tracked downstream utility most closely (LPIPS, r=-0.66, p<0.001), whereas higher SSIM was associated with worse utility (r=-0.64, p<0.001); SynDiff-2.5D best preserved downstream segmentation (U_Dice=0.55). Perceptual and downstream-task metrics should therefore be reported alongside or in preference to global SSIM, and architecture choice conditioned on surgical phase, patient history and clinical objective.

2606.00629 2026-06-02 cs.SD cs.HC cs.LG eess.AS

Quality Audio Prototyping: a prototype system for unified sound retrieval and procedural generation

质量音频原型:统一声音检索与程序化生成的系统原型

Nelly Garcia, Aditya Bhattacharjee, Gabryel Mason-Williams, Israel Mason-Williams, Emmanouil Benetos, Joshua Reiss

发表机构 * GitHub

AI总结 提出QuAP系统,通过统一基于内容的音频检索和实时程序化生成,并集成规则辅助参数指导,降低声音设计中的操作距离,经主观评估和用户测试验证了其有效性和实用性。

详情
Comments
DaFx 2026
AI中文摘要

声音设计工作流经常在耗时的库搜索和复杂的程序化合成之间摇摆,从业者通常依赖独立的工具分别应对每个挑战。本文介绍了质量音频原型(QuAP),一个工作原型,它在单一界面中统一了基于内容的音频检索和程序化声音生成,减少了叙事概念与其声音实现之间的操作距离。QuAP集成了基于相似性的检索引擎与实时程序化音频模型,并辅以基于规则的助手,提供基于感知的参数指导,给出源自经验优化的定义和建议,而不需要先验的合成知识。初步评估证实了这种方法的可行性:主观评估显示六个嵌入合成模型中有五个在质量上具有统计显著性的提升,编码器消融研究在音效数据集上确立了首选的检索架构。与16名从业者的用户评估证实了该工具的工作流实用性,所有参与者一致认为参数助手在保持创作自主性的同时降低了程序化交互的门槛。

英文摘要

Sound design workflows frequently oscillate between time-consuming library searches and the complexity of procedural synthesis, with practitioners typically relying on disconnected tools to address each challenge separately. This paper introduces Quality Audio Prototyping (QuAP), a working prototype that unifies content-based audio retrieval and procedural sound generation within a single interface, reducing the procedural distance between a narrative concept and its sonic realisation. QuAP integrates a similarity-based retrieval engine with real-time procedural audio models, complemented by a rule-based assistant that provides perceptually informed parameter guidance, offering definitions and recommendations derived from empirical optimisation rather than requiring prior synthesis knowledge. Preliminary evaluation confirms the viability of this approach: subjective assessment demonstrated statistically significant quality improvements in five of six embedded synthesis models, and an encoder ablation study established the preferred retrieval architecture on a sound effect dataset. A user evaluation with 16 practitioners confirmed the tool's workflow utility, with all participants agreeing that the parameter assistant preserved creative agency while lowering the barrier to procedural interaction.

2606.00628 2026-06-02 cs.CL

Robust Reasoning via Dynamic Token Selection for Distribution-Aligned Self-Distillation

通过动态令牌选择实现分布对齐的自蒸馏的鲁棒推理

Ruiqi Zhang, Lingxiang Wang, Hainan Zhang Zhiming Zheng

发表机构 * Beijing Advanced Innovation Center for Future Blockchain and Privacy Computing, Beihang University(北京未来区块链与隐私计算先进创新中心,北京航空航天大学) School of Artificial Intelligence, Beihang University(北京航空航天大学人工智能学院)

AI总结 针对自蒸馏中参考答案引入风格偏差导致模型模仿表面形式而非学习推理模式的问题,提出分布对齐自蒸馏(DASD),通过动态过滤高困惑度令牌来保留逻辑修正并抑制风格噪声,在数学、代码和常识推理任务上提升鲁棒性。

详情
Comments
12 pages, 13 figures
AI中文摘要

自蒸馏通过将参考答案重写为更符合模型自身分布的训练数据来提高学习效率。然而,参考答案也引入了强烈的风格偏差,导致生成模型模仿表面形式而非学习有用的推理模式。我们观察到重写数据包含大量高困惑度(PPL)令牌,这些令牌来自两个不同的来源:有益的知识增强逻辑修正,以及由参考模仿引起的有害风格漂移。平等对待所有此类令牌会破坏基础模型的原始分布并降低性能,尤其是在困难推理任务上。为了解决这个问题,我们提出了分布对齐自蒸馏(DASD),它使用答案感知的参考模型生成候选令牌,并根据基础模型的置信度动态过滤它们。DASD 保留编码有用逻辑知识的令牌,同时抑制分布不对齐的风格噪声。在数学、代码和常识推理基准上的实验表明,DASD 始终优于竞争基线,减少了高 PPL 令牌,并提高了不同难度任务的鲁棒性。

英文摘要

Self-distillation improves learning efficiency by rewriting reference answers as training data that better matches the model's own distribution. However, reference answers also introduce strong stylistic biases, causing the generative model to imitate surface forms rather than learn useful reasoning patterns. We observe that the rewriting data contains a large number of high-perplexity (PPL) tokens, coming from two distinct sources: beneficial knowledge-enhancing logical corrections, and harmful stylistic drift induced by reference imitation. Treating all such tokens equally can disrupt the base model's original distribution and degrade performance, especially on difficult reasoning tasks. To address this, we propose Distribution-Aligned Self-Distillation (DASD), which uses an answer-aware reference model to generate candidate tokens and dynamically filters them according to the base model's confidence. DASD preserves tokens that encode useful logical knowledge while suppressing distributionally misaligned style noise. Experiments on math, code, and commonsense reasoning benchmarks show that DASD consistently outperforms competitive baselines, reduces high-PPL tokens, and improves robustness across tasks of varying difficulty.

2606.00622 2026-06-02 cs.CV

MM-Snowball: Evaluating and Mitigating Hallucination Snowballing in Multimodal Multi-Turn Dialogue

MM-Snowball:多模态多轮对话中的幻觉雪崩评估与缓解

Yue Jiang, Xue Jiang, Lihua Zhang, Zhiqiang Wang, Yuhang Lu, Peng Wang, Bo Han, Feng Zheng, Dingkang Yang

发表机构 * arXiv.org cs.CV(计算机视觉)

AI总结 针对多模态大模型在多轮对话中因初始错误累积导致幻觉雪崩的问题,提出首个细粒度诊断基准MM-Snowball,并设计无训练的冲突感知视觉校正方法CAVR,通过表示级刷新视觉锚定和logit级修正输出分布来缓解雪崩效应。

详情
Comments
Accepted by The International Conference on Machine Learning (ICML 2026)
AI中文摘要

多模态大语言模型(MLLMs)展现出显著的视觉理解能力,但在交互环境中的可靠性受到幻觉雪崩的严重破坏:一种初始错误在对话轮次间放大,导致连贯性崩溃的现象。这种失败揭示了一个根本性的脆弱性,即模型逐渐忽视视觉锚定,转而过度依赖受污染的文本历史。现有基准主要局限于单轮VQA,无法捕捉长程交互中错误传播的复杂动态。为解决这一问题,我们引入了MM-Snowball,这是首个用于细粒度诊断对话中幻觉雪崩的基准。广泛评估表明,我们的基准对即使是先进的MLLMs也构成了重大挑战,并揭示了现有为单轮VQA设计的缓解方法的无效性。为对抗这种退化,我们提出了冲突感知视觉校正(CAVR)。这种无训练方法通过协同双机制缓解雪崩:在表示级刷新视觉锚定,并在logit级修正输出分布,有效地将模型重新锚定到视觉事实。实验表明,CAVR达到了最先进的性能,为更可靠的交互式AI提供了一条有希望的路径。数据和代码可在 https://frenkie-chiang.github.io/MM-Snowball 获取。

英文摘要

Multimodal large language models (MLLMs) demonstrate remarkable visual understanding, yet their reliability in interactive settings is severely undermined by hallucination snowballing: a phenomenon where initial errors amplify across conversational turns, leading to a collapse in coherence. This failure reveals a fundamental vulnerability where models progressively neglect visual grounding in favor of over-relying on polluted textual history. Existing benchmarks are predominantly confined to single-turn VQA, which fail to capture the complex dynamics of error propagation in long-horizon interactions. To address this, we introduce MM-Snowball, the first benchmark for fine-grained diagnosis of hallucination snowballing within dialogues. Extensive evaluation shows that our benchmark poses a significant challenge even to advanced MLLMs and reveals the inefficacy of existing mitigation methods designed for single-turn VQA. To counteract this degradation, we propose Conflict-Aware Visual Rectification (CAVR). This training-free method mitigates snowballing through a synergistic dual-mechanism that refreshes visual grounding at the representation level and rectifies output distributions at the logit level, effectively re-anchoring the model to visual facts. Experiments demonstrate that CAVR achieves state-of-the-art performance, offering a promising path toward more reliable interactive AI. Data and code are available at: https://frenkie-chiang.github.io/MM-Snowball

2606.00620 2026-06-02 cs.CV

FlowNar: Scalable Streaming Narration for Long-Form Videos

FlowNar: 面向长视频的可扩展流式叙述

Zeyun Zhong, Manuel Martin, Chengzhi Wu, David Schneider, Frederik Diederichs, Juergen Gall, Juergen Beyerer

发表机构 * arXiv.org [cs.CV] 30 May 2026([cs.CV] 30 2026年5月)

AI总结 提出FlowNar框架,通过动态上下文管理和CLAM模块实现有界视觉记忆与计算复杂度,在流式视频叙述中兼顾高质量与高效率。

详情
Comments
Accepted to ICML 2026
AI中文摘要

近期的大型多模态模型(LMMs)主要针对离线场景设计,难以适应流式视频的动态需求。虽然最近的在线适配改进了实时处理,但仍面临关键的可扩展性挑战,资源需求通常随视频时长至少线性增长。为突破这一瓶颈,我们提出FlowNar,一种用于可扩展流式视频叙述的新型框架。FlowNar的核心是一种用于历史视觉上下文移除的动态上下文管理策略,结合我们的CLAM(跨线性注意力记忆)模块用于流式视觉历史保留,确保有界的视觉内存使用和计算复杂度,这对高效流式处理至关重要。我们还引入了一个现实的自条件评估协议和补充评估指标,以在类似部署的条件下评估流式叙述模型。在Ego4D、EgoExo4D和EpicKitchens100数据集上的实验表明,FlowNar在强基线上显著提高了叙述质量,同时保持高效,支持处理10倍长的视频,并实现3倍更高的吞吐量(FPS)。代码可在https://github.com/zeyun-zhong/FlowNar获取。

英文摘要

Recent Large Multimodal Models (LMMs), primarily designed for offline settings, are ill-suited for the dynamic requirements of streaming video. While recent online adaptations improve real-time processing, they still face critical scalability challenges, with resource demands typically growing at least linearly with video duration. To overcome this bottleneck, we propose FlowNar, a novel framework for scalable streaming video narration. The core of FlowNar is a dynamic context management strategy for historical visual context removal, combined with our CLAM (Cross Linear Attentive Memory) module for streaming visual history retention, ensuring bounded visual memory usage and computational complexity, crucial for efficient streaming. We also introduce a realistic self-conditioned evaluation protocol and complementary evaluation metrics to assess streaming narration models under deployment-like conditions. Experiments on the Ego4D, EgoExo4D, and EpicKitchens100 datasets demonstrate that FlowNar substantially improves narration quality over strong baselines while being highly efficient, supporting processing of 10$\times$ longer videos and achieving 3$\times$ higher throughput (FPS). The code is available at https://github.com/zeyun-zhong/FlowNar.

2606.00619 2026-06-02 cs.CL cs.AI

MemPro: Agentic Memory Systems as Evolvable Programs

MemPro:作为可进化程序的智能体记忆系统

Qingshan Liu, Guoqing Wang, Wen Wu, Jingqi Huang, Xinqi Tao, Dejia Song, Jie Zhou, Liang He

发表机构 * East China Normal University(东华师范大学) Xiaohongshu Inc.(小红书公司)

AI总结 提出MemPro框架,将整个记忆构建-检索管道视为可进化程序,通过故障模式引导的编辑-调试迭代优化,在多个长时任务数据集上超越静态和提示级进化基线。

详情
Comments
20 pages, 14 figures
AI中文摘要

长时程自主智能体需要记忆系统来保留历史信息、跟踪演化状态并在有限上下文窗口之外重用相关知识。现有的智能体记忆系统通常遵循记忆构建-检索(MCR)管道,但往往主要适应记忆库,而在部署后保持周围管道固定。这种固定管道设计难以处理异构的任务特定故障模式,并且可能随着时间推移与规模和结构演化的记忆库产生错位。为解决这些限制,我们提出MemPro,一种系统级进化框架,将整个MCR管道视为可进化程序,而不仅仅是适应记忆库或提示文本。MemPro维护一个可运行记忆系统实现的版本树,其中进化智能体迭代选择有前途的版本,诊断重复出现的故障,并通过故障模式引导的编辑-调试改进创建改进的子版本。在LongMemEval、LoCoMo、HotpotQA和NarrativeQA上的实验表明,MemPro在几次迭代内持续优于强静态和提示级进化基线,随着进化持续改进,并实现了良好的性能-成本权衡。代码可在https://github.com/wanghai673/MemPro获取。

英文摘要

Long-horizon autonomous agents require memory systems to retain historical information, track evolving states, and reuse relevant knowledge beyond finite context windows. Existing agentic memory systems typically follow a memory construction-retrieval (MCR) pipeline, but often adapt mainly the memory bank while keeping the surrounding pipeline fixed after deployment. This fixed-pipeline design struggles to handle heterogeneous task-specific failure modes and can become misaligned with memory banks that evolve in scale and structure over time. To address these limitations, we propose MemPro, a system-level evolution framework that treats the entire MCR pipeline as an evolvable program rather than adapting only the memory bank or prompt text. MemPro maintains a version tree of runnable memory-system implementations, where an Evolving Agent iteratively selects promising versions, diagnoses recurring failures, and creates improved child versions through failure-mode-guided edit-debug refinement. Experiments on LongMemEval, LoCoMo, HotpotQA, and NarrativeQA show that MemPro consistently outperforms strong static and prompt-level evolving baselines within a few iterations, continues to improve with evolution, and achieves a favorable performance-cost trade-off. Code is available at https://github.com/wanghai673/MemPro.

2606.00618 2026-06-02 cs.AI

Efficient Test-time Inference for Generative Planning Models

生成式规划模型的高效测试时推理

Robert Gieselmann, Mihai Samson, Federico Pecora, Jeremy L. Wyatt

发表机构 * University of California, Berkeley(加州大学伯克利分校) ETH Zurich(苏黎世联邦理工学院)

AI总结 本文提出一种改进的开放-封闭列表搜索算法,结合生成模型和启发式模型,在测试时高效推理,提升生成式规划模型的解质量和计算效率。

详情
AI中文摘要

生成式模型已成为人工智能规划的强大范式,但其性能仍受训练数据分布的限制。一种方法是通过扩展测试时计算来改进推理过程中生成的解决方案。更高效的替代方案是优化推理过程本身。在本文中,我们展示了经典开放-封闭列表(OCL)搜索的修改版本提供了这样一种高效的推理过程。我们的算法协同了两个学习组件:一个从中间状态执行快速推演的生成模型,以及一个在候选推理路径中优先排序的启发式模型。关键贡献包括新颖的探索控制机制以及将学习模型集成到OCL框架中。在多个组合规划领域中,我们的方法在计算效率和解质量上均优于神经符号搜索基线和经典求解器。

英文摘要

Generative models have emerged as a powerful paradigm for AI planning, yet their performance remains constrained by the training data distribution. One approach is to improve generated solutions during inference by scaling test-time compute. A more efficient alternative is to optimize the inference process itself. In this paper, we show that a modified version of a classical Open-Closed List (OCL) search provides just such an efficient inference procedure. Our algorithm synergizes two learned components: a generative model that performs fast rollouts from intermediate states and a heuristic model that prioritizes among candidate reasoning paths. Key contributions include novel exploration control mechanisms and integration of learned models within the OCL framework. Across multiple combinatorial planning domains, our approach outperforms both neurosymbolic search baselines and classical solvers in computational efficiency and solution quality.

2606.00613 2026-06-02 cs.CL cs.AI

Linguistics-Aware Non-Distortionary LLM Watermarking

语言学感知的无失真LLM水印

Shinwoo Park, Hyejin Park, Hyeseon An, Yo-Sub Han

发表机构 * Yonsei University(延世大学) Rensselaer Polytechnic Institute(罗切斯特理工学院)

AI总结 提出LUNA水印方法,通过语言自适应非失真二元锦标赛采样器,在保持文本质量的同时实现高检测性能,在12种设置中AUROC达0.9959且中位困惑度偏移仅0.045。

详情
AI中文摘要

水印应能识别语言模型输出而不降低质量或限制验证仅由模型提供者进行。多语言部署使这更加困难,因为形态、分词和书写系统的变化会改变水印证据自然进入的位置。我们引入LUNA,一种语言自适应水印,结合了无模型检测和标准随机密钥模型下的单令牌无失真。LUNA从外部语料库中的词性上下文估计归一化下一标记熵,并用其设置无失真二元锦标赛采样器的深度;检测器从文本、分词器、词性标注器和密钥重建相同的调度。我们在六种类型多样的语言和两个领域上评估了八种主要基线。LUNA在十二种设置中达到了0.9959的AUROC和最低的平均绝对中位困惑度偏移0.045;其95%自助法区间[0.022, 0.073]低于所有基线区间。LUNA还记录了最低的平均Self-BLEU、Distinct-1、surprisal和熵偏移。它是唯一同时在大多数设置中实现AUROC > 0.99和绝对中位困惑度偏移低于0.1的方法,在12种设置中的9种达到该状态,而没有任何基线在超过2种设置中达到。我们的代码可在https://github.com/Shinwoo-Park/luna_watermark获取。

英文摘要

Watermarking should identify language-model output without degrading quality or limiting verification to the model provider. Multilingual deployment makes this harder because morphology, segmentation, and script change where watermark evidence can enter naturally. We introduce LUNA, a linguistically adaptive watermark that combines model-free detection with single-token non-distortion under the standard random-key model. LUNA estimates normalized next-tag entropy from part-of-speech contexts in an external corpus and uses it to set the depth of a non-distortionary binary tournament sampler; the detector reconstructs the same schedule from text, a tokenizer, a tagger, and a secret key. We evaluate six typologically diverse languages and two domains against eight primary baselines. LUNA attains an AUROC of 0.9959 and the lowest mean absolute median perplexity shift of 0.045 across the twelve settings; its 95% bootstrap interval [0.022, 0.073] lies below all baseline intervals. LUNA also records the lowest mean Self-BLEU, Distinct-1, surprisal, and entropy shifts. It is the only method that simultaneously achieves AUROC > 0.99 and an absolute median perplexity shift below 0.1 in a majority of settings, reaching this regime in 9 of the 12 settings while no baseline reaches it in more than 2. Our code is available at: https://github.com/Shinwoo-Park/luna_watermark

2606.00611 2026-06-02 cs.AI

TRACE: Trajectory Risk-Aware Compression for Long-Horizon Agent Safety

TRACE: 面向长程智能体安全的轨迹风险感知压缩

Zhepei Hong, Lin Wang, Liting Li, Haokai Ma, Junfeng Fang, Fei Shen, Dan Zhang, Xiang Wang

发表机构 * University of Science and Technology of China(中国科学技术大学) National University of Singapore(新加坡国立大学) South China Normal University(华南师范大学)

AI总结 提出轨迹风险感知压缩方法TRACE,通过压缩器-阅读器架构将长轨迹压缩为潜在证据状态,以聚合稀疏风险信号并提升长程安全检测准确率。

详情
AI中文摘要

长程LLM智能体在长轨迹中产生安全证据,其中稀疏、延迟和组合的风险信号常常逃脱局部审核。现有的轮次级或短上下文检测器难以在长时间跨度内可靠地保留和聚合此类证据。我们将长程智能体安全检测重新定义为轨迹级证据压缩,并提出面向长程智能体安全的轨迹风险感知压缩(TRACE)。TRACE采用压缩器-阅读器设计:压缩器在轨迹级监督下将完整轨迹编码为紧凑的潜在证据状态,阅读器以该潜在证据状态作为安全参考来判断原始轨迹。该设计有助于聚合分散的风险线索并减少过早的证据丢失。在ASSEBench、Pre-Ex-Bench和R-Judge上,TRACE在所有评估基线上取得了最佳准确率,相比强基线最高提升12.6个百分点。在LongSafety上,TRACE随着上下文长度增加表现出更小的性能下降。注意力可视化和案例研究表明,压缩后的参考有助于阅读器聚焦于风险关键片段并恢复跨步证据。代码可在https://github.com/Peregrine123/TRACE_official获取。

英文摘要

Long-horizon LLM agents produce safety evidence across long trajectories, where sparse, delayed, and compositional risk signals often escape local moderation. Existing turn-level or short-context detectors struggle to reliably retain and aggregate such evidence over extended horizons. We reframe long-horizon agent safety detection as trajectory-level evidence compression and propose Trajectory Risk-Aware Compression for Long-Horizon Agent Safety (TRACE). TRACE uses a Compressor-Reader design: the Compressor encodes the full trajectory into a compact latent evidence state under trajectory-level supervision, and the Reader judges the raw trajectory with this latent evidence state as a safety reference. This design helps aggregate dispersed risk cues and reduce premature evidence loss. Across ASSEBench, Pre-Ex-Bench, and R-Judge, TRACE achieves the best accuracy on all evaluated backbones, improving over strong baselines by up to 12.6 percentage points. On LongSafety, TRACE shows smaller performance degradation as context length grows. Attention visualizations and case studies suggest that the compressed reference helps the Reader focus on risk-critical segments and recover cross-step evidence. Code is available at https://github.com/Peregrine123/TRACE_official.

2606.00609 2026-06-02 cs.LG cs.AI

CARE-RL: Capability-Aware Reinforcement Learning for Mitigating Cross-Domain Conflicts

CARE-RL:用于缓解跨领域冲突的能力感知强化学习

Rui Zhang, Xinle Wu, Yao Lu

发表机构 * National University of Singapore(新加坡国立大学)

AI总结 提出CARE-RL框架,结合协议感知奖励生成与能力感知优化,通过PA-GRM和DACSP方法缓解多领域强化学习中的奖励不可靠与能力干扰问题。

详情
AI中文摘要

具有可验证奖励的强化学习在面向推理的大语言模型中取得了显著进展,但由于非可验证任务中奖励不可靠以及跨领域能力干扰,将其扩展到多领域强化学习仍具挑战性。我们提出CARE-RL,将协议感知奖励生成与能力感知优化相结合,以缓解跨领域冲突。对于非可验证任务,协议感知生成式奖励模型(PA-GRM)在生成轨迹条件奖励之前构建提示级别的评估协议和模式,从而实现对开放式响应的任务自适应且可比较的评估。对于多领域优化,方向感知能力子空间投影(DACSP)从先前的强化学习阶段提取历史能力方向,并通过放大对齐分量、抑制冲突分量以及保留正交更新来调节后续更新。在数学、聊天和指令遵循基准上的实验表明,CARE-RL始终优于标准的多领域强化学习基线,在Qwen2.5-7B和Qwen3-4B上分别达到47.9和50.7的总平均分。

英文摘要

Reinforcement learning (RL) with verifiable rewards has achieved strong progress in reasoning-oriented LLMs, but extending it to multi-domain RL remains challenging due to reward unreliability in non-verifiable tasks and capability interference across domains. We propose CARE-RL to combine protocol-aware reward generation with capability-aware optimization for mitigating cross-domain conflicts. For non-verifiable tasks, the Protocol-Aware Generative Reward Model (PA-GRM) constructs prompt-level evaluation protocols and schemas before producing trace-conditioned rewards, enabling task-adaptive yet comparable evaluation of open-ended responses. For multi-domain optimization, Direction-Aware Capability Subspace Projection (DACSP) extracts historical capability directions from previous RL stages and modulates later updates by amplifying aligned components, suppressing conflicting components, and preserving orthogonal updates. Experiments across math, chat, and instruction-following benchmarks show that CARE-RL consistently outperforms standard multi-domain RL baselines, achieving Total Avg scores of 47.9 and 50.7 on Qwen2.5-7B and Qwen3-4B, respectively.