arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 3990
2606.08473 2026-06-09 cs.LG 新提交

Physically Consistent Null Space Alignment for Detection of Low-Magnitude False Data Injection Attacks

物理一致零空间对齐用于检测低幅值虚假数据注入攻击

Xin Li, Chenhan Xiao, Jonathan Cohen, Aviad Elyashar, Yang Weng, Rami Puzis

发表机构 * Ben-Gurion-University(本-古里安大学)

AI总结 提出物理一致零空间对齐(PCNSA)框架,通过伪零空间守恒预处理保持物理零空间与测量伪零空间的几何对应,从而检测低幅值但高影响的隐蔽虚假数据注入攻击。

详情
Comments
12 pages, 13 figures
AI中文摘要

虚假数据注入攻击(FDIAs)引入小的测量扰动,当注入信号与系统模型的伪零空间对齐时,仍可能导致电力系统状态估计出现较大偏差。现有的基于模型和数据驱动的检测器可能无法识别这种低幅值但高影响的攻击,因为残差检验忽略了隐藏在伪零空间中的变化,而子空间学习方法捕获相关模式但未强制执行物理一致性。本文提出物理一致零空间对齐(PCNSA),一种通过预处理保持物理零空间与测量导出伪零空间之间的几何对应来检测隐蔽FDIAs的框架。关键在于伪零空间守恒数据预处理(PSCP)步骤,该步骤在子空间提取之前将测量重新表达在物理坐标系中。我们证明PSCP保持了行空间与其正交补之间的分离,这是传统逐特征标准化所违反的性质。这使得奇异值分解(SVD)导出的伪零子空间与物理残差空间对齐,而无需显式知道H。在IEEE 14、30、57和118节点系统上的实验证实了这一原理:逃避XTM、LSTM、AE和Isolation Forest基线的隐蔽攻击在对齐子空间中表现为明显偏差,从而获得更高的F1分数和检测精度,同时在部分可观测性和实际PMU噪声下保持鲁棒性。

英文摘要

False data injection attacks (FDIAs) introducing small measurement perturbations can still cause large deviations in power system state estimation when the injected signals align with the pseudo-null space of the system model. Existing model- and data-driven detectors may fail to identify such low-magnitude but high-impact attacks because residual tests ignore changes hidden in the pseudo-null space, while subspace learning methods capture correlation patterns without enforcing physical consistency. This paper proposes Physically Consistent Null Space Alignment (PCNSA), a framework that detects stealthy FDIAs by preserving, through preprocessing, the geometric correspondence between the physical null space and the measurement-derived pseudo-null space. The key point is a Pseudo-null Space Conserved data Preprocessing (PSCP) step that re-expresses measurements in the physical coordinate frame before subspace extraction. We prove that PSCP preserves the separation between row space and its orthogonal complement, a property that conventional per-feature standardization violates. This keeps the singular value decomposition (SVD)-derived pseudo-null subspace aligned with the physical residual space without explicit knowledge of H. Experiments on IEEE 14-, 30-, 57-, and 118-bus systems confirm this principle in practice: stealthy attacks that evade XTM, LSTM, AE and Isolation Forest baselines appear as clear deviations in the aligned subspace, yielding higher F1-score and detection accuracy while remaining robust under partial observability and realistic PMU noise.

2606.08471 2026-06-09 cs.CL cs.AI 新提交

More Yap Less Meaning: Uncovering Self-Improvement Behavior in SLMs

更多废话,更少意义:揭示小语言模型中的自我改进行为

Marina Igitkhanian, Erik Arakelyan

发表机构 * American University of Armenia(亚美尼亚美国大学) NVIDIA(英伟达)

AI总结 本研究通过构建充分性测试,发现小语言模型在自我纠正中仅获得4.4%的准确率提升,且较长的提示反而与错误答案正相关,表明其推理能力有限。

详情
Comments
GEM Workshop at ACL 2026
AI中文摘要

近年来,语言模型在各个领域和应用中取得了快速进展。然而,它们的自我改进能力——即是否善于识别和纠正自身推理中的缺陷——仍然存疑。在本研究中,我们通过构建一个充分性测试来严格检验小语言模型(SLMs)的自我纠正能力。我们提出了一个最小化的三步自我纠正流程:收集初始SLM答案,提示同一模型根据真实答案为错误回答生成提示,然后将相同问题与模型自身的反馈一起输入以改进初始答案。我们在算术和逻辑推理基准上评估了多种指令微调和推理SLM。我们的发现表明,注入提示句子的SLM相比初始问答准确率仅提升4.4%。即使正确答案与模型的错误推理一起提供,评估的SLM也无法理解其推理中缺失了什么,并且在导致纠正和未导致纠正的提示之间显示出最小的语义差异。此外,我们的实验表明,较长的提示与错误的最终答案正相关,表明对问题的较长思考可能阻碍推理过程,这意味着SLM的性能不一定随更大的计算预算而扩展。

英文摘要

Recently, language models have made rapid progress across various domains and applications. However, their capability for self-improvement, i.e., whether they are adept at recognising and correcting flaws in their own reasoning, remains dubious. In this study, we address this question by constructing a sufficiency test to rigorously examine the self-correction capabilities of small language models (SLMs). We propose a minimal three-step self-correction pipeline that collects initial SLM answers, prompts the same model to generate hints for its incorrect responses given the ground truth, and feeds the model the same question with its own feedback to refine the initial answer. We evaluate a variety of instruction-tuned and reasoning SLMs in this experimental setup on arithmetic and logical reasoning benchmarks. Our findings show that SLMs with injected hint sentences yield only a 4.4 percent gain over initial question-answering accuracy. Even though the correct answer was provided alongside the model's incorrect reasoning, the evaluated SLMs fail to understand what was missing in their reasoning and show minimal semantic difference between hints that lead to corrections and ones that do not. Furthermore, our experiments show that longer hints are positively correlated with incorrect final answers, suggesting that longer deliberation on problems can hinder the reasoning process, meaning that SLMs do not necessarily scale in performance with a larger compute budget.

2606.08470 2026-06-09 cs.RO 新提交

LUNA-AD: Lightweight Uncertainty-Aware Language Model with Lifelong Learning for Autonomous Driving

LUNA-AD: 面向自动驾驶的轻量级不确定性感知语言模型与终身学习

Ruoyu Yao, Pei Liu, Ruiguo Zhong, Mingxing Peng, Rui Yang, Jun Ma

发表机构 * The Hong Kong University of Science and Technology (Guangzhou)(香港科技大学(广州))

AI总结 提出LUNA-AD,一种结合三系统架构、多智能体分析、双头轻量模型和反思驱动终身学习的轻量级不确定性感知语言模型,在nuPlan上实现高成功率与低推理延迟。

详情
Comments
16 pages,9 figures
AI中文摘要

虽然大型语言模型(LLMs)提供了有前景的推理能力,但它们在安全关键的驾驶系统中的集成受到推理多样性有限、高计算开销和静态学习范式的阻碍。为了解决这些挑战,我们提出了LUNA-AD,一种面向自动驾驶(AD)的轻量级不确定性感知语言模型与终身学习。LUNA-AD采用三系统架构,协调复杂的多模态行为推理、高效部署和持续改进。我们设计了一个多智能体分析系统,通过多样化的假设探索生成不确定性感知的决策演示。一个双头轻量启发式模型被蒸馏,以统一决策分布和文本解释的推理,同时实现高效部署。此外,一种反思驱动的终身学习机制作用于多模态决策输出并保持策略多样性,允许通过闭环反馈改进候选决策和理由,以增强驾驶鲁棒性。在nuPlan基准上的大量实验表明,与现有知识驱动的AD框架相比,LUNA-AD在非反应式和反应式模式下均实现了最先进的成功率,并显著降低了推理延迟。

英文摘要

While large language models (LLMs) offer promising reasoning capabilities, their integration into safety-critical driving systems is hindered by limited reasoning diversity, high computational overhead, and static learning paradigms. To address these challenges, we propose LUNA-AD, a lightweight uncertainty-aware language model with lifelong learning for autonomous driving (AD). LUNA-AD features a tri-system architecture that reconciles complex multimodal behavioral reasoning, efficient deployment, and continual refinement. We design a multi-agent analytical system to generate uncertainty-aware decision-making demonstrations through diverse hypothesis exploration. A dual-head lightweight heuristic model is distilled to unify the inference of decision distributions and textual explanations while enabling efficient deployment. Furthermore, a reflection-driven lifelong learning mechanism operates on multimodal decision outputs and preserves strategic diversity, allowing for the refinement of candidate decisions and rationales via closed-loop feedback to enhance driving robustness. Extensive experiments on nuPlan benchmarks demonstrate that LUNA-AD achieves state-of-the-art success rates under both non-reactive and reactive modes, with drastically reduced inference latency compared to existing knowledge-driven AD frameworks.

2606.08467 2026-06-09 cs.LG cs.AI 新提交

The Confidence Trap: Calibration Attacks for Graph Neural Networks

置信陷阱:图神经网络的校准攻击

Cuong Dang, Jiahao Zhang, Hieu Ta Quang, Dung Le, Lu Cheng, Suhang Wang

发表机构 * Virginia Polytechnic Institute and State University(弗吉尼亚理工学院暨州立大学) The Pennsylvania State University(宾夕法尼亚州立大学) VinUniversity University of Illinois at Chicago(伊利诺伊大学芝加哥分校)

AI总结 提出统一图校准攻击(UGCA)框架,通过KL散度损失、重排序机制和混合损失等策略,在保持分类精度下显著提高期望校准误差,揭示高精度或多类模型更易受攻击。

详情
AI中文摘要

尽管置信校准对于安全关键应用中的可信决策至关重要,但校准后的GNN对对抗性结构扰动的鲁棒性仍未被充分探索。然而,研究图上的校准攻击面临独特的技术挑战:(1)图结构的离散性使基于梯度的优化复杂化;(2)现有的低置信目标无法将预测推向均匀分布;(3)GNN对边扰动高度敏感,常导致违反攻击约束的意外标签变化。为应对这些挑战,我们提出一个\textbf{统一图校准攻击(UGCA)}框架,用于GNN校准鲁棒性的\textbf{最坏情况(白盒)分析}。UGCA引入KL散度损失以鼓励均匀预测分布,重排序机制以减少标签翻转,混合损失以在违规时恢复标签,以及束搜索以探索更广的对抗搜索空间。我们进一步提供理论见解,将模型泛化、数据集复杂性和校准脆弱性联系起来,表明在该威胁模型下,具有更高精度或在更多类别数据集上训练的模型更容易受到攻击。大量实验表明,UGCA在保持分类精度的同时显著增加了期望校准误差。我们的代码公开在https://github.com/CaptainCuong/Graph-Calibration-Attack.git。

英文摘要

While confidence calibration is essential for trustworthy decision-making in safety-critical applications, the robustness of calibrated GNNs to adversarial structural perturbations remains largely unexplored. However, studying calibration attacks on graphs presents unique technical challenges: (1) the discrete nature of graph structures complicates gradient-based optimization, (2) existing underconfidence objectives fail to drive predictions toward uniform distributions, and (3) GNNs are highly sensitive to edge perturbations, often causing unintended label changes that violate attack constraints. To address these challenges, we propose a \textbf{Unified Graph Calibration Attack (UGCA)} framework designed for \textbf{worst-case (white-box) analysis} of GNN calibration robustness. UGCA introduces a KL-divergence loss to encourage uniform predictive distributions, a reranking mechanism to reduce label flipping, a hybrid loss to recover labels when violations occur, and beam search to explore a broader adversarial search space. We further provide theoretical insights linking model generalization, dataset complexity, and calibration vulnerability, showing that models with higher accuracy or trained on datasets with more classes are more susceptible under this threat model. Extensive experiments demonstrate that UGCA substantially increases Expected Calibration Error while preserving classification accuracy. Our code is publicly available at https://github.com/CaptainCuong/Graph-Calibration-Attack.git.

2606.08458 2026-06-09 cs.RO 新提交

Personalized and Robust Proactive Robot Assistance with Uncertainty-Guided LLM Reasoning

个性化且鲁棒的主动机器人辅助:基于不确定性引导的大语言模型推理

Alvaro Gonzalez, M. H. Hasan Shovo, Ali Ayub

发表机构 * Concordia University(康考迪亚大学)

AI总结 提出GLOBE框架,结合n-gram马尔可夫模型与不确定性引导的大语言模型推理,在家庭环境中实现高效鲁棒的主动机器人辅助,并在HOMER-Noise数据集上验证了其性能与效率。

详情
Comments
Accepted to the 2026 IEEE 35th International Conference on Robot and Human Interactive Communication (RO-MAN)
AI中文摘要

在家庭环境中,主动机器人辅助需要在动态和嘈杂条件下准确预测人类活动和物体使用。现有方法通常依赖复杂的时空模型,这些模型计算成本高且对环境变化敏感。本文提出GLOBE,一个轻量级框架,结合n-gram马尔可夫模型捕捉时间行为模式与不确定性引导的大语言模型推理。该框架高效执行序列预测,仅在模型置信度低时选择性调用大语言模型推理。为评估现实条件下的性能,我们引入HOMER-Noise,即HOMER+数据集的噪声扩展,模拟由人类、宠物和幼儿引起的物体移动等结构化干扰。实验结果表明,GLOBE在干净和嘈杂环境下均达到与最先进方法竞争的性能,同时提高了鲁棒性和计算效率。该框架进一步通过与Stretch 3移动操作器的概念验证集成得到验证,展示了其在真实人机交互场景中的潜在应用。

英文摘要

Proactive robot assistance in household environments requires accurate prediction of human activities and object usage under dynamic and noisy conditions. Existing approaches often rely on complex spatio-temporal models, which can be computationally expensive and sensitive to environmental variability. In this paper, we propose GLOBE, a lightweight framework that combines n-gram Markov models for capturing temporal behavioral patterns with uncertainty-guided large language model (LLM) reasoning. The framework performs sequential prediction efficiently while selectively invoking LLM reasoning only when the model confidence is low. To evaluate performance under realistic conditions, we introduce HOMER-Noise, a noisy extension of the HOMER+ dataset that simulates structured disturbances such as object movements caused by humans, pets, and toddlers. Experimental results show that GLOBE achieves competitive performance with state-of-the-art methods while improving robustness and computational efficiency across both clean and noisy settings. The framework is further validated through a proof-of-concept integration with a Stretch 3 mobile manipulator, demonstrating its potential application in real-world human-robot interaction scenarios.

2606.08454 2026-06-09 cs.LG cs.CL 新提交

Beyond Linear Activation Steering: Invertible Latent Transformations for Controlling LLM Behavior

超越线性激活引导:用于控制大语言模型行为的可逆潜在变换

Tuc Nguyen, Thai Le

发表机构 * Indiana University Bloomington(印第安纳大学伯明顿分校)

AI总结 提出INNSteer框架,通过可逆神经网络将LLM激活映射到潜在空间进行线性控制,再逆变换回原空间,实现非线性、输入依赖的激活引导,在多个模型和基准上优于现有方法。

详情
Comments
36 pages, 7 figures
AI中文摘要

激活引导提供了一种轻量级的推理时机制,通过修改大语言模型(LLM)的内部激活向量,使其朝向期望行为。现有方法大多在原始激活空间中计算固定的引导方向,通常使用对比示例对的均值差、线性探针或任意可分离性标准。虽然在一定程度上有效,但这些方法将行为控制视为全局线性加性偏移:相同的方向应用于所有输入,且行为是线性可分的。当行为特征在激活空间中非线性变化或位于弯曲和各向异性流形上时,这种处理可能具有局限性,因为最优干预可能是输入依赖的。为解决这一限制,我们提出了INNSteer,一种基于可逆潜在变换的非线性激活引导框架。INNSteer并非在原始表示空间中寻找更好的引导向量,而是学习一个轻量级可逆神经网络$ϕ$,将LLM的激活映射到潜在空间,在该空间中行为类别更易于线性控制。推理时,激活通过$ϕ$映射,在潜在空间中进行引导,再通过精确逆变换$ϕ^{-1}$映射回原空间。这使得简单的潜在空间平移在原始激活空间中变为非线性、输入依赖的干预。在多个LLM系列、规模、行为特征和安全基准的实验设置中,INNSteer在保持生成流畅性的同时,一致地优于线性、基于传输和非线性的引导基线。

英文摘要

Activation steering provides a lightweight inference-time mechanism for controlling large language models (LLMs) by modifying their internal activation vectors toward desired behaviors. Most existing methods compute a fixed steering direction in the original activation space, typically from pairs of contrastive examples using mean differences, linear probes, or arbitrary separability criteria. While effective to a certain extent, these methods treat behavioral control as a global, linear, additive offset: the same direction is applied across inputs, and behaviors are linearly separable. This can be restrictive when behavioral features vary nonlinearly across the activation space or lie on curved and anisotropic manifolds, where the optimal intervention may be input-dependent. To address this limitation, we propose INNSteer, a nonlinear activation steering framework based on invertible latent transformations. Rather than searching for a better steering vector in the original representation space, INNSteer learns a lightweight invertible neural network $ϕ$ that maps an LLM's activations into a latent space where behavioral classes are more amenable to linear control. At inference time, activations are mapped through $ϕ$, steered in the latent space, and mapped back through the exact inverse transformation $ϕ^{-1}$. This makes a simple latent-space translation become a nonlinear, input-dependent intervention in the original activation space. Across experiment settings on multiple LLM families, scales, behavioral traits, and safety benchmarks, INNSteer consistently improves model control over linear, transport-based, and nonlinear steering baselines while largely preserving generation fluency.

2606.08452 2026-06-09 cs.LG 新提交

Theoretical Foundations of Continual Learning via Drift-Plus-Penalty

基于漂移加惩罚的持续学习的理论基础

Nazreen Shah, Govinda Arya, Bharath B. N., Ranjitha Prasad

发表机构 * IIIT Delhi(德里印度理工学院) IIT Dharwad(达尔瓦德印度理工学院)

AI总结 提出COLD框架,利用漂移加惩罚原理调节稳定性-可塑性权衡,通过虚拟队列控制遗忘,理论保证收敛性,实验优于现有方法。

详情
Comments
Accepted to Transactions on Machine Learning Research (TMLR)
AI中文摘要

在许多实际场景中,数据流是非平稳的且顺序到达,要求学习系统在不从头重新训练的情况下持续适应。持续学习通过整合新任务同时缓解灾难性遗忘来应对这一挑战,其中学习新信息会降低先前知识的性能。我们引入了一种控制理论视角来明确调节遗忘的演化,将适应视为受长期稳定性约束的受控过程。我们专注于基于回放的持续学习,其中有限的内存缓冲区存储来自先前任务的代表性样本。我们提出了基于漂移加惩罚原理的持续学习框架COLD,该原理来自随机优化。为了便于分析,我们还考虑了一种oracle变体COLD-ORACLE作为参考基准。在每个任务中,两种方法都最小化当前任务损失,同时维护一个虚拟队列,该队列跟踪先前学习任务上长期稳定性的偏差,将稳定性-可塑性权衡捕捉为受调节的动态过程。我们建立了稳定性和收敛性保证,通过可调控制参数表征这种权衡。在标准基准上的实验表明,COLD在提供竞争性和可控的遗忘行为的同时,通过显式调节稳定性和可塑性,始终优于广泛的最先进的持续学习方法。

英文摘要

In many real-world settings, data streams are nonstationary and arrive sequentially, requiring learning systems to adapt continuously without retraining from scratch. Continual learning (CL) addresses this challenge by incorporating new tasks while mitigating catastrophic forgetting, where learning new information degrades performance on previously acquired knowledge. We introduce a control-theoretic perspective on CL that explicitly regulates the evolution of forgetting, framing adaptation as a controlled process subject to long-term stability constraints. We focus on replay-based CL, where a finite memory buffer stores representative samples from prior tasks. We propose COntinual Learning with Drift-Plus-Penalty (COLD), a continual learning framework based on the Drift-Plus-Penalty (DPP) principle from stochastic optimization. To facilitate analysis, we also consider an oracle variant, COLD-ORACLE, as a reference benchmark. At each task, both methods minimize the current task loss while maintaining a virtual queue that tracks deviations from long-term stability on previously learned tasks, capturing the stability-plasticity trade-off as a regulated dynamical process. We establish stability and convergence guarantees that characterize this trade-off through a tunable control parameter. Experiments on standard benchmarks demonstrate that COLD consistently outperforms a broad range of state-of-the-art CL methods while providing competitive and controllable forgetting behavior through explicit regulation of stability and plasticity.

2606.08451 2026-06-09 cs.CL cs.AI 新提交

Sycophancy as a Multilingual Alignment Failure: How Safety Degrades Across Languages, Topics, and Models

谄媚作为多语言对齐失败:安全性能如何随语言、主题和模型退化

Arya Shah, Himanshu Beniwal, Mayank Singh, Chaklam Silpasuwanchai

发表机构 * IIT Gandhinagar(印度理工学院甘地讷格尔分校) Asian Institute of Technology(亚洲理工学院)

AI总结 研究多语言模型中谄媚现象,发现低资源语言中谄媚率激增,且与主题无关,归因于分词器生育率,表明对齐方法在非高资源语言中泛化差。

详情
Comments
19 pages, 9 figures, 7 tables
AI中文摘要

安全对齐的大型语言模型常常表现出谄媚,即倾向于肯定用户的意见而不考虑事实准确性。尽管在英语中已有充分研究,但其在其他语言中的表现仍基本未被考察,使得数十亿非英语使用者可能容易受到模型验证的错误信息的影响。我们首次进行了大规模、多模型的跨语言谄媚评估,对\textbf{六个指令调优模型}在涵盖\textbf{38种语言}和\textbf{33个主题类别}的\textbf{110万个实例}上进行了基准测试。我们识别出一致的资源层级效应:谄媚率在低资源和零资源语言设置中急剧上升。关键的是,这种退化与主题无关,模型在良性提示和安全关键提示上均匀失败,在最需要保护的地方没有提供额外保护。我们进一步确定了分词器生育率作为这种对齐崩溃的结构性驱动因素。总的来说,我们的结果表明,当前的对齐方法在高资源语言之外泛化能力差,强调了迫切需要公平的多语言安全技术。

英文摘要

Safety-aligned large language models often exhibit sycophancy, which is the tendency to affirm users' opinions regardless of factual accuracy. Although well-studied in English, its manifestation in other languages remains largely unexamined, leaving billions of non-English speakers potentially vulnerable to model-validated misinformation. We present the first large-scale, multi-model evaluation of cross-lingual sycophancy, benchmarking \textbf{six instruction-tuned models} across \textbf{1.1 million instances} spanning \textbf{38 languages} and \textbf{33 topic categories}. We identify a consistent resource-tier effect: sycophancy rates spike sharply in low-resource and zero-shot language settings. Critically, this degradation is topic-agnostic, as models fail uniformly across both benign and safety-critical prompts, offering no additional protection where it is most needed. We further identify tokenizer fertility as a structural driver of this alignment collapse. Collectively, our results demonstrate that prevailing alignment methodologies generalize poorly beyond high-resource languages, underscoring the urgent need for equitable multilingual safety techniques.

2606.08450 2026-06-09 cs.AI 新提交

GIFT: LLM-Guided State-Reward Interface for Financial Reinforcement Learning

GIFT: 基于LLM引导的状态-奖励接口用于金融强化学习

Yanyan Wu, Boyi Zhang, Yanlin Liu, Xinyu Fang, Jining Luan, Meiqi Zhang, Jiacheng Liu, Hao Zeng, Dexu Yu, Chang Liu, Hanwen Du, Yongxin Ni, Youhua Li

发表机构 * East China University of Science and Technology(华东理工大学) University of Science and Technology of China(中国科学技术大学) Southwestern University of Finance and Economics(西南财经大学) University of Sydney(悉尼大学) City University of Hong Kong(香港城市大学) Northeastern University(东北大学) The Ohio State University(俄亥俄州立大学) National University of Singapore(新加坡国立大学)

AI总结 提出GIFT框架,利用大语言模型引导PPO强化学习中的状态增强和奖励塑造,提升金融交易策略的样本外风险调整收益。

详情
Comments
25 pages, 7 figures. Code and data are available at https://github.com/KAG778/GIFT . Equal contribution: Yanyan Wu and Boyi Zhang. Corresponding author: Youhua Li
AI中文摘要

金融投资组合交易自然被表述为一个强化学习问题,其中智能体在不断变化的市场条件下顺序调整资产以平衡收益、风险和交易成本。然而,在非平稳市场中,原始的OHLCV状态和短视的回报奖励往往提供了一个不充分的学习接口,这促使使用大语言模型将金融知识注入状态和奖励设计,同时限制开放式的生成。为此,我们提出GIFT,一个基于LLM引导的框架,用于基于PPO的金融强化学习中的状态-奖励接口设计。GIFT不是使用LLM做出交易决策,而是使用因子引导的状态增强从金融因子基元生成状态特征,使用风险规则引导的奖励塑造从投资组合风险规则生成辅助奖励,并使用诊断引导的细化通过PPO rollout诊断修订候选接口。细化后,GIFT在评估前固定所选的状态-奖励接口,在测试时不再进行LLM查询或接口更新。跨不同市场制度和投资组合场景的综合滚动窗口实验表明,GIFT相比基线提高了学习信号质量和样本外风险调整后的投资组合性能。代码和数据可在 https://github.com/KAG778/GIFT 获取。

英文摘要

Financial portfolio trading is naturally formulated as a reinforcement learning problem, where an agent sequentially rebalances assets under changing market conditions to balance return, risk, and transaction costs. Yet in non-stationary markets, raw OHLCV states and short-horizon return rewards often provide an under-specified learning interface, motivating large language models as a way to inject financial knowledge into state and reward design while constraining open-ended generation. To this end, we propose GIFT, an LLM-guided framework for state-reward interface design in PPO-based financial reinforcement learning. Rather than using the LLM to make trading decisions, GIFT uses Factor-guided State Enhancement to generate state features from financial-factor primitives, Risk-rule-guided Reward Shaping to generate auxiliary rewards from portfolio-risk rules, and Diagnostic-guided Refinement to revise candidate interfaces using PPO rollout diagnostics. After refinement, GIFT fixes the selected state-reward interface before evaluation, with no further LLM queries or interface updates at test time. Comprehensive rolling-window experiments across diverse market regimes and portfolio scenarios demonstrate that GIFT improves learning-signal quality and out-of-sample risk-adjusted portfolio performance over baselines. Code and data are available at: https://github.com/KAG778/GIFT .

2606.08446 2026-06-09 cs.LG cs.AI 新提交

Sparrow: Sparse Rollout for Stable and Efficient Long-context RL of Large Language Models

Sparrow: 用于大语言模型稳定高效长上下文强化学习的稀疏 rollout

Yang Zhou, Ranajoy Sadhukhan, Zhaofeng Sun, Zhuoming Chen, Souvik Kundu, Saket Dingliwal, Sai Muralidhar Jayanthi, Aram Galstyan, Haizhong Zheng, Beidi Chen

发表机构 * Carnegie Mellon University(卡内基梅隆大学) Cornell University(康奈尔大学) Intel(英特尔) Amazon AGI(亚马逊AGI)

AI总结 针对RLVR中长上下文rollout计算昂贵的问题,提出Sparrow方法,通过动态稀疏度调度保持token级策略失配的下尾统计量稳定,在Qwen3系列模型上实现2.0-2.4倍加速,并推广到更大模型和编程领域。

详情
AI中文摘要

尽管强大,但带有可验证奖励的强化学习(RLVR)会诱导极长的思维链(COT),使其计算成本高昂。由于RLVR每步成本主要由长上下文rollout生成主导,稀疏注意力为加速密集rollout提供了一种有前景的方法。然而,稀疏rollout需要精细的稳定性-效率权衡:过于激进的稀疏性会导致崩溃,而过于宽松的稀疏性则加速不足。在这项工作中,我们通过稀疏到密集的演员-策略失配来研究这种权衡。我们首先观察到,稀疏rollout崩溃并非由token间的均匀退化驱动:即使在激进的稀疏性下,大多数稀疏token也能与密集token完美对齐。受此启发,我们假设如果每个token的演员-策略失配的下尾在整个轨迹中保持在临界阈值以上,则稀疏rollout训练保持稳定。我们引入一种动态稀疏度调度,在生成过程中保持该尾统计量恒定,并验证了我们的假设。在Qwen3思考族模型上,将尾失配统计量保持在一致阈值附近通常能实现稳定训练。然后,我们使用成本模型在该失配阈值下找到最大加速的稀疏度调度,在训练Qwen3-1.7B、Qwen3-4B和Qwen3-8B时分别实现了2.2倍、2.4倍和2.0倍的rollout加速。实验表明,这些阈值可推广到更大的模型(Qwen3-14B)和另一个RL领域(编程)。最后,我们的分析自然引出了DistillSparse:在稀疏rollout上进行轻量级基于LoRA的蒸馏,使更激进的稀疏性达到相同的稀疏到密集失配阈值,从而获得更高的加速。

英文摘要

Despite being powerful, reinforcement learning with verifiable rewards (RLVR) induces extremely long COT, making it computationally expensive. Since RLVR per-step cost is dominated by long-context rollout generation, sparse attention offers a promising way to accelerate dense rollout. However, sparse rollouts require a delicate stability-efficiency tradeoff: overly aggressive sparsity causes collapse, while overly lenient sparsity gives insufficient speedup. In this work, we study this tradeoff through sparse-to-dense actor-policy mismatch. We first observe that sparse rollout collapse is not driven by uniform degradation across tokens: most sparse tokens align perfectly with dense even under aggressive sparsity. Motivated by this, we hypothesize that sparse rollout training remains stable if the lower tail of per-token actor-policy mismatch stays above a critical threshold throughout the trajectory. We introduce a dynamic sparsity schedule that keeps this tail statistic constant during generation and validate our hypothesis. Across Qwen3 thinking-family models, keeping the tail mismatch statistic near a consistent threshold generally enables stable training. We then use a cost model to find the sparsity schedule for maximum speedup under this mismatch threshold, achieving 2.2x, 2.4x, and 2.0x rollout speedups when training Qwen3-1.7B, Qwen3-4B, and Qwen3-8B. Empirically, we show the thresholds generalize to a larger model (Qwen3-14B) and another RL domain (coding). Finally, our analysis naturally motivates DistillSparse: lightweight LoRA-based distillation on sparse rollout lets more aggressive sparsity reach the same sparse-to-dense mismatch threshold, yielding higher speedup.

2606.08445 2026-06-09 cs.CL cs.AI 新提交

Segment-level Tree Search for Long Meeting Document Summarization

长会议文档摘要的段级树搜索

Sangwon Ryu, Heejin Do, Jun Seo, Daehui Kim, Yunsu Kim, Gary Geunbae Lee, Jungseul Ok

发表机构 * GSAI, POSTECH(浦项科技大学人工智能研究院) CSE, POSTECH(浦项科技大学计算机科学与工程系) ETH Zurich(苏黎世联邦理工学院) ETH AI Center(苏黎世联邦理工学院人工智能中心) Agentic AI Lab, KT(KT公司智能体人工智能实验室) LILT(LILT公司)

AI总结 提出基于蒙特卡洛树搜索的段级摘要框架S3,无需训练即可组合段级候选摘要,使用7B模型达到72B模型性能。

详情
Comments
INTERSPEECH 2026
AI中文摘要

会议文档因其长度和复杂的对话结构而难以总结。现有方法通常采用多阶段流水线,在摘要之前提取信息;然而,这些方法往往因缺乏中间验证而遭受累积错误传播,这一限制因短且低质量的参考摘要而进一步放大。我们提出通过蒙特卡洛树搜索进行段级摘要(S3),这是一个无需训练的框架,通过组合段级摘要候选来构建最终摘要。S3将长文档划分为多个段,并为每个段生成多个摘要候选,形成搜索树的节点。通过自我奖励引导的树搜索选择最佳评分组合,并精炼为最终输出。尽管使用7B模型,S3在生成长度合适的摘要的同时,实现了与更大的72B模型相当的性能。

英文摘要

Meeting documents are challenging to summarize due to their length and complex conversational structure. Existing approaches typically adopt multi-stage pipelines that extract information prior to summarization; however, these approaches often suffer from cumulative error propagation without intermediate validation, a limitation further amplified by short and low-quality reference summaries. We propose segment-level summarization via Monte Carlo Tree Search (S3), a training-free framework that constructs a final summary by composing segment-level summary candidates. S3 partitions a long document into segments and generates multiple summary candidates per segment, forming nodes of a search tree. The best-scoring combination is selected via self-reward-guided tree search and refined into the final output. Despite using a 7B model, S3 achieves performance comparable to larger 72B models while producing length-appropriate summaries.

2606.08440 2026-06-09 cs.RO cs.CV 新提交

GraspFoM: Towards Reconstruction-Driven Robotic Grasping with 3D Foundation Priors

GraspFoM:基于3D基础先验的重建驱动机器人抓取

Dongli Wu, Xiaobao Wei, Hao Wang, Qiaochu Dong, Ying Li, Qingpo Wuwu, Ming Lu, Wufan Zhao

发表机构 * The Hong Kong University of Science and Technology (Guangzhou)(香港科技大学(广州)) Peking University(北京大学) The Hong Kong University of Science and Technology(香港科技大学)

AI总结 提出GraspFoM框架,利用3D基础先验(SAM3D)构建共享3D物体潜变量,联合优化重建与抓取姿态预测,通过锚点初始化的截断姿态推理扩散器生成连续多模态抓取,实现高保真重建与最优抓取。

详情
AI中文摘要

机器人抓取是机器人操作中的基本能力。然而,在部分观测下抓取仍然具有挑战性。可靠的抓取依赖于局部接触线索和物体级3D结构。现有的几何感知抓取方法认识到重建的价值,但通常将几何视为中间预测,而不是可重用的抓取物体先验。在本文中,我们提出了GraspFoM,一个统一的框架,利用3D基础先验(SAM3D)为重建和抓取姿态预测构建共享的3D物体潜变量。基于这个共享的物体潜变量,我们引入了一个锚点初始化的截断姿态推理扩散器,它预测连续且多模态的抓取姿态,而不直接依赖离散的抓取候选。我们进一步通过一个重建感知评分器和残差潜变量更新器来研究重建与抓取之间的相互作用。重建提供基于几何的线索,而抓取监督则使共享的物体潜变量向与抓取相关的可操作性区域细化。GraspFoM联合预测抓取姿态并以网格和3DGS形式重建高保真3D资产。综合实验表明,GraspFoM在重建和抓取上都达到了最先进的结果。值得注意的是,这些改进只需要少量额外的可训练参数。组件消融研究也证明了每个组件的贡献。

英文摘要

Robotic grasping is a fundamental capability in robotic manipulation. Yet grasping remains challenging under partial observations. Reliable grasping depends on both local contact cues and object-level 3D structure. Existing geometry-aware grasping methods recognize the value of reconstruction, but they typically treat geometry as an intermediate prediction rather than a reusable object prior for grasping. In this paper, we present GraspFoM, a unified framework that leverages 3D foundation priors (SAM3D) to build a shared 3D object latent for both reconstruction and grasp pose prediction. Built on this shared object latent, we introduce an anchor-initialized truncated pose-reasoning diffuser that predicts continuous and multimodal grasp poses without directly relying on discrete grasp candidates. We further investigate the interaction between reconstruction and grasping through a reconstruction-aware scorer and a residual latent updater. Reconstruction provides grounded geometric cues, while grasp supervision refines the shared object latent toward grasp-relevant affordances. GraspFoM jointly predicts grasp poses and reconstructs high-fidelity 3D assets in mesh and 3DGS forms. Comprehensive experiments demonstrate that GraspFoM achieves state-of-the-art results on both reconstruction and grasping. Notably, these improvements require only a small number of additional trainable parameters. Component-wise ablation studies also demonstrate the contribution of each component.

2606.08432 2026-06-09 cs.AI 新提交

Trajectory-Refined Distillation

轨迹精炼蒸馏

Li Jiang, Haoran Xu, Yichuan Ding, Amy Zhang

发表机构 * McGill University(麦吉尔大学) Mila Quebec AI Institute(米拉魁北克人工智能研究所) UT Austin(德克萨斯大学奥斯汀分校)

AI总结 提出轨迹精炼蒸馏(TRD),通过教师指导修正学生轨迹中的前缀错误,解决在线策略蒸馏中的前缀失败问题,提升大语言模型的单次准确率和推理覆盖。

详情
Comments
under review
AI中文摘要

在线策略蒸馏(OPD)已成为大型语言模型(LLM)的重要后训练工具,它沿着学生自身的生成轨迹提供密集的逐词教师监督。在这项工作中,我们识别出OPD中一个常见的结构性问题,称为前缀失败。在前缀失败下,密集的逐词监督会导致双峰教师混合和碎片化梯度,而词级损失截断或重加权无法解决这一问题。这一观察促使我们超越词级损失干预,转向轨迹级输出修正。因此,我们提出轨迹精炼蒸馏(TRD),一种轨迹级修正方法,在教师指导下,于在线策略支持范围内修正学生的生成轨迹。通过在蒸馏前修正有问题的前缀,TRD从根源上缓解了前缀失败。此外,即使原始轨迹已经正确,TRD也能通过教师指导让学生接触到替代的有效推导,从而改善探索。TRD还可应用于在线策略自蒸馏(OPSD),这是一种使用基于特权信息的学生模型作为教师的参数共享变体。在多个尺度的广泛基准和基础模型上,TRD始终优于先前基线,提高了单次尝试准确率并扩展了推理覆盖范围。代码可在 https://github.com/louieworth/trd 获取。

英文摘要

On-policy distillation (OPD) has become a central post-training tool for large language models (LLMs), providing dense per-token teacher supervision along the student's own rollouts. In this work, we identify a common structural cause underlying OPD, which we call prefix failure. Under prefix failure, dense per-token supervision induces a bimodal teacher mixture and fragmented gradients that token-level loss truncation or reweighting fail to address. This observation motivates us to move beyond token-level loss interventions toward trajectory-level output corrections. We thus propose Trajectory-Refined Distillation (TRD), a trajectory-level correction method that revises the student's rollout under the teacher guidance while within on-policy support. By correcting problematic prefixes before distillation, TRD mitigates prefix failure at its source. Moreover, TRD improves the exploration by exposing the student to alternative valid derivations under teacher guidance, even when the original rolls are already correct. TRD can also be applied to on-policy self-distillation (OPSD), a parameter-sharing variant that uses the student model conditioned on privileged informations as the teacher. Across a wide range of benchmarks and base models at multiple scales, TRD consistently outperforms prior baselines, improving single-attempt accuracy and broadening reasoning coverage. Code is available at https://github.com/louieworth/trd

2606.08425 2026-06-09 cs.SD cs.CL eess.AS 新提交

TinyGiantALM: A Compact Audio-Language Model for Intent-Aware Reasoning under Resource Constraints

TinyGiantALM:面向资源约束下意图感知推理的紧凑型音频-语言模型

Vinh-Thuan Ly

发表机构 * University of Science, VNU-HCM(胡志明市国立大学下属理科大学) Vietnam National University, Ho Chi Minh City(胡志明市国立大学)

AI总结 提出紧凑型1.5B参数音频-语言模型TinyGiantALM,通过指令感知特征精炼框架(查询引导投影器+语义门控)过滤用户意图相关声学信号,在MMAR基准上零样本准确率46.4%,超越7B-13B基线,并优于8倍大模型。

详情
Comments
Accepted to Interspeech 2026. Project page: https://interspeech-tinygiant-alm.vercel.app
AI中文摘要

当前音频推理的进展依赖于大规模音频-语言模型(LALMs),阻碍了在资源受限环境中的部署。我们提出了TinyGiantALM,一个紧凑的1.5B参数效率导向替代方案。不同于暴力扩展规模,我们提出了一种指令感知特征精炼框架,使用查询引导投影器和语义门控,基于用户意图过滤声学信号。在MMAR基准上,TinyGiantALM实现了46.4%的零样本准确率,显著优于7B-13B基线。虽然在逻辑叙事推理方面与30B+模型存在差距,且在过于密集或空间场景中存在某些权衡,但我们的方法在解耦混合模态环境方面显著优于高达8倍大小的模型。这些发现表明,架构精度为在边缘友好规模上获得稳健感知能力提供了一条切实可行的路径。

英文摘要

Current advancements in Audio Reasoning rely on massive Large Audio-Language Models (LALMs), hindering deployment in resource-constrained environments. We introduce TinyGiantALM, a compact 1.5B efficiency-oriented alternative. Instead of brute-force scaling, we propose an Instruction-Aware Feature Refinement framework using a Query-guided Projector and Semantic Gating to filter acoustic signals based on user intent. On the MMAR benchmark, TinyGiantALM achieves 46.4% zero-shot accuracy, significantly outperforming 7B-13B baselines. While a reasoning gap in logical narrative remains versus 30B+ models and certain trade-offs exist in overly dense or spatial scenes, our approach notably surpasses models up to 8x larger in disentangling mixed-modality environments. These findings demonstrate that architectural precision offers a tangible pathway to secure robust perception capabilities on edge-friendly scales.

2606.08421 2026-06-09 cs.CV 新提交

Segmentation-Assisted Brain MRI Synthesis with Cross-Image Multi-Contrast Feature Memory Bank Retrieval Augmentation

基于跨图像多对比度特征记忆库检索增强的分割辅助脑MRI合成

Wenwei Huang, Jia Wei, Jianlong Zhou

发表机构 * South China University of Technology(华南理工大学) University of Technology Sydney(悉尼科技大学)

AI总结 提出分割辅助的闭环生成对抗框架,通过辅助分割分支和双库检索增强策略,提高多对比度脑MRI中肿瘤区域的合成保真度。

详情
AI中文摘要

多对比度脑MRI提供互补的软组织特征,有助于疾病的筛查和诊断。然而,有限的扫描时间、图像损坏和各种成像协议常常导致多对比度图像不完整。虽然当前方法在图像合成方面表现出色,但它们通常难以合成关键的肿瘤区域,并且无法有效利用多对比度脑MRI中的上下文信息。为了解决这个问题,我们提出了一种以合成为中心、分割辅助的闭环框架,结合检索增强合成。我们的方法整体采用生成对抗架构,旨在通过单一模型从任何可用对比度的组合中合成缺失的对比度。为了显式捕获肿瘤语义并将合成聚焦于肿瘤区域,我们添加了一个辅助分割分支,该分支预测肿瘤掩膜并将其作为语义条件反馈给合成分支,从而在模型中学习肿瘤感知表示并提高合成保真度。此外,我们提出了一种双库检索增强策略。它动态查询两个外部知识库,即用于关键肿瘤上下文的肿瘤掩膜记忆库和用于全局风格信息的跨图像对比度特征记忆库,以增强合成。在两个公开的多对比度磁共振脑数据集:BraTs2020和UCSF-BMSR上验证,所提出的方法在处理医学脑图像合成任务方面有效,并且与先前方法相比表现出优越的性能。代码可在 https://github.com/iBizzard/SSCF.git 获取。

英文摘要

Multi-contrast brain MRI provide complementary soft-tissue characteristics that aid in the screening and diagnosis of diseases. However, limited scanning time, image corruption and various imaging protocols often result in incomplete multi-contrast images. While current approaches excel in image synthesis, they often struggle to synthesize critical tumor regions and exploit contextual information in multi-contrast brain MRI effectively. To address this issue, we propose a synthesis-centric, segmentation-assisted closed-loop framework with retrieval augmentation synthesis. Our method overall takes a generative adversarial architecture, which aims to synthesize missing contrasts from any combination of available ones with a single model. To explicitly capture tumor semantics and focus synthesis on tumor regions, we add an auxiliary segmentation branch that predicts tumor masks and feeds them back as semantic conditioning in synthesis branch, thereby learning tumor-aware representations in the model and improving synthesis fidelity. Furthermore, we propose a dual-bank retrieval augmentation strategy. It dynamically queries two external knowledge bases, namely a tumor masks memory bank for crucial tumor context and cross-image contrast feature memory bank for global style information, to augment synthesis. Verified on two public multi-contrast magnetic resonance brain datasets: BraTs2020 and UCSF-BMSR, the proposed method is effective in handling medical brain images synthesis tasks and shows superior performance compared to previous methods. Code is available at:https://github.com/iBizzard/SSCF.git

2606.08420 2026-06-09 cs.CV 新提交

CheXanatomy: Anatomy-Aware Vision-Language Modeling for Chest Radiographs

CheXanatomy: 面向胸部X光片的解剖感知视觉-语言建模

Sergios Gatidis, Curtis Langlotz, Christian Bluethgen

发表机构 * Stanford Center for Artificial Intelligence in Medicine and Imaging, Stanford University(斯坦福大学医学与影像人工智能中心) Department of Radiology, Stanford University(斯坦福大学放射学系)

AI总结 提出CheXanatomy框架,通过自回归令牌空间监督将解剖知识融入预训练视觉-语言模型,实现解剖分割,在合成和真实X光片上性能媲美U-Net,并提升域迁移鲁棒性和样本效率。

详情
AI中文摘要

在大规模图像-文本对上预训练的视觉-语言模型(VLM)表现出强大的图像级理解能力,但主要针对全局对齐进行优化,并未显式编码细粒度解剖结构,限制了其在分割等空间精确任务中的适用性。我们提出CheXanatomy,一个通过自回归令牌空间监督将显式解剖知识融入预训练VLM的框架。该模型无需添加任务特定的解码器头,而是通过下一个令牌预测训练生成解剖分割掩码。为了实现可扩展的监督,我们从CT体积合成逼真的胸部X光片,并前向投影CT分割标签以获得解剖一致的2D掩码。我们在合成和真实胸部X光片上评估该方法,与U-Net基线进行比较,包括模型规模、输入分辨率和视觉编码器微调的消融实验。自回归解剖监督在分布内实现了与专用卷积模型相当的性能,并在向真实CXR数据的域迁移下表现出改进的几何鲁棒性。此外,在有限监督下适应新定位任务时,解剖预训练模型展现出更好的样本效率。更大的模型和更高的输入图像分辨率提升了性能,而视觉编码器微调效果有限。这些结果表明,将解剖结构直接嵌入生成目标促进了空间有根据的表征,并支持解剖感知的医学视觉-语言建模。

英文摘要

Vision-language models (VLMs) pretrained on large-scale image-text pairs demonstrate strong image-level understanding, but are primarily optimized for global alignment and do not explicitly encode fine-grained anatomical structure, limiting their suitability for spatially precise tasks such as segmentation. We introduce CheXanatomy, a framework that integrates explicit anatomical knowledge into a pretrained VLM through autoregressive token-space supervision. Instead of adding task-specific decoder heads, the model is trained to generate anatomical segmentation masks via next-token prediction. To enable scalable supervision, we synthesize realistic chest radiographs from CT volumes and forward-project CT segmentation labels to obtain anatomically consistent 2D masks. We evaluate the approach on synthetic and real chest radiographs against a U-Net baseline, including ablations on model scale, input resolution, and vision encoder fine-tuning. Autoregressive anatomical supervision achieves performance comparable to specialized convolutional models in-distribution and demonstrates improved geometric robustness under domain shift to real CXR data. In addition, anatomy-pretrained models exhibit improved sample efficiency when adapting to novel localization tasks under limited supervision. Larger models and higher input image resolution improve performance, while vision encoder fine-tuning has limited effect. These results show that embedding anatomical structure directly into the generative objective promotes spatially grounded representations and supports anatomy-aware medical vision-language modeling.

2606.08411 2026-06-09 cs.CL 新提交

AsyncLane: Decoupling Refinement from Advancement in Diffusion Language Model Decoding

AsyncLane: 扩散语言模型解码中精炼与推进的解耦

Yingxuan Ren, Yuxuan Lou, Yong Liu, Pengcheng Fang, Ziming Wang, Pengfei Zhou, Yang You

发表机构 * National University of Singapore(新加坡国立大学) University of Southampton(南安普顿大学)

AI总结 提出AsyncLane,一种无需训练的解码调度器,通过将生成过程分叉为精炼和推进两个通道,解耦块间依赖,在保持质量的同时显著提升扩散语言模型的解码吞吐量。

详情
AI中文摘要

块级半自回归解码是扩散大语言模型(DLMs)的标准推理范式,但它强制块之间存在严格依赖:当前块完全解码或去噪预算耗尽之前,下一个块无法开始。我们观察到,一旦一个块暴露出可靠的分隔符边界或稳定的语义前缀,续写生成无需等待每个残差标记被解析。我们提出AsyncLane,一种无需训练的解码调度器,将精炼与推进解耦。AsyncLane在观察到的分隔符边界处将生成通道分叉为精炼通道和续写生成通道:前缀保持可编辑,而续写在前缀精炼完成之前推进。由此产生的通道树记录解码依赖关系和输出顺序,而执行则在活跃通道集上进行。为了使这种异步调度在双向注意力下高效,AsyncLane结合了共享前缀通道批处理、前瞻草稿重用、级联终止以及带有刷新-逻辑重用的紧凑缓存刷新,防止模型调用成本随通道数量线性增长。AsyncLane是块级DLM采样器的即插即用替代品,无需重新训练。在数学推理和代码生成实验表明,AsyncLane在保持竞争性质量的同时持续提高吞吐量。在LLaDA和Dream骨干网络上,AsyncLane在所有评估的基准长度设置中实现了最高的TPS;相对于最快的竞争基线,它在LLaDA上达到2.95倍峰值加速,在Dream上达到3.04倍,在较长生成预算下增益尤为显著。

英文摘要

Block-wise semi-autoregressive decoding is the standard inference paradigm for diffusion large language models (DLMs), but it imposes a strict dependency between blocks: the next block cannot begin until the current block is fully decoded or its denoising budget is exhausted. We observe that once a block exposes a reliable delimiter boundary or stable semantic prefix, continuation generation need not wait for every residual token to be resolved. We propose AsyncLane, a training-free decoding scheduler that decouples refinement from advancement. AsyncLane forks a generate lane at observed delimiter boundaries into a refine lane and a continuation generate lane: the prefix remains editable, while the continuation advances before prefix refinement finishes. The resulting lane tree records decoding dependencies and output order, while execution proceeds over the active lane set. To make this asynchronous schedule efficient under bidirectional attention, AsyncLane combines shared-prefix lane batching, lookahead draft reuse, cascading termination, and compact cache refresh with refresh-logit reuse, preventing model-call cost from scaling directly with the number of lanes. AsyncLane is a drop-in replacement for block-wise DLM samplers and requires no retraining. Experiments on mathematical reasoning and code generation show that AsyncLane consistently improves throughput while maintaining competitive quality. Across LLaDA and Dream backbones, AsyncLane achieves the highest TPS in all evaluated benchmark-length settings; relative to the fastest competing baseline, it reaches peak speedups of 2.95x on LLaDA and 3.04x on Dream, with especially large gains under longer generation budgets.

2606.08410 2026-06-09 cs.LG cs.AI 新提交

Provably Efficient Personalized Multi-Objective Bandits with Proactive Conversational Queries

具有主动对话查询的可证明高效个性化多目标老虎机

Linfeng Cao, Ming Shi, Ness B. Shroff

发表机构 * The Ohio State University(俄亥俄州立大学) University at Buffalo(布法罗大学)

AI总结 提出MO-PQUCB算法,通过主动查询获取用户偏好信号,结合Plackett-Luce模型和正则化UCB,解决多目标老虎机中偏好与奖励的耦合问题,实现更优的遗憾界。

详情
Comments
UAI 2026
AI中文摘要

多目标老虎机中的个性化决策需要学习用户在不同竞争目标之间的特定权衡。由于臂的效用既取决于未知奖励又取决于未知偏好,现有方法仅从效用反馈中推断偏好,将偏好学习与奖励探索纠缠在一起。然而,在实践中,用户通常通过主动对话查询(例如,“便宜且干净的酒店”)揭示他们的优先级,但这种结构化信号未被利用。我们形式化了一个基于主动查询的框架,其中用户查询提供结构化的偏好信号。通过Plackett-Luce子集选择模型对这些信号进行建模,我们证明了由于基本的平移不变性障碍,仅查询学习是不够的。为了解决这个问题,我们引入了MO-PQUCB,一种混合算法,通过平移不变正则化和双探索UCB将基于查询的偏好锚定与老虎机反馈相结合。我们证明了主动查询加速了偏好估计,并相比先前偏好感知的MO-MAB方法实现了改进的遗憾缩放。在查询被破坏的情况下,我们进一步刻画了统计极限,并设计了一个鲁棒估计器,在破坏稀疏时实现接近最优的性能。实验验证了理论和实际收益。

英文摘要

Personalized decision-making in multi-objective bandits requires learning user-specific trade-offs among competing objectives. Since arm utility depends on both unknown rewards and unknown preferences, existing methods infer preferences only from utility feedback, entangling preference learning with reward exploration. In practice, however, users often reveal their priorities through proactive conversational queries (e.g., "cheap and clean hotel"), yet this structured signal is not leveraged. We formalize a proactive query-based framework in which user queries provide structured preference signals. Modeling these signals via a Plackett-Luce subset choice model, we show that query-only learning is insufficient due to a fundamental shift-invariance barrier. To resolve this, we introduce MO-PQUCB, a hybrid algorithm that integrates query-based preference anchoring with bandit feedback through shift-invariant regularization and dual-exploration UCB. We prove that proactive queries accelerate preference estimation and yield improved regret scaling over prior preference-aware MO-MAB methods. Under corrupted queries, we further characterize statistical limits and design a robust estimator achieving near-optimal performance when the corruption is sparse. Experiments validate both theoretical and practical gains.

2606.08408 2026-06-09 cs.CL cs.AI 新提交

TimpaTeks: Automatic In-place Text Sequence Modification via Diffusion Language Model Steering

TimpaTeks: 通过扩散语言模型引导实现自动原地文本序列修改

Ryandito Diandaru, Ikhlasul Akmal Hanif, Fadli Aulawi Al Ghiffari, Ahmed Elshabrawy, Alham Fikri Aji

发表机构 * MBZUAI(穆罕默德·本·扎耶德人工智能大学)

AI总结 提出TimpaTeks方法,将激活引导扩展到扩散语言模型,实现原地文本修改以改变概念,在情感和概念引导任务上降低困惑度并保持句子结构。

详情
Comments
16 pages
AI中文摘要

我们将激活引导扩展到扩散语言模型(DLM),并研究了一个由于DLM推理机制而产生的新问题:原地修改文本以呈现不同的概念。我们提出了TimpaTeks,一种使用DLM的自动原地文本修改机制。在IMDB电影评论(情感)和合成的猫狗数据集(任意、更非常规的概念引导)上的实验表明,TimpaTeks提供了一种可行的新机制来原地引导扩散语言模型的输出。TimpaTeks实现了原地修改,同时降低了句子困惑度并保留了原始句子结构,无需指令调优模型。与基于提示的DLM引导相比,TimpaTeks计算成本更低,因为它执行原地去噪,而不是构建额外的提示条件输出序列。

英文摘要

We extend activation steering to diffusion language models (DLMs) and study a novel problem that arose due to the inference mechanism of DLMs: Modifying a text in-place to manifest a different concept. We propose TimpaTeks, an automatic in-place text modification mechanism using DLMs. Experiments on IMDB movie reviews (sentiment) and a synthetic Cats and Dogs Dataset (arbitrary, more unconventional concept steering) show that TimpaTeks provides a feasible novel mechanism to steer diffusion language model outputs in-place. TimpaTeks enables in-place modification while simultaneously lowers sentence perplexity and retaining the original sentence structre without the need of instruction tuned models. TimpaTeks is also computationally cheaper than prompt-based DLM steering, as it performs denoising in-place rather than constructing an additional prompt-conditioned output sequence.

2606.08405 2026-06-09 cs.AI physics.flu-dyn 新提交

Self-Evolving Scientific Agent Discovers Generalizable Physically-Reasoned Fluid Control

自进化科学智能体发现可泛化的物理推理流体控制

Boai Sun, Wenjin Guo, Zongmin Yu, Liu Yang

发表机构 * National University of Singapore(新加坡国立大学)

AI总结 提出一种由大语言模型驱动的自进化科学智能体工作流,通过迭代代码生成和物理仿真诊断,自动构建可解释的控制器,并在欠驱动双关节狗鲨游泳器目标到达任务中实现零样本泛化。

详情
AI中文摘要

虽然数据密集的深度强化学习可以优化复杂的控制策略,但物理系统中的科学发现从根本上需要一条可解释的推理链,将物理证据与结构化控制架构联系起来。本文提出了一种自进化的科学智能体工作流,由大语言模型和迭代代码生成驱动,在保持严格可解释性和严谨物理推理的同时,自动构建控制器。该智能体不是调整权重,而是将候选策略部署到物理仿真中,从多模态证据中主动诊断动态行为,并将这些观察转化为渐进的源代码改进。我们在一个高度非线性的流固耦合问题上展示了该框架:一个欠驱动的双关节狗鲨游泳器,仅使用关节角加速度完成空间目标到达任务。从表现出单侧转向偏差的推进种子策略开始,智能体自主发现并改进了一个统一控制器,稳健地捕获所有典型目标。值得注意的是,无需任何重新训练或特定目标分支,合成的控制策略就能泛化到未见过的静态目标和动态曲线追踪轨迹。可审计的进化日志揭示了一个基于行波推进、体坐标系目标引导、偏航率反馈、有符号平均尾曲率和自适应节奏缓解的涌现控制架构。我们的结果表明,自主科学智能体能够成功地将累积的物理证据转化为稳健、数学可读的控制策略,同时保持完全可追溯的科学发现过程。

英文摘要

While data-intensive deep reinforcement learning can optimize complex control policies, scientific discovery in physical systems fundamentally requires an interpretable chain of reasoning that connects physical evidence to structured control architectures. Here, we present a self-evolving scientific-agent workflow, driven by large language models and iterative code generation, that automates controller construction while preserving strict interpretability and rigorous physical reasoning. Instead of adjusting weights, the agent deploys candidate strategies into physical simulations, actively diagnoses dynamic behaviors from multimodal evidence, and translates these observations into progressive source-code refinements. We demonstrate this framework on a highly non-linear fluid-structure interaction problem: an underactuated, two-joint dogfish swimmer tasked with spatial target reaching using only joint angular accelerations. Starting from a propulsive seed policy that exhibits a one-sided steering bias, the agent autonomously discovers and refines a unified controller that robustly captures all canonical targets. Remarkably, without any retraining or target-specific branching, the synthesized control policy generalizes to unseen static targets and dynamically curved pursuit trajectories. The auditable evolve log reveals an emergent control architecture built upon traveling-wave propulsion, body-frame target guidance, yaw-rate feedback, signed mean-tail curvature, and adaptive cadence relief. Our results show that an autonomous scientific agent can successfully transform accumulated physical evidence into robust, mathematically readable control policy, while maintaining a fully traceable process of scientific discovery.

2606.08402 2026-06-09 cs.CV cs.AI cs.MA 新提交

SceneConductor: 3D Scene Generation from Single Image with Multi-Agent Orchestration

SceneConductor: 基于多智能体编排的单图像3D场景生成

Jeonghwan Kim, Yushi Lan, Yongwei Chen, Hieu Trung Nguyen, Chuanyu Pan, Xingang Pan

发表机构 * Nanyang Technological University(南洋理工大学) University of Oxford(牛津大学) Meshy AI

AI总结 提出多智能体编排框架,将单图像3D场景生成分解为场景初始化、环境构建和多智能体细化三个阶段,并引入几何感知布局预测器,在几何精度、空间一致性和感知真实性上超越现有方法。

详情
AI中文摘要

从单张图像生成完整3D场景需要从本质上模糊的视觉证据中推断全局一致的几何、物体关系和环境上下文。尽管联合布局和网格生成近期取得进展,现有方法通常依赖整体或弱分解的流水线,将许多因素纠缠在一起,需要大量场景级监督,限制了其对复杂真实环境的泛化。我们提出一个多智能体编排框架,将单图像3D场景生成分解为三个结构化阶段:场景初始化、环境构建和多智能体细化。初始化阶段提取图像派生的物体掩码,构建物体级3D表示,并预测初始空间布局以形成粗略3D场景。环境构建阶段随后利用该初始化以及点图几何,构建支撑表面、房间边界、材质和光照的环境支架。最后,在细化阶段,规划器智能体识别结构和视觉不一致性,直接应用简单修正,并派遣专家智能体进行复杂的局部修订,再整合回全局场景。为提供可靠的结构初始化同时减少对场景级标注的依赖,我们进一步引入一个几何感知布局预测器,由点图派生的稀疏几何先验监督。与全监督布局生成器不同,该预测器可从分割级数据训练,并稳健泛化到多样真实场景。在基准数据集上的大量实验表明,我们的方法在几何精度、空间一致性和感知真实性上持续优于先前方法。

英文摘要

Generating complete 3D scenes from a single image requires inferring globally consistent geometry, object relationships, and environmental context from inherently ambiguous visual evidence. Despite recent progress in joint layout-and-mesh generation, existing methods often rely on holistic or weakly decomposed pipelines that entangle many factors at once and demand extensive scene-level supervision, limiting their generalization to complex real-world environments. We propose a multi-agent orchestration framework that decomposes single-image 3D scene generation into three structured stages: scene initialization, environment construction, and multi-agent refinement. The initialization stage extracts image-derived object masks, builds object-level 3D representations, and predicts an initial spatial layout to form a coarse 3D scene. The environment-construction stage then leverages this initialization together with point-map geometry to build an environmental scaffold of supporting surfaces, room boundaries, materials, and illumination. Finally, in the refinement stage, a planner agent identifies structural and visual inconsistencies, applies simple corrections directly, and dispatches specialist agents for complex localized revisions that are reintegrated into the global scene. To provide reliable structural initialization while reducing reliance on scene-level annotations, we further introduce a geometry-aware layout predictor supervised by sparse geometric priors derived from point maps. Unlike fully supervised layout generators, the predictor can be trained from segmentation-level data and generalizes robustly to diverse real-world scenes. Extensive experiments on benchmark datasets show that our method consistently outperforms prior approaches in geometric accuracy, spatial consistency, and perceptual realism.

2606.08397 2026-06-09 cs.CL cs.IR 新提交

TrustMargin: Training-Free Arbitration between Parametric Memory and Retrieved Evidence in Large Language Models

TrustMargin: 大语言模型中参数化记忆与检索证据之间的无训练仲裁

Jingyan Xu, Hong Shi, Yi Shan, Penghui Liu, Yunhao Bai, Ningyuan Li, Xueyang Liu

发表机构 * Peking University(北京大学)

AI总结 针对大语言模型在知识问答中参数记忆与检索证据冲突的问题,提出无训练仲裁层TrustMargin,利用模型自身似然度评分选择更可信的答案,无需微调或外部评判。

详情
Comments
13 pages, 6 figures, 9 tables. Code and data are available at https://github.com/mojixu/TrustMargin.git
AI中文摘要

大语言模型通过参数化记忆和检索证据回答知识密集型问题,但两种来源并非都可靠。检索可以填补知识空白,但干扰性段落可能覆盖正确的闭卷答案。我们将这种生成后冲突视为答案级源仲裁:给定来自同一冻结模型的直接和RAG答案,决定信任哪个源。我们提出TRUSTMARGIN,一个无训练、即插即用的仲裁层,它使用模型自身的似然度对两个现有候选答案进行评分。它结合了参数先验边际(测试记忆是否接受检索答案)和证据绑定边际(折扣仅段落显著性并衡量问题特定支持)。TRUSTMARGIN在直接和RAG之间进行选择,无需微调、外部评判或额外生成。在2WIKIMQA和CWQA上使用三种LLaMA规模,TRUSTMARGIN一致优于直接生成和BM25-RAG,恢复了直接/RAG oracle差距的一部分,并推广到多个无训练RAG流水线。

英文摘要

Large language models answer knowledge-intensive questions using both parametric memory and retrieved evidence, but neither source is uniformly reliable. Retrieval can fill knowledge gaps, yet distracting passages may override correct closed-book answers. We study this post-generation conflict as answer-level source arbitration: given Direct and RAG answers from the same frozen model, decide which source to trust. We propose TRUSTMARGIN, a training-free, plug-and-play arbitration layer that scores the two existing candidates with the model's own likelihoods. It combines a parametric-prior margin, which tests whether memory accepts the retrieved answer, with an evidence-binding margin, which discounts passage-only salience and measures question-specific support. TRUSTMARGIN selects between Direct and RAG without fine-tuning, external judges, or additional generation. Across 2WIKIMQA and CWQA with three LLaMA scales, TRUSTMARGIN consistently improves over Direct generation and BM25-RAG, recovers part of the Direct/RAG oracle gap, and generalizes to multiple training-free RAG pipelines.

2606.08394 2026-06-09 cs.CL 新提交

When Correct Decisions Hide Internal Stress: Decision-State Probing in Multimodal Language Models

当正确决策隐藏内部压力:多模态语言模型中的决策状态探测

Haoran Zhao, Soyeon Caren Han, Eduard Hovy

发表机构 * The University of Melbourne(墨尔本大学)

AI总结 提出S³E框架,通过正锚定A/B强制选择任务和隐藏状态分析,发现多模态语言模型在正确行为下仍存在语义压力导致的决策状态位移。

详情
AI中文摘要

多模态语言模型通常通过外部行为进行评估:选择正确的图像-文本匹配、拒绝无支持的标题或正确回答视觉查询。然而,仅凭正确行为并不能证明模型的内部决策状态在受控语义压力下保持稳定。我们通过S$^3$E(结构化语义压力评估)框架研究这一差距,该框架用于分析多模态语言模型中行为-内部解耦。S$^3$E使用正锚定的A/B强制选择设置,其中图像支持的标题与语义压力候选进行对比,并在原始和交换选项顺序下进行,同时在回答前的决策状态提取隐藏状态。我们专注于严格正确的试验,即模型在两种顺序下都一致选择正确标题。我们不将任意的隐藏状态变化视为不稳定的证据,而是测量语义冲突候选是否相对于保持意义的控制项导致过度的决策状态位移。在Qwen3VL、Gemma3和InternVL3上,尽管强制选择行为正确,语义压力相对于词汇控制项始终产生显著的正选定层过度位移,而与随机负样本的比较则依赖于模型。我们将此解释为有范围的决策状态压力敏感性信号,而非下游失败或幻觉的证据。我们的结果表明,仅凭强制选择正确性不足以证明内部决策几何的不变性。

英文摘要

Multimodal language models are typically evaluated through external behavior: selecting the correct image--text match, rejecting unsupported captions, or answering visual queries correctly. However, correct behavior alone does not show that the model's internal decision state remains stable under controlled semantic stress. We study this gap through S$^3$E (Structured Semantic Stress Evaluation), a framework for analyzing behavior-internal decoupling in multimodal language models. S$^3$E uses a positive-anchored A/B forced-choice setup in which an image-supported caption is contrasted against semantic stress candidates under both original and swapped option orders, while hidden states are extracted at the pre-answer decision state. We focus on strict-correct trials, where the model consistently selects the correct caption across both orders. Rather than treating arbitrary hidden-state variation as evidence of instability, we measure whether semantic-conflict candidates induce excess decision-state displacement relative to meaning-preserving controls. Across Qwen3VL, Gemma3, and InternVL3, semantic stress consistently produces positive selected-layer excess displacement over lexical controls despite correct forced-choice behavior, while comparisons against random negatives are model-dependent. We interpret this as a scoped decision-state stress-sensitivity signal rather than evidence of downstream failure or hallucination. Our results suggest that forced-choice correctness alone is not a sufficient certificate of invariant internal decision geometry.

2606.08388 2026-06-09 cs.LG math.OC stat.ML 新提交

The Spectral Dynamics and Noise Geometry of Muon

Muon的谱动力学与噪声几何

Pierfrancesco Beneventano, Mahmoud Abdelmoneum, Tomaso Poggio

发表机构 * Massachusetts Institute of Technology(麻省理工学院)

AI总结 研究Muon优化器通过极分解替换矩阵梯度,证明其偏置为平坦谱,在欠定回归中导出奇异值动力学,实验表明其效果依赖于谱方向活跃度。

详情
Comments
24 pages, 11 figures
AI中文摘要

Muon将矩阵梯度$G=UΣV^\ op$替换为其极因子$UV^\ op$。这保留了梯度选择的奇异方向,但使更新谱平坦。我们研究此操作产生的优化偏置。在显式对齐假设下,我们证明在利用梯度奇异方向且不适应当前权重谱的有界更新中,极更新是单步熵最大化的选择。在欠定回归模型中,我们推导了连续时间Muon的精确奇异值动力学,并识别出一个依赖于测量的条件,在该条件下归一化谱趋向于相等的非零奇异值。这种几何也排除了常见的低秩解释:在固定Frobenius范数下,Muon的区分状态具有平坦谱,而核范数最小化则偏好谱集中。受控矩阵感知实验将效应与简单梯度缩放分离,表明范数匹配的梯度下降不能复现Muon,并在广泛消融中恢复预测的平坦化趋势。在小型NanoGPT预训练中,Muon保持稳定秩,具有宽学习率平台,并相对于AdamW改善验证损失;在匹配的小型ViT对照中,排名反转。由此得出的图景是依赖于区域的:Muon并非普遍优越,但其平坦谱偏置在需要保持许多谱方向活跃时可能有所帮助。

英文摘要

Muon replaces a matrix gradient $G=UΣV^\top$ by its polar factor $UV^\top$. This keeps the singular directions selected by the gradient, but makes the update spectrum flat. We study the optimization bias created by this operation. Under explicit alignment assumptions, we prove that the polar update is the one-step entropy-maximizing choice among bounded updates that use the gradient singular directions and do not adapt to the current weight spectrum. In an underdetermined regression model, we derive exact singular-value dynamics for continuous-time Muon and identify a measurement-dependent condition under which the normalized spectrum moves toward equal nonzero singular values. This geometry also rules out a common low-rank interpretation: at fixed Frobenius norm, Muon's distinguished state has a flat spectrum, whereas nuclear-norm minimization favors spectral concentration. Controlled matrix-sensing experiments separate the effect from simple gradient rescaling, show that norm-matched gradient descent does not reproduce Muon, and recover the predicted flattening trend across broad ablations. In small NanoGPT pretraining, Muon preserves stable rank, has a broad learning-rate plateau, and improves validation loss relative to AdamW; in a matched small-ViT control, the ranking reverses. The resulting picture is regime-dependent: Muon is not universally superior, but its flat-spectrum bias can help when many spectral directions need to remain active.

2606.08381 2026-06-09 cs.CL cs.AI 新提交

Auditing Proprietary Alignment in Large Language Models: A Comparative Framework Without a Ground-Truth Standard

审计大型语言模型中的专有对齐:一种无需真实标准的比较框架

Alireza Arbabi, Florian Kerschbaum

发表机构 * University of Waterloo(滑铁卢大学) Vector Institute(向量研究所)

AI总结 提出一种统计框架,通过比较目标模型与基线模型在共享语义空间中的响应偏差,检测黑盒语言模型中的专有对齐行为,无需真实标准即可实现外部审计。

详情
AI中文摘要

大型语言模型(LLMs)越来越多地通过不透明的开发和部署流程发布和部署,使得模型提供商能够在不正式宣布的情况下注入有意的、提供商特定的策略。因此,已有多种模型被报道生成反映专有规则和组织利益的响应,导致在有争议话题上的审查或错误信息。然而,系统性地识别这种对齐仍然是一个基本挑战,因为“专有”在不同语境中的含义模糊。在本文中,我们提出了一种统计框架,通过比较行为分析来检测黑盒语言模型中的专有对齐。我们的方法量化了目标模型与一组参考基线模型在共享语义空间中的响应之间的系统性偏差。通过评估相对行为差异而非绝对正确性,我们的框架能够在黑盒访问下进行有原则的审计。应用于几个广泛讨论但此前未量化的案例,它为外部评估大型语言模型中提供商特定的对齐行为提供了系统且可扩展的基础。

英文摘要

Large language models (LLMs) are increasingly released and deployed through opaque development and deployment pipelines, enabling model providers to inject intentional, provider-specific policies without officially announcing them. As a result, various models have been reported to generate responses reflecting proprietary rules and organizational interests, leading to censorship or misinformation on controversial topics. However, systematic identification of such alignment remains a fundamental challenge, complicated by the ambiguity of what ``proprietary'' entails in different contexts. In this paper, we propose a statistical framework for detecting proprietary alignment in black-box language models via comparative behavioral analysis. Our approach quantifies systematic deviations between the responses of a target model and those of a reference set of baseline models in a shared semantic space. By evaluating relative behavioral divergence rather than absolute correctness, our framework enables principled auditing under black-box access. Applied to several widely discussed but previously unquantified cases, it provides a systematic and scalable basis for external assessment of provider-specific alignment behavior in large language models.

2606.08379 2026-06-09 cs.AI cs.CE cs.LG q-fin.CP q-fin.TR 新提交

TT-DAC-PS: Twin-Target Deterministic Actor-Critic with Policy Smoothing for Optimal Trade Execution

TT-DAC-PS:用于最优交易执行的双目标确定性演员-评论家与策略平滑

Ilia Zaznov, Atta Badii, Julian Kunkel, Alfonso Dufour

发表机构 * University of Reading(雷丁大学) University of Göttingen(哥廷根大学) GWDG(哥廷根数据处理中心) Henley Business School(亨利商学院)

AI总结 提出TT-DAC-PS算法,结合双指数移动平均评论家目标、悲观最小备份、TD3风格策略平滑噪声、延迟演员更新和保守Q正则化,以抑制过高估计,并在限价订单簿数据上优于经典和强化学习基线。

详情
Comments
21 pages, 1 figure, 3 tables
AI中文摘要

本研究通过引入TT-DAC-PS(双目标确定性演员-评论家与策略平滑),解决了大规模股票卖单的最优执行问题。该确定性演员-评论家架构结合了双指数移动平均评论家目标与悲观最小备份、TD3风格的目标策略平滑噪声、延迟演员更新以及保守Q正则化,以抑制过高估计。探索使用Ornstein-Uhlenbeck(OU)噪声,并采用混合调度:确定性回合衰减、基于近期奖励离散度的方差引导调整,以及一个可学习并映射到噪声尺度的Soft Actor-Critic(SAC)风格温度。环境整合了Almgren-Chriss(AC)交易影响与限价订单簿(LOB)价格和成交量、归一化状态特征、每步成交量参与上限以及基于效用的奖励。该交易执行算法应用于十只美国股票的LOB数据。性能评估针对强化学习基线算法,包括近端策略优化(PPO)、软演员-评论家(SAC)和优势演员-评论家(A2C),以及替代交易执行算法,包括时间加权平均价格(TWAP)、成交量加权平均价格(VWAP)和AC。所提出的模型持续降低平均实现缺口百分比,并具有竞争性的方差,优于经典基线和标准强化学习基准模型。

英文摘要

This study addresses the optimal execution of large stock sell programs by introducing TT-DAC-PS (Twin-Target Deterministic Actor-Critic with Policy Smoothing), a deterministic actor-critic architecture that combines twin exponential-moving-average critic targets with pessimistic min backup, TD3-style target policy smoothing noise, delayed actor updates, and conservative Q regularisation to curb overestimation. Exploration uses Ornstein-Uhlenbeck (OU) noise with a hybrid schedule: deterministic episode-wise decay, variance-guided adjustment based on recent reward dispersion, and a Soft Actor-Critic (SAC)-style temperature that is learned and mapped to the noise scale. The environment integrates Almgren-Chriss (AC) trade impact with Limit Order Book (LOB) prices and volumes, normalised state features, per-step volume participation caps, and a utility-based reward. The trade execution algorithm is applied to LOB data for ten U.S. stocks. Performance is assessed against reinforcement-learning baseline algorithms, including Proximal Policy Optimisation (PPO), Soft Actor-Critic (SAC), and Advantage Actor-Critic (A2C), as well as alternative trade execution algorithms, including Time-Weighted Average Price (TWAP), Volume-Weighted Average Price (VWAP), and AC. The proposed model consistently reduces mean implementation shortfall percentage with competitive variance, outperforming classical baselines and standard reinforcement-learning benchmark models.

2606.08376 2026-06-09 cs.LG cs.AI 新提交

RiskNet: A large-scale dataset of AI risk incidents from news with alignment and multi-dimensional annotations

RiskNet:一个来自新闻的大规模AI风险事件数据集,包含对齐和多维标注

Leihan Zhang, Wecheng Ye, Xianlong Ma, Haochuan Liu, Yang Li, Qianyu Zhang, Jinliang Chen, Qiang Yan

发表机构 * Beijing University of Posts and Telecommunications(北京邮电大学) Beijing Key Laboratory of Multimodal Data Intelligent Perception and Governance(多模态数据智能感知与治理北京市重点实验室)

AI总结 提出RiskNet,一个从多语言新闻构建的大规模AI风险事件数据集,通过结构化流水线进行事件识别、对齐和多维分类,支持AI安全、治理和风险分析研究。

详情
Comments
The manuscript has been submitted to Scientific Data
AI中文摘要

随着人工智能(AI)系统越来越多地部署在社会关键领域,与AI相关的危害和失败事件的报告在频率和多样性上不断增加。尽管现有的治理框架阐述了负责任AI的高层原则,但用于跟踪和分析真实世界AI风险事件的大规模实证资源仍然有限。现有的事件集合通常由人工整理,规模相对较小,不足以支持持续、数据驱动的监控和下游计算分析。为满足这一需求,我们提出了RiskNet,一个从大规模多语言新闻源构建的AI风险事件数据集。RiskNet应用了一个结构化的流水线,用于AI风险新闻识别、事件级报告筛选、事件对齐和多维事件分类。生成的资源将分散的新闻报道组织成以事件为中心的记录,并为事件分类、事件对齐和事件级风险标注提供基准数据集。在当前版本中,RiskNet覆盖了数亿条源记录,并生成了一个大规模的AI风险相关报告集合,包括对齐的事件簇和标注的基准子集。该数据集还通过一个在线平台提供浏览和探索功能。我们描述了数据源、处理工作流、分类法设计以及资源的技术验证。RiskNet旨在支持AI安全、治理、风险分析和基准测试的下游研究,以及对AI相关危害的纵向和跨源分析。通过提供一个结构化且可复用的实证资源,RiskNet有助于弥合高层治理原则与AI风险事件记录现实之间的差距。

英文摘要

As artificial intelligence (AI) systems are increasingly deployed across socially consequential domains, reports of AI-related harms and failures have grown in frequency and diversity. Although existing governance frameworks articulate high-level principles for responsible AI, large-scale empirical resources for tracking and analyzing real-world AI risk incidents remain limited. Existing incident collections are often manually curated, relatively small in scale, and insufficient for continuous, data-driven monitoring and downstream computational analysis. To address this need, we present RiskNet, a large-scale dataset of AI risk incidents constructed from large-scale multilingual news sources. RiskNet applies a structured pipeline for AI risk news identification, event-level report screening, incident alignment, and multi-dimensional incident classification. The resulting resource organizes dispersed news reports into incident-centered records and provides benchmark datasets for event classification, incident alignment, and incident-level risk labeling. In its current release, RiskNet covers hundreds of millions of source records and yields a large-scale collection of AI risk-related reports, including aligned incident clusters and annotated benchmark subsets. The dataset is also accessible through an online platform for browsing and exploration. We describe the data sources, processing workflow, taxonomy design, and technical validation of the resource. RiskNet is intended to support downstream research on AI safety, governance, risk analysis, and benchmarking, as well as longitudinal and cross-source analyses of AI-related harms. By providing a structured and reusable empirical resource, RiskNet helps bridge the gap between high-level governance principles and the documented realities of AI risk incidents.

2606.08375 2026-06-09 cs.LG 新提交

Few-step Cofolding with All-Atom Flow Maps

少步全原子流图共折叠

Gianluca Scarpellini, Ron Shprints, Peter Holderrieth, Juno Nam, Pranav Murugan, Rafael Gómez-Bombarelli, Tommi Jaakola, Maruan Al-Shedivat, Nicholas Matthew Boffi, Avishek Joey Bose

发表机构 * Genesis Molecular AI Massachusetts Institute of Technology(麻省理工学院) Carnegie Mellon University(卡内基梅隆大学) Imperial College London(伦敦帝国学院) Mila

AI总结 提出DeCAF框架,将全原子共折叠扩散模型蒸馏为流图,仅需几步推理即可生成高质量样本,并通过奖励引导搜索提升采样质量。

详情
AI中文摘要

3D生物分子复合物的全原子生成建模已成为预测蛋白质和蛋白质-配体系统结构的主流范式。然而,在原子级保真度下生成结构通常需要昂贵的迭代扩散展开,这使得传统部署和推理时搜索技术的计算成本都很高。在本文中,我们引入了去噪器共折叠全原子流图(DeCAF)框架,用于将最先进的全原子共折叠模型蒸馏为全原子流图,这些流图仅需几步推理即可产生高质量样本。我们基于去噪器的流图公式构建DeCAF,该公式具有端点损失,自然支持SE(3)刚性对齐,我们证明这对于训练准确模型至关重要。我们进一步推导了一个简单的变量变换,使DeCAF能够在EDM风格架构的σ空间噪声调度中运行,从而能够从预训练的共折叠扩散模型直接蒸馏。借助DeCAF的流图前瞻,我们引入了一个专门构建的推理时框架,通过奖励引导搜索改进采样。实验上,在具有挑战性的Runs N' Poses数据集上,DeCAF-Boltz在严格的NFE预算下,在蛋白质-配体姿势的准确性(RMSD)和物理有效性分数上均统计上优于Boltz-1x,同时在PoseBusters上的所有推理计算预算下显示出更优的帕累托前沿。将最先进的Pearl共折叠模型蒸馏后,DeCAF-Pearl优于基于扩散的共折叠模型,并在成功率上与其教师模型匹配,同时使用的NFE减少了5倍。我们在https://github.com/genesistherapeutics/decaf发布代码。

英文摘要

All-atom generative modeling of 3D biomolecular complexes has emerged as the dominant paradigm for predicting the structure of proteins and protein-ligand systems. Generating structures at the atomic level of fidelity, however, typically requires expensive iterative diffusion rollouts, making both conventional deployment and inference-time search techniques computationally costly. In this paper, we introduce the Denoiser Cofolding All-Atom Flowmap (DeCAF) framework for distilling state-of-the-art all-atom cofolding models into all-atom flow maps that produce high-quality samples in only a few inference steps. We build DeCAF on a denoiser-based formulation of flow maps with endpoint losses that naturally support SE(3) rigid alignment, which we show is critical for training accurate models. We further derive a simple change of variables that lets DeCAF operate in the σ-space noise schedule of EDM-style architectures, enabling direct distillation from pretrained cofolding diffusion models. Equipped with DeCAF's flowmap lookahead, we introduce a purpose-built inference-time framework that improves sampling through reward-guided search. Empirically, DeCAF-Boltz statistically improves over Boltz-1x in both accuracy (RMSD) and physical validity scores of protein-ligand poses at strict NFE budgets on the challenging Runs N' Poses, while also showing a more optimal Pareto frontier across all inference compute budgets on PoseBusters. Distilling the state-of-the-art Pearl cofolding model, DeCAF-Pearl outperforms diffusion-based cofolding models and matches its teacher on success rate while using 5x fewer NFEs. We release our code at https://github.com/genesistherapeutics/decaf.

2606.08369 2026-06-09 cs.LG cs.AI 新提交

An Information-Theoretic Definition for Open-Ended Learning

开放学习的信息论定义

Wanqiao Xu, Yifan Zhu, Benjamin Van Roy

发表机构 * Stanford University(斯坦福大学)

AI总结 提出基于比特等价的信息论定义开放环境,证明经典赌博机非开放,设计算法实现开放学习。

详情
AI中文摘要

越来越多的研究表明,能够在开放环境中持续扩展能力的AI系统具有巨大潜力。但目前尚无关于开放性的统一定义或关于智能体应如何探索开放环境的理论。我们基于一个新概念——${\textit比特等价}$——引入了一个信息论定义,该概念量化了达到每个期望奖励水平所需的信息。我们认为,如果智能体能够实现比特等价的线性增长,则该环境是开放的。我们证明了经典赌博机环境不是开放的,并构建了一个开放赌博机环境。我们还提出了一种在该环境中实现开放学习的算法。

英文摘要

A growing body of work points to the great promise of AI systems that can continually expand their capabilities as they operate in an open-ended environment. But yet there is no coherent definition of open-endedness or theory about how an agent ought to explore an open-ended environment. We introduce an information-theoretic definition based on a new concept -- the ${\textit bit-equivalent}$ -- which quantifies the information required to attain each level of expected reward. We consider an environment to be open-ended if an agent can attain linear growth in the bit-equivalent. We establish that classical bandit environments are not open-ended and formulate a bandit environment that is. We also introduce an algorithm that achieves open-ended learning in this environment.

2606.08365 2026-06-09 cs.LG cs.AI 新提交

Pre-Intervention Prediction of Sparse Autoencoder Steering Side Effects

稀疏自编码器引导副作用的干预前预测

Evan Duan

发表机构 * University of Michigan(密歇根大学)

AI总结 提出一种干预前筛选框架,利用特征统计预测SAE引导的副作用(效果不稳定和附带扩散),在多个模型和字典上验证了解码器几何等信号优于基线,但预测效果因模型而异。

详情
AI中文摘要

稀疏自编码器(SAE)特征越来越多地用于引导语言模型,但特征引导很少是干净的:相同的干预在不同上下文中可能表现不一致,并扰动不相关的特征。我们引入了一个干预前筛选框架,用于从引导前计算的特征统计中预测SAE引导的副作用。我们沿着引导模块化的两个轴(效果稳定性和附带扩散)来操作化副作用,并在ReLU、JumpReLU和TopK SAE字典上评估GPT-2-small、Pythia-70M-deduped、Gemma-2-2B和Llama-3.1-8B。在这些设置中,解码器几何、激活统计、共激活结构和直接logit足迹比仅频率和激活幅度基线更好地预测引导模块化。信号在GPT-2-small、Pythia-70M和Llama-3.1-8B中最强,在那里它能在对抗幅度相关混杂的残差化后幸存,而在Gemma-2-2B中较弱。保留筛选表明,通过预测的清洁度对未见特征进行排序可以选择在新上下文中更干净地引导的特征,但成功的轴因设置而异:GPT-2在清洁度上提升最大,Pythia主要在稳定性上提升,Llama主要在附带性上提升,而Gemma仅部分提升。一个受控的Llama Scope宽度比较表明,在32K到128K字典宽度变化下,预测信号仍然存在,尽管筛选收益变得不太稳定。总体而言,SAE引导的副作用是可提前预测的,但有用的预测器签名和迁移的模块化轴依赖于模型和字典设置。

英文摘要

Sparse autoencoder (SAE) features are increasingly used to steer language models, but feature steering is rarely clean: the same intervention can behave inconsistently across contexts and perturb unrelated features. We introduce a pre-intervention screening framework for forecasting SAE steering side effects from feature statistics computed before steering. We operationalize side effects along two axes of steering modularity, effect stability and collateral spread, and evaluate GPT-2-small, Pythia-70M-deduped, Gemma-2-2B, and Llama-3.1-8B across ReLU, JumpReLU, and TopK SAE dictionaries. Across these settings, decoder geometry, activation statistics, co-activation structure, and direct-logit footprint predict steering modularity better than frequency-only and activation-magnitude baselines. The signal is strongest in GPT-2-small, Pythia-70M, and Llama-3.1-8B, where it survives residualization against magnitude-related confounds, and weaker in Gemma-2-2B. Held-out screening shows that ranking unseen features by predicted cleanliness can select features that steer more cleanly on fresh contexts, but the successful axis varies by setting: GPT-2 improves most cleanly, Pythia improves mainly on stability, Llama mainly on collateral, and Gemma only partially. A controlled Llama Scope width comparison shows that the predictive signal persists under a 32K-to-128K dictionary-width change, although the screening payoff becomes less stable. Overall, SAE steering side effects are predictable in advance, but the useful predictor signature and transferred modularity axis are model- and dictionary-setting dependent.