arXivDaily arXiv每日学术速递 周一至周五更新
重置
2605.11130 2026-06-04 cs.LG cs.AI

HEPA: A Self-Supervised Horizon-Conditioned Event Predictive Architecture for Time Series

HEPA: 一种用于时间序列的自监督水平条件事件预测架构

Jonas Petersen, Gian-Alessandro Lombardi, Riccardo Maggioni, Camilla Mazzoleni, Federico Martelli, Philipp Petersen

发表机构 * ETH Zurich(苏黎世联邦理工学院) Forgis University of Vienna(维也纳大学)

AI总结 提出HEPA架构,通过因果Transformer编码器联合嵌入预测(JEPA)预训练和仅微调预测器生成单调生存累积分布函数,在14个基准测试中超过PatchTST等模型,参数和标注数据量减少一个数量级。

详情
Comments
Spotlight at FMSD, ICML 2026. Code: https://github.com/Forgis-Labs/HEPA
AI中文摘要

多变量时间序列中的关键事件,从涡轮机故障到心律失常,需要准确的预测,但由于此类事件罕见且标注成本高,标注数据稀缺。我们引入了HEPA(水平条件事件预测架构),基于两个关键原则。首先,通过联合嵌入预测架构(JEPA)预训练因果Transformer编码器:一个水平条件预测器学习预测未来表示而非未来值,迫使编码器仅从无标注数据中捕获可预测的时间动态。其次,我们冻结编码器,仅微调预测器以预测目标事件,生成随水平单调的生存累积分布函数(CDF)。在所有基准测试中,使用固定的架构和优化器超参数,HEPA处理了水污染、网络攻击检测、波动率制度以及跨11个领域的另外8种事件类型,在14个基准测试中的至少10个上超过了包括PatchTST、iTransformer、MAE和Chronos-2在内的领先时间序列架构,调优参数少一个数量级,并且在生命周期数据集上,标注数据少一个数量级。

英文摘要

Critical events in multivariate time series, from turbine failures to cardiac arrhythmias, demand accurate prediction, yet labeled data is scarce because such events are rare and costly to annotate. We introduce HEPA (Horizon-conditioned Event Predictive Architecture), built on two key principles. First, a causal Transformer encoder is pretrained via a Joint-Embedding Predictive Architecture (JEPA): a horizon-conditioned predictor learns to forecast future representations rather than future values, forcing the encoder to capture predictable temporal dynamics from unlabeled data alone. Second, we freeze the encoder and finetune only the predictor toward the target event, producing a monotonic survival cumulative distribution function (CDF) over horizons. With fixed architecture and optimiser hyperparameters across all benchmarks, HEPA handles water contamination, cyberattack detection, volatility regimes, and eight further event types across 11 domains, exceeding leading time-series architectures including PatchTST, iTransformer, MAE, and Chronos-2 on at least 10 of 14 benchmarks, with an order of magnitude fewer tuned parameters and, on lifecycle datasets, an order of magnitude less labeled data.

2605.09081 2026-06-04 cs.LG cs.AI

FactoryNet: A Large-Scale Dataset toward Industrial Time-Series Foundation Models

FactoryNet:面向工业时间序列基础模型的大规模数据集

Karim Othman, Jonas Petersen, Matei Ignuta-Ciuncanu, Camilla Mazzoleni, Federico Martelli, Alessandro Lombardi, Riccardo Maggioni, Philipp Petersen

发表机构 * ETH Zurich(苏黎世联邦理工学院)

AI总结 提出首个工业时间序列通用预训练语料库FactoryNet,通过统一模式实现跨实体零样本迁移和高效异常检测。

详情
Comments
Accepted at AI4Physics and FMSD, ICML 2026. Code: https://github.com/Forgis-Labs/FactoryNet
AI中文摘要

我们引入了首个工业时间序列数据的通用预训练语料库:FactoryNet。该数据集包含51M个数据点,涵盖六种实体上的23k个端到端任务执行(13.3k真实,9.8k合成),通过共享模式实现了鲁棒的零样本跨实体迁移和高参数效率的异常检测。我们提出了一种新颖的模式:设定点、努力、反馈、上下文(S-E-F-C),该模式贯穿整个流水线,将任何驱动系统映射到共同的表示框架。该语料库涵盖27种标注的异常类型,以及健康基线和机器人操作与加工领域的反事实对。跨实体迁移实验取得了积极结果:在考虑偏见的指标下,我们的模型在评估的源-目标对上展示了公平的跨实体迁移能力,而24个模式对齐的信号与高维基线相比,实现了有竞争力的异常检测性能。我们发布FactoryNet作为一个不断增长的多实体数据集,以推动工业基础模型的发展。

英文摘要

We introduce the first universal pretraining corpus for industrial time-series data: FactoryNet. 51M datapoints across 23k end-to-end task executions (13.3k real, 9.8k synthetic) on six embodiments, unified by a shared schema that enables robust zero-shot cross-embodiment transfer and highly parameter-efficient anomaly detection. We introduce a novel schema: Setpoint, Effort, Feedback, Context (S-E-F-C) underlying the whole pipeline that maps any actuated system into a common representational frame. The corpus spans 27 annotated anomaly types alongside healthy baselines and counterfactual pairs across robotic manipulation and machining domains. Cross-embodiment transfer experiments yield positive results: under bias-aware metrics our model demonstrates fair cross-embodiment transfer capabilities on the evaluated source-target pair, while 24 schema-aligned signals achieves competitive anomaly detection performance compared to high-dimensional baselines. We release FactoryNet as a growing, multi-embodiment dataset to drive progress toward industrial foundation models.

2603.07523 2026-06-04 cs.LG

Breaking the Scale Barrier: One-Shot Knowledge Transfer via Frequency Transform

基于频域知识的通用模型初始化

Jianlu Shen, Fu Feng, Yucheng Xie, Jiaqi Lv, Xin Geng

发表机构 * arXiv.org

AI总结 提出FRONT框架,利用离散余弦变换提取权重的低频分量作为“学习基因”,通过截断或填充实现任意大小模型的免训练初始化,并可选频谱正则化提升迁移性,在视觉任务中加速收敛15倍,语言任务中平均减少40.5%训练计算量。

详情
AI中文摘要

通过微调大规模预训练网络来迁移知识已成为下游任务的标准范式,然而预训练模型的知识与单一架构紧密耦合,限制了在不同规模模型间的灵活复用。针对这一挑战,近期方法通常采用参数选择(无法捕捉知识的相互依赖结构)或使用生成模型进行参数预测(依赖于对大规模网络集合的不切实际访问)。在本文中,我们实验证明,模型的基础、任务无关知识(即其“学习基因”)编码在权重的低频分量中,并且可以被下游模型高效继承。基于这一发现,我们提出FRONT(频域知识迁移),一种新颖框架,使用离散余弦变换(DCT)分离低频“学习基因”。该学习基因可以通过简单的截断或填充无缝适配以初始化任意大小的模型,整个过程无需训练。为了提升性能,我们提出一个可选的低成本精炼过程,引入频谱正则化器以进一步提高学习基因的可迁移性。大量实验表明,FRONT达到了最先进的性能,在视觉任务中加速收敛高达15倍,在语言任务中平均减少40.5%的训练FLOPs。

英文摘要

Transferring knowledge by fine-tuning large-scale pre-trained networks has become a standard paradigm for downstream tasks, yet the knowledge of a pre-trained model is tightly coupled with monolithic architecture, which restricts flexible reuse across models of varying scales. In response to this challenge, recent approaches typically resort to either parameter selection, which fails to capture the interdependent structure of this knowledge, or parameter prediction using generative models that depend on impractical access to large network collections. In this paper, we identify the low-frequency components of model weights as the concrete carrier of foundational, task-agnostic knowledge, its ``learngene", and validate this by demonstrating its efficient inheritance by downstream models and tasks. Based on this insight, we propose FRONT (FRequency dOmain kNowledge Transfer), a novel framework that uses the Discrete Cosine Transform (DCT) to isolate the low-frequency ``learngene". This learngene can be seamlessly adapted to initialize models of arbitrary size via simple truncation or padding, a process that is entirely training-free. For enhanced performance, we propose an optional low-cost refinement process that introduces a spectral regularizer to further improve the learngene's transferability. Extensive experiments demonstrate that FRONT achieves the state-of-the-art performance, accelerates convergence by up to $15\times$ in vision tasks, and reduces training FLOPs by an average of 40.5% in language tasks. Code is available at https://github.com/LUcy0505/FRONT.

2602.05725 2026-06-04 cs.LG math.OC stat.ML

Muon in Associative Memory Learning: Training Dynamics and Scaling Laws

联想记忆学习中的Muon:训练动力学与缩放定律

Binghui Li, Kaifei Wang, Han Zhong, Pinyan Lu, Liwei Wang

发表机构 * arXiv.org University of Science and Technology of China(中国科学技术大学)

AI总结 本文在联想记忆模型中研究Muon优化器的训练动力学和缩放定律,证明其相比梯度下降在无噪声情况下实现指数加速,在有噪声情况下具有更优的缩放效率。

详情
Comments
Published as a conference paper at ICML 2026; 53 pages
AI中文摘要

Muon通过梯度的矩阵符号更新矩阵参数,并显示出强大的经验增益,但其动力学和缩放行为在理论上仍不清楚。我们在具有softmax检索和查询-答案对上的层次频谱(含和不含标签噪声)的线性联想记忆模型中研究Muon。在该设置下,我们证明梯度下降以高度不平衡的速率学习频率分量,导致收敛缓慢,瓶颈在于低频分量。相比之下,Muon优化器缓解了这种不平衡,实现了更快且更均匀的进展。具体地,在无噪声情况下,Muon实现了相对于梯度下降的指数加速;在具有幂律频谱的有噪声情况下,我们推导了Muon的缩放定律,并展示了其相对于梯度下降的优越缩放效率。此外,我们表明Muon可以解释为由自适应任务对齐和块对称梯度结构产生的隐式矩阵预处理器。相比之下,具有坐标符号算子的预处理器在已知未知任务表示的oracle访问下才能匹配Muon,而这在实践中的SignGD中是不可行的。在合成长尾分类和LLaMA风格预训练上的实验证实了该理论。

英文摘要

Muon updates matrix parameters via the matrix sign of the gradient and has shown strong empirical gains, yet its dynamics and scaling behavior remain unclear in theory. We study Muon in a linear associative memory model with softmax retrieval and a hierarchical frequency spectrum over query-answer pairs, with and without label noise. In this setting, we show that Gradient Descent (GD) learns frequency components at highly imbalanced rates, leading to slow convergence bottlenecked by low-frequency components. In contrast, the Muon optimizer mitigates this imbalance, leading to faster and more uniform progress. Specifically, in the noiseless case, Muon achieves an exponential speedup over GD; in the noisy case with a power-law frequency spectrum, we derive Muon's scaling law and demonstrate its superior scaling efficiency over GD. Furthermore, we show that Muon can be interpreted as an implicit matrix preconditioner arising from adaptive task alignment and block-symmetric gradient structure. In contrast, the preconditioner with coordinate-wise sign operator could match Muon under oracle access to unknown task representations, which is infeasible for SignGD in practice. Experiments on synthetic long-tail classification and LLaMA-style pre-training corroborate the theory.

2605.25200 2026-06-04 cs.CL

GroupTravelBench: Benchmarking LLM Agents on Multi-Person Travel Planning

GroupTravelBench: 多人群组旅行规划中LLM智能体的基准测试

Xiang Cheng, Yulan Hu, Lulu Zheng, Zheng Pan, Xin Li, Yong Liu

发表机构 * Gaoling School of Artificial Intelligence, Renmin University of China(中国人民大学首都人工智能学院) AMAP, Alibaba Group(阿里集团AMAP)

AI总结 提出GroupTravelBench基准,通过多用户多轮对话任务评估LLM智能体在偏好获取、冲突协调和公平规划三方面的能力。

详情
Comments
work in process
AI中文摘要

旅行规划是评估LLM智能体规划与工具使用能力的现实任务。然而,现有基准通常只假设单一用户,从而回避了现实场景中最具挑战性的方面之一:智能体识别和解决多用户冲突的能力。为填补这一空白,我们引入了 extbf{GroupTravelBench},这是首个针对 extbf{多用户、多轮}旅行规划的基准。基于真实用户画像、POI数据和票价数据,我们综合生成了650个任务,并将其分为三个难度等级。除了单用户行程规划所需的标准能力(如多步推理和工具使用)外,我们的基准进一步评估了旅行智能体所需的三项关键能力:\emph{(i) 获取}——主动进行多轮对话以收集每位用户的偏好;\emph{(ii) 协调}——通过妥协或分组策略解决用户间的冲突;以及\emph{(iii) 规划}——搜索能最大化整体群体效用同时保持公平性和可行性的旅行方案。为模拟现实中的对话式行程规划,同时确保可靠的工具使用和离线评估,我们构建了一个带有缓存真实工具数据的交互式沙箱环境。我们评估了多种LLM,发现即使是前沿模型在偏好覆盖率和群体公平性方面仍存在显著弱点。 extit{GroupTravelBench}为推进LLM智能体在现实旅行规划中的研究提供了一个实用且可复现的基准。

英文摘要

Travel planning in the real world is overwhelmingly a \textit{group} activity, yet existing LLM travel-planning benchmarks reduce it to a single user, where the field is approaching saturation. This single-user assumption sidesteps what makes group planning hard for an agent: discovering private preferences across multiple users, surfacing conflicts, and balancing utility against fairness. To bring the task back to its multi-user reality, we introduce \textbf{\textit{GroupTravelBench}}, the first benchmark for \textbf{multi-user, multi-turn} travel planning. Built from real user profiles, POI data, and ticket prices, it comprises 650 tasks across three difficulty levels, each running in a synchronous group-chat sandbox with cached tool data for reproducible offline evaluation. Beyond the multi-step reasoning and tool use that single-user benchmarks already test, GroupTravelBench probes three group-specific capabilities: \textit{(i) elicitation} of private preferences through multi-turn dialogue; \textit{(ii) coordination} of inter-user conflicts via compromise or subgrouping; and \textit{(iii) planning} that balances group utility against fairness. We pair this with a complementary evaluation framework combining rule-based outcome metrics and LLM-judge process metrics. Across a wide range of frontier models, even the strongest agents fall short on all four rule-based outcome metrics, with plan validity below 12\%, suggesting that group-level outcome quality is a key open challenge for LLM travel-planning agents.

2605.24782 2026-06-04 cs.LG

The Perception-Physics Paradox: Probing Scientific Alignment with TC-Bench

感知-物理悖论:用TC-Bench探究科学对齐

Dingling Yao, Andrea Polesello, Adeel Pervez, Caroline Muller, Francesco Locatello

发表机构 * ETH Zurich(苏黎世联邦理工学院) DeepMind University of Cambridge(剑桥大学) University of Amsterdam(阿姆斯特丹大学) University of Toronto(多伦多大学)

AI总结 本文提出科学对齐概念,通过结构同构性构建层次化必要条件,并发布TC-Bench基准数据集,揭示视觉基础模型在极端条件下依赖视觉捷径而非科学推理。

详情
Comments
Accepted at ICML 2026
AI中文摘要

虽然视觉基础模型(VFM)在卫星图像的预测任务中表现出色,但其性能可能源于视觉相关性而非底层结构不变性,这使得基于感知的分布外准确性甚至不能作为科学实用性的良好代理。因此,模型可能看起来正确但推理错误,我们将这种差异称为感知-物理悖论。为了解决这一差距,我们引入科学对齐作为科学领域表示学习的隐式目标。我们通过结构同构性研究科学对齐的一个原则性、可测试的方面,该要求潜在表示能够唯一地识别物理系统,直至线性重新参数化。这一视角引出了一个层次化的必要条件,并为物理和因果可解释性提供了系统的探测协议。为了实施这一框架,我们发布了TC-Bench,这是一个全球性的、可复现的基准数据集,带有自动构建流程,用于热带气旋研究,并表明当前的VFM依赖于在极端条件下崩溃的视觉捷径,表明科学对齐并非仅仅是规模扩展的自然副产品。

英文摘要

While Vision Foundation Models (VFMs) excel at predictive tasks on satellite imagery, their performance can arise from visual correlations rather than underlying structural invariants, making even perception-based out-of-distribution accuracy a poor proxy for scientific utility. As a result, models may look correct without reasoning correctly, a discrepancy we term the Perception-Physics Paradox. To address this gap, we introduce scientific alignment as an implicit objective for representation learning in scientific domains. We study a principled, testable aspect of scientific alignment through structural isomorphism, which requires latent representations to uniquely identify physical systems up to a linear reparameterization. This perspective induces a hierarchy of necessary conditions and yields a systematic probing protocol for physical and causal interpretability. To operationalize this framework, we release TC-Bench, a global, reproducible benchmark dataset with an automated construction pipeline for tropical cyclone research, and show that current VFMs rely on visual shortcuts that collapse in intense regimes, indicating that scientific alignment does not arise as a natural byproduct of scaling alone.

2605.24602 2026-06-04 cs.CV cs.AI

Correcting Visual Blur Induced by Attention Distraction to Reduce Hallucinations: Algorithm and Theory

纠正注意力分散引起的视觉模糊以减少幻觉:算法与理论

Quanjiang Li, Zhiming Liu, Wei Luo, Tingjin Luo, Chenping Hou

发表机构 * National University of Singapore(新加坡国立大学) University of Science and Technology of China(中国科学技术大学)

AI总结 本文揭示多模态大语言模型中的物体幻觉与类人注意力分散现象相关,并提出一种无需额外训练的注意力聚焦方法(AFIP)通过跨头注意力增强和动态历史注意力强化来纠正视觉模糊,从而减少幻觉。

详情
Journal ref
ICML2026
AI中文摘要

多模态大语言模型(MLLMs)经常遭受物体幻觉的困扰,但导致这一失败的视觉感知机制仍知之甚少。在这项工作中,我们揭示幻觉与一种类人注意力分散现象密切相关,其中人类在注意力分散下会经历视觉清晰度下降并产生不准确的描述,而在模型中,同样的机制表现为解码过程中多头注意力的空间不一致性以及对图像令牌注意力的时间衰减。我们进一步提供了理论见解,表明注意力分散会增加模型复杂度并降低分类泛化能力。受这些发现的启发,我们提出了一种用于改进图像感知的注意力聚焦方法(AFIP),该方法通过跨头注意力丰富来纠正注意力分散,并通过动态历史注意力增强来强化视觉基础。在多个基准和模型上的大量实验验证了AFIP的有效性,且无需额外训练。

英文摘要

Multimodal large language models (MLLMs) frequently suffer from object hallucinations, yet the visual perceptual mechanism underlying this failure remains poorly understood. In this work, we reveal that hallucinations are strongly associated with a human-like attention distraction phenomenon, where humans under divided focus experience degraded visual clarity and produce inaccurate descriptions, while in models the same mechanism manifests as spatial inconsistency in multi-head attention and temporal fading of attention to image tokens during decoding. We further provide theoretical insights that attention dispersion increases model complexity and degrades classification generalization. Motivated by these findings, we propose an Attention-Focused Approach for Improved Image Perception (AFIP), which corrects attention distraction via cross-head attention enrichment and reinforces visual grounding through dynamic historical attention enhancement. Extensive experiments on multiple benchmarks and models validate the effectiveness of AFIP without additional training.

2605.17273 2026-06-04 cs.LG cs.AI

Position: State-of-the-Art Claims Require State-of-the-Art Evidence

立场:声称最先进需要最先进的证据

YongKyung Oh

发表机构 * YongKyung Oh(永庆欧)

AI总结 本文指出人工智能和机器学习研究中普遍存在的声称最先进(SOTA)与证据不足之间的差距,通过分析十个跨领域基准测试发现,超过一半的顶级模型比较中至少一项常见的优越性假设不成立,并呼吁声明语言应反映证据强度。

详情
AI中文摘要

最先进(SOTA)声称在人工智能(AI)和机器学习(ML)研究中普遍存在。这些声称基于基准评估,其中模型根据跨任务的总分进行排名。公共基准或排行榜是最明显的实例,但相同的结构也出现在文献中的论文表格中。然而,这种微弱的证据往往无法支持这些强有力的声称。我们识别出AI基准测试中普遍存在的声称-证据差距。声称SOTA隐含着超越平均分数优越性的假设,表明模型在大多数任务上显著优于替代方案。然而,平均分数的边际改进仅表明平均排名靠前,而非真正的优越性。通过分析来自公共排行榜的十个跨领域基准测试,我们发现超过一半的顶级模型比较中,至少一项常见的优越性假设不成立。这些属性包括有意义的效应大小、跨任务的一致性,或对数据集移除的鲁棒性。相反,总分提升往往由异常数据集驱动。即使在任务众多的基准测试中,这种脆弱性仍然存在。我们认为,声称语言应反映潜在证据的强度。这不需要额外的实验,只需诚实地报告结果实际显示的内容,从而实现跨模型更精确和可解释的比较。

英文摘要

State-of-the-Art (SOTA) claims pervade Artificial Intelligence (AI) and Machine Learning (ML) research. These claims rest on benchmark evaluations, where models are ranked by aggregate scores across tasks. Public benchmarks or leaderboards are the most visible instance, but the same structure appears in paper tables throughout the literature. However, such minimal evidence often cannot support these strong claims. We identify a widespread claim-evidence gap in AI benchmarking. Claiming SOTA carries implicit assumptions beyond mean score superiority, suggesting that a model meaningfully outperforms alternatives across most tasks. However, a marginal improvement in the mean score merely indicates a top average rank rather than true superiority. Analyzing ten cross-domain benchmarks from public leaderboards, we found that in more than half of top-model comparisons, at least one commonly assumed property of superiority does not hold. These properties include meaningful effect size, consistency across tasks, or robustness to dataset removal. Instead, aggregate gains are frequently driven by outlier datasets. This fragility persists even in benchmarks with many tasks. We argue that claim language should reflect the strength of the underlying evidence. This requires no additional experiments, only honest reporting of what results actually show, enabling more precise and interpretable comparisons across models.

2605.22740 2026-06-04 cs.LG

Ternary Decision Trees with Locally-Adaptive Uncertainty Zones

三元决策树与局部自适应不确定性区域

William Smits

发表机构 * Avathon

AI总结 本文提出三元决策树,通过在每个分裂节点引入局部自适应的不确定性区域,改进传统二元决策树的决策准确性,并在多个数据集上验证了其优越性。

详情
Comments
V2: Major revision. Added decision-theoretic framework deriving optimal delta* as a node-local cost minimisation problem; four formal theoretical properties (Propositions 1-4); motivating example figure (Figure 5); strengthened related work and limitations analysis. 15 pages, 5 figures, 5 appendix sections. Submitted to Data Mining and Knowledge Discovery (DAMI)
AI中文摘要

决策树通过硬二元阈值划分特征空间,对远离决策边界和直接位于边界上的实例赋予相同的置信度。我们引入三元决策树,每个分裂节点附加一个半宽为delta的不确定性区域,位于最优阈值中心。该区域内实例的预测由两个子树的加权混合生成,并被标记为边界不确定,提示下游应用可能以不同方式处理这些预测。关键的是,delta在每个节点本地计算,基于标准CART分裂寻找过程中已有的统计信息,无需外部噪声指定。我们提出并评估了五种delta估计方法:质量平台(分裂标准曲线的平台宽度)、类别重叠(经验类别分布重叠)、增益比(分裂质量相对于分裂熵)、节点自助法(节点层面重采样下的阈值方差)以及边缘(受SVM启发的最近跨类训练实例距离)。在72个OpenML-CC18数据集上进行5折交叉验证后,所有五种方法结合概率路由显著优于标准CART在决定准确性上(Wilcoxon符号秩检验,p < 0.001)。边缘方法在效率上最佳(每个边界不确定标志率单位获得0.104准确性提升),在42个数据集上获胜,且不需要额外超参数。对三个Breiman合成基准的分析显示,边缘方法在干净数据上自我校准,而节点自助法和质量平台方法最佳跟踪理论不可约误差。在四个医疗和金融数据集上的实验展示了实际价值:在乳腺X线摄影中,节点自助法通过将10.8%的筛查病例标记为边界不确定,实现了+0.71%的决定准确性提升。

英文摘要

Decision trees assign identical confidence to instances near and far from each split threshold. We introduce ternary decision trees, which augment each split node with an uncertainty zone of half-width delta. A decision-theoretic framework characterises the optimal zone width delta* as the solution to a node-local cost-minimisation problem; four formal properties are established: accuracy decomposition, a sufficiency condition for decided accuracy improvement, an exact efficiency characterisation (eta = Dec-Acc minus Acc_u, the accuracy gap between decided and boundary-uncertain predictions), and asymptotic consistency of the margin method. Instances within the zone receive predictions by weighted blending of both child subtrees and are flagged as boundary-uncertain. We propose and evaluate five delta-estimation methods: quality-plateau (plateau width of the split criterion curve), class-overlap (empirical class-distribution overlap), gain-ratio (split quality relative to split entropy), node-bootstrap (threshold variance under node-level resampling), and margin (SVM-inspired distance to the nearest cross-class training example). All methods reuse statistics already computed during standard CART split finding, requiring no external noise specification. Evaluated across 71 of the 72 OpenML-CC18 datasets with 5-fold cross-validation, all five methods with probabilistic routing significantly outperform standard CART on decided accuracy (Wilcoxon signed-rank, p < 0.001). The margin method achieves the best efficiency (0.104 accuracy gain per unit flagging rate), wins on 42 of 72 datasets, and requires zero hyperparameters. Analysis on Breiman synthetic benchmarks confirms margin is self-calibrating on clean data. On mammography, node-bootstrap achieves +0.71% decided accuracy by flagging 10.8% of cases as boundary-uncertain.

2605.22240 2026-06-04 cs.AI

Unlocking Proactivity in Task-Oriented Dialogue

解锁任务导向型对话中的主动性

Azure Zhang, Ning Gao, Yuqin Dai, Ruiyuan Wu, Jinpeng Wang, Rena Wei Gao, Bingdong Tan, Shuzheng Gao, Zongjie Li, Chaozheng Wang

发表机构 * Keeta AI, Meituan(Keeta AI,美团) Independent Researcher(独立研究者) CUHK(香港中文大学) HKUST(香港科技大学)

AI总结 针对任务导向型对话中主动性问题,提出认知用户模拟器和模拟器诱导的非对称视角策略优化,通过建模用户潜在关注实现主动对话。

详情
AI中文摘要

主动任务导向型对话(如外呼销售)需要一个有说服力的代理,能够主动探询用户的关注点,并在有限轮次内引导对话走向接受。然而,后训练的LLM本质上是保守的,而奖励塑造强化学习(如GRPO)效果不佳,因为它仅重新加权被动策略已采样的内容。我们表明,以用户的潜在关注为条件可以解锁任何采样量都无法破坏的主动能力,从而将这些关注确立为关键的训练时信号。为将这一发现付诸实践,我们构建了**认知用户模拟器**,它将每个用户建模为一个分层角色,包括可观察的外部特征和隐藏的内部关注。该模拟器产生忠实且多样化的交互,同时输出每轮状态动态以跟踪说服进展。然后,我们引入**模拟器诱导的非对称视角策略优化**,将建模的关注和模拟状态转换转化为互补的训练目标:(1)*非对称在线自蒸馏*,将关注感知行为从同一策略的特权视角转移到其可部署的、仅对话视角;(2)*状态转换策略优化*...

英文摘要

Proactive task-oriented dialogue (TOD), such as outbound sales, demands a persuasive agent that actively probes the user's concerns and steers the conversation toward acceptance within a bounded number of turns. Yet post-trained LLMs are inherently conservative, and reward-shaping RL (e.g., GRPO) struggles since it only re-weights what an already passive policy samples. We show that conditioning on the user's latent concerns unlocks proactive capability that no amount of sampling can undermine, establishing these concerns as a pivotal training-time signal. To operationalize this finding, we build the \textbf{Cognitive User Simulator}, which models each user as a stratified persona comprising observable external traits and hidden internal concerns. The simulator produces faithful and diverse interactions, while emitting per-turn state dynamics that track persuasion progress. We then introduce \textbf{Simulator-Induced Asymmetric-View Policy Optimization}, which converts the modeled concerns and the simulation state transition into complementary training objectives: (1) \emph{Asymmetric On-Policy Self-Distillation} that transfers concern-aware behavior from a privileged view of the same policy into its deployable, conversation-only view; and (2) \emph{State-Transition Policy Refinement} ...

2605.18102 2026-06-04 cs.CV

DanceHMR: Hand-Aware Whole-Body Human Mesh Recovery from Monocular Videos

DanceHMR: 从单目视频中恢复手部感知的全身人体网格

Wenhao Shen, Ming Zhou, Hengyuan Zhang, Siyuan Bian, Youjiang Xu, Yuan Zhang

发表机构 * ByteDance Intelligent Creation(字节跳动智能创作)

AI总结 提出一种基于残差体手融合的时序一致全身HMR框架,通过身体上下文与手部观测的融合以及特写增强,实现稳定身体运动与精细手部恢复。

详情
Comments
Project page: https://shenwenhao01.github.io/dancehmr/
AI中文摘要

单目视频人体网格恢复对于数字人、虚拟角色动画和具身模拟至关重要,需要时间稳定性和表现力丰富的全身运动。现有视频HMR方法能生成连贯的身体运动,但常忽略精细的手部关节;而基于图像的全身体方法逐帧独立恢复SMPL-X网格,常导致手部运动抖动且不准确。我们提出一种针对具有挑战性的野外单目视频的时序一致全身体HMR框架。我们的模型通过残差体手融合统一身体上下文和特定部分的手部观测,在单个时序架构中实现稳定的身体运动和精细的手部恢复。我们进一步引入特写感知增强,以提高上半身构图下的鲁棒性。在全身体和仅身体基准上的实验表明,手部重建得到改善,身体精度具有竞争力。我们的方法在具有挑战性的真实世界视频中也产生了时间稳定且2D一致的SMPL-X运动。

英文摘要

Monocular video human mesh recovery is essential for digital humans, avatar animation, and embodied simulation, where both temporal stability and expressive whole-body motion are required. Existing video HMR methods produce coherent body motion but often overlook detailed hand articulation, while image-based whole-body methods recover SMPL-X meshes independently per frame, often leading to jittery and inaccurate hand motion. We present a temporally coherent whole-body HMR framework for challenging in-the-wild monocular videos. Our model unifies body context and part-specific hand observations through residual body-hand fusion, enabling stable body motion and detailed hand recovery within a single temporal architecture. We further introduce close-up-aware augmentation to improve robustness under upper-body framing. Experiments on whole-body and body-only benchmarks demonstrate improved hand reconstruction and competitive body accuracy. Our method also produces temporally stable and 2D-consistent SMPL-X motion in challenging real-world videos.

2605.21446 2026-06-04 cs.RO cs.AI

Lost in Fog: Sensor Perturbations Expose Reasoning Fragility in Driving VLAs

迷失在雾中:传感器扰动暴露驾驶VLA的推理脆弱性

Abhinaw Priyadershi, Jelena Frtunikj

发表机构 * NVIDIA Corporation, USA(NVIDIA公司,美国) NVIDIA GmbH, Germany(NVIDIA德国公司)

AI总结 通过受控传感器扰动实验,发现因果链解释的一致性可作为轨迹可靠性的高保真指标,并证明启用因果链生成可提升轨迹精度。

详情
AI中文摘要

可解释的自主驾驶规划器不仅依赖于生成解释,还依赖于这些解释在真实传感器退化下的可靠性。本文对自主驾驶中视觉-语言-动作(VLA)模型的鲁棒性进行了受控扰动研究,评估了Alpamayo R1(10B参数)在八种传感器扰动(四种强度的高斯噪声、两种光照极端条件和两种雾浓度;约18,000次推理试验)下的1,996个场景。我们发现推理一致性是轨迹可靠性的高保真指标:当扰动后因果链(CoC)解释发生变化时,轨迹偏差激增5.3倍(21.8米 vs 4.1米),跨攻击类型的相关系数r=0.99,每样本点双列相关系数r_pb=0.53(Cohen's d=1.12)。受控消融实验表明,在匹配的推理设置下,启用CoC生成与轨迹精度提升相关(平均提升11.8%;p<0.0001)。在测试的噪声范围(σ∈{10,30,50,70})内,退化近似线性(R²=0.957),而标准输入预处理防御仅提供边际缓解。综上,这些结果将CoC一致性确立为规划安全的定量代理,并激励基于推理的运行时监控以实现更安全的VLA部署。

英文摘要

Interpretable autonomous driving planners depend not only on generating explanations, but also on those explanations remaining reliable under real-world sensor degradation. In this paper we present a controlled perturbation study of Vision-Language-Action (VLA) robustness in autonomous driving, evaluating Alpamayo R1 (10B parameters) across 1,996 scenarios under eight sensor perturbations (Gaussian noise at four intensities, two lighting extremes, and two fog levels; ${\sim}18{,}000$ inference trials). We find that reasoning consistency is a high-fidelity indicator of trajectory reliability: when Chain-of-Causation (CoC) explanations change after perturbation, trajectory deviation spikes $5.3{\times}$ (21.8m vs 4.1m), with $r\!=\!0.99$ across attack types and $r_{pb}\!=\!0.53$ per-sample (Cohen's $d\!=\!1.12$). A controlled ablation provides evidence that enabling CoC generation is associated with improved trajectory accuracy (11.8% on average across conditions; $p < 0.0001$) under matched inference settings. Over the tested noise range ($σ\in \{10, 30, 50, 70\}$), degradation is approximately linear ($R^2\!=\!0.957$), while standard input preprocessing defenses provide only marginal relief. Together, these results establish CoC consistency as a quantitative proxy for planning safety and motivate reasoning-based runtime monitoring for safer VLA deployment.

2605.20654 2026-06-04 cs.LG cs.AI

REFLECTOR: Internalizing Step-wise Reflection against Indirect Jailbreak

REFLECTOR: 内化逐步反思以对抗间接越狱

Jiachen Ma, Jiawen Zhang, Xiangtian Li, Bo Zou, Chaochao Lu, Chao Yang

发表机构 * University of Science and Technology of China(中国科学技术大学)

AI总结 提出REFLECTOR两阶段框架,通过教师引导生成反思数据并进行监督微调,再结合强化学习内化自主反思能力,在复杂间接攻击下实现超过90%的防御成功率,同时提升通用性能。

详情
Comments
ICML 2026
AI中文摘要

尽管大型语言模型(LLMs)展现出卓越的能力,但它们仍然容易受到复杂的多步越狱攻击,这些攻击通过利用内部生成过程来规避传统的表面安全对齐。为了解决这些漏洞,我们提出了REFLECTOR,一个原则性的两阶段框架,将自我反思内化在生成轨迹中。REFLECTOR首先利用教师引导生成高质量反思数据用于监督微调(SFT),建立结构化的反思模式。随后,它使用强化学习(RL)结合结果驱动和奖励有效性监督,以培养稳健、自主的自我反思能力。实验结果表明,REFLECTOR在复杂的间接攻击下实现了超过90%的防御成功率(DSR),同时在不同威胁场景中具有稳健的泛化能力。值得注意的是,该框架增强了任务特定和通用效用,在GSM8K上获得了5.85%的提升,并在知识密集型基准测试中表现更佳。通过内化轨迹级安全性,REFLECTOR克服了表面对齐的基本限制,且没有显著的计算开销,为开发安全且能力强大的LLMs提供了一种高效且可扩展的解决方案。

英文摘要

While Large Language Models (LLMs) demonstrate remarkable capabilities, they remain susceptible to sophisticated, multi-step jailbreak attacks that circumvent conventional surface-level safety alignment by exploiting the internal generation process. To address these vulnerabilities, we propose Reflector, a principled two-stage framework that internalizes self-reflection within the generation trajectory. Reflector first leverages teacher-guided generation to produce high-quality reflection data for supervised fine-tuning (SFT), establishing structured reflection patterns. It subsequently uses Reinforcement Learning (RL) with outcome-driven and reward-validity supervision to instill robust, autonomous self-reflection capabilities. Empirical results show that Reflector achieves Defense Success Rates (DSR) exceeding 90% against complex indirect attacks while generalizing robustly across diverse threat scenarios. Notably, the framework enhances both task-specific and general utility, yielding a 5.85% gain on GSM8K alongside improved performance on knowledge-intensive benchmarks. By internalizing trajectory-level safety, Reflector overcomes the fundamental limitations of surface alignment without significant computational overhead, offering an efficient and scalable solution for the development of safe and capable LLMs.

2605.19398 2026-06-04 cs.CV cs.AI

Rebalancing Reference Frame Dominance to Improve Motion in Image-to-Video Models

重新平衡参考帧主导性以改善图像到视频模型中的运动

Wooseok Jeon, Seungho Park, Seunghyun Shin, Sangeyl Lee, Hyeonho Jeong, Hae-Gon Jeon

发表机构 * Yonsei University(延世大学) GIST(韩国科学技术院) Adobe Research(Adobe研究)

AI总结 针对图像到视频模型生成视频过于静态的问题,提出无需训练且模型无关的DyMoS方法,通过重新平衡去噪初期生成帧对参考帧的注意力来增强运动,同时保持视觉质量和保真度。

详情
Comments
Preprint. Project page: https://sh0xed98b8.github.io/DyMoS/
AI中文摘要

与文本到视频模型相比,图像到视频模型通常生成的视频过于静态。先前的方法通过削弱或修改图像条件信号来缓解这一问题,但往往需要额外训练或牺牲对参考图像的保真度。在这项工作中,我们识别出参考帧主导性是运动抑制的关键机制。我们观察到,I2V模型中的非参考帧将过多的自注意力分配给参考帧的关键词元,导致参考信息随时间过度传播,从而抑制了帧间动态。基于这一发现,我们提出了DyMoS(动态运动滑块),一种无需训练且模型无关的方法,在初始去噪步骤中重新平衡从生成帧到参考帧的注意力路径。DyMoS保持输入图像和模型权重不变,并引入单个标量参数以连续控制运动强度。在多个最先进的I2V骨干网络上的实验表明,DyMoS在保持视觉质量和参考图像保真度的同时,一致地改善了运动动态。

英文摘要

Image-to-video models often generate videos that remain overly static, compared to text-to-video models. While prior approaches mitigate this issue by weakening or modifying the image-conditioning signal, they often require additional training or sacrifice fidelity to the reference image. In this work, we identify reference-frame dominance as a key mechanism behind motion suppression. We observe that non-reference frames in I2V models allocate excessive self-attention to reference-frame key tokens, causing reference information to be over-propagated across time and suppressing inter-frame dynamics. Based on this finding, we propose DyMoS (Dynamic Motion Slider), a training-free and model-agnostic method that rebalances the attention pathway from generated frames to the reference frame during initial denoising steps. DyMoS leaves both the input image and model weights unchanged and introduces a single scalar parameter for continuous control over motion strength. Experiments across multiple state-of-the-art I2V backbones demonstrate that DyMoS consistently improves motion dynamics while maintaining visual quality and fidelity to the reference image.

2605.18879 2026-06-04 cs.LG cs.AI cs.CL

ZeroUnlearn: Few-Shot Knowledge Unlearning in Large Language Models

ZeroUnlearn:大语言模型中的少样本知识遗忘

Yujie Lin, Chengyi Yang, Zhishang Xiang, Yiping Song, Jinsong Su

发表机构 * University of Science and Technology of China(中国科学技术大学)

AI总结 提出ZeroUnlearn框架,通过模型编辑将机器遗忘重新定义为精确的知识重映射问题,利用封闭解乘法参数更新实现高效、定向的少样本遗忘。

详情
AI中文摘要

大型语言模型由于在海量网络语料上训练,不可避免地会保留敏感信息(定义为可能引发有害生成的输入),从而引发隐私和安全担忧。现有的机器遗忘方法主要依赖于重训练或激进微调,这些方法要么计算成本高,要么容易降低相关知识并损害整体模型效用。在这项工作中,我们通过模型编辑将机器遗忘重新表述为一个精确的知识重映射问题。我们提出了ZeroUnlearn,一个少样本遗忘框架。它通过将敏感输入映射到中性目标状态并移除其原始表示来覆盖敏感输入。ZeroUnlearn通过封闭解形式的乘法参数更新强制执行表示正交性,从而实现高效且有针对性的遗忘。我们进一步将ZeroUnlearn扩展到基于梯度的变体,用于多样本遗忘。实验表明,我们的方法在保持模型整体效用的同时优于现有基线。我们的代码可在github上获取:https://github.com/XMUDeepLIT/ZeroUnlearn。

英文摘要

Large language models inevitably retain sensitive information, defined as inputs that may induce harmful generations, due to training on massive web corpora, raising concerns for privacy and safety. Existing machine unlearning methods primarily rely on retraining or aggressive fine-tuning, which are either computationally expensive or prone to degrading related knowledge and overall model utility. In this work, we reformulate machine unlearning as a precise knowledge re-mapping problem via model editing. We propose ZeroUnlearn, a few-shot unlearning framework. It overwrites sensitive inputs by mapping them to a neutral target state and removing their original representations. ZeroUnlearn enforces representational orthogonality through a multiplicative parameter update with a closed-form solution, enabling efficient and targeted unlearning. We further extend ZeroUnlearn to a gradient-based variant for multi-sample unlearning. Experiments demonstrate that our approach outperforms existing baselines while preserving general model utility. Our code is available at the github: https://github.com/XMUDeepLIT/ZeroUnlearn.

2605.19852 2026-06-04 cs.CL

Are Tools Always Beneficial? Learning to Invoke Tools Adaptively for Dual-Mode Multimodal LLM Reasoning

工具总是有益的吗?学习自适应调用工具以实现双模式多模态大语言模型推理

Qinghe Ma, Zhen Zhao, Yiming Wu, Jian Zhang, Lei Bai, Yinghuan Shi

发表机构 * arXiv.org University of Science and Technology of China(中国科学技术大学)

AI总结 提出AutoTool模型,通过强化学习框架自适应决定是否调用工具,结合双模式推理策略和模式特定奖励函数,在提升准确率的同时降低推理开销。

详情
Comments
Accepted to ICML 2026
AI中文摘要

工具增强推理已成为增强多模态大语言模型(MLLMs)推理能力的一个有前景的方向。然而,现有研究主要关注使模型能够执行工具调用,而忽略了调用工具的必要性。我们认为工具使用并非总是有益的,因为冗余或不恰当的调用会大大增加推理开销,甚至误导模型预测。为解决这一问题,我们引入了AutoTool,一个根据每个查询的特征自适应决定是否调用工具的模型。在强化学习框架内,我们设计了一种显式的双模式推理策略,并配以模式特定的奖励函数,以引导模型产生准确的响应。此外,为防止过早偏向单一推理模式,AutoTool在整个训练过程中共同探索并平衡工具辅助推理和文本中心推理,并在后期促进自由探索。大量实验表明,AutoTool表现出卓越的性能和高效率,在V*基准测试上相比基础模型准确率提升21.8%,在POPE基准测试上相比现有工具增强方法效率提升44.9%。代码可在https://github.com/MQinghe/AutoTool获取。

英文摘要

Tool-augmented reasoning has emerged as a promising direction for enhancing the reasoning capabilities of multimodal large language models (MLLMs). However, existing studies mainly focus on enabling models to perform tool invocation, while neglecting the necessity of invoking tools. We argue that tool usage is not always beneficial, as redundant or inappropriate invocations largely increase reasoning overhead and even mislead model predictions. To address this issue, we introduce AutoTool, a model that adaptively decides whether to invoke tools according to the characteristics of each query. Within a reinforcement learning framework, we design an explicit dual-mode reasoning strategy with mode-specific reward functions to guide the model toward producing accurate responses. Moreover, to prevent premature bias toward a single reasoning mode, AutoTool jointly explores and balances tool-assisted and text-centric reasoning throughout training, and promotes free exploration in later stages. Extensive experiments demonstrate that AutoTool exhibits outstanding performance and high efficiency, yielding a 21.8\% accuracy gain on V* benchmark compared to the base model, and a 44.9\% improvement in efficiency over existing tool-augmented methods on POPE benchmark. Code is available at https://github.com/MQinghe/AutoTool.

2605.19294 2026-06-04 cs.RO cs.AI

DEFLECT: Temporal Counterfactual Preference Learning for Delay-Robust Asynchronous VLAs

DEFLECT: 面向延迟鲁棒异步VLA的时间反事实偏好学习

Yixiang Zhu, Yonghao Chen, Zijie Yang, Yusong Hu, Xinyu Chen

发表机构 * The Hong Kong University of Science and Technology (Guangzhou)(香港科学与技术大学(广州)) One Robotics

AI总结 针对异步视觉-语言-动作(VLA)策略中陈旧观测导致的预测-执行不匹配问题,提出离线后训练框架DEFLECT,通过反事实偏好监督学习偏好与执行时间对齐的动作,无需人工标注、在线部署或架构修改,显著提升高延迟下的任务成功率。

详情
AI中文摘要

视觉-语言-动作(VLA)策略越来越依赖异步推理,将大模型延迟隐藏在持续的机器人运动背后。虽然这避免了同步动作块执行的“走走停停”行为,但产生了预测-执行不匹配:下一个动作块是根据推理开始时的陈旧观测计算得出的,但仅在机器人和场景发生变化后才执行。因此,适合预测时状态的动作可能与执行时状态不对齐。现有的运行时修复、行为克隆和偏好对齐方法并未直接教导策略解决这种陈旧输入不匹配问题。我们提出DEFLECT,一个面向延迟鲁棒异步VLA的离线后训练框架。DEFLECT将延迟引起的不匹配转化为反事实偏好监督:冻结的参考VLA从未来的执行时间观测生成偏好块,并从陈旧的预测时间观测生成拒绝块。可训练策略在相同的部署时间输入下对两个块进行评分,学习偏好与执行时间对齐的动作,同时监督微调锚点保留专家动作流形。DEFLECT不需要人工偏好标签、奖励模型、在线机器人部署、架构更改或额外的推理时间计算。在Kinetix、LIBERO和三个真实机器人任务上,DEFLECT相比强异步VLA基线提高了延迟鲁棒性,在高延迟下成功率提升高达6.4个百分点,并在真实规模VLA的最长延迟下实现4.6个百分点的增益。

英文摘要

Vision-Language-Action (VLA) policies increasingly rely on asynchronous inference to hide large-model latency behind ongoing robot motion. While this avoids the stop-and-go behavior of synchronous action-chunk execution, it creates a prediction-execution mismatch: the next chunk is computed from a stale observation at inference start but executed only after the robot and scene have evolved. As a result, actions that fit the prediction-time state can become misaligned with the execution-time state. Existing runtime repair, behavior-cloning, and preference-alignment approaches do not directly teach the policy to resolve this stale-input mismatch. We propose DEFLECT, an offline post-training framework for delay-robust asynchronous VLAs. DEFLECT converts latency-induced mismatch into counterfactual preference supervision: a frozen reference VLA generates a preferred chunk from the future execution-time observation and a rejected chunk from the stale prediction-time observation. The trainable policy scores both chunks under the same deployment-time input, learning to favor execution-time-aligned actions while a supervised fine-tuning anchor preserves the expert action manifold. DEFLECT requires no human preference labels, reward models, online robot rollouts, architectural changes, or additional inference-time computation. Across Kinetix, LIBERO, and three real-robot tasks, DEFLECT improves delay robustness over strong asynchronous VLA baselines, raising high-latency success by up to 6.4 percentage points and achieving a 4.6 percentage-point gain at the longest delay on a real-scale VLA.

2605.18936 2026-06-04 cs.LG cs.CL

FedMental: Evaluating Federated Learning for Mental Health Detection from Social Media Data

FedMental: 评估用于社交媒体数据心理健康检测的联邦学习

Nuredin Ali Abdelkadir, Anjali Ratnam, Zeerak Talat, Stevie Chancellor

发表机构 * University of Minnesota(明尼苏达大学) University of Edinburgh(爱丁堡大学)

AI总结 本文通过联邦学习和差分隐私联邦学习在抑郁和自杀危机检测任务上的实验,评估了隐私保护技术对心理健康检测性能的影响,发现联邦学习性能接近集中式训练,但差分隐私联邦学习存在显著的性能-隐私权衡。

详情
Comments
Association for Computational Linguistics (ACL) 2026 Main Conference
AI中文摘要

社交媒体文本数据常用于训练机器学习模型以识别表现出高风险心理健康行为的用户。然而,共享这些敏感数据会带来隐私风险,并限制了基准数据集的发展。我们全面评估了隐私保护的机器学习技术是否能在保持性能的同时实现更安全的数据共享。具体来说,我们将联邦学习和差分隐私联邦学习应用于两个广泛研究的心理健康预测任务:X(Twitter)上的抑郁检测和Reddit上的自杀危机检测。通过将每个用户视为非独立同分布设置中的一个客户端,我们模拟了现实的数据共享场景,评估了不同的客户端比例、聚合策略和隐私预算。虽然联邦学习在抑郁识别上达到了与集中式训练相当的性能(集中式F1=85.63;最佳联邦学习模型F1=83.16),但我们发现差分隐私联邦学习即使在低噪声水平(epsilon=50)下也存在较大的性能-隐私权衡(F1下降高达27.01)。这是由于与心理健康相关的高信息量但稀疏的语言标记(如健康主题和情感词)被扭曲所致。本研究实证展示了当前隐私保护技术在心理健康推理任务中的潜力和局限性。

英文摘要

Social media text data are often used to train Machine Learning (ML) models to identify users exhibiting high-risk mental health behaviors. However, sharing this sensitive data poses privacy risks and limits the growth of benchmark datasets. We comprehensively evaluate whether privacy-preserving ML techniques can enable safer data sharing while preserving performance. Specifically, we apply federated learning (FL) and Differentially Private FL for two widely-studied mental health prediction tasks: depression detection on X (Twitter) and suicide crisis detection on Reddit. We simulate realistic data-sharing scenarios by treating each user as a client in a non-IID setting, evaluating across different client fractions, aggregation strategies, and privacy budgets. While FL achieves comparable performance to centralized training (centralized F1 = 85.63; best FL model F1 = 83.16) on depression identification, we find that Differentially Private FL has a large performance-privacy trade-off (up to F1 = 27.01 drop) even with low levels of noise (epsilon = 50). This is due to the distortion of highly informative yet sparse mental health linguistic markers related to mental health, like health topics and emotion words. This research empirically demonstrates the potential and limitations of current privacy preservation techniques for mental health inference tasks.

2605.18931 2026-06-04 stat.ML cs.AI cs.LG

Markov Chain Decoders Overcome the Heavy-Tail Limitations of Lipschitz Generative Models

马尔可夫链解码器克服Lipschitz生成模型的重尾限制

Abdelhakim Ziani, Andras Horvath, Paolo Ballarini

发表机构 * Université Paris Saclay, Lab. MICS, CentraleSupélec, Gif-sur-Yvette, France(巴黎萨克雷大学,MICS实验室,CentraleSupélec,法国吉夫昂耶vette) Università di Torino, Torino, Italy(都灵大学,意大利都灵)

AI总结 针对Lipschitz生成模型无法生成重尾分布的问题,提出用基于马尔可夫链的Phase-Type分布替换高斯解码器,显著降低了尾部误差和极端分位数误差。

详情
Journal ref
22nd European Performance Engineering Workshop (EPEW 2026), Jun 2025, Grimstad, Norway
AI中文摘要

重尾分布在性能评估、网络流量和风险建模中普遍存在。这种行为对现代深度生成模型构成了根本性挑战。标准变分自编码器(VAE)采用高斯解码器似然和Lipschitz约束神经网络,这种组合在结构上无法产生重尾输出:高斯尾部呈指数衰减,而Lipschitz连续性阻止解码器放大来自潜在空间的罕见事件以充分克服这种衰减。我们提供了这一局限性的理论刻画,并使用合成Pareto数据(跨越尾部指数$α$ ∈ {2, 3, 5, 30}和维度d ∈ {1, 5, 10}的网格)进行了受控实证演示。作为解决方案,我们在保持编码器、潜在空间和训练过程不变的情况下,将高斯解码器替换为基于马尔可夫链的Phase-Type(PH)分布。PH分布允许对任何正值分布(包括重尾族)进行任意精确的近似。实验表明,对于重尾数据,与高斯基线相比,基于PH的模型将尾部Kolmogorov-Smirnov距离减少了最多6倍,极端分位数误差减少了最多10倍。这些结果表明,将基于马尔可夫链的分布集成到生成模型的解码器中,为重尾生成问题提供了一个有原则且实际有效的解决方案。

英文摘要

Heavy-tailed distributions are prevalent in performance evaluation, network traffic, and risk modeling. This behavior poses a fundamental challenge for modern deep generative models. Standard Variational Autoencoders (VAEs) employ Gaussian decoder likelihoods and Lipschitz-constrained neural networks, a combination that is structurally incapable of producing heavy-tailed outputs: the Gaussian tail decays exponentially, and Lipschitz continuity prevents the decoder from amplifying rare events from the latent space input to sufficiently overcome this decay. We provide both a theoretical characterization of this limitation and a controlled empirical demonstration using synthetic Pareto data across a grid of tail indices $α$ $\in$ {2, 3, 5, 30} and dimensions d $\in$ {1, 5, 10}. As a solution, we replace the Gaussian decoder with a Phase-Type (PH) distribution based on Markov chains, while keeping the encoder, latent space, and training procedure identical. PH distributions allow for arbitrarily precise approximations of any positive-valued distributions, including heavy-tailed families. Experiments showed that the PH-based model reduces tail Kolmogorov-Smirnov distance by up to x6 and extreme quantile error by up to x10 compared to the Gaussian baseline for heavy-tailed data. These results demonstrate that integrating Markov chain-based distributions into the decoder of a generative model institutes a principled and practically effective solution to the heavy-tail generation problem.

2605.16331 2026-06-04 q-bio.BM cs.AI

Retrieval and competition: how a protein foundation model starts a protein

检索与竞争:蛋白质基础模型如何启动蛋白质

Piotr Jedryszek, Oliver M. Crook

发表机构 * Department of Biology, University of Oxford, Oxford, UK(牛津大学生物学系) Kavli Institute for Nanoscience Discovery, University of Oxford, Oxford, UK(牛津大学纳科学发现研究所) Department of Chemistry, University of Oxford, Oxford, UK(牛津大学化学系)

AI总结 通过追踪ESM2-8M预测蛋白质起始甲硫氨酸的计算路径,发现模型依赖位置先验检索而非直接识别,揭示了模型置信度与生物学证据之间的脱节。

详情
Comments
updated figure 4
AI中文摘要

蛋白质语言模型越来越多地用于指导实验和临床决策,但通常不清楚一个自信的预测是反映了对生物学证据的识别还是对统计默认值的检索。我们针对一个近乎普遍的生物学规则——蛋白质以甲硫氨酸起始——通过追踪ESM2-8M产生该预测的计算路径来检验这一区别。模型并未检测到掩码位置的甲硫氨酸。相反,它通过跨层组装的特定位置查询,从序列起始标记处的参考表示中检索出有利于甲硫氨酸的信号,最终输出通过与上下文相关电路的竞争而出现。为了理解位置信息如何到达读出端,我们引入了旋转频率带内注意力分数的范数-方向分解。位置编码通过分布在各个频带中的查询范数和角度对齐的耦合变化来运作。对于真实N端不是甲硫氨酸的序列(此时生物学问题至关重要),模型仍然预测甲硫氨酸。这不是由意外机制产生的正确预测,而是匹配统计平均值的位置先验检索电路的输出,在生物学偏离平均值的地方失败。区分这两者需要在单个电路、频率带和查询组成的层面上进行解析,这表明在生物学风险更高的预测中,机制验证将是必要且具有挑战性的。即使对于最简单的生物学规则,模型的预测也是通过分布式计算电路而非直接识别来介导的,这表明任务复杂性的增加将进一步模糊模型置信度与潜在生物学证据之间的关系。

英文摘要

Protein language models are increasingly used to guide experimental and clinical decisions, yet it is often unclear whether a confident prediction reflects recognition of biological evidence or retrieval of a statistical default. We examine this distinction for a near-universal biological rule, that proteins begin with methionine, by tracing the computational pathway through which ESM2-8M produces this prediction. The model does not detect methionine at the masked position. Instead, it retrieves a methionine-favouring signal from a reference representation at the beginning-of-sequence token via a position-specific query assembled across layers, with the final output emerging through competition with context-dependent circuits. To understand how positional information reaches the readout, we introduce a norm-direction decomposition of attention scores within rotary frequency bands. Positional encoding operates through coupled changes in query norm and angular alignment distributed across these bands. On sequences whose true N-terminus is not methionine, where the biological question matters, the model predicts methionine anyway. This is not a correct prediction produced by an unexpected mechanism, but the output of a positional-prior retrieval circuit that matches the statistical average and fails where biology diverges from it. Distinguishing the two requires resolution at the level of individual circuits, frequency bands, and query composition, suggesting that mechanistic verification will be necessary, and challenging, for predictions where the biological stakes are higher. Even for the simplest biological rule, the model's prediction is mediated by a distributed computational circuit rather than direct recognition, suggesting that increasing task complexity will further obscure the relationship between model confidence and underlying biological evidence.

2605.16301 2026-06-04 cs.CY cs.AI cs.LG

Do LLMs Hold Their Values? MANTA: A Multi-Turn Adversarial Benchmark for Animal Welfare Reasoning

LLMs 是否坚持其价值观?MANTA:一个用于动物福利推理的多轮对抗性基准

Isabella Luong, Joyee Chen, Arturs Kanepajs, Jasmine Brazilek, Sankalpa Ghose, David Williams-King, Linh Le, Allen Lu

发表机构 * SPAR Compassion Aligned Machine Learning(同情对齐机器学习) NUS(新加坡大学) Mila(Mila研究所) ERA Cambridge(剑桥ERA)

AI总结 提出 MANTA 基准,通过多轮对抗性对话评估大语言模型在动物福利推理中的价值观稳定性和道德敏感性,发现单轮基准无法捕捉的排名变化和物种-压力交互效应。

详情
AI中文摘要

评估大语言模型(LLMs)中的动物福利推理仍然是一个开放挑战,尽管它们在消费者和专业环境中迅速部署,其中福利考虑隐含地出现在日常查询中。现有的基准(如 AnimalHarmBench)通过单轮、明确框架的问题进行评估,衡量模型在直接询问时是否避免有害内容。这种方法忽略了两种失败模式:在持续对抗性压力下的对齐退化,以及道德敏感性(模型是否在日常查询中自发提出福利问题)。为填补这一空白,我们构建了 MANTA,一个包含 1,088 个五轮对话的基准,从隐式的第一轮场景开始,通过明确的福利提示,再到来自五种类型(社会、文化、经济、实用和认知)的三轮对抗性压力。我们在两个维度上对对话进行评分:动物福利价值观稳定性(AWVS,主要)和动物福利道德敏感性(AWMS,诊断)。我们评估了七个前沿模型:Claude Opus 4.7、GPT-5.5、DeepSeek V4、Llama 3.3 70B、Mistral Small、Grok 4.3 和 Gemini 3.1 Flash Lite。多轮评估捕捉了单轮基准遗漏的行为:7 个模型中有 4 个相对于第一轮得分改变了排名,包括 Gemini Flash Lite,它在 AWMS 上从第五名下降到 AWVS 上的最后一名。AWMS 和 AWVS 呈正相关但不完全相关,表明道德识别测试捕捉了模型在压力下行为的一个稳定但不完整的组成部分。MANTA 还提供了先前基准无法获得的物种-压力交互矩阵,显示福利鲁棒性同时取决于动物和施加的压力;伴侣动物得分高于野生动物,后者高于养殖动物和无脊椎动物。我们发布了数据集、脚本化压力计划、评判提示和分析代码。

英文摘要

Evaluating animal welfare reasoning in LLMs remains an open challenge despite rapid deployment in consumer and professional contexts where welfare considerations appear implicitly in everyday queries. Existing benchmarks such as AnimalHarmBench evaluate this through single-turn, explicitly framed questions, measuring whether models avoid harmful content when directly asked. This approach overlooks two failure modes: alignment degradation under sustained adversarial pressure, and moral sensitivity (whether a model spontaneously surfaces welfare stakes in everyday queries). To fill this gap, we construct MANTA, a benchmark of 1,088 five-turn conversations progressing from an implicit Turn-1 scenario through an explicit welfare prompt to three adversarial pressure rounds drawn from a five-type taxonomy: Social, Cultural, Economic, Pragmatic, and Epistemic. We score conversations on two dimensions: Animal Welfare Value Stability (AWVS, primary) and Animal Welfare Moral Sensitivity (AWMS, diagnostic). We evaluate seven frontier models: Claude Opus 4.7, GPT-5.5, DeepSeek V4, Llama 3.3 70B, Mistral Small, Grok 4.3, and Gemini 3.1 Flash Lite. Multi-turn evaluation captures behavior single-turn benchmarks miss: 4 of 7 models change rank relative to Turn 1 scores, including Gemini Flash Lite, which drops from fifth on AWMS to last on AWVS. AWMS and AWVS are positively but imperfectly correlated, suggesting moral-recognition tests capture a stable but incomplete component of model behavior under pressure. MANTA also enables a species-by-pressure interaction matrix unavailable to prior benchmarks, showing welfare robustness depends jointly on the animal and pressure applied; companion animals score above wild animals, which score above farmed animals and invertebrates. We release the dataset, scripted pressure plans, judge prompts, and analysis code.

2605.15980 2026-06-04 cs.CV

Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization

Flash-GRPO:通过单步策略优化实现视频扩散的高效对齐

Xiaoxuan He, Siming Fu, Zeyue Xue, Weijie Wang, Ruizhe He, Yuming Li, Dacheng Yin, Shuai Dong, Haoyang Huang, Hongfa Wang, Nan Duan, Bohan Zhuang

发表机构 * Zhejiang University(浙江大学) Joy Future Academy Independent Researcher(独立研究员) Tsinghua University(清华大学)

AI总结 提出Flash-GRPO单步训练框架,通过等时分组和时间梯度校正解决计算瓶颈,在低计算预算下实现优于全轨迹训练的对齐质量和训练效率。

详情
AI中文摘要

群体相对策略优化已成为将视频扩散模型与人类偏好对齐的关键,但面临一个关键的计算瓶颈:训练一个14B参数的模型通常每个实验需要数百个GPU天。现有的效率方法通过滑动窗口子采样训练时间步来降低成本,但从根本上损害了优化,表现出严重的不稳定性,并且无法达到完整的轨迹性能。我们提出了Flash-GRPO,一个单步训练框架,在低计算预算下在对齐质量上优于全轨迹训练,同时大幅提高了训练效率。Flash-GRPO解决了两个关键挑战:等时分组通过强制提示级别的时间一致性消除了时间步混淆的方差,将策略性能与时间步难度解耦;时间梯度校正中和了导致不同时间步梯度幅度极不一致的时间依赖缩放因子。在1.3B到14B参数模型上的实验验证了Flash-GRPO的有效性,展示了显著的训练加速,同时保持了一致的稳定性和最先进的对齐质量。

英文摘要

Group Relative Policy Optimization has emerged as essential for aligning video diffusion models with human preferences, but faces a critical computational bottleneck: training a 14B parametered model typically demands hundreds of GPU days per experiment. Existing efficiency methods reduce costs through sliding window subsampling training timesteps, but fundamentally compromise optimization, exhibiting severe instability and failing to reach full trajectory performance. We present Flash-GRPO, a single-step training framework that outperforms full trajectory training in alignment quality under low computational budgets while substantially improving training efficiency. Flash-GRPO addresses two critical challenges: iso-temporal grouping eliminates timestep-confounded variance by enforcing prompt-wise temporal consistency, decoupling policy performance from timestep difficulty; temporal gradient rectification neutralizes the time-dependent scaling factor that causes vastly inconsistent gradient magnitudes across timesteps. Experiments on 1.3B to 14B parameter models validate Flash-GRPO's effectiveness, demonstrating substantial training acceleration with consistent stability and state-of-the-art alignment quality.

2605.15949 2026-06-04 cs.RO

A Reproducible and Physically Feasible Dynamic Parameter Identification Framework for a Low-Cost Robot Arm

低成本机器人臂的可重复且物理可行的动力学参数辨识框架

Junji Oaki, Koki Yamane, Koki Inami, Sho Sakaino

发表机构 * Institute of Systems and Information Engineering, University of Tsukuba(系统与信息工程研究所,茨川大学)

AI总结 针对低成本机器人臂CRANE-X7,提出一种结合最小二乘、半定规划投影和闭环输入误差精化的可重复且物理可行的动力学参数辨识方法,并通过主成分分析和惯性矩阵正定性审核确保模型统计一致性与物理可行性。

详情
Comments
11 pages, 8 figures, 7 tables, 1 algorithm and 2 appendices
AI中文摘要

本文针对由模块化智能驱动器驱动的低成本机器人臂CRANE-X7,提出了一种可重复且物理可行的动力学参数辨识框架。为提高实际可辨识性,根据近似连杆对称性移除惯性积,将刚体模型从65个基础参数减少至39个。辨识运动是在实际关节限位下,由结构化的单关节和相邻关节基元手工设计而成。所提出的流程结合了预处理、基于逆动力学回归的普通最小二乘(OLS)、用于可行性恢复的条件半定规划(SDP)投影以及闭环输入误差(CLIE)精化。在共同的主成分分析(PCA)空间中分析来自40个结构化轨迹的候选解,以选择一个统计上中心的代表性模型。由于统计中心性本身不能保证物理可接受性,最终选定的模型需通过所有位姿下的惯性矩阵正定性审核,并在必要时通过局部化的后CLIE SDP救援步骤进行修正。实验表明,参数云从OLS到SDP再到CLIE逐渐变得更加集中,而最终接受的模型在保留的验证运动上保持了高预测精度。这些结果为低成本机器人平台获得统计一致且物理可行的动力学模型提供了一条实用途径。

英文摘要

This paper presents a reproducible and physically feasible dynamic parameter identification framework for CRANE-X7, a low-cost robot arm driven by modular smart actuators. To improve practical identifiability, products of inertia are removed according to approximate link symmetry, reducing the rigid-body model from 65 to 39 base parameters. Identification motions are hand-designed from structured single-joint and adjacent-joint primitives under practical joint-range limits. The proposed pipeline combines preprocessing, inverse-dynamics-regressor-based ordinary least squares (OLS), conditional semidefinite-programming (SDP) projection for feasibility recovery, and closed-loop input error (CLIE) refinement. Candidate solutions from 40 structured trajectories are analyzed in a common principal component analysis (PCA) space to select a statistically central representative model. Because statistical centrality alone does not ensure physical acceptability, the selected model is finally screened by an all-pose positive-definiteness audit of the inertia matrix and, when necessary, corrected by a localized post-CLIE SDP rescue step. Experiments show that the parameter cloud becomes progressively more concentrated from OLS to SDP and CLIE, while the final accepted model preserves high predictive accuracy on held-out validation motions. These results demonstrate a practical route to statistically coherent and physically feasible dynamic models for low-cost robot platforms.

2605.15741 2026-06-04 cs.CV

HyperDiT: Hyper-Connected Transformers for High-Fidelity Pixel-Space Diffusion

HyperDiT: 用于高保真像素空间扩散的超连接Transformer

Yu He, Lichen Ma, Zipeng Guo, Xinyuan Shan, Jingling Fu, Dong Chen, Junshi Huang, Yan Li

发表机构 * University of Science and Technology of China(中国科学技术大学)

AI总结 针对像素空间扩散模型中全局语义与细粒度细节难以兼顾的粒度困境,提出HyperDiT框架,通过超连接跨尺度交互和尺度感知旋转位置编码,结合预训练视觉基础模型的密集语义,在像素空间实现高保真生成,在ImageNet 256×256上取得1.56的SoTA FID。

详情
AI中文摘要

像素空间扩散模型绕过了变分自编码器(VAE)的重建瓶颈,但面临一个基本的“粒度困境”:捕捉全局语义需要大的块尺度,而生成高保真细节则要求细粒度的输入。为了解决这个问题,我们提出了HyperDiT,一个统一的框架,建立超连接跨尺度交互以桥接语义和像素流形。与通过AdaLN注入语义不同,HyperDiT利用交叉注意力机制,使细粒度标记能够全局查询多级语义锚点。为了解决多尺度交互过程中的空间不匹配问题,我们引入了尺度感知旋转位置编码(SA-RoPE),以确保不同块大小的标记之间精确的几何对齐。此外,我们加入了寄存器,从预训练的视觉基础模型(VFM)中学习密集语义,有效减少生成幻觉和伪影。大量实验表明,HyperDiT在像素空间内直接在ImageNet 256×256上实现了最先进的FID为1.56。通过将细粒度流与语义指导相结合,HyperDiT为高保真像素生成提供了一种优越的范式。

英文摘要

Pixel-space diffusion models bypass the reconstruction bottleneck of Variational Autoencoders (VAEs) but face a fundamental "granularity dilemma": capturing global semantics favors large patch scales, while generating high-fidelity details demands fine-grained inputs. To address this issue, we propose HyperDiT, a unified framework establishing Hyper-Connected Cross-Scale Interactions to bridge the semantic and pixel manifold. Diverging from injecting semantics by AdaLN, HyperDiT utilizes Cross-Attention mechanisms, enabling fine-grained tokens to query multi-level semantic anchors globally. To resolve the spatial mismatch during multi-scale interactions, we introduce Scale-Aware Rotary Position Embedding (SA-RoPE) to ensure precise geometric alignment among tokens of varying patch sizes. Furthermore, we incorporate Registers to learn the dense semantics from a pretrained Visual Foundation Model (VFM), effectively reducing generation hallucination and artifacts. Extensive experiments demonstrate that HyperDiT achieves state-of-the-art (SoTA) FID of $\mathbf{1.56}$ on ImageNet $256\times256$ directly within the pixel space. By combining the fine-grained stream with semantic guidance, HyperDiT offers a superior paradigm for high-fidelity pixel generation.

2605.15118 2026-06-04 cs.CR cs.CL

Talk is (Not) Cheap: A Taxonomy and Benchmark Coverage Audit for LLM Attacks

谈话(不)廉价:LLM攻击的分类法与基准覆盖审计

Karthik Raghu Iyer, Yazdan Jamshidi, Nicholas Bray, Alexey A. Shvets

发表机构 * Palo Alto Networks(帕洛阿尔托网络)

AI总结 提出一个基于STRIDE的4×6目标×技术矩阵框架,用于审计LLM攻击基准的集体覆盖,发现现有基准仅覆盖最多25%的威胁面,且存在命名碎片化和评估空白。

详情
AI中文摘要

我们引入了一个可重用的框架,用于审计LLM攻击基准是否共同覆盖威胁面:一个基于STRIDE的4×6目标×技术矩阵,该矩阵由从932篇arXiv安全研究(2023-2026)中提取的507叶分类法(401个数据填充叶和106个威胁模型衍生叶)构建而成。该矩阵支持基准外部验证——审计集体覆盖而非单个基准的一致性。将其应用于六个公开基准,发现三个主要框架(HarmBench、InjecAgent、AgentDojo)占据非重叠的单元格,最多覆盖矩阵的25%,而整个STRIDE威胁类别(服务中断、模型内部)缺乏标准化评估,尽管这些类别中已发表的攻击通过没有基准测试的机制实现了46倍令牌放大和96%的攻击成功率。包含2521个独特攻击组的语料库进一步揭示了普遍的命名碎片化(单个攻击最多有29种表面形式)以及集中在安全与对齐绕过上的严重问题,这些结构属性在较小规模下不可见。分类法、攻击记录和覆盖映射作为可扩展工件发布;随着新基准的出现,它们可以映射到同一矩阵上,使社区能够跟踪评估差距是否正在缩小。

英文摘要

We introduce a reusable framework for auditing whether LLM attack benchmarks collectively cover the threat surface: a 4$\times$6 Target $\times$ Technique matrix grounded in STRIDE, constructed from a 507-leaf taxonomy -- 401 data-populated and 106 threat-model-derived leaves -- of inference-time attacks extracted from 932 arXiv security studies (2023--2026). The matrix enables benchmark-external validation -- auditing collective coverage rather than individual benchmark consistency. Applying it to six public benchmarks reveals that the three primary frameworks (HarmBench, InjecAgent, AgentDojo) occupy non-overlapping cells covering at most 25\% of the matrix, while entire STRIDE threat categories (Service Disruption, Model Internals) lack any standardized evaluation, despite published attacks in these categories achieving 46$\times$ token amplification and 96\% attack success rates through mechanisms which no benchmark tests. The corpus of 2,521 unique attack groups further reveals pervasive naming fragmentation (up to 29 surface forms for a single attack) and heavy concentration in Safety \& Alignment Bypass, structural properties invisible at smaller scale. The taxonomy, attack records, and coverage mappings are released as extensible artifacts; as new benchmarks emerge, they can be mapped onto the same matrix, enabling the community to track whether evaluation gaps are closing.

2605.14091 2026-06-04 cs.CV

Venus-DeFakerOne: Unified Fake Image Detection & Localization

Venus-DeFakerOne: 统一假图检测与定位

GuangJian Team

发表机构 * Ant Group(蚂蚁集团)

AI总结 针对假图生成机制统一化而检测定位研究碎片化的问题,提出基于InternVL2和SAM2的数据驱动统一基础模型DeFakerOne,实现跨场景的图像级检测与像素级定位,在39个检测和9个定位基准上达到最优性能。

详情
AI中文摘要

近年来,生成式AI的快速发展从根本上重塑了图像伪造的范式,打破了文档编辑、自然图像篡改、DeepFake生成和全图像AIGC合成之间的传统界限。尽管伪造生成正趋于统一,但现有的假图检测与定位(FIDL)研究仍然碎片化。这造成了日益统一的伪造生成机制与领域特定检测范式之间的不匹配。弥合这一不匹配给FIDL带来了两个关键挑战:理解跨域伪影的迁移与干扰,以及构建一个高容量的统一基础模型以实现联合检测与定位。为应对这些挑战,我们提出了DeFakerOne,一个以数据为中心的统一FIDL基础模型,集成了InternVL2和SAM2。DeFakerOne能够在多种场景下同时进行图像级检测和像素级伪造定位。大量实验表明,DeFakerOne达到了最先进的性能,在39个伪造检测基准和9个定位基准上均优于基线。此外,该模型对真实世界扰动和最先进的生成器(如GPT-Image-2)表现出卓越的鲁棒性。最后,我们系统分析了数据缩放规律、跨域伪影迁移-干扰模式、细粒度监督的必要性以及原始分辨率伪影保留,突显了可扩展、鲁棒且统一的FIDL的设计原则。

英文摘要

In recent years, the rapid evolution of generative AI has fundamentally reshaped the paradigm of image forgery, breaking the traditional boundaries between document editing, natural image manipulation, DeepFake generation, and full-image AIGC synthesis. Despite this shift toward unified forgery generation, existing research in Fake Image Detection and Localization (FIDL) remains fragmented. This creates a mismatch between increasingly unified forgery generation mechanisms and the domain-specific detection paradigm. Bridging this mismatch poses two key challenges for FIDL: understanding cross-domain artifacts transfer and interference, and building a high-capacity unified foundation model for joint detection and localization. To address these challenges, we propose DeFakerOne, a data-centric, unified FIDL foundation model integrating InternVL2 and SAM2. DeFakerOne enables simultaneous image-level detection and pixel-level forgery localization across diverse scenarios. Extensive experiments demonstrate that DeFakerOne achieves state-of-the-art performance, outperforming baselines on 39 forgery detection benchmarks and 9 localization benchmarks. Furthermore, the model exhibits superior robustness against real-world perturbations and state-of-the-art generators such as GPT-Image-2. Finally, we provide a systematic analysis of data scaling laws, cross-domain artifacts transfer-interference patterns, the necessity of fine-grained supervision, and the original resolution artifacts preservation, highlighting the design principles for scalable, robust, and unified FIDL.

2605.14054 2026-06-04 cs.AI cs.CV

Bad Seeing or Bad Thinking? Rewarding Perception for Multimodal Reasoning

Bad Seeing or Bad Thinking? Rewarding Perception for Multimodal Reasoning

Haozhe Wang, Qixin Xu, Changpeng Wang, Taofeng Xue, Chong Peng, Wenhu Chen, Fangzhen Lin

发表机构 * University of Science and Technology of China(中国科学技术大学)

AI总结 提出一种基于强化学习的模态感知信用分配框架(MoCA),通过感知验证和结构化口头验证解决视觉语言模型中感知与推理的权衡问题,实现多任务性能提升。

详情
Comments
Accepted by ICML 2026 as Oral
AI中文摘要

实现稳健的感知-推理协同是高级视觉语言模型(VLM)的核心目标。最近的进展通过架构设计或智能体工作流追求这一目标。然而,这些方法通常受限于静态文本推理,或因外部智能体复杂性的巨大计算和工程负担而变得复杂。更糟糕的是,这种大量投入并未带来成比例的性能提升,常常在感知和推理上观察到“跷跷板效应”。这促使我们从根本上重新思考真正的瓶颈。在本文中,我们认为这种权衡的根本原因是模态信用分配中的模糊性:当VLM失败时,是由于感知缺陷(“坏视力”)还是逻辑缺陷(“坏思维”)?为解决这一问题,我们引入了一个强化学习框架,通过可靠地奖励感知保真度来改善感知-推理协同。我们明确地将生成过程分解为交错的感知和推理步骤。这种解耦使得能够对感知进行有针对性的监督。关键的是,我们引入了感知验证(PV),利用“盲推理”代理独立于推理结果奖励感知保真度。此外,为了在自由形式的VL任务中扩展训练,我们提出了结构化口头验证(Structured Verbal Verification),用结构化的算法执行替代高方差的LLM评判。这些技术被整合到模态感知信用分配(MoCA)机制中,该机制将奖励路由到特定的错误源——无论是坏视力还是坏思维——使单个VLM能够在广泛的任务谱系上同时获得性能提升。

英文摘要

Achieving robust perception-reasoning synergy is a central goal for advanced Vision-Language Models (VLMs). Recent advancements have pursued this goal via architectural designs or agentic workflows. However, these approaches are often limited by static textual reasoning or complicated by the significant compute and engineering burden of external agentic complexity. Worse, this heavy investment does not yield proportional gains, often witnessing a "seesaw effect" on perception and reasoning. This motivates a fundamental rethinking of the true bottleneck. In this paper, we argue that the root cause of this trade-off is an ambiguity in modality credit assignment: when a VLM fails, is it due to flawed perception ("bad seeing") or flawed logic ("bad thinking")? To resolve this, we introduce a reinforcement learning framework that improves perception-reasoning synergy by reliably rewarding the perception fidelity. We explicitly decompose the generation process into interleaved perception and reasoning steps. This decoupling enables targeted supervision on perception. Crucially, we introduce Perception Verification (PV), leveraging a "blindfolded reasoning" proxy to reward perceptual fidelity independently of reasoning outcomes. Furthermore, to scale training across free-form VL tasks, we propose Structured Verbal Verification, which replaces high-variance LLM judging with structured algorithmic execution. These techniques are integrated into a Modality-Aware Credit Assignment (MoCA) mechanism, which routes rewards to the specific source of error -- either bad seeing or bad thinking -- enabling a single VLM to achieve simultaneous performance gains across a wide task spectrum.

2605.13672 2026-06-04 cs.CV cs.AI cs.LG

SpurAudio: A Benchmark for Studying Shortcut Learning in Few-Shot Audio Classification

SpurAudio: 用于研究少样本音频分类中捷径学习的基准

Giries Abu Ayoub, Morad Tukan, Loay Mualem

发表机构 * Department of Computer Science, University of Haifa(海法大学计算机科学系) Independent Researcher(独立研究者) University of Stuttgart, Germany(斯图加特大学,德国) IMPRS-IS, Germany(智能系统国际Max Planck研究学校,德国)

AI总结 提出SpurAudio基准,通过控制音频中前景与背景的关联,评估少样本分类模型对虚假相关性的敏感性,发现现有方法在背景变化时性能显著下降。

详情
AI中文摘要

少样本分类(FSC)广泛用于从有限标注数据中学习,但大多数评估隐含假设目标概念与上下文线索无关。然而,在现实场景中,样本通常出现在丰富的上下文中,允许模型利用前景内容与背景信号之间的虚假相关性。虽然这种效应已在少样本图像分类中得到研究,但其在少样本音频分类中的作用仍 largely 未被探索,且现有音频基准对上下文结构的控制有限。我们引入了 SpurAudio,一个利用音频中前景事件和背景环境的自然可分离性,以支持对支持集和查询集之间的上下文偏移进行可控、多级评估的基准。使用该基准,我们表明许多最先进的少样本方法在背景相关性被破坏时遭受严重的性能下降,尽管在标准评估协议下达到相似的准确率。关键的是,即使在大型预训练音频基础模型中,这种脆弱性仍然存在,排除了骨干网络容量不足的解释。此外,在传统基准下看似相当的方法可能对虚假相关性表现出显著不同的敏感性,揭示了与特征表示在推理时如何与分类器头交互相关的系统性算法优势和脆弱性。这些发现为音频中少样本方法的行为提供了新的见解,并强调了在评估FSC模型时需要明确探测上下文依赖性的基准。

英文摘要

Few-shot classification (FSC) is widely used for learning from limited labeled data, yet most evaluations implicitly assume that target concepts are independent of contextual cues. In real-world settings, however, examples often appear within rich contexts, allowing models to exploit spurious correlations between foreground content and background signals. While such effects have been studied in few-shot image classification, their role in few-shot audio classification remains largely unexplored, and existing audio benchmarks offer limited control over contextual structure. We introduce SpurAudio, a benchmark that leverages the natural separability of foreground events and background environments in audio to enable controlled, multi-level evaluation of contextual shifts across support and query sets. Using this benchmark, we show that many state-of-the-art few-shot methods suffer severe performance degradation when background correlations are disrupted, despite achieving similar accuracy under standard evaluation protocols. Crucially, this vulnerability persists even in large pretrained audio foundation models, ruling out limited backbone capacity as an explanation. Moreover, methods that appear comparable under conventional benchmarks can exhibit markedly different sensitivity to spurious correlations, revealing systematic algorithmic strengths and vulnerabilities tied to how feature representations interact with classifier heads at inference time. These findings provide new insight into the behavior of few-shot methods in audio and highlight the need for benchmarks that explicitly probe context dependence when evaluating FSC models.

2605.00182 2026-06-04 cs.LG

Towards A Generative Protein Evolution Machine with DPLM-Evo

迈向生成式蛋白质进化机器:DPLM-Evo

Xinyou Wang, Liang Hong, Jiasheng Ye, Zaixiang Zheng, Yu Li, Shujian Huang, Quanquan Gu

发表机构 * Nanjing University(南京大学) CUHK(香港大学) Fudan University(复旦大学) ByteDance(字节跳动)

AI总结 提出DPLM-Evo,一种显式建模替换、插入和删除操作的进化离散扩散框架,在单序列设置下实现蛋白质突变效应预测的最优性能,并支持变长模拟进化与蛋白质编辑优化。

详情
Comments
A peer-reviewed version was accepted to ICML 2026
AI中文摘要

蛋白质在生物物理和功能约束下通过逐渐进化形成。蛋白质语言模型从大规模序列中学习丰富的进化约束,基于离散扩散的蛋白质语言模型(如DPLM)在理解和生成方面都很有前景。然而,现有的DPLM通常依赖于掩码扩散,这与一个简单的生物学直觉相矛盾:蛋白质通过累积的编辑进化,而不是从掩码中出现。因此,这些框架缺乏用于替换和插入/删除(indel)操作的显式预训练目标,限制了优化风格的后编辑和灵活的引导生成。为了解决这些限制,我们提出了DPLM-Evo,一种进化离散扩散框架,在去噪过程中显式预测替换、插入和删除操作。DPLM-Evo将上采样长度的潜在对齐空间与可变长度的观测序列空间解耦,使得indel感知生成变得可行。为了更好地将替换与真实进化对齐,我们进一步引入了一种上下文感知的进化噪声核,产生生物学信息丰富、上下文依赖的突变模式。在各种任务中,DPLM-Evo提升了序列理解能力,并在单序列设置下在ProteinGym上实现了最先进的突变效应预测性能。它还支持变长模拟进化,以及通过显式编辑轨迹对现有蛋白质进行后编辑/优化。

英文摘要

Proteins are shaped by gradual evolution under biophysical and functional constraints. Protein language models learn rich evolutionary constraints from large-scale sequences, and discrete diffusion-based protein language models~(\eg, DPLMs) are promising for both understanding and generation. However, existing DPLMs typically rely on masked diffusion that contradicts a simple biological intuition: proteins evolve through accumulated edits, not by emerging from masks. Consequently, these frameworks lack explicit pretraining objectives for substitution and insertion/deletion (indel) operations, limiting both optimization-style post-editing and flexible guided generation. To address these limitations, we present DPLM-Evo, an evolutionary discrete diffusion framework that explicitly predicts substitution, insertion, and deletion operations during denoising. DPLM-Evo decouples an upsampled-length latent alignment space from the variable-length observed sequence space, which makes indel-aware generation tractable. To better align substitutions with real evolution, we further introduce a contextualized evolutionary noising kernel that produces biologically informed, context-dependent mutation patterns. Across tasks, DPLM-Evo improves sequence understanding and achieves state-of-the-art mutation effect prediction performance on ProteinGym in the single-sequence setting. It also enables variable-length simulated evolution, and post-editing/optimization of existing proteins via explicit edit trajectories.

2304.10891 2026-06-04 cs.LG cs.AI cs.CV cs.RO cs.SY eess.SY

Transformer-Based Autonomous Driving Models and Deployment-Oriented Compression: A Survey

基于Transformer的自动驾驶模型与面向部署的压缩:综述

Juan Zhong, Yuhang Shi, Zukang Xu, Xi Chen

发表机构 * Renmin University of China(中国人民大学) Artificial Intelligence Innovation and Incubation Institute, Fudan University(复旦大学人工智能创新与孵化院) Shanghai Academy of AI for Science(上海人工智能科学研究院) Department of houmo.ai(houmo.ai部门)

AI总结 本文综述了基于Transformer的自动驾驶模型,并从部署角度分析了压缩与加速策略(如量化、剪枝、知识蒸馏等)如何影响模型设计、部署性、鲁棒性和安全性。

详情
AI中文摘要

基于Transformer的模型正成为自动驾驶的核心范式,因为它们能够捕捉感知、预测和规划中的长程空间依赖、多智能体交互和多模态上下文。然而,它们在真实车辆中的部署仍然困难,因为高容量注意力架构带来了显著的延迟、内存和能量开销。本综述回顾了具有代表性的基于Transformer的自动驾驶模型,并按任务角色、感知配置和架构设计进行组织。更重要的是,我们从面向部署的角度审视这些模型,分析效率约束如何在实际中重塑模型设计选择。我们进一步回顾了与基于Transformer的驾驶系统相关的压缩和加速策略,包括量化、剪枝、知识蒸馏、低秩近似和高效注意力,并讨论了它们的优势、局限性和任务依赖性。我们不将压缩视为孤立的后期处理步骤,而是强调其作为直接影响部署性、鲁棒性和安全性的系统级设计考虑。最后,我们指出了面向标准化、安全感知和硬件感知的高效自动驾驶系统评估的开放挑战和未来研究方向。

英文摘要

Transformer-based models are becoming a central paradigm in autonomous driving because they can capture long-range spatial dependencies, multi-agent interactions, and multimodal context across perception, prediction, and planning. At the same time, their deployment in real vehicles remains difficult because high-capacity attention-based architectures impose substantial latency, memory, and energy overhead. This survey reviews representative Transformer-based autonomous driving models and organizes them by task role, sensing configuration, and architectural design. More importantly, it examines these models from a deployment-oriented perspective and analyzes how efficiency constraints reshape model design choices in practice. We further review compression and acceleration strategies relevant to Transformer-based driving systems, including quantization, pruning, knowledge distillation, low-rank approximation, and efficient attention, and discuss their benefits, limitations, and task-dependent applicability. Rather than treating compression as an isolated post-processing step, we highlight it as a system-level design consideration that directly affects deployability, robustness, and safety. Finally, we identify open challenges and future research directions toward standardized, safety-aware, and hardware-conscious evaluation of efficient autonomous driving systems.