arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 4046
2604.21456 2026-05-12 cs.LG cs.RO

Tempered Sequential Monte Carlo for Trajectory and Policy Optimization with Differentiable Dynamics

Heng Yang

AI总结 本文提出了一种基于采样的框架,用于在可微动力学模型下进行有限时间轨迹和策略优化,将控制器设计转化为推断问题。核心方法是通过最小化KL散度正则化的轨迹成本期望,得到一个随着温度降低而集中于低成本解的“玻尔兹曼倾斜”控制器参数分布,并引入温控序列蒙特卡洛(TSMC)方法高效采样,结合哈密顿蒙特卡洛方法保持粒子多样性。实验表明,TSMC在多种轨迹和策略优化任务中表现优异,优于现有先进方法。

Comments Robotics: Science and Systems 2026

详情
英文摘要

We propose a sampling-based framework for finite-horizon trajectory and policy optimization under differentiable dynamics by casting controller design as inference. Specifically, we minimize a KL-regularized expected trajectory cost, which yields an optimal "Boltzmann-tilted" distribution over controller parameters that concentrates on low-cost solutions as temperature decreases. To sample efficiently from this sharp, potentially multimodal target, we introduce tempered sequential Monte Carlo (TSMC): an annealing scheme that adaptively reweights and resamples particles along a tempering path from a prior to the target distribution, while using Hamiltonian Monte Carlo rejuvenation to maintain diversity and exploit exact gradients obtained by differentiating through trajectory rollouts. For policy optimization, we extend TSMC via (i) a deterministic empirical approximation of the initial-state distribution and (ii) an extended-space construction that treats rollout randomness as auxiliary variables. Experiments across trajectory- and policy-optimization benchmarks show that TSMC is broadly applicable and compares favorably to state-of-the-art baselines.

2604.18901 2026-05-12 cs.LG cs.AI cs.CL

Harmful Intent as a Geometrically Recoverable Feature of LLM Residual Streams

Isaac Llorente-Saguer

AI总结 该研究探讨了大型语言模型中有害意图在残差流中的几何可恢复特征。通过在多种架构和对齐方式的模型中进行实验,发现有害意图在残差流激活中具有线性可分性,并提出了一种基于软AUC优化的方向提取方法,实现了高检测性能。研究还揭示了检测方向对提取协议的依赖性,表明不同处理方式可能影响有害意图的识别效果。

Comments 26 pages, 1(+6) figures, 4(+14) tables. Code at https://github.com/isaac-6/harm-directions

详情
英文摘要

Aligned language models refuse harmful instructions, but the representations through which they recognise such instructions are less well characterised than the behaviours they produce. Harmful intent is linearly separable from residual-stream activations across 12 models spanning four architectural families (Qwen2.5, Qwen3.5, Llama-3.2, Gemma-3) and three alignment variants (base, instruction-tuned, abliterated), with parameter scales from 0.5B to 1.3B and a within-family scale extension to 9B on Qwen3.5. A direction fitted from 100 labelled examples per class via Soft-AUC optimisation reaches mean effective AUROC 0.982 and TPR@1\%FPR 0.797, generalises to three held-out harm benchmarks and a hard-benign control, and matches its instruction-tuned counterpart within $\pm 0.003$ AUROC in abliterated variants from which the refusal mechanism has been removed. The supervised strategies all exceed AUROC 0.96, but their TPR@1\%FPR varies by more than ten times the AUROC gap; a deployed 9B safety classifier shows the same pattern at AUROC 0.94 and TPR 0.30, motivating low-FPR reporting as a default in safety-adjacent detection evaluation. Geometric measurements refine the picture. The recovered direction is concentrated within each extraction protocol but protocol-dependent across them: two pooling choices applied to the same chat-templated activations at the same residual-stream layer (max-pool over content tokens versus last-token at the post-instruction position) recover harm directions $73^\circ$ apart, and projecting one out leaves detection under either max-pool extraction essentially intact. Probing identifies a protocol-specific direction rather than a unique computational feature.

2604.17693 2026-05-12 cs.LG cs.AI cs.MA

COSAC: Counterfactual Credit Assignment in Sequential Cooperative Teams

Shripad Deshmukh, Jayakumar Subramanian, Raghavendra Addanki, Nikos Vlassis

AI总结 在顺序合作团队中,由于每个智能体按固定顺序行动并共享单一团队奖励,个体信用分配问题难以解决。本文提出COSAC,一种无需评论家的个体策略梯度方法,通过单次岭回归拟合团队奖励的加性分解,实现去耦个体学习信号,并利用当前策略的虚构延续计算个体反事实优势,从而避免额外环境调用。COSAC在理论分析中展示了其偏差和方差的可控性,并在实验中表现出更低的优越性均方误差和学习遗憾,尤其在AI2推理任务中收敛速度优于其他无评论家方法。

详情
英文摘要

In cooperative teams where agents act in a fixed order and share a single team-level reward (multi-agent language systems, sequential robotic tasks), per-agent credit assignment is under-determined. Critic-based approaches scale poorly as the number of agents grows owing to the costly maintenance of joint/factored critic(s), whereas the existing critic-free alternatives have other issues: common credit across agents that couples every agent's signal to teammate noise, importance-sampling corrections for upstream-update staleness that incur variance exponential in team size, or per-agent counterfactual replay that isolates each agent's effect at the price of extra environment or reward calls. We propose COSAC, a critic-free per-agent policy gradient for sequential cooperative teams. COSAC fits an additive per-agent decomposition of the team reward by a single ridge regression on the rollout batch (giving each agent a learning signal decoupled from teammate noise), and computes each agent's counterfactual advantage from fictitious continuations of the current policy (policy forward passes that replace both importance-sampling reweighting and per-agent environment replay, at no extra environment or reward cost). The estimator instantiates the Sequential Aristocrat Utility (SeqAU), our extension of Wolpert and Tumer's (2001) aristocrat utility to sequential teams. We prove bias and variance bounds on SeqAU credits that stay controlled as the team grows. Our controlled study on sequential bandits demonstrates that COSAC attains the lowest advantage MSE and consistently low learning regret across team sizes up to $K = 16$. On the AI2 Reasoning Challenge (ARC) task, where four Qwen3-0.6B agents reason in turn about a grade-school science question, COSAC attains faster convergence than the other critic-free baselines.

2604.14345 2026-05-12 cs.LG cs.AI stat.ML

PAC-MCTS: Bias-Aware Pruning for Robust LLM-Guided Search and Planning

Tianhao Qian

AI总结 在自主推理和具身规划中,随着搜索深度增加,候选动作空间呈指数级扩展,导致计算资源消耗巨大。本文提出PAC-MCTS,一种基于偏差感知的剪枝框架,通过将节点扩展建模为有界偏差下的最佳臂识别问题,推导出样本复杂度上界和信息论下界,明确了安全剪枝的条件。实验表明,PAC-MCTS在Blocksworld和ALFWorld任务中显著提升了搜索效率和鲁棒性,减少了API调用次数并提高了样本效率。

Comments 18 pages, 4 figures

详情
英文摘要

As search depth increases in autonomous reasoning and embodied planning, candidate action spaces expand exponentially, often exhausting computational budgets. While heuristic pruning is a critical countermeasure, existing approaches lack formal safety guarantees when guided by surrogate evaluators such as Large Language Models (LLMs), which exhibit systematic biases. We formulate node expansion as a localized Best-Arm Identification (BAI) problem under bounded bias $L$ and derive a sample complexity upper bound of $\mathcal{O}((Δ-4L)^{-2})$, identifying $Δ> 4L$ as the regime where safe elimination is feasible. We further establish an information-theoretic lower bound of $Ω((Δ-2L)^{-2})$ that characterizes the structural limits of biased exploration. Motivated by these results, we propose PAC-MCTS, a bias-aware pruning framework that dynamically adapts confidence bounds during search. Experiments on Blocksworld and ALFWorld demonstrate that PAC-MCTS consistently improves robustness and search efficiency over strong pruning baselines, achieving up to 78\% fewer API evaluations and over 3$\times$ higher sample efficiency under strict compute budgets. Ablation studies further validate the predicted degradation behavior as evaluator bias increases.

2604.13710 2026-05-12 cs.CV

SLQ: Bridging Modalities via Shared Latent Queries for Retrieval with Frozen MLLMs

Haoran Lou, Ziyan Liu, Chunxiao Fan, Yuexin Wu, Yue Ming, Hao Wu, Kai Zuo, Yibo Chen, Xu Tang

AI总结 本文提出了一种名为SLQ的参数高效调优框架,用于在不修改预训练多模态大语言模型(MLLM)参数的前提下,将其适配于检索任务。SLQ通过引入共享潜在查询(Shared Latent Queries)将文本和图像信息编码到统一的嵌入空间中,从而实现跨模态检索。此外,研究还构建了KARR-Bench基准测试集,用于评估模型在知识感知推理检索方面的能力。实验表明,SLQ在多个数据集上表现优异,验证了非侵入式适配策略在保持预训练语义表示的同时提升检索效果的有效性。

Comments Accepted to ICML-2026

详情
英文摘要

Multimodal Large Language Models (MLLMs) possess intrinsic reasoning and world-knowledge capabilities, yet adapting them for dense retrieval remains challenging. Existing approaches rely on invasive parameter updates, such as full fine-tuning and LoRA, which may disrupt the pre-trained semantic space and impair the structured knowledge essential for reasoning. To address this, we propose SLQ, a parameter-efficient tuning framework that adapts MLLMs for retrieval while keeping the backbone entirely frozen. SLQ introduces a small set of Shared Latent Queries that are appended to both text and image tokens, leveraging the model's native causal attention to aggregate multimodal context into a unified embedding space. Furthermore, to better evaluate retrieval beyond superficial pattern matching, we construct KARR-Bench, a benchmark designed for knowledge-aware reasoning retrieval. Extensive experiments show that SLQ outperforms full fine-tuning and LoRA on COCO and Flickr30K, while achieving competitive performance on MMEB and yielding substantial gains on KARR-Bench, validating that preserving the pre-trained representations via non-invasive adaptation is an effective strategy for MLLM-based retrieval. The code is available under: https://github.com/CnFaker/SLQ.

2604.13068 2026-05-12 cs.CL cs.LG

Detection Without Correction: A Robust Asymmetry in Activation-Based Hallucination Probing

Dip Roy, Rajiv Misra, Sanjay Kumar Singh, Anisha Roy

AI总结 该研究探讨了基于激活的线性探测方法在检测和纠正语言模型幻觉中的有效性。通过在多个不同规模的模型上进行实验,发现虽然线性探测在大模型中能有效检测幻觉,但其方向引导的激活调整却无法成功纠正幻觉。研究还指出,输出置信度方法在大模型上的检测性能优于激活探测,并强调了激活探测的独特价值在于其能够在输出前进行预生成标记,这为幻觉检测提供了新的部署场景。

详情
英文摘要

Activation-based linear probing is widely proposed as a method for both detecting and correcting hallucinations in autoregressive language models. We present an empirical study across seven models spanning 117M to 7B parameters and three architecture families (GPT-2, Pythia, Qwen-2.5) that documents a robust asymmetry: linear probes can detect hallucination signals with above-chance accuracy in larger models, but activation steering along the probe-derived direction fails to correct hallucinations in 7 of 7 models tested. We further find that output-confidence baselines outperform activation probes on raw detection AUC at every model above 410M parameters, with the gap reaching 0.157 AUC for Pythia-6.9B. The probe's distinguishing value is therefore not detection accuracy but temporal positioning: probe signals are accessible at position zero (before any output tokens are produced), enabling pre-generation flagging that output-based methods structurally cannot provide. The temporal signal is statistically significant in two of seven models (Pythia-1.4B, p = 0.012; Qwen2.5-7B, p = 0.038) and absent in models below 400M parameters and in the base-only Pythia-6.9B. We position these findings as a clean negative result for the dominant probing-as-detection-and-control research direction and as initial evidence that probe-based methods occupy a complementary deployment niche, namely pre-generation flagging, rather than competing with output-based detectors on raw accuracy.

2604.12592 2026-05-12 cs.CV

ELoG-GS: Dual-Branch Gaussian Splatting with Luminance-Guided Enhancement for Extreme Low-light 3D Reconstruction

Yuhao Liu, Dingju Wang, Ziyang Zheng

AI总结 本文提出了一种用于极端低光环境下高质量三维重建的方法ELoG-GS,旨在解决从退化多视角图像中恢复几何一致且逼真的三维场景的问题。该方法结合了基于学习的点云初始化和亮度引导的颜色增强策略,提升了高斯泼溅在低光条件下的稳定性和视觉真实感。实验表明,该方法在NTIRE 2026挑战赛基准上显著优于现有方法,在官方平台的测试中取得了较高的PSNR和SSIM指标。

Comments Our method achieved a ranking of 9 out of 148 participants in Track 1 of the NTIRE 3DRR Challenge, as reported on the official competition website: https://www.codabench.org/competitions/13854/

详情
英文摘要

This paper presents our approach to the NTIRE 2026 3D Restoration and Reconstruction Challenge (Track 1), which focuses on reconstructing high-quality 3D representations from degraded multi-view inputs. The challenge involves recovering geometrically consistent and photorealistic 3D scenes in extreme low-light environments. To address this task, we propose Extreme Low-light Optimized Gaussian Splatting (ELoG-GS), a robust low-light 3D reconstruction pipeline that integrates learning-based point cloud initialization and luminance-guided color enhancement for stable and photorealistic Gaussian Splatting. Our method incorporates both geometry-aware initialization and photometric adaptation strategies to improve reconstruction fidelity under challenging conditions. Extensive experiments on the NTIRE Track 1 benchmark demonstrate that our approach significantly improves reconstruction quality over the baselines, achieving superior visual fidelity and geometric consistency. The proposed method provides a practical solution for robust 3D reconstruction in real-world degraded scenarios. In the final testing phase, our method achieved a PSNR of 18.6626 and an SSIM of 0.6855 on the official platform leaderboard. Code is available at https://github.com/lyh120/FSGS_EAPGS.

2604.08243 2026-05-12 cs.CL

Self-Debias: Self-correcting for Debiasing Large Language Models

Xuan Feng, Shuai Zhao, Luwei Xiao, Tianlong Gu, Bo An

AI总结 尽管大语言模型(LLMs)展现出强大的推理能力,但其内在的社会偏见往往在思维链(CoT)过程中不断传播,导致“偏见传播”问题。为解决这一问题,本文提出了一种名为Self-Debias的渐进式框架,通过将去偏过程重新定义为一种策略性的资源再分配问题,使模型具备内在的自我纠正能力。该方法采用细粒度的轨迹级优化目标,并结合在线自我改进机制,仅需少量标注样本即可高效激活模型的自我修正能力,在去除偏见的同时保持其通用推理能力。

Comments ICML 2026

详情
英文摘要

Although Large Language Models (LLMs) demonstrate remarkable reasoning capabilities, inherent social biases often cascade throughout the Chain-of-Thought (CoT) process, leading to continuous "Bias Propagation". Existing debiasing methods primarily focus on static constraints or external interventions, failing to identify and interrupt this propagation once triggered. To address this limitation, we introduce Self-Debias, a progressive framework designed to instill intrinsic self-correction capabilities. Specifically, we reformulate the debiasing process as a strategic resource redistribution problem, treating the model's output probability mass as a limited resource to be reallocated from biased heuristics to unbiased reasoning paths. Unlike standard preference optimization which applies broad penalties, Self-Debias employs a fine-grained trajectory-level objective subject to dynamic debiasing constraints. This enables the model to selectively revise biased reasoning suffixes while preserving valid contextual prefixes. Furthermore, we integrate an online self-improvement mechanism utilizing consistency filtering to autonomously synthesize supervision signals. With merely 20k annotated samples, Self-Debias activates efficient self-correction, achieving superior debiasing performance while preserving general reasoning capabilities without continuous external oversight.

2604.07522 2026-05-12 cs.CV

Training-free Spatially Grounded Geometric Shape Encoding (Technical Report)

Yuhang He

AI总结 本文提出了一种无需训练的通用二维几何形状编码方法XShapeEnc,用于将任意空间定位的二维几何形状编码为具有可逆性、适应性和频率丰富性等特性的紧凑表示。该方法通过将形状分解为标准化几何和姿态向量,并利用正交Zernike基进行编码,结合频率传播操作增强表达能力,从而在多种形状感知任务中展现出良好的理论有效性与应用潜力。

Comments Training-Free 2D Geometric Shape Encoding

详情
英文摘要

Positional encoding has become the de facto standard for grounding deep neural networks on discrete point-wise positions, and it has achieved remarkable success in tasks where the input can be represented as a one-dimensional sequence. However, extending this concept to 2D spatial geometric shapes demands carefully designed encoding strategies that account not only for shape geometry and pose, but also for compatibility with neural network learning. In this work, we address these challenges by introducing a training-free, general-purpose encoding strategy, dubbed XShapeEnc, that encodes an arbitrary spatially grounded 2D geometric shape into a compact representation exhibiting five favorable properties, including invertibility, adaptivity, and frequency richness. Specifically, a 2D spatially grounded geometric shape is decomposed into its normalized geometry within the unit disk and its pose vector, where the pose is further transformed into a harmonic pose field that also lies within the unit disk. A set of orthogonal Zernike bases is constructed to encode shape geometry and pose either independently or jointly, followed by a frequency-propagation operation to introduce high-frequency content into the encoding. We demonstrate the theoretical validity, efficiency, discriminability, and applicability of XShapeEnc via extensive analysis and experiments across a wide range of shape-aware tasks and our self-curated XShapeCorpus. We envision XShapeEnc as a foundational tool for research that goes beyond one-dimensional sequential data toward frontier 2D spatial intelligence.

2604.07383 2026-05-12 cs.LG

SCOT: Multi-Source Cross-City Transfer with Optimal-Transport Soft-Correspondence Objective

Yuyao Wang, Min Yang, Meng Chen, Weiming Huang, Yilong Yin, Yongshun Gong

AI总结 本文研究了跨城市数据迁移中的标签稀缺问题,提出了一种基于最优运输的软对应目标(SCOT)框架,用于在不同城市之间建立显式的区域对应关系。SCOT 通过 Sinkhorn 算法实现非对称区域集的软匹配,并结合对比学习和循环重建正则化提升迁移效果与稳定性。该方法在多源迁移任务中表现出更高的准确性和鲁棒性,同时提供了可解释的对齐质量诊断信息。

Comments 34 pages, 19 figures, 23 tables

详情
英文摘要

Cross-city transfer improves prediction in label-scarce cities by leveraging labeled data from other cities, but it becomes challenging when cities adopt incompatible partitions and no ground-truth region correspondences exist. Existing approaches either rely on heuristic region matching, which is often sensitive to anchor choices, or perform distribution-level alignment that leaves correspondences implicit and can be unstable under strong heterogeneity. We propose SCOT, a cross-city representation learning framework that learns explicit soft correspondences between unequal region sets via Sinkhorn-based entropic optimal transport. SCOT further sharpens transferable structure with an OT-weighted contrastive objective and stabilizes optimization through a cycle-style reconstruction regularizer. For multi-source transfer, SCOT aligns each source and the target to a shared prototype hub using balanced entropic transport guided by a target-induced prototype prior. Across real-world cities and tasks, SCOT consistently improves transfer accuracy and robustness, while the learned transport couplings and hub assignments provide interpretable diagnostics of alignment quality.

2604.06689 2026-05-12 cs.LG stat.ML

Generative Cross-Entropy: A Strictly Proper Loss for Data-Efficient Classification

Qipeng Zhan, Zhuoping Zhou, Li Shen

AI总结 本文提出了一种名为生成交叉熵(GenCE)的新分类损失函数,旨在提高数据稀缺场景下的样本效率。该方法通过引入生成学习的思想,在不改变网络结构或拟合额外密度模型的前提下,对传统交叉熵损失进行改进。GenCE 基于贝叶斯重写条件似然,并在小批量近似下实现跨类样本的训练信号耦合,理论证明其在一定条件下是严格正确的评分规则,实验表明其在多个数据集和不同场景下均优于传统损失函数,且具有更好的概率校准和分布外检测能力。

详情
英文摘要

Cross-entropy (CE) is the default training loss for supervised classification, but its sample efficiency is limited when labels are scarce. Existing remedies primarily act on the data side, via augmentation, synthesis, or transfer from pretrained models; the training objective itself is rarely revisited. We revisit it here. Drawing on the classical observation that generative classifiers reach their asymptotic error with fewer samples than discriminative ones, we propose Generative Cross-Entropy (GenCE), a drop-in replacement for CE that introduces a generative learning principle into a standard discriminative network without altering the architecture or fitting a separate density model. GenCE follows from a Bayesian rewrite of the class-conditional likelihood and, in the mini-batch approximation, reduces to normalizing each sample's softmax score against the model's predictions on the batch, coupling the training signal across examples sharing a class. We extend the proper-scoring-rule framework to such non-local losses and prove that GenCE is strictly proper under a mild completeness condition: its population risk is uniquely minimized at the true posterior. Across three datasets, on two architectures and in both balanced small-data and class-imbalanced regimes, GenCE outperforms CE and other widely used losses, while also producing better-calibrated probabilities and stronger out-of-distribution detection.

2604.03687 2026-05-12 cs.CV

SciLT: Long-tailed Image Classification under Scientific Image Domains

Jiahao Chen, Bing Su

AI总结 本文研究了科学图像领域中的长尾分类问题,针对现有方法在自然图像上表现良好但在科学图像上效果有限的现象,提出了一种新的框架SciLT。该方法通过自适应特征融合和双监督学习,有效利用基础模型的多级表示,特别是在尾部类别上表现出色,实验表明SciLT在三个科学图像基准上均优于现有方法,为科学长尾分类提供了有力的解决方案和实用基准。

详情
英文摘要

Long-tailed recognition has benefited from foundation models and fine-tuning paradigms, yet existing studies and benchmarks are mainly confined to natural image domains, where pre-training and fine-tuning data share similar distributions. In contrast, scientific images exhibit distinct visual characteristics and supervision signals, raising questions about the effectiveness of fine-tuning foundation models in such settings. In this work, we investigate scientific long-tailed recognition under a purely visual and fine-tuning paradigm. Experiments on three scientific benchmarks show that fine-tuning foundation models yields limited gains, and reveal that penultimate-layer features play an important role, particularly for tail classes. Motivated by these findings, we propose SciLT, a framework that exploits multi-level representations through adaptive feature fusion and dual-supervision learning. By jointly leveraging penultimate- and final-layer features, SciLT achieves balanced performance across head and tail classes. Extensive experiments demonstrate that SciLT consistently outperforms existing methods, establishing a strong and practical baseline for scientific long-tailed recognition and providing valuable guidance for adapting foundation models to scientific data with substantial domain shifts.

2604.02608 2026-05-12 cs.LG

Steerable but Not Decodable: Function Vectors Operate Beyond the Logit Lens

Mohammed Suhail B Nadaf

AI总结 该研究探讨了激活空间中函数向量(FVs)的行为特性,发现它们在引导模型生成特定输出时表现良好,但无法通过传统的logit镜头解码出对应答案。研究在多个模型和任务上验证,表明FVs编码的是计算指令而非答案方向,且其引导能力与解码能力之间存在显著差异。这一发现挑战了线性表示假设,对基于词汇投影的安全监控方法提出了新的挑战。

Comments 43 pages, 14 figures, 34 tables

详情
英文摘要

Activation steering presupposes that task-relevant behaviors correspond to linear directions in activation space -- directions that should both steer the model and be readable along the unembedding. Function vectors (FVs), extracted as mean differences across ICL demonstrations, are the canonical test case; the prediction: steering and decoding succeed or fail together. Across 12 tasks, 6 models from 3 families, and 4,032 directed cross-template pairs, we find the opposite. FV steering routinely succeeds where the logit lens cannot decode the correct answer at any intermediate layer, while the converse -- decodable without steerable -- is nearly empty (3 of 72). The gap is not representational dialect. A diagonal tuned lens closes 1 of 14 steerable-not-decodable cases; a 2-layer MLP probe with a Hewitt \& Liang control closes 5 of 10 via nonlinearly encoded structure but leaves 5 invisible to every decoder tested. Even at $> 0.90$ steering accuracy, projecting the FV through the unembedding yields incoherent token distributions: FVs encode computational instructions, not answer directions. A model-family asymmetry sharpens the picture. Mistral FVs rewrite intermediate representations, while Llama and Gemma FVs steer the final output without leaving a logit-lens-visible trace, corroborated by three signals (post-steering deltas, activation-patching recovery, FV norm-transfer correlations). A previously reported negative cosine-transfer correlation dissolves at scale, adding at most $ΔR^2 = 0.011$ beyond task identity. These results decompose the linear representation hypothesis into linear decodability and linear steerability and show they come apart opposite to intuition, with implications for safety monitoring: vocabulary-projection tools are blind to FV-style interventions on widely deployed model families.

2604.02438 2026-05-12 cs.LG

Mitigating Data Scarcity in Spaceflight Applications for Offline Reinforcement Learning Using Physics-Informed Deep Generative Models

Alex E. Ballentine, Nachiket U. Bapat, Raghvendra V. Cowlagi

AI总结 本文研究了在航天飞行等数据稀缺场景下,如何通过引入物理信息的深度生成模型来缓解离线强化学习中的数据不足问题。作者提出了一种基于互信息的分裂变分自编码器(MI-VAE),该模型能够学习观测轨迹与物理模型预测之间的差异,并生成符合物理约束的合成数据。实验表明,使用MI-VAE生成的数据显著提升了强化学习策略的性能,展示了该方法在复杂、数据受限环境中的有效性与鲁棒性。

详情
英文摘要

The deployment of reinforcement learning (RL)-based controllers on physical systems is often limited by poor generalization to real-world scenarios, known as the simulation-to-reality (sim-to-real) gap. This gap is particularly challenging in spaceflight, where real-world training data are scarce due to high cost and limited planetary exploration data. Traditional approaches, such as system identification and synthetic data generation, depend on sufficient data and often fail due to modeling assumptions or lack of physics-based constraints. We propose addressing this data scarcity by introducing physics-based learning bias in a generative model. Specifically, we develop the Mutual Information-based Split Variational Autoencoder (MI-VAE), a physics-informed VAE that learns differences between observed system trajectories and those predicted by physics-based models. The latent space of the MI-VAE enables generation of synthetic datasets that respect physical constraints. We evaluate MI-VAE on a planetary lander problem, focusing on limited real-world data and offline RL training. Results show that augmenting datasets with MI-VAE samples significantly improves downstream RL performance, outperforming standard VAEs in statistical fidelity, sample diversity, and policy success rate. This work demonstrates a scalable strategy for enhancing autonomous controller robustness in complex, data-constrained environments.

2604.01532 2026-05-12 cs.AI

PHMForge: Evaluating LLM Agents on Industrial Prognostics through MCP-Native, Algorithm-Grounded Tools

Tianjun Feng, Yunfeng Chen, Chun-Yi Tsai, Yihan Sun, Ayan Das, Kaoutar El Maghraoui, Shuxin Lin, Dhaval Patel

AI总结 本文介绍了PHMForge,一个用于评估大型语言模型(LLM)代理在工业预测性维护(PHM)任务中表现的评测环境。该环境基于模型上下文协议(MCP)构建,集成了99个由领域专家编写的工业场景和39个封装了PHM算法的工具,能够准确区分代理的推理能力与工具使用能力。实验表明,尽管前沿LLM在调用工具方面表现较强,但在任务规划和工具序列执行上仍存在显著不足,突显了当前LLM在工业关键安全任务中的局限性。

Comments 23 pages, 3 figures

详情
英文摘要

LLM agents are beginning to invoke industrial asset-management tools through the Model Context Protocol (MCP), yet whether they can act reliably on this substrate for safety-critical \emph{Prognostics and Health Management (PHM)} is unanswered. Prior benchmarks conflate protocol fluency with reasoning, instrumentation failures with agent failures, and tool use with tool retrieval. We introduce \textbf{PHMForge}, an evaluation environment that closes each conflation. PHMForge ships 99 SME-authored scenarios across eight industrial asset classes spanning rotating equipment, aero-engines, and lithium-ion cells, on public datasets including NASA PCoE, served through 39 MCP-native tools wrapping published PHM algorithms (C-MAPSS, ISO~10816, Arrhenius capacity-fade models, time-series foundation models). Krippendorff's $α\in [0.74,\,0.82]$ on a 30-scenario stratified rotating-equipment/aero-engine sample; the battery extension is single-rater. Across three agentic frameworks and six LLM backbones, the strongest configuration reaches \textbf{80.8\% pass@1}, with the residual gap concentrated in orchestration and tool-sequencing errors. Crucially, an architectural ablation shows that replacing MCP execution with text-based Retrieval-Augmented Generation (RAG) over telemetry-equivalent evidence collapses Remaining Useful Life \emph{pass-all-3} from \textbf{100\% to 20\%} (5/5 vs.\ 1/5) on the battery class, exposing the structural limits of static retrieval for prognostic computation. Trajectory decomposition shows orchestration errors dominate failures across backbones, while schema-invalid tool calls concentrate in smaller open-weight models. Frontier LLMs are stronger at calling tools than at planning when to call them. PHMForge is open-sourced with deterministic evaluators, a public leaderboard, and a datasheet.

2604.01151 2026-05-12 cs.AI cs.LG cs.MA

Detecting Multi-Agent Collusion Through Multi-Agent Interpretability

Aaron Rose, Carissa Cullen, Sahar Abdelnabi, Philip Torr, Brandon Gary Kaplowitz, Christian Schroeder de Witt

AI总结 随着大型语言模型代理在多智能体系统中的应用增多,隐蔽协作带来的风险日益突出,而传统的监督方式难以有效检测。本文提出NARCBench基准,用于评估在环境分布偏移下的协作检测能力,并设计了五种基于模型内部表示的探针技术,从群体层面识别多智能体间的协作行为。实验表明,这些方法在多种模型和协作场景中表现出良好的检测性能,且检测效果与模型能力密切相关,为多智能体可解释性研究提供了新的思路和工具。

详情
英文摘要

As LLM agents are increasingly deployed in multi-agent systems, they introduce risks of covert coordination that may evade standard forms of human oversight. While linear probes on model activations have shown promise for detecting deception in single-agent settings, collusion is inherently a multi-agent phenomenon, and the use of internal representations for detecting collusion between agents remains unexplored. We introduce NARCBench, a benchmark for evaluating collusion detection under environment distribution shift, and propose five probing techniques that aggregate per-agent deception scores to classify scenarios at the group level, evaluated across four open-weight models (Qwen3-32B, Llama-3.1-70B, DeepSeek-R1 32B, GPT-OSS-20B) and six probe architectures. We frame this as a distributed anomaly detection problem, identifying three collusion signatures that map onto distinct anomaly types and detection paradigms. Every model reaches 1.00 AUROC in-distribution; on our strongest model (Llama-3.1-70B), our five probing techniques achieve 0.73 to 0.93 AUROC when transferred zero-shot to structurally different multi-agent scenarios and 0.99 to 1.00 on a steganographic blackjack card-counting task, with detection performance scaling with model capability. We find that no single probing technique dominates across all collusion types, consistent with the framework's prediction that different anomaly types require different detection paradigms. This work takes a step toward multi-agent interpretability: extending white-box inspection from single models to multi-agent contexts, where detection requires aggregating signals across agents. These results suggest that model internals provide a complementary signal to text-level monitoring for detecting multi-agent collusion. Code and data available at https://github.com/aaronrose227/narcbench.

2603.26089 2026-05-12 cs.LG cs.AI cs.CL

Selective Deficits in LLM Mental Self-Modeling in a Behavior-Based Test of Theory of Mind

Christopher Ackerman

AI总结 该研究探讨了大型语言模型(LLMs)在心智理论(Theory of Mind)任务中的表现,特别是其对自身和他人心理状态的建模能力。研究设计了一种基于行为的实验范式,要求模型在策略性行动中运用心理状态表示,而非仅进行描述。结果发现,2025年中之前发布的模型在所有任务中表现不佳,而近期模型在他人心理建模上达到人类水平,但在自我建模任务中仍需借助推理痕迹辅助,且表现出类似有限工作记忆的认知负荷效应。研究还揭示了推理模型通过策略性欺骗等机制完成任务的潜在机制。

Comments 22 pages, 13 figures, 1 table

详情
英文摘要

The ability to represent oneself and others as agents with knowledge, intentions, and belief states that guide their behavior - Theory of Mind - is a human universal that enables us to navigate - and manipulate - the social world. It is supported by our ability to form mental models of ourselves and others. Its ubiquity in human affairs entails that LLMs have seen innumerable examples of it in their training data and therefore may have learned to mimic it, but whether they have actually learned causal models that they can deploy in arbitrary settings is unclear. We therefore develop a novel experimental paradigm that requires that subjects form representations of the mental states of themselves and others and act on them strategically rather than merely describe them. We test a wide range of leading open and closed source LLMs released since 2024, as well as human subjects, on this paradigm. We find that 1) LLMs released before mid-2025 fail at all of our tasks, 2) more recent LLMs achieve human-level performance on modeling the cognitive states of others, and 3) even frontier LLMs fail at our self-modeling task - unless afforded a scratchpad in the form of a reasoning trace. We further demonstrate cognitive load effects on other-modeling tasks, offering suggestive evidence that LLMs are using something akin to limited-capacity working memory to hold these mental representations in mind during a single forward pass. Finally, we explore the mechanisms by which reasoning models succeed at the self- and other-modeling tasks, and show that they readily engage in strategic deception.

2603.22016 2026-05-12 cs.LG cs.AI cs.CL

ROM: Real-time Overthinking Mitigation via Streaming Detection and Intervention

Xinyan Wang, Xiaogeng Liu, Chaowei Xiao

AI总结 大型推理模型在得出正确答案后仍常进行冗余验证和重复尝试,导致计算资源浪费甚至推翻正确结论。本文提出ROM框架,通过实时检测推理过程中的关键边界并进行干预,有效减少冗余推理,提升推理效率。实验表明,ROM在多个基准数据集上均提升了模型准确率并显著缩短了响应长度,且具有良好的跨模型泛化能力。

Comments Code is available at https://github.com/SaFo-Lab/ROM

详情
英文摘要

Large Reasoning Models (LRMs) often reach a correct solution before their long Chain-of-Thought trace ends, yet continue with redundant verification, repeated attempts, or unnecessary exploration that wastes computation and can even overturn the correct answer. We frame this behavior as a latent productive-to-redundant transition and show that it is directly reflected in hidden states: around first-correct-solution (FCS) boundaries, late-layer representations separate efficient from overthinking tokens, while boundary-permutation and position-control baselines collapse. Based on this signal, we propose ROM, a model-agnostic streaming intervention framework that monitors frozen LRMs with a lightweight hidden-state detector and intervenes at well-formed reasoning boundaries. Counterfactual Self-Correction (CSC) augments supervision with balanced wrong to correct trajectories, preserving useful pre-FCS correction while labeling only post-FCS continuation as redundant. Across MATH500, GSM8K, AIME25, and MMLU-Pro, ROM improves the overall tradeoff on both Qwen3-8B and DeepSeek-R1-Distill-Qwen-32B (DS-32B): on Qwen3-8B, it raises accuracy from 74.47% to 74.78% and reduces response length from 4262 to 3107 tokens; on DS-32B, it raises accuracy from 68.60% to 68.72% and reduces response length from 3062 to 2319 tokens. The same FCS-derived supervision transfers across scale and training origin, suggesting a shared long-CoT boundary rather than a backbone-specific artifact. ROM is compatible with L1, removing another 20.9-21.6% tokens at zero accuracy loss. ROM also generalizes to open-ended MMLU-Pro (+1.56 pp, 35.4% shorter) and reduces wall-clock latency by 46.5%. Code is available at https://github.com/SaFo-Lab/ROM.

2603.22003 2026-05-12 cs.RO

VP-VLA: Visual Prompting as an Interface for Vision-Language-Action Models

Zixuan Wang, Yuxin Chen, Yuqi Liu, Jinhui Ye, Pengguang Chen, Changsheng Lu, Shu Liu, Bei Yu, Jiaya Jia

AI总结 本文提出了一种名为VP-VLA的视觉语言动作模型框架,旨在解决现有模型在空间精度和分布外场景鲁棒性方面的不足。该方法通过结构化的视觉提示接口将高层推理与底层执行解耦,其中“System 2 Planner”负责分解指令并生成目标对象和位置的视觉提示,而“System 1 Controller”则基于这些提示生成精确的底层控制动作。实验表明,VP-VLA在仿真和现实环境中均优于当前最先进的端到端模型。

Comments Project page: https://visualprompt-vla.github.io/

详情
英文摘要

Vision-Language-Action (VLA) models typically map visual observations and linguistic instructions directly to control signals. This "black-box" mapping forces a single forward pass to simultaneously handle instruction interpretation, spatial grounding, and low-level control, often leading to poor spatial precision and limited robustness in out-of-distribution scenarios. To address these limitations, we propose VP-VLA, a dual-system framework that decouples high-level reasoning and low-level execution via a structured visual prompting interface. Specifically, a "System 2 Planner" decomposes complex instructions into sub-tasks and identifies relevant target objects and goal locations. These spatial anchors are rendered directly within the native RGB observation space as modality-consistent visual prompts, such as crosshairs and bounding boxes. This avoids the modality mismatch introduced by dense masks, affordance maps, or additional control-specific representations. Guided by these prompts and enhanced by a novel auxiliary visual grounding objective during training, a "System 1 Controller" reliably generates precise low-level execution motions. Extensive experiments in simulation and real world demonstrate that VP-VLA surpasses state-of-the-art end-to-end baselines including QwenOFT and GR00T-N1.6. Project page: https://visualprompt-vla.github.io/

2603.19670 2026-05-12 cs.LG

Load--Reserve Wasserstein Propagation for Isotropic Diffusion Samplers

Zicheng Lyu, Zengfeng Huang

AI总结 本文研究了各向同性扩散采样器在逆时间传播过程中的稳定性问题,提出了一种基于负载-储备Wasserstein传播的分析方法。该方法通过认证的漂移剖面构建自适应传播界面,结合反射耦合与Hardy容量量化传播成本与收缩速率,能够更准确地反映扩散过程中的几何特性。实验表明,该方法在处理不同高度和结构的扩散窗口时具有更好的鲁棒性和解释性。

详情
英文摘要

Many Wasserstein analyses of diffusion samplers control reverse-time propagation by global stability summaries of the learned drift. These summaries can hide radial geometry: equal-height expansive regions of different width can yield different propagation costs. We give a profile-adapted propagation interface for scalar-isotropic reverse-SDE windows with certified learned-drift profiles. A certified lower radial profile is compiled into an affine-tail transportation cost: reflection coupling reduces stability to a one-dimensional slope budget, and Hardy capacity quantifies the load paid before a contractive tail reserve. The compiler yields an adapted cost, contraction rate, and retained tail slope. Score-modeling and solver residuals are treated as forcing inputs and propagate additively in the adapted Wasserstein distance. Quadratic Wasserstein error is reported only at terminal time, using the retained tail slope with tail, moment, or support information. Gaussian-smoothed denoising geometry supplies inverse-radius profiles for uniformly dissipative, bounded-amplitude, and common-covariance mixture windows. Fixed-height examples show that adverse height, even with eventual reserve, does not determine the certificate; barrier examples show that the load dependence is structural.

2603.19222 2026-05-12 cs.CV cs.LG

Spectrally-Guided Diffusion Noise Schedules

Carlos Esteves, Ameesh Makadia

AI总结 本文研究了如何为像素扩散模型设计更高效的噪声调度策略,以提升图像生成质量。作者提出了一种基于图像频谱特性的噪声调度方法,通过理论分析确定噪声水平的上下界,从而设计出更紧凑、更有效的噪声调度方案。实验表明,该方法在单阶段像素扩散模型中,尤其是在低步数生成场景下,显著提升了生成效果。

Comments Accepted to ICML'26

详情
英文摘要

Denoising diffusion models are widely used for high-quality image and video generation. Their performance depends on noise schedules, which define the distribution of noise levels applied during training and the sequence of noise levels traversed during sampling. Noise schedules are typically handcrafted and require manual tuning across different resolutions. In this work, we propose a principled way to design per-instance noise schedules for pixel diffusion, based on the image's spectral properties. By deriving theoretical bounds on the efficacy of minimum and maximum noise levels, we design ``tight'' noise schedules that eliminate redundant steps. During inference, we propose to conditionally sample such noise schedules. Experiments show that our noise schedules improve generative quality of single-stage pixel diffusion models, particularly in the low-step regime.

2603.18113 2026-05-12 cs.LG cs.AI

VC-Soup: Value-Consistency Guided Multi-Value Alignment for Large Language Models

Hefei Xu, Le Wu, Yu Wang, Min Hou, Han Wu, Zhen Zhang, Meng Wang

AI总结 随着大型语言模型在内容生成、交互和决策中的广泛应用,如何使其与人类价值观对齐已成为可信人工智能的核心目标。当需要对齐多个可能存在冲突的价值观时,现有方法如奖励重加权、基于提示的监督微调和模型合并仍面临训练成本高和对齐性能下降等挑战。为此,本文提出VC-Soup方法,从数据价值一致性角度出发,通过设计一致性度量指标过滤低一致性的偏好对,并结合参数合并策略,有效缓解价值观冲突,实现多价值之间的平衡对齐。

Comments 12 pages; Accepted to WWW2026

详情
英文摘要

As large language models (LLMs) increasingly shape content generation, interaction, and decision-making across the Web, aligning them with human values has become a central objective in trustworthy AI. This challenge becomes even more pronounced when aligning multiple, potentially conflicting human values. Although recent approaches, such as reward reweighting, prompt-based supervised fine-tuning, and model merging, attempt to tackle multi-value alignment, they still face two major limitations: (1) training separate models for each value combination is prohibitively expensive; (2) value conflicts substantially degrade alignment performance. These limitations make it difficult to achieve favorable trade-offs across diverse human values. To address these challenges, we revisit multi-value alignment from the perspective of value consistency in data and propose VC-soup, a data filtering and parameter merging framework grounded in value-consistent learning. We first design a value consistency metric based on the cosine similarity between the reward-gap vector of each preference pair and an all-ones vector, which quantifies its cross-value coherence. We then filter out low-consistency preference pairs in each value dataset and train on the remaining data to obtain smooth, value-consistent policy models that better preserve linear mode connectivity. Finally, we linearly combine these policies and apply Pareto filtering across values to obtain solutions with balanced multi-value performance. Extensive experiments and theoretical analysis demonstrate that VC-soup effectively mitigates conflicts and consistently outperforms existing multi-value alignment methods.

2603.16253 2026-05-12 cs.CV cs.AI

Grounding the Score: Explicit Visual Premise Verification for Reliable Vision-Language Process Reward Models

Junxin Wang, Dai Guan, Weijie Qiu, Zhihang Li, Yongbo Gai, Zhengyi Yang, Mengyu Zhou, Erchao Zhao, Xiaoxi Jiang, Guanjun Jiang

AI总结 本文研究了视觉语言过程奖励模型(VL-PRM)在推理过程中因感知与推理耦合而导致的评分偏差问题,提出了一种名为显式视觉前提验证(EVPV)的轻量验证接口,通过显式检查步骤所依赖的视觉前提的可靠性,将感知不确定性与逻辑评估解耦,从而提升推理评分的准确性与鲁棒性。实验表明,EVPV在多个视觉与多模态推理基准上有效提升了步骤验证和重排序性能,并验证了其改进源于前提验证的准确性而非偶然的提示效应。

Comments 27 pages, 4 figures, 10 tables. Evaluated on VisualProcessBench and six multimodal reasoning benchmarks (LogicVista, MMMU, MathVerse-VO, MathVision, MathVista, WeMath). Includes ablations and causal analysis via controlled constraint corruption. Code: https://github.com/Qwen-Applications/EVPV-PRM

详情
英文摘要

Vision-language process reward models (VL-PRMs) are increasingly used to score intermediate reasoning steps and rerank candidates under test-time scaling. However, they often function as black-box judges: a low step score may reflect a genuine reasoning mistake or simply the verifier's misperception of the image. This entanglement between perception and reasoning leads to systematic false positives (rewarding hallucinated visual premises) and false negatives (penalizing correct grounded statements), undermining both reranking and error localization. We introduce Explicit Visual Premise Verification (EVPV), a lightweight verification interface that conditions step scoring on the reliability of the visual premises a step depends on. The policy is prompted to produce a step-wise visual checklist that makes required visual facts explicit, while a constraint extractor independently derives structured visual constraints from the input image. EVPV matches checklist claims against these constraints to compute a scalar visual reliability signal, and calibrates PRM step rewards via reliability gating: rewards for visually dependent steps are attenuated when reliability is low and preserved when reliability is high. This decouples perceptual uncertainty from logical evaluation without per-step tool calls. Experiments on VisualProcessBench and six multimodal reasoning benchmarks show that EVPV improves step-level verification and consistently boosts Best-of-N reranking accuracy over strong baselines. Furthermore, injecting controlled corruption into the extracted constraints produces monotonic performance degradation, providing causal evidence that the gains arise from constraint fidelity and explicit premise verification rather than incidental prompt effects. Code is available at: https://github.com/Qwen-Applications/EVPV-PRM

2603.14694 2026-05-12 cs.CV cs.AI cs.LG

Robust Building Damage Detection in Cross-Disaster Settings Using Domain Adaptation

Asmae Mouradi, Shruti Kshirsagar

AI总结 该研究针对灾害管理中远程感知图像的建筑损毁检测问题,提出了基于领域自适应的两阶段集成方法,以解决不同地理区域间数据分布差异导致的模型性能下降问题。通过将xView2方法适配到Ida-BD数据集,并系统分析数据增强对分类性能的影响,实验表明领域自适应对提升模型鲁棒性至关重要。研究在未见过的Ida-BD测试集上实现了0.5552的Macro-F1分数,验证了该方法在跨灾害场景中的有效性与可靠性。

Comments accepted for publication IEEE ICHMS

详情
英文摘要

Rapid structural damage assessment from remote sensing imagery is essential for timely disaster response. Within human-machine systems (HMS) for disaster management, automated damage detection provides decision-makers with actionable situational awareness. However, models trained on multi-disaster benchmarks often underperform in unseen geographic regions due to domain shift - a distributional mismatch between training and deployment data that undermines human trust in automated assessments. We explore a two-stage ensemble approach using supervised domain adaptation (SDA) for building damage classification across four severity classes. The pipeline adapts the xView2 first-place method to the Ida-BD dataset using SDA and systematically investigates the effect of individual augmentation components on classification performance. Comprehensive ablation experiments on the unseen Ida-BD test split demonstrate that SDA is indispensable: removing it causes damage detection to fail entirely. Our pipeline achieves the most robust performance using SDA with unsharp-enhanced RGB input, attaining a Macro-F1 of 0.5552. These results underscore the critical role of domain adaptation in building trustworthy automated damage assessment modules for HMS-integrated disaster response.

2603.13224 2026-05-12 cs.CV cs.AI

Visual-ERM: Reward Modeling for Visual Equivalence

Ziyu Liu, Shengyuan Ding, Xinyu Fang, Xuanlang Dai, Penghui Yang, Jianze Liang, Jiaqi Wang, Kai Chen, Dahua Lin, Yuhang Zang

AI总结 该研究针对视觉到代码任务中模型重建图表、表格和SVG等结构化视觉输入时的挑战,提出了一种新的奖励模型Visual-ERM,用于提供细粒度、可解释且任务无关的反馈。该模型直接在渲染的视觉空间中评估生成质量,有效解决了现有奖励信号对细微视觉差异感知不足的问题。实验表明,结合Visual-ERM的强化学习方法显著提升了模型在图表、表格和SVG解析任务中的表现,并在新构建的基准VC-RewardBench上展现出优越的性能。

Comments Project: https://github.com/InternLM/Visual-ERM

详情
英文摘要

Vision-to-code tasks require models to reconstruct structured visual inputs, such as charts, tables, and SVGs, into executable or structured representations with high visual fidelity. While recent Large Vision Language Models (LVLMs) achieve strong results via supervised fine-tuning, reinforcement learning remains challenging due to misaligned reward signals. Existing rewards either rely on textual rules or coarse visual embedding similarity, both of which fail to capture fine-grained visual discrepancies and are vulnerable to reward hacking. We propose Visual Equivalence Reward Model (Visual-ERM), a multimodal generative reward model that provides fine-grained, interpretable, and task-agnostic feedback to evaluate vision-to-code quality directly in the rendered visual space. Integrated into RL, Visual-ERM improves Qwen3-VL-8B-Instruct by +8.4 on chart-to-code and yields consistent gains on table and SVG parsing (+2.7, +4.1 on average), and further strengthens test-time scaling via reflection and revision. We also introduce VisualCritic-RewardBench (VC-RewardBench), a benchmark for judging fine-grained image-to-image discrepancies on structured visual data, where Visual-ERM at 8B decisively outperforms Qwen3-VL-235B-Instruct and approaches leading closed-source models. Our results suggest that fine-grained visual reward supervision is both necessary and sufficient for vision-to-code RL, regardless of task specificity.

2603.09103 2026-05-12 cs.LG eess.SP

Probabilistic Hysteresis Factor Prediction for Electric Vehicle Batteries with Graphite Anodes Containing Silicon

Runyao Yu, Viviana Kleine, Philipp Gromotka, Thomas Rudolf, Adrian Eisenmann, Gautham Ram Chandra Mouli, Peter Palensky, Jochen L. Cremer

AI总结 本文研究了含硅的石墨负极电动汽车电池中电压滞后因子的预测问题,该问题对电池荷电状态(SoC)估计构成了挑战。为解决现有方法在不确定性量化和计算效率方面的不足,作者提出了一种基于数据驱动的概率滞后因子预测方法,并设计了数据标准化框架以适应不同工况下的驾驶循环。实验表明,该方法在多种未见过的车辆模型中具有良好的泛化能力,有助于推动先进电池技术的应用。

Comments 11 pages, 5 figures, 6 tables

详情
英文摘要

Batteries with silicon-graphite-based anodes, which offer higher energy density and improved charging performance, introduce pronounced voltage hysteresis, making state-of-charge (SoC) estimation particularly challenging. Existing approaches to modeling hysteresis rely on exhaustive high-fidelity tests or focus on conventional graphite-based lithium-ion batteries, without considering uncertainty quantification or computational constraints. This work introduces a data-driven approach for probabilistic hysteresis factor prediction, with a particular emphasis on applications involving silicon-graphite anode-based batteries. A data harmonization framework is proposed to standardize heterogeneous driving cycles across varying operating conditions. Statistical learning and deep learning models are applied to assess performance in predicting the hysteresis factor with uncertainties while considering computational efficiency. Extensive experiments are conducted to evaluate the generalizability of the optimal model configuration in unseen vehicle models through retraining, zero-shot prediction, fine-tuning, and joint training. By addressing key challenges in SoC estimation, this research facilitates the adoption of advanced battery technologies. A summary page is available at: https://runyao-yu.github.io/Porsche_Hysteresis_Factor_Prediction/

2603.09007 2026-05-12 cs.SD cs.AI

Gender Fairness in Audio Deepfake Detection: Performance and Disparity Analysis

Aishwarya Fursule, Shruti Kshirsagar, Anderson R. Avila

AI总结 本文研究了音频深度伪造检测中的性别公平性问题,分析了现有检测模型在不同性别上的性能差异。作者基于ASVspoof 5数据集,采用ResNet-18分类器,并结合四种音频特征进行评估,同时与基线模型AASIST进行对比。通过引入五种公平性指标,研究发现即使整体误识率差异较小,模型在性别上的错误分布仍存在显著差异,强调了传统性能指标的局限性,突出了公平性评估在构建更公正、可靠音频深度伪造检测系统中的重要性。

Comments Paper Accepted to IEEE CAI Conference 2026

详情
英文摘要

Audio deepfake detection aims to detect real human voices from those generated by Artificial Intelligence (AI) and has emerged as a significant problem in the field of voice biometrics systems. With the ever-improving quality of synthetic voice, the probability of such a voice being exploited for illicit practices like identity thest and impersonation increases. Although significant progress has been made in the field of Audio Deepfake Detection in recent times, the issue of gender bias remains underexplored and in its nascent stage In this paper, we have attempted a thorough analysis of gender dependent performance and fairness in audio deepfake detection models. We have used the ASVspoof 5 dataset and train a ResNet-18 classifier and evaluate detection performance across four different audio features, and compared the performance with baseline AASIST model. Beyond conventional metrics such as Equal Error Rate (EER %), we incorporated five established fairness metrics to quantify gender disparities in the model. Our results show that even when the overall EER difference between genders appears low, fairness-aware evaluation reveals disparities in error distribution that are obscured by aggregate performance measures. These findings demonstrate that reliance on standard metrics is unreliable, whereas fairness metrics provide critical insights into demographic-specific failure modes. This work highlights the importance of fairness-aware evaluation for developing a more equitable, robust, and trustworthy audio deepfake detection system.

2603.05495 2026-05-12 cs.LG math.OC

Cheap Thrills: Effective Amortized Optimization Using Inexpensive Labels

Khai Nguyen, Petros Ellinas, Anvita Bhagavathula, Priya L. Donti

AI总结 该论文提出了一种高效的优化方法,旨在利用低成本标签提升代理模型的训练效果。研究通过分阶段策略,先使用低成本但不完美的标签进行监督预训练,再结合基于优势损失的终止方案,最后通过自监督学习进一步优化模型。实验表明,该方法在多个复杂领域中实现了更快的收敛速度和更高的精度,同时大幅降低了计算成本。

Comments in submission

详情
英文摘要

To scale optimization and simulation, prior work has explored training machine-learning surrogates that map problem parameters to solutions inexpensively at inference time. Unfortunately, commonly used approaches, including supervised and self-supervised learning with either soft or hard feasibility enforcement, face inherent challenges such as reliance on expensive high-quality labels or difficult optimization landscapes. To address their trade-offs, we propose a novel framework that collects "cheap" imperfect labels, performs supervised model pretraining with a merit loss-based termination scheme, and finally refines the model through self-supervised learning to improve final performance. Empirical validation across challenging domains -- including nonconvex constrained optimization, power-grid operation, and stiff dynamical systems -- shows that this three-stage strategy yields faster convergence; improved accuracy, feasibility, and optimality; and up to 59x reductions in total offline computational cost. We further analyze why and when our framework improves surrogate model training, finding that (i) merit loss is an informative signal and (ii) only small numbers of cheap, inexact labels are needed to place the model in a favorable regime for self-supervised learning.

2603.05301 2026-05-12 cs.AI

Uniform Inductive Spatio-Temporal Kriging

Lewei Xie, Haoyu Zhang, Yulong Chen, Liangjun You, Zongxian Yang, Yifan Zhang

AI总结 本文研究了在观测数据不完整的情况下进行归纳时空克里金插值的问题,提出了一种名为UniSTOK的通用框架。该方法引入了可靠性引导信号调节(RSR)和残差偏差校准(RBC)两个模块,分别用于增强可靠观测信号并校准预测偏差,从而提升时空信号插值的准确性。实验表明,UniSTOK在多个真实数据集上显著优于现有方法。

详情
英文摘要

Inductive spatio-temporal kriging infers signals at unobserved locations from observed sensors, but real-world observations are often incomplete and exhibit block-wise missingness caused by failures, interruptions, or maintenance. A common impute-then-krige pipeline suffers from objective mismatch: better reconstruction on observed sensors does not necessarily improve downstream kriging, and value-dependent imputation bias can be propagated to unobserved nodes. We propose UniSTOK, a plug-and-play framework for inductive spatio-temporal kriging under incomplete observations. We first introduce Reliability-guided Signal Regulation (RSR), which estimates entry-wise reliability from temporal continuity and spatial support, and uses it to regulate the input signals so that reliable observations are emphasized while long-gap or weakly supported entries are suppressed before spatial propagation. We further introduce Residual Bias Calibration (RBC), which estimates value-conditioned residual prototypes after the main predictor converges and learns context-correction amplitudes to adaptively calibrate systematic over- or under-estimation in final kriging predictions. Extensive experiments on real-world datasets show that UniSTOK consistently improves multiple kriging backbones.

2603.04415 2026-05-12 cs.CL cs.CV

Dual Tuning for Reasoning Efficacy-Driven Data Curation in Multimodal LLM Training

Ruobing Zheng, Tianqi Li, Jianing Li, Qingpei Guo, Yi Yuan, Jingdong Chen

AI总结 该研究探讨了如何通过数据筛选提升多模态大语言模型在推理任务中的训练效果。提出了一种名为Dual Tuning的框架,能够评估给定任务下推理训练是否有效,并确定哪些数据更适合用于推理训练或直接答案训练。该方法通过联合分析模型能力、任务特性及推理数据质量,为多模态模型的训练数据选择和后续训练策略匹配提供了定量依据。

Comments Project Page: https://digital-avatar.github.io/ai/ThinkingBoundary/

详情
英文摘要

Reasoning post-training improves Large Language Models (LLMs) on complex tasks such as mathematics and coding, but its benefits across diverse multimodal tasks remains uncertain. The trend of releasing parallel "Instruct" and "Thinking" models by leading teams is both resource-intensive and user-unfriendly. Prior work finds that the gains from reasoning training are influenced by multiple factors, such as base model capabilities, task characteristics, and Chain-of-Thought (CoT) data quality. However, principled criteria for determining when reasoning post-training is beneficial and which data should support it are still lacking. In this paper, we propose Dual Tuning, a reasoning efficacy-driven data curation framework for multimodal LLMs training. Given a target task and a base model, Dual Tuning jointly evaluates whether the training data is beneficial and whether reasoning training with current CoT content yields positive gains over non-reasoning alternatives. We apply Dual Tuning across spatial, mathematical, and multi-disciplinary tasks, and further analyze how reinforcement learning and thinking patterns affect reasoning efficacy. The Dual Tuning results guide data curation by identifying data that benefit reasoning training, data better suited to direct-answer training, and data that are detrimental under both training modes. Our work provides quantitative criteria for selecting appropriate training data and matching post-training strategies.