arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 2352
2603.23677 2026-05-13 cs.CV cs.AI

Prototype Fusion: A Training-Free Multi-Layer Approach to OOD Detection

Shreen Gul, Mohamed Elmahallawy, Ardhendu Tripathy, Sanjay Madria

AI总结 本文提出了一种无需训练的多层特征融合方法,用于检测模型输入是否超出训练分布(OOD)。不同于现有方法主要依赖网络最后一层激活值,该方法利用中间层丰富的表征信息,通过聚合多层卷积块的特征并计算类均值嵌入,构建紧凑的类别原型。实验表明,该方法在多种架构上均表现出优越的OOD检测性能,显著提升了检测准确率并降低了误报率。

详情
英文摘要

Deep learning models are increasingly deployed in safety-critical applications, where reliable out-of-distribution (OOD) detection is essential to ensure robustness. Existing methods predominantly rely on the penultimate-layer activations of neural networks, assuming they encapsulate the most informative in-distribution (ID) representations. In this work, we revisit this assumption to show that intermediate layers encode equally rich and discriminative information for OOD detection. Based on this observation, we propose a simple yet effective model-agnostic approach that leverages internal representations across multiple layers. Our scheme aggregates features from successive convolutional blocks, computes class-wise mean embeddings, and applies L_2 normalization to form compact ID prototypes capturing class semantics. During inference, cosine similarity between test features and these prototypes serves as an OOD score--ID samples exhibit strong affinity to at least one prototype, whereas OOD samples remain uniformly distant. Extensive experiments on state-of-the-art OOD benchmarks across diverse architectures demonstrate that our approach delivers robust, architecture-agnostic performance and strong generalization for image classification. Notably, it improves AUROC by up to 4.41% and reduces FPR by 13.58%, highlighting multi-layer feature aggregation as a powerful yet underexplored signal for OOD detection, challenging the dominance of penultimate-layer-based methods. Our code is available at: https://github.com/sgchr273/cosine-layers.git.

2603.22000 2026-05-13 cs.LG stat.ML

CRPS-Optimal Binning for Univariate Conformal Regression

Paolo Toccaceli

AI总结 本文提出了一种基于分箱的非参数条件分布估计方法,通过将排序后的协变量观测划分为连续区间,并使用区间内的经验CDF作为预测分布。该方法通过最小化留一法连续排名概率分数(LOO-CRPS)确定最优分箱边界,并采用动态规划以高效求解全局最优分箱数。实验表明,该方法在保持预测区间覆盖率接近名义水平的同时,能显著缩小预测区间,优于多种主流的分层确认回归方法。

详情
英文摘要

We propose a method for non-parametric conditional distribution estimation based on partitioning covariate-sorted observations into contiguous bins and using the within-bin empirical CDF as the predictive distribution. Bin boundaries are chosen to minimise the total leave-one-out Continuous Ranked Probability Score (LOO-CRPS), which admits a closed-form cost function with $O(n^2 \log n)$ precomputation and $O(n^2)$ storage; the globally optimal $K$-partition is recovered by a dynamic programme in $O(n^2 K)$ time. Minimisation of within-sample LOO-CRPS turns out to be inappropriate for selecting $K$ as it results in in-sample optimism. We instead select $K$ by $K$-fold cross-validation of test CRPS, which yields a U-shaped criterion with a well-defined minimum. Having selected $K^*$ and fitted the full-data partition, we form two complementary predictive objects: the Venn prediction band and a conformal prediction set based on CRPS as the nonconformity score, which carries a finite-sample marginal coverage guarantee at any prescribed level $\varepsilon$. The conformal prediction is transductive and data-efficient, as all observations are used for both partitioning and p-value calculation, with no need to reserve a hold-out set. On real benchmarks against split-conformal competitors (Gaussian split conformal, CQR, CQR-QRF, and conformalized isotonic distributional regression), the method produces substantially narrower prediction intervals while maintaining near-nominal coverage.

2603.21887 2026-05-13 cs.RO

IGV-RRT: Prior-Real-Time Observation Fusion for Active Object Search in Changing Environments

Wei Zhang, Ping Gong, Yujie Wang, Leilei Yao, Minghui Bai, Rongfeng Ye, Yinchuan Wang, Yachao Wang, Chen Sun, Chaoqun Wang

AI总结 本文提出了一种名为IGV-RRT的概率规划框架,用于解决动态室内环境中目标物体导航(ObjectNav)的问题。该方法结合了基于场景先验的不确定性感知模型和视觉语言模型(VLM)的在线目标相关性估计,通过双层语义地图模块和实时规划器实现高效搜索。实验表明,该方法在复杂室内环境中能有效应对物体重新摆放的影响,显著提升了搜索效率和成功率。

详情
英文摘要

Object Goal Navigation (ObjectNav) in temporally changing indoor environments is challenging because object relocation can invalidate historical scene knowledge. To address this issue, we propose a probabilistic planning framework that combines uncertainty-aware scene priors with online target relevance estimates derived from a Vision Language Model (VLM). The framework contains a dual-layer semantic mapping module and a real-time planner. The mapping module includes an Information Gain Map (IGM) built from a 3D scene graph (3DSG) during prior exploration to model object co-occurrence relations and provide global guidance on likely target regions. It also maintains a VLM score map (VLM-SM) that fuses confidence-weighted semantic observations into the map for local validation of the current scene. Based on these two cues, we develop a planner that jointly exploits information gain and semantic evidence for online decision making. The planner biases tree expansion toward semantically salient regions with high prior likelihood and strong online relevance (IGV-RRT), while preserving kinematic feasibility through gradient-based analysis. Simulation and real-world experiments demonstrate that the proposed method effectively mitigates the impact of object rearrangement, achieving higher search efficiency and success rates than representative baselines in complex indoor environments.

2603.17510 2026-05-13 cs.RO

Interpreting Context-Aware Human Preferences for Multi-Objective Robot Navigation

Tharun Sethuraman, Subham Agrawal, Nils Dengler, Jorge de Heuvel, Teena Hassan, Maren Bennewitz

AI总结 本文研究了如何使机器人在共享人类环境中的多目标导航中适应人类的上下文相关偏好。为了解决人类偏好通常以自然语言表达且依赖环境上下文的问题,作者提出了一种结合基础模型与多目标强化学习导航策略的框架,实现了对高阶语义信息的理解与低层运动控制的整合。该方法通过视觉语言模型提取环境上下文,利用大语言模型将用户反馈转化为可解释的行为规则,并将其映射为数值偏好向量以实时调整导航策略,实验表明该方法在多种室内场景中有效提升了机器人行为的适应性与可控性。

详情
英文摘要

Robots operating in human-shared environments must not only achieve task-level navigation objectives such as safety and efficiency, but also adapt their behavior to human preferences. However, as human preferences are typically expressed in natural language and depend on environmental context, it is difficult to directly integrate them into low-level robot control policies. In this work, we present a pipeline that enables robots to understand and apply context-dependent navigation preferences by combining foundational models with a Multi-Objective Reinforcement Learning (MORL) navigation policy. Thus, our approach integrates high-level semantic reasoning with low-level motion control. A Vision-Language Model (VLM) extracts structured environmental context from onboard visual observations, while Large Language Models (LLM) convert natural language user feedback into interpretable, context-dependent behavioral rules stored in a persistent but updatable rule memory. A preference translation module then maps contextual information and stored rules into numerical preference vectors that parameterize a pretrained MORL policy for real-time navigation adaptation. We evaluate the proposed framework through quantitative component-level evaluations, a user study, and real-world robot deployments in various indoor environments. Our results demonstrate that the system reliably captures user intent, generates consistent preference vectors, and enables controllable behavior adaptation across diverse contexts. Overall, the proposed pipeline improves the adaptability, transparency, and usability of robots operating in shared human environments, while maintaining safe and responsive real-time control.

2603.16849 2026-05-13 cs.LG

GIST: Gauge-Invariant Spectral Transformers for Scalable Graph Neural Operators

Mattia Rigotti, Nicholas Thumiger, Thomas Frick

AI总结 本文提出了一种名为GIST的可扩展图神经算子,用于解决不规则网格上神经算子在几何建模中的基本矛盾。GIST通过使用高效近似谱嵌入的成对内积来保持规范不变性,从而在保证计算复杂度为$\mathcal{O}(N)$的同时,实现了对网格分辨率和结构变化的鲁棒性。该方法在多个大规模网格数据集和标准图数据集上均取得了最先进的性能。

详情
英文摘要

Neural operators on irregular meshes face a fundamental tension. Spectral positional encodings, the natural choice for capturing geometry, require cubic-complexity eigendecomposition and inadvertently break gauge invariance through numerical solver artifacts; existing efficient approximations sacrifice gauge symmetry by design. Both failure modes break discretization invariance: models fail to transfer across mesh resolutions of the same domain, and similarly across different graphs of related structure in inductive settings. We propose GIST (Gauge-Invariant Spectral Transformer), a scalable neural operator that resolves this tension by restricting attention to pairwise inner products of efficient approximate spectral embeddings. We prove these inner products estimate an exactly gauge-invariant graph kernel at end-to-end $\mathcal{O}(N)$ complexity, and establish a formal connection between gauge invariance and discretization-invariant learning with bounded mismatch error. To our knowledge, GIST is the first scalable graph neural operator with a provable discretization-mismatch bound. Empirically, GIST sets state-of-the-art on the AirfRANS, ShapeNet-Car, DrivAerNet, and DrivAerNet++ mesh benchmarks (up to 750K nodes), and additionally matches strong baselines on standard graph benchmarks (e.g., 99.50% micro-F1 on PPI).

2603.15759 2026-05-13 cs.RO cs.AI cs.LG

Simulation Distillation: Pretraining World Models in Simulation for Rapid Real-World Adaptation

Jacob Levy, Tyler Westenbroek, Kevin Huang, Fernando Palafox, Patrick Yin, Shayegan Omidshafiei, Dong-Ki Kim, Abhishek Gupta, David Fridovich-Keil

AI总结 该研究提出了一种名为 Simulation Distillation(SimDist)的框架,旨在通过模拟器预训练世界模型,以提高机器人在真实环境中的快速适应能力。核心方法是利用物理模拟器生成大量动作条件化的数据,预训练世界模型,然后在真实世界中仅更新模型的动力学部分,从而减少对大量真实数据的依赖。该方法在复杂操作和四足机器人运动任务中表现出色,相比现有方法具有更快的适应速度和更稳定的性能提升。

Comments Robotics: Science and Systems 2026

详情
英文摘要

Robot learning requires adaptation methods that improve reliably from limited, mixed-quality interaction data. This is especially challenging in long-horizon, contact-rich tasks, where end-to-end policy finetuning remains inefficient and brittle. World models offer a compelling alternative: by predicting the outcomes of candidate action sequences, they enable online planning through counterfactual reasoning. However, training action-conditioned robotic world models directly in the real world requires diverse data at impractical scale. We introduce Simulation Distillation (SimDist), a framework that uses physics simulators as a scalable source of action-conditioned robot experience. During pretraining, SimDist distills structural priors from the simulator into a world model that enables planning from raw real-world observations. During real-world adaptation, SimDist transfers the encoder, reward model, and value function learned in simulation, and updates only the latent dynamics model using real-world prediction losses. This reduces adaptation to supervised system identification while preserving dense, long-horizon planning signals for online improvement. Across contact-rich manipulation and quadruped locomotion tasks, SimDist rapidly improves with experience, while prior adaptation methods struggle to make progress or degrade during online finetuning. Project website and code: https://sim-dist.github.io

2603.15423 2026-05-13 cs.CL

Invisible failures in human-AI interactions

Christopher Potts, Moritz Sudhof

AI总结 该研究分析了10万个人与AI的交互数据,发现79%的AI失败是“隐形”的,即用户并未明显察觉问题。研究识别出八类典型的隐形失败模式,并发现这些模式具有系统性关联,表明AI在满足用户需求方面存在深层次问题。通过构建对比数据集,研究还发现尽管AI能力提升使失败率下降,但大多数失败仍属隐形,且失败类型分布保持稳定,表明该分类体系对AI系统监控具有重要价值。

详情
英文摘要

AI systems fail silently far more often than they fail visibly. In an analysis of 100K human-AI interactions from the WildChat dataset, we find that 79% of AI failures are invisible: something went wrong but the user gave no overt indication that there was a problem. These invisible failures cluster into eight archetypes that help us characterize where and how AI systems are failing to meet users' needs. In addition, the archetypes show systematic co-occurrence patterns indicating higher-level failure types. To address the question of whether these archetypes will remain relevant as AI systems become more capable, we also created and annotated a counterfactual dataset in which WildChat's 2024-era responses are replaced by those from three present-day frontier LMs. This analysis indicates that failure rates have dropped substantially, but that the vast majority of failures remain invisible in our sense, and the distribution of failure archetypes seems stable. Finally, we illustrate how the archetypes help us to identify systematic and variable AI limitations across different usage domains. Overall, we argue that our invisible failure taxonomy can be a key component in reliable failure monitoring for product developers, scientists, and policy makers. Our code and data are available at https://github.com/bigspinai/bigspin-invisible-failure-archetypes

2603.13988 2026-05-13 cs.AI cs.LG

Faithful or Just Plausible? Evaluating the Faithfulness of Closed-Source LLMs in Medical Reasoning

Halimat Afolabi, Zainab Afolabi, Elizabeth Friel, Jude Roberts, Antonio Ji-Xu, Lloyd Chen, Egheosa Ogbomo, Emiliomo Imevbore, Phil Eneje, Wissal El Ouahidi, Aaron Sohal, Alisa Kennan, Shreya Srivastava, Anirudh Vairavan, Laura Napitu, Katie McClure

AI总结 本文研究了封闭源大型语言模型(如ChatGPT和Gemini)在医疗推理任务中的解释可信度问题,指出其生成的解释可能看似合理但并不反映真实的推理过程。为此,作者设计了三种基于扰动的探测方法,包括因果消融、位置偏差和提示注入,评估模型推理过程与预测结果之间的关联性,并结合人类评估分析模型解释的可信度与用户信任之间的关系。研究发现,模型的推理步骤往往不直接影响预测结果,且容易受到外部提示的影响,强调在医疗场景中评估模型时,除了准确性,可信度也应成为核心考量。

详情
Journal ref
Proceedings of Machine Learning Research, Vol. 297, pp. 1562-1591, 2026
英文摘要

Closed-source large language models (LLMs), such as ChatGPT and Gemini, are increasingly consulted for medical advice, yet their explanations may appear plausible while failing to reflect the model's underlying reasoning process. This gap poses serious risks as patients and clinicians may trust coherent but misleading explanations. We conduct a systematic black-box evaluation of faithfulness in medical reasoning among three widely used closed-source LLMs. Our study consists of three perturbation-based probes: (1) causal ablation, testing whether stated chain-of-thought (CoT) reasoning causally influences predictions; (2) positional bias, examining whether models create post-hoc justifications for answers driven by input positioning; and (3) hint injection, testing susceptibility to external suggestions. We complement these quantitative probes with a small-scale human evaluation of model responses to patient-style medical queries to examine concordance between physician assessments of explanation faithfulness and layperson perceptions of trustworthiness. We find that CoT reasoning steps often do not causally drive predictions, and models readily incorporate external hints without acknowledgment. In contrast, positional biases showed minimal impact in this setting. These results underscore that faithfulness, not just accuracy, must be central in evaluating LLMs for medicine, to ensure both public protection and safe clinical deployment.

2603.05947 2026-05-13 cs.CV

LucidNFT: LR-Anchored Multi-Reward Preference Optimization for Flow-Based Real-World Super-Resolution

Song Fei, Tian Ye, Sixiang Chen, Zhaohu Xing, Jianyu Lai, Lei Zhu

AI总结 本文提出了一种名为LucidNFT的多奖励强化学习框架,用于基于流匹配的现实场景图像超分辨率任务。该方法通过引入一种对退化不变且对语义幻觉敏感的LR参考评估器LucidConsistency,以及解耦的奖励归一化策略和大规模真实退化图像集LucidLR,有效解决了现有方法在保持低分辨率输入真实性与提升视觉质量之间的平衡问题。实验表明,LucidNFT在多个基准上提升了感知质量,同时保持了对真实低分辨率输入的一致性。

详情
英文摘要

Generative real-world image super-resolution (Real-ISR) can synthesize visually convincing details from severely degraded low-resolution (LR) inputs, yet its stochastic sampling makes a critical failure mode hard to avoid: outputs may look sharp but be unfaithful to the LR evidence, exhibiting semantic or structural hallucinations. Preference-based reinforcement learning (RL) is a natural fit because each LR input yields a rollout group of candidate restorations. However, effective alignment in Real-ISR is hindered by three coupled challenges: (i) the lack of an LR-referenced faithfulness signal that is robust to degradation yet sensitive to localized hallucinations, (ii) a rollout-group optimization bottleneck where scalarizing heterogeneous rewards before normalization compresses objective-wise contrasts and weakens DiffusionNFT-style reward-weighted updates, and (iii) limited coverage of real degradations, which restricts rollout diversity and preference signal quality. We propose LucidNFT, a multi-reward RL framework for flow-matching Real-ISR. LucidNFT introduces LucidConsistency, a degradation-invariant and hallucination-sensitive LR-referenced evaluator trained with content-consistent degradation pools and original-inpainted hard negatives; a decoupled reward normalization strategy that preserves objective-wise contrasts within each LR-conditioned rollout group before fusion; and LucidLR, a large-scale collection of real-world degraded images for robust RL fine-tuning. Extensive experiments show that LucidNFT improves perceptual quality on strong flow-based Real-ISR baselines while generally maintaining LR-referenced consistency across diverse real-world scenarios.

2602.22347 2026-05-13 cs.CV cs.AI

Enabling clinical use of foundation models for computational pathology

Audun L Henriksen, Ole-Johan Skrede, Lisa van der Schee, Enric Domingo, Karolina Cyll, Sepp de Raedt, Ilyá Kostolomov, Jennifer Hay, Wanja Kildal, Joakim Kalsnes, Robert W Williams, Manohar Pradhan, John Arne Nesheim, Hanne Askautrud, Maria Isaksen, Karmele Saez de Gordoa, Miriam Cuatrecasas, Joanne Edwards, TransSCOT group, Arild Nesbakken, Neil A Shepherd, Ian Tomlinson, Daniel-Christoph Wagner, Rachel Kerr, Tarjei Sveinsgjerd Hveem, Knut Liestøl, Yoshiaki Nakamura, Marco Novelli, Masaaki Miyo, Sebastian Försch, David N Church, Miangela M Lacle, David J Kerr, Andreas Kleppe

AI总结 该研究探讨了如何使基础模型在计算病理学中更适用于临床场景,解决了现有模型因捕捉扫描仪和预分析变异而影响下游任务性能的问题。研究提出在下游模型训练中引入新的鲁棒性损失函数,以减少对技术变异的敏感性,并通过大量临床病理图像实验验证了该方法的有效性。该方法在不重新训练基础模型的前提下,提升了模型的鲁棒性和分类准确性,有助于开发更适用于真实临床环境的深度学习系统。

详情
英文摘要

Foundation models for computational pathology are expected to facilitate the development of high-performing, generalisable deep learning systems. However, in addition to biologically relevant features, current foundation models also capture pre-analytic and scanner-specific variation that bias the predictions made by downstream task-specific models trained on these features. Here we show that introducing novel robustness losses during downstream model training reduces sensitivity to technical variability. A purpose-designed comprehensive experimentation setup with 27,042 whole-slide images from 6,155 patients is used to train thousands of models from the features of eight well-known foundation models for computational pathology. In addition to a substantial improvement in robustness, our approach improves classification accuracy by focusing on biologically relevant features. It mitigates robustness limitations of foundation models for computational pathology without retraining the foundation models themselves, enabling development of models that are more suitable in real-world clinical use.

2602.21625 2026-05-13 cs.RO

Tacmap: Bridging the Tactile Sim-to-Real Gap via Geometry-Consistent Penetration Depth Map

Lei Su, Zhijie Peng, Renyuan Ren, Shengping Mao, Juan Du, Kaifeng Zhang, Xuezhou Zhu

AI总结 本文提出了一种名为Tacmap的高保真、计算高效的触觉仿真框架,旨在解决视觉触觉传感器在机器人操作中面临的仿真到现实的差距问题。该方法通过统一的形变图表示,将仿真与现实世界在几何空间中对齐,利用体积穿透深度计算仿真中的3D接触体积,并通过自动化数据采集装置在现实世界中学习触觉图像到真实深度图的映射。实验表明,Tacmap在多种接触场景中表现出与实际测量高度一致的性能,并成功实现了从仿真到物理机器人的零样本迁移。

Comments 8 pages

详情
英文摘要

Vision-Based Tactile Sensors (VBTS) are essential for achieving dexterous robotic manipulation, yet the tactile sim-to-real gap remains a fundamental bottleneck. Current tactile simulations suffer from a persistent dilemma: simplified geometric projections lack physical authenticity, while high-fidelity Finite Element Methods (FEM) are too computationally prohibitive for large-scale reinforcement learning. In this work, we present Tacmap, a high-fidelity, computationally efficient tactile simulation framework anchored in volumetric penetration depth. Our key insight is to bridge the tactile sim-to-real gap by unifying both domains through a shared deform map representation. Specifically, we compute 3D intersection volumes as depth maps in simulation, while in the real world, we employ an automated data-collection rig to learn a robust mapping from raw tactile images to ground-truth depth maps. By aligning simulation and real-world in this unified geometric space, Tacmap minimizes domain shift while maintaining physical consistency. Quantitative evaluations across diverse contact scenarios demonstrate that Tacmap's deform maps closely mirror real-world measurements. Moreover, we validate the utility of Tacmap through an in-hand rotation task, where a policy trained exclusively in simulation achieves zero-shot transfer to a physical robot.

2602.15473 2026-05-13 cs.LG

POP: Prior-Fitted First-Order Optimization Policies

Jan Kobiolka, Christian Frey, Gresa Shala, Arlind Kadra, Erind Bedalli, Josif Grabocka

AI总结 本文提出了一种基于强化学习的元学习优化策略 POP,用于预测梯度下降中的自适应学习率。该方法通过优化轨迹中的上下文信息进行学习,并引入了新的奖励函数、函数缩放策略和先验分布以生成大量合成优化问题。实验表明,POP 在包含 43 个不同复杂度优化函数的基准测试中显著优于传统梯度优化方法,且无需任务特定调参即可实现良好的泛化能力。

Comments Under Review

详情
英文摘要

Gradient-based optimizers are highly sensitive to design choices in their adaptive learning rate mechanisms. To address this limitation, we introduce POP, a meta-learned Reinforcement Learning (RL) policy that predicts adaptive learning rates for gradient descent, conditioned on the contextual information provided in the optimization trajectory. Our method introduces a novel RL reward formulation, a new function-scaling strategy for in-distribution generalization, and a novel prior that is used to sample millions of synthetic optimization problems. We evaluate POP on an established benchmark including 43 optimization functions of various complexity, where it significantly outperforms gradient-based methods. Our evaluation demonstrates strong generalization capabilities without task-specific tuning.

2602.13690 2026-05-13 cs.LG

Physics Aware Neural Networks: Denoising for Magnetic Navigation

Aritra Das, Yashas Shende, Muskaan Chugh, Reva Laxmi Chauhan, Arghya Pathak, Debayan Gupta

AI总结 本文研究了在GPS不可用情况下利用地磁异常进行导航时的去噪问题,提出了一种基于物理约束的神经网络方法。该方法引入了无散度矢量场和E(3)等变性两个物理约束,确保学习到的地磁场符合麦克斯韦方程并具有正确的空间变换特性。通过生成合成数据集和对比多种神经网络结构,实验表明该方法在预测精度和物理合理性方面优于传统方法。

详情
英文摘要

Magnetic-anomaly navigation, leveraging small-scale variations in the Earth's magnetic field, is a promising alternative when GPS is unavailable or compromised. Airborne systems face a key challenge in extracting geomagnetic field data: the aircraft itself induces magnetic noise. Although the classical Tolles-Lawson model addresses this, it inadequately handles stochastically corrupted magnetic data required for navigation. To handle stochastic noise, we propose using two physics-based constraints: divergence-free vector fields and E(3)-equivariance. These ensure the learned magnetic field obeys Maxwell's equation and that outputs transform correctly with sensor position and orientation. The divergence-free constraint is implemented by training a neural network to output a vector potential A, with the magnetic field defined as its curl. For E(3)-equivariance, we use tensor products of geometric tensors represented via spherical harmonics with known rotational transformations. Enforcing physical consistency and restricting the admissible function space acts as an implicit regularizer that improves spatiotemporal performance. We present ablation studies evaluating each constraint alone and jointly across CNNs, MLPs, LTCs, and Contiformers. Continuous-time dynamics and long-term memory are critical for modelling magnetic time series; the Contiformer, which provides both, outperforms existing methods. To mitigate data scarcity, we generate synthetic datasets using the World Magnetic Model (WMM) and time-series conditional GANs, producing realistic, temporally consistent magnetic sequences across varied trajectories and environments. Experiments show that embedding these constraints significantly improves predictive accuracy and physical plausibility, outperforming classical and unconstrained deep learning approaches. Acknowledgement: This work was done in collaboration with Dirac Labs.

2602.12139 2026-05-13 cs.LG

Oscillators Are All You Need: Irregular Time Series Modelling via Damped Harmonic Oscillators with Closed-Form Solutions

Yashas Shende, Aritra Das, Reva Laxmi Chauhan, Arghya Pathak, Debayan Gupta

AI总结 该论文提出了一种基于阻尼谐振子模型的新型时间序列建模方法,用于处理非均匀时间间隔的数据。通过将Transformer中的键值对建模为受驱阻尼振子,并以正弦基展开查询,该方法将注意力机制解释为共振现象,从而在保持模型表达能力的同时,避免了传统ODE求解器的计算开销。该方法具有理论保证,能够在非均匀时间序列任务上实现高精度且高效的表现。

详情
英文摘要

Transformers excel at time series modelling through attention mechanisms that capture long-term temporal patterns. However, they assume uniform time intervals and therefore struggle with irregular time series. Neural Ordinary Differential Equations (NODEs) effectively handle irregular time series by modelling hidden states as continuously evolving trajectories. ContiFormers arxiv:2402.10635 combine NODEs with Transformers, but inherit the computational bottleneck of the former by using heavy numerical solvers. This bottleneck can be removed by using a closed-form solution for the given dynamical system - but this is known to be intractable in general! We obviate this by replacing NODEs with a novel linear damped harmonic oscillator analogy - which has a known closed-form solution. We model keys and values as damped, driven oscillators and expand the query in a sinusoidal basis up to a suitable number of modes. This analogy naturally captures the query-key coupling that is fundamental to any transformer architecture by modelling attention as a resonance phenomenon. Our closed-form solution eliminates the computational overhead of numerical ODE solvers while preserving expressivity. We prove that this oscillator-based parameterisation maintains the universal approximation property of continuous-time attention; specifically, any discrete attention matrix realisable by ContiFormer's continuous keys can be approximated arbitrarily well by our fixed oscillator modes. Our approach delivers both theoretical guarantees and scalability, achieving state-of-the-art performance on irregular time series benchmarks while being orders of magnitude faster. Acknowledgement: This work was done in collaboration with Dirac Labs.

2602.11126 2026-05-13 cs.LG

The Offline-Frontier Shift: Diagnosing Distributional Limits in Generative Multi-Objective Optimization

Stephanie Holly, Alexandru-Ciprian Zăvoianu, Siegfried Silber, Sepp Hochreiter, Werner Zellinger

AI总结 本文研究了离线多目标优化中生成方法的分布限制问题,指出尽管生成模型在超体积指标上表现良好,但在其他关键指标如代际距离上却明显落后于进化算法。研究发现,这种性能差异源于离线数据集与帕累托前沿之间的偏移,即“离线前沿偏移”现象,这构成了离线多目标优化的根本性限制。作者提出通过目标空间中的分布外采样来应对这一限制,并指出生成方法在目标分布上趋于保守,难以有效突破数据分布的边界。

详情
英文摘要

Offline multi-objective optimization (MOO) aims to recover Pareto-optimal designs given a finite, static dataset. Recent generative approaches, including diffusion models, show strong performance under hypervolume, yet their behavior under other established MOO metrics is less understood. We show that generative methods systematically underperform evolutionary alternatives with respect to other metrics, such as generational distance. We relate this failure mode to the offline-frontier shift, i.e., the displacement of the offline dataset from the Pareto front, which acts as a fundamental limitation in offline MOO. We argue that overcoming this limitation requires out-of-distribution sampling in objective space (via an integral probability metric) and empirically observe that generative methods remain conservatively close to the offline objective distribution. Our results position offline MOO as a distribution-shift--limited problem and provide a diagnostic lens for understanding when and why generative optimization methods fail.

2602.09587 2026-05-13 cs.CV cs.AI

MieDB-100k: A Comprehensive Dataset for Medical Image Editing

Yongfan Lai, Wen Qian, Bo Liu, Hongyan Li, Hao Luo, Fan Wang, Bohan Zhuang, Shenda Hong

AI总结 针对医学图像编辑领域高质量数据稀缺的问题,本文提出MieDB-100k,一个大规模、高质量且多样化的文本引导医学图像编辑数据集。该数据集从感知、修改和转换三个视角分类编辑任务,兼顾理解和生成能力,并通过专家模型与规则合成方法构建,经过严格人工审核确保临床准确性。实验表明,基于该数据集训练的模型在性能和泛化能力上均优于现有开源和商业模型,为医学图像编辑研究提供了重要基础。

详情
英文摘要

The scarcity of high-quality data remains a primary bottleneck in adapting multimodal generative models for medical image editing. Existing medical image editing datasets often suffer from limited diversity, neglect of medical image understanding and inability to balance quality with scalability. To address these gaps, we propose MieDB-100k, a large-scale, high-quality and diverse dataset for text-guided medical image editing. It categorizes editing tasks into perspectives of Perception, Modification and Transformation, considering both understanding and generation abilities. We construct MieDB-100k via a data curation pipeline leveraging both modality-specific expert models and rule-based data synthetic methods, followed by rigorous manual inspection to ensure clinical fidelity. Extensive experiments demonstrate that model trained with MieDB-100k consistently outperform both open-source and proprietary models while exhibiting strong generalization ability. We anticipate that this dataset will serve as a cornerstone for future advancements in specialized medical image editing.

2602.09368 2026-05-13 cs.RO

Certified Gradient-Based Contact-Rich Manipulation via Smoothing-Error Reachable Tubes

Wei-Chen Li, Glen Chou

AI总结 该论文研究了如何在接触丰富的操作任务中,通过平滑动力学模型并补偿由此产生的误差,实现基于梯度的控制器优化与安全保证。核心方法是在可微分仿真中对接触动力学和平滑几何进行平滑处理,并通过集合值偏差量化模型失配,结合分析可达集优化时变仿射反馈策略,从而在原始非平滑动力学下实现闭环系统的鲁棒约束满足。该方法在多个接触密集任务中验证了其有效性,表现出更低的安全违规率和更小的目标误差。

Comments Robotics: Science & Systems (RSS) 2026

详情
英文摘要

Gradient-based methods can efficiently optimize controllers by leveraging differentiable simulation and physical priors. However, contact-rich manipulation remains challenging because hybrid contact dynamics often produce discontinuous or vanishing gradients. Although smoothing the dynamics can restore informative gradients, the resulting model mismatch can cause controller failures when deployed on real systems. We address this trade-off by planning with smoothed dynamics while explicitly quantifying and compensating for the induced error, providing formal guarantees on safety and task completion under the original nonsmooth dynamics. Our approach applies smoothing to both contact dynamics and contact geometry within a differentiable simulator based on convex optimization, allowing us to characterize the deviation from the nonsmooth dynamics as a set-valued discrepancy. We incorporate this discrepancy into the optimization of time-varying affine feedback policies through analytical reachable sets, enabling robust constraint satisfaction for the closed-loop hybrid system while relying solely on the informative gradients of the smoothed model. By bridging differentiable simulation with set-valued robust control, our method produces affine feedback policies that respect the unilateral nature of contact. We evaluate our method on several contact-rich tasks, including planar pushing, object rotation, and in-hand dexterous manipulation, achieving certified constraint satisfaction with lower safety violations and smaller goal errors than baseline approaches.

2602.08813 2026-05-13 cs.LG

Robust Policy Optimization to Prevent Catastrophic Forgetting

Mahdi Sabbaghi, George Pappas, Adel Javanmard, Hamed Hassani

AI总结 本文研究了大型语言模型在多阶段微调过程中因后续更新导致的“灾难性遗忘”问题,即早期学习的行为(如安全性)可能被破坏。为解决这一问题,作者提出了一种名为FRPO的鲁棒强化学习框架,通过在策略的KL散度邻域内优化奖励,确保策略在后续微调时仍能保持稳定。实验表明,该方法在多个基础模型和下游任务中有效减少了安全性能的下降,同时保持了任务性能。

详情
英文摘要

Large language models are commonly trained through multi-stage post-training: first via RLHF, then fine-tuned for other downstream objectives. Yet even small downstream updates can compromise earlier learned behaviors (e.g., safety), exposing a brittleness known as catastrophic forgetting. This suggests standard RLHF objectives do not guarantee robustness to future adaptation. To address it, most prior work designs downstream-time methods to preserve previously learned behaviors. We argue that preventing this requires pre-finetuning robustness: the base policy should avoid brittle high-reward solutions whose reward drops sharply under standard fine-tuning. We propose Fine-tuning Robust Policy Optimization (FRPO), a robust RLHF framework that optimizes reward not only at the current policy, but across a KL-bounded neighborhood of policies reachable by downstream adaptation. The key idea is to ensure reward stability under policy shifts via a max-min formulation. By modifying GRPO, we develop an algorithm with no extra computation, and empirically show it substantially reduces safety degradation across multiple base models and downstream fine-tuning regimes (SFT and RL) while preserving downstream task performance. We further study a math-focused RL setting, demonstrating that FRPO preserves accuracy under subsequent fine-tuning.

2602.05830 2026-05-13 cs.AI cs.LG

Learning Compact Boolean Networks

Shengpu Wang, Yuhao Mao, Yani Zhang, Martin Vechev

AI总结 本文研究了如何学习结构紧凑且精度高的布尔网络,以应对资源受限场景下的高效推理需求。为解决布尔网络离散结构带来的学习难题,作者提出了三种互补的方法:一种无需参数的有效连接学习策略、一种利用空间局部性的紧凑卷积布尔架构,以及一种降低连续网络离散化精度损失的自适应量化方法。实验表明,该方法在多个视觉任务中实现了更优的精度-计算量权衡,相比现有方法在布尔运算数量上减少了高达47倍,并在FPGA上实现了更高的精度与更低的推理延迟。

详情
英文摘要

Floating-point neural networks dominate modern machine learning but incur substantial inference costs, motivating emerging interest in Boolean networks for resource-constrained deployments. Since Boolean networks use only Boolean operations, they can achieve nanosecond-scale inference latency. However, learning Boolean networks that are both compact and accurate remains challenging because of their discrete, combinatorial structure. In this work we address this challenge via three novel, complementary contributions: (i) a new parameter-free strategy for learning effective connections, (ii) a novel compact convolutional Boolean architecture that exploits spatial locality while requiring fewer Boolean operations than existing convolutional kernels, and (iii) an adaptive discretization procedure that reduces the accuracy drop incurred when converting a continuously relaxed network into a discrete Boolean network. Across standard vision benchmarks, our method improves the Pareto frontier over prior state-of-the-art methods, achieving higher accuracy with up to $47\times$ fewer Boolean operations. This advantage also extends to other modalities. Further, on an FPGA, our model on MNIST achieves 99.38\% accuracy with 6.48 ns latency, surpassing the prior state-of-the-art in both accuracy and runtime, while generating a $7\times$ smaller circuit. Code and models are available at https://github.com/eth-sri/CompactLogic.

2602.04476 2026-05-13 cs.CV

Vision-aligned Latent Reasoning for Multi-modal Large Language Model

Byungwoo Jeon, Yoonwoo Jeong, Hyunseok Lee, Minsu Cho, Jinwoo Shin

AI总结 尽管多模态大语言模型在多种理解任务上取得了进展,但在需要多步骤推理的问题上仍存在不足,主要原因是视觉信息在长上下文生成过程中逐渐稀释。为此,本文提出了一种名为Vision-aligned Latent Reasoning(VaLR)的推理框架,通过在每一步推理前动态生成与视觉对齐的潜在标记,引导模型基于潜在空间中的感知线索进行推理。实验表明,VaLR在多个需要长上下文理解和精确视觉感知的基准测试中表现优异,并在VSI-Bench上将性能从33.0%提升至52.9%,显著优于现有模型。

Comments Published as conference proceeding for ICML 2026. Last two authors advised equally

详情
英文摘要

Despite recent advancements in Multi-modal Large Language Models (MLLMs) on diverse understanding tasks, these models struggle to solve problems which require extensive multi-step reasoning. This is primarily due to the progressive dilution of visual information during long-context generation, which hinders their ability to fully exploit test-time scaling. To address this issue, we introduce Vision-aligned Latent Reasoning (VaLR), a simple, yet effective reasoning framework that dynamically generates vision-aligned latent tokens before each Chain of Thought reasoning step, guiding the model to reason based on perceptual cues in the latent space. Specifically, VaLR is trained to preserve visual knowledge during reasoning by aligning intermediate embeddings of MLLM with those from vision encoders. Empirical results demonstrate that VaLR consistently outperforms existing approaches across a wide range of benchmarks requiring long-context understanding or precise visual perception, while exhibiting test-time scaling behavior not observed in prior MLLMs. In particular, VaLR improves the performance significantly from 33.0% to 52.9% on VSI-Bench, achieving a 19.9%p gain over Qwen2.5-VL.

2602.02282 2026-05-13 cs.LG

MoLF: Mixture-of-Latent-Flow for Pan-Cancer Spatial Gene Expression Prediction from Histology

Susu Hu, Stefanie Speidel

AI总结 该研究提出了一种名为MoLF的生成模型,用于从组织学图像预测跨癌症类型的基因表达空间分布。MoLF通过条件流匹配目标,结合专家混合架构,将噪声映射到基因潜在空间,从而有效处理不同癌症类型的异质性。实验表明,MoLF在跨癌症基准测试中优于现有方法,并能在跨物种数据上实现零样本泛化,揭示了其对保守组织分子机制的捕捉能力。

Comments Accepted at Proceedings 43rd International Conference on Machine Learning, Seoul, South Korea

详情
Journal ref
Proceedings 43rd International Conference on Machine Learning 2026
英文摘要

Inferring spatial transcriptomics (ST) from histology enables scalable histogenomic profiling, yet current methods are largely restricted to single-tissue models. This fragmentation fails to leverage biological principles shared across cancer types and hinders application to data-scarce scenarios. While pan-cancer training offers a solution, the resulting heterogeneity challenges monolithic architectures. To bridge this gap, we introduce MoLF (Mixture-of-Latent-Flow), a generative model for pan-cancer histogenomic prediction. MoLF leverages a conditional Flow Matching objective to map noise to the gene latent manifold, parameterized by a Mixture-of-Experts (MoE) velocity field. By dynamically routing inputs to specialized sub-networks, this architecture effectively decouples the optimization of diverse tissue patterns. Our experiments demonstrate that MoLF establishes a new state-of-the-art, consistently outperforming both specialized and foundation model baselines on pan-cancer benchmarks. Furthermore, MoLF exhibits zero-shot generalization to cross-species data, suggesting it captures fundamental, conserved histo-molecular mechanisms.

2602.00767 2026-05-13 cs.LG cs.AI

BLOCK-EM: Preventing Emergent Misalignment via Latent Blocking

Muhammed Ustaomeroglu, Guannan Qu

AI总结 该研究探讨了在对语言模型进行细调时可能出现的“新兴对齐偏差”问题,即模型在学习目标行为的同时,可能产生不良的领域外行为。研究提出了一种机制性方法,通过识别并限制控制偏差行为的少量内部特征,有效抑制这种偏差,且不损害模型性能。实验表明,该方法在多个细调任务中可使偏差减少达95%,并通过多种验证方式确认了其有效性与机制的针对性。

Comments Accepted to ICML 2026

详情
英文摘要

Emergent misalignment can arise when a language model is fine-tuned on a narrowly scoped supervised objective: the model learns the target behavior, yet also develops undesirable out-of-domain behaviors. We investigate a mechanistic approach to preventing emergent misalignment by identifying a small set of internal features that reliably control the misaligned behavior and then discouraging the model from strengthening these features during fine-tuning. Across six fine-tuning domains, blocking (i.e., constraining) a fixed set of features achieves up to 95\% relative reduction in emergent misalignment with no degradation in model quality or target-task performance. We strengthen validity with disjoint selection/evaluation splits, multiple independent judges, multiple random seeds for key settings, quality metrics, and extensive ablations demonstrating that the reduction in misalignment is specific to the identified mechanism. We also characterize a limiting regime in which misalignment re-emerges under prolonged fine-tuning, present evidence consistent with rerouting through alternative features or layers, and evaluate modifications that partially restore the misalignment-blocking effect. Overall, our results show that targeted training-time constraints on internal mechanisms can mitigate emergent misalignment without degrading target-task performance.

2602.00297 2026-05-13 cs.LG

From Observations to States: Latent Time Series Forecasting

Jie Yang, Yifan Hu, Yuante Li, Kexin Zhang, Kaize Ding, Philip S. Yu

AI总结 该论文研究了时间序列预测中的潜在表示悖论问题,即模型在预测准确的同时往往学习到时间无序的潜在表示。为此,作者提出了一种新的方法——潜在时间序列预测(LatentTSF),通过将观测数据映射到潜在状态空间并在此空间中进行预测,使模型能够学习到更结构化的时序动态。实验表明,该方法有效缓解了潜在混沌问题,在预测精度和表示质量上均取得显著提升。

Comments Accepted at ICML 2026

详情
英文摘要

Deep learning has achieved strong performance in Time Series Forecasting (TSF). However, we identify a critical representation paradox, termed Latent Chaos: models with accurate predictions often learn latent representations that are temporally disordered and lack continuity. We attribute this to the dominant observation-space forecasting paradigm, where minimizing point-wise errors on noisy and partially observed data encourages shortcut solutions instead of the recovery of underlying system dynamics. To address this, we propose Latent Time Series Forecasting (LatentTSF), a paradigm that shifts TSF from observation regression to latent state prediction. LatentTSF employs an AutoEncoder to project each observation into a learned latent state space and performs forecasting entirely in this space, allowing the model to focus on learning structured temporal dynamics. We provide an information-theoretic analysis showing that the latent objectives can be motivated as surrogates for maximizing mutual information between predicted and ground-truth latent states and future observations. Extensive experiments on widely-used benchmarks confirm that LatentTSF effectively mitigates latent chaos, yielding consistent improvements in both forecasting accuracy and representation quality. Our code is available at https://github.com/Muyiiiii/LatentTSF.

2601.12912 2026-05-13 cs.AI

Human Emotion Verification by Action Languages via Answer Set Programming

Andreas Brännström, Juan Carlos Nieves

AI总结 本文提出了一种基于答案集编程(ASP)和状态转移系统的动作语言C-MT,用于描述人类心智状态在可观测动作序列下的演变过程。该语言结合情绪评估理论等心理学理论,将情绪等心智状态形式化为多维配置,并引入新的因果规则以控制行为对心智状态的影响,从而实现对心智状态转移的精确建模与验证。该框架支持对不同心理原理下心智变化动态的比较分析,为情绪验证等应用提供了有力的逻辑编程工具。

Comments Under consideration in Theory and Practice of Logic Programming (TPLP)

详情
英文摘要

In this paper, we introduce the action language C-MT (Mind Transition Language). It is built on top of answer set programming (ASP) and transition systems to represent how human mental states evolve in response to sequences of observable actions. Drawing on well-established psychological theories, such as the Appraisal Theory of Emotion, we formalize mental states, such as emotions, as multi-dimensional configurations. With the objective to address the need for controlled agent behaviors and to restrict unwanted mental side-effects of actions, we extend the language with a novel causal rule, forbids to cause, along with expressions specialized for mental state dynamics, which enables the modeling of principles for valid transitions between mental states. These principles of mental change are translated into transition constraints, and properties of invariance, which are rigorously evaluated using transition systems in terms of so-called trajectories. This enables controlled reasoning about the dynamic evolution of human mental states. Furthermore, the framework supports the comparison of different dynamics of change by analyzing trajectories that adhere to different psychological principles. We apply the action language to design models for emotion verification. Under consideration in Theory and Practice of Logic Programming (TPLP).

2601.09448 2026-05-13 cs.SD cs.AI

One Prompt, Many Sounds: Modeling Listener Variability in LLM-Based Equalization

Ioannis Stylianou, Jon Francombe, Pablo Martinez-Nuevo, Sven Ewan Shepstone, Zheng-Hua Tan

AI总结 本文提出了一种基于大语言模型(LLM)的音频均衡方法,通过自然语言提示映射到均衡设置,实现了对声音系统的对话式控制。该方法利用受控听音实验收集的数据,结合上下文学习和参数高效微调技术,使模型能够可靠地对齐人群偏好的均衡设置。实验结果表明,与随机采样和静态预设基线相比,该方法在分布对齐方面有显著提升,展示了LLM作为“人工均衡器”的潜力,为更易用、上下文感知和专家级的音频调音方法提供了新方向。

Comments 13 pages, 15 figures, 2 tables, IEEE JSTSP submission

详情
英文摘要

Conventional audio equalization is a static process that requires manual and cumbersome adjustments to adapt to changing listening contexts (e.g., mood, location, or social setting). In this paper, we introduce a Large Language Model (LLM)-based alternative that maps natural language text prompts to equalization settings. This enables a conversational approach to sound system control. By utilizing data collected from a controlled listening experiment, our models exploit in-context learning and parameter-efficient fine-tuning techniques to reliably align with population-preferred equalization settings. Our evaluation methods, which leverage distributional metrics that capture users' varied preferences, show statistically significant improvements in distributional alignment over random sampling and static preset baselines. These results indicate that LLMs could function as "artificial equalizers," contributing to the development of more accessible, context-aware, and expert-level audio tuning methods.

2512.24985 2026-05-13 cs.CV cs.AI cs.LG cs.RO

DarkQA: Benchmarking Vision-Language Models on Visual-Primitive Question Answering in Low-Light Indoor Scenes

Yohan Park, Hyunwoo Ha, Wonjun Jo, Tae-Hyun Oh

AI总结 本文提出DarkQA,一个用于评估视觉语言模型在低光室内场景下视觉原语问答能力的开源基准。该基准通过多级光照控制生成9,400个可验证的问题-图像对,模拟真实光照下降和传感器噪声,揭示了现有模型在低光条件下的性能退化问题。研究还系统评估了多种视觉语言模型和低光图像增强方法,展示了DarkQA在分析模型鲁棒性方面的有效性。

Comments This work has been submitted to the IEEE for possible publication

详情
英文摘要

Vision Language Models (VLMs) are increasingly adopted as central reasoning modules for embodied agents. Existing benchmarks evaluate their capabilities under ideal, well-lit conditions, yet robust 24/7 operation demands performance under a wide range of visual degradations, including low-light conditions at night or in dark environments, a core necessity that has been largely overlooked. To address this underexplored challenge, we present DarkQA, an open-source benchmark for evaluating perceptual primitives under multi-level low-light conditions in embodied scenarios. DarkQA evaluates single-view egocentric observations across controlled degradation levels, isolating low-light perceptual failures before they are entangled with complex embodied tasks. The benchmark contains 9.4K deterministically generated and verifiable question-image pairs spanning five visual-primitive families. A key design feature of DarkQA is its physical fidelity: visual degradations are modeled in linear RAW space, simulating physics-based illumination drop and sensor noise followed by an ISP-inspired rendering pipeline; we further validate the synthesis against real paired low-light camera data. We evaluate representative VLMs and Low-Light Image Enhancement (LLIE) preprocessing methods. Results show consistent VLM degradation under low illumination and sensor noise, while LLIE provides severity-dependent but unstable recovery. We demonstrate the utility of DarkQA by evaluating a wide range of state-of-the-art VLMs and Low-Light Image Enhancement (LLIE) models, and systematically reveal VLMs' limitations when operating under these challenging visual conditions. Our code and benchmark dataset will be released upon acceptance. Project website: https://darkqa-benchmark.github.io

2512.20865 2026-05-13 cs.LG cs.SY eess.SY

Robustness Certificates for Neural Networks against Adversarial Attacks

Sara Taheri, Mahalakshmi Sabanayagam, Debarghya Ghoshdastidar, Majid Zamani

AI总结 随着机器学习在安全关键领域中的广泛应用,对抗性威胁,尤其是数据投毒攻击,带来的风险日益加剧。本文提出了一种基于离散时间动态系统建模的正式鲁棒性认证框架,将梯度训练过程形式化,并借鉴控制理论中的屏障证书概念,为模型在最坏情况下的鲁棒性提供形式化保证。该方法通过神经网络参数化屏障证书,并结合场景凸优化推导出泛化性保证,首次实现了对训练时和测试时攻击的统一形式化认证,实验表明其在多个数据集上具有良好的鲁棒性认证效果。

详情
Journal ref
IEEE Open Journal of Control Systems, 2026
英文摘要

The increasing use of machine learning in safety-critical domains amplifies the risk of adversarial threats, especially data poisoning attacks that corrupt training data to degrade performance or induce unsafe behavior. Most existing defenses lack formal guarantees or rely on restrictive assumptions about the model class, attack type, extent of poisoning, or point-wise certification, limiting their practical reliability. This paper introduces a principled formal robustness certification framework that models gradient-based training as a discrete-time dynamical system (dt-DS) and formulates poisoning robustness as a formal safety verification problem. By adapting the concept of barrier certificates (BCs) from control theory, we introduce sufficient conditions to certify a robust radius ensuring that the terminal model remains safe under worst-case ${\ell}_p$-norm based poisoning. To make this practical, we parameterize BCs as neural networks trained on finite sets of poisoned trajectories. We further derive probably approximately correct (PAC) bounds by solving a scenario convex program (SCP), which yields a confidence lower bound on the certified robustness radius generalizing beyond the training set. Importantly, our framework also extends to certification against test-time attacks, making it the first unified framework to provide formal guarantees in both training and test-time attack settings. Experiments on MNIST, SVHN, and CIFAR-10 show that our approach certifies non-trivial perturbation budgets while being model-agnostic and requiring no prior knowledge of the attack or contamination level.

2512.17637 2026-05-13 cs.AI cs.FL cs.LO

About Time: Model-free Reinforcement Learning with Timed Reward Machines

Rajarshi Roy, Anirban Majumdar, Ritam Raha, David Parker, Marta Kwiatkowska

AI总结 在强化学习中,奖励规范对指导智能体行为至关重要。为表达非马尔可夫奖励,已有研究引入奖励机,但传统奖励机难以建模精确的时间约束。本文提出了一种新的时间奖励机(TRM),将时间约束融入奖励结构,支持更丰富的奖励逻辑,例如对延迟施加惩罚或对及时动作给予奖励。研究基于无模型强化学习框架(如表格Q学习),通过时间自动机的抽象和反事实想象启发式方法,学习满足时间约束的最优策略,并在多个基准任务中验证了其有效性。

Comments Extended version of paper accepted at IJCAI 2026

详情
英文摘要

Reward specification plays a central role in reinforcement learning (RL), guiding the agent's behavior. To express non-Markovian rewards, formalisms such as reward machines have been introduced to capture dependencies on histories. However, traditional reward machines lack the ability to model precise timing constraints, limiting their use in time-sensitive applications. In this paper, we propose timed reward machines (TRMs), which are an extension of reward machines that incorporate timing constraints into the reward structure. TRMs enable more expressive specifications with tunable reward logic, for example, imposing costs for delays and granting rewards for timely actions. We study model-free RL frameworks (i.e., tabular Q-learning) for learning optimal policies with TRMs under digital and real-time semantics. Our algorithms integrate the TRM into learning via abstractions of timed automata, and employ counterfactual-imagining heuristics that exploit the structure of the TRM to improve the search. Experimentally, we demonstrate that our algorithm learns policies that achieve high rewards while satisfying the timing constraints specified by the TRM on popular RL benchmarks. Moreover, we conduct comparative studies of performance under different TRM semantics, along with ablations that highlight the benefits of counterfactual-imagining.

2512.11114 2026-05-13 cs.LG cs.AI stat.ML

In-Context Multi-Objective Optimization

Xinyu Zhang, Conor Hassan, Julien Martinelli, Daolang Huang, Samuel Kaski

AI总结 在多目标优化问题中,如何平衡多个竞争目标是一个普遍存在的挑战,尤其在药物设计和自主系统等领域。本文提出了一种名为TAMO的全摊销通用策略,利用Transformer架构实现对不同输入和目标维度的多目标黑盒优化,无需针对每个任务重新训练模型。通过强化学习预训练,TAMO能够在单次前向传播中快速生成优化方案,显著提升了计算效率,并在多个基准和实际任务中表现出优异的帕累托前沿质量。

详情
英文摘要

Balancing competing objectives is omnipresent across disciplines, from drug design to autonomous systems. Multi-objective Bayesian optimization is a promising solution for such expensive, black-box problems: it fits probabilistic surrogates and selects new designs via an acquisition function that balances exploration and exploitation. In practice, it requires tailored choices of surrogate and acquisition that rarely transfer to the next problem, is myopic when multi-step planning is often required, and adds refitting overhead, particularly in parallel or time-sensitive loops. We present TAMO, a fully amortized, universal policy for multi-objective black-box optimization. TAMO uses a transformer architecture that operates across varying input and objective dimensions, enabling pretraining on diverse corpora and transfer to new problems without retraining: at test time, the pretrained model proposes the next design with a single forward pass. We pretrain the policy with reinforcement learning to maximize cumulative hypervolume improvement over full trajectories, conditioning on the entire query history to approximate the Pareto frontier. Across synthetic benchmarks and real tasks, TAMO produces fast proposals, reducing proposal time by 50-1000x versus alternatives while matching or improving Pareto quality under tight evaluation budgets. These results show that transformers can perform multi-objective optimization entirely in-context, eliminating per-task surrogate fitting and acquisition engineering, and open a path to foundation-style, plug-and-play optimizers for scientific discovery workflows.

2512.07150 2026-05-13 cs.LG cs.AI cs.CV

FlowLPS: Langevin-Proximal Sampling for Flow-based Inverse Problem Solvers

Jonghyun Park, Jong Chul Ye

AI总结 本文提出了一种名为 FlowLPS 的训练-free 潜在流逆问题求解方法,基于朗之万-近端采样(Langevin-Proximal Sampling),旨在解决深度生成模型在图像逆问题中的有限步数权衡问题。该方法在每一步反向过程中使用少量朗之万更新对模型预测的干净估计进行扰动,以提供后验导向的随机初始化,随后通过局部 MAP 风格的近端优化快速提升测量一致性,并结合受控的 pCN 风格重噪声技术保持轨迹稳定性。实验表明,FlowLPS 在多个线性逆问题上实现了测量保真度与感知质量的良好平衡。

详情
英文摘要

Deep generative models are powerful priors for imaging inverse problems, but training-free solvers for latent flow models face a practical finite-step trade-off. Optimization-heavy methods quickly improve measurement consistency, but in highly nonlinear latent spaces, their results can depend strongly on where local refinement is initialized, often degrading perceptual realism. In contrast, stochastic sampling methods better preserve posterior exploration, but often require many iterations to obtain sharp, measurement-consistent reconstructions. To address this trade-off, we propose FlowLPS, a training-free latent flow inverse solver based on Langevin-Proximal Sampling. At each reverse step, FlowLPS uses a few Langevin updates to perturb the model-predicted clean estimate in posterior-oriented directions, providing stochastic initializations for local refinement. It then applies local MAP-style proximal refinement to rapidly improve measurement consistency from the Langevin-updated estimate. We additionally use controlled pCN-style re-noising to stabilize the reverse trajectory while retaining trajectory coherence. Experiments on FFHQ and DIV2K across five linear inverse problems show that FlowLPS achieves a strong balance between measurement fidelity and perceptual quality, with additional experiments on pixel-space inverse problems and phase retrieval.