arXivDaily arXiv每日学术速递 周一至周五更新
重置
全部学科分类 4046
2605.10575 2026-05-12 cs.CR cs.AI cs.LG

Acceptance Cards:A Four-Diagnostic Standard for Safe Fine-Tuning Defense Claims

Phongsakon Mark Konrad, Toygar Tanyel, Serkan Ayvaz

AI总结 该论文提出了一种名为“Acceptance Cards”的四维诊断标准,用于评估安全微调防御方法的有效性。研究指出,单纯依赖保留集差距缩小来判断防御效果可能不可靠,因此引入了包括统计可靠性、新语义泛化、机制对齐和跨任务迁移四个方面的评估体系。实验表明,SafeLoRA在该标准下未能通过全部诊断,揭示了现有安全微调方法在实际应用中的潜在缺陷。

详情
英文摘要

Safe fine-tuning defenses are often endorsed on the basis of a held-out gap reduction, but the same reduction can come from sampling noise, subject artifacts, capability loss, or a mechanism that does not transfer. We introduce Acceptance Cards: an evaluation protocol, a documentation object, an executable audit package, and a claim-specific evidential standard for safe fine-tuning defense claims. The protocol checks statistical reliability, fresh semantic generalization, mechanism alignment, and cross-task transfer before treating a gap reduction as a full-card pass. Re-scored under this installed-gap protocol, SafeLoRA fails the full-card pass on Gemma-2-2B-it: under strict mechanism-class coding it fails all four diagnostics, and under a permissive shrinkage relabel it still fails three of four. This is a narrow installed-gap audit on one model family, not a global judgment of SafeLoRA's effectiveness. In a 46-cell audit, no cell satisfies the strict conjunction. The closest family is a near miss that passes reliability and mechanism checks where the required data are available, but fails the fresh-subject threshold, lacks a strict transfer pass, and carries a measurable deployment-accuracy cost.

2605.10571 2026-05-12 eess.IV cs.CV

Set-Based Groupwise Registration for Variable-Length, Variable-Contrast Cardiac MRI

Yi Zhang, Yidong Zhao, Tijmen Toxopeus, Maša Božić-Iven, Sebastian Weingärtner, Qian Tao

AI总结 该研究针对可变长度、对比度不同的心脏MRI序列,提出了一种基于集合的群组配准方法\emph{\AnyTwoReg},以解决传统深度学习方法在跨协议配准中的泛化性不足问题。该方法将MRI序列视为无序集合,解耦了网络设计与序列长度和输入顺序的依赖关系,并通过共享编码器和相关性引导的特征聚合构建了排列不变的参考基准,实现了从图像到形变场的排列等变映射。实验表明,该方法在未见过的定量MRI数据集上表现出良好的零样本泛化能力,并有效提升了后续定量映射的质量。

Comments MICCAI 2026. Submitted Version

详情
英文摘要

Quantitative cardiac magnetic resonance imaging (MRI) enables non-invasive myocardial tissue characterization but relies on robust motion correction within these variable-length, variable-contrast image sequences. Groupwise registration, which simultaneously aligns all images, has shown greater robustness than pairwise registration for motion correction. However, current deep-learning-based groupwise registration methods cannot generalize across MRI sequences: the architecture typically encodes input data as a fixed-length channel stack, which rigidly couples network design to protocol-specific sequence length, input ordering, and contrast dynamics. At inference time, any change in imaging protocols will render the network unusable. In this work, we introduce \emph{\AnyTwoReg}, a new set-based groupwise registration framework that takes a quantitative MRI sequence as an unordered set. This set formulation fundamentally decouples network design from sequence length and input ordering. By utilizing a shared encoder and correlation-guided feature aggregation, \emph{\AnyTwoReg} constructs a permutation-invariant canonical reference for registration, and learns a permutation-equivariant mapping from images to deformation fields. Additionally, we extract contrast-insensitive image features from an existing foundation model to handle extreme contrast variations. Trained exclusively on a single public $T_1$ mapping dataset (STONE, sequence length $L=11$), \AnyTwoReg generalizes to two unseen quantitative MRI datasets (MOLLI, ASL) with variable lengths ($L \in [11, 60]$) and different contrast dynamics. It achieves strong cross-protocol generalization in a zero-shot manner, and consistently improves downstream quantitative mapping quality. Notably, while designed for quantitative MRI sequences, our framework is directly applicable to Cine MRI sequences for inter-cardiac-phase registration.

2605.10566 2026-05-12 stat.ML cs.LG cs.NA math.NA

Affine Tracing: A New Paradigm for Probabilistic Linear Solvers

Disha Hegde, Marvin Pförtner, Jon Cockayne

AI总结 本文提出了一种新的概率线性求解器框架——仿射追踪(Affine Tracing),旨在解决线性系统求解中的不确定性量化问题。研究指出,传统的贝叶斯概率线性求解器实际上是非平稳仿射概率迭代方法(PIMs)的一个特例,并证明了所有现实的仿射PIMs都是校准良好的。为了解决手动实现仿射PIMs的困难,作者引入了仿射追踪算法,该方法能够自动从标准仿射迭代方法的实现中构建概率迭代求解器,从而显著降低了实现难度,并通过实例展示了其在高斯过程近似中的应用效果。

详情
英文摘要

Probabilistic linear solvers (PLSs) return probability distributions that quantify uncertainty due to limited computation in the solution of linear systems. The literature has traditionally distinguished between Bayesian PLSs, which condition a prior on information obtained from projections of the linear system, and probabilistic iterative methods (PIMs), which lift classical iterative solvers to probability space. In this work we show this dichotomy to be false: Bayesian PLSs are a special case of non-stationary affine PIMs. In addition, we prove that any realistic affine PIM is calibrated. These results motivate a focus on (non-stationary) affine PIMs, but their practical adoption has been limited by the significant manual effort required to implement them. To address this, we introduce affine tracing, an algorithmic framework that automatically constructs a PIM from a standard implementation of an affine iterative method by passing symbolic tracers through the computation to build an affine computational graph. We show how this graph can be transformed to compute posterior covariances, and how equality saturation can be used to perform algebraic simplifications required for computation under specific prior choices. We demonstrate the framework by automatically generating a probabilistic multigrid solver and evaluate its performance in the context of Gaussian process approximation.

2605.10528 2026-05-12 cond-mat.stat-mech cs.CL cs.MA physics.soc-ph

Collective Alignment in LLM Multi-Agent Systems: Disentangling Bias from Cooperation via Statistical Physics

Cristiano De Nobili

AI总结 本文研究了基于大语言模型(LLM)的多智能体系统在二维网格中的集体动力学行为,提出了一种无需依赖具体模型的统计物理方法,用于区分社会从众行为与内在偏见,并计算临界指数以分析系统的集体行为和可能的相变。研究发现,所有测试模型均表现出温度驱动的有序-无序转变和磁化率峰值,但其有效临界指数与二维伊辛模型不一致,表明集体对齐主要由内在偏见驱动而非合作耦合,为评估多智能体系统的一致性和可靠性提供了定量诊断工具。

Comments 10 pages, 7 figures

详情
英文摘要

We investigate the emergent collective dynamics of LLM-based multi-agent systems on a 2D square lattice and present a model-agnostic statistical-physics method to disentangle social conformity from intrinsic bias, compute critical exponents, and probe the collective behavior and possible phase transitions of multi-agent systems. In our framework, each node of an $L\!\times\!L$ lattice hosts an identical LLM agent holding a binary state ($+1$/$-1$, mapped to yes/no) and updating it by querying the model conditioned on the four nearest-neighbor states. The sampler temperature $T$ serves as the sole control parameter. Across three open-weight models (llama3.1:8b, phi4-mini:3.8b, mistral:7b), we measure magnetization and susceptibility under a global-flip protocol designed to probe $\mathbb{Z}_2$ symmetry. All models display temperature-driven order-disorder crossovers and susceptibility peaks; finite-size scaling on even-$L$ lattices yields effective exponents $γ/ν$ whose values are model-dependent, close to but incompatible with the 2D Ising universality class ($γ/ν=7/4$). Our method enables the extraction of effective $β$-weighted couplings $\tilde{J}(T)$ and fields $\tilde{h}(T)$, which serve as a measure of social conformity and intrinsic bias. In the models we analyzed, we found that collective alignment is dominated by an intrinsic bias ($\tilde{h}\gg\tilde{J}$) rather than by cooperative neighbor coupling, producing field-driven crossovers instead of genuine phase transitions. These effective parameters vary qualitatively across models, providing compact collective-behavior fingerprints for LLM agents and a quantitative diagnostic for the reliability of multi-agent consensus and collective alignment.

2605.10515 2026-05-12 cs.CR cs.AI cs.DC

SoK: A Systematic Bidirectional Literature Review of AI & DLT Convergence

Ali Irzam Kathia, Yimika Erinle, Abylay Satybaldy, Paolo Tasca, Nikhil Vadgama, Marco Alberto Javarone

AI总结 本文系统性地回顾了2020年至2025年间发表的关于人工智能(AI)与分布式账本技术(DLT)融合的同行评审研究,从双向视角分析了AI增强DLT和DLT增强AI两个方向的技术架构与应用。研究发现,现有工作多集中于少数技术层级,且尚未在生产环境中得到验证,亟需跨层级协同设计与实际场景的实证研究以解决可扩展性、互操作性等关键问题。

Comments 18 pages, 1 figure, 5 tables

详情
英文摘要

The integration of Artificial Intelligence (AI) with Distributed Ledger Technology (DLT) has become a growing research area, yet contributions tend to cluster around specific application domains or examine only one direction of the integration, leaving the broader architectural interplay between the two technologies poorly understood. This work addresses that gap through a structured, bidirectional review of peer-reviewed studies published between 2020 and 2025. We classify contributions along two directions: AI-enhanced DLT, and DLT-enhanced AI. In the first case, we examine how AI techniques improve DLT systems across five layers: data, network, consensus, execution, and application layers. In the second case, we analyse how DLT supports AI systems across five layers: infrastructure, data, model, inference, and application layers, with particular attention to federated learning, model evaluation, and multi-agent coordination. The analysis reveals that most works concentrate on a small subset of layers: execution and consensus for AI-enhanced DLT, data and model for DLT-enhanced AI. Other layers remain comparatively neglected. Despite reported improvements in controlled settings, no study demonstrates deployment at production scale, and the field has not yet offered satisfying answers to fundamental questions around scalability, interoperability, and verifiable execution. We argue that progress will require cross-layer co-design and empirical validation in real-world settings.

2605.10475 2026-05-12 cs.GT cs.LG

Regret Minimization in Bilateral Trade With Perturbed Markets

Anna Lunghi, Matteo Castiglioni, Alberto Marchesi

AI总结 本文研究在存在预算平衡约束的双边交易中,如何通过重复交易最大化交易收益的问题。针对传统对抗性和随机性环境之间的性能差距,作者提出了一种适用于受对抗性干扰的随机市场环境的算法,该算法能够自适应地应对不同水平的干扰,并在保证预算平衡的前提下,实现了比现有方法更优的累积遗憾界。该工作填补了对抗与随机环境之间的理论空白,提升了交易收益的上限。

详情
英文摘要

We address the problem of maximizing Gain from Trade (GFT) in repeated buyer-seller exchanges subject to global budget balance constraints. While this problem is well-understood in purely adversarial and stochastic settings, these environments exhibit a sharp dichotomy: adversarial environments allow for no-regret learning against the best fixed-price mechanism, whereas stochastic environments allow for no-regret learning against the best distribution over prices that is budget balanced in expectation. This gap is significant, as policies balanced in expectation can increase the GFT by a multiplicative factor of two. In this work, we bridge these extremes by studying perturbed markets, where an underlying stochastic distribution is subject to an adversarial corruption $C$. We design an algorithm that adaptively scales with the level of corruption, achieving an $\tilde{\mathcal{O}}(T^{3/4}) + \mathcal{O}(C\log(T))$ regret bound against the best budget-balanced distribution over prices. Simultaneously, our algorithm maintains the worst-case $\tilde{\mathcal{O}}(T^{3/4})$ regret bound relative to a per-round budget-balanced baseline, ensuring optimality even in fully adversarial environments.

2605.10473 2026-05-12 quant-ph cs.AI

Cavity-Enhanced Collective Quantum Processing with Polarization-Encoded Qubits

Kamil Wereszczyński, Józef Cyran, Adam Brzezowski, Dawid Załużny, Robert Potoniec, Kasper Wiśniowski, Agnieszka Michalczuk

AI总结 本文提出了一种基于腔增强的光学架构,用于实现集体量子处理,其中逻辑量子比特被编码在腔内循环模式的偏振子空间中。通过分离物理载体与计算自由度,利用谐振腔束提供稳定的共振平台,并通过可编程的偏振变换实现单量子比特操作,同时在纠缠区域引入偏振选择性非线性相互作用,实现了可调的受控相位门,从而构建出通用的量子门集。研究显示,在厘米级腔体中使用实验可实现的固态非线性介质即可获得接近单位阶的条件相位,无需极端非线性系数、毫秒级光子寿命或亚赫兹激光稳定度,表明共振再循环为基于腔的集体量子架构提供了可行的物理平台。

详情
英文摘要

We introduce a cavity-enhanced optical architecture for collective quantum processing in which logical qubits are encoded in the polarization subspace of recirculating intracavity modes. The physical carrier and computational degree of freedom are explicitly separated: harmonic cavity bundles provide a stable resonant substrate, while programmable polarization transformations implement single-qubit operations. A polarization-selective nonlinear interaction in the entanglement region generates tunable controlled-phase gates, enabling a universal gate set. A parameter-scaling analysis shows that order-unity conditional phases are attainable in centimeter-scale cavities using experimentally accessible solid-state nonlinear media, without requiring extreme nonlinear coefficients, millisecond photon lifetimes, or sub-hertz laser stabilization. The results indicate that resonant recirculation provides a physically plausible platform for cavity based collective quantum architectures.

2605.10457 2026-05-12 cs.GR cs.PF cs.RO

Geometrically Approximated Modeling for Emitter-Centric Ray-Triangle Filtering in Arbitrarily Dynamic LiDAR Simulation

Rabin Gajmer, Joonas Haapala, Zoltan Beck

AI总结 本文提出了一种名为Gajmer射线投射算法(GRCA)的新方法,用于在动态场景中高效实现LiDAR射线-三角形过滤。该方法通过将问题从“每条射线击中什么”转变为“每个三角形可能被哪些射线击中”,利用几何近似技术对旋转的LiDAR发射器进行建模,从而无需加速结构即可实现高效剔除不可能相交的射线。实验表明,GRCA在高动态、高分辨率多传感器仿真中表现出显著的性能提升,相比传统GPU和CPU加速方案分别提升了数倍。

Comments 21 pages, 20 figures

详情
英文摘要

Real-time Light Detection And Ranging (LiDAR) simulation must find, per emitted ray, the closest intersecting triangle even in dynamic scenes containing large numbers of moving and deformable objects. Dominant acceleration-structure approaches require rebuilding each frame for dynamic geometry -- a cost that compounds directly with scene dynamics and cannot be amortized regardless of how little actually changed. This paper presents the Gajmer Ray-Casting Algorithm (GRCA), which inverts the question: instead of asking what does each ray hit? it asks which rays can each triangle possibly hit? GRCA geometrically models spinning LiDAR emitters as rotation-traced cones or planes and uses each triangle's emitter-centric apparent area to cull, per triangle, which channels and the rays within those channels can possibly reach it -- without any acceleration structure. GRCA is compute-based and vendor-agnostic by design, targeting highly dynamic, high-resolution simultaneous multi-sensor simulation. At its core, GRCA is a general-purpose ray-casting algorithm: the emitter-centric inversion applies to any setting where rays originate from a known position, not only LiDAR. Benchmarks evaluate 2-8 simultaneous 128x4096-ray LiDARs (360deg/180deg) over complex dynamic scenes -- with just two sensors casting ~1M rays per frame. With range culling inactive, GRCA reaches up to 7.97x over hardware-accelerated OptiX (GPU) and 14.55x over Embree (CPU). Two independent extensions further boost performance even in the most complex scene (~22M triangles, ~9M of which are dynamic, 8 LiDARs): range culling at realistic deployment ranges (10-100m) reaches up to 7.02x GPU and 9.33x CPU; a hybrid pipeline -- GRCA for dynamic geometry, OptiX/Embree for static -- reaches up to 10.5x GPU and 19.2x CPU.

2605.10447 2026-05-12 cs.MA cs.AI econ.GN q-fin.EC q-fin.ST

Statistical Model Checking of the Keynes+Schumpeter Model: A Transient Sensitivity Analysis of a Macroeconomic ABM

Stefano Blando, Giorgio Fagiolo, Mauro Napoletano, Tania Treibich, Andrea Vandin

AI总结 本文研究了如何利用统计模型检测(SMC)方法对一个宏观经济的基于智能体的模型(Keynes+Schumpeter模型)进行暂态敏感性分析。通过MultiVeStA工具,作者在不改变原有模拟器的前提下,实现了对模型参数变化影响的系统性分析,重点关注失业率和GDP增长率等宏观指标以及市场占有率等微观指标。研究结果表明,不同参数变化对模型动态的影响存在显著差异,展示了SMC在提高宏观经济ABM分析可重复性和透明度方面的潜力。

详情
英文摘要

Agent-based models (ABMs) are increasingly used in macroeconomics, but their analysis still often relies on ad hoc Monte Carlo campaigns with heterogeneous statistical effort across parameter settings. We show how statistical model checking (SMC), implemented through MultiVeStA, can provide a principled analysis layer for a realistic macroeconomic ABM without rewriting the simulator in a dedicated formalism. Our case study is the heuristic-switching Keynes+Schumpeter(K+S) model, analysed hrough a transient sensitivity campaign over one-parameter sweeps, two macro observables (unemployment and GDP growth), and one auxiliary micro-level probe (market share) on the post-warmup phase of a 600-step horizon. The analysis is driven by reusable temporal queries, observable-specific precision targets, and confidence-based stopping rules that automatically determine the simulation effort required by each configuration. Results show a clear contrast across parameter families: macro-financial and structural sweeps produce the strongest transient effects, whereas several heuristic-rule sweeps remain much weaker under the same precision policy. More broadly, the paper shows that SMC can support reproducible and informative quantitative analysis of substantively rich economic ABMs, while making uncertainty estimates and simulation cost explicit parts of the reported results.

2605.10436 2026-05-12 cs.CR cs.LG cs.NI

DRIFT: Drift-Resilient Invariant-Feature Transformer for DGA Detection

Chaeyoung Lee, Chaeri Jung, Seonghoon Jeong

AI总结 随着域名生成算法(DGA)不断进化以逃避检测,基于深度学习的检测方法在面对时间漂移时性能显著下降。为此,本文提出DRIFT,一种基于Transformer的框架,通过混合分词策略和多任务自监督预训练学习不变特征,有效提升了模型对新型DGA变体的鲁棒性。实验表明,该方法在长期检测任务中表现出更强的稳定性,显著优于现有方法。

Comments 14 pages, 7 figures, 8 tables. Accepted to appear in Proc. of the 56th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2026)

详情
英文摘要

Domain Generation Algorithms (DGAs) evolve continuously to evade botnet detection, posing a persistent challenge for dependable network defense. While deep learning-based detectors achieve strong performance under static conditions, they suffer severe degradation when facing temporal drift. Through a 9-year longitudinal study (2017-2025), we empirically show that state-of-the-art character- and word-based DGA classifiers rapidly lose effectiveness as new DGA variants emerge. To address this problem, we propose a drift-resilient Transformer-based framework that learns invariant representations through a hybrid tokenization strategy and multi-task self-supervised pre-training. The model integrates (i) character-level encoding to capture stochastic morphological patterns and (ii) subword-level encoding for word-based DGAs. Three pre-training tasks enable the model to learn robust structural and contextual features prior to supervised fine-tuning. Comprehensive evaluations demonstrate that our method significantly mitigates temporal degradation and consistently outperforms state-of-the-art baselines in forward-chaining experiments. The proposed approach offers a dependable foundation for long-term DGA defense in evolving threat landscapes. Our code is available at: https://github.com/snsec-net/2026-DSN-DRIFT.

2605.10429 2026-05-12 physics.chem-ph cs.AI

Physical probes expose and alleviate chemical-environment collapse in molecular representations

Jiebin Fang, Zidi Yan, Churu Mao, Yongjun Jiang, Xinyi Tang, Lei Miao, Dan Lu, Yun Huang, Wanjing Ding, Zhongjun Ma

AI总结 该研究针对分子表示学习中因化学环境信息丢失而导致的表示崩溃问题,提出了一种基于物理实验数据的解决方案。研究构建了高保真实验与计算结合的13C核磁共振数据集,并开发了CLAIM框架,通过层次化化学先验和跨层级对比学习,有效恢复了原子级别的化学分辨率,提升了分子-光谱检索性能。该方法在柔性分子和互变异构体系中表现出色,无需显式三维建模即可提高立体异构体区分能力,并可迁移至ADMET预测等分子属性任务,为实验驱动的分子表征学习提供了新思路。

详情
英文摘要

Nuclear magnetic resonance (NMR) spectroscopy provides an experimental readout of local chemical environments, but its use in molecular representation learning has been constrained by heterogeneous data and incomplete atom-level assignments. Here we construct complementary high-fidelity experimental and computational 13C NMR resources, which reveal a recurrent form of representational collapse: atoms that are equivalent in molecular topology can remain experimentally distinct in their real chemical environments, whereas explicit 3D descriptions are further limited by static conformations in dynamic regimes. To alleviate this bottleneck, we develop CLAIM (Contrastive Learning for Atom-to-molecule Inference of Molecular NMR), a framework that aligns efficient topological molecular inputs with atom-resolved NMR observables. Through hierarchical chemical priors and cross-level contrastive learning, CLAIM restores lost chemical resolution and markedly improves atom-level molecule-spectrum retrieval. CLAIM remains robust in flexible and tautomeric systems for 13C NMR prediction, improves stereoisomer discrimination without explicit 3D modelling, and transfers to broader molecular property tasks including ADMET prediction and fluorescence estimation. These results establish physically grounded spectral alignment as an effective strategy for alleviating chemical-environment collapse and for guiding experimentally grounded molecular representation learning.

2605.10425 2026-05-12 cs.CY cs.AI

Toward an Engineering of Science: Rebalancing Generation and Verification in the Age of AI

Jiaqi W. Ma

AI总结 随着AI技术的发展,生成科学成果(如论文、综述等)的成本大幅降低,导致科学系统面临“知识污染”的风险。本文认为,科学界应将这一问题视为工程问题,重新设计知识基础设施以平衡生成与验证的成本。为此,作者提出了一种名为“蓝图”的结构化研究框架,通过将研究内容分解为可验证的图组件,降低后续验证的难度和成本,并已在原型系统中进行了验证。

详情
英文摘要

AI systems can now cheaply generate plausible scientific artifacts such as papers, reviews, and surveys. This creates a risk of \emph{epistemic pollution} in our scientific systems, where unreliable but plausible-looking artifacts can accumulate faster than the system can filter them out. The problem is structural: the epistemic infrastructure of science was calibrated to a world where producing a plausible artifact required substantial expertise, labor, and time, so generation cost itself served as a rough filter; AI weakens that filter without comparably lowering verification cost. We argue that \textbf{AI-era science should treat this as an engineering problem: redesigning epistemic infrastructure to rebalance the costs of generation and verification}. The current paper-centered system makes verification expensive: papers compress long-context scientific logic into prose, forcing reviewers, human or AI, to reconstruct underlying argument structure before they can evaluate it. As one step in this direction, we propose \textbf{blueprints} as preliminary epistemic infrastructure: structured, decomposed research artifacts that represent claims, evidence, assumptions, and definitions as typed graph components. Blueprints are designed to trade an upfront generation cost for cheaper, more local, more distributed verification downstream. We have instantiated the proposal in a proof-of-concept prototype.

2605.10402 2026-05-12 math.GR cs.AI

Every finite group admits a just finite presentation

Marc Lackenby

AI总结 本文研究了有限群是否可以具有“仅有限”(just finite)的有限呈现的问题,即若从该呈现中去掉任何一条关系,所得群将变为无限群。该问题长期以来悬而未决。作者证明了每个有限群都存在这样的“仅有限”呈现,从而肯定地解决了这一猜想。

Comments 5 pages. Significant assistance was provided by the AI co-mathematician tool developed by Google DeepMind

详情
英文摘要

A finite presentation < X | R > of a finite group is called `just finite' if removing any relation from R results in a presentation for an infinite group. It has been an open question (Kourovka Notebook, Problem 21.10) whether every finite group admits such a presentation. We resolve this conjecture in the affirmative.

2605.10385 2026-05-12 stat.ML cs.LG

Regret Analysis of Guided Diffusion for Black-Box Optimization over Structured Inputs

Masaki Adachi, Anita Yang, Yakun Wang, Song Liu

AI总结 本文研究了引导扩散模型在结构化输入的黑箱优化中的遗憾行为,针对现有分析方法在现代扩散优化框架下不适用的问题,提出了一种基于证书的期望简单遗憾分析框架。核心方法围绕“质量提升”这一概念,衡量预训练生成器对近最优设计的概率质量增加,揭示了指数级收敛与多项式加速可能源自同一机制。研究还提供了从有限候选池估计搜索指数的实用诊断方法,并提出了一个完全认证的采样器构造方案。

Comments 48 pages, 12 figures

详情
英文摘要

Guided-diffusion black-box optimization (BO) has shown strong empirical performance on structured design problems such as molecules and crystals, but its regret behavior remains poorly understood. Existing BO regret analyses typically rely on maximum information gain, non-pretrained surrogate models, or exact acquisition maximization -- assumptions that break down in modern diffusion -- BO pipelines, where pretrained diffusion models serve as powerful priors over valid structures and acquisition maximization is replaced by approximate sampling over astronomically large discrete spaces. We develop a first certificate-based expected simple-regret framework for guided-diffusion BO that avoids maximum-information-gain bounds, RKHS assumptions, and exact acquisition maximization. The central quantity in our analysis is mass lift: the increase in probability mass assigned to near-optimal designs relative to the pretrained generator. This view explains how exponential-looking finite-budget convergence and polynomial acceleration can all arise from the same mechanism. We also give practical diagnostics for estimating search exponents from finite candidate pools and a proposal-corrected resampling construction that provides a fully certified sampler instance.

2605.10383 2026-05-12 stat.ML cs.LG

Multifidelity Gaussian process regression for solving nonlinear partial differential equations

Fatima-Zahrae El-Boukkouri, Josselin Garnier, Olivier Roustant

AI总结 本文提出了一种基于协同克里金法的多保真度高斯过程回归方法,用于求解非线性偏微分方程。该方法利用多保真度仿真数据,首先拟合一个可微的非平稳核函数,再结合估计的超参数构建高保真度核函数和均值函数,从而在高斯过程框架下求解PDE。实验在Burgers方程上验证了该方法的有效性,展示了其在物理信息引导下的优越性能。

Comments 31 pages, 20 figures

详情
英文摘要

Solving nonlinear partial differential equations (PDEs) using kernel methods offers a compelling alternative to traditional numerical solvers. However, the performance of these methods strongly depends on the choice of kernel. In this work, as the available information is inherently multifidelity, we propose a kernel learning approach based on cokriging, leveraging empirical information from multifidelity simulations. In the first step, we fit a differentiable non-stationary kernel to an empirical kernel obtained from low-fidelity simulations. In the second step, we derive a high-fidelity kernel with estimated hyperparameters, and construct a corresponding high-fidelity mean using the multifidelity framework. These components can then be used within a Gaussian process framework for solving PDEs. Finally, we demonstrate the performance of the proposed physics-informed method on the Burgers' equation.

2605.10373 2026-05-12 cs.DB cs.CL

Toward Multi-Database Query Reasoning for Text2Cypher

Makbule Gulcin Ozsoy

AI总结 本文研究了如何将自然语言查询转化为适用于多图数据库的Cypher查询,以解决现有Text2Cypher系统仅支持单一数据库的局限性。核心方法提出了一种三阶段的多数据库推理框架,包括数据库路由、多数据库分解和异构查询推理,旨在实现跨多个独立图数据库的信息检索与结果整合。该工作为构建更符合实际场景的自然语言数据库接口提供了结构化的方法和挑战分析。

详情
英文摘要

Large language models have significantly improved natural language interfaces to databases by translating user questions into executable queries. In particular, Text2Cypher focuses on generating Cypher queries for graph databases, enabling users to access graph data without query language expertise. Most existing Text2Cypher systems assume a single preselected graph database, where queries are generated over a known schema. However, real-world systems are often distributed across multiple independent graph databases organized by domain or system boundaries, where relevant information may span multiple sources. To address this limitation, we propose a shift from single-database query generation to multi-database query reasoning. Instead of assuming a fixed execution context, the system must reason about (i) relevant databases, (ii) how to decompose a question across them, and (iii) how to integrate partial results. We formalize this setting through a three-phase roadmap: database routing, multi-database decomposition, and heterogeneous query reasoning across database types and query languages. This work provides a structured formulation of multi-database reasoning for Text2Cypher and identifies challenges in source selection, query decomposition, and result integration, aiming to support more realistic and scalable natural language interfaces to graph databases.

2605.10330 2026-05-12 stat.ML cs.LG stat.ME

Fast Training of Mixture-of-Experts for Time Series Forecasting via Expert Loss Integration

Btissame El Mahtout, Florian Ziel

AI总结 本文提出了一种新的自适应专家混合(MoE)框架,用于时间序列预测,通过在训练过程中直接引入专家特定的损失信息,增强专家的专业化能力。该方法将基础预测损失与专家特定损失结合,使专家级别的预测误差能够与全局预测损失共同影响模型训练,并结合部分在线学习策略,实现对门控机制和专家参数的增量更新,从而显著降低计算成本。实验表明,该方法在多个经济、旅游和能源数据集上优于传统统计方法和先进神经网络模型,具有更高的预测精度和计算效率。

详情
英文摘要

We propose a novel adaptive Mixture-of-Experts (MoE) framework for time series forecasting that enhances expert specialization by incorporating expert-specific loss information directly into the training process. Notably, the overall objective comprises the base forecasting loss and expert-specific losses, allowing expert-level prediction errors to jointly shape training alongside the global forecasting loss. This framework is further combined with a partial online learning strategy, enabling incremental updates of both the gating mechanism and expert parameters. This approach significantly reduces computational cost by eliminating the need for repeated full model retraining. By integrating expert-level loss awareness with efficient online optimization, the proposed method achieves improved learning efficiency while maintaining strong predictive performance. Empirical results across economic, tourism, and energy datasets with varying frequencies demonstrate that the proposed approach generally outperforms both statistical methods and state-of-the-art neural network models, such as Transformers and WaveNet, in forecasting accuracy and computational efficiency. Furthermore, ablation studies confirm the effectiveness of the expert-specific loss integration strategy, highlighting its contribution to enhancing predictive performance.

2605.10327 2026-05-12 quant-ph cs.AI cs.SC

SCALAR: A Neurosymbolic Framework for Automated Conjecture and Reasoning in Quantum Circuit Analysis

Sean Feeney, Pooja Rao, Andreas Klappenecker, Reuben Tate, Yuri Alexeev, Stefano Mensa, Elica Kyoseva, Stephan Eidenbenz

AI总结 本文提出了一种名为SCALAR的神经符号框架,用于量子电路分析中的自动化猜想生成与推理,该框架基于CUDA-Q开源平台,结合了量子模拟、符号猜想生成和大语言模型解释。研究通过在MQLib基准数据集上的82个MaxCut实例以及2000个随机生成的图上进行评估,揭示了量子近似优化算法(QAOA)参数与图不变量之间的关系,并发现了图结构特征与优化景观属性之间的关联。实验还展示了SCALAR在处理多达77个量子比特的问题时的扩展能力,并探讨了生成猜想的准确性、通用性及局限性。

详情
英文摘要

In this paper, we present SCALAR (Symbolic Conjecture and LLM-Assisted Reasoning), a neurosymbolic framework for automated conjecture generation in quantum circuit analysis built on top of the CUDA-Q open source framework. The system integrates quantum simulation, symbolic conjecture generation, and LLM-based interpretation. We evaluate SCALAR on 82 MaxCut instances from the MQLib benchmark dataset and extend the analysis to 2,000 randomly generated graphs across four topologies: regular, Erdos-Renyi, Barabasi-Albert, and Watts-Strogatz. The framework generates conjectured bounds relating optimal QAOA parameters to graph invariants, including known relationships such as periodicity constraints on the phase separation parameter $γ$. SCALAR also recovers previously reported parameter transfer phenomena across structurally similar instances. Additionally, the system identifies correlations between graph structural features and optimization landscape properties, which we characterize through invariant-based descriptors. Using CUDA-Q tensor network simulator, we scale experiments to instances of up to 77 qubits. We discuss the accuracy, generality, and limitations of the generated conjectures, including sensitivity to graph class and quantum circuit depth.

2605.10291 2026-05-12 econ.GN cs.AI cs.ET q-fin.EC stat.AP

Generative AI Fuels Solo Entrepreneurship, but Teams Still Lead at the Top

Hyunso Kim, Hyo Kang, Jaeyong Song

AI总结 近年来生成式人工智能的发展正在改变创业者的参与方式,但并未改变高质量创业成果的分布格局。研究利用Product Hunt平台上超过16万次产品发布的数据发现,ChatGPT-3.5发布后,个人创业者进入创业领域的比例显著上升,尤其在以往更倾向于团队创业的领域更为明显。然而,这种增长主要体现在低投入、实验性创业活动上,而高质量成果仍由团队创业主导,表明生成式AI虽降低了个人创业的门槛,但团队在顶尖成果中仍具优势。

详情
英文摘要

Recent advances in generative artificial intelligence (AI) are reshaping who enters entrepreneurship, but not who reaches the top of the quality distribution. Using data on over 160,000 product launches on Product Hunt, we find that entrepreneurial entry increased sharply following the public release of ChatGPT-3.5, driven disproportionately by solo entrepreneurs. This shift toward solo entry is particularly pronounced in categories that historically favored team-based ventures. However, much of this growth reflects low-commitment, experimental entry and does not translate into greater representation among the highest-quality outcomes. Team-based ventures are increasingly dominant in the top tiers of platform rankings. These findings suggest that generative AI lowers barriers to solo entrepreneurship while reinforcing team-based advantages.

2605.10290 2026-05-12 stat.ML cs.LG math.ST stat.TH

Characterizing the Generalization Error of Random Feature Regression with Arbitrary Data-Augmentation

Lucas Morisset, Alain Durmus, Adrien Hardy

AI总结 本文研究了在协变量数量与样本数量成比例的场景下,数据增强对监督回归方法正则化效果的影响。通过仅依赖真实数据的总体统计量以及数据增强方案的一阶和二阶统计量,给出了测试误差(以均方误差衡量)的精确刻画。研究适用于任意网络结构,只要仅训练最后一层输出层,其余部分固定或随机初始化,并且在高斯数据情况下验证了所提出理论的紧致性。

详情
英文摘要

This paper aims at analyzing the regularization effect that data augmentation induces on supervised regression methods in the proportional regime, where the number of covariates grows proportionally to the number of samples. We provide a tight characterization of the test error, measured in mean squared error, in terms only of the population quantities of the true data, as well as first and second order statistics of the augmentation scheme. Our results are valid under misspecified feature maps, and for any network architecture where only the last readout layer is trained, and the rest of the network is either frozen or randomly initialized. We specify our results in the case of Gaussian data, and show that our asymptotic characterization is tight in this setting.

2605.10285 2026-05-12 stat.ML cs.LG

Scalable Gaussian process inference via neural feature maps

Anthony Stephenson

AI总结 本文提出了一种基于神经特征映射的理论支撑高斯过程框架,用于构建表达能力强的核函数。通过将学习到的特征映射解释为隐含再生核希尔伯特空间中格拉姆矩阵的最优低秩近似,建立了高斯过程后验的一致性。该方法还分析了所诱导核的谱特性,并引入乘积特征映射核以缓解过平滑问题,从而实现了快速、可扩展且准确的高斯过程推理,适用于回归和分类任务,并在多个基准数据集上表现出优越的性能。

Comments 27 pages

详情
英文摘要

We present a theoretically grounded Gaussian process framework that leverages neural feature maps to construct expressive kernels. We show that the learned feature map can be interpreted as an optimal low-rank approximation to a Gram matrix derived from an implied RKHS, from which we establish consistency of the GP posterior. We further analyse the spectral properties of the induced kernels and introduce product feature-map kernels to address oversmoothing. This simple yet powerful approach enables fast, scalable, and accurate exact GP inference with minimal upfront work. The flexibility of kernel design supports seamless application to both regression and classification tasks across diverse data modalities, including tabular inputs and structured domains such as images. On benchmark datasets, this approach surpasses pre-existing methods in terms of accuracy and training and prediction efficiency.

2605.10253 2026-05-12 cs.CR cs.AI

Knowledge Poisoning Attacks on Medical Multi-Modal Retrieval-Augmented Generation

Peiru Yang, Haoran Zheng, Tong Ju, Shiting Wang, Wanchun Ni, Jiajun Liu, Shangguang Wang, Yongfeng Huang, Tao Qi

AI总结 该研究探讨了医疗多模态检索增强生成(RAG)系统中知识投毒攻击的风险,提出了一种名为M³Att的攻击框架,仅需对底层数据库有限的分布知识即可实施攻击。其核心方法是在文本数据中注入隐蔽的错误信息,并利用配对的视觉数据作为查询无关的触发器,以操控检索过程。通过利用医学诊断的固有模糊性,该方法能够在不被模型自我纠正机制发现的情况下降低诊断准确性,实验表明其生成的医疗内容在临床看来合理但存在错误。

Comments Accepted by ACL 2026

详情
英文摘要

Retrieval-augmented generation (RAG) is a widely adopted paradigm for enhancing LLMs in medical applications by incorporating expert multimodal knowledge during generation. However, the underlying retrieval databases may naturally contain, or be intentionally injected with, adversarial knowledge, which can perturb model outputs and undermine system reliability. To investigate this risk, prior studies have explored knowledge poisoning attacks in medical RAG systems. Nevertheless, most of them rely on the strong assumption that adversaries possess prior knowledge of user queries, which is unrealistic in deployments and substantially limits their practical applicability. In this paper, we propose M\textsuperscript{3}Att, a knowledge-poisoning framework designed for medical multimodal RAG systems, assuming only limited distribution knowledge of the underlying database. Our core idea is to inject covert misinformation into textual data while using paired visual data as a query-agnostic trigger to promote retrieval. We first propose a unified framework that introduces imperceptible perturbations to visual inputs to manipulate retrieval probabilities. Besides, due to the prior medical knowledge in LLMs, naively poisoned medical content with explicit factual errors can be corrected during generation. Thus, we leverage the inherent ambiguity of medical diagnosis and design a covert misinformation injection strategy that degrades diagnostic accuracy while evading model self-correction. Experiments on five LLMs and datasets demonstrate that M\textsuperscript{3}Att consistently produces clinically plausible yet incorrect generations. Codes: https://github.com/ypr17/M3Att.

2605.10240 2026-05-12 cs.SE cs.CR cs.LG

MARGIN: Margin-Aware Regularized Geometry for Imbalanced Vulnerability Detection

Yuteng Zhang, Huifang Ma, Jiahui Wei, Qingqing Li, Yafei Yang

AI总结 软件漏洞检测对于保障软件安全和可靠性至关重要。面对现实世界中漏洞数据集存在的频率不平衡和难度不平衡问题,本文从嵌入几何视角重新解读这一挑战,提出了一种基于自适应边距度量学习和超球面原型建模的框架MARGIN,通过动态调整几何正则化策略,减少嵌入空间中的几何畸变,从而提升分类与检测性能。实验表明,MARGIN在多个公开漏洞数据集上显著优于现有方法,尤其在处理不平衡数据时表现出更强的鲁棒性和泛化能力。

Comments 12 pages.9 figures, 4 tables

详情
英文摘要

Software vulnerability detection is critical for ensuring software security and reliability. Despite recent advances in deep learning, real-world vulnerability datasets suffer from two severe challenges: frequency imbalance and difficulty imbalance. We reinterpret these challenges from an embedding geometry perspective, observing that such imbalances induce geometric distortions in hyperspherical representation space. To address this issue, we propose MARGIN, a metric-based framework that learns discriminative vulnerability representations through adaptive margin metric learning and hyperspherical prototype modeling. MARGIN dynamically adjusts geometric regularization according to the distribution structure estimated by the von Mises-Fisher concentration, aligning the probability mass of embedding distributions with their corresponding Voronoi cells, thereby reducing geometric distortion and yielding more stable decision boundaries. Extensive experiments on public vulnerability datasets show that MARGIN consistently outperforms strong baselines, achieving notable improvements in classification and detection, especially on challenging, imbalanced datasets. Further analysis demonstrates that MARGIN produces more structured embedding geometries, improving robustness, interpretability, and generalization.

2605.10206 2026-05-12 math.ST cs.LG stat.ML stat.TH

Extended Wasserstein-GAN Approach to Causal Distribution Learning: Density-Free Estimation and Minimax Optimality

Shu Tamano, Masaaki Imaizumi

AI总结 该论文研究了因果分布学习中的分布性因果推断问题,旨在估计干预后的结果分布,包括分位数和尾部风险等。为解决现有生成对抗网络(GAN)方法在理论保证和稳定性方面的不足,作者提出了GANICE方法,通过引入扩展的Wasserstein距离和单元批评机制,实现了对条件干预分布的精确估计,并在Besov空间理论基础上证明了其最小最大最优性。实验表明,GANICE在多个任务中优于现有方法。

详情
英文摘要

Distributional causal inference requires estimating not only average treatment effects but also interventional outcome distributions, including quantiles, tail risks, and policy-dependent uncertainty. As a method for distributional causal inference, generative adversarial network (GAN)-based counterfactual methods are flexible tools for this task. However, these methods have several limitations. First, the objectives of certain techniques do not coincide with the statistical risk of the identifiable causal target, and therefore provide limited theoretical guarantees regarding estimable counterfactual distributions or optimality. Second, they tend to rely on unstable density-based methods, such as density ratio estimation. In this paper, we propose GANICE (GAN for Interventional Conditional Estimation) with several advantages: it (i) clarifies the conditional interventional distribution for each treatment--covariate state as the causal estimation target; (ii) estimates the conditional distribution such that its averaged Wasserstein risk is minimized; (iii) establishes minimax optimality. GANICE achieves these advantages through the introduction of the extended Wasserstein distance, the incorporation of a cellwise critic in its dual, and an optimality proof based on Besov space theory. Our experiments demonstrate that GANICE consistently outperforms existing methods.

2605.10178 2026-05-12 q-bio.NC cs.LG cs.NE

Joint sparse coding and temporal dynamics support context reconfiguration

Qianqian Shi, Yue Che, Faqiang Liu, Hongyi Li, Mingkun Xu, Sandra Reinert, Pieter M. Goltstein, Rong Zhao, Luping Shi

AI总结 该研究探讨了大脑如何在切换不同情境时保持对先前经验的表征,从而实现适应性行为。研究发现,联合稀疏编码与时间动态特性在小鼠内侧前额叶皮层和计算网络中共同作用,有助于在情境转换过程中维持已有知识,减少跨情境干扰。这些机制不仅为理解大脑如何灵活适应新环境提供了理论框架,也为构建避免灾难性遗忘的持续学习系统提供了高效的架构原则。

Comments 37 pages, 6 figures, 6 extended data figures. Preprint version

详情
英文摘要

Adaptive behavior requires the brain to transition between distinct contexts while maintaining representations of prior experience. The ability to reconfigure neural representations without erasing previously acquired knowledge is central to learning in dynamic environments, yet the neural mechanisms that support this balance remain unclear. Understanding these mechanisms is also critical for addressing catastrophic forgetting in artificial systems designed for lifelong learning. Here, we identify joint sparse coding and temporal dynamics in both the mouse medial prefrontal cortex (mPFC) and computational networks as mechanisms that help preserve prior representations during context transitions. Specifically, sparsity in context-dependent representations reduces cross-context interference, whereas temporal dynamics within the network activity further enhance context separability across time. Strikingly, networks endowed with both properties, such as spiking neural networks, exhibit improved retention during lifelong learning without auxiliary heuristics. These findings establish joint sparse coding and temporal dynamics as a core mechanism supporting flexible context reconfiguration in lifelong learning and, through their activity constraining nature, as an energy-efficient architectural principle for stable adaptation. Together, they provide a mechanistic framework for understanding how the brain preserves prior knowledge while flexibly adapting to new contexts.

2605.10176 2026-05-12 cs.CR cs.AI

When Prompts Become Payloads: A Framework for Mitigating SQL Injection Attacks in Large Language Model-Driven Applications

Farzad Nourmohammadzadeh Motlagh, Mehrdad Hajizadeh, Mehryar Majd, Pejman Najafi, Feng Cheng, Christoph Meinel

AI总结 随着大型语言模型(LLM)的广泛应用,基于自然语言的数据库接口日益普及,但同时也带来了SQL注入攻击的新风险。本文提出了一种多层安全框架,用于检测和缓解LLM驱动应用中的SQL注入攻击,该框架结合了前端提示净化、行为与语义异常检测模型以及基于签名的攻击模式控制层。实验表明,该方法在多种攻击场景下具有高检测准确率和低误报率,有效提升了LLM驱动数据库应用的安全性。

Comments 11 pages

详情
Journal ref
ICAART 2026, 18th Int. Conf. on Agents and Artificial Intelligence, pp. 1380-1390, 2026
英文摘要

Natural language interfaces to structured databases are becoming increasingly common, largely due to advances in large language models (LLMs) that enable users to query data using conversational input rather than formal query languages such as SQL. While this paradigm significantly improves usability and accessibility, it introduces new security risks, particularly the amplification of SQL injection vulnerabilities through the prompt-to-SQL translation process. Malicious users can exploit these mechanisms by crafting adversarial prompts that manipulate model behavior and generate unsafe queries. In this work, we propose a multi-layered security framework designed to detect and mitigate LLM-mediated SQL injection attacks. The framework integrates a front-end security shield for prompt sanitization, an advanced threat detection model for behavioral and semantic anomaly identification, and a signature-based control layer for known attack patterns. We evaluate the proposed framework under diverse and realistic attack scenarios, including prompt injection, obfuscated SQL payloads, and context-manipulation attacks. To ensure robustness, we generate and curate a comprehensive benchmark dataset of adversarial prompts and assess performance across a fine-tuned LLM configuration. Experimental results demonstrate that the proposed approach achieves high detection accuracy while maintaining low false-positive rates, significantly improving the secure deployment of LLM-powered database applications.

2605.10163 2026-05-12 stat.ML cs.AI cs.LG

Coarsening Linear Non-Gaussian Causal Models with Cycles

Francisco Madaleno, Francisco C Pereira, Alex Markham

AI总结 本文研究了在存在循环的线性非高斯因果模型中,如何从高维数据中学习低维因果结构的问题。作者提出了一种方法,在不假设高维结构无环的前提下,仍能恢复出低维的有向无环图(DAG),并将其与现有可识别性结果联系起来。该方法具有较低的时间复杂度和明确的样本复杂度界,为高维因果模型的抽象提供了更广泛适用的解决方案。

详情
英文摘要

Recent work on causal abstraction, in particular graphical approaches focusing on causal structure between clusters of variables, aims to summarize a high-dimensional causal structure in terms of a low-dimensional one. Existing methods for learning such summaries from data assume that both the high- and low-dimensional structures are acyclic, which is helpful for causal effect identification and reasoning but excludes many high-dimensional models and thus limits applicability. We show that in the linear non-Gaussian (LiNG) setting, the high-dimensional acyclicity assumption can be relaxed while still allowing recovery of a low-dimensional causal directed acyclic graph (DAG). We further connect identifiability of this low-dimensional DAG to existing results: LiNG models with cycles are observationally identifiable only up to an equivalence class whose members differ by reversals of directed cycles; our low-dimensional DAG, which is invariant across all members of a given equivalence class, thus forms a natural representative of the class. While existing approaches for learning this observational equivalence class over high-dimensional variables have exponential time complexity, our low-dimensional summary is learned in worst-case cubic time and comes with explicit bounds on the sample complexity. We provide open source code and experiments on synthetic data to corroborate our theoretical results.

2605.10137 2026-05-12 stat.ML cs.LG

PFN-TS: Thompson Sampling for Contextual Bandits via Prior-Data Fitted Networks

Yan Shuo Tan, Kenyon Ng, Ruizhe Deng, Sumetha Loganathan, Qiong Zhang, Bibhas Chakraborty

AI总结 本文提出了一种基于先验数据拟合网络(PFN)的汤普森采样算法PFN-TS,用于上下文老虎机问题。该方法通过子采样预测中心极限定理,将PFN的后验预测分布转化为对奖励函数均值的采样,从而在保持不确定性估计的同时提升采样效率。相比传统方法,PFN-TS通过几何网格上的数据前缀估计后验方差,减少了计算复杂度,并复用TabICL的缓存表示以提高效率。实验表明,PFN-TS在多个基准测试中表现优异,具有较高的策略价值和竞争力。

详情
英文摘要

Thompson sampling is a widely used strategy for contextual bandits: at each round, it samples a reward function from a Bayesian posterior and acts greedily under that sample. Prior-data fitted networks (PFNs), such as TabPFN v2+ and TabICL v2, are attractive candidates for this purpose because they approximate Bayesian posterior predictive distributions in a single forward pass. However, PFNs predict noisy future rewards, while Thompson sampling requires uncertainty over the latent mean reward function. We propose PFN-TS, a Thompson sampling algorithm that converts PFN posterior predictives into mean-reward samples using a subsampled predictive central limit theorem. The method estimates posterior variance from a geometric grid of $O(\log n)$ dataset prefixes rather than the full $O(n)$ predictive sequence used in previous predictive-sequence approaches, and reuses TabICL's cached representations across rounds. We prove consistency of the subsampled variance estimator and give a Bayesian regret bound that decomposes PFN-TS regret into exact posterior-sampling regret under the PFN prior plus approximation terms. Empirically, PFN-TS achieves the best average rank across nonlinear synthetic and OpenML classification-to-bandit benchmarks, remains competitive on linear and BART-generated rewards, and attains the highest estimated policy value in an offline mobile-health evaluation. Code is available at https://anonymous.4open.science/r/PFN_TS-36ED/.

2605.10124 2026-05-12 cs.NI cs.DC cs.IT cs.LG math.IT

GELATO: Generative Entropy- and Lyapunov-based Adaptive Token Offloading for Device-Edge Speculative LLM Inference

Zengzipeng Tang, Yuxuan Sun, Wei Chen, Jianwen Ding, Bo Ai

AI总结 随着设备端大语言模型(LLM)推理的兴起,设备与边缘协同推理成为研究热点。本文提出了一种基于生成熵和李雅普诺夫函数的自适应令牌卸载框架GELATO,旨在在能量受限的设备-边缘协同系统中最大化解码吞吐量。该方法通过外层漂移加惩罚循环制定参考草案预算,管理长期的能耗与吞吐量权衡,并结合熵驱动的生成机制实现按令牌动态不确定性进行早期退出,理论分析证明了其长期吞吐量的性能界限,实验表明GELATO在令牌吞吐量和能耗方面均优于现有先进架构。

Comments This work has been submitted to the IEEE for possible publication

详情
英文摘要

The recent growth of on-device Large Language Model (LLM) inference has driven significant interest in device-edge collaborative LLM inference. As a promising architecture, Speculative Decoding (SD) is increasingly adopted where a lightweight draft model rapidly generates candidate tokens to be verified by a powerful target model. However, a fundamental challenge lies in achieving per-token resource scheduling to effectively adapt SD paradigm to resource-constrained edge environment. This paper proposes a Generative Entropy- and Lyapunov-based Adaptive Token Offloading framework, named GELATO, to maximize decoding throughput under energy constraints in a device-edge collaborative SD system. Specifically, an outer drift-plus-penalty loop makes online decisions to establish a reference drafting budget, managing long-term energy-throughput trade-off. Further, a nested entropy-driven generation mechanism executes early exiting to adapt to per-token dynamic generative uncertainty. Theoretical analysis establishes a rigorous performance bound on long-term throughput for GELATO. Extensive evaluations demonstrate that GELATO achieves a globally optimal tradeoff, outperforming state-of-the-art distributed SD architectures by 64.98% in token throughput and reducing energy consumption by 47.47% under resource-constrained environments, while preserving LLM decoding quality.

2605.07828 2026-05-12 math.NA cs.LG cs.NA

NSPOD: Accelerating Krylov solvers via DeepONet-learned POD subspaces

Francesc Levrero-Florencio, Youngkyu Lee, Jay Pathak, George Em Karniadakis

AI总结 该研究提出了一种名为NSPOD的新型预处理方法,旨在加速求解参数化偏微分方程的Krylov子空间迭代求解器。NSPOD基于深度算子网络,能够在无需重新训练的情况下,有效适应任意非结构网格,并显著减少求解迭代次数,其效果优于现有的代数多网格等先进预处理方法。研究通过固体力学方程在复杂CAD几何中的数值实验验证了该方法的有效性,为开发更高效的混合预处理技术提供了新思路。

Comments 17 pages, 9 figures, 3 tables

详情
英文摘要

The convergence of Krylov-based linear iterative solvers applied to parametric partial differential equations (PDEs) is often highly sensitive to the domain, its discretization, the location/values of the applied Dirichlet/Neumann boundary conditions, body forces and material properties, among others. We have previously introduced hybridization of classical linear iterative solvers with neural operators for specific geometries, but they tend to not perform well on geometries not previously seen during training. We partially addressed this challenge by introducing the deep operator network Geo-DeepONet and hybridizing it with Krylov-based iterative linear solvers, which, despite learning effectively across arbitrary unstructured meshes without requiring retraining, led to only modest reductions in iterations compared to state-of-the-art preconditioners. In this study we introduce Neural Subspace Proper Orthogonal Decomposition (NSPOD), a multigrid-like deep operator network-based preconditioner which can dramatically reduce the number of iterations needed for convergence in Krylov-based linear iterative solvers, even when compared to state-of-the-art methods such as algebraic multigrid preconditioners. We demonstrate its efficiency via numerical experiments on a linearized version of solid mechanics PDEs applied to unstructured domains obtained from complex CAD geometries. We expect that the findings in this study lead to more efficient hybrid preconditioners that can match, or possibly even surpass, the convergence properties of the current gold standard preconditioning methods for solid mechanics PDEs.